Gender-Based Disparities in COVID-19 Patient Outcomes: A Propensity-matched Analysis ==================================================================================== * Shailendra Singh * Monica Chowdhry * Arka Chatterjee * Ahmad Khan ## Abstract COVID-19 epidemiological data show higher mortality among males as compared to females. However, it remains unclear if this disparity is due to gender differences in high-risk characteristics. Our study, including a large cohort of male and female patients, showed that males have a higher risk for mortality, hospitalization and mechanical ventilation even when compared to a matched cohort of females with similar age, high-risk behavior, and comorbidities. This gender-based risk of poor outcomes among COVID-19 patients is especially more pronounced in advanced age. High-risk characteristics only partially explain the gender disparity, and further research is needed to understand the causes of this disparity. ## Introduction Coronavirus disease 2019 (COVID-19) has emerged as unprecedented global health, economic, and social crisis. More than 2 million COVID-19 cases and 160,00 deaths have been reported worldwide as of April 20, 20201,2. Epidemiological data reported across multiple countries show markedly higher mortality for males compared to females3,4. It needs to be determined if the gender disparity in the COVID-19 mortality is true or just an interaction of gender differences in risk characteristics5. Therefore, we studied the clinical characteristics of a large and diverse cohort of COVID-19 patients stratified by gender and determined the outcomes after extensive propensity matching for risk factors and comorbidities. ## Methods The study was conducted using TriNetX (Cambridge, MA, USA), a global federated health research network providing real-time access to electronic medical records (EMR) of more than 49 million patients from 37 healthcare organizations (HCOs) predominantly in the United States. COVID-19 Research Network was recently created, data inflow fast-tracked with COVID-19 specific diagnosis and terminology following the WHO and CDC COVID-19 guidelines6–8. As a federated network, TriNetX received a waiver from Western IRB since only aggregated counts, statistical summaries of de-identified information, but no protected health information is received, and no study-specific activities are performed in retrospective analyses. Both the patients and HCO’s as data sources stay anonymous. A search query was made (April 17-18, 2020) in the EMR of patients on the COVID-19 research network to identify male and female patients (≥ 10 years age) diagnosed with COVID-19 between January 20, 2020, and April 15, 2020. The primary outcome was risk of mortality, mechanical ventilation, and hospitalization within 30 days after the diagnosis of COVID-19. All statistical analyses were performed in real-time using TriNetX. TriNetX obfuscates patient counts to safeguard protected health information (PHI) by rounding patient counts in analyses up to the nearest 10. A 1:1 propensity score matching was performed using a greedy nearest-neighbor matching algorithm. For each outcome, the risk difference and risk ratio were calculated. We also performed Kaplan-Meier survival analyses to estimate the survival probability of each outcome at the end 30 days after the diagnosis of COVID-19. A-priori defined two-sided alpha of less than <0.05 was used for statistical significance. Details of data source, search criteria, the definition of variables, and statistical analysis are available in Supplementary. ## Results We identified a total of 5980 males and 7730 females diagnosed with COVID-19 from 20 HCOs. Of these, 45% of males and 50% of females were from the United States. Males were older than the females (54.9 ± 18.3 vs. 50.9 ± 18.4, p-value <0.0001). There were significant differences in patient demographics, risk behaviors, and comorbidities among male and female patients with COVID-19 (Table 1). Age, race, high-risk behavior including nicotine, and all comorbidities (mentioned in Table 1), including diabetes, hypertension, chronic lung disease, cardiovascular disease, chronic kidney disease, among others, were selected for propensity matching. The groups (N=5350 each group) were well balanced after matching (Table 1). Presenting signs and symptoms and most recent laboratory findings after COVID-19 are also compared in Table 1. Mean values of serum creatinine, liver function tests, and inflammatory markers such as Ferritin, CRP, and IL-6 were all significantly higher in males. View this table: [Table 1:](http://medrxiv.org/content/early/2020/05/12/2020.04.24.20079046/T1) Table 1: Comparison of patient characteristics, presenting signs and symptoms, and laboratory findings among COVID-19 male and female patients. Demographics and comorbidities are compared before and after propensity matching of cohorts. Males had a significantly higher risk for mortality before (Risk Ratio (RR) 2.1, 95% CI 1.8-2.4) and after matching (RR 1.4, 95% CI 1.21.7) (Table 2). Kaplan-Meier survival curves showed significantly different 30-day survival probability for mortality after diagnosis of COVID-19 in matched male (80%) and female cohorts (87%) (p-value <0.0001) (Figure 1). The risk of hospitalization (RR 1.3, 95%CI 1.2-1.4) and mechanical ventilation (RR 1.71, 95% CI 1.3-2.3) was significantly higher in males (Table 2) even after matching of cohorts. A sub-analysis of the cohorts based on age was also performed. Patients (≥ 50 years (mean age of cohort) were considered as advanced age, and 93.9% of overall mortality was in patients ≥ 50 years. Males with advanced age had higher mortality than matched females of similar age group (RR 1.6, 95% CI 1.4-1.8) (Table 2). View this table: [Table 2:](http://medrxiv.org/content/early/2020/05/12/2020.04.24.20079046/T2) Table 2: Outcomes among COVID-19 male and female patients. Outcomes are compared before and after propensity score matching of cohorts. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/12/2020.04.24.20079046/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2020/05/12/2020.04.24.20079046/F1) Figure 1: Kaplan-Meier survival curves estimating the 30-day survival probability among propensity-matched male and female cohort of COVID-19 patients. ## Discussion Our study, including a large cohort of male and female COVID 19 patients, showed that the overall risk for mortality, hospitalization, and mechanical ventilation was significantly higher for males as compared to females. We meticulously compared the characteristics of male and female patients with COVID-19 and found a higher rate of high-risk behaviors and comorbidities in male patients. Even after propensity score matching of the cohorts for age, race, nicotine use, diabetes, hypertension, chronic lung diseases, cardiovascular diseases, chronic kidney disease, and many other comorbidities/risk behaviors, gender disparity in outcomes persist. The vast majority of the deaths in COVID-19 occurred in patients with advanced age (≥50 years), and the gender-dependent risk of poor outcomes after COVID-19 was more pronounced with advancing age. The risk of mortality in patients with advanced age was higher compared to a matched group of females of similar age. In younger patients, the overall mortality is low, and therefore mortality differences might not be apparent. We observed higher levels of serum creatinine, liver function tests, and inflammatory markers like ferritin, CRP, and IL-6 in males, likely suggesting more severe disease. COVID-19 gender differences in mortality are similar to the 2002-2003 SARS-CoV epidemic and the later MERS-CoV outbreak9,10. Multiple factors likely contribute to the disparity in gender-specific outcomes. Age and high-risk comorbid conditions could only partially explain the stark differences reported with unadjusted data. The higher smoking rate in men is speculated as a possible reason11, we matched our cohorts for both past and current nicotine use and still found significant differences in outcome. Moreover, the difference in rates of smoking is unlikely to explain the magnitude of gender disparity seen in outcomes. Although gender-specific variations in angiotensin-converting enzyme 2 (ACE2) receptor expression have been suggested (preprint)12, this needs further confirmation. Immune-defense factors likely have a substantial role. Males and females respond differently to many virus infections, and generally, females mount a more robust protective immune response13,14. Differences in the production of sex-specific hormones, immune response X-linked genes, and the disease-susceptibility genes among males and females likely impact the immune response and disease severity13,14. Besides biological factors, many social, behavioral, and cultural differences might also have a role. Although our data represent one of the largest and most diverse cohorts of COVID-19 patients, our study has several limitations. The data derived from EMR based database is susceptible to errors in coding or data entry15. However, the ability of TriNetX to aggregate the data directly from the EMRs in a real-time fashion minimizes the risk of data collection errors at the investigator’s end. Patient counts were rounded up to the nearest ten, and that may influence results for infrequent outcomes; however, most of our outcomes had a relatively large number of patients. Gender differences in asymptomatic or mild cases of COVID-19 that remain undiagnosed or did not seek medical care were not accounted for16, and hence their role might be undervalued. In conclusion, males are more severely affected and have higher mortality from COVID-19. This gender-specific risk is especially more pronounced in advanced age. Gender disparity in poor outcomes can only be partially explained by differences in high-risk behavior and comorbidities. Further research is needed to understand the causes of this disparity and may be beneficial for decisions in both clinical and policy-making realms. ## Data Availability None ## Acknowledgment We acknowledge the West Virginia Clinical and Translational Science Institute to provide us access, and training to the TriNETX global healthcare network. We also acknowledge the TriNETX (Cambridge, MA, USA) healthcare network for design assistance to complete this project. * Received April 24, 2020. * Revision received May 7, 2020. * Accepted May 12, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.COVID-19 Dashboard by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU). [https://coronavirus.jhu.edu/map.html](https://coronavirus.jhu.edu/map.html). Accessed April 19, 2020. 2. 2.WHO Coronavirus Disease 2019 (COVID-19) Situation Report—91. [https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200420-sitrep-91-covid-19.pdf?sfvrsn=fcf0670b_4](https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200420-sitrep-91-covid-19.pdf?sfvrsn=fcf0670b_4). 3. 3.Team TNCPERE. The Epidemiological Characteristics of an Outbreak of 2019 Novel Coronavirus Diseases (COVID-19) — China, 2020. China CDC Wkly. 2020. 4. 4.COVID-19 sex-disaggregated data tracker, [https://globalhealth5050.org/covidl9/](https://globalhealth5050.org/covidl9/). 5. 5.Liu S, Zhang M, Yang L, et al. Prevalence and patterns of tobacco smoking among Chinese adult men and women: Findings of the 2010 national smoking survey. J Epidemiol Community Health. 2017. doi:10.1136/jech-2016-207805 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiamVjaCI7czo1OiJyZXNpZCI7czo4OiI3MS8yLzE1NCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzA1LzEyLzIwMjAuMDQuMjQuMjAwNzkwNDYuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 6. 6.TriNetX. [https://www.trinetx.com/covid-19/](https://www.trinetx.com/covid-19/). 7. 7.ICD-10-CM Official Coding Guidelines - Supplement Coding encounters related to COVID-19 Coronavirus Outbreak. [https://www.cdc.gov/nchs/data/icd/ICD-10-CM-Official-Coding-Gudance-lnterim-Advice-coronavirus-feb-20-2020.pdf](https://www.cdc.gov/nchs/data/icd/ICD-10-CM-Official-Coding-Gudance-lnterim-Advice-coronavirus-feb-20-2020.pdf). 8. 8.ICD-10-CM Official Coding and Reporting Guidelines April 1, 2020 through September 30, 2020. [https://www.cdc.gov/nchs/data/icd/COVID-19-guidelines-final.pdf](https://www.cdc.gov/nchs/data/icd/COVID-19-guidelines-final.pdf). 9. 9.Karlberg J, Chong DSY, Lai WYY. Do Men Have a Higher Case Fatality Rate of Severe Acute Respiratory Syndrome than Women Do? Am J Epidemiol. 2004. doi:10.1093/aje/kwh056 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwh056&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14742282&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F12%2F2020.04.24.20079046.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188614900004&link_type=ISI) 10. 10.Alghamdi IG, Hussain II, Almalki SS, Alghamdi MS, Alghamdi MM, El-Sheemy MA. The pattern of Middle east respiratory syndrome coronavirus in Saudi Arabia: A descriptive epidemiological analysis of data from the Saudi Ministry of Health. Int J Gen Med. 2014. doi:10.2147/IJGM.S67061 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2147/IJGM.S67061&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25187734&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F12%2F2020.04.24.20079046.atom) 11. 11.Cai H. Sex difference and smoking predisposition in patients with COVID-19. Lancet Respir Med. 2020. doi:10.1016/S2213-2600(20)30117-X [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2213-2600(20)30117-X&link_type=DOI) 12. 12.Zhao Y, Zhao Z, Wang Y, Zhou Y, Ma Y, Zuo W. Single-cell RNA expression profiling of ACE2, the putative receptor of Wuhan 2019-nCov. *bioRxiv*. January 2020:2020.01.26.919985. doi:10.1101/2020.01.26.919985 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMC4wMS4yNi45MTk5ODV2MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzA1LzEyLzIwMjAuMDQuMjQuMjAwNzkwNDYuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 13. 13.Klein SL, Flanagan KL. Sex differences in immune responses. Nat Rev Immunol. 2016. doi:10.1038/nri.2016.90 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nri.2016.90&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27546235&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F12%2F2020.04.24.20079046.atom) 14. 14.Channappanavar R, Fett C, Mack M, Ten Eyck PP, Meyerholz DK, Perlman S. Sex-Based Differences in Susceptibility to Severe Acute Respiratory Syndrome Coronavirus Infection. J Immunol. 2017. doi:10.4049/jimmunol.1601896 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiamltbXVub2wiO3M6NToicmVzaWQiO3M6MTE6IjE5OC8xMC80MDQ2IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMDUvMTIvMjAyMC4wNC4yNC4yMDA3OTA0Ni5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 15. 15.Bowman S. Impact of electronic health record systems on information integrity: quality and safety implications. Perspect Health Inf Manag. 2013. 16. 16.Nishiura H, Kobayashi T, Suzuki A, et al. Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19). Int J Infect Dis. 2020. doi:10.1016/j.ijid.2020.03.020 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijid.2020.03.020&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F12%2F2020.04.24.20079046.atom)