Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Identifying novel factors associated with COVID-19 transmission and fatality using the machine learning approach

Mengyuan Li, Zhilan Zhang, Wenxiu Cao, Yijing Liu, Beibei Du, Canping Chen, Qian Liu, Md. Nazim Uddin, Shanmei Jiang, Cai Chen, Yue Zhang, Xiaosheng Wang
doi: https://doi.org/10.1101/2020.06.10.20127472
Mengyuan Li
1Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China
2Big Data Research Institute, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhilan Zhang
1Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China
2Big Data Research Institute, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wenxiu Cao
1Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China
2Big Data Research Institute, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yijing Liu
3School of Life Science and Technology, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Beibei Du
3School of Life Science and Technology, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Canping Chen
1Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China
2Big Data Research Institute, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qian Liu
1Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China
2Big Data Research Institute, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Md. Nazim Uddin
1Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China
2Big Data Research Institute, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shanmei Jiang
1Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China
2Big Data Research Institute, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cai Chen
4Department of Electrical and Computer Engineering, University of California, San Diego, La Jolla, CA 92093, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yue Zhang
5Futian Hospital for Rheumatic Diseases, Shenzhen 518000, China
6Pinghu hospital of Shenzhen university, Shenzhen 440307, China
7Department of Rheumatology and Immunology, The First Clinical college of Harbin Medical University, Harbin 150001, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiaosheng Wang
1Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing 211198, China
2Big Data Research Institute, China Pharmaceutical University, Nanjing 211198, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: xiaosheng.wang@cpu.edu.cn
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The COVID-19 virus has infected millions of people and resulted in hundreds of thousands of deaths worldwide. By using the logistic regression model, we identified novel critical factors associated with COVID19 cases, death, and case fatality rates in 154 countries and in the 50 U.S. states. Among numerous factors associated with COVID-19 risk, we found that the unitary state system was counter-intuitively positively associated with increased COVID-19 cases and deaths. Blood type B was a protective factor for COVID-19 risk, while blood type A was a risk factor. The prevalence of HIV, influenza and pneumonia, and chronic lower respiratory diseases was associated with reduced COVID-19 risk. Obesity and the condition of unimproved water sources were associated with increased COVID-19 risk. Other factors included temperature, humidity, social distancing, smoking, and vitamin D intake. Our comprehensive identification of the factors affecting COVID-19 transmission and fatality may provide new insights into the COVID-19 pandemic and advise effective strategies for preventing and migrating COVID-19 spread.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by the China Pharmaceutical University (grant numbers 3150120001 to XW).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All data referred to in the manuscript are available.

  • List of abbreviations

    SARS-CoV-2
    2019 novel coronavirus
    CFRs
    case fatality rates
    BMI
    body mass index
    ρ
    correlation coefficient
    AUC
    the area under the receiver operating characteristic curve
    CV
    cross validation.
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted June 12, 2020.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Identifying novel factors associated with COVID-19 transmission and fatality using the machine learning approach
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Identifying novel factors associated with COVID-19 transmission and fatality using the machine learning approach
    Mengyuan Li, Zhilan Zhang, Wenxiu Cao, Yijing Liu, Beibei Du, Canping Chen, Qian Liu, Md. Nazim Uddin, Shanmei Jiang, Cai Chen, Yue Zhang, Xiaosheng Wang
    medRxiv 2020.06.10.20127472; doi: https://doi.org/10.1101/2020.06.10.20127472
    Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
    Citation Tools
    Identifying novel factors associated with COVID-19 transmission and fatality using the machine learning approach
    Mengyuan Li, Zhilan Zhang, Wenxiu Cao, Yijing Liu, Beibei Du, Canping Chen, Qian Liu, Md. Nazim Uddin, Shanmei Jiang, Cai Chen, Yue Zhang, Xiaosheng Wang
    medRxiv 2020.06.10.20127472; doi: https://doi.org/10.1101/2020.06.10.20127472

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Epidemiology
    Subject Areas
    All Articles
    • Addiction Medicine (62)
    • Allergy and Immunology (142)
    • Anesthesia (46)
    • Cardiovascular Medicine (415)
    • Dentistry and Oral Medicine (70)
    • Dermatology (47)
    • Emergency Medicine (144)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (171)
    • Epidemiology (4855)
    • Forensic Medicine (3)
    • Gastroenterology (183)
    • Genetic and Genomic Medicine (676)
    • Geriatric Medicine (70)
    • Health Economics (192)
    • Health Informatics (629)
    • Health Policy (320)
    • Health Systems and Quality Improvement (203)
    • Hematology (85)
    • HIV/AIDS (156)
    • Infectious Diseases (except HIV/AIDS) (5339)
    • Intensive Care and Critical Care Medicine (330)
    • Medical Education (93)
    • Medical Ethics (24)
    • Nephrology (75)
    • Neurology (686)
    • Nursing (42)
    • Nutrition (115)
    • Obstetrics and Gynecology (126)
    • Occupational and Environmental Health (208)
    • Oncology (439)
    • Ophthalmology (140)
    • Orthopedics (36)
    • Otolaryngology (89)
    • Pain Medicine (35)
    • Palliative Medicine (16)
    • Pathology (129)
    • Pediatrics (194)
    • Pharmacology and Therapeutics (131)
    • Primary Care Research (84)
    • Psychiatry and Clinical Psychology (780)
    • Public and Global Health (1816)
    • Radiology and Imaging (324)
    • Rehabilitation Medicine and Physical Therapy (138)
    • Respiratory Medicine (255)
    • Rheumatology (86)
    • Sexual and Reproductive Health (69)
    • Sports Medicine (62)
    • Surgery (100)
    • Toxicology (23)
    • Transplantation (29)
    • Urology (37)