Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Predicting Prognosis in COVID-19 Patients using Machine Learning and Readily Available Clinical Data

Thomas W. Campbell, Melissa P. Wilson, Heinrich Roder, Samantha MaWhinney, Robert W. Georgantas III, Laura K. Maguire, Joanna Roder, Kristine M. Erlandson
doi: https://doi.org/10.1101/2021.01.29.21250762
Thomas W. Campbell
1Biodesix
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: thomas.campbell@biodesix.com
Melissa P. Wilson
2Department of Medicine, Division of Personalized Medicine and Bioinformatics, University of Colorado-Anschutz Medical Campus, Aurora, CO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Heinrich Roder
1Biodesix
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Samantha MaWhinney
3Department of Biostatistics and Informatics, University of Colorado, Colorado School of Public Health
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert W. Georgantas III
1Biodesix
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Laura K. Maguire
1Biodesix
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joanna Roder
1Biodesix
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kristine M. Erlandson
4Department of Medicine, Division of Infectious Diseases, University of Colorado-Anschutz Medical Campus, Aurora, CO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Rationale Prognostic tools for aiding in the treatment of hospitalized COVID-19 patients could help improve outcome by identifying patients at higher or lower risk of severe disease.

Objectives The study objective was to develop models to stratify patients by risk of severe outcomes during COVID-19 hospitalization using readily available information at hospital admission.

Methods Hierarchical ensemble classification models were trained on a set of 229 patients hospitalized with COVID-19 to predict severe outcomes, including ICU admission, development of ARDS, or intubation, using easily attainable attributes including basic patient characteristics, vital signs at admission, and basic lab results collected at time of presentation. Each test stratifies patients into groups of increasing risk. An additional cohort of 330 patients was used for blinded, independent validation. Shapley value analysis evaluated which attributes contributed most to the models’ predictions of risk.

Measurements and Main Results Test performance was assessed using precision (positive predictive value) and recall (sensitivity) of the final risk groups. All test cut-offs were fixed prior to blinded validation. In both development and validation, the tests achieved precision in the lowest risk groups near or above 0.9. The proportion of patients with severe outcomes significantly increased across increasing risk groups. While the importance of attributes varied by test and patient, CRP, LDH, and D-dimer were often found to be important in the assignment of risk label.

Conclusions Risk of severe outcomes for patients hospitalized with COVID-19 infection can be assessed using machine learning-based models based on attributes routinely collected at hospital admission.

Competing Interest Statement

All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: TWC, JR, HR, LM, RWG have a provisional patent method assigned to Biodesix for predicting risk of unfavorable outcome in COVID-19. TWC, JR, HR, LM, RWG have stocks and/or stock options in Biodesix. KME reports grants from NIH, during the conduct of the study; grants from Gilead, personal fees from Theratechnologies, personal fees from ViiV, outside the submitted work

Funding Statement

Supported by NIH/NIA R01 AG054366-05S1 and NIH/NCATS Colorado CTSA Grant Number UL1 TR002535. Contents are the authors' sole responsibility and do not necessarily represent official NIH views. Supported by the Health Data Compass Data Warehouse project (healthdatacompass.org)

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The study was reviewed and approved by the Colorado Multiple Institutional Review Board.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • ↵* co-senior authors

  • Supported by NIH/NIA R01 AG054366-05S1 and NIH/NCATS Colorado CTSA Grant Number UL1 TR002535. Contents are the authors’ sole responsibility and do not necessarily represent official NIH views.

  • Supported by the Health Data Compass Data Warehouse project (healthdatacompass.org)

Data Availability

Data and trained models may be made available upon request to the corresponding author: thomas.campbell{at}biodesix.com

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted February 01, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Predicting Prognosis in COVID-19 Patients using Machine Learning and Readily Available Clinical Data
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Predicting Prognosis in COVID-19 Patients using Machine Learning and Readily Available Clinical Data
Thomas W. Campbell, Melissa P. Wilson, Heinrich Roder, Samantha MaWhinney, Robert W. Georgantas III, Laura K. Maguire, Joanna Roder, Kristine M. Erlandson
medRxiv 2021.01.29.21250762; doi: https://doi.org/10.1101/2021.01.29.21250762
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Predicting Prognosis in COVID-19 Patients using Machine Learning and Readily Available Clinical Data
Thomas W. Campbell, Melissa P. Wilson, Heinrich Roder, Samantha MaWhinney, Robert W. Georgantas III, Laura K. Maguire, Joanna Roder, Kristine M. Erlandson
medRxiv 2021.01.29.21250762; doi: https://doi.org/10.1101/2021.01.29.21250762

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Intensive Care and Critical Care Medicine
Subject Areas
All Articles
  • Addiction Medicine (228)
  • Allergy and Immunology (504)
  • Anesthesia (110)
  • Cardiovascular Medicine (1240)
  • Dentistry and Oral Medicine (206)
  • Dermatology (147)
  • Emergency Medicine (282)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (531)
  • Epidemiology (10023)
  • Forensic Medicine (5)
  • Gastroenterology (499)
  • Genetic and Genomic Medicine (2453)
  • Geriatric Medicine (238)
  • Health Economics (479)
  • Health Informatics (1643)
  • Health Policy (753)
  • Health Systems and Quality Improvement (636)
  • Hematology (248)
  • HIV/AIDS (533)
  • Infectious Diseases (except HIV/AIDS) (11864)
  • Intensive Care and Critical Care Medicine (626)
  • Medical Education (252)
  • Medical Ethics (75)
  • Nephrology (268)
  • Neurology (2281)
  • Nursing (139)
  • Nutrition (352)
  • Obstetrics and Gynecology (454)
  • Occupational and Environmental Health (537)
  • Oncology (1245)
  • Ophthalmology (377)
  • Orthopedics (134)
  • Otolaryngology (226)
  • Pain Medicine (158)
  • Palliative Medicine (50)
  • Pathology (324)
  • Pediatrics (730)
  • Pharmacology and Therapeutics (313)
  • Primary Care Research (282)
  • Psychiatry and Clinical Psychology (2280)
  • Public and Global Health (4834)
  • Radiology and Imaging (837)
  • Rehabilitation Medicine and Physical Therapy (492)
  • Respiratory Medicine (651)
  • Rheumatology (285)
  • Sexual and Reproductive Health (238)
  • Sports Medicine (227)
  • Surgery (267)
  • Toxicology (44)
  • Transplantation (125)
  • Urology (99)