Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

COVID-19 diagnosis prediction in emergency care patients: a machine learning approach

André Filipe de Moraes Batista, João Luiz Miraglia, Thiago Henrique Rizzi Donato, Alexandre Dias Porto Chiavegatto Filho
doi: https://doi.org/10.1101/2020.04.04.20052092
André Filipe de Moraes Batista
1Hospital Israelita Albert Einstein - Big Data Analytics, Av. Albert Einstein, 627/701, Morumbi, 05652-900, São Paulo, SP, Brazil
2Department of Epidemiology, School of Public Health, University of Sao Paulo, 715 Av Dr Arnaldo, Sao Paulo, SP, Brazil 01246-904
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: andre.fmbatista@einstein.br
João Luiz Miraglia
3Hospital Israelita Albert Einstein - Big Data Analytics
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Thiago Henrique Rizzi Donato
3Hospital Israelita Albert Einstein - Big Data Analytics
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandre Dias Porto Chiavegatto Filho
5Department of Epidemiology, School of Public Health, University of Sao Paulo
6Programa de Apoio ao Desenvolvimento Institucional do SUS (PROADI-SUS)
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The coronavirus disease (COVID-19) pandemic has increased the necessity of immediate clinical decisions and effective usage of healthcare resources. Currently, the most validated diagnosis test for COVID-19 (RT-PCR) is in shortage in most developing countries, which may increase infection rates and delay important preventive measures. The objective of this study was to predict the risk of positive COVID-19 diagnosis with machine learning, using as predictors only results from emergency care admission exams. We collected data from 235 adult patients from the Hospital Israelita Albert Einstein in São Paulo, Brazil, from 17 to 30 of March, 2020, of which 102 (43%) received a positive diagnosis of COVID-19 from RT-PCR tests. Five machine learning algorithms (neural networks, random forests, gradient boosting trees, logistic regression and support vector machines) were trained on a random sample of 70% of the patients, and performance was tested on new unseen data (30%). The best predictive performance was obtained by the support vector machines algorithm (AUC: 0.85; Sensitivity: 0.68; Specificity: 0.85; Brier Score: 0.16). The three most important variables for the predictive performance of the algorithm were the number of lymphocytes, leukocytes and eosinophils, respectively. In conclusion, we found that targeted decisions for receiving COVID-19 tests using only routinely-collected data is a promising new area with the use of machine learning algorithms.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

We received funding from the Ministry of Health’s Institutional Development Program of the Brazilian National Health System (PROADI-SUS) "Utilização de Técnicas Avançadas de Análise de Dados (Big Data) e Inovação para Apoio ao Planejamento e Desenvolvimento de Políticas em Saúde" (NUP: 25000.028646/2018-10).

Author Declarations

All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.

Yes

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Due to the nature of this research, participants of this study did not agree to share publicly their individual data, so supporting data is not available.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted April 14, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
COVID-19 diagnosis prediction in emergency care patients: a machine learning approach
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
COVID-19 diagnosis prediction in emergency care patients: a machine learning approach
André Filipe de Moraes Batista, João Luiz Miraglia, Thiago Henrique Rizzi Donato, Alexandre Dias Porto Chiavegatto Filho
medRxiv 2020.04.04.20052092; doi: https://doi.org/10.1101/2020.04.04.20052092
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
COVID-19 diagnosis prediction in emergency care patients: a machine learning approach
André Filipe de Moraes Batista, João Luiz Miraglia, Thiago Henrique Rizzi Donato, Alexandre Dias Porto Chiavegatto Filho
medRxiv 2020.04.04.20052092; doi: https://doi.org/10.1101/2020.04.04.20052092

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (70)
  • Allergy and Immunology (168)
  • Anesthesia (50)
  • Cardiovascular Medicine (451)
  • Dentistry and Oral Medicine (83)
  • Dermatology (55)
  • Emergency Medicine (157)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (191)
  • Epidemiology (5258)
  • Forensic Medicine (3)
  • Gastroenterology (195)
  • Genetic and Genomic Medicine (757)
  • Geriatric Medicine (80)
  • Health Economics (213)
  • Health Informatics (698)
  • Health Policy (358)
  • Health Systems and Quality Improvement (223)
  • Hematology (99)
  • HIV/AIDS (163)
  • Infectious Diseases (except HIV/AIDS) (5867)
  • Intensive Care and Critical Care Medicine (361)
  • Medical Education (104)
  • Medical Ethics (25)
  • Nephrology (83)
  • Neurology (764)
  • Nursing (43)
  • Nutrition (130)
  • Obstetrics and Gynecology (142)
  • Occupational and Environmental Health (231)
  • Oncology (479)
  • Ophthalmology (152)
  • Orthopedics (38)
  • Otolaryngology (95)
  • Pain Medicine (39)
  • Palliative Medicine (20)
  • Pathology (141)
  • Pediatrics (223)
  • Pharmacology and Therapeutics (136)
  • Primary Care Research (96)
  • Psychiatry and Clinical Psychology (862)
  • Public and Global Health (2011)
  • Radiology and Imaging (348)
  • Rehabilitation Medicine and Physical Therapy (158)
  • Respiratory Medicine (285)
  • Rheumatology (94)
  • Sexual and Reproductive Health (74)
  • Sports Medicine (76)
  • Surgery (109)
  • Toxicology (25)
  • Transplantation (29)
  • Urology (39)