Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

A novel high specificity COVID-19 screening method based on simple blood exams and artificial intelligence

View ORCID ProfileFelipe Soares, View ORCID ProfileAline Villavicencio, View ORCID ProfileMichel José Anzanello, View ORCID ProfileFlávio Sanson Fogliatto, View ORCID ProfileMarco A. P. Idiart, Mark Stevenson
doi: https://doi.org/10.1101/2020.04.10.20061036
Felipe Soares
1University of Sheffield (Department of Computer Science) - United Kingdom
2Universidade Federal do Rio Grande do Sul (Department of Industrial Engineering) - Brazil
M.Phil
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Felipe Soares
  • For correspondence: fs@felipesoares.net
Aline Villavicencio
1University of Sheffield (Department of Computer Science) - United Kingdom
3Universidade Federal do Rio Grande do Sul (Institute of Informatics) - Brazil
Ph.D
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Aline Villavicencio
Michel José Anzanello
2Universidade Federal do Rio Grande do Sul (Department of Industrial Engineering) - Brazil
Ph.D
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michel José Anzanello
Flávio Sanson Fogliatto
2Universidade Federal do Rio Grande do Sul (Department of Industrial Engineering) - Brazil
Ph.D
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Flávio Sanson Fogliatto
Marco A. P. Idiart
4Universidade Federal do Rio Grande do Sul (Department of Physics) - Brazil
Ph.D
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marco A. P. Idiart
Mark Stevenson
1University of Sheffield (Department of Computer Science) - United Kingdom
Ph.D
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Summary

Background The SARS-CoV-2 virus responsible for COVID-19 poses a significant challenge to healthcare systems worldwide. Despite governmental initiatives aimed at containing the spread of the disease, several countries are experiencing unmanageable increases in the demand for ICU beds, medical equipment, and larger testing capacity. Efficient COVID-19 diagnosis enables healthcare systems to provide better care for patients while protecting caregivers from the disease. However, many countries are constrained by the limited amount of test kits available, the lack of equipment and trained professionals. In the case of patients visiting emergency rooms (ERs) with a suspect of COVID-19, a prompt diagnosis can improve the outcome and even provide information for efficient hospital management. In this context, a quick, inexpensive and readily available test to perform an initial triage at ER could help to smooth patient flow, provide better patient care, and reduce the backlog of exams.

Methods In this Case-control quantitative study, we developed a strategy backed by artificial intelligence to perform an initial screening of suspect COVID-19 cases. We developed a machine learning classifier that takes widely available simple blood exams as input and predicts if that suspect case is likely to be positive (having SARS-CoV-2) or negative(not having SARS-CoV-2). Based on this initial classification, positive cases can be referred for further highly sensitive testing (e.g. CT scan, or specific antibodies).

We used publicly available data from the Albert Einstein Hospital in Brazil from 5,644 patients. Focussing on using simple blood exams, a sample of 599 subjects that had the fewest missing values for 16 common exams were selected. From these 599 patients, only 81 were positive for SARS-CoV-2 (determined by RT-PCR).

Based on this data, we built an artificial intelligence classification framework, ER-CoV, aiming at determining which patients were more likely to be negative for SARS-CoV-2 when visiting an ER and that were categorized as a suspect case by medical professionals.

The primary goal of this investigation is to develop a classifier with high specificity and high negative predictive values, with reasonable sensitivity.

Findings We identified that our framework achieved an average specificity of 92.16% [95% CI 91.73 - 92.59] and negative predictive value (NPV) of 95.29% [95% CI 94.65% - 95.90%]. Those values are completely aligned with our goal of providing an effective low-cost system to triage suspected patients at ERs. As for sensitivity, our model achieved an average of 63.98% [95% CI 59.82% - 67.50%] and positive predictive value (PPV) of 48.00% [95% CI 44.88% - 51.56%].

An error analysis identified that, on average, 45% of the false negative results would have been hospitalized anyway, thus the model is making mistakes for severe cases that would not be overlooked, partially mitigating the fact that the test is not high-sensitive.

All code for our AI model, called ER-CoV is publicly available at https://github.com/soares-f/ER-CoV.

Interpretation Based on the capacity of our model to accurately predict which cases are negative from suspected patients arriving at emergency rooms, we envision that this framework can play an important role in patient triage. Probably the most important outcome is related to testing availability, which at this point is extremely low in many countries. Considering the achieved specificity, we would reduce by at least 90% the number of SARS-CoV-2 tests performed at emergency rooms, with the chance of getting a false negative at around 5%. The second important outcome is related to patient management in hospitals. Patients predicted as positive by our framework could be immediately separated from the other patients while waiting for the results of confirmatory tests. This could reduce the spread rate inside hospitals since in many hospitals all suspected cases are kept in the same ward. In Brazil, where the data was collected, rate infection is starting to quickly spread, the lead time of a SARS-CoV-2 can be up to 2 weeks.

Funding University of Sheffield provided financial support for the Ph.D scholarship for Felipe Soares

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

University of Sheffield provided financial support for the Ph.D scholarship for Felipe Soares.

Author Declarations

All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.

Yes

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Source code is shared under AFPL License. No commercial derivatives are allowed without formal consent from the first author.

https://github.com/soares-f/ER-CoV

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted April 14, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A novel high specificity COVID-19 screening method based on simple blood exams and artificial intelligence
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A novel high specificity COVID-19 screening method based on simple blood exams and artificial intelligence
Felipe Soares, Aline Villavicencio, Michel José Anzanello, Flávio Sanson Fogliatto, Marco A. P. Idiart, Mark Stevenson
medRxiv 2020.04.10.20061036; doi: https://doi.org/10.1101/2020.04.10.20061036
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
A novel high specificity COVID-19 screening method based on simple blood exams and artificial intelligence
Felipe Soares, Aline Villavicencio, Michel José Anzanello, Flávio Sanson Fogliatto, Marco A. P. Idiart, Mark Stevenson
medRxiv 2020.04.10.20061036; doi: https://doi.org/10.1101/2020.04.10.20061036

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (271)
  • Allergy and Immunology (559)
  • Anesthesia (135)
  • Cardiovascular Medicine (1774)
  • Dentistry and Oral Medicine (239)
  • Dermatology (173)
  • Emergency Medicine (316)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (662)
  • Epidemiology (10826)
  • Forensic Medicine (8)
  • Gastroenterology (595)
  • Genetic and Genomic Medicine (2963)
  • Geriatric Medicine (289)
  • Health Economics (534)
  • Health Informatics (1935)
  • Health Policy (836)
  • Health Systems and Quality Improvement (746)
  • Hematology (294)
  • HIV/AIDS (634)
  • Infectious Diseases (except HIV/AIDS) (12530)
  • Intensive Care and Critical Care Medicine (696)
  • Medical Education (300)
  • Medical Ethics (89)
  • Nephrology (325)
  • Neurology (2815)
  • Nursing (152)
  • Nutrition (433)
  • Obstetrics and Gynecology (560)
  • Occupational and Environmental Health (600)
  • Oncology (1475)
  • Ophthalmology (444)
  • Orthopedics (172)
  • Otolaryngology (258)
  • Pain Medicine (190)
  • Palliative Medicine (56)
  • Pathology (381)
  • Pediatrics (869)
  • Pharmacology and Therapeutics (368)
  • Primary Care Research (340)
  • Psychiatry and Clinical Psychology (2650)
  • Public and Global Health (5387)
  • Radiology and Imaging (1019)
  • Rehabilitation Medicine and Physical Therapy (598)
  • Respiratory Medicine (727)
  • Rheumatology (330)
  • Sexual and Reproductive Health (293)
  • Sports Medicine (279)
  • Surgery (328)
  • Toxicology (48)
  • Transplantation (150)
  • Urology (127)