Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Augmented Curation of Clinical Notes from a Massive EHR System Reveals Symptoms of Impending COVID-19 Diagnosis

Tyler Wagner, View ORCID ProfileFNU Shweta, View ORCID ProfileKarthik Murugadoss, View ORCID ProfileSamir Awasthi, View ORCID ProfileAJ Venkatakrishnan, Sairam Bade, Arjun Puranik, View ORCID ProfileMartin Kang, View ORCID ProfileBrian W. Pickering, View ORCID ProfileJohn C. O’Horo, View ORCID ProfilePhilippe R. Bauer, View ORCID ProfileRaymund R. Razonable, View ORCID ProfilePaschalis Vergidis, View ORCID ProfileZelalem Temesgen, View ORCID ProfileStacey Rizza, Maryam Mahmood, Walter R. Wilson, Douglas Challener, View ORCID ProfilePraveen Anand, Matt Liebers, Zainab Doctor, Eli Silvert, Hugo Solomon, Akash Anand, Rakesh Barve, View ORCID ProfileGregory J. Gores, Amy W. Williams, William G. Morice II, View ORCID ProfileJohn Halamka, View ORCID ProfileAndrew D. Badley, View ORCID ProfileVenky Soundararajan
doi: https://doi.org/10.1101/2020.04.19.20067660
Tyler Wagner
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
FNU Shweta
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for FNU Shweta
Karthik Murugadoss
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Karthik Murugadoss
Samir Awasthi
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Samir Awasthi
AJ Venkatakrishnan
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for AJ Venkatakrishnan
Sairam Bade
3nference Labs, Bangalore, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arjun Puranik
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Martin Kang
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Martin Kang
Brian W. Pickering
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Brian W. Pickering
John C. O’Horo
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for John C. O’Horo
Philippe R. Bauer
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Philippe R. Bauer
Raymund R. Razonable
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Raymund R. Razonable
Paschalis Vergidis
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Paschalis Vergidis
Zelalem Temesgen
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Zelalem Temesgen
Stacey Rizza
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Stacey Rizza
Maryam Mahmood
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Walter R. Wilson
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Douglas Challener
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Praveen Anand
3nference Labs, Bangalore, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Praveen Anand
Matt Liebers
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zainab Doctor
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eli Silvert
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hugo Solomon
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Akash Anand
3nference Labs, Bangalore, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rakesh Barve
3nference Labs, Bangalore, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gregory J. Gores
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Gregory J. Gores
Amy W. Williams
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
William G. Morice II
2Mayo Clinic, Rochester MN, USA
4Mayo Clinic Laboratories, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John Halamka
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for John Halamka
Andrew D. Badley
2Mayo Clinic, Rochester MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andrew D. Badley
  • For correspondence: Badley.Andrew{at}mayo.edu venky{at}nference.net
Venky Soundararajan
1nference, Cambridge MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Venky Soundararajan
  • For correspondence: Badley.Andrew{at}mayo.edu venky{at}nference.net
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Understanding temporal dynamics of COVID-19 patient symptoms could provide fine-grained resolution to guide clinical decision-making. Here, we use deep neural networks over an institution-wide platform for the augmented curation of clinical notes from 77,167 patients subjected to COVID-19 PCR testing. By contrasting Electronic Health Record (EHR)-derived symptoms of COVID-19-positive (COVIDpos; n=2,317) versus COVID-19-negative (COVIDneg; n=74,850) patients for the week preceding the PCR testing date, we identify anosmia/dysgeusia (27.1-fold), fever/chills (2.6-fold), respiratory difficulty (2.2-fold), cough (2.2-fold), myalgia/arthralgia (2-fold), and diarrhea (1.4-fold) as significantly amplified in COVIDpos over COVIDneg patients. The combination of cough and fever/chills has 4.2-fold amplification in COVIDpos patients during the week prior to PCR testing, and along with anosmia/dysgeusia, constitutes the earliest EHR-derived signature of COVID-19. This study introduces an Augmented Intelligence platform for the real-time synthesis of institutional biomedical knowledge. The platform holds tremendous potential for scaling up curation throughput, thus enabling EHR-powered early disease diagnosis.

Competing Interest Statement

The authors are all employees of nference or the Mayo Clinic. The authors from nference have financial interests in the company. One or more of the investigators associated with this project and Mayo Clinic have a Financial Conflict of Interest in technology used in the research and that the investigator(s) and Mayo Clinic may stand to gain financially from the successful outcome of the research. This research has been reviewed by the Mayo Clinic Conflict of Interest Review Board and is being conducted in compliance with Mayo Clinic Conflict of Interest policies. ADB is a consultant for Abbvie, is on scientific advisory boards for Nference and Zentalis, and is founder and President of Splissen therapeutics.

Funding Statement

ADB is supported by Grants AI 110173 and AI120698 from NIAID, 109593-62-RGRL from Amfar, and the HH Sheikh Khalifa Bin Zayed Al-Nahyan named professorship from Mayo Clinic.

Author Declarations

All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.

Yes

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The EHR dataset where augmented curation was conducted from the Mayo Clinic records was accessed under IRB 20-003278, "Study of COVID-19 patient characteristics with augmented curation of Electronic Health Records (EHR) to inform strategic and operational decisions". The EHR data cannot be shared or released due to HIPAA regulations. Contact corresponding authors for additional details regarding the IRB, and please refer to the Mayo Clinic IRB website for further details on our commitment to patient privacy (https://www.mayo.edu/research/institutional-review-board/overview). The summary statistics derived from the EHRs are enclosed within the manuscript.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted June 11, 2020.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Augmented Curation of Clinical Notes from a Massive EHR System Reveals Symptoms of Impending COVID-19 Diagnosis
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Augmented Curation of Clinical Notes from a Massive EHR System Reveals Symptoms of Impending COVID-19 Diagnosis
Tyler Wagner, FNU Shweta, Karthik Murugadoss, Samir Awasthi, AJ Venkatakrishnan, Sairam Bade, Arjun Puranik, Martin Kang, Brian W. Pickering, John C. O’Horo, Philippe R. Bauer, Raymund R. Razonable, Paschalis Vergidis, Zelalem Temesgen, Stacey Rizza, Maryam Mahmood, Walter R. Wilson, Douglas Challener, Praveen Anand, Matt Liebers, Zainab Doctor, Eli Silvert, Hugo Solomon, Akash Anand, Rakesh Barve, Gregory J. Gores, Amy W. Williams, William G. Morice II, John Halamka, Andrew D. Badley, Venky Soundararajan
medRxiv 2020.04.19.20067660; doi: https://doi.org/10.1101/2020.04.19.20067660
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Augmented Curation of Clinical Notes from a Massive EHR System Reveals Symptoms of Impending COVID-19 Diagnosis
Tyler Wagner, FNU Shweta, Karthik Murugadoss, Samir Awasthi, AJ Venkatakrishnan, Sairam Bade, Arjun Puranik, Martin Kang, Brian W. Pickering, John C. O’Horo, Philippe R. Bauer, Raymund R. Razonable, Paschalis Vergidis, Zelalem Temesgen, Stacey Rizza, Maryam Mahmood, Walter R. Wilson, Douglas Challener, Praveen Anand, Matt Liebers, Zainab Doctor, Eli Silvert, Hugo Solomon, Akash Anand, Rakesh Barve, Gregory J. Gores, Amy W. Williams, William G. Morice II, John Halamka, Andrew D. Badley, Venky Soundararajan
medRxiv 2020.04.19.20067660; doi: https://doi.org/10.1101/2020.04.19.20067660

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Infectious Diseases (except HIV/AIDS)
Subject Areas
All Articles
  • Addiction Medicine (427)
  • Allergy and Immunology (753)
  • Anesthesia (220)
  • Cardiovascular Medicine (3281)
  • Dentistry and Oral Medicine (362)
  • Dermatology (274)
  • Emergency Medicine (478)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1164)
  • Epidemiology (13340)
  • Forensic Medicine (19)
  • Gastroenterology (897)
  • Genetic and Genomic Medicine (5130)
  • Geriatric Medicine (479)
  • Health Economics (781)
  • Health Informatics (3253)
  • Health Policy (1138)
  • Health Systems and Quality Improvement (1189)
  • Hematology (427)
  • HIV/AIDS (1014)
  • Infectious Diseases (except HIV/AIDS) (14613)
  • Intensive Care and Critical Care Medicine (910)
  • Medical Education (475)
  • Medical Ethics (126)
  • Nephrology (522)
  • Neurology (4901)
  • Nursing (261)
  • Nutrition (725)
  • Obstetrics and Gynecology (880)
  • Occupational and Environmental Health (795)
  • Oncology (2516)
  • Ophthalmology (722)
  • Orthopedics (280)
  • Otolaryngology (346)
  • Pain Medicine (323)
  • Palliative Medicine (90)
  • Pathology (540)
  • Pediatrics (1298)
  • Pharmacology and Therapeutics (548)
  • Primary Care Research (554)
  • Psychiatry and Clinical Psychology (4193)
  • Public and Global Health (7482)
  • Radiology and Imaging (1702)
  • Rehabilitation Medicine and Physical Therapy (1010)
  • Respiratory Medicine (979)
  • Rheumatology (478)
  • Sexual and Reproductive Health (495)
  • Sports Medicine (424)
  • Surgery (546)
  • Toxicology (71)
  • Transplantation (235)
  • Urology (203)