PT - JOURNAL ARTICLE AU - Juan Zhao AU - Monika E Grabowska AU - Vern Eric Kerchberger AU - Joshua C. Smith AU - H. Nur Eken AU - QiPing Feng AU - Josh F. Peterson AU - S. Trent Rosenbloom AU - Kevin B. Johnson AU - Wei-Qi Wei TI - ConceptWAS: a high-throughput method for early identification of COVID-19 presenting symptoms AID - 10.1101/2020.11.06.20227165 DP - 2020 Jan 01 TA - medRxiv PG - 2020.11.06.20227165 4099 - http://medrxiv.org/content/early/2020/11/10/2020.11.06.20227165.short 4100 - http://medrxiv.org/content/early/2020/11/10/2020.11.06.20227165.full AB - Objective Identifying symptoms highly specific to COVID-19 would improve the clinical and public health response to infectious outbreaks. Here, we describe a high-throughput approach – Concept-Wide Association Study (ConceptWAS) that systematically scans a disease’s clinical manifestations from clinical notes. We used this method to identify symptoms specific to COVID-19 early in the course of the pandemic.Methods Using the Vanderbilt University Medical Center (VUMC) EHR, we parsed clinical notes through a natural language processing pipeline to extract clinical concepts. We examined the difference in concepts derived from the notes of COVID-19-positive and COVID-19-negative patients on the PCR testing date. We performed ConceptWAS using the cumulative data every two weeks for early identifying specific COVID-19 symptoms.Results We processed 87,753 notes 19,692 patients (1,483 COVID-19-positive) subjected to COVID-19 PCR testing between March 8, 2020, and May 27, 2020. We found 68 clinical concepts significantly associated with COVID-19. We identified symptoms associated with increasing risk of COVID-19, including “absent sense of smell” (odds ratio [OR] = 4.97, 95% confidence interval [CI] = 3.21–7.50), “fever” (OR = 1.43, 95% CI = 1.28–1.59), “with cough fever” (OR = 2.29, 95% CI = 1.75–2.96), and “ageusia” (OR = 5.18, 95% CI = 3.02–8.58). Using ConceptWAS, we were able to detect loss sense of smell or taste three weeks prior to their inclusion as symptoms of the disease by the Centers for Disease Control and Prevention (CDC).Conclusion ConceptWAS is a high-throughput approach for exploring specific symptoms of a disease like COVID-19, with a promise for enabling EHR-powered early disease manifestations identification.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThe project was supported by the National Institutes of Health (NIH), National Institute of General Medical Studies (P50 GM115305), National Heart, Lung, and Blood Institute (R01 HL133786), National Library of Medicine (T15 LM007450, R01 LM010685), and the American Heart Association (18AMTG34280063), as well as the Vanderbilt Biomedical Informatics Training Program, Vanderbilt Faculty Research Scholar Fund, and the Vanderbilt Medical Scientist Training Program. The datasets used for the analyses described were obtained from Vanderbilt University Medical Center's resources and the Synthetic Derivative, which are supported by institutional funding and by the Vanderbilt National Center for Advancing Translational Science grant (UL1 TR000445) from NCATS/NIH. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:RE: IRB #200731 "EHR Patterns of COVID-19" Dear Wei-qi Wei: A designee of the Institutional Review Board reviewed the Request for Exemption application identified above. It was determined the study poses minimal risk to participants. This study meets 45 CFR 46.104 (d) category (4) for Exempt Review. Any changes to this proposal that may alter its exempt status should be presented to the IRB for approval prior to implementation of the changes. DATE OF IRB APPROVAL: 5/21/2020 Sincerely, Erin Leigh Johnson BS, CIP Institutional Review Board Behavioral Sciences Committee Electronic Signature: Erin Leigh Johnson/VUMC/Vanderbilt : (2c8072da56409aeae2a38cff74965eed) Signed On: 05/21/2020 3:24:30 PM CDTAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe study used data from patients represented in the Vanderbilt University Medical Center.The study was approved by the Vanderbilt University Medical Center Institutional Review Board (IRB #200512). The summary statistics derived from the EHRs are enclosed within the manuscript.