RT Journal Article SR Electronic T1 How closely is COVID-19 related to HCoV, SARS, and MERS? : Clinical comparison of coronavirus infections and identification of risk factors influencing the COVID-19 severity using common data model (CDM) JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.11.23.20237487 DO 10.1101/2020.11.23.20237487 A1 Yeon Hee Kim A1 YeHee Ko A1 Soo Young Kim A1 Kwangsoo Kim YR 2020 UL http://medrxiv.org/content/early/2020/11/24/2020.11.23.20237487.abstract AB South Korea was one of the epicenters for both the 2015 MERS and 2019 COVID-19 outbreaks. However, there has been a lack of published literature, especially using the EMR records, that provides a comparative summary of the prognostic factors present in the coronavirus-derived diseases patients. Therefore, in this study, we aimed to compare and evaluate the distinct clinical traits between the patients of different coronaviruses, including the lesser pathogenic HCoV strains, SARS-CoV, MERS-CoV, and SARS-CoV-2. We also conducted observed the risk factors by the COVID severity to investigate the extent of resemblance in clinical features between the disease groups and to identify unique factor that may influence the prognosis of the COVID-19 patients. Here, we utilize the common data model (CDM), which is the database that houses the EMR records transformed into the common format to be used by the multiple institutions. For the comparative analyses between the disease groups, we used independent t-test, Scheffe post-hoc test, and Games-howell post-hoc test and for the continuous variables, chi-square test and Fisher’s exact test. Based on the analyses, we selected the variables with p-values less than 0.05 to predict COVID-19 severity by nominal logistic regression with adjustments to age and gender. From the study, we observed diabetes, cardio and cerebrovascular diseases, cancer, pulmonary disease, gastrointestinal disease, and renal disease in all patient groups. Of all, the proportions of cancer patients were highest in all groups with no statistical significance. Most interestingly, we observed a high degree of clinical similarity between the COVID-19 and SARS patients with more than 50% of measured clinical variables to show statistical similarities between two groups. Our research reflects the great significance within the bioinformatics field that we were able to effectively utilize the integrated CDM to reflect real-world challenges in the context of coronavirus. We expect the results from our study to provide clinical insights that can serve as predicator of risk factors from the future coronavirus outbreak as well as the prospective guidelines for the clinical treatments.Competing Interest StatementThe authors have declared no competing interest.Funding StatementYeon Hee Kim, YeHee Ko, Soo Young Kim, and Kwangsoo Kim are funded by Seoul National University HospitalAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:IRB No. E-2004-001-1113All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data that support the findings of this study are available on request from the corresponding author, Yeon Hee Kim. The data are not publicly available as they contain information that could compromise the privacy of research participants.