PT - JOURNAL ARTICLE AU - Nakamichi, Kenji AU - Shen, Jolie Z. AU - Lee, Cecilia S. AU - Lee, Aaron AU - Roberts, Emma A. AU - Simonson, Paul D. AU - Roychoudhury, Pavitra AU - Andriesen, Jessica AU - Randhawa, April K. AU - Mathias, Patrick C. AU - Greninger, Alex L. AU - Jerome, Keith R. AU - Van Gelder, Russell N. TI - Outcomes associated with SARS-CoV-2 viral clades in COVID-19 AID - 10.1101/2020.09.24.20201228 DP - 2020 Jan 01 TA - medRxiv PG - 2020.09.24.20201228 4099 - http://medrxiv.org/content/early/2020/09/25/2020.09.24.20201228.short 4100 - http://medrxiv.org/content/early/2020/09/25/2020.09.24.20201228.full AB - Background The COVID-19 epidemic of 2019-20 is due to the novel coronavirus SARS-CoV-2. Following first case description in December, 2019 this virus has infected over 10 million individuals and resulted in at least 500,000 deaths world-wide. The virus is undergoing rapid mutation, with two major clades of sequence variants emerging. This study sought to determine whether SARS-CoV-2 sequence variants are associated with differing outcomes among COVID-19 patients in a single medical system.Methods Whole genome SARS-CoV-2 RNA sequence was obtained from isolates collected from patients registered in the University of Washington Medicine health system between March 1 and April 15, 2020. Demographic and baseline medical data along with outcomes of hospitalization and death were collected. Statistical and machine learning models were applied to determine if viral genetic variants were associated with specific outcomes of hospitalization or death.Findings Full length SARS-CoV-2 sequence was obtained 190 subjects with clinical outcome data. 35 (18.4%) were hospitalized and 14 (7.4%) died from complications of infection. A total of 289 single nucleotide variants were identified. Clustering methods demonstrated two major viral clades, which could be readily distinguished by 12 polymorphisms in 5 genes. A trend toward higher rates of hospitalization of patients with Clade 2 was observed (p=0.06). Machine learning models utilizing patient demographics and co-morbidities achieved area-under-the-curve (AUC) values of 0.93 for predicting hospitalization. Addition of viral clade or sequence information did not significantly improve models for outcome prediction.Conclusion SARS-CoV-2 shows substantial sequence diversity in a community-based sample. Two dominant clades of virus are in circulation. Among patients sufficiently ill to warrant testing for virus, no significant difference in outcomes of hospitalization or death could be discerned between clades in this sample. Major risk factors for hospitalization and death for either major clade of virus include patient age and comorbid conditions.Funding Supported by NIH P30EY001730, the Mark J. Daily, MD Research Fund (RVG), the Alida and Christopher Latham Research Fund (RVG, AYL, CSL), NIH K23EY029246 (AYL), US Food and Drug Administration (QYL)Competing Interest StatementThe authors have declared no competing interest.Funding StatementSupported by NIH P30EY001730, the Mark J. Daily, MD Research Fund (RVG), the Alida and Christopher Latham Research Fund (RVG, AYL, CSL), NIH K23EY029246 (AYL), US Food and Drug Administration (QYL)Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Institutional Review Board (IRB) approval for this study was obtained from the University of Washington, and all research was conducted in compliance with the Declaration of Helsinki. This study was exempted by the IRB from informed consent requirement as a retrospective chart review study. All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll sequence data generated in this study has been uploaded to public databases. All computer programs used in analysis are available on request to the corresponding author.