TY - JOUR T1 - Updated SARS-CoV-2 Single Nucleotide Variants and Mortality Association JF - medRxiv DO - 10.1101/2021.01.29.21250757 SP - 2021.01.29.21250757 AU - Shuyi Fang AU - Sheng Liu AU - Jikui Shen AU - Alex Z Lu AU - Yucheng Zhang AU - Kailing Li AU - Juli Liu AU - Lei Yang AU - Chang-Deng Hu AU - Jun Wan Y1 - 2021/01/01 UR - http://medrxiv.org/content/early/2021/02/01/2021.01.29.21250757.abstract N2 - Since its outbreak in December 2019, COVID-19 has caused 100,5844,555 cases and 2,167,313 deaths as of Jan 27, 2021. Comparing our previous study of SARS-CoV-2 single nucleotide variants (SNVs) before June 2020, we found out that the SNV clustering had changed considerably since June 2020. Apart from that the group SNVs represented by two non-synonymous mutations A23403G (S: D614G) and C14408T (ORF1ab: P4715L) became dominant and carried by over 95% genomes, a few emerging groups of SNVs were recognized with sharply increased monthly occurrence ratios up to 70% in November 2020. Further investigation revealed that several SNVs were strongly associated with the mortality, but they presented distinct distribution in specific countries, e.g., Brazil, USA, Saudi Arabia, India, and Italy. SNVs including G25088T, T25A, G29861T and G29864A were adopted in a regularized logistic regression model to predict the mortality status in Brazil with the AUC of 0.84. Protein structure analysis showed that the emerging subgroups of non-synonymous SNVs and those mortality-related ones in Brazil were located on protein surface area. The clashes in protein structure introduced by these mutations might in turn affect virus pathogenesis through conformation changes, leading to the difference in transmission and virulence. Particularly, we found that SNVs tended to occur in intrinsic disordered regions (IDRs) of Spike (S) and ORF1ab, suggesting a critical role of SNVs in protein IDRs to determine protein folding and immune evasion.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was partially supported by the National Institutes of Health (P30CA082709 to J.W.), the Walther Cancer Foundation, and Center for Computational Biology and Bioinformatics (CCBB) at Indiana University School of Medicine (Pilot grant to L.Y. and J.W.). Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:NO IRB for this study.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).Yes I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesYes. The data/results will be available upon requests. ER -