RT Journal Article SR Electronic T1 Common, low-frequency, rare, and ultra-rare coding variants contribute to COVID-19 severity JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.09.03.21262611 DO 10.1101/2021.09.03.21262611 A1 Chiara Fallerini A1 Nicola Picchiotti A1 Margherita Baldassarri A1 Kristina Zguro A1 Sergio Daga A1 Francesca Fava A1 Elisa Benetti A1 Sara Amitrano A1 Mirella Bruttini A1 Maria Palmieri A1 Susanna Croci A1 Mirjam Lista A1 Giada Beligni A1 Floriana Valentino A1 Ilaria Meloni A1 Marco Tanfoni A1 Francesca Colombo A1 Enrico Cabri A1 Maddalena Fratelli A1 Chiara Gabbi A1 Stefania Mantovani A1 Elisa Frullanti A1 Marco Gori A1 Francis P. Crawley A1 Guillaume Butler-Laporte A1 Brent Richards A1 Hugo Zeberg A1 Miklos Lipcsey A1 Michael Hultstrom A1 Kerstin U. Ludwig A1 Eva C. Schulte A1 Erola Pairo-Castineira A1 John Kenneth Baillie A1 Axel Schmidt A1 Robert Frithiof A1 WES/WGS working group within the HGI A1 GenOMICC Consortium A1 GEN-COVID Multicenter Study A1 Francesca Mari A1 Alessandra Renieri A1 Simone Furini YR 2021 UL http://medrxiv.org/content/early/2021/09/14/2021.09.03.21262611.abstract AB The combined impact of common and rare exonic variants in COVID-19 host genetics is currently insufficiently understood. Here, common and rare variants from whole exome sequencing data of about 4,000 SARS-CoV-2-positive individuals were used to define an interpretable machine learning model for predicting COVID-19 severity. Firstly, variants were converted into separate sets of Boolean features, depending on the absence or the presence of variants in each gene. An ensemble of LASSO logistic regression models was used to identify the most informative Boolean features with respect to the genetic bases of severity. The Boolean features selected by these logistic models were combined into an Integrated PolyGenic Score that offers a synthetic and interpretable index for describing the contribution of host genetics in COVID-19 severity, as demonstrated through testing in several independent cohorts. Selected features belong to ultra-rare, rare, low-frequency, and common variants, including those in linkage disequilibrium with known GWAS loci. Noteworthly, around one quarter of the selected genes are sex-specific. Pathway analysis of the selected genes associated with COVID-19 severity reflected the multi-organ nature of the disease. The proposed model might provide useful information for developing diagnostics and therapeutics, while also being able to guide bedside disease management.Competing Interest StatementThe authors have declared no competing interest.Funding StatementMIUR project Dipartimenti di Eccellenza 2018-2020 to Department of Medical Biotechnologies University of Siena, Italy (Italian D.L. n.18 March 17, 2020). Private donors for COVID-19 research. Bando Ricerca COVID-19 Toscana project to Azienda Ospedaliero-Universitaria Senese. Charity fund 2020 from Intesa San Paolo dedicated to the project N. B/2020/0119 Identificazione delle basi genetiche determinanti la variabilita clinica della risposta a COVID-19 nella popolazione italiana. The Italian Ministry of University and Research for funding within the Bando FISR 2020 in COVID-19 and the Istituto Buddista Italiano Soka Gakkai for funding the project PAT-COVID: Host genetics and pathogenetic mechanisms of COVID-19 (ID n. 2020-2016_RIC_3). GenOMICC was funded by Sepsis Research (the Fiona Elizabeth Agnew Trust), the Intensive Care Society, a Wellcome-Beit Prize award to J. K. Baillie (Wellcome Trust 103258/Z/13/A) and a BBSRC Institute Program Support Grant to the Roslin Institute (BBS/E/D/20002172, BBS/E/D/10002070 and BBS/E/D/30002275). The Richards research group is supported by the Canadian Institutes of Health Research (CIHR: 365825; 409511, 100558), the McGill Interdisciplinary Initiative in Infection and Immunity (MI4), the Lady Davis Institute of the Jewish General Hospital, the Jewish General Hospital Foundation, the Canadian Foundation for Innovation, the NIH Foundation, Cancer Research UK, Genome Quebec, the Public Health Agency of Canada, McGill University, Cancer Research UK [grant number C18281/A29019] and the Fonds de Recherche Quebec Sante (FRQS). JBR is supported by a FRQS Merite Clinical Research Scholarship. Support from Calcul Quebec and Compute Canada is acknowledged. TwinsUK is funded by the Welcome Trust, Medical Research Council, European Union, the National Institute for Health Research (NIHR)-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy s and St Thomas NHS Foundation Trust in partnership with King s College London. These funding agencies had no role in the design, implementation or interpretation of this study. JBR has served as an advisor to GlaxoSmithKline and Deerfield Capital. His institution has received investigator-initiated grant funding from Eli Lilly, GlaxoSmithKline and Biogen for projects unrelated to this research. He is the founder of 5 Prime Sciences. SweCovid received funding from the European Union s Horizon 2020 research and innovation program under grant agreement No 824110, the SciLifeLab/KAW national Covid-19 research program project grants to MH (KAW 2020.0182 and KAW 2020.0241) and the Swedish Research Council grants to RF (2014-02569 and 2014-07606). Sequencing was performed by the SNP&SEQ Technology Platform in Uppsala. The facility is part of the National Genomics Infrastructure (NGI) Sweden and Science for Life Laboratory. The SNP&SEQ Platform is supported by the Swedish Research Council and the Knut and Alice Wallenberg Foundation.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study was conducted according to the guidelines of the Declaration of Helsinki. The GEN-COVID is a multicentre academic observational study that was approved by the Internal Review Boards (IRB) of each participating centre (protocol code 16917, dated March 16, 2020 for GEN-COVID at the University Hospital of Siena). BQC19 received ethical approval from the Jewish General Hospital (JGH) research ethics board (2020-2137). The Swedish part of the study was performed at the general intensive care unit (ICU) of Uppsala University Hospital, Sweden (a tertiary hospital) and approved by the National Ethical Review Agency (No. 2020-01623). The Declaration of Helsinki and its subsequent revisions were followed. DeCOI received ethical approval by the Ethical Review Board (ERB) of the participating hospitals/centres (Technical University Munich, Munich, Germany; Medical Faculty Bonn, Bonn, Germany; Medical Board of the Saarland, Germany; University Duisburg-Essen, Germany; Medical Faculty Duesseldorf, Duesseldorf, Germany). GenOMICC and ISARIC4C were approved by the appropriate research ethics committees (Scotland, 15/SS/0110; England, Wales and Northern Ireland, 19/WM/0247).All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data and samples referenced here are housed in the GEN-COVID Patient Registry and the GEN-COVID Biobank and are available for consultation. You may contact the corresponding author, Prof. Alessandra Renieri (e-mail: alessandra.renieri{at}unisi.it).