PT - JOURNAL ARTICLE AU - Daniel J. Weiner AU - Emi Ling AU - Serkan Erdin AU - Derek J.C. Tai AU - Rachita Yadav AU - Jakob Grove AU - Jack M. Fu AU - Ajay Nadig AU - Caitlin E. Carey AU - Nikolas Baya AU - Jonas Bybjerg-Grauholm AU - iPSYCH Consortium AU - ASD Working Group of the Psychiatric Genomics Consortium AU - ADHD Working Group of the Psychiatric Genomics Consortium AU - Sabina Berretta AU - Evan Z. Macosko AU - Jonathan Sebat AU - Luke J. O’Connor AU - David M. Hougaard AU - Anders D. Børglum AU - Michael E. Talkowski AU - Steve A. McCarroll AU - Elise B. Robinson TI - Statistical and functional convergence of common and rare variant risk for autism spectrum disorders at chromosome 16p AID - 10.1101/2022.03.23.22272826 DP - 2022 Jan 01 TA - medRxiv PG - 2022.03.23.22272826 4099 - http://medrxiv.org/content/early/2022/03/23/2022.03.23.22272826.short 4100 - http://medrxiv.org/content/early/2022/03/23/2022.03.23.22272826.full AB - The dominant human genetics paradigm for converting association to mechanism (“variant-to-function”) involves iteratively mapping individual associations to specific SNPs and to the proximal genes through which they act. In contrast, here we demonstrate the feasibility of extracting biological insight from a very large (>10Mb) region of the genome, and leverage this approach to derive insight into autism spectrum disorder (ASD). Using a novel statistical framework applied in an unbiased scan of the genome, we identified the 33Mb p-arm of chromosome 16 (16p) as harboring the greatest excess of common polygenic risk for ASD. This region includes the recurrent 16p11.2 copy number variant (CNV) – one of the largest single genetic risk factors for ASD, and whose pathogenic mechanisms are undefined. Analysis of bulk and single-cell RNA-sequencing data from post-mortem human brain samples revealed that common polygenic risk for ASD within 16p associated with decreased average expression of genes throughout this 33-Mb region. Similarly, analysis of isogenic neuronal cell lines with CRISPR/Cas9-mediated deletion of 16p11.2 revealed that the deletion also associated with depressed average gene expression across 16p. The effects of the rare deletion and diffuse common variation were correlated at the level of individual genes. Finally, analysis of chromatin contact patterns by Hi-C revealed patterns which may explain this transcriptional convergence, including elevated contact throughout 16p, and between 16p11.2 and a distal region on 16p (Mb 0-5.2) which showed the greatest gene expression changes in both the common and rare variant analyses. These results demonstrate that elevated 3D chromatin contact may coordinate genetic and transcriptional disease liability across large genomic regions, exemplifying a novel approach for extracting biological insight from genetic association data. As applied to ASD, our analyses highlight the 33Mb p-arm of chromosome 16 as a novel locus for ASD liability and provide insight into disease liability originating from the 16p11.2 CNV.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was generously supported by the Simons Foundation Autism Research Initiative (704413 to Elise Robinson and Luke O'Connor), the National Institute of Mental Health (F30MH129009 to Daniel Weiner, R01MH111813 to Elise Robinson, U01MH119689 to Elise Robinson, R01MH115957 to Michael Talkowski, 1R01MH124851-01 to Anders D. Borglum), the National Institute of General Medical Sciences (T32GM007753 to Ajay Nadig), the National Institute of Child Health and Development (R01HD096326 to Michael Talkowski), the National Institute of Neurological Disorders and Stroke (R01NS093200 to Michael Talkowski), the National Library of Medicine (T15LM007092 to Daniel Weiner), the Stanley Center for Psychiatric Research (to Steve McCarroll), the Lundbeck Foundation (R102-A9118, R155-2014-1724, and R248-2017-2003 to iPSYCH), the Novo Nordisk Foundation (to the Danish National Biobank) and the Universities and University Hospitals of Aarhus and Copenhagen. High-performance computer capacity for handling and statistical analysis of iPSYCH data on the GenomeDK HPC facility was provided by the Center for Genomics and Personalized Medicine and the Centre for Integrative Sequencing, iSEQ, Aarhus University, Denmark in a grant to Anders D. Borglum.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study was reviewed and approved by Partners Human Research of Partners HealthCare. The study name is Molecular Study of Cognitive and Behavioral Variation (IRB: 2015P002376). The Principal Investigator is Elise Robinson.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesData is publicly available, or will be made available from principal investigators upon publication of the manuscript.