Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

A cross-disorder dosage sensitivity map of the human genome

View ORCID ProfileRyan L. Collins, Joseph T. Glessner, Eleonora Porcu, Lisa-Marie Niestroj, Jacob Ulirsch, Georgios Kellaris, Daniel P. Howrigan, Selin Everett, Kiana Mohajeri, Xander Nuttle, Chelsea Lowther, Jack Fu, Philip M. Boone, Farid Ullah, Kaitlin E. Samocha, Konrad Karczewski, Diane Lucente, Epi25 Consortium, James F. Gusella, Hilary Finucane, Ludmilla Matyakhina, Swaroop Aradhya, Jeanne Meck, Dennis Lal, Benjamin M. Neale, Jennelle C. Hodge, Alexandre Reymond, Zoltan Kutalik, Nicholas Katsanis, Erica E. Davis, Hakon Hakonarson, Shamil Sunyaev, Harrison Brand, Michael E. Talkowski
doi: https://doi.org/10.1101/2021.01.26.21250098
Ryan L. Collins
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
3Division of Medical Sciences, Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ryan L. Collins
Joseph T. Glessner
4Department of Pediatrics, Children’s Hospital of Philadelphia
5Department of Pediatrics, Division of Human Genetics, Perelman School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eleonora Porcu
6Center for Integrative Genomics, University of Lausanne
7Swiss Institute of Bioinformatics
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lisa-Marie Niestroj
8Cologne Center for Genomics, University of Cologne
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jacob Ulirsch
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
3Division of Medical Sciences, Harvard Medical School
9Analytic and Translational Genetics Unit, Massachusetts General Hospital
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Georgios Kellaris
10Advanced Center for Translational and Genetic Medicine, Stanley Manne Children’s Research Institute, Lurie Children’s Hospital
11Departments of Pediatrics and Cellular and Molecular Biology, Northwestern University School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel P. Howrigan
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
9Analytic and Translational Genetics Unit, Massachusetts General Hospital
12Stanley Center for Psychiatric Research, Broad Institute of M.I.T. and Harvard
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Selin Everett
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kiana Mohajeri
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
3Division of Medical Sciences, Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xander Nuttle
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
13Department of Neurology, Massachusetts General Hospital and Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chelsea Lowther
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
13Department of Neurology, Massachusetts General Hospital and Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jack Fu
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
13Department of Neurology, Massachusetts General Hospital and Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Philip M. Boone
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
13Department of Neurology, Massachusetts General Hospital and Harvard Medical School
14Division of Genetics and Genomics, Boston Children’s Hospital
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Farid Ullah
10Advanced Center for Translational and Genetic Medicine, Stanley Manne Children’s Research Institute, Lurie Children’s Hospital
11Departments of Pediatrics and Cellular and Molecular Biology, Northwestern University School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kaitlin E. Samocha
15Human Genetics Programme, Wellcome Sanger Institute, Wellcome Genome Campus
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Konrad Karczewski
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
9Analytic and Translational Genetics Unit, Massachusetts General Hospital
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Diane Lucente
1Center for Genomic Medicine, Massachusetts General Hospital
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James F. Gusella
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
16Department of Genetics, Blavatnik Institute, Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hilary Finucane
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
9Analytic and Translational Genetics Unit, Massachusetts General Hospital
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ludmilla Matyakhina
17GeneDx, Inc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Swaroop Aradhya
17GeneDx, Inc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeanne Meck
17GeneDx, Inc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dennis Lal
8Cologne Center for Genomics, University of Cologne
12Stanley Center for Psychiatric Research, Broad Institute of M.I.T. and Harvard
18Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic
19Epilepsy Center, Neurological Institute, Cleveland Clinic
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Benjamin M. Neale
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
9Analytic and Translational Genetics Unit, Massachusetts General Hospital
12Stanley Center for Psychiatric Research, Broad Institute of M.I.T. and Harvard
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jennelle C. Hodge
20Department of Medical and Molecular Genetics, Indiana University School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandre Reymond
6Center for Integrative Genomics, University of Lausanne
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zoltan Kutalik
7Swiss Institute of Bioinformatics
21Center for Primary Care and Public Health, University of Lausanne
22Department of Computational Biology, University of Lausanne
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicholas Katsanis
10Advanced Center for Translational and Genetic Medicine, Stanley Manne Children’s Research Institute, Lurie Children’s Hospital
11Departments of Pediatrics and Cellular and Molecular Biology, Northwestern University School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Erica E. Davis
10Advanced Center for Translational and Genetic Medicine, Stanley Manne Children’s Research Institute, Lurie Children’s Hospital
11Departments of Pediatrics and Cellular and Molecular Biology, Northwestern University School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hakon Hakonarson
4Department of Pediatrics, Children’s Hospital of Philadelphia
5Department of Pediatrics, Division of Human Genetics, Perelman School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shamil Sunyaev
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
23Division of Genetics, Brigham and Women’s Hospital
24Department of Medicine, Harvard Medical School
25Department of Biomedical Informatics, Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harrison Brand
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
13Department of Neurology, Massachusetts General Hospital and Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael E. Talkowski
1Center for Genomic Medicine, Massachusetts General Hospital
2Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard
9Analytic and Translational Genetics Unit, Massachusetts General Hospital
12Stanley Center for Psychiatric Research, Broad Institute of M.I.T. and Harvard
13Department of Neurology, Massachusetts General Hospital and Harvard Medical School
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: talkowsk@broadinstitute.org
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

SUMMARY

Rare deletions and duplications of genomic segments, collectively known as rare copy number variants (rCNVs), contribute to a broad spectrum of human diseases. To date, most disease-association studies of rCNVs have focused on recognized genomic disorders or on the impact of haploinsufficiency caused by deletions. By comparison, our understanding of duplications in disease remains rudimentary as very few individual genes are known to be triplosensitive (i.e., duplication intolerant). In this study, we meta-analyzed rCNVs from 753,994 individuals across 30 primarily neurological disease phenotypes to create a genome-wide catalog of rCNV association statistics across disorders. We discovered 114 rCNV-disease associations at 52 distinct loci surpassing genome-wide significance (P=3.72×10−6), 42% of which involve duplications. Using Bayesian fine-mapping methods, we further prioritized 38 novel triplosensitive disease genes (e.g., GMEB2 in brain abnormalities), including three known haploinsufficient genes that we now reveal as bidirectionally dosage sensitive (e.g., ANKRD11 in growth abnormalities). By integrating our results with prior literature, we found that disease-associated rCNV segments were enriched for genes constrained against damaging coding variation and identified likely dominant driver genes for about one-third (32%) of rCNV segments based on de novo mutations from exome sequencing studies of developmental disorders. However, while the presence of constrained driver genes was a common feature of many pathogenic large rCNVs across disorders, most of the rCNVs showing genome-wide significant association were incompletely penetrant (mean odds ratio=11.6) and we also identified two examples of noncoding disease-associated rCNVs (e.g., intronic CADM2 deletions in behavioral disorders). Finally, we developed a statistical model to predict dosage sensitivity for all genes, which defined 3,006 haploinsufficient and 295 triplosensitive genes where the effect sizes of rCNVs were comparable to deletions of genes constrained against truncating mutations. These dosage sensitivity scores classified disease genes across molecular mechanisms, prioritized pathogenic de novo rCNVs in children with autism, and revealed features that distinguished haploinsufficient and triplosensitive genes, such as insulation from other genes and local cis-regulatory complexity. Collectively, the cross-disorder rCNV maps and metrics derived in this study provide the most comprehensive assessment of dosage sensitive genomic segments and genes in disease to date and set the foundation for future studies of dosage sensitivity throughout the human genome.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

These studies were supported by the National Institutes of Health grants HD081256, NS093200, HD096326, and MH106826. R.L.C. was supported by NHGRI T32HG002295 and NSF GRFP #2017240332. H.B. was supported by NIDCR K99DE026824. This work was supported by grants from the Swiss National Science Foundation (31003A_182632 to A.R. and 310030-189147, 32473B-166450 to Z.K.). M.E.T. was supported by Desmond and Ann Heathwood.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This study was approved by the Partners Healthcare Institutional Review Board Protocol #2013P000323. Data from the UK BioBank was accessed via application #50765 (PI: Talkowski), and data from the Simons Foundation for Autism Research Initiative was accessed via SFARIbase application #573206

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • ↵† Invitae Corp.

Data Availability

Code Availability: All code used in this study has been provided in a single repository on GitHub (https://github.com/talkowski-lab/rCNV2). Where applicable, scripts have been provided with documentation and help text. We also have provided a Docker image hosted on DockerHub (https://hub.docker.com/r/talkowski/rcnv) and Google Container Registry (https://gcr.io/gnomad-wgs-v2-sv/rcnv) that contains all dependencies necessary to execute the code identically as presented in this study. Data Availability: Most data generated in this study, including summary statistics from association tests, have been provided as Supplemental Tables or Supplemental Files. Large Supplemental Data Files have been temporarily hosted in a public Google Cloud Storage Bucket until formal publication in a peer-reviewed journal, as described in the Supplemental Information. Data from existing publications or public resources can be accessed according to their original source, as described in the corresponding Methods section detailing their curation. All other data not otherwise described here or in the Methods will be made available upon request.

https://storage.googleapis.com/rcnv_project/public/collins_medrxiv_2021/sliding_window_sumstats.tar.gz

https://storage.googleapis.com/rcnv_project/public/collins_medrxiv_2021/gene_based_sumstats.tar.gz

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted January 28, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A cross-disorder dosage sensitivity map of the human genome
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A cross-disorder dosage sensitivity map of the human genome
Ryan L. Collins, Joseph T. Glessner, Eleonora Porcu, Lisa-Marie Niestroj, Jacob Ulirsch, Georgios Kellaris, Daniel P. Howrigan, Selin Everett, Kiana Mohajeri, Xander Nuttle, Chelsea Lowther, Jack Fu, Philip M. Boone, Farid Ullah, Kaitlin E. Samocha, Konrad Karczewski, Diane Lucente, Epi25 Consortium, James F. Gusella, Hilary Finucane, Ludmilla Matyakhina, Swaroop Aradhya, Jeanne Meck, Dennis Lal, Benjamin M. Neale, Jennelle C. Hodge, Alexandre Reymond, Zoltan Kutalik, Nicholas Katsanis, Erica E. Davis, Hakon Hakonarson, Shamil Sunyaev, Harrison Brand, Michael E. Talkowski
medRxiv 2021.01.26.21250098; doi: https://doi.org/10.1101/2021.01.26.21250098
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
A cross-disorder dosage sensitivity map of the human genome
Ryan L. Collins, Joseph T. Glessner, Eleonora Porcu, Lisa-Marie Niestroj, Jacob Ulirsch, Georgios Kellaris, Daniel P. Howrigan, Selin Everett, Kiana Mohajeri, Xander Nuttle, Chelsea Lowther, Jack Fu, Philip M. Boone, Farid Ullah, Kaitlin E. Samocha, Konrad Karczewski, Diane Lucente, Epi25 Consortium, James F. Gusella, Hilary Finucane, Ludmilla Matyakhina, Swaroop Aradhya, Jeanne Meck, Dennis Lal, Benjamin M. Neale, Jennelle C. Hodge, Alexandre Reymond, Zoltan Kutalik, Nicholas Katsanis, Erica E. Davis, Hakon Hakonarson, Shamil Sunyaev, Harrison Brand, Michael E. Talkowski
medRxiv 2021.01.26.21250098; doi: https://doi.org/10.1101/2021.01.26.21250098

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (239)
  • Allergy and Immunology (521)
  • Anesthesia (124)
  • Cardiovascular Medicine (1418)
  • Dentistry and Oral Medicine (217)
  • Dermatology (158)
  • Emergency Medicine (291)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (582)
  • Epidemiology (10288)
  • Forensic Medicine (6)
  • Gastroenterology (527)
  • Genetic and Genomic Medicine (2625)
  • Geriatric Medicine (254)
  • Health Economics (496)
  • Health Informatics (1729)
  • Health Policy (789)
  • Health Systems and Quality Improvement (671)
  • Hematology (266)
  • HIV/AIDS (564)
  • Infectious Diseases (except HIV/AIDS) (12083)
  • Intensive Care and Critical Care Medicine (648)
  • Medical Education (273)
  • Medical Ethics (83)
  • Nephrology (288)
  • Neurology (2456)
  • Nursing (144)
  • Nutrition (377)
  • Obstetrics and Gynecology (491)
  • Occupational and Environmental Health (566)
  • Oncology (1320)
  • Ophthalmology (400)
  • Orthopedics (146)
  • Otolaryngology (235)
  • Pain Medicine (168)
  • Palliative Medicine (51)
  • Pathology (342)
  • Pediatrics (778)
  • Pharmacology and Therapeutics (329)
  • Primary Care Research (296)
  • Psychiatry and Clinical Psychology (2395)
  • Public and Global Health (4999)
  • Radiology and Imaging (893)
  • Rehabilitation Medicine and Physical Therapy (525)
  • Respiratory Medicine (681)
  • Rheumatology (309)
  • Sexual and Reproductive Health (255)
  • Sports Medicine (244)
  • Surgery (297)
  • Toxicology (45)
  • Transplantation (140)
  • Urology (108)