Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Genetic constraint at single amino acid resolution improves missense variant prioritisation and gene discovery

View ORCID ProfileXiaolei Zhang, View ORCID ProfilePantazis I. Theotokis, Nicholas Li, the SHaRe Investigators, View ORCID ProfileCaroline F. Wright, View ORCID ProfileKaitlin E. Samocha, View ORCID ProfileNicola Whiffin, View ORCID ProfileJames S. Ware
doi: https://doi.org/10.1101/2022.02.16.22271023
Xiaolei Zhang
1National Heart & Lung Institute, Imperial College London, London W12 0NN, United Kingdom
2MRC London Institute of Medical Sciences, Imperial College London, London W12 0NN, United Kingdom
3Royal Brompton & Harefield Hospitals, Guy’s and St. Thomas’ NHS Foundation Trust, London SW3 6NP, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Xiaolei Zhang
Pantazis I. Theotokis
1National Heart & Lung Institute, Imperial College London, London W12 0NN, United Kingdom
2MRC London Institute of Medical Sciences, Imperial College London, London W12 0NN, United Kingdom
3Royal Brompton & Harefield Hospitals, Guy’s and St. Thomas’ NHS Foundation Trust, London SW3 6NP, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Pantazis I. Theotokis
Nicholas Li
1National Heart & Lung Institute, Imperial College London, London W12 0NN, United Kingdom
2MRC London Institute of Medical Sciences, Imperial College London, London W12 0NN, United Kingdom
3Royal Brompton & Harefield Hospitals, Guy’s and St. Thomas’ NHS Foundation Trust, London SW3 6NP, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Caroline F. Wright
4University of Exeter Medical School, Institute of Biomedical and Clinical Science, Royal Devon and Exeter Hospital, Exeter EX2 5DW, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Caroline F. Wright
Kaitlin E. Samocha
5Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
6Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kaitlin E. Samocha
Nicola Whiffin
7Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nicola Whiffin
  • For correspondence: nwhiffin@well.ox.ac.uk j.ware@imperial.ac.uk
James S. Ware
1National Heart & Lung Institute, Imperial College London, London W12 0NN, United Kingdom
2MRC London Institute of Medical Sciences, Imperial College London, London W12 0NN, United Kingdom
3Royal Brompton & Harefield Hospitals, Guy’s and St. Thomas’ NHS Foundation Trust, London SW3 6NP, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for James S. Ware
  • For correspondence: nwhiffin@well.ox.ac.uk j.ware@imperial.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

The clinical impact of most germline missense variants in humans remains unknown. Genetic constraint identifies genomic regions under negative selection, where variations likely have functional impacts, but the spatial resolution of existing constraint metrics is limited. Here we present the Homologous Missense Constraint (HMC) score, which measures genetic constraint at quasi single amino-acid resolution by aggregating signals across protein homologues. We identify one million possible missense variants under strong negative selection. HMC precisely distinguishes pathogenic variants from benign variants for both early-onset and adult-onset disorders. It outperforms existing constraint metrics and pathogenicity meta-predictors in prioritising de novo mutations from probands with developmental disorders (DD), and is orthogonal to these, adding power when used in combination. We demonstrate utility for gene discovery by identifying seven genes newly-significant associated with DD that could act through an altered-function mechanism. Overall, HMC is a novel and strong predictor to improve missense variant interpretation.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by Medical Research Council (UK), British Heart Foundation [RE/18/4/34215], the NIHR Imperial College Biomedical Research Centre, and the Wellcome Trust [107469/Z/15/Z, 200990/A/16/Z]. NW is currently supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (Grant Number 220134/Z/20/Z) and funding from the Rosetrees Trust.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This study involves only openly available human data, which can be obtained from: gnomAD, ClinVar, code repository (https://github.com/ImperialCardioGenetics/homologous-missense-constraint) and references indicated in the manuscript.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All data produced are available online at www.cardiodb.org/hmc.

https://www.cardiodb.org/hmc

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted February 21, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Genetic constraint at single amino acid resolution improves missense variant prioritisation and gene discovery
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Genetic constraint at single amino acid resolution improves missense variant prioritisation and gene discovery
Xiaolei Zhang, Pantazis I. Theotokis, Nicholas Li, the SHaRe Investigators, Caroline F. Wright, Kaitlin E. Samocha, Nicola Whiffin, James S. Ware
medRxiv 2022.02.16.22271023; doi: https://doi.org/10.1101/2022.02.16.22271023
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Genetic constraint at single amino acid resolution improves missense variant prioritisation and gene discovery
Xiaolei Zhang, Pantazis I. Theotokis, Nicholas Li, the SHaRe Investigators, Caroline F. Wright, Kaitlin E. Samocha, Nicola Whiffin, James S. Ware
medRxiv 2022.02.16.22271023; doi: https://doi.org/10.1101/2022.02.16.22271023

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (230)
  • Allergy and Immunology (507)
  • Anesthesia (111)
  • Cardiovascular Medicine (1264)
  • Dentistry and Oral Medicine (207)
  • Dermatology (148)
  • Emergency Medicine (283)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (538)
  • Epidemiology (10056)
  • Forensic Medicine (5)
  • Gastroenterology (502)
  • Genetic and Genomic Medicine (2486)
  • Geriatric Medicine (240)
  • Health Economics (482)
  • Health Informatics (1653)
  • Health Policy (757)
  • Health Systems and Quality Improvement (638)
  • Hematology (250)
  • HIV/AIDS (538)
  • Infectious Diseases (except HIV/AIDS) (11896)
  • Intensive Care and Critical Care Medicine (627)
  • Medical Education (255)
  • Medical Ethics (75)
  • Nephrology (269)
  • Neurology (2304)
  • Nursing (140)
  • Nutrition (354)
  • Obstetrics and Gynecology (458)
  • Occupational and Environmental Health (537)
  • Oncology (1259)
  • Ophthalmology (377)
  • Orthopedics (134)
  • Otolaryngology (226)
  • Pain Medicine (158)
  • Palliative Medicine (50)
  • Pathology (326)
  • Pediatrics (737)
  • Pharmacology and Therapeutics (315)
  • Primary Care Research (282)
  • Psychiatry and Clinical Psychology (2295)
  • Public and Global Health (4850)
  • Radiology and Imaging (846)
  • Rehabilitation Medicine and Physical Therapy (493)
  • Respiratory Medicine (657)
  • Rheumatology (289)
  • Sexual and Reproductive Health (241)
  • Sports Medicine (228)
  • Surgery (273)
  • Toxicology (44)
  • Transplantation (131)
  • Urology (100)