Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

The Gene Curation Coalition: A global effort to harmonize gene-disease evidence resources

View ORCID ProfileMarina T. DiStefano, View ORCID ProfileScott Goehringer, View ORCID ProfileLawrence Babb, View ORCID ProfileFowzan S. Alkuraya, Joanna Amberger, Mutaz Amin, View ORCID ProfileChristina Austin-Tse, Marie Balzotti, View ORCID ProfileJonathan S. Berg, View ORCID ProfileEwan Birney, Carol Bocchini, View ORCID ProfileElspeth A. Bruford, View ORCID ProfileAlison J. Coffey, Heather Collins, View ORCID ProfileFiona Cunningham, View ORCID ProfileLouise C. Daugherty, Yaron Einhorn, View ORCID ProfileHelen V. Firth, View ORCID ProfileDavid R. Fitzpatrick, View ORCID ProfileRebecca E. Foulger, Jennifer Goldstein, View ORCID ProfileAda Hamosh, View ORCID ProfileMatthew R. Hurles, Sarah E. Leigh, View ORCID ProfileIvone US. Leong, View ORCID ProfileSateesh Maddirevula, View ORCID ProfileChrista L. Martin, View ORCID ProfileEllen M. McDonagh, Annie Olry, Arina Puzriakova, View ORCID ProfileKelly Radtke, View ORCID ProfileErin M. Ramos, View ORCID ProfileAna Rath, View ORCID ProfileErin Rooney Riggs, View ORCID ProfileAngharad M. Roberts, View ORCID ProfileCharlotte Rodwell, View ORCID ProfileCatherine Snow, View ORCID ProfileZornitza Stark, Jackie Tahiliani, View ORCID ProfileSusan Tweedie, View ORCID ProfileJames S. Ware, Phillip Weller, View ORCID ProfileEleanor Williams, View ORCID ProfileCaroline F. Wright, T Michael. Yates, View ORCID ProfileHeidi L. Rehm
doi: https://doi.org/10.1101/2022.01.03.21268593
Marina T. DiStefano
1Geisinger Health System, Danville, PA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marina T. DiStefano
Scott Goehringer
1Geisinger Health System, Danville, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Scott Goehringer
Lawrence Babb
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lawrence Babb
Fowzan S. Alkuraya
3Department of Translational Genomics, Center for Genomic Medicine, King Faisal Specialist Hospital and Research Center, Riyadh, 11211, Saudi Arabia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Fowzan S. Alkuraya
Joanna Amberger
4Department of Genetic Medicine, Online Mendelian Inheritance in Man (OMIM), Johns Hopkins University School of Medicine, Baltimore, MD, 21287-4922, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mutaz Amin
5Inserm, US14 - Orphanet, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christina Austin-Tse
6Department of Pathology, Massachusetts General Hospital, Boston, MA, USA
7MGB Laboratory for Molecular Medicine, Cambridge, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Christina Austin-Tse
Marie Balzotti
8Myriad Women’s Health, San Francisco, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonathan S. Berg
9Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jonathan S. Berg
Ewan Birney
10European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, CB10 1SD, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ewan Birney
Carol Bocchini
4Department of Genetic Medicine, Online Mendelian Inheritance in Man (OMIM), Johns Hopkins University School of Medicine, Baltimore, MD, 21287-4922, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elspeth A. Bruford
11HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, CB10 1SD, UK
12Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, CB2 0PT, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Elspeth A. Bruford
Alison J. Coffey
13Illumina Clinical Services Laboratory, Illumina Inc., 5200 Illumina Way, San Diego, CA, 92122, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alison J. Coffey
Heather Collins
14National Library of Medicine, Bethesda, MD, USA
15ICF, 9300 Lee Highway, Fairfax, VA, 22031, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fiona Cunningham
16Genome Interpretation, Genome Assembly and Annotation (GAA), European Molecular Biology Laboratory, European Bioinformatics Institute,, Wellcome Genome Campus, Hinxton,, Cambridge, CB10 1SD, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Fiona Cunningham
Louise C. Daugherty
17Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
18Healx Ltd., Charter House, 66-68 Hills Rd, Cambridge, CB2 1LA, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Louise C. Daugherty
Yaron Einhorn
19Franklin by Genoox, Palo Alto, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Helen V. Firth
20Department of Genetics, Addenbrooke’s Hospital, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Helen V. Firth
David R. Fitzpatrick
21MRC Human Genetics Unit, MRC IGMM, The University of Edinburgh, Edinburgh, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for David R. Fitzpatrick
Rebecca E. Foulger
17Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
22SciBite Limited, BioData Innovation Centre, Wellcome Genome Campus, Hinxton, CB10 1DR, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rebecca E. Foulger
Jennifer Goldstein
9Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ada Hamosh
4Department of Genetic Medicine, Online Mendelian Inheritance in Man (OMIM), Johns Hopkins University School of Medicine, Baltimore, MD, 21287-4922, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ada Hamosh
Matthew R. Hurles
23Wellcome Sanger Institute, Hinxton, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Matthew R. Hurles
Sarah E. Leigh
17Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ivone US. Leong
17Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ivone US. Leong
Sateesh Maddirevula
3Department of Translational Genomics, Center for Genomic Medicine, King Faisal Specialist Hospital and Research Center, Riyadh, 11211, Saudi Arabia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sateesh Maddirevula
Christa L. Martin
1Geisinger Health System, Danville, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Christa L. Martin
Ellen M. McDonagh
17Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
24Open Targets, EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1DR, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ellen M. McDonagh
Annie Olry
5Inserm, US14 - Orphanet, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arina Puzriakova
17Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kelly Radtke
25AmbryGenetics, Aliso Viejo, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kelly Radtke
Erin M. Ramos
26National Human Genome Research Institute, National Institutes of Health, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Erin M. Ramos
Ana Rath
5Inserm, US14 - Orphanet, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ana Rath
Erin Rooney Riggs
27Autism & Developmental Medicine Institute, Geisinger Health System, Danville, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Erin Rooney Riggs
Angharad M. Roberts
28National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London, UK
29Great Ormond Street Hospital, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Angharad M. Roberts
Charlotte Rodwell
5Inserm, US14 - Orphanet, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Charlotte Rodwell
Catherine Snow
17Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Catherine Snow
Zornitza Stark
30Australian Genomics, Melbourne, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Zornitza Stark
Jackie Tahiliani
31Invitae, San Francisco, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Susan Tweedie
11HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, CB10 1SD, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Susan Tweedie
James S. Ware
28National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London, UK
32Royal Brompton & Harefield Hospitals, Guy’s and St. Thomas’ NHS Foundation Trust, London, UK
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for James S. Ware
Phillip Weller
27Autism & Developmental Medicine Institute, Geisinger Health System, Danville, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eleanor Williams
17Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Eleanor Williams
Caroline F. Wright
33Institute of Biomedical and Clinical Science, University of Exeter Medical School, Royal Devon & Exeter Hospital, Exeter, EX2 5DW, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Caroline F. Wright
T Michael. Yates
21MRC Human Genetics Unit, MRC IGMM, The University of Edinburgh, Edinburgh, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Heidi L. Rehm
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
34Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Heidi L. Rehm
  • For correspondence: HREHM@mgh.harvard.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

PURPOSE Several groups and resources provide information that pertains to the validity of gene-disease relationships used in genomic medicine and research; however, universal standards and terminologies to define the evidence base for the role of a gene in disease, and a single harmonized resource were lacking. To tackle this issue, the Gene Curation Coalition (GenCC) was formed.

METHODS The GenCC drafted harmonized definitions for differing levels of gene-disease validity based on existing resources, and performed a modified Delphi survey with three rounds to narrow the list of terms. The GenCC also developed a unified database to display curated gene-disease validity assertions from its members.

RESULTS Based on 241 survey responses from the genetics community, a consensus term set was chosen for grading gene-disease validity and database submissions. As of December 2021, the database contains 15,241 gene-disease assertions on 4,569 unique genes from 12 submitters. When comparing submissions to the database from distinct sources, conflicts in assertions of gene-disease validity ranged from 5.3% to 13.4%.

CONCLUSION Terminology standardization, sharing of gene-disease validity classifications, and resolution of curation conflicts will facilitate collaborations across international curation efforts and in turn, improve consistency in genetic testing and variant interpretation.

Competing Interest Statement

Conflict of Interest: R.E.F. is an employee of SciBite Ltd, an Elsevier company. Her work towards this paper was performed when employed by Genomics England. The following authors are an employee for a commercial laboratory that offers clinical genetic testing: M.B.; A.J.C.; K.R.; J.T.. All other authors have nothing to disclose.

Funding Statement

This study was supported by the National Human Genome Research Institute of the National Institutes of Health under award U24HG006834. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or other affiliations. This work was supported by the Intramural Research Program at the National Library of Medicine. PanelApp Australia is supported by Australian Genomics (NHMRC Grants GNT1113531 and GNT2000001). This work was supported by Wellcome Trust [107469/Z/15/Z; 200990/A/16/Z], Medical Research Council (UK), British Heart Foundation [RE/18/4/34215], the NIHR Imperial College Biomedical Research Centre. We thank all PanelApp reviewers and those who have contributed feedback or gene lists to help in the development of PanelApp;individual panels show the names and affiliations of contributors. We thank all participants in the 100,000 Genomes Project. This research was made possible through access to the data and findings generated by the 100,000 Genomes Project. The 100,000 Genomes Project is managed by Genomics England Limited (a wholly owned company of the Department of Health). The 100,000 Genomes Project is funded by the NIHR and NHSE. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. The 100,000 Genomes Project uses data provided by patients and collected by the NHSE as part of their care and support. Open Targets is supported by Open Targets. The work performed by authors at EMBLEBI was supported by the Wellcome Trust [WT200990/Z/16/Z].

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All data from the GenCC website is openly available in multiple download formats and can be accessed here (https://search.thegencc.org/download). A snapshot of the GenCC database (Dec 2021) relevant to the figures and analysis in this manuscript can be found in the supplemental files. Deidentified Delphi survey responses are available upon request (email mdistefa{at}broadinstitute.org).

https://search.thegencc.org/download

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted January 03, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
The Gene Curation Coalition: A global effort to harmonize gene-disease evidence resources
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
The Gene Curation Coalition: A global effort to harmonize gene-disease evidence resources
Marina T. DiStefano, Scott Goehringer, Lawrence Babb, Fowzan S. Alkuraya, Joanna Amberger, Mutaz Amin, Christina Austin-Tse, Marie Balzotti, Jonathan S. Berg, Ewan Birney, Carol Bocchini, Elspeth A. Bruford, Alison J. Coffey, Heather Collins, Fiona Cunningham, Louise C. Daugherty, Yaron Einhorn, Helen V. Firth, David R. Fitzpatrick, Rebecca E. Foulger, Jennifer Goldstein, Ada Hamosh, Matthew R. Hurles, Sarah E. Leigh, Ivone US. Leong, Sateesh Maddirevula, Christa L. Martin, Ellen M. McDonagh, Annie Olry, Arina Puzriakova, Kelly Radtke, Erin M. Ramos, Ana Rath, Erin Rooney Riggs, Angharad M. Roberts, Charlotte Rodwell, Catherine Snow, Zornitza Stark, Jackie Tahiliani, Susan Tweedie, James S. Ware, Phillip Weller, Eleanor Williams, Caroline F. Wright, T Michael. Yates, Heidi L. Rehm
medRxiv 2022.01.03.21268593; doi: https://doi.org/10.1101/2022.01.03.21268593
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
The Gene Curation Coalition: A global effort to harmonize gene-disease evidence resources
Marina T. DiStefano, Scott Goehringer, Lawrence Babb, Fowzan S. Alkuraya, Joanna Amberger, Mutaz Amin, Christina Austin-Tse, Marie Balzotti, Jonathan S. Berg, Ewan Birney, Carol Bocchini, Elspeth A. Bruford, Alison J. Coffey, Heather Collins, Fiona Cunningham, Louise C. Daugherty, Yaron Einhorn, Helen V. Firth, David R. Fitzpatrick, Rebecca E. Foulger, Jennifer Goldstein, Ada Hamosh, Matthew R. Hurles, Sarah E. Leigh, Ivone US. Leong, Sateesh Maddirevula, Christa L. Martin, Ellen M. McDonagh, Annie Olry, Arina Puzriakova, Kelly Radtke, Erin M. Ramos, Ana Rath, Erin Rooney Riggs, Angharad M. Roberts, Charlotte Rodwell, Catherine Snow, Zornitza Stark, Jackie Tahiliani, Susan Tweedie, James S. Ware, Phillip Weller, Eleanor Williams, Caroline F. Wright, T Michael. Yates, Heidi L. Rehm
medRxiv 2022.01.03.21268593; doi: https://doi.org/10.1101/2022.01.03.21268593

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (215)
  • Allergy and Immunology (495)
  • Anesthesia (106)
  • Cardiovascular Medicine (1093)
  • Dentistry and Oral Medicine (195)
  • Dermatology (141)
  • Emergency Medicine (274)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (499)
  • Epidemiology (9757)
  • Forensic Medicine (5)
  • Gastroenterology (480)
  • Genetic and Genomic Medicine (2303)
  • Geriatric Medicine (222)
  • Health Economics (462)
  • Health Informatics (1553)
  • Health Policy (732)
  • Health Systems and Quality Improvement (602)
  • Hematology (236)
  • HIV/AIDS (501)
  • Infectious Diseases (except HIV/AIDS) (11631)
  • Intensive Care and Critical Care Medicine (616)
  • Medical Education (236)
  • Medical Ethics (67)
  • Nephrology (256)
  • Neurology (2139)
  • Nursing (134)
  • Nutrition (335)
  • Obstetrics and Gynecology (426)
  • Occupational and Environmental Health (517)
  • Oncology (1172)
  • Ophthalmology (363)
  • Orthopedics (128)
  • Otolaryngology (220)
  • Pain Medicine (145)
  • Palliative Medicine (50)
  • Pathology (309)
  • Pediatrics (694)
  • Pharmacology and Therapeutics (298)
  • Primary Care Research (265)
  • Psychiatry and Clinical Psychology (2172)
  • Public and Global Health (4645)
  • Radiology and Imaging (775)
  • Rehabilitation Medicine and Physical Therapy (455)
  • Respiratory Medicine (623)
  • Rheumatology (274)
  • Sexual and Reproductive Health (225)
  • Sports Medicine (208)
  • Surgery (250)
  • Toxicology (43)
  • Transplantation (120)
  • Urology (94)