Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Comprehensive analysis of GBA using a novel algorithm for Illumina whole-genome sequence data or targeted Nanopore sequencing

View ORCID ProfileMarco Toffoli, Xiao Chen, View ORCID ProfileFritz J Sedlazeck, Chiao-Yin Lee, Stephen Mullin, Abigail Higgins, Sofia Koletsi, Monica Emili Garcia-Segura, Esther Sammler, Sonja W. Scholz, Anthony HV Schapira, View ORCID ProfileMichael A. Eberle, View ORCID ProfileChristos Proukakis
doi: https://doi.org/10.1101/2021.11.12.21266253
Marco Toffoli
1Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, WC1N 3BG, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marco Toffoli
Xiao Chen
2Illumina Inc., San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fritz J Sedlazeck
3Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Fritz J Sedlazeck
Chiao-Yin Lee
1Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, WC1N 3BG, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stephen Mullin
1Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, WC1N 3BG, United Kingdom
4Institute of Translational and Stratified Medicine, University of Plymouth School of Medicine, Plymouth, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Abigail Higgins
1Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, WC1N 3BG, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sofia Koletsi
1Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, WC1N 3BG, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Monica Emili Garcia-Segura
1Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, WC1N 3BG, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Esther Sammler
5MRC Protein Phosphorylation and Ubiquitylation Unit, School of Life Sciences, University of Dundee
6Molecular and Clinical Medicine, School of Medicine, University of Dundee
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sonja W. Scholz
7Neurodegenerative Diseases Research Unit, National Institute of Neurological Disorders and Stroke, Bethesda, MD 20892, USA
8Department of Neurology, Johns Hopkins University Medical Center, Baltimore, MD 21287, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anthony HV Schapira
1Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, WC1N 3BG, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael A. Eberle
2Illumina Inc., San Diego, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael A. Eberle
  • For correspondence: c.proukakis@ucl.ac.uk meberle@illumina.com
Christos Proukakis
1Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, WC1N 3BG, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Christos Proukakis
  • For correspondence: c.proukakis@ucl.ac.uk meberle@illumina.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

GBA variants cause the autosomal recessive Gaucher disease, and carriers are at increased risk of Parkinson’s disease (PD) and Lewy body dementia (LBD). The presence of a highly homologous nearby pseudogene (GBAP1) predisposes to a range of structural variants arising from either gene conversion or reciprocal recombination, the latter resulting in copy number gains or losses, complicating genetic testing and analysis. To date, short-read sequencing has not been able to fully resolve these or other variants in the key homology region, and targeted long-read sequencing has not previously resolved reciprocal recombinants. We present and validate two independent methods to resolve recombinant alleles and other variants in GBA: Gauchian, a novel bioinformatics tool for short-read, whole-genome sequencing data analysis, and Oxford Nanopore long-read sequencing after enrichment with appropriate PCR. The methods were concordant for 42 samples including 30 with a range of recombinants and GBAP1-related mutations, and Gauchian outperforms the GATK Best Practices pipeline. Applying Gauchian to Illumina sequencing of over 10,000 individuals from publicly available cohorts shows that copy number variants (CNVs) spanning GBAP1 are relatively common in Africans. CNV frequencies in PD and LBD are similar to controls, but gains may coexist with other mutations in patients, and a modifying effect cannot be excluded. Gauchian detects a higher frequency of GBA variants in LBD than PD, especially severe ones. These findings highlight the importance of accurate GBA mutation detection in these patients, which is possible by either Gauchian analysis of short-read whole genome sequencing, or targeted long-read sequencing.

Competing Interest Statement

XC and MAE are employees of Illumina Inc. S.W.S. serves on the Scientific Advisory Council of the Lewy Body Dementia Association. S.W.S. is an editorial board member for the Journal of Parkinson Disease and JAMA Neurology. AHVS is supported by the UCLH NIHR BRC.

Funding Statement

This study was supported in part by the Intramural Research Program of the National Institutes of Health (National Institute of Neurological Disorders and Stroke, project numbers: 1ZIANS003154) and the JPND through the MRC grant code MR/T046007/1.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

National Research Ethics Service London–Hampstead Ethics Committee gave ethical approval for research involving RAPSODI samples. National Research Ethics Service Committee central–London gave ethical approval for research involving samples obtained from Queen Square Brain Bank. UCL Ethics Committee gave ethical approval for research involving PPMI samples.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Gauchian will be a part of Version 3.10 of the Illumina DRAGEN (Dynamic Read Analysis for GENomics) Bio–IT platform. ONT and UNCALLED scripts used will be downloadable at https://github.com/marcotoffoli. Individual–level genome sequence data for the PD patients, LBD patients, and neurologically healthy controls are available at AMP–PD (https://amp–pd.org). The datasets of DNA from QSBB brain samples and NHGRI samples generated during this study (Illumina WGS and targeted ONT sequencing) will be made available on the European Nucleotide Archive (https://www.ebi.ac.uk/ena/browser/home), ascession number PRJEB48317. The datasets only include read alignments to GBA/GBAP1 regions (other regions of the genome have been removed or masked) to minimize the amount of genetic information made available for public access. The datasets of DNA from PPMI samples generated during this study (targeted ONT sequencing) will be made available on the PPMI repository (https://www.ppmi–info.org/). ONT sequencing data on living individuals are not available due to consent / IRB restrictions. Additional data produced will be available upon reasonable request to the authors

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted November 13, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Comprehensive analysis of GBA using a novel algorithm for Illumina whole-genome sequence data or targeted Nanopore sequencing
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Comprehensive analysis of GBA using a novel algorithm for Illumina whole-genome sequence data or targeted Nanopore sequencing
Marco Toffoli, Xiao Chen, Fritz J Sedlazeck, Chiao-Yin Lee, Stephen Mullin, Abigail Higgins, Sofia Koletsi, Monica Emili Garcia-Segura, Esther Sammler, Sonja W. Scholz, Anthony HV Schapira, Michael A. Eberle, Christos Proukakis
medRxiv 2021.11.12.21266253; doi: https://doi.org/10.1101/2021.11.12.21266253
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Comprehensive analysis of GBA using a novel algorithm for Illumina whole-genome sequence data or targeted Nanopore sequencing
Marco Toffoli, Xiao Chen, Fritz J Sedlazeck, Chiao-Yin Lee, Stephen Mullin, Abigail Higgins, Sofia Koletsi, Monica Emili Garcia-Segura, Esther Sammler, Sonja W. Scholz, Anthony HV Schapira, Michael A. Eberle, Christos Proukakis
medRxiv 2021.11.12.21266253; doi: https://doi.org/10.1101/2021.11.12.21266253

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (161)
  • Allergy and Immunology (416)
  • Anesthesia (91)
  • Cardiovascular Medicine (860)
  • Dentistry and Oral Medicine (159)
  • Dermatology (97)
  • Emergency Medicine (250)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (394)
  • Epidemiology (8562)
  • Forensic Medicine (4)
  • Gastroenterology (384)
  • Genetic and Genomic Medicine (1753)
  • Geriatric Medicine (167)
  • Health Economics (373)
  • Health Informatics (1244)
  • Health Policy (621)
  • Health Systems and Quality Improvement (468)
  • Hematology (196)
  • HIV/AIDS (374)
  • Infectious Diseases (except HIV/AIDS) (10303)
  • Intensive Care and Critical Care Medicine (553)
  • Medical Education (192)
  • Medical Ethics (51)
  • Nephrology (212)
  • Neurology (1678)
  • Nursing (97)
  • Nutrition (251)
  • Obstetrics and Gynecology (326)
  • Occupational and Environmental Health (451)
  • Oncology (929)
  • Ophthalmology (263)
  • Orthopedics (102)
  • Otolaryngology (172)
  • Pain Medicine (114)
  • Palliative Medicine (40)
  • Pathology (253)
  • Pediatrics (536)
  • Pharmacology and Therapeutics (253)
  • Primary Care Research (208)
  • Psychiatry and Clinical Psychology (1770)
  • Public and Global Health (3841)
  • Radiology and Imaging (624)
  • Rehabilitation Medicine and Physical Therapy (320)
  • Respiratory Medicine (520)
  • Rheumatology (208)
  • Sexual and Reproductive Health (168)
  • Sports Medicine (158)
  • Surgery (190)
  • Toxicology (36)
  • Transplantation (101)
  • Urology (76)