Abstract
Purpose Genomic sequencing offers the opportunity to screen for hundreds of rare genetic conditions with a single test. To minimise potential negative impact on families and clinical services, it is crucial to reduce false positive results while prioritising clinical utility. Here we present an automated variant prioritisation approach for genomic newborn screening and the assessment of its validity across the conditions included in the Generation Study, a research study investigating genomic sequencing in 100,000 newborns in England. Prioritised variants will subsequently undergo manual review by a registered clinical scientist, and a specialist clinician, before being reported back to parents. Methods The automated variant prioritisation approach was evaluated for its ability to predict the presence or absence of relevant genetic variants, subsequently informing gene and variant-specific inclusion in the study. Specificity was estimated using a cohort of 34,410 samples not enriched for rare diseases, while sensitivity was assessed with 546 samples from participants with diagnostic variants in genes relevant to newborn screening. Coverage and CNV callability metrics were evaluated as proxies for variant detection. Results Assessing validity of automated variant prioritisation on a gene level led to changes in rules used in variant prioritisation and conditions included. We estimated that 3-5% of samples will have prioritised variants that require manual review and that <1% of samples will be taken forward for orthogonal confirmatory testing and genetic confirmation. We prioritised variants in ~80% of samples with diagnostic variants across genes included in the Generation Study. Conclusion Gene-specific assessment of variant prioritisation is crucial to establish analytical validity prior to inclusion in genomic newborn screening.
Competing Interest Statement
JK, PS, DD, MG, IL, APu, KS, GA, JS, PQ, MT, KS, DB, APi, RS, ET, and DK are employees of Genomics England.
Funding Statement
This study did not receive external funding. This study is run by Genomics England, a company owned by the UK Department of Health and Social Care.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The Generation study has been approved by the Health Research Authority and Cambridge Central Research Ethics Committee. Clearance number: 324562
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
Data from the National Genomic Research Library (NGRL) supporting this study are available within the secure Genomics England Research Environment. Data used or referred to in this publication include: 100,000 Genomes Project data from Rare Disease arm, 100,000 Genomes Project data from Cancer arm, Covid-19 aggregate data, NHS Genomic Medicine Service data for rare disease participants. Access is restricted to approved researchers who are members of the Genomics England Research Network, subject to a data access agreement and participant-led governance. For more information on data access, visit: https://www.genomicsengland.co.uk/research