Medical Evaluation of Unanticipated Monogenic Disease Risks Identified through Newborn Genomic Screening: Findings from the BabySeq Project

Genomic sequencing of healthy newborns to screen for medically important genetic information has long been anticipated but data around downstream medical consequences are lacking. Among 159 infants randomized to the sequencing arm in the BabySeq Project, an unanticipated monogenic disease risk (uMDR) was discovered in 18 (11.3%). We assessed uMDR actionability by visualizing scores from a modified ClinGen Actionability SemiQuantitative Metric and tracked medical outcomes in these infants for 3-5 years. All uMDRs scored as highly actionable (mean 9, range: 7-11 on a 0-12 scale) and had readily available clinical interventions. In 4 cases, uMDRs revealed unsuspected genetic etiologies for existing phenotypes, and in the remaining 14 cases provided risk stratification for future surveillance. In 8 cases, uMDRs prompted screening for multiple at-risk family members. These results suggest that actionable uMDRs are more common than previously thought and support ongoing efforts to evaluate population-based newborn genomic screening.

Recent advances in the clinical deployment of genome-scale sequencing now make it possible to determine an infant's complete genome sequence shortly after birth, enabling the identification of deleterious variants associated with monogenic disorders along with other health-related traits 1 . Genomic sequencing as a screening tool in newborns is being explored 2 and supplementary genomic screening panels are now offered by several commercial laboratories 3 , but there are a number of evidentiary, ethical, and cost concerns that must be considered 1, [4][5][6][7] . The BabySeq Project is a series of randomized clinical trials of newborn screening by genomic sequencing (GS) that has generated empirical data on mechanisms of consent, gene curation, variant interpretation, disclosure methods, as well as medical, behavioral and economic outcomes, with institutional review board approval and informed consent from all participants [8][9][10][11][12][13] . In the initial pilot of the project, we recruited infants from both a wellbaby nursery and intensive care units (ICU) who were randomized to receive either standard of care newborn screening (NBS) or NBS plus GS. Participants randomized to GS underwent whole exome sequencing (WES) with clinical reporting of pathogenic or likely pathogenic variants (PLPVs) for any genetic condition that was childhood-onset and highly penetrant, or childhood-actionable and at least moderately penetrant 9,14 . A limited subset of actionable adult-onset only conditions were also offered to be returned to families 12 . Results were disclosed to participants' parents in a counseling session and a disclosure letter was delivered to the family and the newborn's clinician(s). After disclosure, the study team monitored utility of, and reactions to, the genomic findings by . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022. ; chart review and parental and physician surveys. Parental saliva samples were also collected at enrollment and variant specific testing was completed and disclosed for the parents of all infants with an identified uMDR. The randomized trial was powered to compare psychosocial outcomes and primary results were previously reported. 10 In this report, we examine the subset of newborns who had findings associated with an unanticipated monogenic disease risk (uMDR) and describe the specific genes and variants detected and disclosed, assess their degree of actionability, and document subsequent medical evaluations and outcomes for these at-risk infants and families.
Of the 325 newborns enrolled in the pilot phase of the BabySeq Project,159 underwent WES and of these, 18 (11.3%) were found to have a PLPV in a gene associated with a childhood-onset-actionable condition, or an actionable adult-onset condition (Table 1) 14 . uMDRs were discovered in 11 of 127 (8.7%) newborns recruited from the well-baby nursery, and 7 of 32 (21.8%) newborns recruited from the ICU. uMDRs in the ICU were not associated with the presenting symptoms and signs of the infants.
Among the 18 infants with uMDRs, we identified PLPVs in 14 unique genes. We detected heterozygous variants in 11 genes associated with autosomal dominant inheritance; a hemizygous variant in 1 gene (G6PD) associated with X-linked recessive inheritance, and compound heterozygous variants in two genes (BTD and CYP21A2) associated with autosomal recessive inheritance. One de novo variant (in ANKRD11) provided an unsuspected molecular explanation for a syndromic phenotype under active evaluation. All other variants were inherited. In 3 infants, genomic findings prompted discovery of a clinical phenotype that had not been previously recognized (ELN, BTD, . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted March 18, 2022. ; and GLMN, Table 1), for example echocardiogram revealed mild but abnormal aortic stenosis in the infant with a PV in ELN (case 2). In the remainder of the cases (14/18) molecular findings were associated with future disease risk with PLPVs in the following genes: TTN in 4 cases; BRCA2 in 2 cases; and G6PD, SLC7A9, KCNQ4, CD46, VCL, MYBPC3, MSH2, and CYP21A2 with 1 case each. In 8 of those 14 cases, postdisclosure review of the newborn's family history raised the possibility of additional previously unsuspected and undiagnosed family members with the condition. For example, the maternal grandfather of an infant with a maternally inherited likely pathogenic variant in TTN (case 6) had previously been clinically diagnosed with dilated cardiomyopathy and, as a result of the BabySeq finding, was confirmed to carry the same variant. Consequently, the infant's mother is now routinely followed by echocardiography.
Clinical actionability of uMDRs was classified using the SemiQuantitative Metric (CASQM) adapted in modified form from the ClinGen Actionability Working Group (Supplemental Table 1) 15,16 . The CASQM provides a score of 0-3 on four domains: disease severity, likelihood of developing the disease, efficacy of intervention, and nature of intervention when the patient is carrying the deleterious variant 15 . We scored each of our findings on these top-level concepts, modified the labels for clarity and visualized each of the uMDRs on a radar graph for easier visual interpretation as shown in Figure 1. We also separately assessed the available knowledge base regarding all domains with a score between 0-3 17 . The uMDRs discovered in our study had a mean score of 9, out of a maximum possible score of 12 (range: 7-11). Clinical interventions are available for all the 14 conditions found in the 18 infants, ranging from the initiation . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted March 18, 2022. ; of surveillance for cancer risk, hearing loss, and cardiac abnormalities, to biotin supplementation in the case of partial biotinidase deficiency.
The infants with uMDRs, and their families, were followed for a median of 44.7 months after disclosure (range: 37.6 to 60.3 months). We reviewed medical records to ascertain what actions were taken for these participants based on their genomic findings (Table 1). We also conducted semi-structured phone interviews with parents 3 years after the conclusion of the study to elicit any relevant information that was not captured in the medical record. We tracked all medical interventions taken on behalf of the infant, as well as medical interventions taken by other family members as a result of the uMDR findings in the sequenced infant. In all but one case (G6PD deficiency), the disclosure of uMDR findings generated recommendations for specialist consultation for the child and/or at least one of the parents. Two-thirds (12/18) of the infants received immediate referrals for specialist care, of which 11 completed an initial referral with 9 of the 11 continuing to receive ongoing specialty care. In one-sixth of these (3/18) cases, an adult-onset uMDR was identified in the infant, and parents were encouraged to alert their child when older to seek appropriate screening and care in adulthood.
Confirmatory parental testing was performed for all infants with a uMDR, and in 72% of the cases (13/18), genomic findings prompted specialist evaluations and/or diagnostic procedures for one or more family members. To date, at least 4 of these family member evaluations (Cases 4, 6, 7 and 8 in Table 1) have led to medically significant findings.
Of the 6 parents discovered to have genetic cardiovascular risk through their infant's uMDR findings, all 6 had screening workups, and 2 continue to receive ongoing surveillance and care. Of the 3 parents discovered to have genetic cancer risk through . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022. ; their infant's uMDR findings, all 3 are undergoing ongoing enhanced surveillance, and 2 of the 3 opted for risk-reducing surgeries.
Our study found that 18 out of 159 (11.3%) infants sequenced had a uMDR for a childhood-onset or adult-onset condition, all of which were actionable on a semiquantitative metric of clinical actionability. In 4 of these cases, the genotype led to the discovery of a phenotype that had not been considered genetic or had not been previously noted, and in 8 of these cases, the genotype findings led to discovery of hitherto unknown at-risk family members. In the 37-60 months following disclosure of the unanticipated genetic findings, two-thirds of the infants and all of their at-risk firstdegree family members with uMDRs were referred for specialty consultation, surveillance, or treatment.
Among the 18 infants identified with uMDRs, 3 were subsequently found to have previously unrecognized symptoms or laboratory measures consistent with the underlying condition that had not been previously recognized, and thus were penetrantalbeit with mild or subclinical features. This finding highlights how difficult it is to determine the true penetrance of most monogenic conditions and the importance of considering differences in expressivity over time. While it is well recognized that estimations of penetrance for many genetic conditions are biased toward higher estimates because clinicians are more likely to recognize genetic disease in families with robust inheritance patterns, our findings suggest that penetrance may also be grossly underestimated when genetic diseases present with milder or subclinical symptoms, or in families without obvious family history. Indeed, the very concept of penetrance depends on which particular phenotype is being measured. For example, in . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022.
an epidemiological study where subclinical features include asymptomatic thickening of the cardiac septum or electrophysiological abnormalities, penetrance around the early features of cardiomyopathy could be recognized earlier than in a study where the first signs of the disease were exercise intolerance or sudden cardiac death 18 .
In the BabySeq Project, we evaluated variants in all disease-associated genes in the genome but only reported variants from those in genes that met our criteria, including 954 pre-curated genes selected for definitive and strong disease-gene association and high penetrance regardless of actionability (884) or moderate evidence or moderate penetrance but high actionability (70) 9 . In addition, 7 genes for adult-onset hereditary breast and ovarian cancer (HBOC) and colon cancer syndromes were offered to all parents. Since the majority of genes listed were selected without regard for actionability, it was surprising to find that all the uMDR discovered in sequenced infants were associated with conditions that were scored as actionable on a modified version of the CASQM. This suggests that the differential in results disclosed between "parents who just want actionable information" and "parents who want any information that is of medical significance to their infant" may not be as large as previously considered.
The concept of actionability is difficult to discuss with patients or research participants, and sometimes even with clinicians and researchers, because for some, the ability to provide enhanced surveillance or even knowledgeable anticipation are considered actionable, while to others this concept only includes conditions where a treatment can slow or stop progression, or unequivocally improve the patient's prognosis. Figure 1 illustrates how the modified CASQM domains can be represented visually so as to facilitate interpretation. A full diamond with score of 3 in each domain . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022. ; represents the most actionable condition whereas alternative shapes can signal variations in which, for example, penetrance is expected to be lower, or treatment is expected to be more burdensome. This type of visualization could eventually be adapted in a way which would allow clinicians and patients to intuitively understand the specific pros and cons around the actionability of the genetic information they are choosing or are receiving.
Two-thirds of the cases identified with uMDRs were seen by specialists knowledgeable about the genetic condition, and 72% of the cases prompted specialist evaluations and/or follow-up procedures for one or more family members. This speaks to the benefits of alerting family members to a genotype carried by one individual and to the powerful advantages of cascade testing, a phenomenon that has been recognized as far more efficient and cost-effective than population screening 19,20 , but that is implemented inconsistently and with variable success in clinical genetics today 21 .
The BabySeq Project is the first rigorous examination of uMDR findings in an unselected cohort of newborns and has several limitations. Participation in our project was offered to the parents of several thousand infants born in a tertiary care medical center, and those who enrolled were more educated and of higher socio-economic status than the general population 10,13 . The sample size of infants carrying uMDRs is small and the constellation of unanticipated genetic findings cannot be presumed to be representative of what might be found in population-level sequencing. In 5 cases of uMDR disclosure, neither the physical examination, subsequent testing, nor the family history revealed any new or useful information, leaving these 5 families with risk information that may be less likely to lead to penetrant disease. A much larger study . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022. ; with a more diverse population would be needed to ascertain whether the anxiety to parents and cost to the healthcare system associated with specialty consultations and surveillance around these risks is worthwhile. However, it is reassuring that in a separate report we recently demonstrated that neither sequencing, nor receiving uMDRs, was associated with significantly greater parental distress or disruption of the parent-child bond, and that parents felt empowered by both positive and negative results 10 . In assessing the reactions of parents in a study like this, it is important to keep in mind that parents self-select to enroll, and their self-selection reflects a desire to receive risk information and may be biased toward greater optimism around the results.
In summary, these early medical outcomes from the BabySeq Project suggest that a large proportion of infants identified as having unanticipated monogenic disease risk had actionable conditions and in nearly all cases the genomic findings resulted in a change in medical care.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022. ;

ONLINE METHODS:
The BabySeq Project is a series of randomized controlled trials of whole exome sequencing (WES) in newborns from the newborn nursery (NBN) or intensive care unit (ICU) who were randomized to conventional care plus family history assessment and genetic counseling alone or with WES. Detailed methodology for the study design and recruitment can be found elsewhere 1,2 . In brief, parents and newborns from the well-

Gene Selection and Variant Analysis
Detailed description of the rationale for gene selection and variant analysis in the BabySeq Project have been previously published 3,4 . In brief, exomes were annotated, . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022. ; filtered and results analyzed to identify pathogenic or likely pathogenic variants (PLPVs) that met criteria for return, including variants found in genes with definitive or strong disease-gene association and high penetrance regardless of actionability or variants found in genes with moderate evidence or moderate penetrance but high actionability.

Disclosure protocol
Parents attended a disclosure session facilitated by a study physician and genetic counselor, on average four months after enrollment (range: 1.2 to 10.2 months), where they were informed of their randomization status and given their family history report. Parents in the ES arm also received the NGSR, and results of an IBA (if performed). At disclosure, the study physicians (who are all trained in either clinical genetics, neurology and/or neonatology) performed detailed physical examination to identify features that might have been previously missed. After disclosure of results, the genetic counselor and physician prepared a note summarizing the visit. These notes, along with the family history report, NBS report, and, for those in the sequencing arm, the NGSR/IBA, were then mailed to the parents and faxed to the infant's pediatrician . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022. ; and other providers and all these documents were uploaded to the infant's medical record at BWH or BCH.

Health Outcomes in this Analysis
We reviewed all available medical records to ascertain downstream actions taken both for the proband and family members. We also conducted phone interviews 46.8 months (mean; range: 37.6 to 60.3 months) after the conclusion of the study to identify any relevant information that may not have been captured in the medical records. We collected a range of outcomes to study the process and effect of integrating WGS into primary care. We evaluated downstream healthcare use attributable to the genomic results both for primary care as well as specialty care. We also evaluated family member cascade testing and family member health care utilization (where available).

Actionability Analysis
Clinical severity of potential conditions identified and available interventions were graded by modifying the ClinGen Actionability Semiquantitative Metric (SQM) 5 . In the BabySeq Project, the SQM was applied to assess potential actionability over the lifetime of a newborn screened as a primary quest for identifying childhood-onset or childhoodactionable condition, or an actionable adult-onset condition, all of which have clinical interventions available. Hence the ClinGen Actionability SQM was adapted such that for each gene-disease pair, we scored outcome-intervention pairs for outcomes with relatively high penetrance when more than one outcome was possible. Additionally, we . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 18, 2022. ; scored outcome-intervention pairs based on all available evidence as opposed to only pediatric actionability, given the important cascade testing implications.     : Radar plots that illustrate the pattern of clinical actionability in the specific genes in which unanticipated pathogenic and likely pathogenic variants (PLPVs) were identified. The plot utilizes a modified Semi-Quantitative Metric adapted from ClinGen as described in the text and online methods. As shown in Panel A, the four points of the diamond on the radar graph (starting from the top and moving clockwise with each figure) represent severity of the fully expressed genetic condition, the penetrance or likelihood that the condition will manifest over an individual's lifetime, the effectiveness of the specific intervention shown in the figure and the tolerability of the intervention, i.e. its burden and acceptability to patients. A radar plot with maximum area within the diamond would represent a severe genetic condition that has high penetrance with a highly effective intervention that is particularly acceptable to patients because it is minimally invasive or dangerous. Panel B shows sample clinical actionability radar plots for three genes from the ACMG secondary findings list. Panel C demonstrates clinical actionability of the 14 genes in which PLPVs were identified in the 18 infants with an unanticipated monogenic disease risk. Figure 1: Radar plots that illustrate the pattern of clinical actionability in the specific genes in which unanticipated pathogenic and likely pathogenic variants (PLPVs) were identified. The plot utilizes a modified Semi-Quantitative Metric adapted from ClinGen as described in the text and online methods. As shown in Panel A, the four points of the diamond on the radar graph (starting from the top and moving clockwise with each figure) represent severity of the fully expressed genetic condition, the penetrance or likelihood that the condition will manifest over an individual's lifetime, the effectiveness of the specific intervention shown in the figure and the tolerability of the intervention, i.e., its burden and acceptability to patients. A radar plot with maximum area within the diamond would represent a severe genetic condition that has high penetrance with a highly effective intervention that is particularly acceptable to patients because it is minimally invasive or dangerous. Panel B shows sample clinical actionability radar plots for three genes from the ACMG secondary findings list. Panel C demonstrates clinical actionability of the 14 genes in which PLPVs were identified in the 18 infants with an unanticipated monogenic disease risk. Colors indicate different genes.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted March 18, 2022. ; https://doi.org/10.1101/2022.03.18.22272284 doi: medRxiv preprint