Abstract
Heterozygous missense variants and in-frame indels in SMC3 are a cause of Cornelia de Lange syndrome (CdLS), marked by intellectual disability, growth deficiency, and dysmorphism, via an apparent dominant-negative mechanism. However, the spectrum of manifestations associated with SMC3 loss-of-function variants has not been reported, leading to hypotheses of alternative phenotypes or even developmental lethality. We used matchmaking servers, patient registries, and other resources to identify individuals with heterozygous, predicted loss-of-function (pLoF) variants in SMC3, and analyzed population databases to characterize mutational intolerance in this gene. Here, we show that SMC3 behaves as an archetypal haploinsufficient gene: it is highly constrained against pLoF variants, strongly depleted for missense variants, and pLoF variants are associated with a range of developmental phenotypes. Among 13 individuals with SMC3 pLoF variants, phenotypes were variable but coalesced on low growth parameters, developmental delay/intellectual disability, and dysmorphism reminiscent of atypical CdLS. Comparisons to individuals with SMC3 missense/in-frame indel variants demonstrated a milder presentation in pLoF carriers. Furthermore, several individuals harboring pLoF variants in SMC3 were nonpenetrant for growth, developmental, and/or dysmorphic features, some instead having intriguing symptomatologies with rational biological links to SMC3 including bone marrow failure, acute myeloid leukemia, and Coats retinal vasculopathy. Analyses of transcriptomic and epigenetic data suggest that SMC3 pLoF variants reduce SMC3 expression but do not result in a blood DNA methylation signature clustering with that of CdLS, and that the global transcriptional signature of SMC3 loss is model-dependent. Our finding of substantial population-scale LoF intolerance in concert with variable penetrance in subjects with SMC3 pLoF variants expands the scope of cohesinopathies, informs on their allelic architecture, and suggests the existence of additional clearly LoF-constrained genes whose disease links will be confirmed only by multi-layered genomic data paired with careful phenotyping.
Competing Interest Statement
M.E.T. is supported by research funding and/or reagents from Illumina Inc., Microsoft Inc., Ionis Therapeutics, and Levo Therapeutics. M.P.N, J.H., and J.J. are employees of GeneDx, LLC.
Funding Statement
P.M.B. is supported by award K08NS117891 from the US National Institute of Neurological Disorders and Stroke (NINDS), and an award from the Boston Children's Hospital Office of Faculty Development. K.N.W.F. and the Clinic for Cornelia de Lange Syndrome and Related Disorders are supported by gifts from Peter and Kathy Wagner and Julie and Frank Mairano. M.E.T, P.M.B., L.W., and D.B-I were supported by grants from the US National Institutes of Health (NIH) (U01HG011755, R01MH115957, HD081256, and P50HD104224). D.B is supported by T32HG002295 from the US National Human Genome Research Institute. S.S. is supported by award K23NS119666 from the US NINDS. T.B.H. is supported by the Deutsche Forschungsgemeinschaft (German Research Foundation) - 418081722, 433158657. Some sequencing and analysis was provided by the Broad Institute Center for Mendelian Genomics and was funded by NIH grants UM1HG008900 and R01HG009141. K.M.B is supported by US National Eye Institute (R01EY026904, R01EY012910, P30EY014104) and the Foundation Fighting Blindness (EGI-GE-1218-0753-UCSD). This work was supported in part by the National Institutes of Health of the United States of America grants R01HD078641 and P50HD096723 to D.F.C. S.B. is supported by the NIHR Manchester Biomedical Research Centre (NIHR203308). A.S. is supported by RC2DK122533. M.S-B was supported by NHLBI R01HL143295. Analysis by E.G., S.T. and A.O'D.L was supported by U01HG0011755 and R01HG009141. This work was supported by the Boston Children's Hospital Rare Disease Cohorts Initiative (CRDC). This study makes use of data generated by the DECIPHER community. A full list of centers who contributed to the generation of the data is available from https://deciphergenomics.org/about/stats and via email from contact{at}deciphergenomics.org. Funding for the DECIPHER project was provided by Wellcome (grant number WT223718/Z/21/Z).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The IRB of Boston Children's Hospital gave ethical approval for this work prior to any research activities.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data produced in the present work are contained in the manuscript.