Nanopore sequencing-based episignature detection ================================================ * Mathilde Geysens * Benjamin Huremagic * Erika Souche * Jeroen Breckpot * Koenraad Devriendt * Hilde Peeters * Griet Van Buggenhout * Hilde Van Esch * Kris Van Den Bogaert * Joris Robert Vermeesch ## ABSTRACT **Background** A subset of developmental disorders (DD) is characterized by disease-specific genome-wide methylation changes. These episignatures inform about underlying pathogenic mechanisms and can be used to assess the pathogenicity of genomic variants as well as confirm clinical diagnoses. Currently, episignature detection requires the use of indirect methylation profiling microarrays. We hypothesized that long-read whole genome sequencing would not only enable the detection of single nucleotide variants and structural variants but also episignatures. **Methods** Genome-wide nanopore sequencing was performed in forty controls and twenty patients with confirmed or suspected episignature-associated DD, representing thirteen distinct diseases. Following variant and methylome calling, hierarchical clustering and dimensional reduction were used to determine the compatibility with microarray-based episignatures. Subsequently, we developed a support vector machine for each DD. **Results** Nanopore sequencing based methylome patterns were concordant with microarray-based episignatures. Our classifier identified episignatures in 17/20 disease samples and none of the controls. The remaining three patient samples were classified as controls by both our classifier and a commercial microarray assay. In addition, we identified all underlying pathogenic single nucleotide and structural variants and showed haplotype-aware skewed X-inactivation evaluation directs clinical interpretation. **Conclusion** This proof-of-concept study demonstrates nanopore sequencing enables concurrent haplotyped genomic and epigenomic analyses. ## INTRODUCTION The epigenome plays a central role in regulating differential gene expression. Epigenetic variation is therefore involved in several adaptative but also pathological processes. Repeat expansion-associated modifications of promotor methylation have long been recognized as a cause of developmental disorders [1], [2]. Imprinting defects are also known to disturb development, growth, and metabolism [3]. More recently, methylome studies in patients with unexplained DD revealed rare epigenetic changes in 23% of patients with a 2.8 fold excess of *de novo* epivariants [4], suggesting that these may be involved in their etiology. Yet, apart from a handful of targeted methylation assays for well-known imprinted regions, epigenomic analyses are currently not routinely performed in the diagnostic work-up of patients with DD. Besides localized epigenetic variants, genome-wide methylation studies in DD have recently revealed multi-loci, disorder-specific methylation disturbances, called episignatures. Episignatures have initially been identified in disorders caused by known chromatin and/or methylation regulatory gene disruptions [5]. An increasing number of disorders are characterized by episignatures and recognizable methylation pattern changes have now been detected in over sixty diseases [6]. For some, such as certain microdeletion syndromes [7], the mechanism causing reproducible genome wide methylation changes remains unclear. Mapping the methylome in DD will therefore improve our understanding of pathogenic mechanisms and might leverage new insights in the origins of phenotypic variability. In addition, episignatures have a high diagnostic value in assessing the pathogenicity of genomic variants of unknown significance (VUS) and in confirming clinical diagnoses in patients for which the underlying genomic variant could not be identified [8]. The current method used to identify episignatures is based on bisulfite conversion of methylated cytosines followed by methylation analysis using Illumina Epic/Infinium methylation microarrays [6] and commercialized as the EpiSign assay. A clinical implementation study of this method reported the detection of an episignature in 11% of patients with DD without conclusive genetic finding and in 35% of patients with a VUS in a gene for which an episignature is described [9], demonstrating its diagnostic value. One drawback of this assay is that it represents an extra step in the diagnostic odyssey of patients which comes with additional diagnostic delays and increased costs. With the advent of long-read sequencing technologies, detection of single nucleotide variants (SNVs), structural variants (SVs), and base modifications in a single assay becomes reality [10]. Both prevailing long-read methodologies, nanopore (Oxford Nanopore Technologies) and single molecule real-time sequencing (PacBio), enable methylation detection from the native DNA strands without amplification, bisulfite or enzymatic conversion biases [11]. Moreover, the generated long reads enable haplotyping [12], allowing the phasing of genetic and epigenetic variation. Recent studies showed the ability of nanopore sequencing to detect X-chromosome inactivation (XCI) [13] as well as methylation disturbances at imprinted loci [14] and short tandem repeats causing DD [15]. Considering the potential of nanopore sequencing to detect localized methylation levels and disease states, we reasoned it should be possible to map methylome-wide disturbances in chromatinopathies. Hence, we explored the potential to concurrently detect episignatures and underlying pathogenic variants. We performed long-read whole genome nanopore sequencing (lrWGS) of twenty patients representing thirteen different rare DD with known episignatures, as well as forty controls. We report non-inferiority of nanopore sequencing for detecting episignatures and demonstrate concurrent detection of underlying SNVs or SVs. We also confirm the detection of imprinting as well as haplotype-aware skewed XCI. In addition, our results illustrate some limitations of current episignatures. Overall, our study underscores the diagnostic value of lrWGS which might ease rare diseases diagnosis by offering simultaneous genome and epigenome analysis. ## MATERIAL AND METHODS ### Patients’ inclusion and sequencing Twenty patients with a confirmed or a suspected DD for which an episignature is described (Table 1, Suppl. Table 1), as well as forty healthy controls, were included. For the first phase of our study, we included eleven patients representing four diseases as well as five controls. For the second phase, nine additional patients with different diseases were included to increase the amount of evaluated episignatures as well as thirty-five other controls. Two of the patients were selected based on the presence of a VUS and not a (likely) pathogenic variant, as evaluation and illustration of one of the major applications of episignature detection. The genomic variants of all patients had been identified by chromosomal microarray, targeted Sanger sequencing or trio-based exome sequencing (WES). View this table: [Table 1.](http://medrxiv.org/content/early/2024/04/22/2024.04.19.24305959/T1) Table 1. Concurrent detection of episignatures (SVM based) and underlying genomic variants The seven first columns of the table describe patient characteristics and (likely) pathogenic genomic variants or VUS (marked by a “?”) detected by standard of care methods. The next column indicates the result of the SVM based classifier for each sample. Patients for which no episignature is identified by the classifiers are classified as control (italic). The last column specifies if SNV calling (Clair3) or SV calling (Sniffles2 and QDNAseq) allowed the identification of the genomic variant by nanopore sequencing. Fresh or frozen (−80°C) blood was used for HMW DNA extraction (Promega Wizard®, Monarch®). For four patients and two controls, DNA extracted with the automated Revvity – Chemagic® workflow and stored at −20°C was retrieved (Suppl. Table 2). Libraries were prepared from 3µg of DNA using the Oxford Nanopore Technologies (ONT) Ligation Sequencing Kit (SQK-LSK110). Each sample was loaded on a R.9.4.1 Flow Cell and reloaded once after 24-48h, for a total sequencing period of 72-96h on a PromethION®. From HMW DNA extracted from fresh blood, we obtained a median output of 95.6 GB and a median N50 of 40.4 kb. Using frozen blood, the median output was 116.25 GB, and the median N50 31.9 kb. Using stored non-HMW DNA, the median output and N50 were 113.7 GB and 17 kb, respectively (Suppl. Table 2). For patient EPI_13, additional urine and buccal swab samples were collected. DNA extraction of these tissues was performed using the Qiagen® and MagCore® chemistries, respectively. Using the SQK-LSK110 chemistry and r.9.4.1 Flow Cells we generated an output of 100 GB for the urine (N50 = 9.79 kb) and 146.7 GB for the buccal swab (N50 = 7.25 kb). ### (Modified) basecalling, mapping, SVs and SNVs calling To extract methylation information from raw ONT data, we performed modified basecalling with Dorado (v0.3.0) using the dna\_r9.4.1_e8_sup@v3.3_5mCG model[17]. After basecalling, reads were aligned to hg38 with minimap2 (v2.24) [18]. Subsequently, bedMethyl files, used in downstream analysis, were generated from aligned BAM files using modkit (v0.1.13 with pileup --preset traditional --only-tabs) [19]. Ultimately, starting from aligned BAMs, SVs were called using Sniffles2 (v2.0.2) [16] and QDNAseq (v1.3.8) [17]. Large copy number variants (>50 kb) are known to be difficult to detect with current nanopore SV callers such as Sniffles2 [18], and read-depth based softwares, such as QDNAseq, can therefore be used to improve the sensitivity for these variants [19]. SNVs were called with Clair3 (v1.0.4)[20], which was also used to generate haplotagged BAM files. Methylation was visualized with Methylartist [21] and Integrative Genomics Viewer (IGV) [22]. ### Detection of differentially methylated regions: imprinting and X-inactivation Imprinting was assessed by quantifying and visualizing methylation of both haplotypes at six loci located in three imprinted regions associated with DD (11p15.5, 14q32, and 15q11-q13). The following CpGs were assessed: 242 CpGs at the *H19* (chr11:1997509-2003349, hg38), 194 CpG at the *KCNQ1* (chr11:2698155-2701028, hg38), 188 CpG at the *MEG3* (chr14:100824185-100827640, hg38), 45 CpG at the *MEG8* (chr14:100904325-100905081, hg38), 116 CpG at the *SNRPN* (chr15:24954564-24956828, hg38) and 52 CpG at the *MAGEL2* (chr15:23647178-23648424, hg38) imprinted loci. To evaluate XCI, we assessed methylation at two validated loci, also recently targeted by CRISPR-Cas9 nanopore sequencing for this purpose: 115 CpGs at the *AR* gene loci (chrX:67543761-67546170, hg38) and 57 CpGs at the *RP2* gene loci (chrX:46836539-46837273, hg38) [23]. In addition, 99 CpGs in the promotor region of *PHF6* were evaluated (chrX:134372110-134374891, hg38). Differential methylation at these loci was quantified and visualized in the six females of our cohort. Loci with <10x total coverage or <5x coverage for one of the haplotypes were excluded from subsequent analyses (3/18 loci for both imprinting and XCI analyses). For the mother of EPI_19, XCI was evaluated by the standard of care method, *i.e.* comparing the relative amount of PCR amplification products of both *AR* gene CAG-repeat haplotypes after methylation-sensitive restriction enzyme digestion. ### Distinction of episignatures by dimensional reduction and clustering Thirty-four disease-specific episignatures have been identified through microarrays and published by E. Aref-Eshghi *et al.* [4]. The most significant microarray probes for each disease and methylation levels at these loci have been made available. We extracted the DNA methylation beta values from this published dataset at all episignatures probes’ loci for the thirty-four diseases and the included control. This extraction provided, for every episignature locus, one array-based example of the episignature in presence of the disorder and thirty-four examples of methylation levels at these loci in the absence of the disorder. Using pyliftover [24], all the probes (loci) encoding episignatures were lifted from hg19 to hg38. During the liftover, only a single CpG locus in the SBBYSS episignature was lost. Methylation levels were extracted from the nanopore sequencing data at the episignature loci using hg38 coordinates. During this process, the percentage of methylated reads was extracted from bedMethyl files. Using python (v3.9.15), three approaches were used to determine the similarity between array- and nanopore-based episignatures: UMAP (umap-learn\_v0.5.4, n\_neighbors=2) and t-SNE (sklearn\_v1.2.2, n_components=2, perplexity=2) for dimensional reduction, and hierarchical clustering (scipy_v1.9.3, seaborn_v0.12.2). ### Development of a SVM classifier We adopted the One-vs-All approach and trained 34 individual Support Vector Machines (SVMs) using disorder episignature loci DNA methylation median beta values [4]. Every SVM (sklearn v1.2.2) was trained to predict the presence of a specific disorder episignature considering all the other cases as controls. These SVMs were used to classify the nanopore-based methylome profiles. To predict the sample’s class, we ran all the classifiers and assigned the sample to the class with the highest confidence score. Confidence scores represent the decision function indicating the signed distance of a sample from the separating hyperplane. If none of the classifiers returned a positive confidence score or all the confidence scores were <0.30, the sample was classified as control. While this approach guaranteed one and thirty-four examples of methylation values in the presence and in the absence of the disorder, respectively, it also introduced an unbalance between the positive (disorder presence) and negative (disorder absence) classes that could affect the SVM training. To overcome this issue, we introduced a parameter adjusting the weight of the classes during the training of SVMs. The weight for the positive class training sample was set to ten, and the weight for the negative class training samples to one. ## RESULTS ### Imprinting detection To verify the haplotype-aware methylation detection, six loci localized in three imprinted regions (11p15.5, 14q32, and 15q11-q13) were investigated in six patients (Figure 1, Suppl. Figure 1A). Visualizing both haplotypes’ methylation, we observe consistent methylation of most CpGs of one allele in contrast to the absence of methylation of most CpGs of the other haplotype at all loci in all patients. The *H19* and *SNRPN* loci in EPI_12, and *H19* locus in EPI_01, could not be assessed given the low coverage. Further quantification of methylation at these loci shows mean percentages of methylation between 89.3% and 96.3% for the methylated haplotypes and 4.1% to 16.5% for the unmethylated alleles (Suppl. Figure 1A, Suppl. Table 3A). These results are concordant with expectations of mono-allelic expression at imprinted loci and illustrate direct allelic methylation measurement as comparison to standard technologies where disturbances of one allele’s methylation are inferred from total methylation levels. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/04/22/2024.04.19.24305959/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/04/22/2024.04.19.24305959/F1) Figure 1. Detection of differential methylation at imprinted loci Haplotype aware methylation visualization of EPI_02 at 6 imprinted loci (middle and right part). The maternal allele is represented in orange and paternal allele in blue. The upper part of each plot represents the reads, with methylated CpGs visualized as black circles. In the lower part of each plot, after translation from genome to CpG-only coordinate space, raw log-likelihood ratios are plotted above a smoothened graph of methylation fraction. The left part illustrates gene localization and expression at these 3 imprinted loci. Active genes are represented as full boxes with arrows, whereas untranslated genes are represented by empty boxes. Full circles represent methylated CpG islands whereas empty circles represent unmethylated CpG islands. ### Episignature distinction using microarray data as reference To evaluate the potential of nanopore sequencing-based episignature detection, seven patients were initially sequenced: two with Kabuki, Sotos, and Wiedemann-Steiner syndrome, and one with Cornelia de Lange syndrome. Two-dimensional reduction analyses of episignatures (Figure 2A, Suppl. Figure 2), comparing our samples to the microarray-based disease reference as well as five healthy controls, co-located nanopore methylation values close to the reference microarray data for each disease sample at their disease-specific loci. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/04/22/2024.04.19.24305959/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2024/04/22/2024.04.19.24305959/F2) Figure 2. Nanopore methylation analysis allow to distinguish disease samples from controls and other diseases samples UMAP scatter plots (Fig. 2A), hierarchical clustering (Fig. 2B, left part) and heatmaps (Fig. 2B, right part) based on methylation levels at Sotos, Kabuki, Wiedemann-Steiner (WDSTS) and Cornelia de Lange (CdLS) syndromes episignature loci. Nanopore sequencing data of five healthy controls and eleven (seven known and four blinded) patients with one of the four disorders are represented, along with the array-based reference for the respective episignature. Patients sequenced with nanopore cluster in close proximity to their respective array-based episignature reference. Blinded samples can be assigned the correct diagnosis. To further confirm the hypothesis that nanopore-based methylome analysis can be used to recognize episignatures, we sequenced an additional four blinded cases (one of each disease). Hierarchical clustering (Figure 2B) of methylation levels at the disease’s episignature loci combined with the microarray methylome profiles used as a reference, clustered all blinded samples with the disease reference and other samples with the same disorder. Similarly, the blinded samples clustered with samples with the same disease in the UMAP analysis (Figure 2A). Methylation levels at other disease-specific loci are discriminative for some diseases (*e.g.* CdLS and WDSTS loci for Sotos), but not for all (*e.g.* Kabuki loci for Sotos), as was shown in the original publications [25]. Moreover, the heatmaps show how strong the methylation differences are in some diseases (*e.g.* Sotos), but subtle in others. Nonetheless, even in those diseases with small methylation variation, hierarchical clustering enables to recognize each samples’ disease. ### SVM based episignature detection Our primary results showed, through different approaches, a high similarity between the microarray reference and nanopore episignatures. However, methods like UMAP require interpretation upon visual representation, and in the case of hierarchical clustering, determining the cutoffs or number of clusters is not trivial. For this reason, we developed an automated and generic approach to screen samples for 34 shared episignatures [5] using SVM classifiers. Our classifiers were tested on the eleven patients assessed in our first analyses and nine additional samples, representing thirteen diseases, as well as forty controls. In 17/20 patients with a (suspected) DD, the classifiers recognized an episignature and assigned the sample to the right disease (Table 1). All healthy individuals were classified as controls. For patient EPI\_19, both a maternally inherited, hemizygous c.7263\_7265del p.(Gln2425del) VUS in *ATRX* and a pathogenic but mosaic c.3002_3005del p.(Leu1001Profs*15) variant in *KMT2D* (present in 10-21% of cells according to WES and Sanger sequencing) were identified. The detection of the mosaic *KMT2D* variant was driven by clinical suspicion, but an additional ATRX syndrome could not be excluded solely based on the patient’s phenotype. The SVMs classified this patient as Kabuki syndrome, the syndrome caused by pathogenic variants in *KMT2D,* but not as ATRX syndrome. Interestingly, the confidence score value returned by the SVM was lower compared to other Kabuki patients: 0.64 in EPI_19 in contrast to 1.94, 1.78, and 2.30 for the three other Kabuki samples (Suppl. Table 4), likely as consequence of the mosaic status of the *KMT2D* variant. Similarly, EpiSign also identified the Kabuki and not the ATRX signature. In addition, XCI testing was performed in the mother and showed a 41/59 inactivation ratio. Hence, there is no skewed XCI in maternal blood. Moreover, segregation analysis showed the presence of the *ATRX* c.7263_7265del p.(Gln2425del) variant in a healthy brother of the patient, further supporting the *ATRX* variant to be benign. Together with the result of our classifier, those results indicate the *KMT2D* c.3002_3005del p.(Leu1001Profs*15) variant to be pathogenic, but not the *ATRX* c.7263_7265del p.(Gln2425del) variant. In three samples (EPI_13, EPI_18 and EPI20), the classifier did not recognize any of the reference episignatures (Table 1). For those cases, clinical testing using Episign also returned negative. ### Supporting evidence from phased X-inactivation analysis Patient EPI_13 had a diagnosis of Börjeson-Forssman-Lehmann syndrome (BFLS), after identification of a *de novo* pathogenic c.306del p.(Tyr103Thrfs*40) variant in *PHF6*. BFLS is an X-linked disorder with variable penetrance and expression in females. Functional mosaicism, a phenomenon where the allele on which the pathogenic variant is located is active in some but not all cells due to XCI, can explain, at least in part, this variation [26]. Hence, we hypothesize that the female patient with BFLS (EPI_13) might express the allele with the pathogenic variant in relevant tissues but have skewed XCI in blood with predominant inactivation of the abnormal allele. This could explain the absence of episignature in blood. Methylation analysis of the two *AR*, *RP2*, and *PHF6* promotor haplotypes on the patient’s blood sample showed divergent methylation levels between both haplotypes: 75%:2% at the *AR* loci, 80%:5% at the *RP2* loci and 79%:10% at the *PHF6* loci (coverage of 37x for the three loci). These results point to skewing of XCI. In comparison, 26% to 68% of the reads of each of the two haplotypes are methylated at these loci in the five other females of the cohort (Figure 3, Suppl. Table 3B). Evaluation of XCI of the patient’s urine and buccal swab samples also revealed an imbalanced inactivation, the percentages of methylation of both alleles being measured at 76%:19% (*AR*) and 75%:12% (*RP2*) for the urine sample and 71%:3% (*AR*) and 83%:3% (*RP2)* for the buccal swab sample (Suppl. Figure 1B). ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/04/22/2024.04.19.24305959/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2024/04/22/2024.04.19.24305959/F3) Figure 3. Evaluation of X-inactivation Phased nanopore sequencing methylation levels at AR and RP2 loci, as well as at the promotor of the PHF6 gene, in patient EPI\_13 (top) and EPI_01 (bottom). Orange and blue represent the two different haplotypes of each sample. In the upper part of each plot, the genome coordinates, genes and reads are illustrated. On each read, modified bases are represented by full, dark circles, unmodified bases by open circles. In the lower part of the plot, after translation from genome to CpG-only coordinate space, raw log-likelihood ratios are plotted above a smoothened graph of methylation fraction. About half reads of each allele are methylated at the three loci in patient EPI_01 whereas in patient EPI_13 one allele (orange) is methylated in all reads in contrast to the other one which is unmethylated. To further substantiate our hypothesis, we confirmed EPI_13’s *PHF6* c.306del p.(Tyr103Thrfs*40) pathogenic variant to be localized on the hypermethylated inactive allele by haplotyping. Phasing is enabled by the long reads spanning the 5’UTR differentially methylated region to the pathogenic variant. Taking further advantage of this property, we could determine that the *de novo PHF6* pathogenic variant occurred on the maternal allele. Available trio WES data showed an informative maternally inherited SNV 521 base pairs downstream of the pathogenic variant and nanopore long reads co-localized these two variants to be in *cis* (Suppl. Fig. 3). The *RP2* locus did not contain any informative SNV. The *AR* differentially methylated region, however, also contained an informative SNV which confirms the inactivation of the mutated maternal allele. These results support the hypothesis that this variant is pathogenic but, due to skewed X-inactivation, is not causing an epigenetic fingerprint in white blood cells. ### Concurrent epigenetic and genetic diagnosis Given that long-read sequencing offers genome in parallel with methylome sequencing, we wanted to evaluate and confirm the potential of long-read sequencing to enable concurrent episignature, SV, and SNV detection. Hence, we assessed whether the (likely) pathogenic variants (or VUS for EPI\_18 and EPI\_19) could be detected in the nanopore sequencing data. Eighteen of nineteen SNVs were identified using the standard bioinformatic analysis (see M&M). The mosaic *KMT2D* variant in EPI_19 was only present in 17% (5/29) of the nanopore reads and could not be detected. However, the variant allele frequency in previous short-read WES was also too low (22%, 38/174 reads) to be detected after default variant allele frequency-based filtering. Both short- and long-read methods did allow the *KMT2D* variant detection after manual curation driven by clinical suspicion. Two patients were heterozygous for a pathogenic SV and both were detected: the multi-exonic deletion in the *EP300* gene in EPI\_17 and the 7q11.23 microdeletion in EPI_16. Additionally, the long-read sequencing data allowed mapping the breakpoints of the *EP300* deletion (NM_001429: g.41168065_41171580del, hg38). ## DISCUSSION Episignatures are diagnostically important. Their presence provides evidence for the pathogenicity of VUS or can act as biomarker for a disease when the underlying genomic variant cannot be identified [6], [8]. This proof-of-concept study illustrates the potential of lrWGS for episignature detection. Despite known technological differences, we show that microarray reference data can be used for several diseases. Moreover, nanopore sequencing enables to concurrently identify the episignature inducing SNVs and SVs. In addition, we show that phased methylation maps inform about imprinting and skewed XCI. Altogether, long-read sequencing haplotype-aware mapping of genetic and methylation variation provides a comprehensive genome analysis, combining several analyses that currently require multiple technologies. While this study opens the door for concurrent genomic and episignature assessment, it also illustrates limitations of current episignatures. For two of the genes of which the episignature could not be detected, *UBE2A* and *PHF6,* the reference microarray data we used are based on a limited sample set [5] without test nor validation cohort, and might therefore be imprecise. Both genes, as well as *ZNF711*, are also listed among the genes for which the commercial microarray assay is reported to have a lower sensitivity [27]. Moreover, in the last years, sub-episignatures have been described for different types of variants in the same gene [28],[29]. While not yet been reported for *UBE2A* and *ZNF711*, such sub-episignatures could confound the classification of EPI 18 and EPI_20, which both harbor missense variants. Segregation, maternal XCI analyses, and functional analyses performed in a previous study have confirmed the pathogenicity of the c.19C>T p.(Arg7Trp) variant in *UBE2A* identified in *EPI_20* [30]. Segregation and functional analyses could not be performed for the c.1223A>G p.(His408Arg) VUS in *ZNF711* identified in EPI_18. In this case, the absence of the disease-associated episignature might therefore indicate the variant to be benign. However, one must carefully draw conclusions based on the absence of episignatures as a sub-episignature cannot be excluded. An evaluation of published episignatures also recently showed their high specificity but variable sensitivity [31]. Another constraint highlighted by this study is XCI. For the *PHF6* c.306del p.(Tyr103Thrfs*40) variant (EPI_13), we hypothesized that skewed XCI in white blood cells might lead to predominant expression of the normal allele, resulting in a normal methylation pattern. Skewed XCI has already been reported in other females with BFLS [32]. Using haplotype-aware nanopore methylation mapping, we confirmed the unbalanced skewing in this patient and subsequently localized the variant on the inactivated allele. As the patient presents high phenotypic similarities with BFLS, we hypothesized that other tissues probably lack XCI skewing and might present the episignature. However, both urine and buccal swab samples also showed predominant inactivation of the affected allele. Other tissues could be investigated to substantiate our hypothesis further, but this requires more invasive procedures and might require tissue-specific epigenomic maps, which are currently lacking. Concurrent XCI and episignature evaluation in blood of carrier mothers of males with a X-linked disease episignature could, therefore, provide an indirect alternative to validate this hypothesis in the future. An important potential pitfall in the diagnostic process of DD is mosaicism. Post-zygotic *de novo* pathogenic variants can lead to mosaic variants which often escape detection by the default variant calling algorithms. Interestingly, our classifier did detect the episignature in a mosaic heterozygous patient (EPI_19), albeit with a lower confidence score, reflecting the low-grade mosaicism for the pathogenic variant. Mosaicism detection can be improved by including mosaic samples in the training set and lowering the positive confidence score threshold [33]. Episignature evaluation was important in this boy as it contributed to classifying the X-linked *ATRX* variant as benign and reassuring the parents of low recurrence risk for future pregnancies. The *ATRX* episignature has indeed been reported as having a high sensitivity [31], and this conclusion was corroborated by segregation and maternal XCI analyses. We expect that episignature detection can also potentially direct the clinical diagnostic laboratory to search for mosaic variants in specific genes. The correlation between microarray- and nanopore-based methylomes has been estimated at around 0.85 [34]. Here we show that despite these inter-assay biases and the subtle methylation changes that characterize the episignatures of some disorders, publicly available microarray-based methylation ratio profiles and/or positions can be used for nanopore sequencing episignature detection for several disorders. Still, it is likely that comparing nanopore methylation to microarray reference data reduces the sensitivity. Microarray references also have the disadvantage of being based on a restricted amount of CpGs, varying with the product’s version and each sample’s hybridization [35]. We envision that large-scale nanopore sequencing of patients and controls in the future will leverage high-resolution whole methylomes, enabling refinement of existing episignatures, potentially improving their sensitivity and specificity, and likely leading to the discovery of new episignatures. Important drawbacks of LRS platforms have long been their lower accuracy and high costs. However, the costs of nanopore sequencing have been dropping and currently (for 30x, genome-wide LRS) slightly exceed the costs of clinical episignature testing by microarrays in a large-scale setting. With the improving accuracy of LRS chemistries and analytical tools, the cost-effectiveness balance might change as concurrent genetic and epigenetic testing would enable to replace the combination of WES and methylation-sensitive microarray analyses. The other side of the coin of all the analyses enabled by LRS remains the high amount of resources that are needed to store, process and analyze the data. To conclude, lrWGS enables concurrent phased genome and methylome variant detection, opening the door to large-scale studies about genomic and epigenomic variation and their interactions. While this study has used nanopore sequencing, other sequencing platforms combining methylation and base calling will probably produce similar results. XCI and imprinting detection have already been demonstrated with single-molecule real-time sequencing too [36], [37]. With its improving SNV calling accuracy [10] and more comprehensive SV detection, it seems likely that long-read sequencing will be implemented as the first-tier test for the diagnosis of DD in the future. Beyond known and potential new episignatures, we envision that shedding light on secondary epivariation will improve our understanding of genomic, especially non-coding, variants, and give us new insights into molecular mechanisms underlying diseases. In addition, an excess of *de novo* primary epivariants has been identified in patients with unexplained DD [4], and the presence of such epivariants has been associated with outlier gene expression [1]. Further exploration of the whole epigenome might, therefore, help us solve another part of the remaining unexplained DD. ## Supporting information SupplementaryTable 1 [[supplements/305959_file03.xlsx]](pending:yes) SupplementaryTable 2 [[supplements/305959_file04.xlsx]](pending:yes) SupplementaryTable 3 [[supplements/305959_file05.xlsx]](pending:yes) SupplementaryTable 4 [[supplements/305959_file06.xlsx]](pending:yes) SupplementaryTable 5 [[supplements/305959_file07.xlsx]](pending:yes) Supplementary Figure 1a [[supplements/305959_file08.jpg]](pending:yes) Supplementary Figure 1b [[supplements/305959_file09.jpg]](pending:yes) Supplementary Figure 2 [[supplements/305959_file10.jpg]](pending:yes) Supplementary Figure 3 [[supplements/305959_file11.jpg]](pending:yes) ## Data Availability Data produced are partially available as supplementary with the manuscript and on GitHub. [https://github.com/JorisVermeeschLab/NSBEpi](https://github.com/JorisVermeeschLab/NSBEpi) ## DATA AVAILABILITY Methylation values at used genomic positions are shared in supplementary table 5. The used data and code are available on GitHub: JorisVermeeschLab/NSBEpi: Nanopore sequencing-based episignature detection ([github.com](http://github.com)). Bedmethyl files are available upon request. ## FUNDING STATEMENT This work has been made possible by FWO-TBM grant T-003819N, FWO grant G0A2622N and KU Leuven grants, C1-C14/18/092 and C14/22/125 to JRV. BH was supported by a Collaborative Doctoral Partnership Agreement of the Joint Research Center F.7/KU LEUVEN B&G 35332. JB is supported by a senior clinical investigator fellowship of the FWO Flanders. ## AUTHOR CONTRIBUTIONS Conceptualization: MG, BH, JRV; Data curation: MG, BH, ES; Formal analysis: MG, BH; Funding acquisition: KDVB, JRV; Investigation: MG; Methodology: MG, BH; Project administration: KVDB, JRV; Resources: MG, JB, KDV, HP, GVB, HVE, KDVB; Software: BH, ES; Supervision: KVDB, JRV; Validation: MG, BH, ES; Visualization: MG, BH; Writing – original draft: MG, BH, KVDB, JRV; Writing – review and editing: MG, BH, ES, JB, KDV, HP, GVB, HVE, KVDB, JRV ## ETHICS AND CONSENT DECLARATION This study was approved by the Ethical committee research UZ/KU Leuven (S65304 – S65991) and informed consent was obtained from all participants. ## CONFLIC OF INTEREST The authors declare no conflicts of interest. MG presented her research at the ACLF conference 2023 for Oxford Nanopore Technologies. Travel costs were reimbursed by the company. ## ADDITIONAL FILES **Supplementary table 1.** **Patients’ phenotypes and variant classification** Description of the phenotypes of the included patients, the underlying genomic (potential) pathogenic variants or VUS, and the classification of these variants according to the ACMG criteria [1], [2]. **Supplementary table 2.** **DNA extraction and sequencing summary** Description of the used methodologies for DNA extraction and sequencing as well as summary of the data output for each sample (estimated and basecalled bases as well as read length N50). **Supplementary table 3**. **Differentially methylated regions : quantification** Mean methylation levels at six imprinted loci (first tab) and the *AR, RP2* and *PHF6* differentially methylated loci (second tab) for each haplotype of the six female patients’ blood samples (as well as the urine and buccal swab samples of EPI_13 for the loci on chromosome X). The values for which total coverage <10x or with <5x coverage for one of the haplotypes are represented in grey. **Supplementary figure 1.** **Differentially methylated regions : visualization** Phased nanopore sequencing methylation levels at the same 6 imprinted loci (**Supplementary figure 1A**), as well as the *AR*, *RP2*, and *PHF6* genes (**Supplementary figure 1B**), in blood of the 6 female patients (as well as the urine and buccal swab samples of EPI_13 for the loci on chromosome X). Orange and blue represent the two different haplotypes of each sample. In the upper part of each Methylartist plot [3], the genome coordinates, genes and reads are illustrated. On each read, modified bases are represented by full, dark circles, unmodified bases by open circles. In the lower part of the plot, after translation from genome to CpG-only coordinate space, raw log-likelihood ratios are plotted above a smoothened graph of methylation fraction. **Supplementary figure 2.** **Nanopore methylation analysis allow to distinguish disease samples from controls and other diseases samples (t-SNE)** t-SNE scatter plot from methylation levels at Sotos, Kabuki, Wiedemann-Steiner (WDSTS) and Cornelia de Lange (CdLS) syndromes episignature loci. Nanopore sequencing data of five healthy controls and seven patients with one of the four disorders are represented, along with the array-based reference [4] for the respective episignature. Patients sequenced with nanopore cluster in close proximity to their respective array-based episignature reference. **Supplementary table 4. SVM classifiers results** The excel sheets contains two tabs. The first one details the SVM Confidence Scores. In each row the values of the confidence score from each disorder’s SVM is shared for each sample. The second tab summarizes for each sample the highest confidence score (second row), and assigned disease (third row). Sample without recognized episignature (confidence score <0.3) are classified as control. **Supplementary table 5**. **Methylation values at known episignature loci : reference array data** [5] **and ONT data** The excel sheet contains a tab for every evaluated disorder. For every disorder (tab), the first 2 columns indicate the chromosome and the end position of the CpG loci of the disorder specific episignature. For every disorder specific episignature locus, Illumina derived methylation beta values in all disorders and controls shared in E. Aref-Eshghi *et al.* are shown (data was extracted from the provided supplementary table) [4]. The last 60 columns of every tab indicate the methylation values extracted from our Nanopore samples at every specific locus. **Supplementary figure 3.** **Phasing of methylome and genome variation** Screenshots of the visualization of EPI_13’s Oxford Nanopore long read sequencing and Illumina short read sequencing BAM files in IGV [5] at two loci of interest: *PHF6* and *AR* genes. At the *PHF6* locus we observe the pathogenic c.306delA variant in cis with a maternally inherited SNV as well as methylation of the promotor region of the gene. At the *AR* locus we observe a paternally inherited SNV in *cis* with unmethylation of the promotor region of the gene. ## ACKNOWLEDGEMENTS The authors thank the patients and their families who participated in this research. We thank Jonas Demeulemeester for sharing his expertise about nanopore sequencing methylation calling at the start of the project and also thank Anne Bassett and Donna McDonnald-McGuinn for their critical review of the manuscript. Part of Figure 1 was created with [BioRender.com](http://BioRender.com). ## Footnotes * * joint first authors * Received April 19, 2024. * Revision received April 19, 2024. * Accepted April 22, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## REFERENCES 1. [1]. P. Garg et al., “A Survey of Rare Epigenetic Variation in 23,116 Human Genomes Identifies Disease-Relevant Epivariations and CGG Expansions,” Am J Hum Genet, vol. 107, no. 4, pp. 654–669, Oct. 2020, doi: 10.1016/j.ajhg.2020.08.019. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2020.08.019&link_type=DOI) 2. [2]. C. Depienne and J. L. Mandel, “30 years of repeat expansion disorders: What have we learned and what are the remaining challenges?,” American Journal of Human Genetics, vol. 108, no. 5. Cell Press, pp. 764–785, May 06, 2021. doi: 10.1016/j.ajhg.2021.03.011. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2021.03.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F22%2F2024.04.19.24305959.atom) 3. [3]. D. Monk, D. J. G Mackay, T. Eggermann, E. R. Maher, and A. Riccio, “Genomic imprinting disorders: lessons on how genome, epigenome and environment interact,” Nature Reviews, vol. 20, pp. 235–248, 2019, doi: 10.1038/s41576-018-0092-0. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41576-018-0092-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F22%2F2024.04.19.24305959.atom) 4. [4]. M. Barbosa et al., “Identification of rare de novo epigenetic variations in congenital disorders,” Nature Communications 2018 9:1, vol. 9, no. 1, pp. 1–11, May 2018, doi: 10.1038/s41467-018-04540-x. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-018-03376-9&link_type=DOI) 5. [5]. E. Aref-Eshghi et al., “Evaluation of DNA Methylation Episignatures for Diagnosis and Phenotype Correlations in 42 Mendelian Neurodevelopmental Disorders,” The American Journal of Human Genetics, vol. 106, no. 3, pp. 356–370, 2020, doi: 10.1016/j.ajhg.2020.01.019. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2020.01.019&link_type=DOI) 6. [6]. M. A. Levy et al., “Novel diagnostic DNA methylation episignatures expand and refine the epigenetic landscapes of Mendelian disorders,” Human Genetics and Genomics Advances, vol. 3, no. 1, p. 100075, 2022, doi: 10.1016/j.xhgg.2021.100075. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.xhgg.2021.100075&link_type=DOI) 7. [7]. K. Rooney et al., “Identification of a DNA Methylation Episignature in the 22q11.2 Deletion Syndrome,” International Journal of Molecular Sciences Article, 2021, doi: 10.3390/ijms22168611. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/ijms22168611&link_type=DOI) 8. [8]. B. Sadikovic, M. A. Levy, and E. Aref-Eshghi, “Functional annotation of genomic variation: DNA methylation episignatures in neurodevelopmental Mendelian disorders,” Hum Mol Genet, vol. 29, pp. 1–27, 2020, doi: 10.1093/hmg/ddaa144. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddz215&link_type=DOI) 9. [9]. B. Sadikovic et al., “Clinical epigenomics: genome-wide DNA methylation analysis for the diagnosis of Mendelian disorders,” Genetics in Medicine, vol. 23, no. 6, pp. 1065–1074, 2021, doi: 10.1038/s41436-020-01096-4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41436-020-01096-4&link_type=DOI) 10. [10]. M. Kolmogorov et al., “Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation,” Nat Methods, vol. 20, no. 10, pp. 1483–1492, 2023, doi: 10.1038/s41592-023-01993-x. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41592-023-01993-x&link_type=DOI) 11. [11]. G. A. Logsdon, M. R. Vollger, and E. E. Eichler, “Long-read human genome sequencing and its applications,” Nature Reviews, vol. 21, pp. 597–614, 2020, doi: 10.1038/s41576-020-0236-x. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41576-020-0236-x&link_type=DOI) 12. [12]. P. Ebert et al., “Haplotype-resolved diverse human genomes and integrated analysis of structural variation,” Science (1979), vol. 372, no. 6537, Apr. 2021, doi: 10.1126/SCIENCE.ABF7117. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1126/SCIENCE.ABF7117&link_type=DOI) 13. [13]. J. Johansson et al., “A novel quantitative targeted analysis of X-chromosome inactivation (XCI) using nanopore sequencing,” Sci Rep, vol. 13, no. 1, p. 12856, 2023, doi: 10.1038/s41598-023-34413-3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-023-34413-3&link_type=DOI) 14. [14]. M. Yamada et al., “Diagnosis of Prader-Willi syndrome and Angelman syndrome by targeted nanopore long-read sequencing,” Eur J Med Genet, vol. 66, no. 2, Feb. 2023, doi: 10.1016/j.ejmg.2022.104690. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ejmg.2022.104690&link_type=DOI) 15. [15]. P. Giesselmann et al., “Analysis of short tandem repeat expansions and their methylation state with nanopore sequencing,” Nat Biotechnol, vol. 37, pp. 1478–1481, 2019, doi: 10.1038/s41587-019-0293-x. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41587-019-0293-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F22%2F2024.04.19.24305959.atom) 16. [16]. F. J. Sedlazeck et al., “Accurate detection of complex structural variations using single-molecule sequencing,” Nat Methods, vol. 15, no. 6, pp. 461–468, Jun. 2018, doi: 10.1038/s41592-018-0001-7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41592-018-0001-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29713083&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F22%2F2024.04.19.24305959.atom) 17. [17]. I. Scheinin et al., “DNA copy number analysis of fresh and formalin-fixed specimens by shallow whole-genome sequencing with identification and exclusion of problematic regions in the genome assembly,” Genome Res, vol. 24, 2022, doi: 10.1101/gr.175141.114. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjEwOiIyNC8xMi8yMDIyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDQvMjIvMjAyNC4wNC4xOS4yNDMwNTk1OS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 18. [18]. J. Cuenca-Guardiola, B. de la Morena-Barrio, J. L. García, A. Sanchis-Juan, J. Corral, and J. T. Fernández-Breis, “Improvement of large copy number variant detection by whole genome nanopore sequencing,” J Adv Res, vol. 50, pp. 145–158, Aug. 2023, doi: 10.1016/j.jare.2022.10.012. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jare.2022.10.012&link_type=DOI) 19. [19]. S. Romagnoli, N. Bartalucci, and A. M. Vannucchi, “Resolving complex structural variants via nanopore sequencing,” Frontiers in Genetics, vol. 14. 2023. doi: 10.3389/fgene.2023.1213917. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fgene.2023.1213917&link_type=DOI) 20. [20]. Zheng Zhenxian, Li Shumin, Su Junhao, Leung Amy Wing-Sze, Lam Tak-Wah, and Luo Ruibang, “Symphonizing pileup and full-alignment for deep learning-based long-read variant calling,” Nat Comput Sci, vol. 2, pp. 797–803, 2022, doi: 10.1038/s43588-022-00387-x. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s43588-022-00387-x&link_type=DOI) 21. [21]. S. W. Cheetham, M. Kindlova, and A. D. Ewing, “Methylartist: tools for visualizing modified bases from nanopore sequence data,” Bioinformatics, vol. 38, no. 11, pp. 3109–3112, 2022, doi: 10.1093/bioinformatics/btac292. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btac292&link_type=DOI) 22. [22]. J. T. Robinson et al., “Integrative genomics viewer,” Nature Biotechnology 2011 29:1, vol. 29, no. 1, pp. 24–26, Jan. 2011, doi: 10.1038/nbt.1754. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nbt.1754&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21221095&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F22%2F2024.04.19.24305959.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000286048900013&link_type=ISI) 23. [23]. J. Johansson et al., “A novel quantitative targeted analysis of X-chromosome inactivation (XCI) using nanopore sequencing,” Sci Rep, vol. 13, no. 1, p. 12856, 2023, doi: 10.1038/s41598-023-34413-3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-023-34413-3&link_type=DOI) 24. [24].“pyliftover: Pure-python implementation of UCSC liftOver genome coordinate conversion.” Accessed: Sep. 25, 2023. [Online]. Available: [https://github.com/konstantint/pyliftover](https://github.com/konstantint/pyliftover) 25. [25]. E. Aref-Eshghi et al., “Evaluation of DNA Methylation Episignatures for Diagnosis and Phenotype Correlations in 42 Mendelian Neurodevelopmental Disorders,” The American Journal of Human Genetics, vol. 106, no. 3, pp. 356–370, Mar. 2020, doi: 10.1016/J.AJHG.2020.01.019. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2020.01.019&link_type=DOI) 26. [26]. C. B. Gerber et al., “Further characterization of Borjeson-Forssman-Lehmann syndrome in females due to de novo variants in PHF6,” Clin Genet, vol. 102, no. 3, pp. 182–190, Sep. 2022, doi: 10.1111/CGE.14173. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/CGE.14173&link_type=DOI) 27. [27].Amsterdam UMC, “EpiSign complete.” Accessed: Nov. 14, 2023. [Online]. Available: [https://genoomdiagnostiek.nl/en/product/episign-complete/](https://genoomdiagnostiek.nl/en/product/episign-complete/) 28. [28]. E. G. Bend et al., “Gene domain-specific DNA methylation episignatures highlight distinct molecular entities of ADNP syndrome,” Clin Epigenetics, vol. 11, no. 1, pp. 1–17, Apr. 2019, doi: 10.1186/S13148-019-0658-5/FIGURES/6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13148-019-0635-z&link_type=DOI) 29. [29]. R. H. van Jaarsveld et al., “Delineation of a KDM2B-related neurodevelopmental disorder and its associated DNA methylation signature,” Genetics in Medicine, vol. 25, no. 1, pp. 49–62, Jan. 2023, doi: 10.1016/j.gim.2022.09.006. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.gim.2022.09.006&link_type=DOI) 30. [30]. D. M. Haddad et al., “Mutations in the Intellectual Disability Gene Ube2a Cause Neuronal Dysfunction and Impair Parkin-Dependent Mitophagy,” Mol Cell, vol. 50, no. 6, pp. 831–843, Jun. 2013, doi: 10.1016/j.molcel.2013.04.012. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.molcel.2013.04.012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23685073&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F22%2F2024.04.19.24305959.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000321319900007&link_type=ISI) 31. [31]. T. Husson et al., “Episignatures in practice: independent evaluation of published episignatures for the molecular diagnostics of ten neurodevelopmental disorders,” European Journal of Human Genetics, 2023, doi: 10.1038/s41431-023-01474-x. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41431-023-01474-x&link_type=DOI) 32. [32]. C. Zweier et al., “A new face of Borjeson-Forssman-Lehmann syndrome? De novo mutations in PHF6 in seven females with a distinct phenotype,” J Med Genet, vol. 50, pp. 838–847, 2013, doi: 10.1136/jmedgenet-2013-101918. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiam1lZGdlbmV0IjtzOjU6InJlc2lkIjtzOjk6IjUwLzEyLzgzOCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA0LzIyLzIwMjQuMDQuMTkuMjQzMDU5NTkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 33. [33]. K. Oexle et al., “Episignature analysis of moderate effects and mosaics,” European Journal of Human Genetics, vol. 31, no. 9, pp. 1032–1039, 2023, doi: 10.1038/s41431-023-01406-9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41431-023-01406-9&link_type=DOI) 34. [34]. C. Silva, M. Machado, J. Ferrão, A. Sebastião Rodrigues, and L. Vieira, “Whole human genome 5’-mC methylation analysis using long read nanopore sequencing,” Epigenetics, vol. 17, no. 13, pp. 1961–1975, 2022, doi: 10.1080/15592294.2022.2097473. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/15592294.2022.2097473&link_type=DOI) 35. [35]. E. Giuili et al., “Comprehensive evaluation of the implementation of episignatures for diagnosis of neurodevelopmental disorders (NDDs),” Hum Genet, vol. 142, no. 12, pp. 1721–1735, Dec. 2023, doi: 10.1007/s00439-023-02609-2. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00439-023-02609-2&link_type=DOI) 36. [36]. M. R. Vollger, et al., “Synchronized long-read genome, methylome, epigenome, and transcriptome for resolving a Mendelian condition,” bioRxiv preprint, 2023, doi: 10.1101/2023.09.26.559521. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMy4wOS4yNi41NTk1MjF2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA0LzIyLzIwMjQuMDQuMTkuMjQzMDU5NTkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 37. [37]. O. Y. Olivia Tse et al., “Genome-wide detection of cytosine methylation by single molecule real-time sequencing,” PNAS, vol. 118, no. 5, 2021, doi: 10.1073/pnas.2019768118/-/DCSupplemental. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1073/pnas.2019768118/-/DCSupplemental&link_type=DOI) 38. [38]. S. W. Cheetham, M. Kindlova, and A. D. Ewing, “Methylartist: tools for visualizing modified bases from nanopore sequence data,” Bioinformatics, vol. 38, no. 11, 2022, doi: 10.1093/bioinformatics/btac292. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btac292&link_type=DOI) ## ADDITIONAL FILES REFERENCES 1. 1.Riggs, E. R. et al. Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen). (2020) doi:10.1038/s41436. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41436&link_type=DOI) 2. 2.Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genetics in Medicine 17, 405–424 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/gim.2015.30&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25741868&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F22%2F2024.04.19.24305959.atom) 3. 3.Cheetham, S. W., Kindlova, M. & Ewing, A. D. Methylartist: tools for visualizing modified bases from nanopore sequence data. Bioinformatics 38, (2022). 4. 4.Aref-Eshghi, E. et al. Evaluation of DNA Methylation Episignatures for Diagnosis and Phenotype Correlations in 42 Mendelian Neurodevelopmental Disorders. The American Journal of Human Genetics 106, 356–370 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2020.01.019&link_type=DOI) 5. 5.Robinson, J. T. et al. Integrative genomics viewer. Nature Biotechnology 2011 29:1 29, 24–26 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nbt.1754&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21221095&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F04%2F22%2F2024.04.19.24305959.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000286048900013&link_type=ISI)