Using single molecule Molecular Inversion Probes as a cost-effective, high-throughput sequencing approach to target all genes and loci associated with macular diseases
=======================================================================================================================================================================

* Rebekkah J. Hitti-Malin
* Claire-Marie Dhaenens
* Daan M. Panneman
* Zelia Corradi
* Mubeen Khan
* Anneke I. den Hollander
* G. Jane Farrar
* Christian Gilissen
* Alexander Hoischen
* Maartje van de Vorst
* Femke Bults
* Erica G.M. Boonen
* Patrick Saunders
* MD Study Group
* Susanne Roosing
* Frans P.M. Cremers

## Abstract

**Background** Macular diseases (MDs) are a subgroup of retinal disorders characterized by central vision loss that represent a major cause of vision impairment. Despite the identification of numerous genes associated with inherited MD (iMD) and risk factors associated with age-related MD (AMD), the extent of genetic and non-genetic factors that influence the expression of MD is still not fully explained. Single molecule Molecular Inversion Probes (smMIPs) have proven effective in sequencing the *ABCA4* gene in patients with Stargardt disease in a cost-effective manner to identify associated coding and non-coding variation, however a large portion of patients with MDs still remain genetically unexplained.

**Methods** For MD patients, we hypothesized that the missing heritability may be revealed by smMIPs-based sequencing of all MD-associated genes and risk factors. We used 17,394 smMIPs to sequence the coding regions of 105 genes and non-coding or regulatory loci associated with iMD and AMD, known pseudo-exons, and the mitochondrial genome in two test cohorts that had previously been screened for variants in the entire *ABCA4* gene.

**Results** Sequencing of 379 probands achieved an average nucleotide coverage of 431× across all targets. Following detailed sequencing analysis of 110 probands, a diagnostic yield of 38% was observed. This established an ‘‘MD-smMIPs panel’’ that allows a genotype-first approach in a high-throughput and cost-effective manner, whilst achieving uniform and high coverage across targets.

**Conclusions** Further analysis will identify known and novel variants in MD-associated genes to offer an accurate clinical diagnosis to patients and their families. Furthermore, this will reveal new genetic associations for MD and potential genetic overlaps between iMD and AMD.

Keywords
*   macular diseases
*   targeted gene sequencing
*   cost-effective
*   high-throughput
*   smMIPs

## Introduction

Inherited retinal diseases (IRDs) encompass a variety of disorders impacting retinal function, subsequently resulting in vision impairment and often blindness. IRDs can be classified based on disease progression and the retinal photoreceptor cells that are initially affected. For example, retinitis pigmentosa (RP) is characterized as a rod-cone dystrophy, where rod photoreceptor cells degenerate before cone photoreceptor cells. Since rods are more abundant in the peripheral retina, defects in peripheral vision are initially observed. In contrast, cone dystrophies and cone-rod dystrophies are characterized by the degeneration of photoreceptor cells situated in the central part of the retina, termed the macula, and the neighboring retinal pigment epithelium. The macula harbors the highest concentration of cones, thus central vision defects are prominent in cone- and cone-rod dystrophies. In some cases, degeneration of rod photoreceptors follows, which expands into the mid-periphery of the retina.

IRDs affecting the macula are termed macular degenerations (MDs). These can be broadly categorized as inherited MDs (iMDs) that are relatively rare and present at an early age, and age-related MD (AMD), which are more prevalent and occurs later in life, typically affecting adults over the age of 55 [1]. Late-onset iMDs can exhibit shared clinical features to AMD, and vice-versa, therefore identification of the causal genetic defect is the only way to distinguish between late-onset iMDs, AMD and other MD-phenocopies to offer an accurate clinical diagnosis to patients and their families. Extensive clinical and genetic heterogeneity is observed amongst IRDs, whereby variants in 280 genes currently implicate an IRD phenotype in humans, including 185 genes associated with non-syndromic IRDs ([https://sph.uth.edu/retnet/](https://sph.uth.edu/retnet/)). This clinical and genetic overlap between IRDs can hinder a clear diagnosis, since one gene can be associated with more than one form of IRD. One example includes variants in the *RPGR* gene, which can lead to a broad range of phenotypes, including an early-onset MD or a RP phenotype, primarily affecting males [2-4]. In total, 28 genes are known to be mutated amongst individuals with either MD, RP or even LCA.

Several high-throughput next-generation sequencing (NGS) technologies using short-read sequencing have accelerated molecular diagnoses for IRD patients, ranging from custom gene panel approaches [5,6] to whole exome sequencing (WES) [7] and whole genome sequencing (WGS) [8,9]. Each technique involves complex outputs to consider in terms of financials, total data generated (and its required storage capacity) and hands-on analysis time per sample. Another important consideration is the total sequencing coverage achieved, ensuring that this is sufficient and uniform across all genomic targets to increase the diagnostic yield. An advantage of a gene-panel or a targeted sequencing approach includes the ability to customize targets to cover a selection of coding and non-coding regions of IRD genes, whilst keeping costs, data generation and storage to a minimum. Approximately two-thirds of patients with IRDs can be genetically explained by targeting the coding regions of IRD-associated genes [10], favoring targeted approaches as a first assessment. Molecular Inversion Probes (MIPs) have been used to study 108 non-syndromic IRD-associated genes [11,12] as well as autism spectrum disorders [13] and neurodevelopmental disorders [14], and are even used in a diagnostic setting to screen for familial breast cancers [15]. In particular, a MIP approach uses library-free target enrichment, which is faster and easier to scale, thus making this a multiplexed and high-throughput approach. More recently, single molecule MIPs (smMIPs) have proven successful in providing a molecular diagnosis to patients suffering with *ABCA4*-associated Stargardt disease (STGD1) [16,17]. When compared to alternative NGS approaches, a smMIPs approach can achieve high sequencing coverage at a reasonable cost per sample, depending on the sample numbers and platform used. As a multiplex-targeted sequencing approach, numerous smMIPs are designed, each targeting one unique sequence, and all smMIPs in the design are pooled to incorporate thousands of smMIPs in one assay that do not interfere with one another. The desired targets are captured and undergo amplification to generate DNA libraries for sequencing. By incorporating patient-specific barcodes to each amplicon, DNA libraries from numerous DNA samples can be pooled and sequenced simultaneously in one sequencing run. A previous study used smMIPs to capture and sequence the entire *ABCA4* gene (coding and non-coding regions) in 1,054 probands, hereafter referred to as *ABCA4*-smMIPs sequencing, demonstrated sequencing coverages up to 700× per nucleotide for ∼210 patients in one series using an Illumina NextSeq 500 platform [17]. Although different applications and platforms are generally used for WGS and WES, coverages achieved are typically 30-50× and 100× coverage, respectively. Despite the advantages and success of WES, even with 100x coverage, many exome kits contain regions of low coverage, most often in exon 1 or GC-rich regions [18,19]. Uniform coverage in excess of 100× is advantageous for the detection of CNVs and rare (non-germline/somatic; mosaic) variants with a higher degree of confidence.

Despite advances in NGS applications, many MD cases still remain genetically unexplained, by which the detection of likely causative variants following WES in patients with MD (excluding *ABCA4*-associated STGD1) was 28% [7]. Although many genes and risk factors for MDs are known, knowledge of genetic and non-genetic factors influencing phenotypic MD expression are still lacking. Thus, we hypothesized that by sequencing all iMD- and AMD-associated genes and single nucleotide polymorphisms (SNPs), known as risk factors, overlapping genetic causes may be revealed, which may further distinguish between late-onset iMDs and AMD or may yield genetic modifiers of MD. We sought to implement a new smMIPs platform to combine gene selection and high-throughput sequencing for this clinical subgroup, improving the smMIPs-based approach used previously in our laboratory [17], to detect genetic associations in MD cohorts. We designed and synthesized smMIPs for massively parallel sequencing of exons and pseudo-exons of 105 iMD and AMD-associated genes, in addition to AMD risk factor SNPs, in 360 iMD patients simultaneously in a cost-effective manner; ensuring single nucleotide variants (SNVs), small insertions and deletions (indels) and copy number variants (CNVs) could be detected. With this approach, we established a smMIPs assay that allows a genetics-first approach for a heterogeneous subgroup of disorders, in an affordable and high-throughput manner. This will provide a proof-of-concept for our ongoing, long-term objective to sequence large cohorts of MD patients.

## Materials and Methods

### Inclusion of genes

Gene inclusion criteria was based on genes associated with iMD and/or AMD listed on the Retinal Information Network online resource ([https://sph.uth.edu/retnet/](https://sph.uth.edu/retnet/); accessed 07 August 2020) and those reported in literature. The Human Gene Mutation Database (HGMD) ([http://www.hgmd.cf.ac.uk/ac/index.php](http://www.hgmd.cf.ac.uk/ac/index.php); accessed 07 August 2020) was also used to highlight genes associated with reported iMD and AMD phenotypes. Specifically, iMD-associated genes comprised genes involved in MDs and cone-prominent IRDs.

### smMIPs design

Molecular Loop Biosciences Inc.’s smMIPs design incorporates a capture region of 225 nucleotides (nt), which is flanked by 5’ and 3’ extension and ligation probe arms, respectively, comprising 40 nt in total. Each smMIP is a single-stranded DNA-oligonucleotide that is dual-indexed with custom, distinct index adapter sequences, two index primer sequences (barcodes) of 8 nt in length, or 10 nt when the total number of patients exceeds 96 in a given series. These index primers act as a patient barcoding system in order to generate uniquely tagged libraries. In addition, each smMIP molecule contains dual 5-nt randomers that are adjacent to each probe arm, incorporated as molecular barcoding to uniquely tag each smMIP molecule in a sample library. Dual 5-nt randomers are used to enable detection of duplicate reads and increase the detection of unique reads prior to data analysis.

### Generation of the MD-smMIPs panel

We sought to assemble a panel of genomic regions associated with iMD and AMD phenotypes to target using smMIPs. The 5’ untranslated regions (UTRs), protein-coding exons and alternate protein-coding exons of 105 genes/loci associated with iMD and AMD were selected as targets for our new panel. Transcript numbers were selected from Alamut Visual software version 2.13 (Interactive Biosoftware), selecting the protein-coding transcript or the longest transcript, and checked using the UCSC Table Browser [20]. Genomic coordinates were extracted from UCSC Genome Browser, hg19 (GRCh37) [21]. All transcripts were also evaluated for the presence of alternative exons and alternative 5’ UTRs using the Ensembl Genome Browser (GRCh37; Ensembl release 101) [22]. A complete list of genes included can be found in **Supplementary Table S1**. In addition to gene targets, 89 AMD-associated risk factor variants and loci reported in genome- and transcriptome-wide association studies [23,24] were included. Furthermore, 60 literature-reported and eight unpublished deep-intronic variants (DIVs) (G. Arno, unpublished data; Z. Corradi, unpublished data) and resulting pseudo-exons were included. Only DIVs with published functional evidence of pseudo-exon generation, intron retention, exon skipping, or an effect on promotor activity were included. Details of all DIVs are listed in **Supplementary Table S2**. Finally, additional regions, including SNPs on chromosome 6, were selected to ensure equal representation across all chromosomes and for (partial) uniparental disomy to be assessed. **Supplementary Figure S1** shows the genomic distribution of all nuclear DNA targets selected. As mitochondrial DNA (mtDNA) variants and mtDNA haplogroups are implicated in (A)MD phenotypes [25,26], the entire mitochondrial genome (16,569 nt) was also targeted to investigate potential biomarkers and mtDNA variants associated with MDs.

Genomic coordinates of all selected regions were provided to Molecular Loop Biosciences Inc., USA, who designed 16,973 smMIPs covering the desired genomic targets, including 20 nt of flanking sequence up- and downstream (amounting to 436,017 nt), and 421 smMIPs targeting the mtDNA. Taken together, a grand total of 452,586 nt were targeted by 17,394 smMIPs. Thus, on average, each targeted nucleotide was covered by eight smMIPs. This smMIPs panel is further referred to as the MD-smMIPs panel.

### Patient cohorts

To validate the MD-smMIPs panel, we focused on iMD patients. All iMD probands selected were clinically diagnosed with Stargardt disease (STGD), a “Stargardt-like” phenotype (STGD-like), MD or a retinal degeneration characterized by central vision defects, including cone dystrophy (CD) or cone-rod dystrophy (CRD). In some instances, an RP phenotype was also included. Cohorts consisted of two main patient groups: a) patients that had previously undergone a different smMIPs-based sequencing approach, targeting the entire *ABCA4* gene [17] and *PRPH2* exonic regions, yet remained unsolved; b) patients that had not undergone *ABCA4*/*PRPH2* screening using a smMIPs-sequencing method, but may have undergone alternative screening methods, such as WES or targeted gene analysis. Moreover, positive controls were selected to validate the MD-smMIPs workflow in addition to assist in CNV analysis, including patients carrying previously identified CNVs in genes selected for the design. DNA from genetically unexplained probands that were collected as part of a previous study [17] were selected. These DNA samples were collected by 17 international collaborators and one national collaborator, of whom written informed consent was obtained and DNA isolations were performed in the respective laboratories (**Supplementary Table S3**).

### Sample preparation

Genomic DNA samples were quantified and diluted within a DNA concentration range of 15-25 nanograms per microliter (ng/μl). One hundred ng of each patient DNA was analyzed using agarose gel electrophoresis to determine DNA quality and concentration. Crucially, samples were discerned as high molecular weight (MW) DNA or low MW DNA, also by gel electrophoresis. DNA was considered high MW when the DNA fragments were ≥23 kilobases (kb), as measured using the lambda DNA-*Hin*dIII marker. Samples with DNA of high MW were plated into a 96-well capture plate, which was pre-treated by incubating the DNA at 92°C for five minutes prior to library preparation steps to denature DNA. Following the pre-shearing step, samples with low MW DNA were added to the capture plate for further processing.

### Library preparation

DNA libraries were prepared for each proband using the High Input DNA Capture Kit, Chemistry 2.3.0H produced by Molecular Loop Biosciences Inc. Protocol version 2.4.1H was followed, using an incubation time of 18-hours for the probe hybridization. A fill-in step was performed, followed by a combined clean-up and PCR step using the following parameters: 45°C for 15 minutes, 95°C for 3 minutes, 17 cycles of 98°C for 15 seconds, 60°C for 15 seconds, and 72°C for 30 seconds. Prior to library pooling, 5 μl of each individual library was evaluated using agarose gel electrophoresis to assess library quality and size, where a distinct gel band at approximately 400 bp represented successful amplification of the DNA library. Library purification was performed using 1× AMPure XP beads (Beckman Coulter, California, USA) using the standard protocol and eluted in low-TE buffer. Quantitation was performed using the Qubit Fluorometer (dsDNA HS assay kit; ThermoFisher Scientific, Massachusetts, USA) and the 2200 TapeStation system (HS D1000 DNA kit; Agilent Technologies, California, USA) to determine the library concentration and exact library size (in base pairs), respectively. A final dilution of 1.5 nanomole (nM) in 100 μl was prepared. Steps performed are summarized in **Figure 1**.

![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/24/2022.03.17.22272534/F1.medium.gif)

[Figure 1:](http://medrxiv.org/content/early/2022/03/24/2022.03.17.22272534/F1)

Figure 1: 
A summary of the DNA preparation and library preparation workflow to prepare DNA libraries for 360 DNA samples from MD cases.

Two sample sets were used for validation of the workflow. For the first set of samples, sample set 1, 46 libraries were prepared in two series of 24 libraries (pools A and B), each comprising 23 solved or unsolved probands and one no template control (NTC) of ultrapure water, to test the efficiency and overall coverage of the MD-smMIPs panel (referred to as the ‘‘MD test run’’). Library pools A and B were pooled into one final mega-pool of 46 individual capture reactions before quantitation and preparation of a final library dilution for sequencing. Sample set 2 consisted of 384 libraries, prepared in series of 96 libraries, with a focus on genetically unexplained iMD probands. Four final library pools were prepared (pools A, B, C and D), which were combined into one mega-pool of 384 individual capture reactions, comprising 360 probands that are genetically unexplained, 20 positive controls and four NTCs at fixed positions in the capture plate. This larger sequencing run will be referred to as ‘‘MD run 01’’.

### NovaSeq 6000 SP sequencing

The final library mega-pools (from the MD test run and MD run 01) were denatured according to Illumina’s NovaSeq 6000 System Denature and Dilute Libraries Guide, resulting in two 300 pM final libraries. Each library was sequenced by paired-end sequencing on a NovaSeq 6000 platform (Illumina, California, USA) using two SP reagent kits v1.5 (300 cycles), each with a capacity of 1.3 – 1.6 billion paired end reads per run.

### Variant calling and annotation

For each NovaSeq 6000 run, sequencing reads were converted to raw sequencing data (FASTQ) files using bcl2fastq (v2.20). Raw FASTQ files were processed through an in-house bioinformatics pipeline, as previously described [16]. In brief, random identifiers were trimmed from the sequencing reads and stored within the read identifier for later use. Duplicated reads were removed while the remaining unique reads were written to a single BAM file per patient based on the index barcode. To generate the number of mapped reads to determine overall average smMIPs coverages, the number of forward reads was added to the number of reverse reads, and this value divided by two.

### smMIPs performance

The performance and evenness of smMIPs were evaluated using the average read coverages of the MD test run data. The average read count was calculated for each smMIP across all samples in the sequencing run and sorted in descending order. To identify high performing and low performing smMIPs, a log10 plot of the ranked evenness was generated.

### Average coverages per nucleotide

To determine the number of all reads covering each nucleotide of nuclear targets (428,562 nt) and mtDNA targets (16,569 nt) in MD run 01, the base calls of aligned reads to a reference sequence were counted in individual sample BAM files using the ‘pileups’ function of SAMtools [27]. The following parameters were used for the generation of pileups data: minimum mapping quality = 0; minimum base quality = 12; anomalous read pairs were discarded; overlapping base pairs from a single paired read as a depth of 1 were counted. An average coverage per nucleotide was generated for each nucleotide position across all samples sequenced in MD run 01, followed by an average coverage for all genes/loci targeted in the MD-smMIPs panel. The average coverage per nucleotide for *RPGR* was calculated excluding exon 15 of the *RPGR-ORF15* transcript, and the coverage of exon 15 of the *RPGR-ORF15* transcript was determined independently. Coverage plots for all reads across each gene/loci were generated. The average coverages per nucleotide were used to assess whether regions were poorly covered (≤10 reads), moderately covered (11-49 reads), or well-covered (≥50 reads).

### Variant prioritization

First, using an Excel script as previously described [17], CNV analysis was conducted for all samples. The presence of six consecutive smMIPs with a normalized coverage across all samples included in the sequencing run of ≤0.65 assumed a deletion, whereas those with a normalized coverage ≥1.20 suspected a duplication. The second phase of analysis focused on SNVs and indels that were present in *ABCA4*, compared to a list of previously identified variants, as listed in the Leiden Open (source) Variation Database (LOVD; [https://databases.lovd.nl/shared/genes/ABCA4](https://databases.lovd.nl/shared/genes/ABCA4)), to highlight frequent *ABCA4* variants, e.g. c.2588G>C and c.5603A>T [28-30], and/or previously published causative variants in *ABCA4*. In addition, the presence of known pathogenic DIVs targeted by the MD-smMIPs panel were highlighted. Next, all homozygous and compound heterozygous SNVs and indels with a minor allele frequency ≤0.5% and all heterozygous variants in genes associated with autosomal dominant retinal dystrophies with a minor allele frequency of ≤0.1% in an in-house cohort, containing 24,488 individuals with numerous phenotypes, and the Genome Aggregation Databases (gnomAD) for control exome and genome populations were assessed. Variants with an allele frequency variation between 35-80% were considered as heterozygous variants, and multiple variants present in a given gene were considered compound heterozygous candidates. Variants with an allele frequency variation ≥80% were classified as homozygous variants. Thus, variants of all inheritance patterns were considered. For probands in MD run 01, variants present in at least 10% of probands (i.e. ≥ 38) in the sequencing run were excluded to filter out sequencing artefacts. Only variants with >10× coverage (i.e. present in >10 unique sequencing reads, following de-duplication) were assessed.

Variants were prioritized based on their predicted protein effect and pathogenicity, and variant types. Stop-gain, frameshift, stop loss, start loss, and canonical splice site variants, along with in-frame indels, that were present following filtering for allele frequencies were considered. The pathogenicity of missense variants were assessed based on score thresholds using the following *in silico* prediction tools: PhyloP (range: −14.1_6.4; predicted pathogenic ≥2.7) [31], CADD-PHRED (range: 1_99; predicted pathogenic ≥15) [32] and Grantham (range: 0_215; predicted pathogenic ≥80) [33]. Variants that met all three criteria (allele frequencies, predicted protein effect and variant type) were prioritized, followed by those that met the threshold scores of any *in silico* tools used. The genomic positions of all candidate variants were manually visualized in patient BAM files to include or exclude in final data interpretation steps. Truncating variants with a clear effect in AMD-associated genes were considered, however missense variants in AMD-associated genes were not considered in this initial analysis. A separate analysis will be performed for such variants, with a focus on rare, low frequency variants with large effect sizes, also taking into consideration an AMD cohort.

For non-canonical splice site (NCSS) variants, near-exon variants and DIVs, *in silico* tools Splice Site Finder-like [34], MaxEntScan [35], NNSPLICE [36] and GeneSplicer [37] were used in Alamut Visual software version 2.13 (Interactive Biosoftware) to predict the impact on splicing, using parameters described by Fadaie, et al. [38], and ESEfinder [39] to predict effects on exon splicing enhancers (ESEs). SpliceAI [40] was used via the BROAD Institute web interface tool ([https://spliceailookup.broadinstitute.org/#](https://spliceailookup.broadinstitute.org/#)) to further predict splicing effects, using a 10,000-bp (5,000-bp upstream and 5000-bp downstream) window. Variants with a predicted delta score of ≥0.2 (range: 0–1) for at least one of the four predictions (acceptor loss, donor loss, acceptor gain, donor gain) were considered potential candidates.

### Variant classification

All prioritized variants were queried through the Franklin Genoox platform ([https://franklin.genoox.com/](https://franklin.genoox.com/): Accessed 10 December 2021) to obtain suggested classifications using the ACMG-AMP guidelines to gather annotations and evidence associated with each variant. The ACMG classification system divides variants into five classes: class 1 (benign); class 2 (likely benign); class 3 (variant of uncertain significance i.e. VUS); class 4 (likely pathogenic) and class 5 (pathogenic) [41]. Only variants classified as class 3, 4 or 5 using ACMG guidelines were considered. In addition, all variants were investigated for in public online databases, including the LOVDs ([https://www.lovd.nl/](https://www.lovd.nl/): Accessed 10 December 2021) and ClinVar ([https://www.ncbi.nlm.nih.gov/clinvar/](https://www.ncbi.nlm.nih.gov/clinvar/): Accessed 10 December 2021). Variant data present in these databases were assessed on the pathogenicity interpretation submitted by previous reports. For novel candidate variants absent from these databases with no previous reports or functional evidence, *in silico* tools Sorting Intolerant From Tolerant (SIFT) (range 0_1; predicted pathogenic 0-0.05) [42] and MutationTaster (probability value 0-1 where 1 indicates a high security of the prediction; deleterious) [43] were used to further evaluate predicted pathogenicity of candidate coding variants. A final verdict was made based on a majority vote of allele frequencies, suggested ACMG classifications and previous reports in online databases.

Since segregation analysis was not performed for probands included in the study, but are an important criterion following full ACMG guidelines, patients were defined as genetically ‘possibly solved’, ‘very likely solved’ or ‘unsolved’. Furthermore, individuals carrying two different (rare) variants in one gene were assumed to carry these in a biallelic state, and therefore were presumed compound heterozygous, since segregation analysis was not performed to confirm the phase of variants. All modes of inheritance were considered for each proband, and phenotypes that had been previously published for genes were taken into account. When homozygous class 4 or class 5 variants were identified in genes previously implicated in autosomal recessive IRD, probands were deemed to be ‘very likely solved’. Genomic regions spanning homozygous variants were also assessed in the CNV analysis to ensure that a deletion in the second allele was not responsible for a homozygous call i.e. hemizygous. When (presumed) compound heterozygous variants with one or two class 5 variant(s), or two heterozygous class 4 variants were identified, we considered a proband to be ‘very likely solved’. When two class 3 variants (VUS), or a class 3 and a class 4 variant were identified, and presumed compound heterozygous, a patient was ‘possibly solved’. A homozygous class 3 variant in a gene associated with an autosomal recessive phenotype gave a proband a verdict of ‘possibly solved’. Monoallelic cases carrying one class 4/5 variant in a gene associated with a recessive phenotype remained ‘unsolved’. A proband with a heterozygous class 4 or class 5 variant present in a gene associated with an autosomal dominant phenotype was considered ‘very likely solved’. A heterozygous class 3 variant in a gene implicated in an autosomal dominant IRD was considered insufficient to provide a genetic diagnosis, therefore in this case the proband remained ‘unsolved’.

## Results

### MD test run characteristics

The purpose of the test run was to determine the average coverage of the MD-smMIPs pool (nuclear and mtDNA targets combined) when using the NovaSeq 6000 platform and determine the overall performance of the designed smMIPs. Forty-two probands (24 positive controls, 18 unexplained cases) were used for the MD test run, including four samples in duplicate to test the efficiency of the pre-treatment step on DNA obtained using different extraction methods. Therefore, in total, 46 DNA libraries were prepared and underwent smMIPs-sequencing.

After the removal of duplicates, the total number of reads obtained across all 46 samples in the NovaSeq 6000 sequencing run were 377,179,002 reads, with an average of 8,199,544 reads per sample. An overall average coverage of 472× was achieved for all smMIPs in the MD-smMIPs pool. On average, eight smMIPs target each nucleotide position, therefore the effective average coverage was ∼3,776×. The performance of each individual smMIP was assessed based on the average count of each smMIP (17,394 smMIPs in total) across all samples in the sequencing run, highlighting the reproducibility across samples and the number of smMIPs that were high performing or low performing. Even read coverage across smMIPs was observed without the need for rebalancing (**Figure 2**). Across all samples included in the test run, the percentage of targeted bases sequenced to 40-fold or greater coverage was a median of 95.7% **(Supplementary Figure S2)**. To further assess smMIP performance, the average coverage of each smMIP targeting nuclear DNA across all samples was calculated, resulting in an overall average coverage of 463×.

![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/24/2022.03.17.22272534/F2.medium.gif)

[Figure 2:](http://medrxiv.org/content/early/2022/03/24/2022.03.17.22272534/F2)

Figure 2: 
Average read coverage per smMIP in the MD-smMIPs panel.

The MD test run was also performed to determine the ratio of nuclear smMIPs to mtDNA smMIPs to use in subsequent library preparations. Furthermore, since various DNA isolation methods were used to isolate the DNA samples used in the study, the MD test run also assessed if different DNA isolation methods impact the mtDNA copy number and coverage. Using an initial ratio at 100:1 of nuclear DNA smMIPs relative to mtDNA smMIPs, on average there were 3.6 times more mtDNA reads compared to the nuclear DNA reads **(Supplementary Figure S3)**. Furthermore, when focusing on smMIPs targeting the mtDNA, an overall average coverage of 836× was achieved for mtDNA regions, which was 1.8 times the coverage achieved for nuclear regions (463×). This led to the decision to use a ratio of 300:1 of nuclear to mtDNA smMIPs in subsequent larger sequencing runs in order to optimise the smMIPs ratio in the final MD-smMIPs pool and provide a more even representation of the mtDNA and nuclear DNA targets. DNA from probands included in the MD test run were isolated using 16 different DNA extraction methods by the collaborators (**Supplementary Table S4**). Comparing the ratios of mtDNA to nuclear DNA across these isolation methods showed that these were similar and therefore the adjusted MD-smMIPs pool could be used across DNA samples isolated using varying isolation methods.

All samples were considered for analysis, however unsolved samples with an average smMIPs coverage of <25× were considered to be unsolved due to low sequencing coverage, due to extremely low DNA input or quality, and therefore were excluded from the total numbers. This included four probands that remained genetically unsolved (three were duplicate samples) and one proband that was previously solved.

### MD run 01 characteristics

The first large NovaSeq 6000 run (MD run 01) aimed to sequence 380 DNA samples. Due to a small shortage of reagents and poor DNA quality for seven positive control samples and one unsolved proband, respectively, DNA libraries were not generated for eight samples. Thus, 13 positive controls (including one technical control used for every capture plate i.e. four times) and DNA libraries for 359 samples were sequenced.

Following the removal of duplicates, 628,049,102 reads were obtained across all samples, with an average of 1,688,304 reads per sample. Using SAMtools pileups (based on sequencing alignment files, i.e. BAM files), an average nucleotide coverage of ∼431× was determined for BAM files of all 372 probands (**Supplementary Table S5**) (average coverages per nucleotide for nuclear targets and mitochondrial targets are 427× and 434×, respectively). This consists of an overall average smMIPs coverage of 97× and 88× targeting nuclear DNA and mtDNA, respectively. Taken together, an overall average coverage of 97× was achieved for all smMIPs in the sequencing run. Pileups data for the nuclear DNA regions are represented as coverage plots, demonstrating the average nucleotide coverages per gene (**Supplementary Figure S4**), and were also used to determine performance of the smMIPs at the nucleotide level. For mtDNA targets, the average coverages per nucleotide are represented as a coverage plot across the mitochondrial genome (chrM:1-16,569) in **Supplementary Figure S5**. Regions covered by ≤10 reads were considered poorly covered, 11-49 reads considered moderately covered, or ≥50 reads considered well-covered. From the 428,562 nt of all nuclear targets (105 genes/loci), 3,581 nt were not or poorly covered (0.8%), 11,516 nt were moderately covered (2.7%) and 413,465 nt were well-covered (96.5%). Of the 89 AMD-associated risk variants that were targeted, 85 were moderately to well covered, as depicted in **Supplementary Table S6**. Of these, 52 are independently associated common and rare AMD risk variants (reported by Fritsche et al. [23]), which will ultimately be used to generate polygenic risk score calculations. Notably, 50 of these 52 AMD-risk variants are well covered, where only two have low coverage: one in C2/CFB locus (rs181705462) and one in *C3* (rs12019136). The five genes/loci with the lowest average coverages were *GNAT2*, the last exon of the *RPGR-ORF15* transcript, *SLC16A8*, the opsin locus control region (LCR), and *RAX2*, with average coverages over the entire (coding) gene/locus of 26×, 141× (repetitive region: 115×), 141×, 148× and 163×, respectively (**Supplementary Figure S4; Supplementary Table S5**). We suspected *RPGR-ORF15* to prove refractory to sequence due to its repetitive and purine-rich nature **(Supplementary Figure S6)**, where the repetitive region chrX:38,144,788-38,146,098 shows a purine content of 92%. For this reason, several variants could not be called confidently based on visualization of sequencing data in BAM files. *SLC16A8* and *RAX2* have a high GC content (70% and 71%, respectively), thus this could explain why these genes achieved an overall lower sequencing coverage, where smMIPs were likely challenging to hybridize. Secondary structures in smMIPs or target DNA, or PCR bias may also be responsible. There are no highly repetitive regions of *GNAT2* and the opsin-LCR, and neither are considered GC-rich (47% and 54% respectively) to explain why these regions may be suboptimal in comparison to other targets. Despite this, we still considered these regions to have sufficient coverage to perform variant calling.

For the purpose of this study, smMIPs sequencing analysis was performed for the positive controls included across the whole run of 384 samples and 92 unsolved probands included in pool A. Three probands achieved an average smMIPs coverage of <25× per sample, and therefore the possibility that these probands remained genetically unexplained due to low sequencing coverage could not be ruled out.

### smMIPs sequencing analysis

Variant prioritization was performed for all nuclear targets. All 46 (previously known) variants in positive control samples across both sequencing runs, including SNVs plus small and large CNVs, were successfully called, hence reaching 100% concordance. CNV analysis for each NovaSeq 6000 run was performed for all samples included in the run and was normalized using positive control samples that were included in each dataset, in order to exclude samples where false positives were overestimated. Prior to variant prioritization, 110 probands were genetically unsolved across both the MD test run and 92 samples taken forward for analysis from MD run 01 pool A. Prioritization of variants within the selected parameters, and retaining only variants with an ACMG classification of class 3-5, resulted in 134 variants in 40 genes (**Supplementary Table S7**), comprising two CNVs and 132 SNVs or indels. Of the SNVs or indels, 89 were classified as class 3 variants (VUS), 23 were class 4 variants (likely pathogenic) and 20 were class 5 variants (pathogenic) (**Figure 3**).

![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/24/2022.03.17.22272534/F3.medium.gif)

[Figure 3:](http://medrxiv.org/content/early/2022/03/24/2022.03.17.22272534/F3)

Figure 3: 
Variant types of the 134 identified variants classified as class 3-5.

Using our stringent filtering and considering the gene harboring each variant, and the context of variants found in a proband, 42 probands could be confidently genetically ‘solved’ (**Tables 1a and 1b; Supplementary Table S8**), i.e. very likely or possibly solved, by variants in 24 genes (**Figure 4**). A diagnostic yield of 38% was achieved, whereby 34 samples were considered very likely solved and 8 samples considered possibly solved. A class 4 variant in *CRX* (c.122G>C) and class 3 and class 4 variants in *CNGB3* (c.886A>T;887_896del(;)1844A>G) were identified as potential causal variants in proband 070583. Since segregation analysis was not performed, and as the initial clinical diagnosis was STGD, *CRX* is more likely the causal gene. Another proband (patient 070682) harbors presumed compound heterozygous loss-of-function variants in *EYS* (a splice region variant, c.-448+5G>A, located in non-coding exon 1 of *EYS*, and c.3024C>A). Since truncating variants are often implicated in *EYS*-associated RP [44], the *EYS* variants identified in this patient are compelling. Furthermore, the c.-448+5G>A variant has been previously reported in the literature as a likely cause of disease when in *trans* with a second truncating variant in *EYS* [45]. The same proband could also possibly be solved by a heterozygous variant in *BEST1* (c.1067G>T), which is classified as a likely pathogenic variant by our assessment criteria, however, c.1067G>T is reported in LOVD and ClinVar as a VUS. The *BEST1* variant could be involved in recessive bestrophinopathy, therefore can be fortuitous. Moreover, the RP phenotype for this patient does not match with *BEST1*-associated disease, proposing the *EYS* variants as the more likely cause of disease for this proband. Segregation analysis will further aid in verifying this hypothesis. Finally, we identified two possibilities to genetically explain proband 070227. The c.3181G>C in *CACNA1F* may segregate as X-linked CD or CRD, or c.668G>A variant in *ROM1* as autosomal dominant RP. Both variants offer to genetically ‘possibly solve’ this proband based on our assessment, where we only took into account the phenotype recorded upon patient submission to the study. Although no further clinical information was considered following variant identification, we have doubts that either of these variants genetically explain this case, based on the high allele frequency of *CACNA1F* c.3181G>C in sub-populations for a X-linked causal variant and that the patient was submitted with an STGD phenotype, and not RP.

View this table:
[Table 1:](http://medrxiv.org/content/early/2022/03/24/2022.03.17.22272534/T1)

Table 1: 
Novel likely causative variants identified in the MD-smMIPs panel sequencing study.

![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/24/2022.03.17.22272534/F4.medium.gif)

[Figure 4:](http://medrxiv.org/content/early/2022/03/24/2022.03.17.22272534/F4)

Figure 4: 
Genes for which candidate variants are present, which are considered to solve 42 probands.

Of the variants that were considered to genetically solve probands, 17 are unique, novel variants, i.e. not previously reported in the literature, depicted in **Table 1**. Within the solved cohort, 40% of probands were proposed to be affected with an autosomal dominant or X-linked IRD, whereas in 60% of the probands an autosomal recessive inheritance was presumed. *ABCA4, CNGA3, PROM1* and *RPGR-ORF15*, were amongst the most frequently implicated genes within the solved cohort, with five (*ABCA4*) or three (*CNGA3, PROM1, RPGR-ORF15*) cases per gene. Digenic triallelic variants in *CNGA3* and *CNGB3* were observed in proband 070748 (homozygous for c.1208G>A in *CNGB3*, and heterozygous for *CNGA3* variant c.985G>T. All proposed modes of inheritance and phenotypes are listed for each proband in **Table 2a and 2b** and all variant data was uploaded to LOVD.

View this table:
[Table 2a:](http://medrxiv.org/content/early/2022/03/24/2022.03.17.22272534/T2)

Table 2a: 
Autosomal dominant or X-linked causal alleles considered to very likely or possibly solve probands.

View this table:
[Table 2b:](http://medrxiv.org/content/early/2022/03/24/2022.03.17.22272534/T3)

Table 2b: 
Autosomal recessive causal alleles considered to very likely or possibly solve probands.

## Discussion

### Coverage comparisons

The performance of our MD-smMIPs can be compared to other MIP/smMIP studies. To further develop MIP sequencing methods, a previous study tested MIP pools, including the capture and sequencing of 44 candidate genes in 2,446 probands diagnosed with autism spectrum disorders [13]. Using an Illumina HiSeq 2000 sequencing platform, ∼92% of coding target bases were sequenced to 25× coverage or greater. In comparison, the average percentage of targets with a coverage greater than 20× and 30× in the MD-smMIPs panel test run using a NovaSeq 6000 platform, which attains a higher sequencing capacity than a HiSeq 2000 platform, was 97.0% and 96.2%, respectively. A separate study described the use of smMIPs to validate genetic testing for *BRCA1* and *BRCA2* genes using 166 samples in a clinical setting [15]. Using a double-tiling smMIPs design, 478 smMIPs, with a target region of 112 nt per smMIP, were designed. Final pooled libraries were sequenced using an Illumina NextSeq 500, achieving a mean coverage of 359× and 289× per smMIP for *BRCA1* and *BRCA2*, respectively. Finally, the *ABCA4*-smMIPs sequencing study analyzed 1,054 samples for variants in *ABCA4*, which included several unsolved iMD probands in the current study, and achieved an overall average coverage of 377× [17]. Since a double-tiling smMIPs design was used with a target region of 110 nt per smMIP, and two smMIPs targeting most nucleotides, an effective average coverage of ∼700× was achieved using the NextSeq 500 platform. The percentage of *ABCA4* targets that were considered well covered was comparable between the Molecular Loop Biosciences Inc. smMIPs design and the previous *ABCA4*-smMIPs sequencing design (97.6% versus 97.4%, respectively). However, an improvement in the regions that were previously not or poorly covered was observed. Overall, the smMIPs coverage and performance was comparable or increased in the current study compared to aforementioned MIP/smMIPs studies, where instead of using single- or double-tiling smMIPs, on average, eight smMIPs cover each nucleotide (i.e. ‘octa-tiling’) and are distributed across both DNA strands for the MD-smMIPs panel. An increase in the number of independent smMIPs targeting a nucleotide allows variants to be detected by multiple smMIPs, which may also offer advantages for CNV detection. Coupled with an increase in the number of gene targets, this aids variant validation and increases the diagnostic yield.

### Novel findings

The clinical heterogeneity of MDs has led to the existence of many STGD phenocopies, which subsequently can make a clinical diagnosis challenging and result in misdiagnoses. This is further exemplified in the present study by the identification of likely causative variants in 40 MD-associated genes, including proposed inheritance patterns of autosomal dominant, autosomal recessive, and X-linked inheritance. Furthermore, a possibility of digenic triallelism was observed for *CNGA3* and *CNGB3* variants in proband 070748. Digenic triallelism has previously been described in the literature, involving two variants in *CNGB3* (among which the c.1208G>A) and one in *CNGA3* [46]. The phenotype for this patient must be confirmed, e.g. based on colour vision tests and photopic electroretinogram, to confirm the involvement of digenic triallelism. Seventeen novel variants were identified, which are considered to be causative variants, including a novel heterozygous 1-bp deletion in *ROM1*, c.320del; p.(Gly107Alafs*15), which is predicted to introduce a premature stop codon. This was the only putative pathogenic variant identified for this individual, suggesting a potential autosomal dominant RP (adRP) phenotype, however it is important to note that non-coding variants and structural variation that is not captured in the MD-smMIPs panel may have been overlooked. Variants in *ROM1* and the implication in IRDs are not completely understood. Heterozygous *ROM1* variants have been reported with heterozygous *PRPH2* variants to cause digenic RP [47] in addition to few putative associations of heterozygous *ROM1* variants with adRP with incomplete penetrance [48,49]. An additional *ROM1* stop-gain variant was identified in another individual in our cohort, c.339dup; p.(Leu114Alafs*18), which was previously reported in literature as ‘‘most likely not pathogenic’’ [50]. Since our evaluation classifies c.339dup as a likely pathogenic variant, further analysis is required to evaluate this variant. The analysis of additional samples through the MD-smMIPs panel may reveal further *ROM1* stop-gain mutations to decide whether stop mutations in *ROM1* should be considered as a cause of adRP.

Although several iMD probands previously underwent *ABCA4*-smMIPs sequencing, four *ABCA4* variants identified in the present study were formerly overlooked. A CNV encompassing a deletion of exon 41 was identified in one proband (patient 070560), which was not previously reported. Conversely, visualization of the CNV analysis from the previous *ABCA4*-smMIP sequencing for the same proband indicated that the deletion was present, yet since the density of smMIPs was reduced and only double-tiling was used, the deletion was unconvincing. The homozygous c.834del variant was identified in one proband (patient 070901), which was reported as heterozygous in prior *ABCA4*-smMIPs sequencing analysis [17]. Re-visualization of both datasets show that the c.834del is present and is homozygous in both datasets, however the MD-smMIPs approach showed coverages of 1,300× surrounding the region in contrast to the 240 reads obtained from the *ABCA4*-smMIPs approach. Since this is considered a severe variant, this will lead to early-onset STGD1 disease in this individual. Similarly, c.2813T>C was identified in *ABCA4* in patient 070902. Despite the variant being present in previous *ABCA4*-smMIPs sequencing data of the same proband, at the time of study, c.2813T>C was assigned a class 3 ACMG classification (VUS) [29]. More recently, when categorized by severity based on statistical comparisons of allele frequencies across patient and general populations, c.2813T>C is classed as a mild or moderate variant [51]. Further functional studies are required to determine whether this variant alone can lead to STGD1 in a homozygous state. Finally, a monoallelic *ABCA4* case with the c.2992dup variant (patient 070674) remained unsolved following smMIPs sequencing previously [14]. However, in the current study, c.6119G>A was newly identified as the putative second allele. This variant was again present in both the *ABCA4*-smMIPs sequencing and the MD-smMIPs sequencing data of the proband. Segregation analysis is required to verify all variant findings and confirm the proposed modes of inheritance based on independent results/findings. Moreover, these findings may aid in reverse refinement of phenotypes based on the genotypic data.

### Mitochondrial DNA analysis

Mitochondrial DNA could be reliably sequenced using the MD-smMIPs panel, however since mtDNA variant analysis was not performed in the stringent filtering described in this study, further analysis of the mtDNA targets will be performed in the next phases of the study. In addition to identifying mitochondrial variants associated with MD, mtDNA analysis will determine heteroplasmy levels. Heteroplasmy reflects the percentage of mutant mtDNA among the multiple copies of mtDNA in a specific tissue. The retina is a highly metabolically active tissue, whereby oxidative damage can accumulate and increase with age, consequently resulting in an accumulation of heteroplasmy in the mtDNA [52]. Despite an expected lower heteroplasmy in blood than in retina, NGS technologies can be used for accurate detection of heteroplasmy in blood, including low-level heteroplasmy if sufficient read coverage is obtained. However, a negative result in blood does not exclude the presence of a mutant mtDNA in the tissue expressing the disease, in which the heteroplasmy level is higher. Base calling errors and sequencing artefacts associated with NGS data could result in false-positive nucleotide variant calls as they, incorrectly, might be considered heteroplasmic [53,54]. A variant must also be present on both DNA strands, i.e. double-strand validation, thereby eliminating strand bias and sequencing errors, in order to consider the variant call trustworthy as opposed to a sequencing artefact, which holds true for variant calling of both heteroplasmic and homoplasmic variants. In MD run 01, comprising 359 DNA samples, the overall average smMIPs coverage of mitochondrial targets was 88×, with an average coverage per nucleotide of 434× based on pileups data (**Supplementary Table S5**). Incorporating mtDNA sequencing in the current study enabled sequencing of multiple nuclear and mitochondrial DNA targets simultaneously in a large number of individuals. Although this arguably results in a lower overall mtDNA coverage than we would have achieved by sequencing the mtDNA alone in a small number of samples, which may require a higher minor allele frequency to call heteroplasmy, the current strategy is a more cost-efficient approach for analyzing genomic variation alongside mtDNA heteroplasmy in a large cohort. Furthermore, the very high coverage achieved following de-duplication is a core benefit of smMIPs sequencing, which allows for more sensitive mutation detection. Since no mitochondrial purification steps are performed prior to the library preparations, the homology between some mtDNA and nuclear DNA (i.e. nuclear mt DNA; NUMTs [55]) remains a complicating factor, which may result in difficulty in alignment and variant calling. Furthermore, mitochondrial haplogroup variation is associated with AMD [25,26], thus mitochondrial analysis will also be performed in ongoing MD-smMIPs analysis to obtain mitochondrial haplogroup genotypes and their associations with AMD. Notably, low sequencing coverage in mtDNA will not affect the determination of mitochondrial haplogroups, since the SNPs forming these haplogroups are expected to be homoplasmic.

### Advantages and disadvantages

The MD-smMIPs panel approach holds many advantages. Firstly, Molecular Loop Biosciences Inc.’s smMIPs design incorporates a 225-nt capture region; double that of previous MIP/smMIPs designs reported in the literature [13,15,16]. An additional advantage of the Molecular Loop smMIPs design is that, unlike for previous MIP/smMIPs studies [13,15,16], no rebalancing of the smMIPs is required, since the design is already sufficiently densely tiled to ensure problematic regions are covered. If a smMIP fails to capture a target due to allelic drop-outs, the chance that additional smMIPs are present and ensure successful capture are higher in the MD-smMIPs panel. Next, the Molecular Loop Biosciences protocol requires a small amount of DNA, which is advantageous when using older degraded DNA samples. Furthermore, the laboratory workflow (library-free target enrichment) is simpler than hybridization based approaches, such as WES. Last, but by no means least, the high-throughput nature of a smMIPs approach is a core benefit, which enables hundreds of samples to be sequenced simultaneously at an affordable cost per sample.

The use of the MD-smMIPs panel is ideal in a research context: informed consent is more simple to gather for gene-panel sequencing as opposed to a whole genome or exome approach and data storage costs and processing are reduced. Although our MD-smMIPs panel demonstrates uniform read coverage across the majority of targets, problematic regions, including *RPGR*-*ORF15*, still proved challenging during variant calling. For regions such as *RPGR-ORF15*, which already pose challenges for other short-read sequencing methods such as WES and WGS, long-read sequencing methods (for instance PacBio sequencing) should be considered when a proband remains unsolved. In addition, CNVs or structural variants with breakpoints in non-coding regions, along with non-coding variation in general, were not captured using our MD-smMIPs panel approach. Such variation could be further investigated in downstream analysis of genetically unexplained probands using optical genome mapping approaches and/or long-read sequencing technologies. Finally, the MD-smMIPs panel requires a specialized bioinformatics pipeline to be in place in order to perform variant analysis; a feature not all laboratories will have at their disposal.

All variants were classified using the ACMG classifications, as obtained using the Franklin platform. In some instances, variants we suspected would be class 5 (e.g. protein-truncating variants) obtained a class 4 ACMG classification. For consistency, we took the ACMG classification for our final verdicts, however further analysis on individual variants of this nature, and segregation analysis, would be beneficial in order to aid this variant classification system for variants that may currently stand between two ACMG classes (class 3-4/5) based on the criteria

### Cost considerations

High-throughput sequencing approaches, including WES and WGS, have proven successful in the identification of pathogenic variants to provide a molecular diagnosis for IRD patients, however the costs implicated are still relatively high when considering large cohorts, particularly in a research context. Our MD-smMIPs panel comprising 17,394 smMIPs targets 1,632 exons and 80 alternate exons of 105 genes, with capture and sequencing costs of $30 per sample (list prices excluding personnel costs and smMIPs design/synthesis costs), which is similar to that of bi-directional Sanger sequencing of one amplicon or one exon ($25 sequencing costs per amplicon/exon). This translates to $0.29 per sample per gene. A previous MIP study [13] reported the cost of capture and sequencing using their MIP approach of $9.80 per sample and $0.33 per sample per gene. However, the number of probes in both studies varied considerably – a total of 2,044 MIPs targeting 144 kb of sequence [13] versus 17,394 smMIPs in the current study targeting ∼453 kb. We utilized the NovaSeq 6000 platform with the SP reagent kit, enabling MD-associated targets to be sequenced in up to 384 samples simultaneously. The SP reagent kit has a maximum capacity of 1.6 billion paired-end reads per flow cell. Higher output kits are available for the NovaSeq 6000 instrument, which would allow more samples to be sequenced in one sequencing run. For example, the S1 kit sequencing capacity is two times that of the SP kit, effectively enabling the number of samples to be doubled whilst achieving similar coverage. However, scaling up to a kit with higher sequencing capacity would not reduce the cost per sample significantly (sequencing costs approximately $12 using the S1 kit vs. $10 using the SP kit, excluding capture, personnel costs and smMIPs design/synthesis costs) and would significantly increase the cost risk should a sequencing run fail. Moreover, the number of samples sequenced per series could theoretically be increased from 384 to 768 samples using the NovaSeq 6000 S1 reagent kit, whereby the coverage would decrease 2-fold. By simulating this coverage decrease in our dataset by halving the total read counts for each variant, we achieve 95.6% sensitivity and specificity, whereby 43 of 45 variants that are considered to solve probands would still be identified using our prioritization methods (i.e. ≥10 reads are present).

Although several of the patients in our study had undergone previous screening methods, with probands previously screened for at least *ABCA4* using a smMIPs approach, this offered a group of probands with genotype-phenotype correlations ideal for the sequencing of MD-associated genes as a first approach rather than other sequencing methods, such as WES. Implementing a smMIPs-based approach can significantly decrease sequencing costs per proband, and by implementing this for a highly heterogeneous subgroup of IRDs allows for a genotype-first approach. A lower cost per sample also makes this an attractive and feasible approach for low-income countries, where genomic sequencing facilities can be limited and low-cost analysis is vital, which in turn may hinder genetic diagnoses for patients.

## Conclusions

The low costs of our MD-smMIPs high-throughput sequencing approach allows us to screen thousands of iMD and AMD samples to better understand the genetics and potential overlap between inherited and multifactorial MD. This will further expand our knowledge regarding genetic and non-genetic factors that influence the severity of MDs. Knowledge of genetic variants in MD-associated genes will enable genetic reclassification of iMD and AMD probands, which will improve the diagnosis and, in some cases, treatment options for patients. Ultimately, this may offer patients improved prognoses and health outcomes, and may aid people to make lifestyle and dietary changes, thereby improving long-term vision prospects.

## Supporting information

Supplemental Figure S1 [[supplements/272534_file02.pdf]](pending:yes)

Supplemental Figure S2 [[supplements/272534_file03.pdf]](pending:yes)

Supplemental Figure S3 [[supplements/272534_file04.pdf]](pending:yes)

Supplemental Figure S4 [[supplements/272534_file05.pdf]](pending:yes)

Supplemental Figure S5 [[supplements/272534_file06.pdf]](pending:yes)

Supplemental Figure S6 [[supplements/272534_file07.pdf]](pending:yes)

Supplemental Table S1 [[supplements/272534_file08.xlsx]](pending:yes)

Supplemental Table S2 [[supplements/272534_file09.xlsx]](pending:yes)

Supplemental Table S3 [[supplements/272534_file10.xlsx]](pending:yes)

Supplemental Table S4 [[supplements/272534_file11.xlsx]](pending:yes)

Supplemental Table S5 [[supplements/272534_file12.xlsx]](pending:yes)

Supplemental Table S6 [[supplements/272534_file13.xlsx]](pending:yes)

Supplemental Table S7 [[supplements/272534_file14.xlsx]](pending:yes)

Supplemental Table S8 [[supplements/272534_file15.xlsx]](pending:yes)

## Data Availability

All data produced in the present study are available upon reasonable request to the authors. 

## Data Availability

All data produced in the present study are available upon reasonable request to the authors. 

## Funding

This work was supported by the HRCI HRB Joint Funding Scheme 2020-007, Stichting Oogfonds Nederland (UZ 2020-17), Pro Retina Deutschland, Stichting tot Verbetering van het Lot der Blinden, Stichting voor Ooglijders and the Stichting Blindenhulp.

## Author Contributions

Conceptualization, Claire-Marie Dhaenens, Anneke den Hollander, G. Jane Farrar, Alexander Hoischen, Susanne Roosing and Frans Cremers; Data curation, Christian Gilissen and Maartje van de Vorst; Formal analysis, Patrick Saunders; Funding acquisition, Claire-Marie Dhaenens, Anneke den Hollander, G. Jane Farrar, Susanne Roosing and Frans Cremers; Investigation, Daan Panneman and Erica Boonen; Methodology, Claire-Marie Dhaenens, Daan Panneman, Zelia Corradi, Mubeen Khan, Anneke den Hollander, Alexander Hoischen, Femke Bults, Susanne Roosing and Frans Cremers; Project administration, Susanne Roosing and Frans Cremers; Resources, Claire-Marie Dhaenens, Christian Gilissen, Maartje van de Vorst and Patrick Saunders; Software, Christian Gilissen and Maartje van de Vorst; Supervision, Claire-Marie Dhaenens, Anneke den Hollander, Alexander Hoischen, Susanne Roosing and Frans Cremers; Validation, Susanne Roosing and Frans Cremers; Visualization, Susanne Roosing and Frans Cremers; Writing – review & editing, Claire-Marie Dhaenens, Daan Panneman, Zelia Corradi, Anneke den Hollander, Alexander Hoischen, Femke Bults, Erica Boonen, Susanne Roosing and Frans Cremers.

## Conflicts of Interest

Author AIdH is currently an employee of AbbVie. Author PS is currently an employee of Molecular Loop BioSciences Inc.

## Acknowledgements

The authors would like to thank the Australian Inherited Retinal Disease Registry and DNA Bank, Carmen Ayuso, Eyal Banin, Marta del Pozo, Lubica Dudakova, Michael B. Gorin, Carel B. Hoyng, Petra Liskova, Ian M. MacDonald, Anna Matynia, Marcela Mena, Monika Oldak, Osvaldo Podhajcer, Alina Radziwon, Leah Rizel, Manar Salameh, Francesca Simonelli, Heidi Stöhr, Jacek P. Szaflik, Sandra Valeina and Bernard H.F. Weber for their effort in sample collection. We thank Ellen A.W. Blokland, Bart de Koning, Marlie Jacobs-Camps, Anita Roelofs, Michiel Oorsprong, Simon van Reijmersdal, Saskia van der Velde-Visser and Martine van Zweeden and the Radboud Genomics Technology Center for technical assistance. We also thank Greg Porreca and Eric Boyden at Molecular Loop Biosciences Inc. for their contribution and expertise. The authors would also like to acknowledge additional funding resources, including Retina Australia, Australian National Health and Medical Research Council (MRF1142962, FKC), Retina South Africa, the South African Medical Research Council, SOLVE-RET, support by the University of Campania ‘Luigi Vanvitelli’ under the research program “VALERE: VAnviteLli pEr la RicErca” (DisHetGeD). This work is also generated within the European Reference Network for Rare Eye Diseases (ERN-EYE).

## Footnotes

*   † MD Study Group authors:

*   1 Alaa AlTalbishi; St John of Jerusalem Eye Hospital Group, East Jerusalem, Palestine.

*   2 Sandro Banfi; Department of Precision Medicine, University of Campania ‘Luigi Vanvitelli’, Naples and Telethon Institute of Genetics and Medicine (TIGEM), Pozzuoli, Italy.

*   3 Tamar Ben-Yosef; Ruth and Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel.

*   4 John N. de Roach; Australian Inherited Retinal Disease Registry and DNA Bank, Department of Medical Technology and Physics, Sir Charles Gairdner Hospital, Nedlands, Western Australia, Australia.

*   5 Tina M. Lamey; Australian Inherited Retinal Disease Registry and DNA Bank, Department of Medical Technology and Physics, Sir Charles Gairdner Hospital, Nedlands, Western Australia, Australia.

*   6 Terri L. McLaren; Australian Inherited Retinal Disease Registry and DNA Bank, Department of Medical Technology and Physics, Sir Charles Gairdner Hospital, Nedlands, Western Australia, Australia.

*   7 Jennifer A. Thompson; Australian Inherited Retinal Disease Registry and DNA Bank, Department of Medical Technology and Physics, Sir Charles Gairdner Hospital, Nedlands, Western Australia, Australia.

*   8 Fred K. Chen; Centre for Ophthalmology and Visual Science, the University of Western Australia, Nedlands, Western Australia, Australia; Ocular Tissue Engineering Laboratory, Lions Eye Institute, Nedlands, Western Australia, Australia.

*   9 Lonneke Haer-Wigman; Department of Precision Medicine, University of Campania ‘Luigi Vanvitelli’, Naples and Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, University of Campania ‘Luigi Vanvitelli’, Naples, Italy.

*   10 Marianthi Karali; Department of Precision Medicine, University of Campania ‘Luigi Vanvitelli’, Naples and Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, University of Campania ‘Luigi Vanvitelli’, Naples, Italy.

*   11 Marcel Nelen; Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands.

*   12 Lisa Roberts; University of Cape Town/MRC Precision and Genomic Medicine Research Unit, Division of Human Genetics, Department of Pathology, Institute of Infectious Disease and Molecular Medicine (IDM), Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa.

*   13 Raj Ramesar; University of Cape Town/MRC Precision and Genomic Medicine Research Unit, Division of Human Genetics, Department of Pathology, Institute of Infectious Disease and Molecular Medicine (IDM), Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa.

*   14 Dror Sharon; Department of Ophthalmology, Hadassah Medical Center, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel.

*   15 Andrea L. Vincent; Department of Ophthalmology, New Zealand National Eye Centre, Faculty of Medical and Health Sciences, The University of Auckland, Grafton, Auckland, New Zealand; Eye Department, Greenlane Clinical Centre, Auckland District Health Board, Auckland, New Zealand.

*   Title of Table 2b corrected.

*   Received March 17, 2022.
*   Revision received March 23, 2022.
*   Accepted March 24, 2022.


*   © 2022, Posted by Cold Spring Harbor Laboratory

The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission.

## References

1.  1.Colijn, J.M.; Buitendijk, G.H.S.; Prokofyeva, E.; Alves, D.; Cachulo, M.L.; Khawaja, A.P.; Cougnard-Gregoire, A.; Merle, B.M.J.; Korb, C.; Erke, M.G.; et al. Prevalence of Age-Related Macular Degeneration in Europe. Ophthalmology 2017, 124, 1753–1763, doi:10.1016/j.ophtha.2017.05.035.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ophtha.2017.05.035&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

2.  2.Ebenezer, N.D.; Michaelides, M.; Jenkins, S.A.; Audo, I.; Webster, A.R.; Cheetham, M.E.; Stockman, A.; Maher, E.R.; Ainsworth, J.R.; Yates, J.R.; et al. Identification of novel RPGR ORF15 mutations in X-linked progressive cone-rod dystrophy (XLCORD) families. Invest Ophthalmol Vis Sci 2005, 46, 1891–1898, doi:10.1167/iovs.04-1482.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiaW92cyI7czo1OiJyZXNpZCI7czo5OiI0Ni82LzE4OTEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wMy8yNC8yMDIyLjAzLjE3LjIyMjcyNTM0LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

3.  3.Nguyen, X.T.; Talib, M.; van Schooneveld, M.J.; Brinks, J.; Ten Brink, J.; Florijn, R.J.; Wijnholds, J.; Verdijk, R.M.; Bergen, A.A.; Boon, C.J.F. RPGR-Associated Dystrophies: Clinical, Genetic, and Histopathological Features. Int J Mol Sci 2020, 21, doi:10.3390/ijms21030835.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/ijms21030835&link_type=DOI) 

4.  4.Talib, M.; van Schooneveld, M.J.; Thiadens, A.A.; Fiocco, M.; Wijnholds, J.; Florijn, R.J.; Schalij-Delfos, N.E.; van Genderen, M.M.; Putter, H.; Cremers, F.P.M.; et al. CLINICAL AND GENETIC CHARACTERISTICS OF MALE PATIENTS WITH RPGR-ASSOCIATED RETINAL DYSTROPHIES: A Long-Term Follow-up Study. Retina 2019, 39, 1186–1199, doi:10.1097/iae.0000000000002125.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/iae.0000000000002125&link_type=DOI) 

5.  5.Consugar, M.B.; Navarro-Gomez, D.; Place, E.M.; Bujakowska, K.M.; Sousa, M.E.; Fonseca-Kelly, Z.D.; Taub, D.G.; Janessian, M.; Wang, D.Y.; Au, E.D.; et al. Panel-based genetic diagnostic testing for inherited eye diseases is highly accurate and reproducible, and more sensitive for variant detection, than exome sequencing. Genet Med 2015, 17, 253–261, doi:10.1038/gim.2014.172.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/gim.2014.172&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25412400&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

6.  6.García Bohórquez, B.; Aller, E.; Rodríguez Muñoz, A.; Jaijo, T.; García García, G.; Millán, J.M. Updating the Genetic Landscape of Inherited Retinal Dystrophies. Frontiers in cell and developmental biology 2021, 9, 645600–645600, doi:10.3389/fcell.2021.645600.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fcell.2021.645600&link_type=DOI) 

7.  7.Haer-Wigman, L.; van Zelst-Stams, W.A.; Pfundt, R.; van den Born, L.I.; Klaver, C.C.; Verheij, J.B.; Hoyng, C.B.; Breuning, M.H.; Boon, C.J.; Kievit, A.J.; et al. Diagnostic exome sequencing in 266 Dutch patients with visual impairment. European journal of human genetics: EJHG 2017, 25, 591–599, doi:10.1038/ejhg.2017.9.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ejhg.2017.9&link_type=DOI) 

8.  8.Borooah, S.; Stanton, C.M.; Marsh, J.; Carss, K.J.; Waseem, N.; Biswas, P.; Agorogiannis, G.; Raymond, L.; Arno, G.; Webster, A.R. Whole genome sequencing reveals novel mutations causing autosomal dominant inherited macular degeneration. Ophthalmic Genet 2018, 39, 763–770, doi:10.1080/13816810.2018.1546406.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/13816810.2018.1546406&link_type=DOI) 

9.  9.Fadaie, Z.; Whelan, L.; Ben-Yosef, T.; Dockery, A.; Corradi, Z.; Gilissen, C.; Haer-Wigman, L.; Corominas, J.; Astuti, G.D.N.; de Rooij, L.; et al. Whole genome sequencing and in vitro splice assays reveal genetic causes for inherited retinal diseases. NPJ Genom Med 2021, 6, 97, doi:10.1038/s41525-021-00261-1.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41525-021-00261-1&link_type=DOI) 

10. 10.Duncan, J.L.; Pierce, E.A.; Laster, A.M.; Daiger, S.P.; Birch, D.G.; Ash, J.D.; Iannaccone, A.; Flannery, J.G.; Sahel, J.A.; Zack, D.J.; et al. Inherited Retinal Degenerations: Current Landscape and Knowledge Gaps. Translational Vision Science & Technology 2018, 7, 6, doi:10.1167/tvst.7.4.6.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1167/tvst.7.4.6&link_type=DOI) 

11. 11.Weisschuh, N.; Feldhaus, B.; Khan, M.I.; Cremers, F.P.M.; Kohl, S.; Wissinger, B.; Zobor, D. Molecular and clinical analysis of 27 German patients with Leber congenital amaurosis. PLoS One 2018, 13, e0205380, doi:10.1371/journal.pone.0205380.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0205380&link_type=DOI) 

12. 12.Tracewska, A.M.; Kocyła-Karczmarewicz, B.; Rafalska, A.; Murawska, J.; Jakubaszko-Jabłónska, J.; Rydzanicz, M.; Stawiński, P.; Ciara, E.; Lipska-Ziętkiewicz, B.S.; Khan, M.I.; et al. Non-syndromic inherited retinal diseases in Poland: Genes, mutations, and phenotypes. Mol Vis 2021, 27, 457–465.
    
    
13. 13.O’Roak, B.J.; Vives, L.; Fu, W.; Egertson, J.D.; Stanaway, I.B.; Phelps, I.G.; Carvill, G.; Kumar, A.; Lee, C.; Ankenman, K.; et al. Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science 2012, 338, 1619–1622, doi:10.1126/science.1227764.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzMzgvNjExNC8xNjE5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDMvMjQvMjAyMi4wMy4xNy4yMjI3MjUzNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

14. 14.Cantsilieris, S.; Stessman, H.A.; Shendure, J.; Eichler, E.E. Targeted Capture and High-Throughput Sequencing Using Molecular Inversion Probes (MIPs). Methods in molecular biology (Clifton, N.J.) 2017, 1492, 95–106, doi:10.1007/978-1-4939-6442-0_6.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-1-4939-6442-0_6&link_type=DOI) 

15. 15.Neveling, K.; Mensenkamp, A.R.; Derks, R.; Kwint, M.; Ouchene, H.; Steehouwer, M.; van Lier, B.; Bosgoed, E.; Rikken, A.; Tychon, M.; et al. BRCA Testing by Single-Molecule Molecular Inversion Probes. Clin Chem 2017, 63, 503–512, doi:10.1373/clinchem.2016.263897.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiY2xpbmNoZW0iO3M6NToicmVzaWQiO3M6ODoiNjMvMi81MDMiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wMy8yNC8yMDIyLjAzLjE3LjIyMjcyNTM0LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

16. 16.Khan, M.; Cornelis, S.S.; Khan, M.I.; Elmelik, D.; Manders, E.; Bakker, S.; Derks, R.; Neveling, K.; van de Vorst, M.; Gilissen, C.; et al. Cost-effective molecular inversion probe-based ABCA4 sequencing reveals deep-intronic variants in Stargardt disease. Hum Mutat 2019, 40, 1749–1759, doi:10.1002/humu.23787.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/humu.23787&link_type=DOI) 

17. 17.Khan, M.; Cornelis, S.S.; Pozo-Valero, M.D.; Whelan, L.; Runhart, E.H.; Mishra, K.; Bults, F.; AlSwaiti, Y.; AlTalbishi, A.; De Baere, E.; et al. Resolving the dark matter of ABCA4 for 1054 Stargardt disease probands through integrated genomics and transcriptomics. Genet Med 2020, 22, 1235–1246, doi:10.1038/s41436-020-0787-4.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41436-020-0787-4&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

18. 18.Meienberg, J.; Zerjavic, K.; Keller, I.; Okoniewski, M.; Patrignani, A.; Ludin, K.; Xu, Z.; Steinmann, B.; Carrel, T.; Röthlisberger, B.; et al. New insights into the performance of human whole-exome capture platforms. Nucleic Acids Res 2015, 43, e76, doi:10.1093/nar/gkv216.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkv216&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25820422&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

19. 19.Barbitoff, Y.A.; Polev, D.E.; Glotov, A.S.; Serebryakova, E.A.; Shcherbakova, I.V.; Kiselev, A.M.; Kostareva, A.A.; Glotov, O.S.; Predeus, A.V. Systematic dissection of biases in whole-exome and whole-genome sequencing reveals major determinants of coding sequence coverage. Sci Rep 2020, 10, 2057, doi:10.1038/s41598-020-59026-y.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-020-59026-y&link_type=DOI) 

20. 20.Karolchik, D.; Hinrichs, A.S.; Furey, T.S.; Roskin, K.M.; Sugnet, C.W.; Haussler, D.; Kent, W.J. The UCSC Table Browser data retrieval tool. Nucleic Acids Res 2004, 32, D493–496, doi:10.1093/nar/gkh103.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkh103&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14681465&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188079000117&link_type=ISI) 

21. 21.Haeussler, M.; Zweig, A.S.; Tyner, C.; Speir, M.L.; Rosenbloom, K.R.; Raney, B.J.; Lee, C.M.; Lee, B.T.; Hinrichs, A.S.; Gonzalez, J.N.; et al. The UCSC Genome Browser database: 2019 update. Nucleic Acids Res 2019, 47, D853–d858, doi:10.1093/nar/gky1095.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gky1095&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30407534&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

22. 22.Yates, A.D.; Achuthan, P.; Akanni, W.; Allen, J.; Allen, J.; Alvarez-Jarreta, J.; Amode, M.R.; Armean, I.M.; Azov, A.G.; Bennett, R.; et al. Ensembl 2020. Nucleic Acids Res 2020, 48, D682–D688, doi:10.1093/nar/gkz966.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkz966&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31691826&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

23. 23.Fritsche, L.G.; Igl, W.; Bailey, J.N.; Grassmann, F.; Sengupta, S.; Bragg-Gresham, J.L.; Burdon, K.P.; Hebbring, S.J.; Wen, C.; Gorski, M.; et al. A large genome-wide association study of age-related macular degeneration highlights contributions of rare and common variants. Nat Genet 2016, 48, 134–143, doi:10.1038/ng.3448.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3448&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26691988&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

24. 24.Strunz, T.; Lauwen, S.; Kiel, C.; International, A.M.D.G.C.; Hollander, A.D.; Weber, B.H.F. A transcriptome-wide association study based on 27 tissues identifies 106 genes potentially relevant for disease pathology in age-related macular degeneration. Sci Rep 2020, 10, 1584, doi:10.1038/s41598-020-58510-9.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-020-58510-9&link_type=DOI) 

25. 25.de Laat, P.; Smeitink, J.A.M.; Janssen, M.C.H.; Keunen, J.E.E.; Boon, C.J.F. Mitochondrial retinal dystrophy associated with the m.3243A>G mutation. Ophthalmology 2013, 120, 2684–2696, doi:10.1016/j.ophtha.2013.05.013.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ophtha.2013.05.013&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23806424&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000327249600056&link_type=ISI) 

26. 26.Udar, N.; Atilano, S.R.; Memarzadeh, M.; Boyer, D.S.; Chwa, M.; Lu, S.; Maguen, B.; Langberg, J.; Coskun, P.; Wallace, D.C.; et al. Mitochondrial DNA haplogroups associated with age-related macular degeneration. Invest Ophthalmol Vis Sci 2009, 50, 2966–2974, doi:10.1167/iovs.08-2646.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiaW92cyI7czo1OiJyZXNpZCI7czo5OiI1MC82LzI5NjYiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wMy8yNC8yMDIyLjAzLjE3LjIyMjcyNTM0LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

27. 27.Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R.; Genome Project Data Processing, S. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078–2079, doi:10.1093/bioinformatics/btp352 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btp352&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19505943&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000268808600014&link_type=ISI) 

28. 28.Maugeri, A.; van Driel, M.A.; van de Pol, D.J.; Klevering, B.J.; van Haren, F.J.; Tijmes, N.; Bergen, A.A.; Rohrschneider, K.; Blankenagel, A.; Pinckers, A.J.; et al. The 2588G-->C mutation in the ABCR gene is a mild frequent founder mutation in the Western European population and allows the classification of ABCR mutations in patients with Stargardt disease. Am J Hum Genet 1999, 64, 1024–1035, doi:10.1086/302323.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/302323&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10090887&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000079632900013&link_type=ISI) 

29. 29.Cornelis, S.S.; Bax, N.M.; Zernant, J.; Allikmets, R.; Fritsche, L.G.; den Dunnen, J.T.; Ajmal, M.; Hoyng, C.B.; Cremers, F.P. In Silico Functional Meta-Analysis of 5,962 ABCA4 Variants in 3,928 Retinal Dystrophy Cases. Hum Mutat 2017, 38, 400–408, doi:10.1002/humu.23165.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/humu.23165&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28044389&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

30. 30.Zernant, J.; Lee, W.; Collison, F.T.; Fishman, G.A.; Sergeev, Y.V.; Schuerch, K.; Sparrow, J.R.; Tsang, S.H.; Allikmets, R. Frequent hypomorphic alleles account for a significant fraction of ABCA4 disease and distinguish it from age-related macular degeneration. J Med Genet 2017, 54, 404–412, doi:10.1136/jmedgenet-2017-104540.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiam1lZGdlbmV0IjtzOjU6InJlc2lkIjtzOjg6IjU0LzYvNDA0IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDMvMjQvMjAyMi4wMy4xNy4yMjI3MjUzNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

31. 31.Pollard, K.S.; Hubisz, M.J.; Rosenbloom, K.R.; Siepel, A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res 2010, 20, 110–121, doi:10.1101/gr.097857.109.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjg6IjIwLzEvMTEwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDMvMjQvMjAyMi4wMy4xNy4yMjI3MjUzNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

32. 32.Kircher, M.; Witten, D.M.; Jain, P.; O’Roak, B.J.; Cooper, G.M.; Shendure, J. A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet 2014, 46, 310–315, doi:10.1038/ng.2892.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.2892&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24487276&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

33. 33.Grantham, R. Amino acid difference formula to help explain protein evolution. Science 1974, 185, 862–864, doi:10.1126/science.185.4154.862.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIxODUvNDE1NC84NjIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wMy8yNC8yMDIyLjAzLjE3LjIyMjcyNTM0LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

34. 34.Zhang, M.Q. Statistical features of human exons and their flanking regions. Hum Mol Genet 1998, 7, 919–932, doi:10.1093/hmg/7.5.919.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/7.5.919&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9536098&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000073657000020&link_type=ISI) 

35. 35.Yeo, G.; Burge, C.B. Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals. J Comput Biol 2004, 11, 377–394, doi:10.1089/1066527041410418.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/1066527041410418&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15285897&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000222588300010&link_type=ISI) 

36. 36.Reese, M.G.; Eeckman, F.H.; Kulp, D.; Haussler, D. Improved splice site detection in Genie. J Comput Biol 1997, 4, 311–323, doi:10.1089/cmb.1997.4.311.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/cmb.1997.4.311&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9278062&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1997XR80900008&link_type=ISI) 

37. 37.Pertea, M.; Lin, X.; Salzberg, S.L. GeneSplicer: a new computational method for splice site prediction. Nucleic Acids Res 2001, 29, 1185–1190, doi:10.1093/nar/29.5.1185.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/29.5.1185&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11222768&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000167240500020&link_type=ISI) 

38. 38.Fadaie, Z.; Khan, M.; Del Pozo-Valero, M.; Cornelis, S.S.; Ayuso, C.; Cremers, F.P.M.; Roosing, S.; The Abca4 Study, G. Identification of splice defects due to noncanonical splice site or deep-intronic variants in ABCA4. Human Mutation 2019, doi:10.1002/humu.23890.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/humu.23890&link_type=DOI) 

39. 39.Cartegni, L.; Wang, J.; Zhu, Z.; Zhang, M.Q.; Krainer, A.R. ESEfinder: A web resource to identify exonic splicing enhancers. Nucleic Acids Res 2003, 31, 3568–3571, doi:10.1093/nar/gkg616.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkg616&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12824367&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183832900059&link_type=ISI) 

40. 40.Jaganathan, K.; Kyriazopoulou Panagiotopoulou, S.; McRae, J.F.; Darbandi, S.F.; Knowles, D.; Li, Y.I.; Kosmicki, J.A.; Arbelaez, J.; Cui, W.; Schwartz, G.B.; et al. Predicting Splicing from Primary Sequence with Deep Learning. Cell 2019, 176, 535–548 e524, doi:10.1016/j.cell.2018.12.015.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2018.12.015&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30661751&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

41. 41.Richards, S.; Aziz, N.; Bale, S.; Bick, D.; Das, S.; Gastier-Foster, J.; Grody, W.W.; Hegde, M.; Lyon, E.; Spector, E.; et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 2015, 17, 405–424, doi:10.1038/gim.2015.30.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/gim.2015.30&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25741868&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

42. 42.Sim, N.L.; Kumar, P.; Hu, J.; Henikoff, S.; Schneider, G.; Ng, P.C. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res 2012, 40, W452–457, doi:10.1093/nar/gks539 10.1093/nar/gks539. Epub 2012 Jun 11.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gks539&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22689647&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000306670900074&link_type=ISI) 

43. 43.Schwarz, J.M.; Cooper, D.N.; Schuelke, M.; Seelow, D. MutationTaster2: mutation prediction for the deep-sequencing age. Nat Methods 2014, 11, 361–362, doi:10.1038/nmeth.2890 10.1038/nmeth.2890.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nmeth.2890&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24681721&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000333749900007&link_type=ISI) 

44. 44.Iwanami, M.; Oshikawa, M.; Nishida, T.; Nakadomari, S.; Kato, S. High prevalence of mutations in the EYS gene in Japanese patients with autosomal recessive retinitis pigmentosa. Invest Ophthalmol Vis Sci 2012, 53, 1033–1040, doi:10.1167/iovs.11-9048.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiaW92cyI7czo1OiJyZXNpZCI7czo5OiI1My8yLzEwMzMiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wMy8yNC8yMDIyLjAzLjE3LjIyMjcyNTM0LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

45. 45.Eisenberger, T.; Neuhaus, C.; Khan, A.O.; Decker, C.; Preising, M.N.; Friedburg, C.; Bieg, A.; Gliem, M.; Charbel Issa, P.; Holz, F.G.; et al. Increasing the yield in targeted next-generation sequencing by implicating CNV analysis, non-coding exons and the overall variant load: the example of retinal dystrophies. PLoS One 2013, 8, e78496, doi:10.1371/journal.pone.0078496.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0078496&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24265693&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

46. 46.Burkard, M.; Kohl, S.; Kratzig, T.; Tanimoto, N.; Brennenstuhl, C.; Bausch, A.E.; Junger, K.; Reuter, P.; Sothilingam, V.; Beck, S.C.; et al. Accessory heterozygous mutations in cone photoreceptor CNGA3 exacerbate CNG channel-associated retinopathy. J Clin Invest 2018, 128, 5663–5675, doi:10.1172/JCI96098.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1172/JCI96098&link_type=DOI) 

47. 47.Dryja, T.P.; Hahn, L.B.; Kajiwara, K.; Berson, E.L. Dominant and digenic mutations in the peripherin/RDS and ROM1 genes in retinitis pigmentosa. Invest Ophthalmol Vis Sci 1997, 38, 1972–1982.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiaW92cyI7czo1OiJyZXNpZCI7czoxMDoiMzgvMTAvMTk3MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzI0LzIwMjIuMDMuMTcuMjIyNzI1MzQuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

48. 48.Martinez-Mir, A.; Vilela, C.; Bayes, M.; Valverde, D.; Dain, L.; Beneyto, M.; Marco, M.; Baiget, M.; Grinberg, D.; Balcells, S.; et al. Putative association of a mutant ROM1 allele with retinitis pigmentosa. Hum Genet 1997, 99, 827–830, doi:10.1007/s004390050456.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s004390050456&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9187681&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1997XB30500022&link_type=ISI) 

49. 49.Reig, C.; Martinez-Gimeno, M.; Carballo, M. A heterozygous novel C253Y mutation in the highly conserved cysteine residues of ROM1 gene is the cause of retinitis pigmentosa in a Spanish family? Hum Mutat 2000, 16, 278, doi:10.1002/1098-1004(200009)16:3<278::AID-HUMU29>3.0.CO;2-G.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/1098-1004(200009)16:3<278::AID-HUMU29>3.0.CO;2-G&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10980552&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

50. 50.Boulanger-Scemama, E.; El Shamieh, S.; Démontant, V.; Condroyer, C.; Antonio, A.; Michiels, C.; Boyard, F.; Saraiva, J.P.; Letexier, M.; Souied, E.; et al. Next-generation sequencing applied to a large French cone and cone-rod dystrophy cohort: mutation spectrum and new genotype-phenotype correlation. Orphanet J Rare Dis 2015, 10, 85, doi:10.1186/s13023-015-0300-3.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13023-015-0300-3&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26103963&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 

51. 51.Cornelis, S.S.; Runhart, E.H.; Bauwens, M.; Corradi, Z.; De Baere, E.; Roosing, S.; Haer-Wigman, L.; Dhaenens, C.M.; Vulto-van Silfhout, A.T.; Cremers, F.P.M. Personalized genetic counseling for Stargardt disease: Offspring risk estimates based on variant severity. Am J Hum Genet 2022, doi:10.1016/j.ajhg.2022.01.008.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2022.01.008&link_type=DOI) 

52. 52.Jarrett, S.G.; Lewin, A.S.; Boulton, M.E. The importance of mitochondria in age-related and inherited eye disorders. Ophthalmic research 2010, 44, 179–190, doi:10.1159/000316480.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1159/000316480&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20829642&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F24%2F2022.03.17.22272534.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000281972300006&link_type=ISI) 

53. 53.Seneca, S.; Vancampenhout, K.; Van Coster, R.; Smet, J.; Lissens, W.; Vanlander, A.; De Paepe, B.; Jonckheere, A.; Stouffs, K.; De Meirleir, L. Analysis of the whole mitochondrial genome: translation of the Ion Torrent Personal Genome Machine system to the diagnostic bench? European Journal of Human Genetics 2015, 23, 41–48, doi:10.1038/ejhg.2014.49.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ejhg.2014.49&link_type=DOI) 

54. 54.Atilano, S.R.; Udar, N.; Satalich, T.A.; Udar, V.; Chwa, M.; Kenney, M.C. Low frequency mitochondrial DNA heteroplasmy SNPs in blood, retina, and [RPE+choroid] of age-related macular degeneration subjects. PLoS One 2021, 16, e0246114, doi:10.1371/journal.pone.0246114.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0246114&link_type=DOI)