Abstract
Adeno-associated viruses (AAVs) are the most used vectors in gene therapy but can frequently cause liver complications in patients. The mechanisms underlying AAV-related liver toxicity remain poorly understood, posing challenges for effective prevention and intervention. We undertook long and short read metagenomic sequencing of liver tissue from a child with spinal muscular atrophy type 1 experiencing significant hepatitis after receiving onasemnogene abeparvovec. We identified manufacturing plasmid sequences, with evidence of complex structures and recombination. Vector genomes had extensive disruption and concatemerisation. We also identified the presence of human betaherpesvirus 6B in the liver. It is possible that presence of the manufacturing plasmid sequences or helper viruses allow replication of the vector within cells, contributing to the development of complex concatemeric structures and associated hepatitis.
Main
AAV gene therapies show promise for treating a wide variety of serious genetic conditions, such as haemophilia1–3, muscular dystrophies4 and spinal muscular atrophy5. As of 2024, there were seven AAV gene therapies approved by the FDA6, with many more in clinical trials. The most common adverse effect of intravenously-administered AAV gene therapies is hepatotoxicity, with most patients experiencing a rise in serum liver enzymes, which is routinely treated with high dose steroids. Occasionally, liver toxicity is severe and some patients have experienced fulminant liver failure7–10. Hepatotoxicity tends to be more severe in older patients with a higher body weight, who receive higher vector doses, and who may require additional immunosuppression11,12. Pre-existing liver disease appears to be a significant risk factor7.
The mechanisms underlying hepatotoxicity are complex and incompletely understood. Hepatitis has been postulated to be caused by innate, humoral and cellular immune responses to the vector capsid, genome, or transgene product13–15, or by impurities within the vector preparation16,17, or from a direct toxic effect18,19. Acute sinusoidal endothelial injury resembling capillary leak syndrome has also been well documented in non-human primates using both empty capsids and therapeutic transgenes20.
Onasemnogene abeparvovec (Zolgensma®, OA) is an AAV-vectored gene therapy for spinal muscular atrophy (SMA), a neurodegenerative disease caused by deleterious variants in the SMN1 gene which encodes the survival motor neuron protein (SMN)21. OA is manufactured using three plasmids (Figure 1): the vector plasmid (pSMN), which contains SMN and a promotor region between two AAV inverted terminal repeats (ITRs), pAAV2/9, which contains AAV2 rep and AAV9 cap genes, and pHelper, which contains the human adenovirus (HAdV) genes necessary for AAV replication22,23. The resultant vector preparation contains therapeutic recombinant AAV (rAAV) particles which have an outer AAV9 capsid, containing a human SMN coding sequence between AAV ITRs, in a self-complementary structure. Apart from this desirable construct, there are also manufacturing process-related impurities including empty capsids, reverse packaged manufacturing plasmids, genome fragments, and recombined products24,25. Transcriptionally-active sequences from a Rep-Cap manufacturing plasmid have been identified in mouse liver after systemic administration of a good manufacturing practice (GMP) produced and purified rAAV preparation25. These manufacturing issues are complex to study and resolve, and the FDA has released guidance on reporting and validating the steps in the manufacturing process26.
OA is produced by transfection of HEK293 cells with a vector plasmid (pSMN), containing SMN between AAV inverted terminal repeats (ITRs), an AAV plasmid containing AAV2 rep and AAV9 cap genes (pAAV2/9), and a helper plasmid containing HAdV genes such as E2A, E4 and VA RNA genes (pHelper)22,23. SMN, survival motor neuron; AAV, adeno-associated virus; HAdV, human adenovirus; ITR, inverted terminal repeat; ssDNA, single-stranded DNA; dsDNA, double-stranded DNA. Produced using biorender.com.
We investigated a patient treated with OA for type 1 SMA who experienced significant symptomatic hepatitis after infusion. [Potentially identifying clinical details withheld in preprint]
Histology of the liver biopsy showed a marked periportal and lobular lymphocytic infiltrate with interface inflammation, patchy hepatocyte necrosis and ballooning degeneration, with relative sparing of bile ducts (Figure 2A, B). This histological pattern is similar to that seen in children with hepatitis associated with wild type AAV2 infection27,28 and in “indeterminate” paediatric acute liver failure29. In situ-hybridisation for SMN confirmed hepatocyte vector transduction in the patient liver (Figure 2C, D).
A) Liver biopsy of the patient shows marked periportal and lobular inflammation as well as interface inflammation (*); numerous hepatocytes with ballooning degeneration are present (**). Box in A is magnified in B. B) High magnification of ballooning hepatocytes highlighting the swollen cytoplasm. C) RNA in situ-hybridisation detecting the SMN1-gene shows a strong positive red signal in the nucleus of ballooning hepatocytes separated by areas with severe immune cell infiltration (*). Box in C is magnified in D. D) High magnification of ballooning hepatocytes shows the positive signal in the nucleus and a mild to moderate, punctuated signal within the cytoplasm of hepatocytes and of the immune cells (*). Inflammation in the liver is shown by immunohistochemistry detecting CD4 (E), CD8 (F) and CD20 (G). Bars, A and C, 400 micrometres, B, 60 micrometres, D, 100 micrometres, E-G, 300 micrometres.
We conducted untargeted short-read metagenomic sequencing of DNA and RNA from the residual patient liver sample. Analysis identified multiple serotypes of AAV, human mastadenovirus C (HAdV-C) and human betaherpesvirus 6 type B (HHV-6B) in the DNA-seq, while RNA-seq did not identify any pathogen transcripts, consistent with lack of viral replication (Table 1). For AAV2 and HAdV-C, the sequencing reads only aligned to the sections of the viral genomes that are part of the OA manufacturing plasmids, suggesting presence of plasmids in the liver tissue rather than wild-type virus infection (Figure 3A). In contrast, HHV-6B reads covered the breadth of the HHV-6B genome (Figure 3B), suggesting natural HHV-6B infection. Accordingly, a specific PCR for HAdV was negative (targeting a region of the genome which is not present in pHelper), and a specific PCR for HHV6 was positive (Ct value= 26.2). Aligning reads to the manufacturing plasmid sequences, we found good coverage of the vector genome, but also of the other regions of the pSMN manufacturing plasmid and of pAAV2/9, with some reads mapping to pHelper (Figure 3C).
A) Genome coverage of wild-type (WT) AAV2 and HAdV-C from Illumina sequencing reads. Approximate locations of the genes present in the manufacturing plasmids are marked along the x-axis. AAV2 alignment uses more stringent mapping parameters to more clearly differentiate between any AAV2 and AAV9 derived sequences – see methods. B) Alignment of Illumina sequencing reads to the HHV-6B genome shows reads cover the breadth of the genome. C) Alignment of Illumina sequencing reads to approximate manufacturing plasmid sequences shows presence of plasmid sequences. NB From the negative control, 10 reads aligned to the pSMN sequence. No reads aligned to the pAAV2/9 or pHelper sequences.
To further confirm the presence of the manufacturing plasmids in the liver, we performed untargeted long read metagenomic sequencing of the liver sample (Oxford Nanopore Technologies). Again, we found reads aligning to AAVs, HHV-6B and HAdV-C (Table 1), along with the gene therapy vector sequence. Reads mapping to all three of the manufacturing plasmids were also found, primarily to pSMN (including the region of the plasmid that is not included in the therapeutic vector genome), and pAAV2/9 (Table 2). Sequence analysis of individual reads showed high levels of vector genome concatemerisation and complex genome structures with rearrangements (Figure 4A-D, Table 2). The concatemeric patterns observed show similarities to those seen in replicating AAVs using rolling hairpin and rolling circle amplification30. Many of the structures observed were longer than the maximum packaging length of an AAV vector (approximately 4.7kb), suggesting that recombination may have occurred in vivo. Alternatively, these sequences may represent plasmid contaminants from manufacture. The majority of pAAV2/9 reads also contained regions of the other OA manufacturing plasmids, indicating recombination between plasmid sequences (Figure 4, Table 2). Most of the complex structures and recombination involved the region between the ITRs of pSMN, the rep/cap region of pAAV2/9, and the region of pHelper containing the HAdV-derived genes (Figure 4, Table 2).
Alignment dot plots showing individual nanopore reads (x axis) aligning to representative sequences of the OA manufacturing plasmids (y axis). Red dots show alignment to forward strand and blue to the reverse. A) Alignment against vector region of pSMN plasmid. B) Alignment against entire pSMN plasmid. C) Alignment to pAAV2/9 plasmid. D) Alignment to regions of all three plasmids – the vector region of pSMN, AAV rep and cap within pAAV2/9 and the HAdV gene region within pHelper. Representative images selected – numbers of reads belonging to each category can be found in Table 2.
Complex AAV vector genome rearrangements with structural rearrangements and concatemers have previously been demonstrated in macaque liver after treatment with rAAVs31,32 and in human hepatocytes in a humanised mouse model33. Similar complex concatemeric structures have been noted in liver samples from children with hepatitis associated with wild-type AAV2 infection27. To our knowledge, this is the first metagenomic analysis of liver tissue from an rAAV-treated patient to comprehensively elucidate the vector concatemer structures and complex genome rearrangements, which also included some manufacturing plasmid sequences.
Both short and long read metagenomic sequencing approaches in this study provide evidence that sequences from all three manufacturing plasmids were present in the liver of a patient with hepatitis after treatment with OA. Long read sequencing revealed large quantities of complexed plasmid DNA and many reads contained elements of multiple plasmids. It is possible that recombination events between plasmids took place during the manufacturing process, resulting in transduction of the AAV rep and cap genes and the HAdV helper genes as contaminants in the therapeutic vector preparation. Theoretically, this could allow replication competence of the vector genome in liver cells, thus explaining the replication-associated vector genome complexes we observed. The contribution of these genomic structures to the pathogenesis of rAAV hepatitis, and the putative role of contaminating plasmids and their potential immunogenicity, remain to be determined. It would be important to ascertain whether these genomic structures are also present in rAAV-treated patients without hepatitis.
Due to insufficient liver material, we were unable to assess whether the rearranged vector genomes were chromosomally integrated or episomal. Random, low frequency integration of various rAAV vectors in patient tissue is now well recognised34–37. AAV integrants in complex concatemers containing mixtures of rearranged and truncated vector genomes have been demonstrated in liver tissue of non-human primates after intravenous administration of rAAV8 vectors31. The significance of HHV-6B in the liver is unclear from this single case description. HHV-6 can act as a “helper” virus in wild-type AAV2 replication, and its genome contains a homologue of the AAV rep gene38. HHV-6 has also been found in liver tissue in a proportion of children with hepatitis associated with wild type AAV2 infection, although also sometimes in controls27,28, and in children with acute liver failure of unknown cause39,40.
Overall, our metagenomic analysis of liver tissue from a patient with hepatitis after OA treatment has revealed extensive disruption and concatemerisation of vector genomes, alongside numerous contaminating manufacturing plasmid sequences, with evidence of complex genomic structures and recombination events. We also identified HHV-6B in the liver. We postulate that presence of certain manufacturing plasmid sequences or helper viruses may allow replication of the vector genome within cells, giving rise to complex concatemeric structures. Future work is needed to determine the frequency and pathological significance of complex DNA structures in patient liver cells after rAAV gene therapy, whether they are episomal or integrated into the host genome, and how this may relate to the known hepatotoxicity of rAAV gene therapies.
Data Availability
Data produced in the present study are available upon reasonable request to the authors.
Declaration of competing interests
GB is PI of clinical trials Sponsored by Roche, Novartis, Sarepta, Pfizer, NS Pharma, Reveragen, Percheron, Biomarin, Scholar Rock, and has received speaker and/or consulting fees from Sarepta, PTC Therapeutics, Entrada Therapeutics, Pfizer, Biogen, Novartis Gene Therapies, Inc. (AveXis), and Roche, and grants from Sarepta, Roche and Novartis Gene Therapies. UCL has received funding from Sarepta, Roche, Pfizer, Italfarmaco, Santhera.
FM is the PI of the Novartis sponsored trials in which OA was studied in the UK, and is also involved in clinical trials sponsored by Biogen, Roche, Sarepta Therapeutics, Genethon, PTC therapeutics and Solid Bioscience. He has received consulting fees from Pfizer, Sarepta, Roche, Biogen, Novartis, Solid, Dyne Therapeutics, Entrada, PTC and Edgewise.
MS is the sub-I of the Novartis sponsored trials in which OA was studied in the UK, and is also involved in clinical trials sponsored by Biogen, Roche, Dyne. She has received consulting fees from Roche, Biogen and Novartis.
Methods Ethics
The liver biopsy procedure was performed for diagnostic purposes. Residual material was analysed in this study with informed consent for additional research under the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) WHO Clinical Characterisation Protocol UK (CCP-UK) (ISRCTN 66726260). Ethical approval for the ISARIC CCP-UK study was given by the South Central–Oxford Research Ethics Committee in England (13/SC/0149), the Scotland A Research Ethics Committee (20/SS/0028) and the WHO Ethics Review Committee (RPC571 and RPC572).
Short read metagenomic sequencing
Untargeted Illumina metagenomic sequencing of the liver biopsy was carried out by the clinical metagenomics service at Great Ormond Street Hospital, according to the protocol previously described27,41. A negative control sample consisting human DNA and RNA spiked with positive controls (cowpox DNA and FCV RNA) was run in parallel. Viruses were identified from the metagenomics data using Kraken242 and Bracken43 run through Taxprofiler44, as well as metaMix45.
Human-filtered reads from the metaMix pipeline (other than for alignment to pSMN, where raw reads were used) were aligned using Bowtie246 in very sensitive mode (apart from WT AAV2, where the parameters -score-min L,0,-0.1 -N 0 -L 22 --mp 6,2 --rdg 5,3 --rfg 5,3 were used) to genome sequences of AAV2 (NC_001401), HHV-6B (NC_000898) and HAdV-C (NC_001405) obtained from RefSeq, as well as representative sequences of the plasmids used in OA manufacture (pSMN47, pAAV2/948, pHelper (pHGTI-Adeno1)49. The sequence of the AMR gene region in the pSMN plasmid did not match what was observed in the patient, so this region was reconstructed using the long read sequencing data and the modified pSMN sequence was used in all alignments. PCR duplicates were removed from the resulting alignments using samtools markdup50 and alignments were plotted using a custom R script.
Long read metagenomic sequencing
DNA from approximately 3 mg of liver was purified using the Qiagen DNeasy Blood & Tissue kit as per the manufacturer’s instructions. DNA was fragmented to an average size of 10kb using a Megaruptor 3 (Diagenode) to reach an optimal molar concentration for library preparation. Quality control was perform using a Femto Pulse System (Agilent Technologies) and a Qubit fluorometer (Invitrogen). Samples were prepared for nanopore sequencing using the ligation sequencing kit SQK-LSK110. DNA was sequenced on a PromethION using R9.4.1 flowcells (Oxford Nanopore Technologies). Samples were run for 72 hours.
All library preparation and sequencing were performed by the UCL Long Read Sequencing facility.
Reads were trimmed using porechop with an adaptor threshold of 85 and were mapped to the human genome (ensemble GRCh38 v107) using minimap251 in map-ont mode. Unaligned reads were then aligned to the regions of the plasmids shown in the figures using minimap2, and the aligned reads extracted using samtools50. A custom R script was used to filter reads that were over 1000bp in length, had a total alignment length of at least 80% of the total read length across all alignments and had a continuous stretch of matches/mismatches with no insertions or deletions of at least 100bp. Alignment dot plots for these reads were created using redotable52 with a window size of 20.
Representative examples are shown in the figures. Viruses were identified from the metagenomics data using Kraken2 and Bracken run through Taxprofiler.
RNAscope in situ-hybridization
Formalin-fixed paraffin-embedded liver sections were cut at 2-3 µm thickness and mounted on glass slides. According to manufacturer’s instructions, RNAscope was performed with protease treatment and simmering in target solution (product codes: 322360 and 322331, ACDBio) to detect the SMN gene (product code: 553631, ACDBio, RNAscope® Probe - Hs-SMN1-CDS - Homo sapiens survival of motor neuron 1 telomeric (SMN1) transcript variant d mRNA). As positive control, a probe detecting Ubiquitin (product code: 310041, ACDBio) and as a negative control, a probe for DapB (product code: 310043, ACDBio) were used. Haematoxylin was used as a counterstaining and slides were digitised using the Leica Aperio 8 slide scanner.
Immunohistochemistry
Immunohistochemistry was performed on formalin-fixed paraffin-embedded tissue cut at a thickness of 3 µm, using the Ventana Benchmark ULTRA staining platform and Optiview DAB Detection kit. The positive control was tonsil. The following antibodies were used: anti-CD4 (clone SP35, Roche, 790-4423), anti-CD8 (clone SP239, Roche, 790-7176) and anti-CD20 (clone L26, Dako (Agilent), M0755).
For all three antibodies, a HIER pre-treatment and a haematoxylin counterstain has been used.
Acknowledgements
SB, OMT and SM are funded by the National Institute for Health Research (NIHR) Blood and Transplant Research Unit for Genomics to Enhance Microbiology Screening (NIHR203338). LB is funded by the NIHR Great Ormond Street Biomedical Research Centre (BRC). JB receives funding from the NIHR UCL/UCLH BRC. JB is an NIHR Senior Investigator. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.
RK is funded by LifeArc P2020-0008 and P2023-0011, Great Ormond Street Hospital Children Charity and Dravet Syndrome UK Charity V4720 and V4919 and Therapeutic Acceleration Support (TAS), UCL.
This work was supported by grants CRUSH MC_UU_00034/9 and Wellcome Trust 226141/Z/22/Z.
The support of the GOSH and UCLH/ Institute of Neurology BRC to the Dubowitz Neuromuscular Centre Biobank is gratefully acknowledged.
The authors thank the team of the Histology Research Service, University of Glasgow, for the excellent technical support.
Footnotes
↵13 See consortium list