1 Response to neoadjuvant chemoradiotherapy in rectal cancer is associated with RAS/AKT pathway dysregulation and high tumour mutational burden.

Purpose: Pathological complete response (pathCR) in rectal cancer, seen in examination of the pathological specimen post-surgery is the phenomenon whereby a tumour completely regresses under treatment with chemoradiotherapy. This is beneficial as up to 75% of patients do not experience regrowth of the primary tumour, allowing organ preservation and is poorly understood. We aimed to characterise the processes involved in pathCR. Materials & Methods: Two groups of patients were identified with either complete response (pathCR group) or no response (poor response group) and biopsy and/or resection specimen blocks were retrieved. These underwent high read depth amplicon sequencing, exome sequencing, methylation arrays and immunohistochemistry for DNA repair pathway proteins. Sequencing data underwent analysis and the two cohorts were compared. Results: Seven patients who underwent pathological complete response and twenty four who underwent poor response (to act as opposite “extreme phenotypes”) underwent molecular characterisation. Patients in the complete response group had significantly higher tumour mutational burden, neoantigen load and enrichments for mutations in the PI3K/AKT/mTOR signalling pathway as well as significantly lower numbers of structural variants. There were no differences in copy number variants or tumour heterogeneity. Methylation analysis demonstrated enrichment for changes in the PI3K/AKT/mTOR signalling pathway. Conclusions: The phenomenon of pathCR in rectal cancer appears to be related to immunovisibility caused by a high tumour mutational burden phenotype. Resistance both the 3D accurate as well and Our findings suggest a number of new therapeutic avenues for increasing responsiveness to chemoradiotherapy in rectal cancer. We plan to further study these in complex 3D models such as organoids in order to understand whether they will increase response rates when appropriately targeted.

4 mechanisms seem to involve the PI3K/AKT/mTOR signalling pathway and tumour heterogeneity does not seem to play a role in resistance.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

INTRODUCTION
Rectal cancer is a common malignancy (1), with approximately 11,000 case per year in the UK (2). Treatment typically consists of excisional surgery (3) with neoadjuvant therapy if the cancer is locally advanced, consisting of either short course radiotherapy (4) (25 Gy in 5 fractions over 1 week) with surgery on the following week, or long course neo-adjuvant chemoradiotherapy (nCRT) (5) (45-50 Gy in 25 fractions over 5 weeks, with synchronous iv 5-FU or oral capecitabine) with surgery 6-10 weeks later. This latter regimen leads to significant tumour shrinkage and downstaging, with pathological complete response (pathCR, defined as complete regression of tumour in the resection specimen) observed in approximately 10-15% of patients (6). Multiple investigators have shown higher rates of response (7) (8) with higher radiotherapy doses, with pCR rates of 25-30%.
Clinical complete response (CCR) is defined as the absence of tumour on imaging and/or clinical examination, but does not definitely exclude residual tumour, which requires a resection specimen to confirm. If high rates of CCR can be associated with pathCR, this may allow the use of watch and rate to occur routinely.
Pathological CR is associated with several potential benefits., either effecting potential cure (9) (defined as >5 years recurrence free) or either delaying the need for excisional surgery (10) or allowing potential organ preservation surgery (11) such as TEMS or TAMIS (12). It is unclear as to what the molecular drivers of pathCR are (13, 14), but they are thought to relate to the factors that promote radiotherapy and chemotherapy related tumour killing. Rectal cancers exist within a low oxygen tension environment (15) leading to an intrinsic resistance. As radiotherapy induces the formation of oxygen derived free radicals, which cause tumour cell death by . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.01.20019794 doi: medRxiv preprint 6 direct DNA damage, a low oxygen environment leads to fewer available oxygen free radicals, leading to lower cell death.
Another hypothesised mechanism is around DNA repair. Ionising radiation (16) induces a DNA double strand break (DSB), which is then repaired either by one of two mechanisms, homologous recombination (HR, where a sister chromatid is used to repair the defect) or non-homologous end joining (NHEJ), where a complex repair mechanism including BRCA, ATM, ERCC5 and others contribute to a DNA repair complex that re-joins the damaged segments of DNA. There is evidence from previous studies that there is aberrant functioning of the NHEJ pathway is associated with response in radiotherapy (17).
Tumour heterogeneity undoubtedly plays a role (13, 18), as does immunogenicity caused by the formation of clonal neoantigens which determines the immune microenvironment. As the rates of microsatellite instability in rectal cancer are low (2-3% (19)), there is little hypermutation & therefore rates of immunovisibility in rectal cancer are low. However Akiyoshi et al (1) have recently shown that levels of clonal neoantigens are higher in patients undergoing a good response to neoadjuvant chemoradiotherapy, suggesting that hypermutation plays a role as this is the primary mechanism by which neoantigens are produced. A previous integrated molecular analysis of rectal cancer (13) focusing on resistance to rectal cancer found no key genomic features that correlated with resistance, in opposition to that found in Akiyoshi's study.
Because of the uncertainty surrounding the precise mechanisms of sensitivity of rectal cancer to chemoradiotherapy, we aimed to study the phenomenon of pathCR in rectal cancer by carrying out a comprehensive molecular analysis of both pre-. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.01.20019794 doi: medRxiv preprint treatment and post-treatment rectal cancer specimens in order to elucidate potential mechanisms of sensitivity.

MATERIALS & METHODS
Patients: A prospective database of all patients undergoing neoadjuvant chemoradiotherapy for rectal cancer was used to identify patients. Ethical approval was obtained from the NW Research Ethics committee (15/NW/0079) Patients who underwent long course chemoradiotherapy and achieved pathCR were identified as determined by complete regression of the tumour on histopathological examination of the specimen by a Consultant Histopathologist with absolutely no tumour cells remaining (Mandard grade 1), as opposed to minimal residual disease where several cells were allowable. The histopathology archives were then searched to find the pre-treatment endoscopic biopsies of these patients for downstream analysis. This was defined as the "pathCR cohort". A randomly selected second cohort of patients were identified, who had poor response to treatment or progressed whilst on treatment, as defined both by post-treatment MRI and Mandard regression grading of either grade 4 or grade 5. This was defined as the "non-responder cohort".
Representative normal tissue from the proximal resection margin was obtained for genomic analysis.
Samples: Formalin fixed blocks were retrieved for these patients and cut into 4uM sections on frosted glass slides for needle macrodissection and immunohistochemistry. A representative H&E section was used to target tumour cells on needle macrodissection, which was then used to maximise tumour content for DNA extraction. DNA extraction was performed using a modified protocol of the . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Immunohistochemistry (IHC):
This was performed on 4uM slides as previously described using a Leica BondMax autostainer. IHC was performed against yH2AX (ab26350), ATM (ab36810), Ku70/80 (ab53126), MLH1 (ab92312), MSH2 (ab52266) and MSH6 (ab14204) and slides were then scanned onto a Leica slide imaging platform. IHC was scored by the QuPath system (20). Briefly, a single section was zoomed to 10x view of a representative area of epithelium/tumour, calibration was performed, either nuclei or membranous counting was set depending on the antibody and auto counting of DAB stained positive/negative cells was carried out. Staining was reported as a percentage of positive/negative stained cells.
Genomics: Extracted genomic DNA was used for downstream analysis. Targeted amplicon resequencing was performed using the Fluidigm 48x48 Access array. PCR amplicons covering APC, KRAS, BRAF, NRAS, FBXW7, SOX7 (primer sequences available on request) were designed using Primer3 (21). Primer specificity was checked using PrimerBLAST and UCSC in-silico PCR. Briefly, 20ng of FFPE DNA was injected into the Access Array system and thermal cycled according to manufacturer's specifications. Amplicons were then ligated to Illumina sequencing indexes & adapters and pooled and sequenced on an Illumina MiSeq to an average read depth of > 1000X using a 100bp PE sequencing strategy. For FFPE exome sequencing, a custom modification of the Illumina TruSeq Exome hybridisation kit was used. Approximately 300ng of FFPE-derived DNA was prepared with the following modifications: Firstly, no size selection was performed after end repair and DNA fragments were amplified with 12 cycles of PCR. Enrichment was performed using a bead ratio of 0.8, then samples were combined into pools of 3 plex for coding . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. Bioinformatics: All bioinformatics analysis was carried out on the a high performance computing service. For the amplicon resequencing, FASTQ files were trimmed (Trimgalore) and aligned to the hg19 reference genome using a standard GATK pipeline using the BWA aligner (22). Mutation calling was performed using FreeBayes (23). For the exome sequencing analysis, FASTQ files were trimmed and aligned to the hg19 reference genome using an exome sequencing pipeline Isaac v4 aligner (24), Manta SV caller (25) and Canvas CNV (26)  is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
Clonality was determined by running superFreq (32), a cancer specific tumour exome caller and river plots were producing from this package. For neoantigen calls, tumour-normal subtracted VCF files produced with Strelka, GT fields were added via conversion of the SGT field (using a custom script), annotated with the variant effect predictor (VEP), filtered for indels and then analysed using PVacTools against MHC Class I binding predictions (33). Neoantigens with a "best" median IC of < 50nmol.L-1 were counted as binding neoantigens. HLA typing for each patient was determined with HLA-LA on germline exome sequencing data (34).
For methylation, IDAT files were imported into R/Bioconductor and analysed on the ChAMP pipeline (35) using standard settings. iDAT files were imported, standardised to beta methylation values, then underwent SWAN normalisation. Normalised betavalues underwent differential methylation analysis using limma, DMR were called using dmrLasso. Pathway analyses were conducted in GProfiler2 (36).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) 1 1

Samples
In the pre-treatment group, 7 biopsies were only available in isolation (because of selecting complete pathological response). In the post-treatment group, there were 24 samples that had had poor response to chemoradiotherapy as defined by a low histological tumour regression grade.

Sequencing metrics
In total 31 samples successfully underwent amplicon sequencing and 31 samples underwent whole exome sequencing. For the amplicon sequencing average read depth was 1020x (range 357-5210x). For the whole exome sequencing the average read depth was 99x for tumour samples and 55x for control normal. For methylation arrays all samples hybridised successfully.
The top ranked signatures were signatures three, five and thirty. Signature three is due to defective base repair due to faulty homologous recombination, signature five is due to the effects of transcription coupled nucleotide repair and signature 30 is due to a defect in base-excision repair due to inactivating mutations in NTHL1 (supplementary tables).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020.  Figure 1). Interestingly, in samples where a pre-treatment biopsy was available with a post-treatment specimen, TMB dropped to very low levels (median 5.89, range 1.08-8.65, Figure 1) in the post-treatment specimen.

Clonal evolution analysis
In the seven samples (out of twenty four) where triplet normal, pre-treatment biopsy and post-treatment specimen were available, clonal evolution analysis with superFreq was carried out. This revealed that subsequent to radiation therapy there was an increase in clonal diversity (Figure 4) across all samples with pre-treatment . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. Renal Agenesis (p=2.82x10-2) and the GO:MF term Class II MHC binding (p=1.77x10-2). Differential block analysis revealed that the top differential methylated block was located at chr1:96420019-96881853 (p valuearea =0.03) which is . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Neoantigen prediction
Neoantigen prediction using pVacTools revealed a median of 78 neoantigens per sample (range 9-383). The hypermutant samples unsurprisingly had the highest neoantigen load as well as the best response to radiotherapy, with a median 91 neoantigens in the responder group and 78 in the non-responder group (Mann-Whitney p=0.0342).

Immunohistochemical analysis of DSB pathway and mismatch repair deficiency
Semi-quantititative analysis using QuPath was carried out on white light DAB stained images of the expression of yH2AX, Ku70/80, ATM, MLH1, MSH2 and MSH6 respectively.
For the expression of DNA repair genes, significant differential expression was seen (Table 2) in yH2AX, Ku70/80, ATM, MLH1 and MSH6 expression when compared to response/no response status. However, especially for the mismatch repair proteins, despite there being statistically significant differences, these were only a few percentage points different and little complete loss of expression was seen.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020.

CONCLUSIONS
In this study, we have demonstrated a number of interesting findings comparing pretreatment biopsies and post treatment specimens in patients undergoing (nCRT) for rectal cancer.
The most important finding is that in patients achieving a complete or near complete response to nCRT is that the pre-treatment tumour has a high tumour mutational burden (37). This is because hypermutation within the tumour due to an intrinsic DNA repair defect and leads to increased presentation of neoantigens because of indel and frameshift mutations (38). This increases immunovisibility and we hypothesise that the increased immunovisibility, coupled with activation of the immune system by irradiation, causes cGAS/STING activation which has been shown to lead to a Type I interferon response (39) leading to migration of immune cells and enhanced regression. Also, it suggests that these patients may benefit from anti-PD1/PD-L1 or anti-CTLA4 immunotherapy as high TMB (defined as > 10 muts/mb) has been shown to be correlated with responsiveness to these agents(1).
Another intriguing possibility is that genotoxic therapy, coupled with radiation therapy could be delivered as part of neoadjuvant treatment in the tumour, possibly increasing neoantigen burden and making more patients suitable for immunotherapy (40). Our sample size of pathological complete responders is relatively small, however we deliberately chose tumour regression where absolutely no cells remained, which is a very rare phenomenon, compared to the phenomenon of "minimal residual disease", but we believed that this would give a stronger biological signal.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.01.20019794 doi: medRxiv preprint 1 7 We have also demonstrated that there is enrichment for mutations in the mTOR/AKT signalling pathways, specifically PIK3CA but also as a general trend towards mutations and epigenetic changes in this pathway. PIK3CA is a member of the mTOR signalling pathway and makes up the alpha subunit of the PI3K protein.
mTOR signalling has previously been highlighted as being of possible relevance in radiosensitivity (41) as a cellular marker of stress in the tumours of patients undergoing chemoradiotherapy. PIK3CA signals through the mTORC1/mTORC2 and exerts its downstream effect on AKT (42). Targeted agents for PIK3CA (apitosilib (43)), mTORC1/2 (visusertib (44)) and AKT (MK2206 (45)) exist and are at various stages of clinical development. We suggest that these agents may be utilised as part of a neoadjuvant therapy strategy in order to increase response rates. Tumour heterogeneity is a significant problem across all tumours due to drivers that may cause a differential response to therapy because of mutational clonality. Our results show, unsurprisingly, that treatment of samples with neoadjuvant chemoradiotherapy causes an increase in clonal diversity, which may be a driver in the phenomenon of radiation resistance. However, we did not demonstrate any . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.01.20019794 doi: medRxiv preprint 1 8 difference between responders and non-responders in terms of MATH score as a measure of tumour heterogeneity, unlike other papers, however the accuracy of MATH score as a measurement of heterogeneity has been called into question because of differences in variant callers (49).
In the epigenetic analysis, the microRNA pathway Has-mir-21-5p was significantly differentially methylated. This microRNA is of significant interest (50) in response to therapy as it represents a key driver in the cellular response to hypoxia. The immunohistochemical analysis of tumour samples showed increased yH2AX and ATM expression was associated with response, as well as increased expression of the mismatch repair proteins MLH1 & MSH6. We found this puzzling, as we would have expected the opposite to be true, i.e. loss of expression was needed for response, especially in ATM and yH2AX. However we hypothesise that these samples may have had regions of loss of normal response and these disappeared as a consequence of response to radiotherapy.
Therefore we suggest that based on these findings, a number of factors contribute to the response to neoadjuvant chemoradiotherapy: hypermutation leading to increased neoantigen presentation; enrichment in defects in the mTOR signalling pathway; hypoxia regulated by miR-21-5p and an increase in clonal diversity. Our findings agree with those of Akiyoshi et al (1) in that increase in neoantigen diversity correlated with response. Kamran (13) however, found no clear molecular defects that predisposed to radioresistance. This could be due to the fact that their analysis was geared towards pathways of treatment resistance rather than those that lead to radiosensitivity.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. Clearly, the best way to understand these defects and investigate them further would be to build a cellular model of rectal cancer in order to modulate these pathways in order to measure responsiveness (51). Current cell lines have a bias towards their micro environment and although provide reasonable models of single pathway alterations lack the fidelity to measure therapy response when modulated. The ideal model would be an organoid based rectal cancer therapy model as this provides both the 3D structure (enabling cell/cell communication a more representative element of intra-tumoural hypoxia) and more accurate response to therapy, as well as the ability to evolve and resist therapy.
Our findings suggest a number of new therapeutic avenues for increasing responsiveness to chemoradiotherapy in rectal cancer. We plan to further study these in complex 3D models such as organoids in order to understand whether they will increase response rates when appropriately targeted.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.01.20019794 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.  . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 4, 2020.  is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.