KLF3 and PAX6 are candidate driver genes in late-stage, MSI-hypermutated endometrioid endometrial carcinomas

Meghan L. Rudd; Nancy F. Hansen; Xiaolu Zhang; Mary Ellen Urick; Suiyuan Zhang; Maria J. Merino; National Institutes of Health Intramural Sequencing Center Comparative Sequencing Program; James C. Mullikin; Lawrence C. Brody; Daphne W. Bell

doi:10.1101/2021.04.26.21256125

Abstract

Endometrioid endometrial carcinomas (EECs) are the most common histological subtype of uterine cancer. Late-stage disease is an adverse prognosticator for EEC. The purpose of this study was to analyze EEC exome mutation data to identify late-stage-specific statistically significantly mutated genes (SMGs), which represent candidate driver genes potentially associated with disease progression. We exome sequenced 15 late-stage (stage III or IV) non-ultramutated EECs and paired non-tumor DNAs; somatic variants were called using Strelka, Shimmer, Somatic Sniper and MuTect. Additionally, somatic mutation calls were extracted from The Cancer Genome Atlas (TCGA) data for 66 late-stage and 270 early-stage (stage I or II) non-ultramutated EECs. MutSigCV (v1.4) was used to annotate SMGs in the two late-stage cohorts and to derive p-values for all mutated genes in the early-stage cohort. To test whether late-stage SMGs are statistically significantly mutated in early-stage tumors, q-values for late-stage SMGs were re-calculated from the MutSigCV (v1.4) early-stage p-values, adjusting for the number of late-stage SMGs tested. We identified 14 SMGs in the combined late-stage EEC cohorts. When the 14 late-stage SMGs were examined in the TCGA early-stage data, only KLF3 and PAX6 failed to reach significance as early-stage SMGs, despite the inclusion of enough early-stage cases to ensure adequate statistical power. Within TCGA, nonsynonymous mutations in KLF3 and PAX6 were, respectively, exclusive or nearly exclusive to the microsatellite instability (MSI)-hypermutated molecular subgroup and were dominated by insertions-deletions at homopolymer tracts. In conclusion, our findings are hypothesis-generating and suggest that KLF3 and PAX6, which encode transcription factors, are MSI target genes and late-stage-specific SMGs in EEC.

Introduction

Endometrial carcinoma (EC) exacts a significant toll on women’s health. It resulted in 89,929 deaths globally in 2018 [1], and is projected to cause 12,940 deaths within the United States in 2021 [2]. Importantly, EC incidence is increasing annually in the US and many other countries [3]. This phenomenon is likely partly due to increasing rates of obesity [4], a well-recognized epidemiological risk factor for endometrioid endometrial carcinomas (EECs) that make up 75%-80% of all newly diagnosed endometrial tumors. EECs most often present as low-grade, early-stage (FIGO (International Federation of Obstetricians and Gynecologists) stage I or II) tumors, that are confined within the uterus [5]. Five-year survival rates for patients with low-grade, early-stage disease are high because surgery is often curative for this patient population, due to the limited extent of disease [5]. In contrast, patients with late-stage EEC have relatively poor outcomes [6], despite more aggressive treatment approaches of surgery with adjuvant chemotherapy or radiotherapy [7-9]. Thus, increasing tumor stage is an adverse prognosticator for EEC that is used in the clinical setting, as are high tumor grade (Grade 3; G3), and extent of lymphovascular space invasion [10]. The prognostic utility of molecular classification, according to POLE, microsatellite instability (MSI), and TP53/p53 status, is an area of active exploration originating from The Cancer Genome Atlas (TCGA) discovery that EECs can be subclassified into four molecular subgroups associated with distinct clinical outcomes [11](and reviewed in [12]).

TCGA’s initial comprehensive molecular characterization of primary endometrial carcinomas included exome sequencing of 200 EECs [11]; an expanded analysis that included 188 additional EECs was subsequently reported [13]. These studies confirmed prior findings that EEC exhibits high frequencies of somatic alterations resulting in activation of the PI3-kinase pathway, the RAS-RAF-MEK-ERK pathway, and the WNT/β-catenin pathway, frequent mutations in ARID1A (BAF250A) tumor suppressor, and mismatch repair (MMR) defects resulting in MSI [11, 13-15]. Moreover, many additional “significantly mutated genes” (SMGs), which represent candidate pathogenic driver genes, were annotated in EECs by TCGA using statistical approaches [11].

Given the dynamic nature of tumor genomes during disease initiation and progression, it is conceivable that the repertoire of pathogenic driver genes may differ in late-stage compared to early-stage EEC. However, the annotation of candidate driver genes (i.e., SMGs) in primary EEC exomes by TCGA, was performed in a stage-agnostic manner [11, 13]. An improved understanding of the molecular etiology of late-stage EEC may provide novel insights into disease pathogenesis and progression. The aim of this study was to delineate SMGs in late-stage EEC exomes, and to determine whether these genes are also significantly mutated in early-stage disease. To this end, we exome sequenced 15 “in-house” late-stage EECs (National Human Genome Research Institute (NHGRI) cohort) and reanalyzed somatic mutation calls from 66 late-stage and 270 early-stage non-ultramutated EECs within TCGA. Collectively, we identified 14 SMGs in 81 late-stage tumors. KLF3 (Krüppel-like factor 3) and PAX6 (Paired box 6), which encode transcription factors, were SMGs in late-stage tumors, but were not statistically significantly mutated in early-stage tumors. All KLF3 mutations, and almost all PAX6 mutations, were in the MSI-hypermutated EEC subgroup; within this subgroup, KLF3 and PAX6 mutations were more frequent in late-stage than early-stage tumors. The mutation spectrum of both genes included recurrent insertions-deletions (indels) at homopolymer tracts, consistent with strand slippage resulting from MMR defects, and suggesting that PAX6 and KLF3 are likely MSI target genes.

Materials and Methods

NHGRI clinical specimens

For 15 cases in the NHGRI cohort, anonymized, fresh-frozen endometrioid endometrial tumors and matched non-tumor (normal) samples were obtained from the Cooperative Human Tissue Network (CHTN) (S1 Table). The National Institutes of Health Office of Human Subjects Research Protections determined that this research was not human subject research, per the Common Rule (45 CFR 46). For each tumor sample, an H&E stained section was reviewed by an experienced gynecologic pathologist to identify regions containing ≥ 70% neoplastic cellularity; accompanying surgical pathology reports were retrospectively evaluated by the same gynecologic pathologist to annotate tumor stage using the FIGO (International Federation of Gynecology and Obstetrics) 2009 classification (S1 Table).

Genomic DNA preparation and next-generation sequencing

Genomic DNA extraction, identity testing and MSI analysis of tumor and normal samples in the NHGRI cohort were performed as previously described [16]. DNA was purified by phenol-chloroform extraction prior to library preparation. DNA libraries were prepared using the SeqCap EZ Exome + UTR capture kit (Roche) and sequenced with the Illumina HiSeq 2000 platform (Illumina).

Alignment and variant calling

Short sequence reads from NHGRI cohort exomes were aligned to the hg19 human reference sequence using NovoAlign version 2.08.02 (University of California at Santa Cruz). Four somatic mutation detection algorithms, Strelka [17], Shimmer [18], SomaticSniper [19], and MuTect [20], were used to call potential somatic variants. Insertions and deletions (indels) were identified by Shimmer and Strelka, while single nucleotide variants (SNVs) were identified by all four somatic algorithms. Strelka workflow version 1.0.14 (https://doi.org/10.1093/bioinformatics/bts271) was run with default parameters. Shimmer version 0.2 (https://github.com/nhansen/shimmer) was run with –min_som_reads=6 and -- minqual=20 [18]. SomaticSniper version 1.0.5 was run with options -Q 40 -G -L, followed by the “standard somatic detection filters” described in Larsen et al [19]. MuTect version 1.1.5 was run with default parameters, and data were then filtered to include only calls designated as “KEEP” in the program’s output [20]. Following analysis with each algorithm, a VarSifter-formatted file was generated containing the somatic variant allele frequencies observed in each tumor and matched normal sample for every called variant [21]. ANNOVAR (downloaded on August 12, 2014) was used to annotate all variants using the UCSC “known genes” gene structures [22].

Variant filtering

Coding, splicing, and non-coding (intronic, 3’ or 5’ untranslated region (UTR), and 1kb upstream of the transcription start or downstream of the transcription end site) somatic variant calls in the NHGRI cohort were displayed using VarSifter [21]. A minimum of 14 reads covering a site in the tumor and 8 in the normal were required for mutation calling [23, 24]; potential germline variants (those with a variant allele frequency (VAF) of greater than 3% in matched normal samples) were excluded. Coding and splice-site single nucleotide variants (SNVs) were annotated against dbSNP Build 135 and nonpathogenic single nucleotide polymorphisms (SNPs) with a minor allele frequency (MAF) greater than 5% were excluded. Indel variants that were present in dbSNP Build 135 were excluded without further evaluation of MAF. SNVs called by all four algorithms and indels called by either Strelka or Shimmer were retained and further annotated against GENCODE hg19 using Oncotator (v1.5.3.0) (http://www.broadinstitute.org/oncotator) [25]; noncoding variants, those with a variant classification of UTR, Flank, lincRNA, RNA, Intron, or De novo start were excluded.

TCGA data analysis

A subset of TCGA Uterine Corpus Endometrial Carcinoma (UCEC) somatic mutation data (TCGA UCEC PanCancer Atlas [13]) was extracted from the MC3 Public MAF file (mc3.v0.2.8.PUBLIC.maf.gz, https://gdc.cancer.gov/about-data/publications/mc3-2017) [26].

Briefly, the MC3 Public MAF file was filtered to include somatic variants from 336 EECs from the MSI-hypermutated (n=141), copy number-low/MSS (n=140) or copy number-high (n=55) molecular subgroups; variants from EECs within the ultramutated-POLE molecular subgroup or those without a molecular subgroup assignment were excluded (S2 Table). Molecular subtype annotation for each sample was obtained from the cBioPortal for Cancer Genomics [27, 28].

Variants with a PASS, WGA, or Native_WGA_mix designation as described by [26] were retained and further filtered to include SNVs called by MuTect and Indels called by Indelocator [13]. The final set of selected variants was annotated against GENCODE hg19 using Oncotator (v1.5.3.0) (http://www.broadinstitute.org/oncotator) [25]; noncoding variants, those with a variant classification of UTR, Flank, lincRNA, RNA, Intron, or De novo start were excluded. Additional clinicopathologic information for each tumor, including histology, stage, and grade, was obtained from Berger et al [13], and the cBioPortal for Cancer Genomics (URL: https://www.cbioportal.org/) [27, 28] (S2 Table). Early-stage tumors were defined herein as stage I or II; late-stage tumors were defined as stage III or IV.

Annotation of statistically significantly mutated genes

Statistically significantly mutated genes (SMGs) were annotated using MutSigCV (v1.4). Briefly, MutSigCV (v1.4) was run on the NIH high-performance computing Biowulf cluster (http://hpc.nih.gov) using the coverage, covariate, and mutation type dictionary files provided by the Broad Institute. Filtered somatic variants for each data set were annotated against GENCODE hg19 using Oncotator (http://www.broadinstitute.org/oncotator) [25], noncoding variants were excluded in accordance with a published approach [29], and the resulting coding mutation annotation format (maf) files were uploaded to the Biowulf cluster. Somatically mutated genes with a false discovery rate (q-value) ≤0.10 were defined as SMGs in accordance with a published approach [30].

Power analysis

MutSigCV’s statistical power to detect SMGs was estimated using the binomial model described in [30]. Briefly, the probability of obtaining a p-value <= 0.1/14 (for 14 tests) was calculated assuming a background mutation rate of p₀ = 1 − (1 − µf_g)^3/4L, where µ is the background mutation rate, and f_g=3.9 and L=1500 are the 90^th percentile gene-specific mutation rate factor and gene length, respectively. We also assumed a signal mutation rate of p₁ = p₀ + r(1 – m), where r is the frequency of non-silent mutations in tumor samples and m=0.1 is the mis-detection rate. Power estimates were performed and plotted for a range of mutation rates and frequencies (S1 Figure) using an R script available at https://github.com/nhansen/LateStageEECs.

Determining whether late-stage SMGs are statistically significantly mutated in early-stage tumors

MutSigCV (v1.4) was run as described above on the set of filtered somatic variants from the 270 early-stage EECs to obtain p-values for all mutated genes. For all genes annotated as SMGs in late-stage tumors, q-values were re-calculated from the MutSigCV (v1.4) p-values assigned to the early-stage data, adjusting for 14 tests (reflecting the total number of SMGs identified in late-stage tumors).

In silico prediction of functional consequences for somatic variants

MutationAssessor [31], PROVEAN (Protein Variation Effect Analyzer) [32], SIFT (Sorting Intolerant From Tolerant) [33], and PolyPhen-2 (Polymorphism Phenotyping v2) [34], were used to predict the effects of missense mutations on protein function. For each algorithm, the following descriptors were considered as impacting protein function: “high” (MutationAssessor), “deleterious” (PROVEAN), “damaging” (SIFT), and “probably-damaging” (PolyPhen-2).

Agreement across at least three of the four prediction methods was required to assign an overall determination of “functional impact” for a missense mutation.

Survival analyses

We utilized the cBioPortal for Cancer Genomics (https://www.cbioportal.org/) to query the relationship between SMG mutation status and survival (overall-, disease-free-, progression-free-, and disease-specific-survival) stratifying cases by stage (all stages, early-stage, late-stage) and molecular subgroup (MSI-hypermutated, CN-low, CN-high, all non-ultramutated), and applying a Bonferroni correction (number of SMGs multiplied by 48 (3 stage subgroups/4 molecular subgroups/4 survival subgroups)) to account for multiple testing.

Results

Identification of SMGs among late-stage EECs

For the NHGRI late-stage cohort (n=15), the average depth of coverage within regions targeted by the capture kit for tumor and normal samples was 67.2x and 65.5x, respectively; 90.87% of targeted bases for each tumor/normal pair had sufficient coverage for variant calling (S3 Table). Using a combination of somatic variant calling algorithms and stringent filtering parameters, we identified 2,879 high-confidence coding and splice-site somatic variants (consisting of 2,214 nonsynonymous (1,405 SNVs, 809 indels), 92 splice-site, and 573 synonymous variants) (S4 Table). Combined, the 2,306 nonsynonymous and splice-site variants affected 1,968 protein-coding genes and averaged 153.7 variants per tumor (range 9-542 per tumor) (S4 and S5 Tables). For the TCGA late-stage cohort (n=66), we extracted a total of 28,996 somatic coding and splice-site variants distributed among 10,504 protein-encoding genes (S6 and S7 Tables). Using MutSigCV (v1.4), we identified a total of 14 unique late-stage SMGs (Fig 1), representing 6 SMGs (q-value ≤0.1) in the NHGRI (Table 1) and 12 SMGs in the TCGA late-stage EEC cohorts (Table 2).

View this table:

Table 1. Statistically significantly mutated genes (q≤0.10) identified within the NHGRI cohort of 15 late-stage EEC exomes

View this table:

Table 2. Statistically significantly mutated genes (q≤0.10) identified among 66 late-stage TCGA EECs

Figure 1. Statistically significantly mutated genes (SMGs) in late-stage and early-stage EEC cohorts.

Venn diagram showing unique and shared SMGs identified by MutSigCV (v1.4) analysis of 15 NHGRI late-stage EECs and 66 TCGA late-stage EECs.

KLF3 and PAX6 are SMGs in late-stage but not early-stage EEC

To test whether each of the 14 late-stage SMGs are also statistically significantly mutated in the TCGA early-stage EECs (n=270), we first estimated MutSigCV’s power to detect genes as significantly mutated in the early-stage cohort. Estimating power using a binomial model as described in [35], we determined that the data from 270 tumors, when tested on 14 genes, yields >95% power to detect genes as significantly mutated across a wide range of background mutation rates when at least 10% of the 270 tumors are mutated in that gene (S1 Figure). Next, we obtained somatic variants for the cohort of non-ultramutated TCGA early-stage EECs; there were 162,763 somatic coding- and splice-site variants affecting 17,435 protein-encoding genes (S8 and S9 Tables). To determine whether any of the 14 late-stage SMGs were significantly mutated in this dataset, p-values for all somatically mutated genes in early-stage tumors were calculated and used to determine q-values adjusting for 14 tests (reflecting the 14 late-stage SMGs queried) using the Benjamini-Hochberg procedure [36] (Table 3). Results showed that 12 of 14 late-stage SMGs were statistically significantly mutated (q-value <0.1) in early-stage EECs whereas two late-stage SMGs, KLF3 (Krüppel Like Factor 3) and PAX6 (Paired Box 6), were not (Table 3). Somatic mutations were more frequent among late-stage tumors than early-stage tumors for both KLF3 (10.6% (7 of 66) late-stage vs 4.8% (13 of 270) early-stage) and PAX6 (10.6% (7 of 66) late-stage vs 1.9% (5 of 270) early-stage) (Table 4).

View this table:

Table 3. PAX6 and KLF3 are the only late-stage EEC SMGs (q-value ≤0.1) that are not statistically significantly mutated in early-stage EEC

View this table:

Table 4. Frequency of non-silent KLF3 and PAX6 mutations in non-ultramutated EECs, according to molecular subgroup

Late-stage-specific EEC SMG (KLF3 and PAX6) mutations occur in MSI-hypermutated EEC and are predicted to affect protein function

For the TCGA cohorts, we evaluated the distribution of KLF3 and PAX6 mutations across the MSI-hypermutated (n=141 cases), CN-low (n=140 cases), and CN-high (n=55 cases) molecular subgroups (Table 4). KLF3 mutations occurred exclusively in the MSI-hypermutated subgroup at an overall frequency of 14.2% (20 of 141 cases), which was significantly higher than the occurrence of KLF3 mutations among the combined CN-high and CN-low subgroups (0 of 195 cases) (p-value < 0.0001 2-tailed Fisher’s exact test). Within the MSI-hypermutated subgroup, KLF3 was mutated in 25.9% (7 of 27) of late-stage tumors versus 11.4% (13 of 114) of early-stage tumors. There were no statistically significant differences in KLF3 mutation frequency according to tumor grade; mutations were present in 14.2% of grade 1 (4 of 28), 8.1% of grade 2 (3 of 37), and 13.2% of grade 3 (11 of 83) MSI tumors (S10 Table).

All but one (11 of 12) of PAX6 mutations were in the MSI subgroup; the PAX6^X306_splice mutation was present in a CN-low tumor (Table 4). The higher frequency of PAX6 mutations in the MSI-hypermutated subgroup compared to other subgroups was statistically significant (p-value = 0.0004, 2-tailed Fisher’s exact test). Within the MSI-hypermutated subgroup, PAX6 was mutated in 7.8% (11 of 141) of tumors; mutations in late-stage tumors were more frequent compared to early-stage tumors (25.9% (7 of 27) versus 3.5% (4 of 114)). There was no significant difference in the frequency of PAX6 mutations between tumors of differing grade; PAX6 mutations were present in 3.6% of grade 1 (1 of 28), 13.5% of grade 2 (5 of 37) and 7.9% of grade 3 (6 of 76) MSI-hypermutated tumors (S10 Table). We observed no statistically significant differences in KLF3 or PAX6 mutation frequencies between POLE/POLD1-mutated and POLE/POLD1-wildtype cases within the MSI-hypermutated subgroup (S11 Table).

A majority of KLF3 and PAX6 mutations were indels within homopolymer tracts, resulting in frameshifts; the KLF3^K106Nfs*21, KLF3^P226Rfs*52, KLF3^Q227Afs*37, and PAX6^P375Hfs*7 frameshift mutations were recurrent (Fig 2). Six of 21 (28.6%) KLF3 mutations and 3 of 11 (27.3%) PAX6 mutations were missense mutations; KLF3^R257W, KLF3^R261G and PAX6^A33T were predicted to affect protein function by 3 of 4 in silico algorithms (S12 Table).

Figure 2. Spectrum of KLF3 and PAX6 somatic mutations in late-stage and early-stage non-ultramutated EECs.

Lollipop plots showing the positions of somatic mutations in (A) KLF3 and (B) PAX6 relative to protein domains. Mutations in NHGRI and TCGA late-stage EEC (red and orange) cohorts and the TCGA early-stage (blue) EEC cohort are distinguished.

Survival analysis

We utilized the cBioPortal for Cancer Genomics (https://www.cbioportal.org/) to query the relationship between patient survival and somatic mutation status of all 14 late-stage SMGs identified herein, applying a Bonferroni correction to account for multiple testing (672 tests). With respect to KLF3 and PAX6 in the MSI-hypermutated subgroup, no significant differences in overall survival (OS), progression-free survival (PFS), disease-free survival (DFS) or disease-specific survival (DSS) were observed between mutated and non-mutated tumors when all stages were combined or when early- and late-stage tumors were considered separately (S13 Table). For the remaining 12 SMGs, there were no statistically significant differences in survival for any stage or molecular subgrouping (data not shown).

Discussion

The mutational landscape of EEC was reported by TCGA in an initial 2013 study and a subsequent “pan-gyn” study which included the 2013 EEC cohort and additional cases. Both studies performed in silico annotation of SMGs, which represent candidate driver genes, in a stage-agnostic manner. However, cancer genomes are dynamic and the mutational repertoire of tumors can evolve during progression and metastasis [37]. Recent comparisons of primary and metastatic endometrial cancer genomes have demonstrated diverge sease, raising the possibility that KLF3 and PAX6 mutations undergo positive selection during tumor progression.

KLF3 encodes a zinc nce in their mutational landscapes [38-40]. But exome-wide comparisons of late-stage and early-stage primary tumors are lacking. Here, our stage-specific analysis of TCGA mutation data for non-ultramutated EECs showed that KLF3 and PAX6 are SMGs in late-stage (III/IV) but not early-stage (I/II) di finger transcription factor with roles in adipogenesis, erythroid maturation, B-cell differentiation, and cardiovascular development (reviewed in [41]). The encoded protein includes an N-terminal CtBP-binding motif, three C-terminal Cys2His2 zinc finger domains, and a primary phosphorylation site at serine-249 that is important for DNA binding and enhancing transcriptional repression [41]. In our analysis of NHGRI EEC exomes and TCGA mutation data, the majority of KLF3 mutations, including three mutation hotspots, were frameshift mutations that occur N-terminal to the zinc finger domains and to serine-249. It is likely that these mutations result in nonsense-mediated decay and haploinsufficiency because the associated premature stop codons are located more than 50-55 nucleotides upstream of the final exon-exon junction [42]. In addition, in silico analyses predicted deleterious effects for the KLF3^R257W and KLF3^R261G missense mutants that occur in EEC; KLF3^R257W also occurs somatically in 2 colorectal cancers (1 MSI-high/CIMP (CpG island methylator phenotype)-low; 1 CIN (chromosome instability)-subgroup) [43, 44].

The fact that KLF3 mutations in EEC occur predominantly at homopolymer tracts, were restricted to the MSI-hypermutated EEC subgroup, and are more frequently mutated in late-stage than early-stage MSI-hypermutated tumors (25.9% versus 11.4%, respectively), indicate that KLF3 is an MSI target gene that may be involved in the etiology and progression of a subset of hypermutated EECs. Consistent with the idea that KLF3 is an MSI target gene, frameshift mutations at codons 106 and 227, which are recurrent in MSI-EECs, are also recurrent in the colorectal MSI-colorectal and MSI-stomach TCGA molecular subgroups [27, 28, 44, 45].

Studies in other tumor types have reported KLF3 alterations as adverse prognosticators. For example, decreased KLF3 expression in colorectal and cervical cancers is associated with lymph node positivity and poorer outcomes [46, 47]. Conflicting data exist regarding the occurrence and effects of reduced KLF3 levels in lung cancer. However, one study reported lower levels of KLF3 mRNA and protein expression in lung adenocarcinomas compared with adjacent normal tissues and more frequent loss of KLF3 expression in late-versus early-stage disease [48]. Although we found KLF3 is a late-stage-specific SMG in EEC, there was no significant association between KLF3 mutation status and survival for EEC patients, possibly reflecting tissue-specific differences in KLF3 association with outcome, and/or outcome differences between mutation and reduced expression of KLF3.

The second late-stage-specific SMG identified in our study was PAX6 (paired box protein Pax-6). PAX6 encodes a highly conserved paired box transcription factor that includes paired box and homeobox DNA-binding domains and a C-terminal transactivation domain (TAD); the final 40 residues of the TAD influence homeobox-DNA binding [49]. PAX6 has important roles in the development of several tissue types, including the eye (reviewed in [50]). Inherited and de novo nonsense and frameshift mutations in PAX6 cause the autosomal dominant eye disorder aniridia 1, whereas germline missense mutations are associated with attenuated ocular phenotypes [51]. Dysregulation of PAX6 expression has been implicated in a variety of human cancers, resulting in tumor suppressive or oncogenic phenotypes depending on the cellular context [52-64]. A recent study reported a potential role for epigenetic silencing of PAX6 in EC progression based on hypermethylation of PAX6 in primary EC versus endometrial hyperplasia, and in metastatic EC versus primary EC [65]. Our analysis of TCGA mutation data found that PAX6 mutations almost exclusively occur in MSI-hypermutated tumors. This observation, coupled with the fact that PAX6 mutations were more frequent among late-stage than early-stage MSI-hypermutated tumors (25.9% versus 3.5%, respectively), raise the possibility that, like KLF3 mutations, PAX6 mutations may be pathogenic drivers of tumor progression in the context of MSI-hypermutated EECs.

Most PAX6 mutations in TCGA MSI-hypermutated EECs were the recurrent PAX6^P375Hfs*7 frameshift mutation in the transactivation domain [11, 13]. We predict that PAX6^P375Hfs*7 and an adjacent PAX6^H376Tfs*36 frameshift mutation encode truncated proteins with reduced transactivation capacity, because the associated premature stop codons are located within 50 nucleotides of the penultimate exon-exon junction and are located proximal to a synthetic nonsense mutation (PAX6^Q422X) that exhibits reduced transactivation capacity in vitro [66]. Moreover, the fact that the PAX6^P375Q aniridia-associated missense mutation results in attenuated DNA binding affinity in vitro [66], raises the possibility that the recurrent PAX6^P375Hfs*7 mutant also may have attenuated DNA binding. Similar to KLF3 frameshift mutations, the PAX6^P375Hfs*7 and PAX6^H376Tfs*36 frameshift mutations in EEC both arise within a (C)₇ homopolymer tract indicating that PAX6 is an MSI target gene. Consistent with this idea is the fact that PAX6 frameshift mutations originating at codon 375 and/or codon 376 are also recurrent in MSI-stomach cancer and MSI-colorectal carcinoma [27, 28, 45].

Compared to frameshift mutations, PAX6 missense mutations are relatively rare in the non-ultramutated TCGA cohort, occurring in three cases. The PAX6^A33T EC-mutant occurs in the N-terminal paired box domain at a residue highly conserved across paired domains in Pax family members and other proteins and is predicted to impact function [67]. A different substitution at this residue (PAX6^A33P) exhibits altered transactivation activity in vitro and is a germline variant associated with partial aniridia [67, 68]. The other two PAX6 missense mutations in EC (PAX6^E220G and PAX6^G141S) were not uniformly predicted to be functionally significant in our analysis and, to our knowledge, are not pathogenic variants for ocular phenotypes.

In conclusion, our findings indicate that KLF3 and PAX6 are candidate driver genes in a subset of late-stage hypermutated EECs and are MSI target genes. Despite sufficient power, neither KLF3 nor PAX6 were detected as candidate driver genes in early-stage EECs. To our knowledge, this is the first study to annotate KLF3 and PAX6 as late stage-specific SMGs in EEC. Our findings warrant future studies to independently validate the enrichment of PAX6 and KLF3 mutations in late-stage, MSI-hypermutated EECs and to determine the functional effects of recurrent frameshift mutations in these genes particularly in regard to phenotypic properties associated with tumor progression.

Data Availability

Exome sequencing data for the NHGRI tumor-normal cohort have been deposited in dbGAP under controlled access

https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001153.v1.p1

References

1.↵
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424. doi: 10.3322/caac.21492
OpenUrl CrossRef PubMed
2.↵
American Cancer Society. Cancer facts & figures 2021. Atlanta: American Cancer Society;2021:1–67.
3.↵
Lortet-Tieulent J, Ferlay J, Bray F, Jemal A. International patterns and trends in endometrial cancer incidence, 1978-2013. J Natl Cancer Inst. 2018;110(4):354–61. doi: 10.1093/jnci/djx214
OpenUrl CrossRef PubMed
4.↵
Smrz SA, Calo C, Fisher JL, Salani R. An ecological evaluation of the increasing incidence of endometrial cancer and the obesity epidemic. Am J Obstet Gynecol. 2020. doi: 10.1016/j.ajog.2020.10.042
OpenUrl CrossRef
5.↵
Creasman WT, Odicino F, Maisonneuve P, Quinn MA, Beller U, Benedet JL, et al. Carcinoma of the corpus uteri. FIGO 26th annual report on the results of treatment in gynecological cancer. Int J Gynaecol Obstet. 2006;95 Suppl 1:S105–43. doi: 10.1016/S0020-7292(06)60031-3
OpenUrl CrossRef PubMed
6.↵
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020;70(1):7–30. doi: 10.3322/caac.21590
OpenUrl CrossRef PubMed
7.↵
Gadducci A, Cosio S, Genazzani AR. Old and new perspectives in the pharmacological treatment of advanced or recurrent endometrial cancer: Hormonal therapy, chemotherapy and molecularly targeted therapies. Crit Rev Oncol Hematol. 2006;58(3):242–56. doi: 10.1016/j.critrevonc.2005.11.002
OpenUrl CrossRef PubMed
8.
Bakkum-Gamez JN, Gonzalez-Bosquet J, Laack NN, Mariani A, Dowdy SC. Current issues in the management of endometrial cancer. Mayo Clin Proc. 2008;83(1):97–112. doi: 10.4065/83.1.97
OpenUrl CrossRef PubMed
9.↵
Fleming GF. Systemic chemotherapy for uterine carcinoma: Metastatic and adjuvant. J Clin Oncol. 2007;25(20):2983–90. doi: 10.1200/JCO.2007.10.8431
OpenUrl FREE Full Text
10.↵
Singh N, Hirschowitz L, Zaino R, Alvarado-Cabrero I, Duggan MA, Ali-Fehmi R, et al. Pathologic prognostic factors in endometrial carcinoma (other than tumor type and grade). Int J Gynecol Pathol. 2019;38 Suppl 1:S93–S113. doi: 10.1097/PGP.0000000000000524
OpenUrl CrossRef PubMed
11.↵
Cancer Genome Atlas Research Network, Kandoth C, Schultz N, Cherniack AD, Akbani R, Liu Y, et al. Integrated genomic characterization of endometrial carcinoma. Nature. 2013;497(7447):67–73. doi: 10.1038/nature12113
OpenUrl CrossRef PubMed Web of Science
12.↵
Urick ME, Bell DW. Clinical actionability of molecular targets in endometrial cancer. Nat Rev Cancer. 2019;19(9):510–21. doi: 10.1038/s41568-019-0177-x
OpenUrl CrossRef
13.↵
Berger AC, Korkut A, Kanchi RS, Hegde AM, Lenoir W, Liu W, et al. A comprehensive pan-cancer molecular study of gynecologic and breast cancers. Cancer Cell. 2018;33(4):690–705 e9. doi: 10.1016/j.ccell.2018.03.014
OpenUrl CrossRef
14.
Byron SA, Gartside M, Powell MA, Wellens CL, Gao F, Mutch DG, et al. FGFR2 point mutations in 466 endometrioid endometrial tumors: Relationship with MSI, KRAS, PIK3CA, CTNNB1 mutations and clinicopathological features. PLoS One. 2012;7(2):e30801. doi: 10.1371/journal.pone.0030801
OpenUrl CrossRef PubMed
15.↵
McMeekin DS, Tritchler DL, Cohn DE, Mutch DG, Lankes HA, Geller MA, et al. Clinicopathologic significance of mismatch repair defects in endometrial cancer: An NRG oncology/gynecologic oncology group study. J Clin Oncol. 2016;34(25):3062–8. doi: 10.1200/JCO.2016.67.8722
OpenUrl Abstract/FREE Full Text
16.↵
Le Gallo M, O’Hara AJ, Rudd ML, Urick ME, Hansen NF, O’Neil NJ, et al. Exome sequencing of serous endometrial tumors identifies recurrent somatic mutations in chromatin-remodeling and ubiquitin ligase complex genes. Nat Genet. 2012;44(12):1310–5. doi: 10.1038/ng.2455
OpenUrl CrossRef PubMed
17.↵
Saunders CT, Wong WS, Swamy S, Becq J, Murray LJ, Cheetham RK. Strelka: Accurate somatic small-variant calling from sequenced tumor-normal sample pairs. Bioinformatics. 2012;28(14):1811–7. doi: 10.1093/bioinformatics/bts271
OpenUrl CrossRef PubMed Web of Science
18.↵
Hansen NF, Gartner JJ, Mei L, Samuels Y, Mullikin JC. Shimmer: Detection of genetic alterations in tumors using next-generation sequence data. Bioinformatics. 2013;29(12):1498–503. doi: 10.1093/bioinformatics/btt183
OpenUrl CrossRef PubMed
19.↵
Larson DE, Harris CC, Chen K, Koboldt DC, Abbott TE, Dooling DJ, et al. SomaticSniper: Identification of somatic point mutations in whole genome sequencing data. Bioinformatics. 2012;28(3):311–7. doi: 10.1093/bioinformatics/btr665
OpenUrl CrossRef PubMed Web of Science
20.↵
Cibulskis K, Lawrence MS, Carter SL, Sivachenko A, Jaffe D, Sougnez C, et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat Biotechnol. 2013;31(3):213–9. doi: 10.1038/nbt.2514
OpenUrl CrossRef PubMed
21.↵
Teer JK, Green ED, Mullikin JC, Biesecker LG. Varsifter: Visualizing and analyzing exome-scale sequence variation data on a desktop computer. Bioinformatics. 2012;28(4):599–600. doi: 10.1093/bioinformatics/btr711
OpenUrl CrossRef PubMed Web of Science
22.↵
Wang K, Li M, Hakonarson H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164. doi: 10.1093/nar/gkq603
OpenUrl CrossRef PubMed
23.↵
Barbieri CE, Baca SC, Lawrence MS, Demichelis F, Blattner M, Theurillat JP, et al. Exome sequencing identifies recurrent SPOP, FOXA1 and MED12 mutations in prostate cancer. Nat Genet. 2012;44(6):685–9. doi: 10.1038/ng.2279
OpenUrl CrossRef PubMed
24.↵
Witkiewicz AK, McMillan EA, Balaji U, Baek G, Lin WC, Mansour J, et al. Whole-exome sequencing of pancreatic cancer defines genetic diversity and therapeutic targets. Nat Commun. 2015;6:6744. doi: 10.1038/ncomms7744
OpenUrl CrossRef PubMed
25.↵
Ramos AH, Lichtenstein L, Gupta M, Lawrence MS, Pugh TJ, Saksena G, et al. Oncotator: cancer variant annotation tool. Hum Mutat. 2015;36(4):E2423–9. doi: 10.1002/humu.22771
OpenUrl CrossRef PubMed
26.↵
Ellrott K, Bailey MH, Saksena G, Covington KR, Kandoth C, Stewart C, et al. Scalable open science approach for mutation calling of tumor exomes using multiple genomic pipelines. Cell Syst. 2018;6(3):271–81 e7. doi: 10.1016/j.cels.2018.03.002
OpenUrl CrossRef PubMed
27.↵
Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, et al. The cBio cancer genomics portal: An open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012;2(5):401–4. doi: 10.1158/2159-8290.CD-12-0095
OpenUrl Abstract/FREE Full Text
28.↵
Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013;6(269):pl1. doi: 10.1126/scisignal.2004088
OpenUrl Abstract/FREE Full Text
29.↵
Tokheim CJ, Papadopoulos N, Kinzler KW, Vogelstein B, Karchin R. Evaluating the evaluation of cancer driver genes. Proc Natl Acad Sci U S A. 2016;113(50):14330–5. doi: 10.1073/pnas.1616440113
OpenUrl Abstract/FREE Full Text
30.↵
Lawrence MS, Stojanov P, Polak P, Kryukov GV, Cibulskis K, Sivachenko A, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature. 2013;499(7457):214–8. doi: 10.1038/nature12213
OpenUrl CrossRef PubMed Web of Science
31.↵
Reva B, Antipin Y, Sander C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. 2011;39(17):e118. doi: 10.1093/nar/gkr407
OpenUrl CrossRef PubMed Web of Science
32.↵
Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS One. 2012;7(10):e46688. doi: 10.1371/journal.pone.0046688
OpenUrl CrossRef PubMed
33.↵
Ng PC, Henikoff S. SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003;31(13):3812–4. doi: 10.1093/nar/gkg509
OpenUrl CrossRef PubMed Web of Science
34.↵
Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7(4):248–9. doi: 10.1038/nmeth0410-248
OpenUrl CrossRef PubMed Web of Science
35.↵
Lawrence MS, Stojanov P, Mermel CH, Robinson JT, Garraway LA, Golub TR, et al. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature. 2014;505(7484):495–501. doi: 10.1038/nature12912
OpenUrl CrossRef PubMed Web of Science
36.↵
Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc. 1995;57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x
OpenUrl CrossRef PubMed
37.↵
Stratton MR, Campbell PJ, Futreal PA. The cancer genome. Nature. 2009;458(7239):719–24. doi: 10.1038/nature07943
OpenUrl CrossRef PubMed Web of Science
38.↵
Gibson WJ, Hoivik EA, Halle MK, Taylor-Weiner A, Cherniack AD, Berg A, et al. The genomic landscape and evolution of endometrial carcinoma progression and abdominopelvic metastasis. Nat Genet. 2016;48(8):848–55. doi: 10.1038/ng.3602
OpenUrl CrossRef PubMed
39.
Soumerai TE, Donoghue MTA, Bandlamudi C, Srinivasan P, Chang MT, Zamarin D, et al. Clinical utility of prospective molecular characterization in advanced endometrial cancer. Clin Cancer Res. 2018;24(23):5939–47. doi: 10.1158/1078-0432.CCR-18-0412
OpenUrl Abstract/FREE Full Text
40.↵
Ashley CW, Da Cruz Paula A, Kumar R, Mandelker D, Pei X, Riaz N, et al. Analysis of mutational signatures in primary and metastatic endometrial cancer reveals distinct patterns of DNA repair defects and shifts during tumor progression. Gynecol Oncol. 2019;152(1):11–9. doi: 10.1016/j.ygyno.2018.10.032
OpenUrl CrossRef PubMed
41.↵
Pearson RC, Funnell AP, Crossley M. The mammalian zinc finger transcription factor kruppel-like factor 3 (KLF3/BKLF). IUBMB Life. 2011;63(2):86–93. doi: 10.1002/iub.4222
OpenUrl CrossRef PubMed Web of Science
42.↵
Popp MW, Maquat LE. Leveraging rules of nonsense-mediated mRNA decay for genome engineering and personalized medicine. Cell. 2016;165(6):1319–22.doi: 10.1016/j.cell.2016.05.053
OpenUrl CrossRef PubMed
43.↵
Giannakis M, Mu XJ, Shukla SA, Qian ZR, Cohen O, Nishihara R, et al. Genomic correlates of immune-cell infiltrates in colorectal carcinoma. Cell Rep. 2016;15(4):857–65. doi: 10.1016/j.celrep.2016.03.075
OpenUrl CrossRef PubMed
44.↵
The Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487(7407):330–7. doi: 10.1038/nature11252
OpenUrl CrossRef PubMed Web of Science
45.↵
Liu Y, Sethi NS, Hinoue T, Schneider BG, Cherniack AD, Sanchez-Vega F, et al. Comparative molecular analysis of gastrointestinal adenocarcinomas. Cancer Cell. 2018;33(4):721–35 e8.doi: 10.1016/j.ccell.2018.03.010
OpenUrl CrossRef PubMed
46.↵
Lyng H, Brovig RS, Svendsrud DH, Holm R, Kaalhus O, Knutstad K, et al. Gene expressions and copy numbers associated with metastatic phenotypes of uterine cervical cancer. BMC Genomics. 2006;7:268. doi: 10.1186/1471-2164-7-268
OpenUrl CrossRef PubMed
47.↵
Wang X, Jiang Z, Zhang Y, Wang X, Liu L, Fan Z. RNA sequencing analysis reveals protective role of kruppel-like factor 3 in colorectal cancer. Oncotarget. 2017;8(13):21984–93. doi: 10.18632/oncotarget.15766
OpenUrl CrossRef
48.↵
Sun W, Hu S, Zu Y, Deng Y. KLF3 is a crucial regulator of metastasis by controlling STAT3 expression in lung cancer. Mol Carcinog. 2019;58(11):1933-45.doi:10.1002/mc.23072
OpenUrl CrossRef
49.↵
Shukla S, Mishra R. Predictions on impact of missense mutations on structure function relationship of PAX6 and its alternatively spliced isoform PAX6(5a). Interdiscip Sci. 2012;4(1):54–73. doi: 10.1007/s12539-012-0114-0
OpenUrl CrossRef PubMed
50.↵
Cvekl A, Callaerts P. PAX6: 25th anniversary and more to learn. Exp Eye Res. 2017;156:10–21. doi: 10.1016/j.exer.2016.04.017
OpenUrl CrossRef
51.↵
Lima Cunha D, Arno G, Corton M, Moosajee M. The spectrum of PAX6 mutations and genotype-phenotype correlations in the eye. Genes (Basel). 2019;10(12).1050. doi: 10.3390/genes10121050
OpenUrl CrossRef
52.↵
Li Y, Li Y, Liu Y, Xie P, Li F, Li G. Pax6, a novel target of microRNA-7, promotes cellular proliferation and invasion in human colorectal cancer cells. Dig Dis Sci. 2014;59(3):598-606.doi:10.1007/s10620-013-2929-x
OpenUrl CrossRef PubMed
53.
Zong X, Yang H, Yu Y, Zou D, Ling Z, He X, et al. Possible role of PAX-6 in promoting breast cancer cell proliferation and tumorigenesis. BMB Rep. 2011;44(9):595–600. doi: 10.5483/bmbrep.2011.44.9.595
OpenUrl CrossRef PubMed
54.
Kiselev Y, Andersen S, Johannessen C, Fjukstad B, Standahl Olsen K, Stenvold H, et al. Transcription factor PAX6 as a novel prognostic factor and putative tumour suppressor in non-small cell lung cancer. Sci Rep. 2018;8(1):5059. doi: 10.1038/s41598-018-23417-z
OpenUrl CrossRef
55.
Mayes DA, Hu Y, Teng Y, Siegel E, Wu X, Panda K, et al. PAX6 suppresses the invasiveness of glioblastoma cells and the expression of the matrix metalloproteinase-2 gene. Cancer Res. 2006;66(20):9809–17. doi: 10.1158/0008-5472.CAN-05-3877
OpenUrl Abstract/FREE Full Text
56.
Shyr CR, Tsai MY, Yeh S, Kang HY, Chang YC, Wong PL, et al. Tumor suppressor PAX6 functions as androgen receptor co-repressor to inhibit prostate cancer growth. Prostate. 2010;70(2):190–9. doi: 10.1002/pros.21052
OpenUrl CrossRef PubMed Web of Science
57.
Zhou YH, Wu X, Tan F, Shi YX, Glass T, Liu TJ, et al. PAX6 suppresses growth of human glioblastoma cells. J Neurooncol. 2005;71(3):223–9. doi: 10.1007/s11060-004-1720-4
OpenUrl CrossRef PubMed
58.
Hegge B, Sjottem E, Mikkola I. Generation of a PAX6 knockout glioblastoma cell line with changes in cell cycle distribution and sensitivity to oxidative stress. BMC Cancer. 2018;18(1):496. doi: 10.1186/s12885-018-4394-6
OpenUrl CrossRef
59.
Huang BS, Luo QZ, Han Y, Li XB, Cao LJ, Wu LX. MicroRNA-223 promotes the growth and invasion of glioblastoma cells by targeting tumor suppressor PAX6. Oncol Rep. 2013;30(5):2263–9. doi: 10.3892/or.2013.2683
OpenUrl CrossRef
60.
Maulbecker CC, Gruss P. The oncogenic potential of PAX genes. EMBO J. 1993;12(6):2361–7. doi: 10.1002/j.1460-2075.1993.tb05890.x
OpenUrl CrossRef PubMed Web of Science
61.
Ooki A, Dinalankara W, Marchionni L, Tsay JJ, Goparaju C, Maleki Z, et al. Epigenetically regulated PAX6 drives cancer cells toward a stem-like state via GLI-SOX2 signaling axis in lung adenocarcinoma. Oncogene. 2018;37(45):5967–81. doi: 10.1038/s41388-018-0373-2
OpenUrl CrossRef
62.
Wu DM, Zhang T, Liu YB, Deng SH, Han R, Liu T, et al. The PAX6-ZEB2 axis promotes metastasis and cisplatin resistance in non-small cell lung cancer through PI3K/AKT signaling. Cell Death Dis. 2019;10(5):349. doi: 10.1038/s41419-019-1591-4
OpenUrl CrossRef
63.
Jin M, Gao D, Wang R, Sik A, Liu K. Possible involvement of TGF--SMAD epithelial-mesenchymal transition in pro-metastatic property of PAX6. Oncol Rep. 2020;44(2):555–64. doi: 10.3892/or.2020.7644
OpenUrl CrossRef
64.↵
Urrutia G, Laurito S, Campoy E, Nasif D, Branham MT, Roque M. PAX6 promoter methylation correlates with MDA-MB-231 cell migration, and expression of MMP2 and MMP9. Asian Pac J Cancer Prev 2018;19(10):2859–66. doi: 10.22034/APJCP.2018.19.10.2859
OpenUrl CrossRef
65.↵
Wu X, Miao J, Jiang J, Liu F. Analysis of methylation profiling data of hyperplasia and primary and metastatic endometrial cancers. Eur J Obstet Gynecol Reprod Biol. 2017;217:161–6. doi: 10.1016/j.ejogrb.2017.08.036
OpenUrl CrossRef
66.↵
Singh S, Chao LY, Mishra R, Davies J, Saunders GF. Missense mutation at the c-terminus of PAX6 negatively modulates homeodomain function. Hum Mol Genet. 2001;10(9):911–8. doi: 10.1093/hmg/10.9.911
OpenUrl CrossRef PubMed Web of Science
67.↵
Hanson I, Churchill A, Love J, Axton R, Moore T, Clarke M, et al. Missense mutations in the most ancient residues of the PAX6 paired domain underlie a spectrum of human congenital eye malformations. Hum Mol Genet. 1999;8(2):165–72. doi: 10.1093/hmg/8.2.165
OpenUrl CrossRef PubMed Web of Science
68.↵
Chauhan BK, Yang Y, Cveklova K, Cvekl A. Functional properties of natural human PAX6 and PAX6(5a) mutants. Invest Ophthalmol Vis Sci. 2004;45(2):385–92. doi: 10.1167/iovs.03-0968
OpenUrl Abstract/FREE Full Text

View the discussion thread.

Posted April 30, 2021.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Oncology

Subject Areas

All Articles

Addiction Medicine (354)
Allergy and Immunology (680)
Anesthesia (182)
Cardiovascular Medicine (2692)
Dentistry and Oral Medicine (318)
Dermatology (227)
Emergency Medicine (404)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (956)
Epidemiology (12326)
Forensic Medicine (10)
Gastroenterology (771)
Genetic and Genomic Medicine (4164)
Geriatric Medicine (392)
Health Economics (688)
Health Informatics (2702)
Health Policy (1010)
Health Systems and Quality Improvement (1004)
Hematology (366)
HIV/AIDS (864)
Infectious Diseases (except HIV/AIDS) (13771)
Intensive Care and Critical Care Medicine (805)
Medical Education (401)
Medical Ethics (110)
Nephrology (446)
Neurology (3947)
Nursing (214)
Nutrition (587)
Obstetrics and Gynecology (751)
Occupational and Environmental Health (704)
Oncology (2076)
Ophthalmology (596)
Orthopedics (244)
Otolaryngology (308)
Pain Medicine (253)
Palliative Medicine (75)
Pathology (474)
Pediatrics (1133)
Pharmacology and Therapeutics (473)
Primary Care Research (462)
Psychiatry and Clinical Psychology (3495)
Public and Global Health (6590)
Radiology and Imaging (1428)
Rehabilitation Medicine and Physical Therapy (835)
Respiratory Medicine (877)
Rheumatology (415)
Sexual and Reproductive Health (414)
Sports Medicine (345)
Surgery (455)
Toxicology (55)
Transplantation (191)
Urology (170)

[1] 1.↵
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424. doi: 10.3322/caac.21492
OpenUrl CrossRef PubMed

[2] 2.↵
American Cancer Society. Cancer facts & figures 2021. Atlanta: American Cancer Society;2021:1–67.

[3] 3.↵
Lortet-Tieulent J, Ferlay J, Bray F, Jemal A. International patterns and trends in endometrial cancer incidence, 1978-2013. J Natl Cancer Inst. 2018;110(4):354–61. doi: 10.1093/jnci/djx214
OpenUrl CrossRef PubMed

[4] 4.↵
Smrz SA, Calo C, Fisher JL, Salani R. An ecological evaluation of the increasing incidence of endometrial cancer and the obesity epidemic. Am J Obstet Gynecol. 2020. doi: 10.1016/j.ajog.2020.10.042
OpenUrl CrossRef

[5] 5.↵
Creasman WT, Odicino F, Maisonneuve P, Quinn MA, Beller U, Benedet JL, et al. Carcinoma of the corpus uteri. FIGO 26th annual report on the results of treatment in gynecological cancer. Int J Gynaecol Obstet. 2006;95 Suppl 1:S105–43. doi: 10.1016/S0020-7292(06)60031-3
OpenUrl CrossRef PubMed

[6] 6.↵
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020;70(1):7–30. doi: 10.3322/caac.21590
OpenUrl CrossRef PubMed

[7] 7.↵
Gadducci A, Cosio S, Genazzani AR. Old and new perspectives in the pharmacological treatment of advanced or recurrent endometrial cancer: Hormonal therapy, chemotherapy and molecularly targeted therapies. Crit Rev Oncol Hematol. 2006;58(3):242–56. doi: 10.1016/j.critrevonc.2005.11.002
OpenUrl CrossRef PubMed

[8] 8.
Bakkum-Gamez JN, Gonzalez-Bosquet J, Laack NN, Mariani A, Dowdy SC. Current issues in the management of endometrial cancer. Mayo Clin Proc. 2008;83(1):97–112. doi: 10.4065/83.1.97
OpenUrl CrossRef PubMed

[9] 9.↵
Fleming GF. Systemic chemotherapy for uterine carcinoma: Metastatic and adjuvant. J Clin Oncol. 2007;25(20):2983–90. doi: 10.1200/JCO.2007.10.8431
OpenUrl FREE Full Text

[10] 10.↵
Singh N, Hirschowitz L, Zaino R, Alvarado-Cabrero I, Duggan MA, Ali-Fehmi R, et al. Pathologic prognostic factors in endometrial carcinoma (other than tumor type and grade). Int J Gynecol Pathol. 2019;38 Suppl 1:S93–S113. doi: 10.1097/PGP.0000000000000524
OpenUrl CrossRef PubMed

[11] 11.↵
Cancer Genome Atlas Research Network, Kandoth C, Schultz N, Cherniack AD, Akbani R, Liu Y, et al. Integrated genomic characterization of endometrial carcinoma. Nature. 2013;497(7447):67–73. doi: 10.1038/nature12113
OpenUrl CrossRef PubMed Web of Science

[12] 12.↵
Urick ME, Bell DW. Clinical actionability of molecular targets in endometrial cancer. Nat Rev Cancer. 2019;19(9):510–21. doi: 10.1038/s41568-019-0177-x
OpenUrl CrossRef

[13] 13.↵
Berger AC, Korkut A, Kanchi RS, Hegde AM, Lenoir W, Liu W, et al. A comprehensive pan-cancer molecular study of gynecologic and breast cancers. Cancer Cell. 2018;33(4):690–705 e9. doi: 10.1016/j.ccell.2018.03.014
OpenUrl CrossRef

[14] 14.
Byron SA, Gartside M, Powell MA, Wellens CL, Gao F, Mutch DG, et al. FGFR2 point mutations in 466 endometrioid endometrial tumors: Relationship with MSI, KRAS, PIK3CA, CTNNB1 mutations and clinicopathological features. PLoS One. 2012;7(2):e30801. doi: 10.1371/journal.pone.0030801
OpenUrl CrossRef PubMed

[15] 15.↵
McMeekin DS, Tritchler DL, Cohn DE, Mutch DG, Lankes HA, Geller MA, et al. Clinicopathologic significance of mismatch repair defects in endometrial cancer: An NRG oncology/gynecologic oncology group study. J Clin Oncol. 2016;34(25):3062–8. doi: 10.1200/JCO.2016.67.8722
OpenUrl Abstract/FREE Full Text

[16] 16.↵
Le Gallo M, O’Hara AJ, Rudd ML, Urick ME, Hansen NF, O’Neil NJ, et al. Exome sequencing of serous endometrial tumors identifies recurrent somatic mutations in chromatin-remodeling and ubiquitin ligase complex genes. Nat Genet. 2012;44(12):1310–5. doi: 10.1038/ng.2455
OpenUrl CrossRef PubMed

[17] 17.↵
Saunders CT, Wong WS, Swamy S, Becq J, Murray LJ, Cheetham RK. Strelka: Accurate somatic small-variant calling from sequenced tumor-normal sample pairs. Bioinformatics. 2012;28(14):1811–7. doi: 10.1093/bioinformatics/bts271
OpenUrl CrossRef PubMed Web of Science

[18] 18.↵
Hansen NF, Gartner JJ, Mei L, Samuels Y, Mullikin JC. Shimmer: Detection of genetic alterations in tumors using next-generation sequence data. Bioinformatics. 2013;29(12):1498–503. doi: 10.1093/bioinformatics/btt183
OpenUrl CrossRef PubMed

[19] 19.↵
Larson DE, Harris CC, Chen K, Koboldt DC, Abbott TE, Dooling DJ, et al. SomaticSniper: Identification of somatic point mutations in whole genome sequencing data. Bioinformatics. 2012;28(3):311–7. doi: 10.1093/bioinformatics/btr665
OpenUrl CrossRef PubMed Web of Science

[20] 20.↵
Cibulskis K, Lawrence MS, Carter SL, Sivachenko A, Jaffe D, Sougnez C, et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat Biotechnol. 2013;31(3):213–9. doi: 10.1038/nbt.2514
OpenUrl CrossRef PubMed

[21] 21.↵
Teer JK, Green ED, Mullikin JC, Biesecker LG. Varsifter: Visualizing and analyzing exome-scale sequence variation data on a desktop computer. Bioinformatics. 2012;28(4):599–600. doi: 10.1093/bioinformatics/btr711
OpenUrl CrossRef PubMed Web of Science

[22] 22.↵
Wang K, Li M, Hakonarson H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164. doi: 10.1093/nar/gkq603
OpenUrl CrossRef PubMed

[23] 23.↵
Barbieri CE, Baca SC, Lawrence MS, Demichelis F, Blattner M, Theurillat JP, et al. Exome sequencing identifies recurrent SPOP, FOXA1 and MED12 mutations in prostate cancer. Nat Genet. 2012;44(6):685–9. doi: 10.1038/ng.2279
OpenUrl CrossRef PubMed

[24] 24.↵
Witkiewicz AK, McMillan EA, Balaji U, Baek G, Lin WC, Mansour J, et al. Whole-exome sequencing of pancreatic cancer defines genetic diversity and therapeutic targets. Nat Commun. 2015;6:6744. doi: 10.1038/ncomms7744
OpenUrl CrossRef PubMed

[25] 25.↵
Ramos AH, Lichtenstein L, Gupta M, Lawrence MS, Pugh TJ, Saksena G, et al. Oncotator: cancer variant annotation tool. Hum Mutat. 2015;36(4):E2423–9. doi: 10.1002/humu.22771
OpenUrl CrossRef PubMed

[26] 26.↵
Ellrott K, Bailey MH, Saksena G, Covington KR, Kandoth C, Stewart C, et al. Scalable open science approach for mutation calling of tumor exomes using multiple genomic pipelines. Cell Syst. 2018;6(3):271–81 e7. doi: 10.1016/j.cels.2018.03.002
OpenUrl CrossRef PubMed

[27] 27.↵
Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, et al. The cBio cancer genomics portal: An open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012;2(5):401–4. doi: 10.1158/2159-8290.CD-12-0095
OpenUrl Abstract/FREE Full Text

[28] 28.↵
Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013;6(269):pl1. doi: 10.1126/scisignal.2004088
OpenUrl Abstract/FREE Full Text

[29] 29.↵
Tokheim CJ, Papadopoulos N, Kinzler KW, Vogelstein B, Karchin R. Evaluating the evaluation of cancer driver genes. Proc Natl Acad Sci U S A. 2016;113(50):14330–5. doi: 10.1073/pnas.1616440113
OpenUrl Abstract/FREE Full Text

[30] 30.↵
Lawrence MS, Stojanov P, Polak P, Kryukov GV, Cibulskis K, Sivachenko A, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature. 2013;499(7457):214–8. doi: 10.1038/nature12213
OpenUrl CrossRef PubMed Web of Science

[31] 31.↵
Reva B, Antipin Y, Sander C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. 2011;39(17):e118. doi: 10.1093/nar/gkr407
OpenUrl CrossRef PubMed Web of Science

[32] 32.↵
Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS One. 2012;7(10):e46688. doi: 10.1371/journal.pone.0046688
OpenUrl CrossRef PubMed

[33] 33.↵
Ng PC, Henikoff S. SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003;31(13):3812–4. doi: 10.1093/nar/gkg509
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7(4):248–9. doi: 10.1038/nmeth0410-248
OpenUrl CrossRef PubMed Web of Science

[35] 35.↵
Lawrence MS, Stojanov P, Mermel CH, Robinson JT, Garraway LA, Golub TR, et al. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature. 2014;505(7484):495–501. doi: 10.1038/nature12912
OpenUrl CrossRef PubMed Web of Science

[36] 36.↵
Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc. 1995;57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x
OpenUrl CrossRef PubMed

[37] 37.↵
Stratton MR, Campbell PJ, Futreal PA. The cancer genome. Nature. 2009;458(7239):719–24. doi: 10.1038/nature07943
OpenUrl CrossRef PubMed Web of Science

[38] 38.↵
Gibson WJ, Hoivik EA, Halle MK, Taylor-Weiner A, Cherniack AD, Berg A, et al. The genomic landscape and evolution of endometrial carcinoma progression and abdominopelvic metastasis. Nat Genet. 2016;48(8):848–55. doi: 10.1038/ng.3602
OpenUrl CrossRef PubMed

[39] 39.
Soumerai TE, Donoghue MTA, Bandlamudi C, Srinivasan P, Chang MT, Zamarin D, et al. Clinical utility of prospective molecular characterization in advanced endometrial cancer. Clin Cancer Res. 2018;24(23):5939–47. doi: 10.1158/1078-0432.CCR-18-0412
OpenUrl Abstract/FREE Full Text

[40] 40.↵
Ashley CW, Da Cruz Paula A, Kumar R, Mandelker D, Pei X, Riaz N, et al. Analysis of mutational signatures in primary and metastatic endometrial cancer reveals distinct patterns of DNA repair defects and shifts during tumor progression. Gynecol Oncol. 2019;152(1):11–9. doi: 10.1016/j.ygyno.2018.10.032
OpenUrl CrossRef PubMed

[41] 41.↵
Pearson RC, Funnell AP, Crossley M. The mammalian zinc finger transcription factor kruppel-like factor 3 (KLF3/BKLF). IUBMB Life. 2011;63(2):86–93. doi: 10.1002/iub.4222
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Popp MW, Maquat LE. Leveraging rules of nonsense-mediated mRNA decay for genome engineering and personalized medicine. Cell. 2016;165(6):1319–22.doi: 10.1016/j.cell.2016.05.053
OpenUrl CrossRef PubMed

[43] 43.↵
Giannakis M, Mu XJ, Shukla SA, Qian ZR, Cohen O, Nishihara R, et al. Genomic correlates of immune-cell infiltrates in colorectal carcinoma. Cell Rep. 2016;15(4):857–65. doi: 10.1016/j.celrep.2016.03.075
OpenUrl CrossRef PubMed

[44] 44.↵
The Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487(7407):330–7. doi: 10.1038/nature11252
OpenUrl CrossRef PubMed Web of Science

[45] 45.↵
Liu Y, Sethi NS, Hinoue T, Schneider BG, Cherniack AD, Sanchez-Vega F, et al. Comparative molecular analysis of gastrointestinal adenocarcinomas. Cancer Cell. 2018;33(4):721–35 e8.doi: 10.1016/j.ccell.2018.03.010
OpenUrl CrossRef PubMed

[46] 46.↵
Lyng H, Brovig RS, Svendsrud DH, Holm R, Kaalhus O, Knutstad K, et al. Gene expressions and copy numbers associated with metastatic phenotypes of uterine cervical cancer. BMC Genomics. 2006;7:268. doi: 10.1186/1471-2164-7-268
OpenUrl CrossRef PubMed

[47] 47.↵
Wang X, Jiang Z, Zhang Y, Wang X, Liu L, Fan Z. RNA sequencing analysis reveals protective role of kruppel-like factor 3 in colorectal cancer. Oncotarget. 2017;8(13):21984–93. doi: 10.18632/oncotarget.15766
OpenUrl CrossRef

[48] 48.↵
Sun W, Hu S, Zu Y, Deng Y. KLF3 is a crucial regulator of metastasis by controlling STAT3 expression in lung cancer. Mol Carcinog. 2019;58(11):1933-45.doi:10.1002/mc.23072
OpenUrl CrossRef

[49] 49.↵
Shukla S, Mishra R. Predictions on impact of missense mutations on structure function relationship of PAX6 and its alternatively spliced isoform PAX6(5a). Interdiscip Sci. 2012;4(1):54–73. doi: 10.1007/s12539-012-0114-0
OpenUrl CrossRef PubMed

[50] 50.↵
Cvekl A, Callaerts P. PAX6: 25th anniversary and more to learn. Exp Eye Res. 2017;156:10–21. doi: 10.1016/j.exer.2016.04.017
OpenUrl CrossRef

[51] 51.↵
Lima Cunha D, Arno G, Corton M, Moosajee M. The spectrum of PAX6 mutations and genotype-phenotype correlations in the eye. Genes (Basel). 2019;10(12).1050. doi: 10.3390/genes10121050
OpenUrl CrossRef

[52] 52.↵
Li Y, Li Y, Liu Y, Xie P, Li F, Li G. Pax6, a novel target of microRNA-7, promotes cellular proliferation and invasion in human colorectal cancer cells. Dig Dis Sci. 2014;59(3):598-606.doi:10.1007/s10620-013-2929-x
OpenUrl CrossRef PubMed

[53] 53.
Zong X, Yang H, Yu Y, Zou D, Ling Z, He X, et al. Possible role of PAX-6 in promoting breast cancer cell proliferation and tumorigenesis. BMB Rep. 2011;44(9):595–600. doi: 10.5483/bmbrep.2011.44.9.595
OpenUrl CrossRef PubMed

[54] 54.
Kiselev Y, Andersen S, Johannessen C, Fjukstad B, Standahl Olsen K, Stenvold H, et al. Transcription factor PAX6 as a novel prognostic factor and putative tumour suppressor in non-small cell lung cancer. Sci Rep. 2018;8(1):5059. doi: 10.1038/s41598-018-23417-z
OpenUrl CrossRef

[55] 55.
Mayes DA, Hu Y, Teng Y, Siegel E, Wu X, Panda K, et al. PAX6 suppresses the invasiveness of glioblastoma cells and the expression of the matrix metalloproteinase-2 gene. Cancer Res. 2006;66(20):9809–17. doi: 10.1158/0008-5472.CAN-05-3877
OpenUrl Abstract/FREE Full Text

[56] 56.
Shyr CR, Tsai MY, Yeh S, Kang HY, Chang YC, Wong PL, et al. Tumor suppressor PAX6 functions as androgen receptor co-repressor to inhibit prostate cancer growth. Prostate. 2010;70(2):190–9. doi: 10.1002/pros.21052
OpenUrl CrossRef PubMed Web of Science

[57] 57.
Zhou YH, Wu X, Tan F, Shi YX, Glass T, Liu TJ, et al. PAX6 suppresses growth of human glioblastoma cells. J Neurooncol. 2005;71(3):223–9. doi: 10.1007/s11060-004-1720-4
OpenUrl CrossRef PubMed

[58] 58.
Hegge B, Sjottem E, Mikkola I. Generation of a PAX6 knockout glioblastoma cell line with changes in cell cycle distribution and sensitivity to oxidative stress. BMC Cancer. 2018;18(1):496. doi: 10.1186/s12885-018-4394-6
OpenUrl CrossRef

[59] 59.
Huang BS, Luo QZ, Han Y, Li XB, Cao LJ, Wu LX. MicroRNA-223 promotes the growth and invasion of glioblastoma cells by targeting tumor suppressor PAX6. Oncol Rep. 2013;30(5):2263–9. doi: 10.3892/or.2013.2683
OpenUrl CrossRef

[60] 60.
Maulbecker CC, Gruss P. The oncogenic potential of PAX genes. EMBO J. 1993;12(6):2361–7. doi: 10.1002/j.1460-2075.1993.tb05890.x
OpenUrl CrossRef PubMed Web of Science

[61] 61.
Ooki A, Dinalankara W, Marchionni L, Tsay JJ, Goparaju C, Maleki Z, et al. Epigenetically regulated PAX6 drives cancer cells toward a stem-like state via GLI-SOX2 signaling axis in lung adenocarcinoma. Oncogene. 2018;37(45):5967–81. doi: 10.1038/s41388-018-0373-2
OpenUrl CrossRef

[62] 62.
Wu DM, Zhang T, Liu YB, Deng SH, Han R, Liu T, et al. The PAX6-ZEB2 axis promotes metastasis and cisplatin resistance in non-small cell lung cancer through PI3K/AKT signaling. Cell Death Dis. 2019;10(5):349. doi: 10.1038/s41419-019-1591-4
OpenUrl CrossRef

[63] 63.
Jin M, Gao D, Wang R, Sik A, Liu K. Possible involvement of TGF--SMAD epithelial-mesenchymal transition in pro-metastatic property of PAX6. Oncol Rep. 2020;44(2):555–64. doi: 10.3892/or.2020.7644
OpenUrl CrossRef

[64] 64.↵
Urrutia G, Laurito S, Campoy E, Nasif D, Branham MT, Roque M. PAX6 promoter methylation correlates with MDA-MB-231 cell migration, and expression of MMP2 and MMP9. Asian Pac J Cancer Prev 2018;19(10):2859–66. doi: 10.22034/APJCP.2018.19.10.2859
OpenUrl CrossRef

[65] 65.↵
Wu X, Miao J, Jiang J, Liu F. Analysis of methylation profiling data of hyperplasia and primary and metastatic endometrial cancers. Eur J Obstet Gynecol Reprod Biol. 2017;217:161–6. doi: 10.1016/j.ejogrb.2017.08.036
OpenUrl CrossRef

[66] 66.↵
Singh S, Chao LY, Mishra R, Davies J, Saunders GF. Missense mutation at the c-terminus of PAX6 negatively modulates homeodomain function. Hum Mol Genet. 2001;10(9):911–8. doi: 10.1093/hmg/10.9.911
OpenUrl CrossRef PubMed Web of Science

[67] 67.↵
Hanson I, Churchill A, Love J, Axton R, Moore T, Clarke M, et al. Missense mutations in the most ancient residues of the PAX6 paired domain underlie a spectrum of human congenital eye malformations. Hum Mol Genet. 1999;8(2):165–72. doi: 10.1093/hmg/8.2.165
OpenUrl CrossRef PubMed Web of Science

[68] 68.↵
Chauhan BK, Yang Y, Cveklova K, Cvekl A. Functional properties of natural human PAX6 and PAX6(5a) mutants. Invest Ophthalmol Vis Sci. 2004;45(2):385–92. doi: 10.1167/iovs.03-0968
OpenUrl Abstract/FREE Full Text

KLF3 and PAX6 are candidate driver genes in late-stage, MSI-hypermutated endometrioid endometrial carcinomas

Abstract

Introduction

Materials and Methods

NHGRI clinical specimens

Genomic DNA preparation and next-generation sequencing

Alignment and variant calling

Variant filtering

TCGA data analysis

Annotation of statistically significantly mutated genes

Power analysis

Determining whether late-stage SMGs are statistically significantly mutated in early-stage tumors

In silico prediction of functional consequences for somatic variants

Survival analyses

Results

Identification of SMGs among late-stage EECs

KLF3 and PAX6 are SMGs in late-stage but not early-stage EEC

Late-stage-specific EEC SMG (KLF3 and PAX6) mutations occur in MSI-hypermutated EEC and are predicted to affect protein function

Survival analysis

Discussion

Data Availability

References

Citation Manager Formats

Subject Area