ABSTRACT
Background Aberrant methylation of DNA acts epigenetically to skew the gene transcription rate up or down. In this study, we have developed a comprehensive computational framework for the stage-differentiated modelling of DNA methylation landscapes in colorectal cancer. Methods: The methylation β - matrix was derived from the public-domain TCGA data, converted into M-value matrix, annotated with sample stages, and analysed for stage-salient genes using multiple approaches involving stage-differentiated linear modelling of methylation patterns and/or expression patterns. Differentially methylated genes (DMGs) were identified using a contrast against control samples (adjusted p-value <0.001 and |log fold-change of M-value| >2). These results were filtered using a series of all possible pairwise stage contrasts (p-value <0.05) to obtain stage-salient DMGs. These were then subjected to a consensus analysis, followed by Kaplan–Meier survival analysis to explore the relationship between methylation and prognosis for the consensus stage-salient biomarkers.
Results We found significant genome-wide changes in methylation patterns in cancer samples relative to controls agnostic of stage. Our stage-differentiated analysis yielded the following stage-salient genes: one stage-I gene (FBN1), one stage-II gene (FOXG1), one stage-III gene (HCN1) and four stage-IV genes (NELL1, ZNF135, FAM123A, LAMA1). All the biomarkers were hypermethylated, indicating down-regulation and signifying a CpG island Methylator Phenotype (CIMP) manifestation. A prognostic signature consisting of FBN1 and FOXG1was significantly associated with patient survival (p-value < 0.01) and could be used as a biomarker panel for early-stage CRC prognosis.
Conclusion Our workflow for stage-differentiated consensus analysis has yielded stage-salient diagnostic biomarkers as well as an early-stage prognostic biomarker panel. In addition, our studies have affirmed a novel CIMP-like signature in colorectal cancer, urging clinical validation.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Trial
Publicly available TCGA and GDC datasets used in modelling and analysis
Funding Statement
DST-SERB EMR/2017/000470/BBM, Govt of India.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Since the study uses publicly available, anonymized TCGA data, no IRB approval is required.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Revised extensively for clarity and in-depth analysis.
Data Availability
All data and results are available as available as supplementary information: https://doi.org/10.6084/m9.figshare.13013852