Abstract
Numerous potential susceptibility genes have been identified for colorectal cancer (CRC). However, it remains unclear which genes have a causal role in CRC risk, whether these genes are associated with specific types of CRC, and if they have potential for therapeutic targeting. We performed a multi-tissue transcriptome-wide association study (TWAS) across six relevant normal tissues (n=187-670) and applied a causal framework (involving Mendelian randomisation and genetic colocalisation) to prioritise causal associations between gene expression or splicing events and CRC risk (52,775 cases; 45,940 controls), incorporating sex- and anatomical subsite-specific analyses. We identified 35 genes with robust evidence for a potential causal role in CRC, including ten genes not previously identified by TWAS. Among these genes, SEMA4D emerged as a significant discovery; it is not located at any established CRC genome-wide association study (GWAS) risk locus and its encoded protein is targeted by an antibody currently being clinically studied for CRC treatment. Several genes showed increased expression associated with CRC risk and evidence of CRC cell dependency in CRISPR screen analyses, highlighting their potential as targets for therapeutic inhibition. A female-specific association with CRC risk was observed for CCM2 expression, which is involved in progesterone signalling pathways. Subsite-specific associations were also found, including a link between rectal cancer risk and expression of LAMC1, which encodes a target for a clinically approved drug. Additionally, we performed a focused analysis of established drug targets to further identify potential therapies for CRC, revealing PDCD1, the product of which (PD-1) is targeted by a clinically approved CRC immunotherapy. In summary, our comprehensive analysis provides valuable insights into the molecular underpinnings of CRC and supports promising avenues for therapeutic intervention.
Competing Interest Statement
Tom G Richardson is employed full-time by GlaxoSmithKline outside of the research presented in this manuscript. Where authors are identified as personnel of the International Agency for Research on Cancer / World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy, or views of the International Agency for Research on Cancer / World Health Organization. This article is the result of the scientific work of Dr Murphy while he was affiliated at IARC.
Funding Statement
Funding and acknowledgements: EH is supported by a Cancer Research UK Population Research Committee Studentship (C18281/A30905), the CRUK Integrative Cancer Epidemiology Programme (C18281/A29019), the Harold Hyam Wingate Foundation, the European Cancer Prevention (ECP) organisation, the European Association for Cancer Research (EACR), and is part of the Medical Research Council Integrative Epidemiology Unit at the University of Bristol which is supported by the Medical Research Council (MC_UU_00032/03) and the University of Bristol. TAOM is supported by a National Health and Medical Research Council of Australia Investigator Fellowship (Emerging Leadership 2; APP1179170). The CRC GWAS used in this analysis was provided by: GECCO, which acknowledges funding from the National Cancer Institute, National Institutes of Health, U.S. Department of Health and Human Services (U01 CA137088, R01 CA059045, R01 201407), with genotyping/sequencing services provided by the Center for Inherited Disease Research (CIDR) contract number HHSN268201700006I and HHSN268201200008I, and was funded in part through the NIH/NCI Cancer Center Support Grant P30 CA015704, with scientific Computing Infrastructure at Fred Hutch funded by ORIP grant S10OD028685; the CORECT study which was supported by the National Cancer Institute, National Institutes of Health (NCI/NIH), U.S. Department of Health and Human Services (grant numbers U19 CA148107, R01 CA081488, P30 CA014089, R01 CA197350; P01 CA196569; R01 CA201407; R01 CA242218), National Institutes of Environmental Health Sciences, National Institutes of Health (grant number T32 ES013678), and a generous gift from Daniel and Maryann Fong; and the Colon Cancer Family Registry (CCFR, https://www.coloncfr.org/), which is supported in part by funding from the National Cancer Institute (NCI), National Institutes of Health (NIH) (award U01 CA167551), with support for case ascertainment provided in part from the Surveillance, Epidemiology, and End Results (SEER) Program and the following US state cancer registries: AZ, CO, MN, NC, NH; and by the Victoria Cancer Registry (Australia) and Ontario Cancer Registry (Canada). The CCFR Set-1 (Illumina 1M/1M-Duo) and Set-2 (Illumina Omni1-Quad) scans were supported by NIH awards U01 CA122839 and R01 CA143237 (to GC). The CCFR Set-3 (Affymetrix Axiom CORECT Set array) was supported by NIH award U19 CA148107 and R01 CA81488 (to SBG). The CCFR Set-4 (Illumina OncoArray 600K genetic variant array) was supported by NIH award U19 CA148107 (to SBG) and by the Center for Inherited Disease Research (CIDR), which is funded by the NIH to the Johns Hopkins University, contract number HHSN268201200008I. Additional funding for the OFCCR/ARCTIC was through award GL201-043 from the Ontario Research Fund (to BWZ), award 112746 from the Canadian Institutes of Health Research (to TJH), through a Cancer Risk Evaluation (CaRE) Program grant from the Canadian Cancer Society (to SG), and through generous support from the Ontario Ministry of Research and Innovation. The SFCCR Illumina HumanCytoSNP array was supported in part through NCI/NIH awards U01/U24 CA074794 and R01 CA076366 (to PAN). The content of this manuscript does not necessarily reflect the views or policies of the NCI, NIH or any of the collaborating centres in the Colon Cancer Family Registry (CCFR), nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government, any cancer registry, or the CCFR. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics committee of QIMR Berghofer Medical Research Institute gave ethical approval for this work.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data and code availability
All data generated by this analysis can be found within the manuscript and supporting information. The CRC GWAS data used in this analysis can be accessed through the GECCO data access process (outlined at https://share.fhcrc.org/sites/gecco/Pages/paper-proposals.aspx).