Abstract
Genome-wide association studies have discovered numerous common variants associated with human cancers. However, the contribution of exome-wide rare variants to cancers remains largely unexplored, especially for the protein-coding variants. The UK Biobank provides detailed cancer follow-up information linked to whole-exome sequencing (WES) for approximately 450,000 participants, offering an unprecedented opportunity to evaluate the effect of exome variation on pan-cancer. Here, we performed exome-wide association studies (ExWAS) based on single variant levels and gene levels to detect their associations across 20 primary cancer types in the discovery set (WES-300k, N = 284,456) and replication set (WES-150k, N = 143,478), separately. The ExWAS detected 143 independent variants at variant-level and 49 genes at gene-level, while nine variants and eight genes were shared across cancers. In the cross-trait meta-analysis, we identified 239 additional independent pleiotropic variants, mapping to the genes which were functional through trans-omics analyses in transcriptomics and proteomics. Further, we developed exome-wide risk scores (ERS) to identify high-risk populations based on rare variants with minor allele frequency (MAF) < 0.05. The ERS had satisfactory performance in cancer risk stratification, especially for the extremely high-risk persons (top 5% ERS) that were frequently risk allele carriers. The ERS (median C-index (IQR): 0.655 (0.636-0.667)) outperforms the traditional polygenic risk score (PRS) (median C-index (IQR): 0.585 (0.572-0.614)) for discrimination in the replication set. Our findings offer further insight into the genetic architecture of human exomes for cancer susceptibility.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was supported by the National Natural Science Foundation of China (82103946 to S.S., 82173620 to Y.Z., 81530088 to F.C.), National Key Research and Development Program of China (2016YFE0204900 to F.C.), Natural Science Foundation of the Jiangsu Higher Education Institutions of China (21KJB330004 to S.S.).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Consent for publication All authors have reviewed and approved this manuscript.
Conflicts of interest statements/Financial Disclosure statement: The authors report no conflicts of interest.
Funding This study was supported by the National Natural Science Foundation of China (82103946 to S.S., 82173620 to Y.Z., 81530088 to F.C.), National Key Research and Development Program of China (2016YFE0204900 to F.C.), Natural Science Foundation of the Jiangsu Higher Education Institutions of China (21KJB330004 to S.S.).
Data Availability
UK Biobank data is available from https://www.ukbiobank.ac.uk/. TCGA data is available from https://portal.gdc.cancer.gov/. GTEx data is available from https://www.gtexportal.org/home/. CPTAC data is available from https://pdc.esacinc.com/pdc/pdc.