Summary
Lung cancer is the most frequent cause of cancer death worldwide1. It is male predominant and for reasons that are unknown also associated with significantly worse outcomes in men2. Here we compared gene co-expression networks in affected and unaffected pulmonary tissue derived from 126 patients with Stage IA–IV lung cancer. We observed marked degradation of a sex-associated gene co-expression network in tumour tissue. The disturbance was linked to fractional loss of the Y chromosome and was detected in 28% of male tumours in the discovery dataset and 27% of male tumours in a 123 sample replication dataset. Depression of Y chromosome expression was accompanied by extensive autosomal DNA hypomethylation. The male specific H3K4 demethylase, KDM5D, was identified as an apex hub within this co-expression network. Male patients exhibiting relative tumour KDM5D deficiency had an increased risk of death in the discovery dataset (Hazard Ratio [HR] 3.80, 95% CI 1.40 – 10.3, P=0.009) and in an independent sample of 1,100 male lung tumours (HR 1.67, 95% CI 1.4-2.0, P=1.2⨯10−10). Our findings identify tumour-specific weakening of male-specific expression, in particular deficiency of KDM5D, as a common replicable prognostic marker and credible mechanism underlying sex disparity in cancer.
Competing Interest Statement
AN reports personal fees from Merck, Boehringer Ingelheim, Novartis, Astra Zeneca, Bristol Myer Squib, Roche, Abbvie and Oncologica, as well as grants and personal fees from Pfizer outside the submitted work. EL reports personal fees from Glaxo Smith Kline, Pfizer, Novartis, Covidien, Roche, Lily Oncology, Boehringer Ingelheim, Medela, Astra Zeneca and Ethicon; grants and personal fees from ScreenCell; grants from Clearbridge Biomedics, Illumina and Guardant Health, outside the submitted work. In addition, EL has patents P52435GB and P57988GB issued to Imperial Innovations, is the Director of lung screening at the Cromwell Hospital, and is CI for both VIOLET NIHR HTA (13/04/03) and MARS 2 NIHR HTA (15/188/31). SP reports personal fees from BMS, Roche, Takeda, AstraZeneca, Pfizer, MSD, EMD, Serono, Guardant Health, Abbvie, Boehringer Ingelheim, OncLive, Medscape, Incyte, Paradox Pharmaceuticals and Eli Lilly outside the submitted work.
Funding Statement
This study was supported by the Asmarley Trust and by the Wellcome Trust. Sample collection for this study was supported by the NIHR Respiratory Disease Biomedical Research Unit at the Royal Brompton and Harefield NHS Foundation Trust. Y.R. is a research scholar of the Fonds de recherche du Quebec - Sante (FRQS). The funding sources had no involvement in study design, collection, analysis or interpretation of the data, or in submission for publication.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study methodologies followed the standards set by the Declaration of Helsinki and were conducted under approval by the Royal Brompton and Harefield Research Ethics Committee (RBH) NIHR BRU Advanced Lung Disease Biobank (NRES reference 10/H0504/9) and Brompton and Harefield NHS Trust Diagnostic Tissue Bank (NRES reference 10/H0504/29) [Discovery], and the Royal Brompton and Harefield Ethics Committee (REC reference number LREC 02-261) [Replication].
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
Gene expression data have been deposited in NCBI's Gene Expression Omnibus and will be accessible through GEO SuperSeries accession number GSE151103 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE151103); comprising SubSeries GSE151101 (discovery) and GSE151102 (replication) upon acceptance for peer-reviewed publication. Sequence data are available upon request.