Abstract
Background Blood DNA methylation can inform us about the biological mechanisms that underlie common disease states. Previous epigenome-wide analyses of common diseases often focus solely on the prevalence or incidence of individual conditions and rely on small sample sizes, which may limit power to discover disease-associated loci.
Results We conduct blood-based epigenome-wide association studies on the prevalence of 14 common disease states in Generation Scotland (nindividuals≤18,413, nCpGs=752,722). We also utilise health record linkage to perform epigenome-wide analyses on the incidence of 19 disease states. We present a structured literature review on existing epigenome-wide analyses for all 19 disease states to assess the degree of replication within the existing literature and the novelty of the present findings.
We identify 69 associations between CpGs and the prevalence of four disease states at baseline, of which 58 are novel. We also uncover 64 CpGs that associate with the incidence of two disease states (COPD and type 2 diabetes), of which 56 are novel. These associations were independent from common lifestyle risk factors. We highlight poor replication across the existing literature. Here, replication was defined by the reporting of at least one common gene in >2 studies examining the same disease state. Existing blood-based epigenome-wide analyses showed evidence of replication for only 4/19 disease states (with up-to-15% of unique genes replicated for lung cancer).
Conclusions Our summary data and structured review of the literature provide an important platform to guide future studies that examine the role of blood DNA methylation in complex disease states.
Competing Interest Statement
R.F.H has received speaker fees from Illumina and acts as a scientific consultant to Optima Partners. D.A.G acts as a scientific consultant for Optima Partners. L.M has received speaker and consultancy fees from Illumina. A.M.M has received research support from Eli Lilly, Janssen, and the Sackler Foundation. A.M.M has also received speaker fees from Illumina and Janssen and consulting fees. R.E.M has received a speaker fee from Illumina and is an advisor to the Epigenetic Clock Development Foundation and Optima Partners.
Funding Statement
This research was funded in whole, or in part, by the Wellcome Trust (104036/Z/14/Z, 108890/Z/15/Z, 221890/Z/20/Z, 220857/Z/20/Z and 218493/Z/19/Z). For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission. GS received core support from the Chief Scientist Office of the Scottish Government Health Directorates (CZD/16/6) and the Scottish Funding Council (HR03006). DNA methylation profiling of the GS samples was carried out by the Genetics Core Laboratory at the Edinburgh Clinical Research Facility, Edinburgh, Scotland, and was funded by the Medical Research Council UK and Wellcome (Wellcome Trust Strategic Award STratifying Resilience and Depression Longitudinally (STRADL; Reference 104036/Z/14/Z)). DNA methylation data for Generation Scotland was also funded by a 2018 NARSAD Young Investigator Grant from the Brain & Behavior Research Foundation (Ref: 27404; awardee: Dr David M Howard) and by a John, Margaret, Alfred and Stewart Sim Fellowship from the Royal College of Physicians of Edinburgh (Awardee: Dr Heather C Whalley). A.M.M is supported by Wellcome (104036/Z/14/Z, 216767/Z/19/Z, 220857/Z/20/Z (which also supported DNAm measurement)), United Kingdom Research and Innovation (UKRI) MRC (MC_PC_17209, MR/W014386/1 and MR/S035818/1) and the European Union H2020 (SEP-210574971). Y.C. is supported by the University of Edinburgh and University of Helsinki joint PhD program in Human Genomics. A.D.C is supported by a Medical Research Council PhD Studentship in Precision Medicine with funding by the Medical Research Council Doctoral Training Programme and the University of Edinburgh College of Medicine and Veterinary Medicine. D.A.G and H.M.S are supported by funding from the Translational Neuroscience PhD Programme (108890/Z/15/Z, 218493/Z/19/Z). C.H. is supported by an MRC Human Genetics Unit programme grant ‘Quantitative traits in health and disease’ (U. MC_UU_00007/10). D.L.Mc.C. and R.E.M. are supported by Alzheimer’s Research UK major project grant ARUK-PG2017B−10. E.B and R.E.M. are supported by Alzheimer’s Society major project grant AS-PG-19b-010. R.F.H is supported by a MRC IEU Fellowship.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
All components of GS received ethical approval from the NHS Tayside Committee on Medical Research Ethics (REC Reference Number: 05/S1401/89). GS has also been granted Research Tissue Bank status by the East of Scotland Research Ethics Service (REC Reference Number: 20–ES–0021), providing generic ethical approval for a wide range of uses within medical research.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
According to the terms of consent for Generation Scotland participants, access to data must be reviewed by the Generation Scotland Access Committee. Applications should be made to access@generationscotland.org. All code associated with this manuscript is available open access at the following GitHub repository: https://github.com/marioni-group/EWAS-of-common-disease-Generation-Scotland EWAS summary statistics will be submitted to the EWAS Catalog and Edinburgh DataShare upon publication.