%0 Journal Article %A Luke M. Kraven %A Adam R. Taylor %A Philip L. Molyneaux %A Toby M. Maher %A John E. McDonough %A Marco Mura %A Ivana V. Yang %A David A. Shwartz %A Yong Huang %A Imre Noth %A Shwu-Fan Ma %A Astrid J. Yeo %A William A. Fahy %A R. Gisli Jenkins %A Louise V. Wain %T Cluster analysis of transcriptomic datasets to identify endotypes of Idiopathic Pulmonary Fibrosis %D 2021 %R 10.1101/2021.07.16.21260633 %J medRxiv %P 2021.07.16.21260633 %X Rationale Considerable clinical heterogeneity in Idiopathic Pulmonary Fibrosis (IPF) suggests the existence of multiple disease endotypes. Identifying these endotypes could allow for a biomarker-driven personalised medicine approach in IPF.Objectives To improve our understanding of the pathogenesis of IPF by identifying clinically distinct groups of patients with IPF that could represent distinct disease endotypes.Methods We co-normalised, pooled and clustered three publicly available blood transcriptomic datasets (total 220 IPF cases). We compared clinical traits across clusters and used gene enrichment analysis to identify biological pathways and processes that were over-represented among the genes that were differentially expressed across clusters. A gene-based classifier was developed and validated using three additional independent datasets (total 194 IPF cases).Measurements and main results We identified three clusters of IPF patients with statistically significant differences in lung function (P=0.009) and mortality (P=0.009) between groups. Gene enrichment analysis implicated dysregulation of mitochondrial homeostasis, apoptosis, cell cycle and innate and adaptive immunity in the pathogenesis underlying these groups. We developed and validated a 13-gene cluster classifier that predicted mortality in IPF (high-risk clusters vs low-risk cluster: hazard ratio= 4.25, 95% confidence interval= [2.14, 8.46], P=3.7×10−5).Conclusions We have identified blood gene expression signatures capable of discerning groups of IPF patients with significant differences in survival. These clusters could be representative of distinct pathophysiological states, which would support the theory of multiple endotypes of IPF. Although more work must be done to confirm the existence of these endotypes, our classifier could be a useful tool in patient stratification and outcome prediction in IPF.Competing Interest StatementThe authors have declared no competing interest.Funding StatementL.V.W. holds a GSK/British Lung Foundation Chair in Respiratory Research (C17-1). R.G.J. is supported by a National Institute for Health Research (NIHR) Research Professorship (NIHR reference RP-2017-08-ST2-014). P.L.M. is supported by an Action for Pulmonary Fibrosis Mike Bray fellowship. T.M. Maher is supported by a National Institute for Health Research Clinician Scientist Fellowship (CS-2013-13-017) and a British Lung Foundation Chair in Respiratory Research (C17-3). I.N. is supported by a National Heart, Lung, and Blood Institute (NHLBI) grant (R01HL145266). D.A.S. is supported by NHLBI grants (UG3HL151865, R01HL097163, P01HL092870, X01HL134585 and UH3HL123442) and a United States Department of Defense grant (W81XWH-17-1-0597). The study with Gene Expression Omnibus accession code GSE110147 was supported by the Roche Multi Organ Transplant Academic Enrichment Fund, Lawson Research Institute Internal Research Fund and Western Strategic Support for CIHR Success, Seed Grant. The research was partially supported by the NIHR Leicester Biomedical Research Centre; the views expressed are those of the author(s) and not necessarily those of the National Health Service (NHS), the NIHR, or the Department of Health.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Individuals used in this study come from a previously reported study with appropriate institutional ethics approval at each study centre. All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data that supports the findings in this study are openly available via the Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/). https://www.ncbi.nlm.nih.gov/geo/ https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE38958 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE33566 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE93606 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132607 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE27957 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE28042 %U https://www.medrxiv.org/content/medrxiv/early/2021/07/19/2021.07.16.21260633.full.pdf