RT Journal Article SR Electronic T1 DNA methylation biomarker selected by an ensemble machine learning approach predicts mortality risk in an HIV-positive veteran population JF medRxiv FD Cold Spring Harbor Laboratory Press SP 19010272 DO 10.1101/19010272 A1 Chang Shu A1 Amy C. Justice A1 Xinyu Zhang A1 Vincent C. Marconi A1 Dana B. Hancock A1 Eric O. Johnson A1 Ke Xu YR 2020 UL http://medrxiv.org/content/early/2020/09/08/19010272.abstract AB Background With the improved life expectancy of people living with HIV (PLWH), identifying vulnerable subpopulations at high risk of mortality is important for clinical care. Evidences showed that DNA methylation (DNAm) is associated with aging and mortality in non-HIV populations. Here, we aimed to establish a panel of DNAm biomarkers that can predict mortality risk among PLWH.Methods 1,081 HIV-positive participants from the Veterans Aging Cohort Study (VACS) were divided into training (N=460), validation (N=114), and testing (N=507) sets. VACS index was used as a measure of mortality risk among PLWH. Model training and fine-tuning were conducted using the ensemble method in the training and validation sets and prediction performance was assessed in the testing set. The survival analysis comparing the predicted high and low mortality risk groups was conducted. The Gene Ontology enrichment analysis of the predictive CpG sites was performed.Results We selected a panel of 393 CpGs for the ensemble prediction model. The prediction model showed excellent performance in predicting high mortality risk with an auROC of 0.809 (95%CI: 0.767-0.851) and a balanced accuracy of 0.653 (95%CI: 0.611, 0.693) in the testing set. The predicted high mortality risk group was significantly associated with 10-year mortality (hazard ratio=1.79, p=4E-05) compared with low mortality risk group. These 393 CpGs were located in 280 genes enriched in immune and inflammation responses pathways.Conclusions We identified a panel of DNAm features associated with mortality risk in PLWH. These DNAm features may serve as predictive biomarkers for mortality risk among PLWH.Competing Interest StatementThe authors have declared no competing interest.Funding StatementNational Institute on Drug AbuseAuthor DeclarationsAll relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesAny clinical trials involved have been registered with an ICMJE-approved registry such as ClinicalTrials.gov and the trial ID is included in the manuscript.Not ApplicableI have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE or other checklist(s) as supplementary files, if applicable.YesData has been deposited in GEO