PT - JOURNAL ARTICLE AU - Chang Shu AU - Amy C. Justice AU - Xinyu Zhang AU - Vincent C. Marconi AU - Dana B. Hancock AU - Eric O. Johnson AU - Ke Xu TI - DNA methylation biomarker selected by an ensemble machine learning approach predicts frailty in an HIV-positive veteran population AID - 10.1101/19010272 DP - 2020 Jan 01 TA - medRxiv PG - 19010272 4099 - http://medrxiv.org/content/early/2020/05/13/19010272.short 4100 - http://medrxiv.org/content/early/2020/05/13/19010272.full AB - Background With the improved life expectancy in people living with HIV, predicting individual frailty is important for clinical care. DNA methylation (DNAm) has been previously linked to aging and mortality in non-HIV populations. We aim to establish a panel of DNAm biomarkers to predict frailty in HIV-positive population.Methods Our samples consist of HIV-positive participants (Ntotal=1,081) from the Veterans Aging Cohort Study (VACS) and were divided into the training set (Ntraining=460), the validation set (Nvalidation=114), and the testing set (Ntesting=507). We used a well-established score, VACS Index, as a measure of frailty. Model training and fine-tuning were conducted using the ensemble method that aggregated prediction results of four base models. The final number of features were determined in the validation set and the model performance was assessed in the testing set. We conducted a survival analysis to assess whether the predicted frailty was associated with 10-year mortality and gene ontology enrichment analysis of the predictive CpG sites was performed.Results We selected a panel of 393 CpGs to build the ensemble machine learning model. The prediction model showed an excellent performance on predicting frailty with Area Under Curve of 0.809 (95%CI: 0.767-0.851) and balanced accuracy of 0.653 in the testing set. The predicted frailty based on our model was significantly associated with 10-year mortality (hazard ratio=1.79, p=4E-05). These 393 CpGs were located within or near 280 genes that were enriched in biological pathways including immune and inflammation response.Conclusions We identified a panel of predictive DNAm features associated with frailty and mortality among HIV-positive population. These DNAm features may serve as biomarkers to predict frailty people who live with HIV.Competing Interest StatementThe authors have declared no competing interest.Funding StatementNational Institute on Drug AbuseAuthor DeclarationsAll relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesAny clinical trials involved have been registered with an ICMJE-approved registry such as ClinicalTrials.gov and the trial ID is included in the manuscript.Not ApplicableI have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE or other checklist(s) as supplementary files, if applicable.YesData has been deposited in GEO