RT Journal Article SR Electronic T1 A Machine Learning Approach to Predict 5-year Risk for Complications in Type 1 Diabetes JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.09.28.21264161 DO 10.1101/2021.09.28.21264161 A1 Naba Al-Sari A1 Svetlana Kutuzova A1 Tommi Suvitaival A1 Peter Henriksen A1 Flemming Pociot A1 Peter Rossing A1 Douglas McCloskey A1 Cristina Legido-Quigley YR 2021 UL http://medrxiv.org/content/early/2021/10/08/2021.09.28.21264161.abstract AB OBJECTIVE Our aim was to apply state-of-the-art machine learning algorithms to predict the risk of future progression to diabetes complications, including diabetic kidney disease (≥30% decline in eGFR) and diabetic retinopathy (mild, moderate or severe).RESEARCH DESIGN AND METHODS Using data in a cohort of 537 adults with type 1 diabetes we predicted diabetes complications emerging during a median follow-up of 5.4 years. Prediction models were computed first with clinical risk factors at baseline (17 measures) and then with clinical risk factors and blood-derived molecular data (965 molecules) at baseline. Participants were classified into different groups: type 1 diabetes stable (n=195), type 1 diabetes with progression to diabetic kidney disease (≥30% decline in eGFR; n=79) or diabetic retinopathy (mild, moderate or severe; n=111). Progression of diabetic kidney disease and diabetic retinopathy were predicted in two complication-specific models. Area under the receiver operating characteristic curves (AUROCs) were validated by both k-fold and Monte-Carlo simulation for each model. Clinical risk factor and molecule importance were explained with the SHAP algorithm. Accuracy, precision, recall, and F-score were used to evaluate clinical utility.RESULTS During a median follow-up of 5.4 years, 79 (21 %) of the participants (mean±SD: age 54.8 ± 13.7 years) progressed in diabetic kidney disease and 111 (29 %) of the participants progressed to diabetic retinopathy. The two predictive models for diabetic kidney disease progression were highly accurate with 15 clinical risk factors: AUROCs of 0.92±0.01 and 0.96±0.25 (k-fold and Monte Carlo simulation values respectively) and when six omics predictors were included: AUROC of 0.99±0.001 and 0.96±0.06. The top three important features were albuminuria, estimated glomerular filtration rate and retinopathy status at baseline for both models.Models for diabetic retinopathy progression with 12 clinical risk predictors achieved an AUROC of 0.81±0.10 and 0.75±0.14 and with additional seven omics included an AUROC of 0.87±0.001 and 0.79±0.16. The top features were hemoglobin A1c, albuminuria and retinopathy status at baseline.CONCLUSIONS application of machine learning effectively predicted five-year progression of diabetes complications, in particular diabetic kidney disease. Further replication of machine learning tools in a real-world context will facilitate implementation in the clinic.Competing Interest StatementThe authors declare no potential conflicts of interests relevant to this manuscript. Outside this manuscript PR reports consultancy and/or speaking fees to Steno Diabetes Center Copenhagen from Astellas, AstraZeneca, Bayer, Boehringer Ingelheim, Gilead, Eli Lilly, MSD, Novo Nordisk Vifor, and Sanofi Aventis and research grants from AstraZeneca and Novo Nordisk.Funding StatementThis project was funded by the Novo Nordisk Foundation grant NNF14OC0013659 (PROTON Personalizing treatment of diabetic nephropathy). Internal funding was provided by Steno Diabetes Center Copenhagen, Gentofte, Denmark.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study involving human participants were approved by The Ethics Committee E, Region Hovedstaden, Denmark. The participants /patients provided their written informed consent to participate in this study.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesData are available on request for researchers who have acquired the required legal permissions from the Danish data protection agency. Requests to access the datasets should be directed to PR, peter.rossing@regionh.dk.