PT - JOURNAL ARTICLE AU - Jae-Seung Yun AU - Jaesik Kim AU - Sang-Hyuk Jung AU - Seon-Ah Cha AU - Seung-Hyun Ko AU - Yu-Bae Ahn AU - Hong-Hee Won AU - Kyung-Ah Sohn AU - Dokyoon Kim TI - A Deep Learning Model for Screening Type 2 Diabetes from Retinal Photographs AID - 10.1101/2021.06.29.21259606 DP - 2021 Jan 01 TA - medRxiv PG - 2021.06.29.21259606 4099 - http://medrxiv.org/content/early/2021/07/03/2021.06.29.21259606.short 4100 - http://medrxiv.org/content/early/2021/07/03/2021.06.29.21259606.full AB - Objective We aimed to develop and evaluate a non-invasive deep learning algorithm for screening type 2 diabetes in UK Biobank participants using retinal images.Research Design and Methods The deep learning model for prediction of type 2 diabetes was trained on retinal images from 50,077 UK Biobank participants and tested on 12,185 participants. We evaluated its performance in terms of predicting traditional risk factors (TRFs) and genetic risk for diabetes. Next, we compared the performance of three models in predicting type 2 diabetes using 1) an image-only deep learning algorithm, 2) TRFs, 3) the combination of the algorithm and TRFs. Assessing net reclassification improvement (NRI) allowed quantification of the improvement afforded by adding the algorithm to the TRF model.Results When predicting TRFs with the deep learning algorithm, the areas under the curve (AUCs) obtained with the validation set for age, sex, and HbA1c status were 0.931 (0.928-0.934), 0.933 (0.929-0.936), and 0.734 (0.715-0.752), respectively. When predicting type 2 diabetes, the AUC of the composite logistic model using non-invasive TRFs was 0.810 (0.790-0.830), and that for the deep learning model using only fundus images was 0.731 (0.707-0.756). Upon addition of TRFs to the deep learning algorithm, discriminative performance was improved to 0.844 (0.826-0.861). The addition of the algorithm to the TRFs model improved risk stratification with an overall NRI of 50.8%.Conclusions Our results demonstrate that this deep learning algorithm can be a useful tool for stratifying individuals at high risk of type 2 diabetes in the general population.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was supported by the National Research Foundation of Korea Grant funded by the Korean Government (NRF-2016R1C1B1009262) and the National Research Foundation of Korea Grant (NRF-2019R1A2C1006608) funded by the Korea government. This work was also supported by NLM R01 NL012535 and NIGMS R01 GM138597.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The UK Biobank has ethical approval from the National Research Ethics Committee (June 17, 2011 [RES reference 11/NW/0382]), which was further extended (May 10, 2016 [RES reference 16/NW/0274]). Use of the UK Biobank Resource in the current study was approved under Application Number 67855.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesWe used the UK Biobank dataset to develop and validate a deep learning algorithm for prediction of type 2 diabetes using retinal fundus photographs. The UK Biobank project is a prospective observational study that recruited 505,025 UK participants, aged 40-69 years at baseline, between 2006 and 2010. Each participant provided informed consent, completed a touchscreen and in-person interview with trained staff, and underwent a series of physical examinations. Extensive information was collected, including lifestyle, sociodemographic factors, medical history, biologic samples, imaging, and genome-wide genotype data. Detailed protocols for obtaining the data are available on the UK Biobank website at www.ukbiobank.ac.uk.AIartificial intelligenceAUCarea under the curveCECross-entropyCVDCardiovascular diseaseNPVnegative predictive valuePPVpositive predictive valueR2R-squaredTRFtraditional risk factor