PT - JOURNAL ARTICLE AU - Nita Vangeepuram AU - Bian Liu AU - Po-hsiang Chiu AU - Linhua Wang AU - Gaurav Pandey TI - Predicting youth diabetes risk using NHANES data and machine learning AID - 10.1101/19007872 DP - 2019 Jan 01 TA - medRxiv PG - 19007872 4099 - http://medrxiv.org/content/early/2019/10/05/19007872.short 4100 - http://medrxiv.org/content/early/2019/10/05/19007872.full AB - Type 2 diabetes has become alarmingly prevalent among youth in recent years. However, simple questionnaire-based screening tools to reliably identify diabetes risk and prevent the adverse effects of this serious disease are only available for adults, not for youth. As a first step in developing such a tool, we used a large-scale dataset from the National Health and Nutritional Examination Survey (NHANES), to examine the performance of a well-known adult diabetes risk self-assessment screener and published pediatric clinical screening guidelines in identifying youth with pre- diabetes/diabetes (pre-DM/DM) based on American Diabetes Association diagnostic biomarkers. We assessed the agreement between the adult screener/pediatric screening guidelines and biomarker diagnostic criteria by conducting comparisons using the overall data set and sub-datasets stratified by sex, race/ethnicity, and age. While the pediatric guidelines performed better than the adult screener in identifying youth with pre-DM/DM (sensitivity 43.1% vs 7.2%), both are inadequate for general deployment among youth. There were also notable differences in the performance of the pediatric guidelines across subgroups based on age, sex and race/ethnicity. In an effort to improve pre-DM/DM screening, we also evaluated data-driven machine learning-based classification algorithms, several of which performed slightly but statistically significantly better than the pediatric screening guidelines.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThe research presented in this manuscript was supported by a National Institutes of Health grant (R01GM114434) and an IBM Faculty award to author GP and by a Cigna Foundation grant (10005177) awarded to author NV.Author DeclarationsAll relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesAny clinical trials involved have been registered with an ICMJE-approved registry such as ClinicalTrials.gov and the trial ID is included in the manuscript.YesI have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE or other checklist(s) as supplementary files, if applicable.YesOnly publicly available NHANES data were used in this study. These data are available from https://wwwn.cdc.gov/nchs/nhanes/. https://wwwn.cdc.gov/nchs/nhanes/