Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Prediction of type 2 diabetes mellitus onset using logistic regression-based scoreboards

Yochai Edlitz, Eran Segal
doi: https://doi.org/10.1101/2020.08.02.20165092
Yochai Edlitz
1Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot, Israel
2Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eran Segal
1Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot, Israel
2Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: eran.segal@weizmann.ac.il
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Type 2 diabetes mellitus (T2DM) accounts for ∼90% of all cases of diabetes which are estimated with an annual world death rate of 1.6 million in 2016. Early detection of T2D high-risk patients can reduce the incidence of the disease through a change in lifestyle, diet, or medication. Since populations of lower socio-demographic status are more susceptible to T2D and might have limited resources for laboratory testing, there is a need for accurate yet accessible prediction models based on non-laboratory parameters. This paper introduces one non-laboratory model which is highly accessible to the general population and one highly precise yet simple laboratory model. Both models are provided as an accessible scoreboard form and also as a logistic regression model. We based the models on data from 44,879 non-diabetic, UK Biobank participants, aged 40-65, predicting the risk of T2D onset within the next 7.3 years (SD 2.3). The non-laboratory prediction model for T2DM onset probability incorporated the following covariates: sex, age, weight, height, waist, hips-circumferences, waist-to-hip Ratio (WHR) and Body-Mass Index (BMI). This logistic regression model achieved an Area Under the Receiver Operating Curve (auROC) of 0.82 (0.79-0.85 95% CI) and an odds ratio (OR) between the upper and lower prevalence deciles of x77 (28-98). We further analysed the contribution of laboratory-based parameters and devised a blood-test model based on just five blood tests. In this model, we included age, sex, Glycated Hemoglobin (HbA1c%), reticulocyte count, Gamma Glutamyl-Transferase, Triglycerides, and HDL cholesterol to predict T2D onset. This logistic-regression model achieved an auROC of 0.89 (0.86-0.91) and a deciles’ OR of x87 (27-152). Using the scoreboard results, the Anthropometrics model classified three risk groups, a group with 1%(1-2%); a group with 9% (7-11%) probability, and a group with a 15% (7-23%) risk of developing T2D. The Five blood tests scoreboard model, further classified into four risk groups: 0.9% (0.7%-1%); 8%(6-11%); 18%(14-22%) and a high risk group of 38%(23-54%) of developing T2D. We analysed several more comprehensive models which included genotyping data and other environmental factors and found that it did not provide cost efficient benefits over the five blood tests model. The Five blood tests and anthropometric models, both in their logistic regression form and scoreboard form, outperform the commonly used non-laboratory models, the Finnish Diabetes Risk Score (FINDRISC) and the German Diabetes Risk Score (GDRS). When trained using our data, the FINDRISC achieved an auROC of 0.75 (0.71-0.78), and the GDRS auROC resulted in 0.58 (0.54-0.62), respectively.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

E.S. is supported by the Crown Human Genome Center, by D. L. Schwarz, J. N. Halpern and L. Steinberg, and by grants funded by the European Research Council and the Israel Science Foundation.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

01/03/2020 Title of the project: COPY from protocol: 149-1 - Development of models relating genetics, diet, lifestyle, and clinical outcomes (945-1) Submitted by: Eran Segal on 01/03/2020 Dear Eran Segal, This is to certify that Weizmann Institutional Review Board (IRB) reviewed the above research project and approved it for execution at the Weizmann Institute of Science. The certificate of approval expires on 01/03/2023 Sincerely yours Prof. Yosef Shaul, Head Weizmann IRB

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • This version updated to include scoreboards models based on the anthropometric and five blood tests logistic regression models. We identified six participants with HbA1C%=6.5 and dropped them from the cohort of the final models. The results and figures were updated accordingly. Figures of the Scoreboards were added, including figures of the type 2 diabetes prevalence within each score group.

Data Availability

The UKB data are available through the UK Biobank Access Management System https://www.ukbiobank.ac.uk/

https://www.ukbiobank.ac.uk/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted July 03, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Prediction of type 2 diabetes mellitus onset using logistic regression-based scoreboards
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Prediction of type 2 diabetes mellitus onset using logistic regression-based scoreboards
Yochai Edlitz, Eran Segal
medRxiv 2020.08.02.20165092; doi: https://doi.org/10.1101/2020.08.02.20165092
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Prediction of type 2 diabetes mellitus onset using logistic regression-based scoreboards
Yochai Edlitz, Eran Segal
medRxiv 2020.08.02.20165092; doi: https://doi.org/10.1101/2020.08.02.20165092

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Endocrinology (including Diabetes Mellitus and Metabolic Disease)
Subject Areas
All Articles
  • Addiction Medicine (238)
  • Allergy and Immunology (520)
  • Anesthesia (124)
  • Cardiovascular Medicine (1417)
  • Dentistry and Oral Medicine (217)
  • Dermatology (158)
  • Emergency Medicine (291)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (582)
  • Epidemiology (10280)
  • Forensic Medicine (6)
  • Gastroenterology (527)
  • Genetic and Genomic Medicine (2622)
  • Geriatric Medicine (254)
  • Health Economics (496)
  • Health Informatics (1728)
  • Health Policy (788)
  • Health Systems and Quality Improvement (671)
  • Hematology (266)
  • HIV/AIDS (564)
  • Infectious Diseases (except HIV/AIDS) (12080)
  • Intensive Care and Critical Care Medicine (647)
  • Medical Education (273)
  • Medical Ethics (83)
  • Nephrology (288)
  • Neurology (2453)
  • Nursing (144)
  • Nutrition (377)
  • Obstetrics and Gynecology (489)
  • Occupational and Environmental Health (566)
  • Oncology (1318)
  • Ophthalmology (400)
  • Orthopedics (145)
  • Otolaryngology (235)
  • Pain Medicine (168)
  • Palliative Medicine (51)
  • Pathology (342)
  • Pediatrics (776)
  • Pharmacology and Therapeutics (329)
  • Primary Care Research (296)
  • Psychiatry and Clinical Psychology (2393)
  • Public and Global Health (4996)
  • Radiology and Imaging (892)
  • Rehabilitation Medicine and Physical Therapy (523)
  • Respiratory Medicine (681)
  • Rheumatology (309)
  • Sexual and Reproductive Health (255)
  • Sports Medicine (244)
  • Surgery (297)
  • Toxicology (45)
  • Transplantation (140)
  • Urology (108)