Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Machine learning algorithm to predict fragility fractures and identification of important features – an explainable approach

View ORCID ProfileSayem Borhan, Alexandra Papaioannou, Jonathan Adachi, Shrey Acharya, Suzanne N. Morin, David Goltzman, David A. Hanley, Claudie Berger, Lehana Thabane, Parminder Raina
doi: https://doi.org/10.1101/2025.07.10.25331257
Sayem Borhan
1Department of Health Research Methods, Evidence, and Impact, McMaster University, Canada
2Research Methodology Centre, Research Institute of St Joseph’s Healthcare Hamilton, Canada
3McMaster Institute for Research on Aging (MIRA), McMaster University, Canada
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sayem Borhan
  • For correspondence: borhana{at}mcmaster.ca
Alexandra Papaioannou
3McMaster Institute for Research on Aging (MIRA), McMaster University, Canada
4Department of Medicine, McMaster University, Canada
5GERAS centre, Hamilton, ON, Canada
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonathan Adachi
4Department of Medicine, McMaster University, Canada
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shrey Acharya
6Michael DeGroote School of Medicine, McMaster University, Hamilton, ON, Canada
BSc, MD
Roles: Candidate
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Suzanne N. Morin
7Department of Medicine, McGill University, Montreal, QC, Canada
8Research Institute of McGill University Health Centre, McGill University, Montreal, QC, Canada
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Goltzman
7Department of Medicine, McGill University, Montreal, QC, Canada
8Research Institute of McGill University Health Centre, McGill University, Montreal, QC, Canada
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David A. Hanley
9Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claudie Berger
9Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
MSc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lehana Thabane
1Department of Health Research Methods, Evidence, and Impact, McMaster University, Canada
2Research Methodology Centre, Research Institute of St Joseph’s Healthcare Hamilton, Canada
3McMaster Institute for Research on Aging (MIRA), McMaster University, Canada
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Parminder Raina
1Department of Health Research Methods, Evidence, and Impact, McMaster University, Canada
3McMaster Institute for Research on Aging (MIRA), McMaster University, Canada
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

In this study, we developed ML algorithms to predict fragility fractures, considering the occurrence of fractures at different skeletal sites. We investigated seven ML algorithms (LASSO, Elastic Net, Random Forest, Decision Tree, Neural Network, XGBoost and Logistic Regression) using the data from the Canadian Multicentre Osteoporosis Study (CaMos) with participants aged 50 years or older. We considered 73 baseline features, including age, sex, menopause status, and bone mineral density (BMD), and the outcome was the first incidence of fracture at any of the following sites: hip, spine, pelvis, ribs, shoulder, and forearm, over a 19-year follow-up period. Data were divided into training (70%) and testing (30%) datasets. The ML algorithms were trained on the training dataset and evaluated on the test dataset in terms of the ROC_AUC. SHapley Additive exPlanations (SHAP) analysis was performed to identify the important features that contribute to the prediction of fracture, and to investigate the interaction among these features.

In total, 7,753 subjects were included in the study. Approximately 72% were female, and the average age was 67 years. We found that the XGBoost algorithm had a slightly better ROC_AUC (0.70; 95% CI: 0.67, 0.73). From the SHAP analysis, we found that BMD was the most important feature that contributed to the prediction. The other important features include age, previous fracture, osteoporosis and menopausal status. Total hip BMD interacted the most with femoral neck BMD, lumbar spine BMD interacted the most with weight, previous fracture status interacted the most with femoral neck BMD, and age interacted the most with lumbar spine BMD.

This study demonstrated that XGBoost was the most effective algorithm for predicting fragility fractures. In addition, we identified important features that contribute to the prediction of fragility fractures. Intervention focusing on these features will help to prevent the incidence of these fractures.

Lay summaries We developed machine learning (ML) algorithms to predict fragility fractures, considering the incidence of fractures at different skeletal sites, including the hip, spine, pelvis, ribs, shoulder, or forearm, using 19 years of follow-up data from the Canadian Multicentre Osteoporosis Study (CaMos). We investigated seven ML algorithms and found that XGBoost had slightly better performance compared to other algorithms. We identified important factors that increase the risk of fractures, including BMD, age, and previous fracture. We also demonstrated how the interaction between these factors increases the risk of fractures. The intervention focusing on these factors will help to prevent fragility fractures.

Competing Interest Statement

Acharya and Drs. Borhan, Thabane, Hanely, Berger, and Morin declared no conflict of interest. Dr. Papaioannou reported receiving honoria from Amgen and funding from Osteoporosis Canada. Dr. Goltzman reported receiving funding from the Canadian Institute of Health Research (CIHR), one-time royalties from UpToDate, one consulting fee from Biosyent, patents: 2457928(Canada), 60/384122(USA); 2343713(Canada) issued to the McGill University, and provided clinical expert assessment of Burosumab for treatment of X-linked Hypophosphatemia(XLH). Dr. Adachi reported receiving funding from CIHR, Eli Lily, Merck, Procter & Gamble, Sanofi, Amgen, consulting fees and honoraria from Amgen. Dr. Raina reported receiving funding from the Canadian Institute of Health Research (CIHR) and the Canada Foundation for Innovation and being involved with the WHO working group on life course.

Funding Statement

Dr. Borhan received partial funding through the OC-CaMos fellowship from Osteoporosis Canada to conduct this study.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Hamilton Integrated Research Ethics Board (HiREB) gave ethical approval of this work.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • Conflict of interest disclosure: Acharya and Drs. Borhan, Thabane, Hanely, Berger, and Morin declared no conflict of interest. Dr. Papaioannou reported receiving honoria from Amgen and funding from Osteoporosis Canada. Dr. Goltzman reported receiving funding from the Canadian Institute of Health Research (CIHR), one-time royalties from UpToDate, one consulting fee from Biosyent, patents: 2457928(Canada), 60/384122(USA); 2343713(Canada) issued to the McGill University, and provided clinical expert assessment of Burosumab for treatment of X-linked Hypophosphatemia(XLH). Dr. Adachi reported receiving funding from CIHR, Eli Lily, Merck, Procter & Gamble, Sanofi, Amgen, consulting fees and honoraria from Amgen. Dr. Raina reported receiving funding from the Canadian Institute of Health Research (CIHR) and the Canada Foundation for Innovation and being involved with the WHO working group on life course.

  • Funding: Dr. Borhan received partial funding through the OC-CaMos fellowship from Osteoporosis Canada to conduct this study.

  • Data availability statement: Data are not available to share.

  • The manuscript has been formatted for another journal.

Data Availability

Data are not available to share

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted August 30, 2025.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Machine learning algorithm to predict fragility fractures and identification of important features – an explainable approach
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Machine learning algorithm to predict fragility fractures and identification of important features – an explainable approach
Sayem Borhan, Alexandra Papaioannou, Jonathan Adachi, Shrey Acharya, Suzanne N. Morin, David Goltzman, David A. Hanley, Claudie Berger, Lehana Thabane, Parminder Raina
medRxiv 2025.07.10.25331257; doi: https://doi.org/10.1101/2025.07.10.25331257
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Machine learning algorithm to predict fragility fractures and identification of important features – an explainable approach
Sayem Borhan, Alexandra Papaioannou, Jonathan Adachi, Shrey Acharya, Suzanne N. Morin, David Goltzman, David A. Hanley, Claudie Berger, Lehana Thabane, Parminder Raina
medRxiv 2025.07.10.25331257; doi: https://doi.org/10.1101/2025.07.10.25331257

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Public and Global Health
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (867)
  • Anesthesia (306)
  • Cardiovascular Medicine (4480)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (614)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15276)
  • Forensic Medicine (31)
  • Gastroenterology (1133)
  • Genetic and Genomic Medicine (6644)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4603)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1623)
  • Hematology (544)
  • HIV/AIDS (1275)
  • Infectious Diseases (except HIV/AIDS) (15960)
  • Intensive Care and Critical Care Medicine (1111)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (674)
  • Neurology (6693)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1152)
  • Occupational and Environmental Health (961)
  • Oncology (3369)
  • Ophthalmology (988)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (668)
  • Pediatrics (1703)
  • Pharmacology and Therapeutics (699)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5494)
  • Public and Global Health (9285)
  • Radiology and Imaging (2223)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1201)
  • Rheumatology (598)
  • Sexual and Reproductive Health (720)
  • Sports Medicine (535)
  • Surgery (720)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (267)