RT Journal Article SR Electronic T1 Development and Validation of Sex-Specific Hip Fracture Prediction Models using Electronic Health Records JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2022.10.26.22281584 DO 10.1101/2022.10.26.22281584 A1 Li, Gloria Hoi-Yee A1 Cheung, Ching-Lung A1 Tan, Kathryn Choon-Beng A1 Kung, Annie Wai-Chee A1 Kwok, Timothy Chi-Yui A1 Lau, Wallis Cheuk-Yin A1 Wong, Janus Siu-Him A1 Hsu, Warrington W.Q. A1 Fang, Christian A1 Wong, Ian Chi-Kei YR 2022 UL http://medrxiv.org/content/early/2022/10/27/2022.10.26.22281584.abstract AB Background Hip fracture is associated with immobility, morbidity, mortality, and high medical cost. Due to limited availability of dual-energy X-ray absorptiometry (DXA), hip fracture prediction models without using bone mineral density (BMD) data are essential. We aimed to develop and validate 10-year sex-specific hip fracture prediction models using electronic health records (EHR) without BMD.Methods In this population-based study, the derivation cohort comprised 161,051 public healthcare service users (91,926 female; 69,125 male) in Hong Kong agedā‰„60. Sex-stratified derivation cohort was randomly split to 80% training and 20% internal testing datasets. An external validation cohort comprised 3,046 community-dwelling participants. With 395 potential predictors (age, diagnosis and drug prescription records from EHR), 10-year sex-specific hip fracture prediction models were developed using stepwise selection by logistic regression (LR) and four machine learning (ML) algorithms (gradient boosting machine, random forest, eXtreme gradient boosting, and single-layer neural networks) in the training cohort. Model performance was evaluated in both internal and external validation cohorts.Findings In female, the LR model had the highest AUC (0.815) and adequate calibration in internal validation. Reclassification metrics showed ML algorithms could not further improve the performance of the LR model. Similar performance was attained by the LR model in external validation, with high AUC (0.841) comparable to other ML algorithms. In internal validation for male, LR model had high AUC (0.818) and it outperformed all ML models as indicated by reclassification metrics, with adequate calibration. In external validation, the LR model had high AUC (0.898) comparable to ML algorithms. Reclassification metrics demonstrated that LR model had the best discrimination performance.Interpretation Even without using BMD data, the 10-year hip fracture prediction models developed by conventional LR had better discrimination performance than the models developed by ML algorithms. Upon further validation in independent cohorts, the LR models could be integrated into the routine clinical workflow, aiding the identification of people at high risk for DXA scan.Funding This study was funded by the Health and Medical Research Fund, Food and Health Bureau, Hong Kong SAR Government (reference: 17181381).Competing Interest StatementCLC reports grants and personal fees from Amgen outside the submitted work. The other authors have nothing to declare.Funding StatementThe study is supported by the Health and Medical Research Fund, Food and Health Bureau, Hong Kong SAR Government (reference: 17181381) granted to GHL.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study protocol was approved by the institutional review board of the University of Hong Kong and the HA Hong Kong West Cluster (reference: UW 19-798), and the Hong Kong Polytechnic University (reference: HSEARS20201109004).I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThis study is conducted based on the anonymised dataset from the CDARS. We are unable to share the CDARS data used in this study since the data custodian, the Hong Kong Hospital Authority, has not provided us the permission. Nevertheless, CDARS data can be accessed via the Hospital Authority Data Sharing Portal for research purpose (https://www3.ha.org.hk/data).