Abstract
Background Predicting accurately the frequent Emergency Department (ED) visitors is critical for hospitals because they often consume significant ED resources, including staff time, equipment, and medical supplies. Furthermore, frequent ED visitors may contribute to increased wait times for all patients. Therefore, by accurately predicting and identifying these individuals, hospitals can help reduce the burden on the ED and decrease wait times for all patients, improving the overall quality of care.
Objective This study proposed an effective and adaptive ensemble learning prediction model to identify frequent visitors in the emergency department.
Methods This was a retrospective population-based study of patients and utilised medical and administrative databases at Canberra Hospital, a tertiary public hospital in ACT, Australia, between January 1997 and December 2022. The study focuses on a wide age range of the population with 20 viral chronic diseases. The definition of frequent ED use is considered as having at least three visits within a year. This study developed an Adaptive ensemble learning–based prediction model and compared the performance with 16 popular machine learning models. In addition, three techniques are compared to handle the imbalanced data issue, and we also proposed a hybrid feature selection composed of Elastic-Net and local search to find the best combination of features. In order to hyper-parameter tuning, two techniques were compared: a population-based evolutionary algorithm and a local search.
Results The study included 535,474 patient visits and 1.6 million episodes, with 25% overall frequent visitors. We compared the performance of the proposed prediction model with that of the other 16 popular classifiers. According to the prediction results, the proposed model considerably outperformed other models in terms of five metrics: accuracy, Recall, F1-score, Area under the ROC curve (AUC), and Log loss at 0.78 (95% CI 0.78-0.79), 0.68 (95% CI 0.68-0.68), 0.68 (95% CI 0.68-0.69), 0.69 (95% CI 0.69-0.70), and 7.4 (95% CI 7.2-7.5), respectively.
Conclusions We proposed an adaptive ensemble learning model combining XGBoost Elastic-net with local search and Differential evolution to address the imbalanced nature of the frequent ED visitors’ data. Our approach aimed to enhance the prediction capability of the classifier substantially. To tackle the class imbalance, we employed both under-sampling and adjusted weights for the positive class. Through extensive testing and evaluation, we demonstrated that these strategies effectively improved the model’s performance. Further, we emphasised the importance of employing a robust feature selection method and a fast hyper-parameter optimiser. These elements were essential for enhancing the identification of frequent ED visitors. By incorporating these techniques, our study contributes to developing more accurate and reliable models for predicting frequent ED visitors, thereby assisting hospitals in resource allocation and patient care management.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study did not receive any external funding.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study has received ethical approval from The Australian Capital Territory Health Human Research Ethics Committee (Approval No. 2024/ETH01037).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
mehdi.neshat{at}ieee.org, michael.phipps{at}act.gov.au
nikhil.jha{at}act.gov.au
chris.browne{at}anu.edu.au
Walter.P.Abhayaratna{at}act.gov.au
Walter.P.Abhayaratna{at}anu.edu.au
↵6 https://github.com/scipy/scipy/blob/main/scipy/optimize/differentialevolution.py
Data Availability
All data produced in the present study are available upon reasonable request to Michael Phipps (michael.phipps{at}act.gov.au), the esteemed Senior Director of the Canberra Health Service.