Abstract
Background Bladder cancer is the most common cancer of the urinary system among the American population and it is the fourth most common cause of cancer morbidity and the eight most common cause of cancer mortality among men. Using machine learning algorithms, we predict the five-year survival among bladder cancer patients and deploy the best performing algorithm as a web application for survival prediction.
Methods Microscopically confirmed adult bladder cancer patients were included from the Surveillance Epidemiology and End Results (SEER) database (2000-2017) and randomly split into training and test datasets (70/30 ratio). Five machine learning algorithms (logistic regression, support vector machine, gradient boosting, random forest, and K nearest neighbor) were trained on features to predict five-year survival. The algorithms were compared with performance metrics and the best performing algorithm was deployed as a web application.
Results A total of 52,529 patients were included in our study. The gradient boosting algorithm was the best performer in terms of predictive ability and discrimination. It was deployed as the survival prediction web application named BlaCaSurv (https://blacasurv.herokuapp.com/).
Conclusions We tested several machine learning algorithms and developed a web application for predicting five-year survival for bladder cancer patients. This application can be used as a supplementary prognostic tool to clinical decision making.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
No funding received for this study.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Study uses anonymized data from a publicly available database.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
Paper uses SEER database