Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Systematic Review of Supervised Machine Learning Models in Prediction of Medical Conditions

View ORCID ProfileBranimir Ljubic, View ORCID ProfileMartin Pavlovski, Avrum Gillespie, View ORCID ProfileDaniel Rubin, View ORCID ProfileGalen Collier, Zoran Obradovic
doi: https://doi.org/10.1101/2022.04.22.22274183
Branimir Ljubic
1Temple University, Center for Data Analytics and Biomedical Informatics (DABI), Philadelphia, PA 19122, U.S.A.
2Rutgers University, Office of Advanced Research Computing (OARC), Piscataway, NJ 08854, U.S.A.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Branimir Ljubic
Martin Pavlovski
1Temple University, Center for Data Analytics and Biomedical Informatics (DABI), Philadelphia, PA 19122, U.S.A.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Martin Pavlovski
Avrum Gillespie
3Lewis Katz School of Medicine, Temple University, Philadelphia PA 19140, U.S.A.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel Rubin
3Lewis Katz School of Medicine, Temple University, Philadelphia PA 19140, U.S.A.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Daniel Rubin
Galen Collier
2Rutgers University, Office of Advanced Research Computing (OARC), Piscataway, NJ 08854, U.S.A.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Galen Collier
Zoran Obradovic
1Temple University, Center for Data Analytics and Biomedical Informatics (DABI), Philadelphia, PA 19122, U.S.A.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: zoran.obradovic@temple.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Machine learning (ML) models for analyzing medical data are critical for both accelerating development of novel diagnostic and treatment strategies and improving the accuracy of medical care delivery. Our objective was to comprehensively review supervised ML models for diagnosis or treatment prediction. Publications indexed in PubMed were reviewed to identify articles utilizing supervised predictive ML models in medicine. Articles published between 01/01/2020–01/01/2022 were included in this review. Initially, PubMed was searched using MeSH major terms, and if more extensive search results were needed, a broader search was applied (titles/abstracts).

PubMed indexed 21,268 published articles (MeSH Major topic) describing ML methods implemented in medicine. Of those, 11,726 articles were published within the last 2 years. Most of the published ML models in medicine in the last two years were different types of deep learning models (about 75%). Fifty articles were included in this review.

Almost all categories of disease were subjects of ML predictions. Positive and negative factors in each of the scenarios need to be evaluated before the most optimal ML model is selected. Domain knowledge and collaborations between physicians and ML experts can improve the selection and prediction performance of ML models in medicine and facilitate implementation in clinical practice. Predictive ML models could provide recommendations to recruit suitable patients for clinical trials. Prediction ML models may contribute to development of more effective diagnostic and therapeutic choices, founded on evidence-based medicine. A broad range of methodological approaches have been taken toward this goal, and those approaches are presented here with their various advantages and disadvantages.

AUTHOR SUMMARY Over the last decade, there has been rapid development of Machine learning (ML) methods to analyze Big Data in medicine. ML is aimed to make the computer learn from past experiences and make predictions by recognizing patterns in medical data. We performed a comprehensive systematic literature review of recent publications (last two years), indexed in PubMed/MEDLINE that have described either traditional or deep supervised prediction ML models in medicine. We identified 21,268 articles describing ML implementation in medicine. 11,726 articles were published in the last 2 years. We presented the number of publications describing each of the most often ML methods to show current trends in development of these models. Most of the recently published ML models in medicine were deep learning models. We found that the understanding of disease is likely to lead to more accurate prediction. An important dilemma is the selection of optimal ML models for a specific task, considering amount and type of available data. Domain knowledge and collaborations between physicians and ML experts can improve the prediction performance of ML models, which could help clinicians to select the most effective diagnostic and therapeutic choices available and decrease medical errors.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This research was supported by the National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health under Award Number R01DK122073. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The research was also supported by Clinical and Translational Science Award (CTSA) Program grants under Award numbers: UL1TR003017, KL2TR003018 and TL1TR003019.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Not Applicable

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

N/A

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Not Applicable

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Not Applicable

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Not Applicable

Data Availability

All data used in the manuscript are provided as part of the submitted article. Data are extracted from searching the PubMed, publicly available database.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted April 27, 2022.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Systematic Review of Supervised Machine Learning Models in Prediction of Medical Conditions
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Systematic Review of Supervised Machine Learning Models in Prediction of Medical Conditions
Branimir Ljubic, Martin Pavlovski, Avrum Gillespie, Daniel Rubin, Galen Collier, Zoran Obradovic
medRxiv 2022.04.22.22274183; doi: https://doi.org/10.1101/2022.04.22.22274183
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Systematic Review of Supervised Machine Learning Models in Prediction of Medical Conditions
Branimir Ljubic, Martin Pavlovski, Avrum Gillespie, Daniel Rubin, Galen Collier, Zoran Obradovic
medRxiv 2022.04.22.22274183; doi: https://doi.org/10.1101/2022.04.22.22274183

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (230)
  • Allergy and Immunology (507)
  • Anesthesia (111)
  • Cardiovascular Medicine (1262)
  • Dentistry and Oral Medicine (207)
  • Dermatology (148)
  • Emergency Medicine (283)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (538)
  • Epidemiology (10055)
  • Forensic Medicine (5)
  • Gastroenterology (502)
  • Genetic and Genomic Medicine (2486)
  • Geriatric Medicine (240)
  • Health Economics (482)
  • Health Informatics (1653)
  • Health Policy (757)
  • Health Systems and Quality Improvement (638)
  • Hematology (250)
  • HIV/AIDS (538)
  • Infectious Diseases (except HIV/AIDS) (11896)
  • Intensive Care and Critical Care Medicine (627)
  • Medical Education (255)
  • Medical Ethics (75)
  • Nephrology (269)
  • Neurology (2304)
  • Nursing (140)
  • Nutrition (354)
  • Obstetrics and Gynecology (458)
  • Occupational and Environmental Health (537)
  • Oncology (1259)
  • Ophthalmology (377)
  • Orthopedics (134)
  • Otolaryngology (226)
  • Pain Medicine (158)
  • Palliative Medicine (50)
  • Pathology (326)
  • Pediatrics (737)
  • Pharmacology and Therapeutics (315)
  • Primary Care Research (282)
  • Psychiatry and Clinical Psychology (2295)
  • Public and Global Health (4850)
  • Radiology and Imaging (846)
  • Rehabilitation Medicine and Physical Therapy (493)
  • Respiratory Medicine (657)
  • Rheumatology (289)
  • Sexual and Reproductive Health (241)
  • Sports Medicine (228)
  • Surgery (273)
  • Toxicology (44)
  • Transplantation (131)
  • Urology (100)