Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Prediction of intensive care unit mortality based on missing events

View ORCID ProfileTatsuma Shoji, Hiroshi Yonekura, Sato Yoshiharu, Yohei Kawasaki
doi: https://doi.org/10.1101/2021.02.28.21252249
Tatsuma Shoji
1DNA Chip Research Inc. 1-15-1 Kaigan, Suzue Baydium 5F, Minato-ku, Tokyo 105-0022, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tatsuma Shoji
  • For correspondence: tatsumashoji@bioinforest.com
Hiroshi Yonekura
2Department of Anesthesiology and Pain Medicine, Fujita Health University Bantane Hospital 2-6-10 Otoubashi, Nakagawa-ku, Nagoya City, Aichi, 454-8509, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sato Yoshiharu
1DNA Chip Research Inc. 1-15-1 Kaigan, Suzue Baydium 5F, Minato-ku, Tokyo 105-0022, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yohei Kawasaki
3Faculty of Nursing, Japan Red Cross College of Nursing, Tokyo, Japan 4-1-3 Hiroo, Shibuya-ku, Tokyo 150-0012, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background The increasing availability of electronic health records has made it possible to construct and implement models for predicting intensive care unit (ICU) mortality using machine learning. However, the algorithms used are not clearly described, and the performance of the model remains low owing to several missing values, which is unavoidable in big databases.

Methods We developed an algorithm for subgrouping patients based on missing event patterns using the Philips eICU Research Institute (eRI) database as an example. The eRI database contains data associated with 200,859 ICU admissions from many hospitals (>400) and is freely available. We then constructed a model for each subgroup using random forest classifiers and integrated the models. Finally, we compared the performance of the integrated model with the Acute Physiology and Chronic Health Evaluation (APACHE) scoring system, one of the best known predictors of patient mortality, and the imputation approach-based model.

Results Subgrouping and patient mortality prediction were separately performed on two groups: the sepsis group (the ICU admission diagnosis of which is sepsis) and the non-sepsis group (a complementary subset of the sepsis group). The subgrouping algorithm identified a unique, clinically interpretable missing event patterns and divided the sepsis and non-sepsis groups into five and seven subgroups, respectively. The integrated model, which comprises five models for the sepsis group or seven models for the non-sepsis group, greatly outperformed the APACHE IV or IVa, with an area under the receiver operating characteristic (AUROC) of 0.91 (95% confidence interval 0.89–0.92) compared with 0.79 (0.76–0.81) for the APACHE system in the sepsis group and an AUROC of 0.90 (0.89–0.91) compared with 0.86 (0.85–0.87) in the non-sepsis group. Moreover, our model outperformed the imputation approach-based model, which had an AUROC of 0.85 (0.83–0.87) and 0.87 (0.86–0.88) in the sepsis and non-sepsis groups, respectively.

Conclusions We developed a method to predict patient mortality based on missing event patterns. Our method more accurately predicts patient mortality than others. Our results indicate that subgrouping, based on missing event patterns, instead of imputation is essential and effective for machine learning against patient heterogeneity.

Trial registration Not applicable.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by JSPS KAKENHI Grant Number JP 20K17834.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Prior to requesting access to Philips eICU Research Institute (eRI) database, researchers are required to complete the CITI Data or Specimens Only Research course. We have completed the course and received the approval for the use of 31 csv files in the eRI database. Regarding the statement about the ethics oversight body that gave ethical approval for the collection of the original data, the original database is released under the Health Insurance Portability and Accountability Act (HIPAA) safe harbor provision. The re-identification risk was certified as meeting safe harbor standards by Privacert (Cambridge, MA) (HIPAA Certification no. 1031219-2).

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • E-mail: Shoji Tatsuma: t-shoji{at}dna-chip.co.jp, Yonekura Hiroshi: hiroshi.yonekura{at}fujita-hu.ac.jp, Sato Yoshiharu: yo-sato{at}dna-chip.co.jp, Kawashiki yohei: ykawasaki{at}chiba-u.jp

Data Availability

The datasets generated and/or analyzed during the current study are available in the eICU repository.

https://eicu-crd.mit.edu/gettingstarted/access/

  • List of abbreviations

    APACHE
    Acute Physiology and Chronic Health Evaluation
    ICU
    Intensive Care Unit
    eRI
    eICU Research Institute
    APS
    Acute Physiology Score
    ROC
    Receiver Operating Characteristic
    AUROC
    Area Under the ROC
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted March 02, 2021.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Prediction of intensive care unit mortality based on missing events
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Prediction of intensive care unit mortality based on missing events
    Tatsuma Shoji, Hiroshi Yonekura, Sato Yoshiharu, Yohei Kawasaki
    medRxiv 2021.02.28.21252249; doi: https://doi.org/10.1101/2021.02.28.21252249
    Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
    Citation Tools
    Prediction of intensive care unit mortality based on missing events
    Tatsuma Shoji, Hiroshi Yonekura, Sato Yoshiharu, Yohei Kawasaki
    medRxiv 2021.02.28.21252249; doi: https://doi.org/10.1101/2021.02.28.21252249

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Intensive Care and Critical Care Medicine
    Subject Areas
    All Articles
    • Addiction Medicine (179)
    • Allergy and Immunology (434)
    • Anesthesia (99)
    • Cardiovascular Medicine (948)
    • Dentistry and Oral Medicine (178)
    • Dermatology (110)
    • Emergency Medicine (260)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (422)
    • Epidemiology (8987)
    • Forensic Medicine (4)
    • Gastroenterology (420)
    • Genetic and Genomic Medicine (1958)
    • Geriatric Medicine (190)
    • Health Economics (402)
    • Health Informatics (1329)
    • Health Policy (660)
    • Health Systems and Quality Improvement (519)
    • Hematology (212)
    • HIV/AIDS (420)
    • Infectious Diseases (except HIV/AIDS) (10808)
    • Intensive Care and Critical Care Medicine (575)
    • Medical Education (200)
    • Medical Ethics (54)
    • Nephrology (222)
    • Neurology (1830)
    • Nursing (110)
    • Nutrition (274)
    • Obstetrics and Gynecology (353)
    • Occupational and Environmental Health (470)
    • Oncology (999)
    • Ophthalmology (298)
    • Orthopedics (111)
    • Otolaryngology (182)
    • Pain Medicine (126)
    • Palliative Medicine (44)
    • Pathology (265)
    • Pediatrics (580)
    • Pharmacology and Therapeutics (276)
    • Primary Care Research (234)
    • Psychiatry and Clinical Psychology (1903)
    • Public and Global Health (4123)
    • Radiology and Imaging (676)
    • Rehabilitation Medicine and Physical Therapy (368)
    • Respiratory Medicine (549)
    • Rheumatology (225)
    • Sexual and Reproductive Health (191)
    • Sports Medicine (177)
    • Surgery (207)
    • Toxicology (39)
    • Transplantation (109)
    • Urology (81)