Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Multitask learning from clinical text and acute physiological conditions differentially improve the prediction of mortality and diagnosis at the ICU

L.G. Reichmann, G. Valdes, Romain Pirrachio, View ORCID ProfileY. Interian
doi: https://doi.org/10.1101/2020.06.30.20143677
L.G. Reichmann
1Data Science Program, University of San Francisco, San Francisco, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
G. Valdes
2Department of Radiation Oncology, University of California, San Francisco, San Francisco, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Romain Pirrachio
3Department of Anesthesia and Perioperative Medicine, Zuckerberg San Francisco General Hospital, University of California San Francisco, San Francisco, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Y. Interian
1Data Science Program, University of San Francisco, San Francisco, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Y. Interian
  • For correspondence: yinterian@usfca.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The prediction of mortality of critically ill patients has stimulated the development of many severity scoring algorithms. The majority of the models use physiological measurements obtained during the first hours of admission (i.e., heart rate, arterial blood pressure, or respiratory rate). In this study, we propose to improve the performance of current scoring system by including free text from patient’s medical history. Although the primary outcome was in-hospital mortality, we chose a model architecture to provide simultaneous assessment of ICD-9 codes and groupings. We hypothesized that including patients’ medical history with a multitask learning approach would improve model performance. We compared the predictive performance obtained with our approach to the best models previously proposed in the literature (baseline models). We used the MIMIC publicly available database which includes > 60,000 ICU admissions between 2001 and 2012. The patients’ condition at admission was accounted for by the preliminary diagnosis at admission and the medical history extracted from the discharge summaries notes. Unstructured data was processed through a Gated Recurrent Units layer with pre-trained word embeddings, and the hidden states were concatenated to the remaining structured-tabular data. Baseline models achieved similar results than in previously published work, but our artificial neural networks models showed significant improvement towards classification of mortality (AUC-ROC = 0.90). Including the medical history improved all tasks but relatively more the ICD-9 codes prediction than the mortality. The clinical prediction model presented here could be used to identify patients’ risk groups, which would improve the quality of ICU care, and further help to efficiently allocate hospital resources.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

Lara Reichmann had partial funding by https://wamri.ai/.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This study uses the MIMIC dataset. We are using the MIMIC IRB. This study was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the study did not impact clinical care and all protected health information was de-identified. De-identification was performed in compliance with Health Insurance Portability and Accountability Act (HIPAA) standards in order to facilitate public access to MIMIC-II. Deletion of protected health information (PHI) from structured data sources (e.g., database fields that provide patient name or date of birth) was straightforward. Additionally, PHI were removed from the discharge summaries and diagnostic reports as well as the approximately 700,000 free-text nursing and respiratory notes in MIMIC-II using an automated algorithm previously shown to out perform clinicians in detecting PHI.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

We used publically available data from MIMIC dataset.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted July 08, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Multitask learning from clinical text and acute physiological conditions differentially improve the prediction of mortality and diagnosis at the ICU
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Multitask learning from clinical text and acute physiological conditions differentially improve the prediction of mortality and diagnosis at the ICU
L.G. Reichmann, G. Valdes, Romain Pirrachio, Y. Interian
medRxiv 2020.06.30.20143677; doi: https://doi.org/10.1101/2020.06.30.20143677
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Multitask learning from clinical text and acute physiological conditions differentially improve the prediction of mortality and diagnosis at the ICU
L.G. Reichmann, G. Valdes, Romain Pirrachio, Y. Interian
medRxiv 2020.06.30.20143677; doi: https://doi.org/10.1101/2020.06.30.20143677

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Intensive Care and Critical Care Medicine
Subject Areas
All Articles
  • Addiction Medicine (175)
  • Allergy and Immunology (423)
  • Anesthesia (97)
  • Cardiovascular Medicine (902)
  • Dentistry and Oral Medicine (171)
  • Dermatology (102)
  • Emergency Medicine (257)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (408)
  • Epidemiology (8810)
  • Forensic Medicine (4)
  • Gastroenterology (405)
  • Genetic and Genomic Medicine (1871)
  • Geriatric Medicine (179)
  • Health Economics (389)
  • Health Informatics (1294)
  • Health Policy (644)
  • Health Systems and Quality Improvement (495)
  • Hematology (207)
  • HIV/AIDS (397)
  • Infectious Diseases (except HIV/AIDS) (10591)
  • Intensive Care and Critical Care Medicine (565)
  • Medical Education (193)
  • Medical Ethics (52)
  • Nephrology (218)
  • Neurology (1770)
  • Nursing (105)
  • Nutrition (267)
  • Obstetrics and Gynecology (344)
  • Occupational and Environmental Health (461)
  • Oncology (968)
  • Ophthalmology (285)
  • Orthopedics (107)
  • Otolaryngology (177)
  • Pain Medicine (118)
  • Palliative Medicine (43)
  • Pathology (265)
  • Pediatrics (560)
  • Pharmacology and Therapeutics (266)
  • Primary Care Research (221)
  • Psychiatry and Clinical Psychology (1847)
  • Public and Global Health (3998)
  • Radiology and Imaging (657)
  • Rehabilitation Medicine and Physical Therapy (344)
  • Respiratory Medicine (537)
  • Rheumatology (216)
  • Sexual and Reproductive Health (178)
  • Sports Medicine (167)
  • Surgery (199)
  • Toxicology (37)
  • Transplantation (107)
  • Urology (80)