%0 Journal Article %A Akhil Vaid %A Sulaiman Somani %A Adam J Russak %A Jessica K De Freitas %A Fayzan F Chaudhry %A Ishan Paranjpe %A Kipp W Johnson %A Samuel J Lee %A Riccardo Miotto %A Shan Zhao %A Noam D Beckmann %A Nidhi Naik %A Kodi Arfer %A Arash Kia %A Prem Timsina %A Anuradha Lala %A Manish Paranjpe %A Patricia Glowe %A Eddye Golden %A Matteo Danieletto %A Manbir Singh %A Dara Meyer %A Paul F O’Reilly %A Laura H Huckins %A Patricia Kovatch %A Joseph Finkelstein %A Robert M Freeman %A Edgar Argulian %A Andrew Kasarskis %A Bethany Percha %A Judith A Aberg %A Emilia Bagiella %A Carol R Horowitz %A Barbara Murphy %A Eric J Nestler %A Eric E Schadt %A Judy H Cho %A Carlos Cordon-Cardo %A Valentin Fuster %A Dennis S Charney %A David L Reich %A Erwin P Bottinger %A Matthew A Levin %A Jagat Narula %A Zahi A Fayad %A Allan C Just %A Alexander W Charney %A Girish N Nadkarni %A Benjamin S Glicksberg %A on behalf of the Mount Sinai Covid Informatics Center (MSCIC). %T Machine Learning to Predict Mortality and Critical Events in COVID-19 Positive New York City Patients %D 2020 %R 10.1101/2020.04.26.20073411 %J medRxiv %P 2020.04.26.20073411 %X Coronavirus 2019 (COVID-19), caused by the SARS-CoV-2 virus, has become the deadliest pandemic in modern history, reaching nearly every country worldwide and overwhelming healthcare institutions. As of April 20, there have been more than 2.4 million confirmed cases with over 160,000 deaths. Extreme case surges coupled with challenges in forecasting the clinical course of affected patients have necessitated thoughtful resource allocation and early identification of high-risk patients. However, effective methods for achieving this are lacking. In this paper, we use electronic health records from over 3,055 New York City confirmed COVID-19 positive patients across five hospitals in the Mount Sinai Health System and present a decision tree-based machine learning model for predicting in-hospital mortality and critical events. This model is first trained on patients from a single hospital and then externally validated on patients from four other hospitals. We achieve strong performance, notably predicting mortality at 1 week with an AUC-ROC of 0.84. Finally, we establish model interpretability by calculating SHAP scores to identify decisive features, including age, inflammatory markers (procalcitonin and LDH), and coagulation parameters (PT, PTT, D-Dimer). To our knowledge, this is one of the first models with external validation to both predict outcomes in COVID-19 patients with strong validation performance and identify key contributors in outcome prediction that may assist clinicians in making effective patient management decisions.One-Sentence Summary We identify clinical features that robustly predict mortality and critical events in a large cohort of COVID-19 positive patients in New York City.Competing Interest StatementThe authors have declared no competing interest.Funding Statementn/aAuthor DeclarationsAll relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.Yesn/a %U https://www.medrxiv.org/content/medrxiv/early/2020/04/28/2020.04.26.20073411.full.pdf