Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Integrating Machine Learning-Based Variable Selection into Heat Vulnerability Index Design

View ORCID ProfileShuyue Qu, View ORCID ProfileJana Sillmann, View ORCID ProfileBenjamin W. Barrett, View ORCID ProfilePeter M. Graffy, View ORCID ProfileBenjamin Poschlod, View ORCID ProfileLukas Brunner, View ORCID ProfileRaed Mansour, View ORCID ProfileMalte von Szombathely, View ORCID ProfileFinley Hay-Chapman, View ORCID ProfileTeresa H. Horton, Jennifer Chan, View ORCID ProfileSheetal Khedkar Rao, Kyra Woods, View ORCID ProfileAbel N Kho, View ORCID ProfileDaniel E. Horton
doi: https://doi.org/10.64898/2026.03.29.26349672
Shuyue Qu
1Research Unit Sustainability and Climate Risk, Earth and Society Research Hub (ESRAH), University of Hamburg, Hamburg, Germany
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Shuyue Qu
  • For correspondence: shuyue.qu{at}uni-hamburg.de shuyue.qu{at}northwestern.edu
Jana Sillmann
1Research Unit Sustainability and Climate Risk, Earth and Society Research Hub (ESRAH), University of Hamburg, Hamburg, Germany
3Center of International Climate Research (CICERO), Oslo, Norway
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jana Sillmann
Benjamin W. Barrett
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
4Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Benjamin W. Barrett
Peter M. Graffy
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
4Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Peter M. Graffy
Benjamin Poschlod
1Research Unit Sustainability and Climate Risk, Earth and Society Research Hub (ESRAH), University of Hamburg, Hamburg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Benjamin Poschlod
Lukas Brunner
1Research Unit Sustainability and Climate Risk, Earth and Society Research Hub (ESRAH), University of Hamburg, Hamburg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lukas Brunner
Raed Mansour
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
5Metropolitan Planning Council, Chicago, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Raed Mansour
Malte von Szombathely
1Research Unit Sustainability and Climate Risk, Earth and Society Research Hub (ESRAH), University of Hamburg, Hamburg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Malte von Szombathely
Finley Hay-Chapman
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
9Department of Earth, Environmental, and Planetary Science, Northwestern University, Evanston, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Finley Hay-Chapman
Teresa H. Horton
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
6Department of Anthropology, Northwestern University, Evanston, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Teresa H. Horton
Jennifer Chan
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
4Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sheetal Khedkar Rao
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
7Department of Medicine, University of Illinois, Chicago, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sheetal Khedkar Rao
Kyra Woods
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
8Sustain Our Future Foundation, Chicago, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Abel N Kho
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
4Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Abel N Kho
Daniel E. Horton
2Defusing Disasters Working Group, Buffett Institute for Global Affairs, Northwestern University, Evanston, IL, USA
9Department of Earth, Environmental, and Planetary Science, Northwestern University, Evanston, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Daniel E. Horton
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

As climate change intensifies, health risks from extreme heat are rising. Accurate assessment of heat vulnerability at high spatial resolution is crucial for developing effective adaptation strategies, particularly in socioeconomically heterogeneous urban settings. However, the identification of key indicators underlying heat vulnerability remains challenging. Using Chicago, Illinois (USA) as a case study, we systematically compare different variable selection strategies in community-level heat vulnerability assessments. We take the conventional unsupervised principal component analysis (PCA)-based Heat Vulnerability Index (HVI) as a baseline, and compare it with supervised approaches that incorporate variable selection, including machine learning algorithms (Lasso regression, Random Forest, and XGBoost) as well as traditional statistical methods (simple linear regression and polynomial regression). Using the vulnerability indicator subsets identified by each variable selection method, we construct multiple HVIs and evaluate their performance against heat-related excess mortality. Our work indicates that supervised variable selection improves the performance of HVIs in capturing heat-related health risks. Among all methods, the Random Forest-based variable selection algorithm achieves the best overall results, highlighting the potential of machine learning to enhance heat vulnerability assessment tools. Our results demonstrate that poverty rate, lack of air conditioning, and proportion of residents aged 65 and above are robust determinants of heat vulnerability in Chicago.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study was funded by the School of Integrated Climate and Earth System Sciences at University of Hamburg; the Office of Global Initiatives at the McCormick School of Engineering and the Paula M. Trienens Institute for Sustainability and Energy at Northwestern University; and the Buffett Institute for Global Affairs Defusing Disasters Working Group. JS, BP, LB and MvS acknowledge funding by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy - EXC 2037 "CLICCS - Climate, Climatic Change, and Society" - Project Number 390683824.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Institutional Review Board of Northwestern University gave ethical approval for this work (protocol number STU00219292).

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

Meteorological data and socio-demographic data are publicly available from relevant data providers. Mortality data were obtained through institutional access and are not publicly available. Access to mortality data may be requested from the original data provider, subject to their approval.

https://www.census.gov/programs-surveys/acs.html

https://daymet.ornl.gov/

https://data.cityofchicago.org/

https://www.usgs.gov/centers/eros/science/national-land-cover-database

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted March 31, 2026.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Integrating Machine Learning-Based Variable Selection into Heat Vulnerability Index Design
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Integrating Machine Learning-Based Variable Selection into Heat Vulnerability Index Design
Shuyue Qu, Jana Sillmann, Benjamin W. Barrett, Peter M. Graffy, Benjamin Poschlod, Lukas Brunner, Raed Mansour, Malte von Szombathely, Finley Hay-Chapman, Teresa H. Horton, Jennifer Chan, Sheetal Khedkar Rao, Kyra Woods, Abel N Kho, Daniel E. Horton
medRxiv 2026.03.29.26349672; doi: https://doi.org/10.64898/2026.03.29.26349672
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Integrating Machine Learning-Based Variable Selection into Heat Vulnerability Index Design
Shuyue Qu, Jana Sillmann, Benjamin W. Barrett, Peter M. Graffy, Benjamin Poschlod, Lukas Brunner, Raed Mansour, Malte von Szombathely, Finley Hay-Chapman, Teresa H. Horton, Jennifer Chan, Sheetal Khedkar Rao, Kyra Woods, Abel N Kho, Daniel E. Horton
medRxiv 2026.03.29.26349672; doi: https://doi.org/10.64898/2026.03.29.26349672

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Public and Global Health
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (867)
  • Anesthesia (306)
  • Cardiovascular Medicine (4480)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (614)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15276)
  • Forensic Medicine (31)
  • Gastroenterology (1133)
  • Genetic and Genomic Medicine (6643)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4603)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1622)
  • Hematology (544)
  • HIV/AIDS (1275)
  • Infectious Diseases (except HIV/AIDS) (15960)
  • Intensive Care and Critical Care Medicine (1110)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (674)
  • Neurology (6693)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1152)
  • Occupational and Environmental Health (961)
  • Oncology (3369)
  • Ophthalmology (988)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (668)
  • Pediatrics (1703)
  • Pharmacology and Therapeutics (699)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5494)
  • Public and Global Health (9284)
  • Radiology and Imaging (2223)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1201)
  • Rheumatology (598)
  • Sexual and Reproductive Health (720)
  • Sports Medicine (535)
  • Surgery (720)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (266)