<?xml version='1.0' encoding='UTF-8'?><xml><records><record><source-app name="HighWire" version="7.x">Drupal-HighWire</source-app><ref-type name="Journal Article">17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Qu, Shuyue</style></author><author><style face="normal" font="default" size="100%">Sillmann, Jana</style></author><author><style face="normal" font="default" size="100%">Barrett, Benjamin W.</style></author><author><style face="normal" font="default" size="100%">Graffy, Peter M.</style></author><author><style face="normal" font="default" size="100%">Poschlod, Benjamin</style></author><author><style face="normal" font="default" size="100%">Brunner, Lukas</style></author><author><style face="normal" font="default" size="100%">Mansour, Raed</style></author><author><style face="normal" font="default" size="100%">Szombathely, Malte von</style></author><author><style face="normal" font="default" size="100%">Hay-Chapman, Finley</style></author><author><style face="normal" font="default" size="100%">Horton, Teresa H.</style></author><author><style face="normal" font="default" size="100%">Chan, Jennifer</style></author><author><style face="normal" font="default" size="100%">Rao, Sheetal Khedkar</style></author><author><style face="normal" font="default" size="100%">Woods, Kyra</style></author><author><style face="normal" font="default" size="100%">Kho, Abel N</style></author><author><style face="normal" font="default" size="100%">Horton, Daniel E.</style></author></authors><secondary-authors></secondary-authors></contributors><titles><title><style face="normal" font="default" size="100%">Integrating Machine Learning-Based Variable Selection into Heat Vulnerability Index Design</style></title><secondary-title><style face="normal" font="default" size="100%">medRxiv</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2026</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2026-01-01 00:00:00</style></date></pub-dates></dates><elocation-id><style  face="normal" font="default" size="100%">2026.03.29.26349672</style></elocation-id><doi><style  face="normal" font="default" size="100%">10.64898/2026.03.29.26349672</style></doi><volume><style face="normal" font="default" size="100%"></style></volume><issue><style face="normal" font="default" size="100%"></style></issue><abstract><style  face="normal" font="default" size="100%">As climate change intensifies, health risks from extreme heat are rising. Accurate assessment of heat vulnerability at high spatial resolution is crucial for developing effective adaptation strategies, particularly in socioeconomically heterogeneous urban settings. However, the identification of key indicators underlying heat vulnerability remains challenging. Using Chicago, Illinois (USA) as a case study, we systematically compare different variable selection strategies in community-level heat vulnerability assessments. We take the conventional unsupervised principal component analysis (PCA)-based Heat Vulnerability Index (HVI) as a baseline, and compare it with supervised approaches that incorporate variable selection, including machine learning algorithms (Lasso regression, Random Forest, and XGBoost) as well as traditional statistical methods (simple linear regression and polynomial regression). Using the vulnerability indicator subsets identified by each variable selection method, we construct multiple HVIs and evaluate their performance against heat-related excess mortality. Our work indicates that supervised variable selection improves the performance of HVIs in capturing heat-related health risks. Among all methods, the Random Forest-based variable selection algorithm achieves the best overall results, highlighting the potential of machine learning to enhance heat vulnerability assessment tools. Our results demonstrate that poverty rate, lack of air conditioning, and proportion of residents aged 65 and above are robust determinants of heat vulnerability in Chicago.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was funded by the School of Integrated Climate and Earth System Sciences at University of Hamburg; the Office of Global Initiatives at the McCormick School of Engineering and the Paula M. Trienens Institute for Sustainability and Energy at Northwestern University; and the Buffett Institute for Global Affairs Defusing Disasters Working Group. JS, BP, LB and MvS acknowledge funding by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy - EXC 2037 "CLICCS - Climate, Climatic Change, and Society" - Project Number 390683824.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Institutional Review Board of Northwestern University gave ethical approval for this work (protocol number STU00219292).I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesMeteorological data and socio-demographic data are publicly available from relevant data providers. Mortality data were obtained through institutional access and are not publicly available. Access to mortality data may be requested from the original data provider, subject to their approval. https://www.census.gov/programs-surveys/acs.html https://daymet.ornl.gov/ https://data.cityofchicago.org/ https://www.usgs.gov/centers/eros/science/national-land-cover-database</style></abstract></record></records></xml>