Abstract
BACKGROUND Sleep disorders pose a major global health burden and are associated with a wide range of adverse health outcomes. Polysomnography (PSG) is the gold standard for sleep assessment, but it is impractical for home-based or long-term monitoring. We investigated whether zero-burden cardiorespiratory signals, when harnessed through foundation model approaches, can enable accurate sleep assessment at scale and capture broader dimensions of multi-organ health.
METHODS We present SleepFounder, a foundation model for zero-burden sleep monitoring built upon cardiorespiratory signals. SleepFounder was developed on the largest curated multi-ethnic sleep dataset to date, comprising over 800,000 hours of recordings from 35 cohorts across the United States and China. We evaluated SleepFounder across downstream tasks ranging from conventional sleep analysis to emerging applications, including demographic profiling and multi-organ disease detection and prediction. We further conducted a real-world study using multi-center ballistocardiography (BCG) data collected with a custom-developed sleep mat system for external validation.
RESULTS SleepFounder achieved strong performance across diverse downstream tasks and consistently outperformed baseline models, obtaining the best results in 14 out of 17 dataset-task pairs. For conventional sleep analysis and demographic profiling, averaged across external datasets, it achieved a Cohen’s Kappa of 0.671 (0.668-0.673) for five-class sleep staging, an area under the receiver operating characteristic curve (AUROC) of 0.917 (0.912-0.922) for moderate-to-severe obstructive sleep apnea detection, a mean absolute error of 6.727 (6.684-6.771) years for age prediction, and an AUROC of 0.865 (0.860-0.870) for sex classification. In multi-organ disease detection, representative AUROCs reached 0.943 (0.917-0.966) for Parkinson’s disease, 0.886 (0.841-0.928) for gastroesophageal reflux disease, and 0.881 (0.831-0.922) for heart failure. Additional conditions, including high cholesterol, coronary heart disease (CHD), bipolar disorder, and chronic pain, achieved AUROCs ranging from 0.811 to 0.830 in the held-out test set, with results further validated across five external cohorts. For future disease prediction, concordance indices reached 0.838 (0.797-0.873) for CHD death and 0.837 (0.806-0.865) for cardiovascular disease death, with corresponding metrics of 0.734-0.781 for congestive heart failure, stroke, and angina. In the real-world BCG study, SleepFounder maintained 94% of its performance on average relative to prior external validations conducted on PSG-based datasets.
CONCLUSIONS SleepFounder establishes a foundation model that learns from cardiorespiratory signals to enable accurate, scalable, and zero-burden sleep assessment. By linking sleep physiology with multi-organ health, it bridges clinical and home settings and demonstrates that signals traditionally used for sleep monitoring can serve as powerful biomarkers of systemic function and disease risk. These findings highlight a new paradigm for zero-burden sleep and health monitoring in real-world settings.
Competing Interest Statement
Westover is a co-founder, serves as a scientific advisor and consultant to, and has a personal equity interest in Beacon Biosignals. Thomas discloses: 1) patent and license/royalties from MyCardio, LLC, for the ECG-spectrogram; 2) patent and license/royalties from DeVilbiss-Drive for an auto-CPAP algorithm; 3) consulting for Jazz Pharmaceuticals, Guidepoint Global and GLG Councils.
Funding Statement
The authors gratefully acknowledge the National Sleep Research Resource (NSRR; sleepdata.org), funded by the National Heart, Lung, and Blood Institute (NHLBI), for providing access to de-identified clinical data that supported the development and evaluation of our work. We also thank the Brain Data Science Platform (BDSP) for access to the Human Sleep Project (HSP) dataset, a growing collection of clinical polysomnography recordings from Massachusetts General Hospital. This work is supported by the Ministry of Science and Technology of China STI2030-Major Projects (No.2021ZD0201900, 2021ZD0201902), and the National Natural Science Foundation of China (62102008).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study used both public and private datasets and was conducted in accordance with the principles of the Declaration of Helsinki. Ethical approval for the overall project was obtained from the Biomedical Ethics Review Committee of Peking University (IRB00001052-23207). In addition, local institutional review board approvals were obtained from all participating clinical centers, including the Affiliated Hospital of Gansu University of Chinese Medicine (AHGUCM), Beijing Huilongguan Hospital (BHH), Inner Mongolia Mental Health Center (IMMHC), Shenzhen Hospital of Southern Medical University (SHSMU), Beijing Tongren Hospital (BTH affiliated to Capital Medical University), Sir Run Run Shaw Hospital, (SRRSH affiliated to Zhejiang University School of Medicine), Shanghai Sixth People's Hospital (SSPH affiliated to Shanghai Jiao Tong University School of Medicine), the Sleep Medicine Center of West China Hospital (WCH affiliated to Sichuan University), the Second Affiliated Hospital of Soochow University (SAHSU), Ruijin Hospital (RH affiliated to Shanghai Jiao Tong University School of Medicine), the Affiliated Brain Hospital of Guangzhou Medical University (ABHGMU), and the Chinese University of Hong Kong Medical Center (CUHKMC). As only de-identified retrospective data were used, the requirement for informed consent was waived by each institutional review board.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
In this revised version of the manuscript, we have updated several components to improve the clarity, rigor, and overall quality of the work. Specifically, we refined multiple sections of the text to correct ambiguous phrasing and to provide more precise descriptions of the methodology and analytical procedures. We also updated several results based on additional analyses and improved data processing, ensuring that the findings presented more accurately reflect the underlying experimental observations. Furthermore, we reorganized parts of the introduction and discussion to better contextualize the contributions of our study and to strengthen the scientific narrative. Additional explanations and justifications were incorporated to enhance transparency and reproducibility. Several figures and tables were revised to improve readability. Overall, these revisions substantially strengthen the manuscript by improving clarity of presentation, updating the reported results, and reinforcing the interpretability and scientific impact of the study.
Data Availability
The Human Sleep Project (HSP) and the National Sleep Research Resource (NSRR) datasets analyzed in this study are publicly available. The datasets collected in China are not publicly available due to privacy and regulatory restrictions but may be made available from the corresponding author upon reasonable request.





