Abstract
Background Sepsis therapy is still limited to treatment of the underlying infection and supportive measures. To date, various sepsis subtypes were proposed, but therapeutic options addressing the molecular changes of sepsis were not identified. With the aim of a future individualized therapy, we used machine learning (ML) to identify clinical phenotypes and their temporal development in a prospective, multicenter sepsis cohort and characterized them using plasma proteomics.
Methods Routine clinical data and blood samples were collected from 384 patients. Sepsis phenotypes were identified based on clinical measurements and plasma samples from 301 patients were analyzed using mass spectrometry. The obtained data were evaluated in relation to the phenotypes, and supervised ML models were developed enabling prospective phenotype classification and determination of key features distinguishing the phenotypes.
Results Three phenotypes and their progression across four time points in sepsis were identified. Cluster C was characterized by the highest disease severity and multi-organ failure with leading liver failure. Cluster B showed relevant organ failure, with renal damage being particularly prominent in comparison to cluster A. Time course analysis showed a strong association of cluster C with mortality and dynamic properties of cluster B. The plasma proteome reflected the clinical features of the phenotypes and revealed excessive consumption of complement and coagulation factors in severe sepsis. Supervised ML models allow the assignment of patients based on only seven widely available features.
Conclusions The identified clinical phenotypes reflected varying degrees of sepsis severity and were mirrored in the plasma proteome. Proteomic profiling offered novel insights into the molecular mechanisms underlying sepsis and enabled a deeper characterization of the identified phenotypes. This integrative approach may serve as a blueprint for uncovering molecular signatures of sepsis subgroups and holds promise for the development of future targeted therapies.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The SepsisDataNet.NRW research group was funded by the European Regional Development Fund of the European Union (EFRE.NRW, reference number LS-1-2-012). The CovidDataNet.NRW study was funded by the, Germany. ME and MW have been partly funded by the Federal Ministry of Education and Research in the frame of de.NBI/ELIXIR-DE (W-de.NBI-005).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study was approved by the Ethics Committee of the Medical Faculty of Ruhr-University Bochum (Registration No. 504714 and 1966066BR, respectively) or the responsible ethics committee of each respective study center and conducted in accordance with the revised Declaration of Helsinki. Written informed consent was obtained from the patients or a legal representative.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD058562.
List of abbreviations
- AKI
- Acute kidney injury
- ALT
- Alanine transaminase
- AST
- Aspartate transaminase
- AP
- Alternative pathway of complement activation
- aPTT
- Activated partial thromboplastin time
- COVID-19
- Coronavirus disease 2019
- FDR
- False discovery rate
- ICU
- Intensive care unit
- INR
- International normalized ratio
- LC-MS/MS
- Liquid chromatography-coupled tandem mass spectrometry
- MCCV
- Monte Carlo cross validation
- ML
- Machine learning
- PCA
- Principal Component Analysis
- SARS-CoV-2
- Severe acute respiratory syndrome coronavirus type 2
- SOFA
- Sepsis-related organ failure assessment score