Identification of Acute Respiratory Distress Syndrome subphenotypes denovo using routine clinical data: a retrospective analysis of ARDS clinical trials

Abhijit Duggal; Rachel Kast; Emily Van Ark; Lucas Bulgarelli; Matthew T. Siuba; Jeff Osborn; Diego Rey; Fernando G Zampieri; Alexandre B Cavalcanti; Israel S Maia; Denise M Paisani; Ligia N Laranjeira; Ary Serpa Neto; Rodrigo Octávio Deliberato

doi:10.1101/2021.07.08.21260102

ABSTRACT

Rationale The acute respiratory distress syndrome (ARDS) is a heterogenous condition, and identification of subphenotypes may help in better risk stratification.

Objectives Identify ARDS subphenotypes using new simpler methodology and readily available clinical variables.

Design Retrospective Cohort Study of ARDS trials.

Setting Data from the U.S. ARDSNet trials and from the international ART trial.

Participants 3763 patients from ARDSNet datasets and 1010 patients from the ART dataset.

Primary and secondary outcome measures The primary outcome was 60-day or 28-day mortality, depending on what was reported in the original trial. K-means cluster analysis was performed to identify subgroups. For feature selection, sets. Sets of candidate variables were tested to assess their ability to produce different probabilities for mortality in each cluster. Clusters were compared to biomarker data, allowing identification of subphenotypes.

Results Data from 4,773 patients was analyzed. Two subphenotypes (A and B) resulted in optimal separation in the final model, which included nine routinely collected clinical variables, namely: heart rate, mean arterial pressure, respiratory rate, bilirubin, bicarbonate, creatinine, PaO₂, arterial pH, and FiO₂. Participants in subphenotype B showed increased levels of pro-inflammatory markers, had consistently higher mortality, lower number of ventilator-free days at day 28, and longer duration of ventilation compared to patients in the subphenotype A.

Conclusions Routinely available clinical data can successfully identify two distinct subphenotypes in adult ARDS patients. This work may facilitate implementation of precision therapy in ARDS clinical trials.

Strengths and limitations of this study

Largest cohort of patients used to identify subphenotypes of ARDS patients.
Subphenotypes were validated in the population of a large international ARDS randomized controlled trial.
Subphenotypes were identified by using only routinely collected clinical data.
Our use of data exclusively from randomized controlled trials does not prove generalizability to unselected ARDS populations.
The clinical utility of the subphenotypes have to be validated in a prospective study.

INTRODUCTION

The Berlin definition of acute respiratory distress syndrome (ARDS) encompasses acute hypoxemic respiratory failure due to a wide variety of etiologies [1]. Due to this inclusion of heterogeneous conditions within the syndrome, there are significant clinical and biological differences that makes ARDS challenging to treat [2,3]. These differences amongst ARDS patients are associated with variation in risk of disease development and progression [3,4], potentially generating differential responses to treatments and interventions [5–10]. In spite of those evidences, clinical risk stratification of ARDS patients still solely depends on PaO₂/FiO₂ ratios [11,12], possibly misleading the interpretation of results in clinical trials and clinicians when evaluating treatment options for patients [13].

Therefore, identifying groups of patients who have similar clinical, physiologic, or biomarker traits becomes relevant [6,14] as it can help with stratification of patients producing better targeted therapies and interventions [15]. These different groups can be defined as ARDS subphenotypes [4,14]. Two ARDS subphenotypes have been consistently identified in previous studies [6–10,16–18]. However, these models are complex, and significant barriers exist in their implementation and use in clinical practice. Existing models use up to 40 predictor variables, including biomarkers and other variables that are not readily available at the bedside [6–10,16–18]. These limitations explain the current status quo of ARDS care, where clinicians must depend on the limited prognostic value of PaO₂/FiO₂ ratios instead of biologically distinct subphenotypes.

We hypothesized that the use of a simpler methodology and a small number of easily available clinical variables could identify new ARDS subphenotypes and thus provide the means to allow future implementation of bedside stratification.

METHODS

Data source and participants

We performed a retrospective study using a de-identified dataset pooling data from six randomized clinical trials in patients with ARDS, namely: ARMA, ALVEOLI, FACTT, EDEN, SAILS, and ART [19–24]. Patients in ARMA, ALVEOLI, FACTT, EDEN and SAILS trials were eligible if they met the American-European consensus for ARDS, including patients with a PaO₂ / FiO₂ ratio < 300 up to 48 hours before enrollment. From 1996 to 2013, these trials enrolled 902, 549, 1000, 1000, and 745 patients, respectively, and tested a variety of interventions [19–23]. Between 2011 and 2017 the international ART study enrolled 1010 adult patients diagnosed with moderate to severe ARDS according to the Berlin definition (PaO₂ / FiO₂ ratio < 200) for less than 72 hours of duration and assessed two different ventilatory strategies [24]. To avoid biases due to high mortality in the high tidal volume group of the ARMA study [19], which has not been standard of care since the beginning of 2000, only 473 patients receiving low tidal volume in that study were included.

Predictors

Six clinical trials were assessed to identify a set of clinical variables recorded closest to time of randomization which were most commonly available across all datasets. The list of potential candidates was then further refined to include only those that are frequently observed in the routine care of ARDS patients at the time of its diagnosis. In order to develop a clustering algorithm for potential rapid translation into clinical use, elements which would not be commonly found in the electronic health records (EHR) at the time of ARDS diagnosis, such as biomarker levels, ARDS risk factors, organ support apart from mechanical ventilation settings, and severity scores, were excluded from model development. The treatment assignment in the original trials, and clinical outcomes were not considered in the model development.

After all assessment, 16 variables that are routinely collected as part of the usual care and which were uniformly present in all the trials were considered, including: age, gender, arterial pH, PaO₂, PaCO₂, bicarbonate, creatinine, bilirubin, platelets, heart rate, respiratory rate, mean arterial pressure, positive end-expiratory pressure (PEEP), plateau pressure, FiO₂, and tidal volume adjusted for predicted body weight (mL/kg PBW). The PBW was calculated as equal to 50 + 0.91 (centimeters of height – 152.4) in males, and 45.5 + 0.91 (centimeters of height – 152.4) in females [18]. These variables were grouped into five domains named demographics, arterial blood gases, laboratory values, vital signs, and ventilatory variables. Plateau pressure was excluded due to a high rate of missingness across the trials included in the training set. Amount of missing data in the training datasets is reported in eTable 1.

Outcomes

The primary outcome was 60-day mortality for all ARDSnet trials, and 28-day mortality for ART trial. Secondary outcomes included 90-day mortality, number of ventilator free days at day 28 [25], and the duration of mechanical ventilation in survivors within the first 28 days post enrollment.

Data preparation

Data preprocessing was performed before modeling, and the pooled dataset was assessed for completeness and consistency. Patients with values out of the plausible physiological range for a specific variable were excluded from the final analysis (described in eTable 2). The training dataset was constructed using data from the two largest ARDSnet trials, EDEN and FACTT. The validation dataset was sourced from the four remaining trials: ALVEOLI, ARMA, SAILS, and ART. Means and standard deviations for z-scoring variables were calculated from the training dataset and subsequently applied to the validation data.

Statistical analysis

Baseline and outcome data were presented according to the assigned cluster. Continuous variables were presented as medians with their interquartile ranges and categorical variables as total number and percentage. Proportions were compared using Fisher exact tests and continuous variables were compared using the Wilcoxon rank-sum test. Study outcomes were further compared using the median and mean absolute differences for continuous and categorical values, respectively.

Model development and validation

For the model development, the K-means clustering algorithm was used. K-means is one of the simplest and most commonly used classes of clustering algorithms. In critical care research, unsupervised machine learning techniques have already been used in several studies, attempting to find homogeneous subgroups within a broad heterogeneous population [26]. This specific algorithm identifies a K number of clusters in a dataset by finding K centroids within the n-dimensional space of clinical features [26].

For feature selection, different sets of candidate variables were tested to assess their ability to produce significantly different mortality probabilities in each cluster using the minimum amount of readily available clinical data. For each set of candidate variables, the optimal number of clusters was determined by comparing models with between 2 and 5 clusters, using the Elbow method [27] and the Calinski-Harabasz index [28]. Information about the methods for selecting number of clusters are provided in the supplemental material.

The following steps were performed for the final model selection: 1) all predictors were assessed for correlation (eTable 3); and 2) ten different combinations of the proposed variables were investigated. These combinations were developed based on the perceived clinical importance of each variable and its combinations. All 10 models were tested for the optimal number of clusters and based on both the Elbow method and the Calinski-Harabasz index, as described above. The models were then compared, aiming for the minimum set of variables with high 60-day mortality separation. The description of each model is show in eTable 4.

Biological and clinical characteristics of the clusters were evaluated using clinical, laboratory, and (when available) biomarker data to establish subphenotypes [4]. All iterations in model development were done on the training set and the generalizability of the final model was assessed using the validation dataset. K-means clustering analysis is structured to ignore cases with missing data. No assumption was made for missingness and we therefore conducted a complete case analysis. Model development and evaluation was performed using Python version 3.8 and scikit-learn 0.23.1.

Data availability

Data from the ARDSnet studies (EDEN, FACTT, ARMA, ALVEOLI and SAILS) is publicly available from the NHLBI ARDS Network and data from the ART trial can be requested from study authors.

RESULTS

Participants

Data from 4777 clinical trial patients were considered for inclusion. In total, 4 patients were excluded for having clinical measurements outside plausible range. The remaining 1998 patients from EDEN and FACTT trials were included in the training set, while the 2775 patients from ARMA, ALVEOLI, SAILS, and ART were included in the validation cohort.

Baseline characteristics of the patients in the training and validation sets are presented in Table 1. Pneumonia was the prevailing etiology followed by sepsis and aspiration in all trials. Between 29.3% to 72.7% of the patients were receiving vasopressors at the time of randomization. At randomization, PaO₂ / FiO₂ ratio ranged from 112 (75 - 158) to 134 (96 - 185) mmHg, and PEEP from 8 (5 - 10) to 12 (10 - 14) cmH₂O across trials. Mortality at 60 days for the ARDSnet trials ranged from 22.7% to 30.1%, while in the ART trial mortality at 28 days was 58.8%.

View this table:

Table 1 Baseline Characteristics and Clinical Outcomes in the Included Trials

Predictor variables and model selection

The correlation between the 15 variables selected for clustering is shown in eTable 3. The strongest correlation was between PEEP and FiO₂ (r = 0.49). The comparison of the 10 models regarding the optimal number of clusters based on both the Elbow method and the Calinski-Harabasz index is shown in eFigure 1. In all models and methods, two clusters were a better fit than a higher number of clusters.

Across the ten models, absolute mortality difference between cluster 1 and cluster 2 ranged from 3.9% to 13.1% for the FACTT study and between 0.1% to 8.1% for EDEN (Table 2). The models with the highest 60-day absolute mortality separation between the clusters for each of the two trials in the training set were then further evaluated. Models 6, 5, and 8 were consistently amongst the models with highest separation (Table 2). Model 8 was selected for further investigation, as it the fewest variables (eTable 4).

View this table:

Table 2 Absolute 60-day Mortality Difference Among Clusters per Trial and Model

Clinical characteristics of each cluster

Based on model 8, only nine clinical and laboratory variables were needed to identify the two distinct clusters in ARDS patients, namely: heart rate, mean arterial pressure, respiratory rate, bilirubin, bicarbonate, creatinine, PaO₂, arterial pH, and FiO₂. For each variable in the model, opposing measurements could be observed for each cluster (Figure 1 and eFigure 2). For the ARDSnet trials, the incidence of cluster 1 patients varied from 57.8% (EDEN) to 73.6% (ARMA), and 41.5% of ART patients were part of cluster 1. Across all trials, patients in cluster 2 had higher severity of illness, rate of vasopressor, heart rate, respiratory rate, creatinine, and bilirubin, as well as lower platelets, pH, BUN, and bicarbonate compared to patients in cluster 1 (eTable 5, 6 and 7). In addition, 28-, 60-, and 90-day mortality rate was higher in patients in cluster 2 in all trials (Table 3). Likewise, for each trial, ventilator-free days at day 28 was lower in patients in cluster 2 compared to cluster 1, and duration of ventilation in survivors was longer in cluster 1.

Figure 1 Differences of the Variables Included in the Cluster Algorithm Among Clusters

Square symbols represent the study with the highest mean z score for each phenotype; Circles represent the study with the lowest mean z score for each phenotype. The colored bands are exclusively to help visualize the opposite trends of the variables on the different clusters; Art.pH: arterial pH; Bicarb: bicarbonate; MAP: mean arterial pressure; Creat: creatinine; Resp.Rate: respiratory rate

View this table:

Table 3 Clinical Outcomes According to Clusters in Each Trial

Identification of Subphenotypes

After comparing the clinical characteristics of the clusters, each cluster was assigned to represent a distinct subphenotype of ARDS, with patients in cluster 1 assigned to subphenotype A, and patients in cluster 2 assigned to subphenotype B. Using blood biomarker information available for a subset of patients from both ARMA and ALVEOLI, subphenotype B showed increased levels of pro-inflammatory markers when compared to subphenotype A (Figure 2 and eTables 8 and 9).

Figure 2 Heat Map of the Biomarkers Available for the ARMA and ALVEOLI Trials

For better visualization and due to difference in scales, the values were log-normalized and z-scored. Subphenotypes A and B are shown separately to highlight their differences.

DISCUSSION

This study successfully demonstrated that nine easily obtainable clinical variables can identify two distinct ARDS subphenotypes with different clinical and biologic characteristics as well as outcomes across the test and validation cohorts. There was good generalizability amongst diverse populations from multiple validation datasets with temporal and geographical differences.

It is understandable that researchers feel compelled to use as much information as possible to build robust models. This is supportable for two main reasons: (1) the well-known heterogeneity of complex syndromes such as ARDS and (2) the abundance of highly granular clinical data generated by electronic health records (EHRs). It is anticipated that analyzing this vast amount of data will provide new knowledge regarding disease mechanisms by enabling researchers to find plausible hidden patterns within the data [29]. However, this data-heavy approach has the potential drawback of using predictors which are not generally obtained in a time window prior to intervention, or worse yet, using variables that are not part of the routine standard of care for patients. The rationale of using fewer and easy to collect clinical variables is not new in the field of critical care. Prognostic models have already shown that it is indeed feasible to create meaningful models using fewer predictors [30,31].

Our initial choices to define variables commonly found in the EHR at ARDS diagnosis was inspired by a recent report from the World Health Organization (WHO) which showed an enormous discrepancy of medical devices availability in a survey across 135 countries [29]. Recognizing this inconsistency is essential for widespread implementation of machine learning models regardless of varying availability of resources across countries and health systems [29]. The aim is to provide clinically relevant information within a defined and short time period that might impact the delivery of effective interventions to the right patient population and to as many patients as possible [29].

Recently, Sinha et al. developed supervised-learning gradient boosted classifier models trained using 24 or 14 readily available clinical data elements to reproduce biomarker-derived subphenotypes which were previously identified by Calfee et al. [17]. Unlike Sinha et al., who predicted previously identified subphenotypes, our study has identified two subphenotypes de novo using a small set of clinical variables.

Although the subphenotypes that we have identified and those that have been previously published look similar, our work is distinct from previous studies in several ways. We employed different training and validation datasets and also utilized a different and well-established unsupervised learning technique. Moreover, we utilized a process for selecting predictors which is not comparable to previous studies. Acknowledging these differences is crucial. It would not be unexpected to assume that these deviations would be relevant enough to produce different subphenotypes [32]. However, the clinical, laboratory characteristics, and the clinical outcomes of our subphenotypes show that they are remarkably similar to subphenotypes found in previous papers, regardless of methodological differences.

At this point it is not possible to go beyond this comparative analysis, as there is no gold standard definition of ARDS subphenotypes [32]. Nonetheless, our work does provide robust evidence that ARDS does indeed have two subphenotypes that can be systematically identified, despite major differences in population assessed and methodological approach used compared with previous studies. It also reinforces that we should continue to explore the underlying biological pathways of such subphenotypes to find responders to new or previously tested therapies.

Our study has several strengths. First, it is the largest cohort of patients that has been studied to develop distinct subphenotypes of ARDS patients. Moreover, our validation cohort included patients from the ART trial, allowing us to validate our model in the contemporaneous population of a large international randomized clinical trial in addition to the ARDSnet studies used in other subphenotyping studies. Second, our subphenotyping model was developed exclusively on the training set and then validated across multiple separate datasets. Nevertheless, similar separation in mortality was seen between the two subphenotypes across all trials. Third, we used the K-means algorithm to identify our subphenotypes, and the results obtained with this technique can be easily interpreted by clinicians and implemented in clinical practice. Lastly, this is the first phenotyping study that has used easily available clinical variables to identify ARDS phenotypes de novo, which allows for early identification of these patients in the clinical care at the bedside. Using this algorithm with a small number of routinely collected variables could enable our model to be applied in trials that either retrospectively or prospectively assess interventions targeted to each subphenotype. Future work should analyze previous trials to identify possible differential treatment response for the subphenotypes of ARDS patients identified in this study.

This study also has limitations. First, we have developed our models exclusively on patients enrolled in clinical trials. Due to the strict inclusion and exclusion criteria of these clinical trials, the generalizability of these results needs to be evaluated in unselected ARDS populations. Although there are clear clinical and biomarker differences between the identified subphenotypes, the model’s clinical utility needs to be prospectively validated and further investigated. Additionally, our biomarker analysis is limited to those patients in which the data was made publicly available by the study authors. Lastly, K-means clustering does not handle missing data, and no approach was used to impute missing values. However, the extremely low rate of missingness in our study makes this issue less relevant.

CONCLUSIONS

This study confirms the existence of two distinct subphenotypes in ARDS patients using a novel clustering model on routinely collected clinical data. This work may allow for easier identification of ARDS subphenotypes to facilitate implementation of precision clinical trial enrollment and development of targeted therapies in a variety of settings without the added burdens of biomarker evaluation.

Data Availability

Data from the ARDSnet studies (EDEN, FACTT, ARMA, ALVEOLI) is publicly available from the NHLBI ARDS Network (NHLBI ARDS Network) and data from the ART trial can be requested from study authors.

DECLARATIONS

Funding

This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.

Competing Interest

AD, MS, FGZ, ABC, ISM, DMP, LNL declare no relevant financial conflicts of interest. RK, EVA, LB, JO, DR and ROD are employees of Endpoint Health, Inc. ASN reported receiving personal fees from Dräger unrelated to the submitted work.

Ethics Approval

All patients provided informed consent in the original trials. This secondary analysis study was determined to be not human subject research and was this exempt from IRB review by WIRB. WIRB Work Order #1-1228617-1

Availability of data and material

Data from the ARDSnet studies (EDEN, FACTT, ARMA, ALVEOLI) is publicly available from the NHLBI ARDS Network (NHLBI ARDS Network) and data from the ART trial can be requested from study authors.

Author Contributions

AD, RK, EVA, LB participated in study design and analysis, drafted, and revised the manuscript, and are the guarantor of the document. MS, DR, JO, FGZ, ABC, ISM, DMP, LNL, and ASN participated in interpretation of data analysis, drafted the manuscript, and revised it for critically important intellectual content. ROD participated in the study design, analysis, interpretation of data analysis, and final revision of the manuscript content.

Twitter

@msiuba, @f_g_zampieri, @rod_deliberato, @a_serpaneto, @l_bulgarelli, @endpointhealth

REFERENCES

↵
ARDS Definition Task Force, Ranieri VM, Rubenfeld GD, et al. Acute respiratory distress syndrome: the Berlin Definition. JAMA 2012;307:2526–33. doi:10.1001/jama.2012.5669
OpenUrl CrossRef PubMed Web of Science
↵
Thille AW, Esteban A, Fernández-Segoviano P, et al. Comparison of the Berlin Definition for Acute Respiratory Distress Syndrome with Autopsy. Am J Respir Crit Care Med 2013;187:761–7. doi:10.1164/rccm.201211-1981OC
OpenUrl CrossRef PubMed
↵
Reilly J, Calfee C, Christie J. Acute Respiratory Distress Syndrome Phenotypes. Semin Respir Crit Care Med 2019;40:019–30. doi:10.1055/s-0039-1684049
OpenUrl CrossRef
↵
Reddy K, Sinha P, O’Kane CM, et al. Subphenotypes in critical care: translation into clinical practice. The Lancet Respiratory Medicine 2020;8:631–43. doi:10.1016/S2213-2600(20)30124-7
OpenUrl CrossRef
↵
Shankar-Hari M, Fan E, Ferguson ND. Acute respiratory distress syndrome (ARDS) phenotyping. Intensive Care Med 2019;45:516–9. doi:10.1007/s00134-018-5480-6
OpenUrl CrossRef
↵
Calfee CS, Delucchi K, Parsons PE, et al. Subphenotypes in acute respiratory distress syndrome: latent class analysis of data from two randomised controlled trials. The Lancet Respiratory Medicine 2014;2:611–20. doi:10.1016/S2213-2600(14)70097-9
OpenUrl CrossRef PubMed
Famous KR, Delucchi K, Ware LB, et al. Acute Respiratory Distress Syndrome Subphenotypes Respond Differently to Randomized Fluid Management Strategy. Am J Respir Crit Care Med 2017;195:331–8. doi:10.1164/rccm.201603-0645OC
OpenUrl CrossRef PubMed
Calfee CS, Delucchi KL, Sinha P, et al. Acute respiratory distress syndrome subphenotypes and differential response to simvastatin: secondary analysis of a randomised controlled trial. The Lancet Respiratory Medicine 2018;6:691–8. doi:10.1016/S2213-2600(18)30177-2
OpenUrl CrossRef
for the NHLBI ARDS Network, Sinha P, Delucchi KL, et al. Latent class analysis of ARDS subphenotypes: a secondary analysis of the statins for acutely injured lungs from sepsis (SAILS) study. Intensive Care Med 2018;44:1859–69. doi:10.1007/s00134-018-5378-3
OpenUrl CrossRef PubMed
↵
Bos LD, Schouten LR, van Vught LA, et al. Identification and validation of distinct biological phenotypes in patients with acute respiratory distress syndrome by cluster analysis. Thorax 2017;72:876–83. doi:10.1136/thoraxjnl-2016-209719
OpenUrl Abstract/FREE Full Text
↵
Ferguson ND, Fan E, Camporota L, et al. The Berlin definition of ARDS: an expanded rationale, justification, and supplementary material. Intensive Care Med 2012;38:1573–82. doi:10.1007/s00134-012-2682-1
OpenUrl CrossRef PubMed Web of Science
↵
Bellani G, Laffey JG, Pham T, et al. Epidemiology, Patterns of Care, and Mortality for Patients With Acute Respiratory Distress Syndrome in Intensive Care Units in 50 Countries. JAMA 2016;315:788. doi:10.1001/jama.2016.0291
OpenUrl CrossRef PubMed
↵
Gattinoni L, Vassalli F, Romitti F. Benefits and risks of the P/F approach. Intensive Care Med 2018;44:2245–7. doi:10.1007/s00134-018-5413-4
OpenUrl CrossRef
↵
Matthay MA, Arabi YM, Siegel ER, et al. Phenotypes and personalized medicine in the acute respiratory distress syndrome. Intensive Care Med 2020;46:2136–52. doi:10.1007/s00134-020-06296-9
OpenUrl CrossRef PubMed
↵
Shankar-Hari M, Rubenfeld GD. Population enrichment for critical care trials: phenotypes and differential outcomes. Current Opinion in Critical Care 2019;25:489–97. doi:10.1097/MCC.0000000000000641
OpenUrl CrossRef
↵
Sinha P, Delucchi KL, McAuley DF, et al. Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials. The Lancet Respiratory Medicine 2020;8:247–57. doi:10.1016/S2213-2600(19)30369-8
OpenUrl CrossRef
↵
Sinha P, Churpek MM, Calfee CS. Machine Learning Classifier Models Can Identify Acute Respiratory Distress Syndrome Phenotypes Using Readily Available Clinical Data. Am J Respir Crit Care Med 2020;202:996–1004. doi:10.1164/rccm.202002-0347OC
OpenUrl CrossRef
↵
Kitsios GD, Yang L, Manatakis DV, et al. Host-Response Subphenotypes Offer Prognostic Enrichment in Patients With or at Risk for Acute Respiratory Distress Syndrome*: Critical Care Medicine 2019;47:1724–34. doi:10.1097/CCM.0000000000004018
OpenUrl CrossRef
↵
The National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome (ARDS) Clinical Trials Network. Ventilation with Lower Tidal Volumes as Compared with Traditional Tidal Volumes for Acute Lung Injury and the Acute Respiratory Distress Syndrome. N Engl J Med 2000;342:1301–8. doi:10.1056/NEJM200005043421801
OpenUrl CrossRef PubMed Web of Science
The National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome (ARDS) Clinical Trials Network. Higher versus Lower Positive End-Expiratory Pressures in Patients with the Acute Respiratory Distress Syndrome. N Engl J Med 2004;351:327–36. doi:10.1056/NEJMoa032193
OpenUrl CrossRef PubMed Web of Science
The National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome (ARDS) Clinical Trials Network. Pulmonary-Artery versus Central Venous Catheter to Guide Treatment of Acute Lung Injury. N Engl J Med 2006;354:2213–24. doi:10.1056/NEJMoa061895
OpenUrl CrossRef PubMed Web of Science
The National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome (ARDS) Clinical Trials Network, Rice TW, Wheeler AP, et al. Initial Trophic vs Full Enteral Feeding in Patients With Acute Lung Injury: The EDEN Randomized Trial. JAMA: The Journal of the American Medical Association 2012;307:795–803. doi:10.1001/jama.2012.137
OpenUrl CrossRef PubMed Web of Science
↵
The National Heart, Lung, and Blood Institute ARDS Clinical Trials Network. Rosuvastatin for Sepsis-Associated Acute Respiratory Distress Syndrome. N Engl J Med 2014;370:2191–200. doi:10.1056/NEJMoa1401520
OpenUrl CrossRef PubMed Web of Science
↵
Writing Group for the Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial (ART) Investigators, Cavalcanti AB, Suzumura ÉA, et al. Effect of Lung Recruitment and Titrated Positive End-Expiratory Pressure (PEEP) vs Low PEEP on Mortality in Patients With Acute Respiratory Distress Syndrome: A Randomized Clinical Trial. JAMA 2017;318:1335. doi:10.1001/jama.2017.14171
OpenUrl CrossRef PubMed
↵
Yehya N, Harhay MO, Curley MAQ, et al. Reappraisal of Ventilator-Free Days in Critical Care Research. Am J Respir Crit Care Med 2019;200:828–36. doi:10.1164/rccm.201810-2050CP
OpenUrl CrossRef
↵
Castela Forte J, Perner A, van der Horst ICC. The use of clustering algorithms in critical care research to unravel patient heterogeneity. Intensive Care Med 2019;45:1025–8. doi:10.1007/s00134-019-05631-z
OpenUrl CrossRef
↵
Ketchen DJ, Shook CL. The Application of Cluster Analysis in Strategic Management Research: An Analysis and Critique. Strategic Management Journal 1996;17:441–58. doi:https://doi.org/10.1002/(SICI)1097-0266(199606)17:6<441::AID-SMJ819>3.0.CO;2-G
OpenUrl CrossRef
↵
Caliński T, Harabasz J. A dendrite method for cluster analysis. Communications in Statistics 1974;3:1–27. doi:10.1080/03610927408827101
OpenUrl CrossRef
↵
Bulgarelli L, Deliberato RO, Johnson AEW. Prediction on critically ill patients: The role of “big data.” Journal of Critical Care 2020;60:64–8. doi:10.1016/j.jcrc.2020.07.017
OpenUrl CrossRef
↵
Johnson AEW, Kramer AA, Clifford GD. A New Severity of Illness Scale Using a Subset of Acute Physiology and Chronic Health Evaluation Data Elements Shows Comparable Predictive Accuracy*: Critical Care Medicine 2013;41:1711–8. doi:10.1097/CCM.0b013e31828a24fe
OpenUrl CrossRef
↵
Deliberato RO, Escudero GG, Bulgarelli L, et al. SEVERITAS: An externally validated mortality prediction for critically ill patients in low and middle-income countries. International Journal of Medical Informatics 2019;131:103959. doi:10.1016/j.ijmedinf.2019.103959
OpenUrl CrossRef
↵
DeMerle KM, Angus DC, Baillie JK, et al. Sepsis Subclasses: A Framework for Development and Interpretation. Crit Care Med Published Online First: 15 February 2021. doi:10.1097/CCM.0000000000004842
OpenUrl CrossRef

View the discussion thread.

Posted July 08, 2021.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Intensive Care and Critical Care Medicine

Subject Areas

All Articles

Addiction Medicine (317)
Allergy and Immunology (621)
Anesthesia (162)
Cardiovascular Medicine (2300)
Dentistry and Oral Medicine (280)
Dermatology (205)
Emergency Medicine (372)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (820)
Epidemiology (11633)
Forensic Medicine (10)
Gastroenterology (685)
Genetic and Genomic Medicine (3638)
Geriatric Medicine (342)
Health Economics (624)
Health Informatics (2335)
Health Policy (921)
Health Systems and Quality Improvement (872)
Hematology (336)
HIV/AIDS (761)
Infectious Diseases (except HIV/AIDS) (13210)
Intensive Care and Critical Care Medicine (761)
Medical Education (361)
Medical Ethics (101)
Nephrology (394)
Neurology (3393)
Nursing (193)
Nutrition (512)
Obstetrics and Gynecology (655)
Occupational and Environmental Health (655)
Oncology (1783)
Ophthalmology (527)
Orthopedics (211)
Otolaryngology (284)
Pain Medicine (226)
Palliative Medicine (66)
Pathology (441)
Pediatrics (1014)
Pharmacology and Therapeutics (423)
Primary Care Research (411)
Psychiatry and Clinical Psychology (3113)
Public and Global Health (6031)
Radiology and Imaging (1238)
Rehabilitation Medicine and Physical Therapy (721)
Respiratory Medicine (814)
Rheumatology (370)
Sexual and Reproductive Health (360)
Sports Medicine (319)
Surgery (390)
Toxicology (50)
Transplantation (171)
Urology (144)

[1] ↵
ARDS Definition Task Force, Ranieri VM, Rubenfeld GD, et al. Acute respiratory distress syndrome: the Berlin Definition. JAMA 2012;307:2526–33. doi:10.1001/jama.2012.5669
OpenUrl CrossRef PubMed Web of Science

[2] ↵
Thille AW, Esteban A, Fernández-Segoviano P, et al. Comparison of the Berlin Definition for Acute Respiratory Distress Syndrome with Autopsy. Am J Respir Crit Care Med 2013;187:761–7. doi:10.1164/rccm.201211-1981OC
OpenUrl CrossRef PubMed

[3] ↵
Reilly J, Calfee C, Christie J. Acute Respiratory Distress Syndrome Phenotypes. Semin Respir Crit Care Med 2019;40:019–30. doi:10.1055/s-0039-1684049
OpenUrl CrossRef

[4] ↵
Reddy K, Sinha P, O’Kane CM, et al. Subphenotypes in critical care: translation into clinical practice. The Lancet Respiratory Medicine 2020;8:631–43. doi:10.1016/S2213-2600(20)30124-7
OpenUrl CrossRef

[5] ↵
Shankar-Hari M, Fan E, Ferguson ND. Acute respiratory distress syndrome (ARDS) phenotyping. Intensive Care Med 2019;45:516–9. doi:10.1007/s00134-018-5480-6
OpenUrl CrossRef

[6] ↵
Calfee CS, Delucchi K, Parsons PE, et al. Subphenotypes in acute respiratory distress syndrome: latent class analysis of data from two randomised controlled trials. The Lancet Respiratory Medicine 2014;2:611–20. doi:10.1016/S2213-2600(14)70097-9
OpenUrl CrossRef PubMed

[7] Famous KR, Delucchi K, Ware LB, et al. Acute Respiratory Distress Syndrome Subphenotypes Respond Differently to Randomized Fluid Management Strategy. Am J Respir Crit Care Med 2017;195:331–8. doi:10.1164/rccm.201603-0645OC
OpenUrl CrossRef PubMed

[8] Calfee CS, Delucchi KL, Sinha P, et al. Acute respiratory distress syndrome subphenotypes and differential response to simvastatin: secondary analysis of a randomised controlled trial. The Lancet Respiratory Medicine 2018;6:691–8. doi:10.1016/S2213-2600(18)30177-2
OpenUrl CrossRef

[9] for the NHLBI ARDS Network, Sinha P, Delucchi KL, et al. Latent class analysis of ARDS subphenotypes: a secondary analysis of the statins for acutely injured lungs from sepsis (SAILS) study. Intensive Care Med 2018;44:1859–69. doi:10.1007/s00134-018-5378-3
OpenUrl CrossRef PubMed

[10] ↵
Bos LD, Schouten LR, van Vught LA, et al. Identification and validation of distinct biological phenotypes in patients with acute respiratory distress syndrome by cluster analysis. Thorax 2017;72:876–83. doi:10.1136/thoraxjnl-2016-209719
OpenUrl Abstract/FREE Full Text

[11] ↵
Ferguson ND, Fan E, Camporota L, et al. The Berlin definition of ARDS: an expanded rationale, justification, and supplementary material. Intensive Care Med 2012;38:1573–82. doi:10.1007/s00134-012-2682-1
OpenUrl CrossRef PubMed Web of Science

[12] ↵
Bellani G, Laffey JG, Pham T, et al. Epidemiology, Patterns of Care, and Mortality for Patients With Acute Respiratory Distress Syndrome in Intensive Care Units in 50 Countries. JAMA 2016;315:788. doi:10.1001/jama.2016.0291
OpenUrl CrossRef PubMed

[13] ↵
Gattinoni L, Vassalli F, Romitti F. Benefits and risks of the P/F approach. Intensive Care Med 2018;44:2245–7. doi:10.1007/s00134-018-5413-4
OpenUrl CrossRef

[14] ↵
Matthay MA, Arabi YM, Siegel ER, et al. Phenotypes and personalized medicine in the acute respiratory distress syndrome. Intensive Care Med 2020;46:2136–52. doi:10.1007/s00134-020-06296-9
OpenUrl CrossRef PubMed

[15] ↵
Shankar-Hari M, Rubenfeld GD. Population enrichment for critical care trials: phenotypes and differential outcomes. Current Opinion in Critical Care 2019;25:489–97. doi:10.1097/MCC.0000000000000641
OpenUrl CrossRef

[16] ↵
Sinha P, Delucchi KL, McAuley DF, et al. Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials. The Lancet Respiratory Medicine 2020;8:247–57. doi:10.1016/S2213-2600(19)30369-8
OpenUrl CrossRef

[17] ↵
Sinha P, Churpek MM, Calfee CS. Machine Learning Classifier Models Can Identify Acute Respiratory Distress Syndrome Phenotypes Using Readily Available Clinical Data. Am J Respir Crit Care Med 2020;202:996–1004. doi:10.1164/rccm.202002-0347OC
OpenUrl CrossRef

[18] ↵
Kitsios GD, Yang L, Manatakis DV, et al. Host-Response Subphenotypes Offer Prognostic Enrichment in Patients With or at Risk for Acute Respiratory Distress Syndrome*: Critical Care Medicine 2019;47:1724–34. doi:10.1097/CCM.0000000000004018
OpenUrl CrossRef

[19] ↵
The National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome (ARDS) Clinical Trials Network. Ventilation with Lower Tidal Volumes as Compared with Traditional Tidal Volumes for Acute Lung Injury and the Acute Respiratory Distress Syndrome. N Engl J Med 2000;342:1301–8. doi:10.1056/NEJM200005043421801
OpenUrl CrossRef PubMed Web of Science

[20] The National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome (ARDS) Clinical Trials Network. Higher versus Lower Positive End-Expiratory Pressures in Patients with the Acute Respiratory Distress Syndrome. N Engl J Med 2004;351:327–36. doi:10.1056/NEJMoa032193
OpenUrl CrossRef PubMed Web of Science

[21] The National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome (ARDS) Clinical Trials Network. Pulmonary-Artery versus Central Venous Catheter to Guide Treatment of Acute Lung Injury. N Engl J Med 2006;354:2213–24. doi:10.1056/NEJMoa061895
OpenUrl CrossRef PubMed Web of Science

[22] The National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome (ARDS) Clinical Trials Network, Rice TW, Wheeler AP, et al. Initial Trophic vs Full Enteral Feeding in Patients With Acute Lung Injury: The EDEN Randomized Trial. JAMA: The Journal of the American Medical Association 2012;307:795–803. doi:10.1001/jama.2012.137
OpenUrl CrossRef PubMed Web of Science

[23] ↵
The National Heart, Lung, and Blood Institute ARDS Clinical Trials Network. Rosuvastatin for Sepsis-Associated Acute Respiratory Distress Syndrome. N Engl J Med 2014;370:2191–200. doi:10.1056/NEJMoa1401520
OpenUrl CrossRef PubMed Web of Science

[24] ↵
Writing Group for the Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial (ART) Investigators, Cavalcanti AB, Suzumura ÉA, et al. Effect of Lung Recruitment and Titrated Positive End-Expiratory Pressure (PEEP) vs Low PEEP on Mortality in Patients With Acute Respiratory Distress Syndrome: A Randomized Clinical Trial. JAMA 2017;318:1335. doi:10.1001/jama.2017.14171
OpenUrl CrossRef PubMed

[25] ↵
Yehya N, Harhay MO, Curley MAQ, et al. Reappraisal of Ventilator-Free Days in Critical Care Research. Am J Respir Crit Care Med 2019;200:828–36. doi:10.1164/rccm.201810-2050CP
OpenUrl CrossRef

[26] ↵
Castela Forte J, Perner A, van der Horst ICC. The use of clustering algorithms in critical care research to unravel patient heterogeneity. Intensive Care Med 2019;45:1025–8. doi:10.1007/s00134-019-05631-z
OpenUrl CrossRef

[27] ↵
Ketchen DJ, Shook CL. The Application of Cluster Analysis in Strategic Management Research: An Analysis and Critique. Strategic Management Journal 1996;17:441–58. doi:https://doi.org/10.1002/(SICI)1097-0266(199606)17:6<441::AID-SMJ819>3.0.CO;2-G
OpenUrl CrossRef

[28] ↵
Caliński T, Harabasz J. A dendrite method for cluster analysis. Communications in Statistics 1974;3:1–27. doi:10.1080/03610927408827101
OpenUrl CrossRef

[29] ↵
Bulgarelli L, Deliberato RO, Johnson AEW. Prediction on critically ill patients: The role of “big data.” Journal of Critical Care 2020;60:64–8. doi:10.1016/j.jcrc.2020.07.017
OpenUrl CrossRef

[30] ↵
Johnson AEW, Kramer AA, Clifford GD. A New Severity of Illness Scale Using a Subset of Acute Physiology and Chronic Health Evaluation Data Elements Shows Comparable Predictive Accuracy*: Critical Care Medicine 2013;41:1711–8. doi:10.1097/CCM.0b013e31828a24fe
OpenUrl CrossRef

[31] ↵
Deliberato RO, Escudero GG, Bulgarelli L, et al. SEVERITAS: An externally validated mortality prediction for critically ill patients in low and middle-income countries. International Journal of Medical Informatics 2019;131:103959. doi:10.1016/j.ijmedinf.2019.103959
OpenUrl CrossRef

[32] ↵
DeMerle KM, Angus DC, Baillie JK, et al. Sepsis Subclasses: A Framework for Development and Interpretation. Crit Care Med Published Online First: 15 February 2021. doi:10.1097/CCM.0000000000004842
OpenUrl CrossRef

Identification of Acute Respiratory Distress Syndrome subphenotypes denovo using routine clinical data: a retrospective analysis of ARDS clinical trials

ABSTRACT

INTRODUCTION

METHODS

Data source and participants

Predictors

Outcomes

Data preparation

Statistical analysis

Model development and validation

Data availability

RESULTS

Participants

Predictor variables and model selection

Clinical characteristics of each cluster

Identification of Subphenotypes

DISCUSSION

CONCLUSIONS

Data Availability

DECLARATIONS

Funding

Competing Interest

Ethics Approval

Availability of data and material

Author Contributions

Twitter

REFERENCES

Citation Manager Formats

Subject Area