Disentangling Predictors of COPD Mortality with Probabilistic Graphical Models

Tyler C. Lovelace; Min Hyung Ryu; Minxue Jia; Peter Castaldi; Frank C. Sciurba; Craig P. Hersh; Panayiotis V. Benos

doi:10.1101/2024.01.31.24301705

Abstract

Background-Research question Chronic Obstructive Pulmonary Disease (COPD) is a leading cause of mortality. Predicting mortality risk in COPD patients can be important for disease management strategies. Although scores for all-cause mortality have been developed previously, there is limited research on factors that may directly affect COPD-specific mortality.

Study design-Methods used probabilistic (causal) graphs to analyze clinical baseline COPDGene data, including demographics, spirometry, quantitative chest imaging, and symptom features, as well as gene expression data (from year-5).

Results We identified factors linked to all-cause and COPD-specific mortality. Although many were similar, there were differences in certain comorbidities (all-cause mortality model only) and forced vital capacity (COPD-specific mortality model only). Using our results, we developed VAPORED, a 7-variable COPD-specific mortality risk score, which we validated using the ECLIPSE 3-yr mortality data. We showed that the new model is more accurate than the existing ADO, BODE, and updated BODE indices. Additionally, we identified biological signatures linked to all-cause mortality, including a plasma cell mediated component. Finally, we developed a web page to help clinicians calculate mortality risk using VAPORED, ADO, and BODE indices.

Interpretation Given the importance of predicting COPD-specific and all-cause mortality risk in COPD patients, we showed that probabilistic graphs can identify the features most directly affecting them, and be used to build new, more accurate models of mortality risk. Novel biological features affecting mortality were also identified. This is an important step towards improving our identification of high-risk patients and potential biological mechanisms that drive COPD mortality.

Introduction

Chronic Obstructive Pulmonary Disease (COPD) is a leading cause of mortality worldwide¹. Predictive models of mortality in COPD can be used to identify high-risk individuals who may benefit from earlier or targeted interventions. As such, predictive models have been developed to predict all-cause mortality risk in individuals with COPD. Examples include the BODE (BMI, airflow obstruction, dyspnea, exercise capacity) index² and its updated³ or expanded variants⁴, the ADO (age, dyspnea, airflow obstruction) index³, and the DOSE (dyspnea, airflow obstruction, smoking status, exacerbation frequency) index⁵. Of these, the ADO and updated BODE indices were found to perform best in a large-scale meta-analysis in external validation cohorts⁶. In addition to these simple clinical predictors, more complex machine-learning approaches that incorporate clinical, demographic, and imaging features have demonstrated improved prediction of all-cause mortality^7,8.

These approaches have two key limitations which we seek to address. First, traditional machine learning methods, such as regression models^9,10 and random survival forests (RFs)¹¹, identify purely associative predictors and cannot provide insights into the possible causality of the observed interactions. In contrast, probabilistic graphs (also referred to as “causal graphs”)¹² seek to learn potential cause-effect relationships in observational datasets¹³, by considering and factoring out any confounders. In biomedical settings, such approaches have been applied to identify direct effectors of an outcome and to develop efficient predictors^14,15 or predict effects of interventions^16-18. Here, we use a recently developed algorithm (Supplementary Methods) to construct probabilistic graph models from multi-modal data (i.e., demographic, clinical, spirometry, chest CT scan, and biological features) to identify predictors that provide independent information for COPD-specific mortality.

Second, existing mortality predictors are trained, calibrated, and validated on all-cause rather than COPD-specific mortality of COPD patients. This introduces the possibility of incorrectly estimating the degree to which general risk factors contribute to COPD-specific mortality; or may introduce spurious associations due to the presence of comorbidities that act independently of the COPD-specific risk. Graph models, by construction, consider all confounder combinations and factor out the indirect effects and simple correlations. Here, we construct and compare separate graph models of all-cause and COPD-specific mortality. This allows us to disentangle features that are strongly independently informative to COPD-specific mortality from features informative of all-cause mortality. Furthermore, direct effectors identified by the graph models can be used to construct robust predictors of COPD-specific mortality risk. To further investigate blood-derived molecular signatures affecting mortality in COPD patients, we additionally construct graph models of all-cause mortality utilizing features derived from whole blood samples.

Methods

Study Population and Features

The discovery cohorts were derived from the COPD Genetic Epidemiology (COPDGene) Study, which recruited 10,198 current and former smokers, aged 45-80, from across US¹⁹. Various demographic, clinical, spirometry, and chest CT scan features were collected. Additionally, all-cause and COPD-specific mortality were recorded. A death was attributed to COPD if its cause was adjudicated to be COPD-related by the COPDGene criteria²⁰ adapted from the TORCH UCD²¹. The features included in our mortality graph model analyses are key demographics (e.g., age, sex, race), clinical measures (e.g., BMI, resting oxygen saturation (SaO2)), measures of COPD-related physiology, and symptoms), and quantitative chest CT scan features (e.g., %emphysema of the lungs²², segmental airway wall thickness (AWT)²²). Relevant medical history and comorbidities (smoking status, pneumonia, diabetes) were also included. Our final COPDGene Phase 1 study cohort consisted of the 8,610 participants who had no missing values in the set of selected baseline features and longitudinal follow up data for all-cause and COPD-specific mortality.

COPDGene conducted a 5-yr follow-up study, during which, blood-derived biological features were measured (hemoglobin levels, platelet counts, white blood cell differential percentages, whole blood gene expression)²³. To identify biological signatures linked to all-cause mortality in COPD, we constructed the COPDGene Phase 2 study cohort, that contains these blood-derived features in addition to the Phase 1 COPDGene measurements. The RNA-seq data was processed using CIBERSORTx²⁴ and the leukocyte signature matrix (LM22)²⁵ to infer more detailed cell type proportions. VIPER²⁶ was used to infer transcription factor activity. This resulted in 3,182 participants with longitudinal follow up data for all-cause mortality, RNA-seq gene expression profiles, and no missing values in the set of selected features.

The ECLIPSE study²⁷ was used for external validation of our findings. ECLIPSE has 2,312 COPD subjects or smoker controls that have 3-year mortality data, and no missing values among our selected features. Table 1 presents the main characteristics of the cohorts, while Suppl Table S1 presents the differences in characteristics between the two COPDGene cohorts.

View this table:

Table 1:

Clinical characteristics of the COPDGene Phase1 and Phase 2 studies used for the construction of graphical models of all-cause and COPD-specific mortality;, and the ECLIPSE 3-year all-cause mortality cohort, used for external validation.

Construction of Graph-based Predictive Models

Directed probabilistic graph models (also referred to as “causal graph models”) for the COPDGene cohorts were constructed using the recently developed CausalCoxMGM method (Supplementary Methods). This method uses a two-step procedure to learn directed graphs from observational data in the presence of latent confounders. The resulting graphs can provide insights into potential mechanisms, generate hypotheses, and select minimal sets of the most relevant predictors (the direct neighbors only or the Markov blanket-MB)²⁸. By construction, the MB variables of an outcome provide independent information to it.

We used both the direct neighbors and the full MB in the graphs to develop predictive Cox regression models⁹ of all-cause and COPD-specific mortality. These models were compared to the ADO and Updated BODE indices³, and to models learned with standard machine learning approaches: LASSO Cox Regression²⁹ and RF¹¹. Performance of these models was assessed through 5-fold cross-validation, with each model (except ADO and the updated BODE indices) trained on 80% of the data to predict the held-out 20% in each fold. Model performance was scored using the Harrell’s concordance index³⁰, which represents the probability of a higher risk individual dying before a lower risk one. For models that perform feature selection (direct neighbors, MB, and LASSO Cox regression) we also compared the number of features each predictive model selected. For further details on causal graph discovery, model selection, and graph-based feature selection, see Supplementary Methods.

Results

Characteristics of Discovery and Validation Cohorts

As the Phase 2 study comprised from a subset of the Phase 1 study participants, there are some expected significant changes to clinical covariates (Suppl Table S1), such as an approximately five-year increase in age (p<0.001) and a statistically significant reduction of patients in more severe GOLD categories (p<0.001). We also observe significant increases in comorbidities’ incidence, such as cardiovascular disease (CVD; defined in ⁸) and diabetes (p<0.001). Among the standard mortality risk indices, there is a significant decrease in the BODE index² and a significant increase in the ADO index³ of participants of Phase 2 compared to Phase 1 (p<0.001). Of particular interest is the distribution of all-cause mortality in the two phases. We see that there is not a statistically significant difference in the survival functions of the two phases, nor in the proportion of deaths at time points shared by both groups (3, 5, and 8 years). This is important as it suggests that, despite the older participants in Phase 2 and the exclusion of individuals who died before the five-year follow up, the all-cause mortality does not differ significantly between the two phases.

The ECLIPSE study²⁷ differs significantly in its patient population compared to COPDGene (Table 1). It has significantly more male participants (p<0.001) and is less racially diverse (98.3% white). Additionally, ECLIPSE is enriched for more severe cases of COPD (GOLD Stages 2-4; p<0.001). This is also reflected in significant increases in the ADO, BODE, and updated BODE indices compared to COPDGene. Finally, likely due to the difference in COPD severity in the patient population, we see a significant difference in all-cause mortality between the two studies (p<0.001). This is also reflected in the number of deaths observed in the first three years, which is significantly higher in the ECLIPSE study (p<0.001).

Comparison of All-Cause and COPD-specific Mortality Models Identifies Features Directly Linked to COPD Mortality

The MB variables (see Supplementary Methods) of all-cause mortality and COPD-specific mortality overlap substantially in COPDGene Phase 1. Overlapping features include classical predictors of mortality in COPD, such as age, 6MWD, mMRC Dyspnea score, BMI, and forced expiratory volume in 1s (FEV1)/forced vital capacity (FVC) ratio^3,31 (Figure 1, Supplementary Table S2). Additionally, prior diagnosis of pneumonia, participant’s resting SaO2, AWT, history of severe exacerbations, and high blood pressure were linked to both all-cause and COPD-specific mortality. However, there were differences in the two models. FVC %predicted is strongly associated with COPD-specific mortality only, while FEV1 %predicted independently informs all-cause mortality only. Also, as expected, comorbidities such as CVD and diabetes were associated with all-cause mortality only; same applies to risk factors such as smoking status (at the visit), pack years, and gender. Finally, heart rate and the coughing up phlegm symptom are also linked to all-cause mortality only.

Figure 1:

Graph models of all-cause (A, left) and COPD-specific (A, right) mortality in the COPDGene Phase 1 study. Only the Markov blanket (MB) mortality features are shown. Node boundaries are colored to represent whether a feature is present in the MB of all-cause mortality (yellow), COPD-specific mortality (red), or both (orange). Adjacencies in the graphical model represent a direct interaction between two variables, while edge orientations represent the type of interaction inferred by CausalCoxMGM. Undirected edges (X --- Y) represent a direct interaction between X and Y, while bidirected edges (X <-> Y) mean that some unobserved confounder affects X and Y. Edge color designates whether higher values of this MB variable represent lower (blue) or (red) higher mortality risk. The standardized hazard ratios of the direct neighbors all-cause (B, left) and COPD-specific mortality (B, right) display the direction, relative effect size, and significance of each covariates’ association with mortality. Hazard ratios for numeric features represent the change in hazard for a 1 SD increase. Although all depicted MB variables contribute independent information to each mortality variable, we observed no significant decrease in prediction accuracy in models with only the direct neighbors.

Looking only at the direct neighbors of COPD-specific mortality provides some additional insights. BMI, although it provides some independent information for COPD-specific mortality, is not directly linked to it, suggesting that its association with mortality in COPD is through its interactions with FEV1/FVC, FVC %predicted, 6MWD, mMRC Dyspnea score, and resting SaO2 (Figure 1A, right). However, BMI is directly linked to all-cause mortality. Additionally, we observe a direct effect of FVC %predicted to COPD-specific but not to all-cause mortality. Figure 1B shows the relative importance (hazard ratios) of each direct neighbor to mortality in the two graphs, which are all independently significant.

Predictive Models of COPD Mortality

In addition to providing insights on direct interactions in observational datasets, graph models are useful tools for constructing robust, parsimonious, and powerful predictive models. We found that the graph-based predictive models (Neighbors, MB) significantly outperformed the ADO and updated BODE indices for both all-cause and COPD-specific mortality (Figure 2A). The models based on direct neighbors only (i.e., excluding “spouses” from the MB) performed similarly to models constructed from the full MB, LASSO Cox regression, and RF, but consistently selected significantly fewer features. The predictive models based on the full MB performed similarly to LASSO Cox regression and RF, while still being significantly more parsimonious than LASSO.

Figure 2:

Performance of predictive models of all-cause and COPD-specific models assessed through 5-fold cross-validation (A). Models constructed using the direct neighbors of mortality (Neighbors) and all Markov blanket variables (full MB) were compared against the ADO and updated BODE indices, as well as LASSO Cox regression and random survival forests (RF) models trained on the same cross-validation splits. Model performance was assessed via Harrell’s concordance. The number of features for models where feature selection was performed (Neighbors, full MB, LASSO Cox) was also assessed. Error bars denote the 95% confidence intervals of the cross-validation estimates. To demonstrate the ability of the full MB models to stratify patients by risk of all-cause (B) and COPD-specific (C) mortality compared to the updated BODE index, we stratify individuals into four risk groups using the updated BODE index (left) as well as four risk groups of equal size predicted by the full MB models (right). Confidence bands represent the 95% confidence interval.

To demonstrate the ability of our models to stratify patients better than the updated BODE index, we created four risk groups based on the updated BODE score as well as groups of equal size from our full MB model. The Kaplan-Meier survival probability estimates³² were plotted for these four risk groups for both all-cause (Figure 2B) and COPD-specific mortality (Figure 2C). The new full MB risk groups stratify patients into groups with significantly different survival curves for both all-cause and COPD-specific mortality. Additionally, COPD-specific mortality was considerably easier to predict compared to all-cause mortality (Figure 2BC).

External Validation: the ECLIPSE Study

In our model, COPD-specific mortality has seven direct neighbors (Supplementary Table S2, bold), which are easily obtainable clinical measurements (no need for chest CT scans), and they are frequently measured across COPD studies. We trained a predictor model of COPD-specific mortality with these seven variables from COPDGene Phase 1 and constructed an index of COPD mortality risk as in ³ with the categories presented in Table 2. Our model, VAPORED (Vital capacity-FVC %predicted, Age, history of Pneumonia, Oxygen saturation, the FEV1/FVC Ratio, 6-min walk Exercise capacity, Dyspnea), was validated on the 3-year mortality data in the ECLIPSE study and compared to the standard ADO, BODE, and updated BODE indices. We used four measures to assess predictive power: concordance, the concordance probability estimate (CPE), the cumulative/dynamic (C/D) AUC at 3 years, and the integrated C/D AUC. Means and standard errors of these metrics were estimated over 2000 bootstrapped samples. While the ADO, BODE, and Updated BODE indices have similar predictive power across all metrics, VAPORED had consistently the highest predictive power by at least one standard error (Figure 3A) and significantly higher CPE than all other indices (p<0.05). All-cause mortality survival functions for each VAPORED score interval were estimated by training a parametric Cox regression model with a Weibull baseline hazard on COPDGene Phase 1 data¹⁰ (Supplementary Figure S1A; Supplementary Table S3). In Supplementary Figure S1B we see that the predictions of this model are not significantly different than the observed survival probabilities for each calibration plot in 1-year, 2-years and 3-years survival. Patients in the ECLIPSE dataset were also stratified into four risk groups according to the BODE score and four risk groups of equivalent size based on the VAPORED score (Figure 3B). The VAPORED score categories have significantly different survival probability in the ECLIPSE study and VAPORED shows greater separation amongst risk groups than the BODE score.

Figure 3:

Performance of the VAPORED risk score in the ECLIPSE 3-year all-cause mortality external validation cohort. The concordance, concordance probability estimate (CPE), 3-year C/D AUC, and integrated C/D AUC for the VAPORED, ADO, BODE, and updated BODE scores (A). Error bars denote the 95% confidence intervals of the bootstrapped estimates. To demonstrate the ability of the VAPORED risk score to stratify patients by risk of all-cause mortality in the ECLIPSE study compared to the updated BODE index (B), we stratify individuals into four risk groups using the BODE index (left) as well as four risk groups of approximately equal size by VAPORED risk score (right). Confidence bands represent the 95% confidence interval.

View this table:

Table 2:

Scoring criteria for VAPORED mortality score, constructed from the direct neighbors of COPD-specific mortality in the graphical model.

Graph-based Model of All-Cause Mortality in COPD Reveals Biological Signatures of Mortality Risk

In addition to the demographic, clinical, spirometry, and chest CT scan features, COPDGene Phase 2 Study also collected biological data, such as white blood cell differential percentages, hemoglobin levels, and whole blood gene expression. This provides a unique opportunity to investigate how informative the biological features are in the context of the clinical features and comorbidities. We applied our graph-modeling approach to identify features potentially affecting all-cause mortality in this dataset. We found eight of the 18 previously identified clinical features in the MB (age, gender, BMI, 6MWD, mMRC Dyspnea score, FEV1/FVC, heart rate, CVD; Supplementary Table S2). Additional variables linked to all-cause mortality include: a socioeconomic feature, Internet Access, which was not collected during Phase 1, and five biological features (platelets, hemoglobin, SPIB transcription factor activity, and CIBERSORTx inferred proportions of plasma cells and resting memory CD4+ T cells) (Figure 4AB). We used 5-fold cross-validation (internally) to ensure that our graph modeling approach was learning robust sets of features of all-cause mortality. As above, the model based on the identified predictors of all-cause mortality significantly outperformed the ADO and updated BODE indices in terms of concordance, while it is significantly more parsimonious than the other machine learning methods (LASSO Cox regression, RF) without a significant loss in predictive accuracy (Figure 4C). Finally, the COPDGene Phase 2 full MB predictive model can stratify individuals into significantly different mortality risk groups (Figure 4D). These results strengthen the claim that our method derives relevant direct interactions between features from this multi-modal and multi-scale dataset and all-cause mortality.

Figure 4:

Graph models of all-cause mortality in the COPDGene Phase 2 study (A). Only the Markov blanket (MB) of all-cause mortality is shown. Adjacencies in the graphical model have the same notation as in Figure 1. The standardized hazard ratios of the direct neighbors all-cause mortality (B) display the direction, relative effect size, and significance of each covariates’ association with mortality. Hazard ratios for numeric features represent the change in hazard for a 1 SD increase. The performance of predictive models of all-cause and COPD-specific models assessed through 5-fold cross-validation (C). Models constructed using the direct neighbors of mortality (Neighbors) and Markov blanket of mortality (full MB) were compared against the ADO and updated BODE indices, as well as LASSO Cox regression and random survival forests (RF) models trained on the same cross-validation splits. Model performance was assessed via Harrell’s concordance and the number of features for models where feature selection was performed (Neighbors, full MB, LASSO Cox). Error bars denote the 95% confidence intervals of the cross-validation estimates. To demonstrate the ability of the full MB model to stratify patients by risk of all-cause mortality compared to the updated BODE index (D), we stratify individuals into four risk groups using the updated BODE index (left) as well as four risk groups of equal size using full MB model predictions (right). Confidence bands represent the 95% confidence interval.

Web-based tool for all-cause and COPD-specific mortality

To help people further evaluate our VAPORED score predictor, we developed a web-based tool. The web tool allows the user to input the seven VAPORED key variables (FVC %predicted, age, history of pneumonia, SaO2, FEV1/FVC ratio, 6MWD, mMRC Dyspnea score) for an individual and outputs two mortality risk curves (all-cause, COPD-specific) for the next 10 years. In addition, if the user provides values for BMI and FEV1 %predicted, the web tool outputs similar curves for BODE and ADO risk scores (for comparison purposes). The tool is available as a Shiny app from: https://vapored.shinyapps.io/VAPORED/. Some example values have been pre-loaded.

Discussion

In this study, we applied a probabilistic graph modeling approach to distinguish features directly linked to COPD-specific and all-cause mortality from simple correlates. We analyzed demographic, clinical, spirometry, and chest CT scan features from baseline COPDGene measurements. The graphical models, by construction, are considering all possible combinations of covariates and filter out the indirect effects.

Key results – Interpretation

We found known risk factors of all-cause mortality in COPD to inform both models. These include age, mMRC Dyspnea score, 6MWD, and BMI, which are used in BODE and ADO indices. Interestingly, these features remained informative of all-cause mortality even after biological data were added to the model (from COPDGene Phase 2). Other common features of the all-cause and COPD-specific mortality baseline models (history of pneumonia, and resting SaO2) did not appear in the Phase 2 model, probably because their information is superseded by the biological variables of this model (Platelets, Resting Memory CD4+ T Cells, SPIB Activity, and Plasma Cells). Further investigation is needed to determine potential long-term biological effects of pneumonia to COPD patients.

The all-cause and COPD-specific mortality models have some unique characteristics, despite their substantial overlap. FVC %predicted appeared only in the COPD-specific model. This might indicate the effect of hyperinflation (i.e., residual volume) in COPD mortality, which was recently shown to be better represented by FVC %predicted and FEV1/FVC rather than FEV1 %predicted³³. Further, hyperinflation is more strongly linked to mortality than FEV1^34,35. FEV1/FVC ratio, which is independent on race specific reference equations, is a better discriminator of mortality than FEV1 %predicted³⁶. Not surprisingly, certain comorbidities are informative for the all-cause mortality only: diabetes, CVD, heart rate. Pack years, which affects multiple systems, is informative for the all-cause mortality, but not the COPD-specific mortality. This is probably because direct measurement of impacted lung variables has incorporated the smoking information. Finally, the important contribution of pneumonia to our COPD specific model is unique, although not unexpected given the established relationship between pneumonia and mortality in patients with COPD and the potential that pneumonia may cause impairment in lung immunity not reflected in the other metrics^37,38.

The graph models enabled us to develop a new, parsimonious but informative, 7-feature risk score for COPD-specific mortality consisting of easily obtainable characteristics (age, 6MWD, FEV1/FVC, FVC %predicted, mMRC Dyspnea score, history of pneumonia, resting SaO2). We validated this new model in the ECLIPSE 3-year all-cause mortality data, as ECLIPSE has not recorded COPD-specific mortality. Our score consistently outperformed the ADO, BODE, and updated BODE indices across multiple metrics and had a significantly higher mortality CPE.

We also took advantage of COPDGene 5-year follow-up data, which additionally included socioeconomic factors and biological measurements. Our graph approach identified potentially important clinical and biological factors of all-cause mortality in current and former smokers with or at high risk of developing COPD. Age, BMI, FEV1/FVC, mMRC, 6MWD were still connected to all-cause mortality, as well as comorbidities CVD and heart rate. In addition, Internet Access was directly linked to all-cause mortality. Further investigation is needed to determine whether this is a surrogate for income or rural vs urban population characteristics.

Five biological features were linked to all-cause mortality in the Phase 2 model: hemoglobin levels, platelet counts, SPIB transcription factor activity, plasma cell proportions, and resting memory CD4+ T cell proportions. Low hemoglobin levels were associated with an increased risk of all-cause mortality, as has been previously observed in a population-based study of individuals with COPD³⁹ and in patients who were admitted to hospital for COPD⁴⁰. Platelets have also been previously associated with all-cause mortality in COPD, and antiplatelet therapies have been shown to reduce all-cause mortality in individuals with COPD^41,42. This contradicts the hazard ratio learned by our model, which suggests that low platelet levels are associated with increased all-cause mortality. However, previous analysis has found a U-shaped association of platelet counts with all-cause mortality in COPD⁴³. This indicates that both high and low platelet counts are associated with a higher risk of all-cause mortality, and the hazard ratio observed in our model is likely due to limitations of our modeling assumptions, which expect such associations to be monotonic. Even so, our graph discovery algorithm has correctly identified a biological signature that has been previously identified as having a likely causal effect on all-cause mortality in COPD patients^41,42.

Regarding mechanistic insights, our model suggests that the plasma cell proportion and SPIB transcription factor activity are affecting mortality. The two likely interact (our model supports that), as the SPIB transcription factor is a negative regulator of plasma cell differentiation and immunoglobulin production^44,45. Previous studies have linked B cell activity and the humoral immune response with COPD progression⁴⁶. The formation of lymphoid follicles in the lung^46,47 and larger numbers of infiltrating B cells, memory B cells, and plasma cells are associated with COPD severity⁴⁶. Our model provides support for this mechanism of COPD progression, as elevated plasma cells and reduced SPIB transcription factor activity are found to be associated with an increased risk of all-cause mortality.

Our graph model also suggests that lower resting memory CD4+ T cells directly affect increase in all-cause mortality, independently of age or sex. Interestingly, circulating memory CD4+ T cells are known to increase with age⁴⁸, and smoking^49,50, and have been linked to an increase in IL-22 secretion, which has been previously implicated in COPD pathogenesis⁵¹. However, independently of these known risk factors for all-cause mortality in COPD, our results suggest that memory CD4+ T cells play a protective role in patients with COPD. A previous study in patients with COPD found that CD4+ T cell cytokine production in response to stimulus is restricted almost entirely to memory CD4+ T cells⁵². In healthy populations, memory CD4+ T cell populations can respond faster and more efficiently to previously experienced infections^48,53. In both healthy individuals and those with COPD, circulating and tissue resident memory CD4+ T cells that respond to common viral respiratory pathogens were found, without any significant defects between the COPD and control subjects⁵⁴. The association of low levels of resting memory CD4+ T cells with an elevated risk of all-cause mortality observed in our analysis may reflect the importance of memory CD4+ T cell populations in the response to respiratory infections in subjects with COPD.

Limitations

The observational nature of both cohorts makes difficult to establish true causal relationships. Additionally, the COPDGene Study contains older individuals with extensive smoking histories, so the VAPORED score reflects this. Thus, its application should be limited to individuals who have or are at high risk of developing COPD. Finally, the modeling assumptions of our probabilistic graph models (additive monotonic relationships, Markov faithfulness) place limitations on the interactions our models can recover. Despite these limitations, our approach had similar predictive power compared to standard indices or common machine learning methods for mortality prediction.

Conclusions

Our graph-based models can go beyond simple correlates and identify direct effectors of outcomes. It is also important to distinguish COPD-specific from all-cause mortality, a subject that has been understudied in the past. We developed a new COPD-specific mortality risk score (VAPORED), which is significantly better than established risk scores, and we validated our findings in an external cohort. Furthermore, we identified socioeconomic and biological factors that contribute to all-cause mortality in COPD patients. In the future, we plan to extend this study to additional modalities, such as genetic information, blood proteomics or methylomics, and develop truly comprehensive mortality risk scores. Accurate risk stratification of COPD patients can aid in the identification of high-risk individuals who may benefit most from targeted interventions.

Acknowledgement

This work was supported by NHLBI grants R01HL159805 and R01HL157879 (to PVB) and F31LM013966 (to TCL). This work was also supported by NHLBI U01HL089897 and U01HL089856. The COPDGene study (NCT00608764) is also supported by the COPD Foundation through contributions made to an Industry Advisory Committee comprised of AstraZeneca, Bayer Pharmaceuticals, Boehringer-Ingelheim, Genentech, GlaxoSmithKline, Novartis, Pfizer and Sunovion.

Footnotes

Authors state the following conflicts of interest: TCL, MHR, MJ, PVB have nothing to disclose. PC received grant support from Bayer. FCS has received grant support and consulting fees from Sanofi/Regeneron, AstraZeneca, Verona Pharma, Nuvaira, Gala Therapeutics, GlaxoSmithKline, Boehringer Ingelheim. CPH has received grant support and consulting fees from Alpha-1 Foundation, Bayer, Boehringer Ingelheim, Vertex, AstraZeneca, Takeda, Sanofi.
Note: This paper has not undergone peer review

Abbreviations

MB: Markov blanket
RF: Random Forest

References

1.↵
Collaborators GBDCRD. Global, regional, and national deaths, prevalence, disability-adjusted life years, and years lived with disability for chronic obstructive pulmonary disease and asthma, 1990-2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet Respir Med. 2017;5(9):691-706.
OpenUrl Google Scholar
2.↵
Celli BR, Cote CG, Marin JM, et al. The Body-Mass Index, Airflow Obstruction, Dyspnea, and Exercise Capacity Index in Chronic Obstructive Pulmonary Disease. N. Engl. J. Med. 2004;350(10):1005–1012.
OpenUrl CrossRef PubMed Web of Science Google Scholar
3.↵
Puhan MA, Garcia-Aymerich J, Frey M, et al. Expansion of the prognostic assessment of patients with chronic obstructive pulmonary disease: the updated BODE index and the ADO index. Lancet. 2009;374(9691):704-711.
OpenUrl CrossRef PubMed Web of Science Google Scholar
4.↵
Soler-Cataluña JJ, Martínez-García MA, Sánchez LS, Tordera MP, Sánchez PR. Severe exacerbations and BODE index: two independent risk factors for death in male COPD patients. Respir. Med. 2009;103(5):692–699.
OpenUrl CrossRef PubMed Web of Science Google Scholar
5.↵
Jones RC, Donaldson GC, Chavannes NH, et al. Derivation and validation of a composite index of severity in chronic obstructive pulmonary disease: the DOSE Index. Am. J. Respir. Crit. Care Med. 2009;180(12):1189–1195.
OpenUrl CrossRef PubMed Web of Science Google Scholar
6.↵
Guerra B, Haile SR, Lamprecht B, et al. Large-scale external validation and comparison of prognostic models: an application to chronic obstructive pulmonary disease. BMC Med. 2018;16(1):33.
OpenUrl CrossRef PubMed Google Scholar
7.↵
Moll M, Qiao D, Regan EA, et al. Machine Learning and Prediction of All-Cause Mortality in COPD. Chest. 2020;158(3):952–964.
OpenUrl PubMed Google Scholar
8.↵
Strand M, Austin E, Moll M, et al. A Risk Prediction Model for Mortality Among Smokers in the COPDGene® Study. Int. J. Chron. Obstruct. Pulmon. Dis. 2020;7(4):346–361.
OpenUrl Google Scholar
9.↵
Cox DR. Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological). 1972;34(2):187–202.
OpenUrl Web of Science Google Scholar
10.↵
Klein JP, Moeschberger ML. Survival analysis: techniques for censored and truncated data. Vol 1230: Springer; 2003.
Google Scholar
11.↵
Ishwaran H, Kogalur UB, Blackstone EH, Lauer MS. Random survival forests. aoas. 2008;2(3):841-860.
OpenUrl Google Scholar
12.↵
Glymour C, Zhang K, Spirtes P. Review of Causal Discovery Methods Based on Graphical Models. Front. Genet. 2019;10:524.
Google Scholar
13.↵
Raghu VK, Poon A, Benos PV. Evaluation of Causal Structure Learning Methods on Mixed Data Types. Proceedings of 2018 ACM SIGKDD Workshop on Causal Disocvery; 2018; Proceedings of Machine Learning Research.
Google Scholar
14.↵
Raghu VK, Zhao W, Pu J, et al. Feasibility of lung cancer prediction from low-dose CT scan and smoking factors using causal models. Thorax. 2019;74(7):643–649.
OpenUrl Abstract/FREE Full Text Google Scholar
15.↵
Abecassis I, Sedgewick AJ, Romkes M, et al. PARP1 rs1805407 Increases Sensitivity to PARP1 Inhibitors in Cancer Cells Suggesting an Improved Therapeutic Strategy. Sci Rep. 2019;9(1):3309.
OpenUrl Google Scholar
16.↵
Maathuis MH, Colombo D, Kalisch M, Bühlmann P. Predicting causal effects in large-scale systems from observational data. Nat. Methods. 2010;7(4):247–248.
OpenUrl CrossRef PubMed Web of Science Google Scholar
17.
Stekhoven DJ, Moraes I, Sveinbjörnsson G, Hennig L, Maathuis MH, Bühlmann P. Causal stability ranking. Bioinformatics. 2012;28(21):2819–2823.
OpenUrl CrossRef PubMed Web of Science Google Scholar
18.↵
Buschur KL, Chikina M, Benos PV. Causal network perturbations for instance-specific analysis of single cell and disease samples. Bioinformatics. 2020;36(8):2515–2521.
OpenUrl Google Scholar
19.↵
Regan EA, Hokanson JE, Murphy JR, et al. Genetic epidemiology of COPD (COPDGene) study design. COPD. 2010;7(1):32–43.
OpenUrl CrossRef PubMed Web of Science Google Scholar
20.↵
Young KA, Regan EA, Han MK, et al. Subtypes of COPD have unique distributions and differential risk of mortality. Chronic Obstructive Pulmonary Diseases: Journal of the COPD Foundation. 2019;6(5):400.
OpenUrl Google Scholar
21.↵
McGarvey LP, John M, Anderson JA, Zvarich M, Wise RA. Ascertainment of cause-specific mortality in COPD: operations of the TORCH Clinical Endpoint Committee. Thorax. 2007;62(5):411–415.
OpenUrl Abstract/FREE Full Text Google Scholar
22.↵
Han MK, Kazerooni EA, Lynch DA, et al. Chronic obstructive pulmonary disease exacerbations in the COPDGene study: associated radiologic phenotypes. Radiology. 2011;261(1):274–282.
OpenUrl CrossRef PubMed Web of Science Google Scholar
23.↵
Ghosh AJ, Saferali A, Lee S, et al. Blood RNA sequencing shows overlapping gene expression across COPD phenotype domains. Thorax. 2022;77(2):115–122.
OpenUrl Abstract/FREE Full Text Google Scholar
24.↵
Newman AM, Steen CB, Liu CL, et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 2019;37(7):773–782.
OpenUrl PubMed Google Scholar
25.↵
Newman AM, Liu CL, Green MR, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods. 2015;12(5):453–457.
OpenUrl CrossRef PubMed Google Scholar
26.↵
Alvarez MJ, Shen Y, Giorgi FM, et al. Functional characterization of somatic mutations in cancer using network-based inference of protein activity. Nat. Genet. 2016;48(8):838–847.
OpenUrl CrossRef PubMed Google Scholar
27.↵
Vestbo J, Anderson W, Coxson HO, et al. Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-points (ECLIPSE). Eur. Respir. J. 2008;31(4):869–873.
OpenUrl Abstract/FREE Full Text Google Scholar
28.↵
Aliferis CF. Local causal and Markov blanket induction for causal discovery and feature selection for classification part II: Analysis and extensions. J. Mach. Learn. Res. 2010;11:235–284.
OpenUrl Google Scholar
29.↵
Simon N, Friedman J, Hastie T, Tibshirani R. Regularization Paths for Cox’s Proportional Hazards Model via Coordinate Descent. J. Stat. Softw. 2011;39(5):1–13.
OpenUrl CrossRef PubMed Web of Science Google Scholar
30.↵
Harrell FE, Jr., Califf RM, Pryor DB, Lee KL, Rosati RA. Evaluating the yield of medical tests. JAMA. 1982;247(18):2543–2546.
OpenUrl CrossRef PubMed Web of Science Google Scholar
31.↵
Singh D, Agusti A, Anzueto A, et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive lung disease: the GOLD science committee report 2019. European Respiratory Journal. 2019;53(5).
Google Scholar
32.↵
Kaplan EL, Meier P. Nonparametric Estimation from Incomplete Observations. J. Am. Stat. Assoc. 1958;53(282):457–481.
OpenUrl CrossRef Web of Science Google Scholar
33.↵
Evankovich JW, Nouraie SM, Sciurba FC. A Model to Predict Residual Volume from Forced Spirometry Measurements in Chronic Obstructive Pulmonary Disease. Chronic Obstr Pulm Dis. 2023;10(1):55–63.
OpenUrl Google Scholar
34.↵
Casanova C, Cote C, de Torres JP, et al. Inspiratory-to-total lung capacity ratio predicts mortality in patients with chronic obstructive pulmonary disease. American journal of respiratory and critical care medicine. 2005;171(6):591–597.
OpenUrl CrossRef PubMed Web of Science Google Scholar
35.↵
Martinez FJ, Foster G, Curtis JL, et al. Predictors of mortality in patients with emphysema and severe airflow obstruction. American journal of respiratory and critical care medicine. 2006;173(12):1326–1334.
OpenUrl CrossRef PubMed Web of Science Google Scholar
36.↵
Bhatt SP, Nakhmani A, Fortis S, et al. FEV(1)/FVC Severity Stages for Chronic Obstructive Pulmonary Disease. American journal of respiratory and critical care medicine. 2023.
Google Scholar
37.↵
Restrepo MI, Mortensen EM, Pugh JA, Anzueto A. COPD is associated with increased mortality in patients with community-acquired pneumonia. Eur Respir J. 2006;28(2):346–351.
OpenUrl Abstract/FREE Full Text Google Scholar
38.↵
Bhat TA, Panzica L, Kalathil SG, Thanavala Y. Immune Dysfunction in Patients with Chronic Obstructive Pulmonary Disease. Ann Am Thorac Soc. 2015;12 Suppl 2(Suppl 2):S169-175.
OpenUrl Google Scholar
39.↵
Park SC, Kim YS, Kang YA, et al. Hemoglobin and mortality in patients with COPD: a nationwide population-based cohort study. Int. J. Chron. Obstruct. Pulmon. Dis. 2018;13:1599–1605.
OpenUrl Google Scholar
40.↵
Toft-Petersen AP, Torp-Pedersen C, Weinreich UM, Rasmussen BS. Association between hemoglobin and prognosis in patients admitted to hospital for COPD. Int. J. Chron. Obstruct. Pulmon. Dis. 2016;11:2813–2820.
OpenUrl Google Scholar
41.↵
Pavasini R, Biscaglia S, d’Ascenzo F, et al. Antiplatelet Treatment Reduces All-Cause Mortality in COPD Patients: A Systematic Review and Meta-Analysis. COPD. 2016;13(4):509–514.
OpenUrl Google Scholar
42.↵
Mallah H, Ball S, Sekhon J, Parmar K, Nugent K. Platelets in chronic obstructive pulmonary disease: An update on pathophysiology and implications for antiplatelet therapy. Respir. Med. 2020;171:106098.
OpenUrl Google Scholar
43.↵
Fawzy A, Anderson JA, Cowans NJ, et al. Association of platelet count with all-cause mortality and risk of cardiovascular and respiratory morbidity in stable COPD. Respir. Res. 2019;20(1):86.
OpenUrl Google Scholar
44.↵
Schmidlin H, Diehl SA, Nagasawa M, et al. Spi-B inhibits human plasma cell differentiation by repressing BLIMP1 and XBP-1 expression. Blood. 2008;112(5):1804–1812.
OpenUrl Abstract/FREE Full Text Google Scholar
45.↵
Willis SN, Tellier J, Liao Y, et al. Environmental sensing by mature B cells is controlled by the transcription factors PU.1 and SpiB. Nat. Commun. 2017;8(1):1426.
OpenUrl Google Scholar
46.↵
Polverino F, Seys LJM, Bracke KR, Owen CA. B cells in chronic obstructive pulmonary disease: moving to center stage. Am. J. Physiol. Lung Cell. Mol. Physiol. 2016;311(4):L687–L695.
OpenUrl CrossRef PubMed Google Scholar
47.↵
Brusselle GG, Demoor T, Bracke KR, Brandsma CA, Timens W. Lymphoid follicles in (very) severe COPD: beneficial or harmful? Eur. Respir. J. 2009;34(1):219–230.
OpenUrl Abstract/FREE Full Text Google Scholar
48.↵
Farber DL, Yudanin NA, Restifo NP. Human memory T cells: generation, compartmentalization and homeostasis. Nat. Rev. Immunol. 2014;14(1):24–35.
OpenUrl CrossRef PubMed Google Scholar
49.↵
Tanigawa T, Araki S, Nakata A, et al. Increase in memory (CD4+CD29+ and CD4+CD45RO+) T and naive (CD4+CD45RA+) T-cell subpopulations in smokers. Arch. Environ. Health. 1998;53(6):378–383.
OpenUrl CrossRef PubMed Web of Science Google Scholar
50.↵
Nakata A, Takahashi M, Irie M, Fujioka Y, Haratani T, Araki S. Relationship between cumulative effects of smoking and memory CD4+ T lymphocyte subpopulations. Addict. Behav. 2007;32(7):1526–1531.
OpenUrl PubMed Google Scholar
51.↵
Starkey MR, Plank MW, Casolari P, et al. IL-22 and its receptors are increased in human and experimental COPD and contribute to pathogenesis. Eur. Respir. J. 2019;54(1).
Google Scholar
52.↵
Paats MS, Bergen IM, Hoogsteden HC, van der Eerden MM, Hendriks RW. Systemic CD4+ and CD8+ T-cell cytokine profiles correlate with GOLD stage in stable COPD. Eur. Respir. J. 2012;40(2):330–337.
OpenUrl Abstract/FREE Full Text Google Scholar
53.↵
MacLeod MKL, Kappler JW, Marrack P. Memory CD4 T cells: generation, reactivation and re-assignment. Immunology. 2010;130(1):10–15.
OpenUrl CrossRef PubMed Web of Science Google Scholar
54.↵
Daniels H, van Schilfgaarde M, Jansen HM, et al. Characterization of CD4+ Memory T Cell Responses Directed against Common Respiratory Pathogens in Peripheral Blood and Lung. J. Infect. Dis. 2007;195(11):1718–1725.
OpenUrl CrossRef PubMed Web of Science Google Scholar

Posted February 01, 2024.

Download PDF

Author Declarations

Data/Code

Citation Tools

Get QR code

Tweet Widget

Subject Area

Respiratory Medicine

Reviews and Context

Comment

TRIP Peer Reviews

Community Reviews

Automated Services

Blogs/Media

Author Videos

Subject Areas

All Articles

Addiction Medicine (419)
Allergy and Immunology (741)
Anesthesia (217)
Cardiovascular Medicine (3193)
Dentistry and Oral Medicine (355)
Dermatology (270)
Emergency Medicine (473)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1136)
Epidemiology (13180)
Forensic Medicine (19)
Gastroenterology (883)
Genetic and Genomic Medicine (5010)
Geriatric Medicine (467)
Health Economics (767)
Health Informatics (3154)
Health Policy (1119)
Health Systems and Quality Improvement (1161)
Hematology (418)
HIV/AIDS (992)
Infectious Diseases (except HIV/AIDS) (14482)
Intensive Care and Critical Care Medicine (900)
Medical Education (466)
Medical Ethics (123)
Nephrology (512)
Neurology (4762)
Nursing (253)
Nutrition (706)
Obstetrics and Gynecology (865)
Occupational and Environmental Health (775)
Oncology (2452)
Ophthalmology (696)
Orthopedics (274)
Otolaryngology (335)
Pain Medicine (317)
Palliative Medicine (89)
Pathology (525)
Pediatrics (1270)
Pharmacology and Therapeutics (539)
Primary Care Research (542)
Psychiatry and Clinical Psychology (4091)
Public and Global Health (7325)
Radiology and Imaging (1647)
Rehabilitation Medicine and Physical Therapy (978)
Respiratory Medicine (959)
Rheumatology (469)
Sexual and Reproductive Health (486)
Sports Medicine (413)
Surgery (530)
Toxicology (68)
Transplantation (227)
Urology (197)

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] 1.↵
Collaborators GBDCRD. Global, regional, and national deaths, prevalence, disability-adjusted life years, and years lived with disability for chronic obstructive pulmonary disease and asthma, 1990-2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet Respir Med. 2017;5(9):691-706.
OpenUrl Google Scholar

[2] 2.↵
Celli BR, Cote CG, Marin JM, et al. The Body-Mass Index, Airflow Obstruction, Dyspnea, and Exercise Capacity Index in Chronic Obstructive Pulmonary Disease. N. Engl. J. Med. 2004;350(10):1005–1012.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[3] 3.↵
Puhan MA, Garcia-Aymerich J, Frey M, et al. Expansion of the prognostic assessment of patients with chronic obstructive pulmonary disease: the updated BODE index and the ADO index. Lancet. 2009;374(9691):704-711.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[4] 4.↵
Soler-Cataluña JJ, Martínez-García MA, Sánchez LS, Tordera MP, Sánchez PR. Severe exacerbations and BODE index: two independent risk factors for death in male COPD patients. Respir. Med. 2009;103(5):692–699.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[5] 5.↵
Jones RC, Donaldson GC, Chavannes NH, et al. Derivation and validation of a composite index of severity in chronic obstructive pulmonary disease: the DOSE Index. Am. J. Respir. Crit. Care Med. 2009;180(12):1189–1195.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[6] 6.↵
Guerra B, Haile SR, Lamprecht B, et al. Large-scale external validation and comparison of prognostic models: an application to chronic obstructive pulmonary disease. BMC Med. 2018;16(1):33.
OpenUrl CrossRef PubMed Google Scholar

[7] 7.↵
Moll M, Qiao D, Regan EA, et al. Machine Learning and Prediction of All-Cause Mortality in COPD. Chest. 2020;158(3):952–964.
OpenUrl PubMed Google Scholar

[8] 8.↵
Strand M, Austin E, Moll M, et al. A Risk Prediction Model for Mortality Among Smokers in the COPDGene® Study. Int. J. Chron. Obstruct. Pulmon. Dis. 2020;7(4):346–361.
OpenUrl Google Scholar

[9] 9.↵
Cox DR. Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological). 1972;34(2):187–202.
OpenUrl Web of Science Google Scholar

[10] 10.↵
Klein JP, Moeschberger ML. Survival analysis: techniques for censored and truncated data. Vol 1230: Springer; 2003.
Google Scholar

[11] 11.↵
Ishwaran H, Kogalur UB, Blackstone EH, Lauer MS. Random survival forests. aoas. 2008;2(3):841-860.
OpenUrl Google Scholar

[12] 12.↵
Glymour C, Zhang K, Spirtes P. Review of Causal Discovery Methods Based on Graphical Models. Front. Genet. 2019;10:524.
Google Scholar

[13] 13.↵
Raghu VK, Poon A, Benos PV. Evaluation of Causal Structure Learning Methods on Mixed Data Types. Proceedings of 2018 ACM SIGKDD Workshop on Causal Disocvery; 2018; Proceedings of Machine Learning Research.
Google Scholar

[14] 14.↵
Raghu VK, Zhao W, Pu J, et al. Feasibility of lung cancer prediction from low-dose CT scan and smoking factors using causal models. Thorax. 2019;74(7):643–649.
OpenUrl Abstract/FREE Full Text Google Scholar

[15] 15.↵
Abecassis I, Sedgewick AJ, Romkes M, et al. PARP1 rs1805407 Increases Sensitivity to PARP1 Inhibitors in Cancer Cells Suggesting an Improved Therapeutic Strategy. Sci Rep. 2019;9(1):3309.
OpenUrl Google Scholar

[16] 16.↵
Maathuis MH, Colombo D, Kalisch M, Bühlmann P. Predicting causal effects in large-scale systems from observational data. Nat. Methods. 2010;7(4):247–248.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[17] 17.
Stekhoven DJ, Moraes I, Sveinbjörnsson G, Hennig L, Maathuis MH, Bühlmann P. Causal stability ranking. Bioinformatics. 2012;28(21):2819–2823.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[18] 18.↵
Buschur KL, Chikina M, Benos PV. Causal network perturbations for instance-specific analysis of single cell and disease samples. Bioinformatics. 2020;36(8):2515–2521.
OpenUrl Google Scholar

[19] 19.↵
Regan EA, Hokanson JE, Murphy JR, et al. Genetic epidemiology of COPD (COPDGene) study design. COPD. 2010;7(1):32–43.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[20] 20.↵
Young KA, Regan EA, Han MK, et al. Subtypes of COPD have unique distributions and differential risk of mortality. Chronic Obstructive Pulmonary Diseases: Journal of the COPD Foundation. 2019;6(5):400.
OpenUrl Google Scholar

[21] 21.↵
McGarvey LP, John M, Anderson JA, Zvarich M, Wise RA. Ascertainment of cause-specific mortality in COPD: operations of the TORCH Clinical Endpoint Committee. Thorax. 2007;62(5):411–415.
OpenUrl Abstract/FREE Full Text Google Scholar

[22] 22.↵
Han MK, Kazerooni EA, Lynch DA, et al. Chronic obstructive pulmonary disease exacerbations in the COPDGene study: associated radiologic phenotypes. Radiology. 2011;261(1):274–282.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[23] 23.↵
Ghosh AJ, Saferali A, Lee S, et al. Blood RNA sequencing shows overlapping gene expression across COPD phenotype domains. Thorax. 2022;77(2):115–122.
OpenUrl Abstract/FREE Full Text Google Scholar

[24] 24.↵
Newman AM, Steen CB, Liu CL, et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 2019;37(7):773–782.
OpenUrl PubMed Google Scholar

[25] 25.↵
Newman AM, Liu CL, Green MR, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods. 2015;12(5):453–457.
OpenUrl CrossRef PubMed Google Scholar

[26] 26.↵
Alvarez MJ, Shen Y, Giorgi FM, et al. Functional characterization of somatic mutations in cancer using network-based inference of protein activity. Nat. Genet. 2016;48(8):838–847.
OpenUrl CrossRef PubMed Google Scholar

[27] 27.↵
Vestbo J, Anderson W, Coxson HO, et al. Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-points (ECLIPSE). Eur. Respir. J. 2008;31(4):869–873.
OpenUrl Abstract/FREE Full Text Google Scholar

[28] 28.↵
Aliferis CF. Local causal and Markov blanket induction for causal discovery and feature selection for classification part II: Analysis and extensions. J. Mach. Learn. Res. 2010;11:235–284.
OpenUrl Google Scholar

[29] 29.↵
Simon N, Friedman J, Hastie T, Tibshirani R. Regularization Paths for Cox’s Proportional Hazards Model via Coordinate Descent. J. Stat. Softw. 2011;39(5):1–13.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[30] 30.↵
Harrell FE, Jr., Califf RM, Pryor DB, Lee KL, Rosati RA. Evaluating the yield of medical tests. JAMA. 1982;247(18):2543–2546.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[31] 31.↵
Singh D, Agusti A, Anzueto A, et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive lung disease: the GOLD science committee report 2019. European Respiratory Journal. 2019;53(5).
Google Scholar

[32] 32.↵
Kaplan EL, Meier P. Nonparametric Estimation from Incomplete Observations. J. Am. Stat. Assoc. 1958;53(282):457–481.
OpenUrl CrossRef Web of Science Google Scholar

[33] 33.↵
Evankovich JW, Nouraie SM, Sciurba FC. A Model to Predict Residual Volume from Forced Spirometry Measurements in Chronic Obstructive Pulmonary Disease. Chronic Obstr Pulm Dis. 2023;10(1):55–63.
OpenUrl Google Scholar

[34] 34.↵
Casanova C, Cote C, de Torres JP, et al. Inspiratory-to-total lung capacity ratio predicts mortality in patients with chronic obstructive pulmonary disease. American journal of respiratory and critical care medicine. 2005;171(6):591–597.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[35] 35.↵
Martinez FJ, Foster G, Curtis JL, et al. Predictors of mortality in patients with emphysema and severe airflow obstruction. American journal of respiratory and critical care medicine. 2006;173(12):1326–1334.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[36] 36.↵
Bhatt SP, Nakhmani A, Fortis S, et al. FEV(1)/FVC Severity Stages for Chronic Obstructive Pulmonary Disease. American journal of respiratory and critical care medicine. 2023.
Google Scholar

[37] 37.↵
Restrepo MI, Mortensen EM, Pugh JA, Anzueto A. COPD is associated with increased mortality in patients with community-acquired pneumonia. Eur Respir J. 2006;28(2):346–351.
OpenUrl Abstract/FREE Full Text Google Scholar

[38] 38.↵
Bhat TA, Panzica L, Kalathil SG, Thanavala Y. Immune Dysfunction in Patients with Chronic Obstructive Pulmonary Disease. Ann Am Thorac Soc. 2015;12 Suppl 2(Suppl 2):S169-175.
OpenUrl Google Scholar

[39] 39.↵
Park SC, Kim YS, Kang YA, et al. Hemoglobin and mortality in patients with COPD: a nationwide population-based cohort study. Int. J. Chron. Obstruct. Pulmon. Dis. 2018;13:1599–1605.
OpenUrl Google Scholar

[40] 40.↵
Toft-Petersen AP, Torp-Pedersen C, Weinreich UM, Rasmussen BS. Association between hemoglobin and prognosis in patients admitted to hospital for COPD. Int. J. Chron. Obstruct. Pulmon. Dis. 2016;11:2813–2820.
OpenUrl Google Scholar

[41] 41.↵
Pavasini R, Biscaglia S, d’Ascenzo F, et al. Antiplatelet Treatment Reduces All-Cause Mortality in COPD Patients: A Systematic Review and Meta-Analysis. COPD. 2016;13(4):509–514.
OpenUrl Google Scholar

[42] 42.↵
Mallah H, Ball S, Sekhon J, Parmar K, Nugent K. Platelets in chronic obstructive pulmonary disease: An update on pathophysiology and implications for antiplatelet therapy. Respir. Med. 2020;171:106098.
OpenUrl Google Scholar

[43] 43.↵
Fawzy A, Anderson JA, Cowans NJ, et al. Association of platelet count with all-cause mortality and risk of cardiovascular and respiratory morbidity in stable COPD. Respir. Res. 2019;20(1):86.
OpenUrl Google Scholar

[44] 44.↵
Schmidlin H, Diehl SA, Nagasawa M, et al. Spi-B inhibits human plasma cell differentiation by repressing BLIMP1 and XBP-1 expression. Blood. 2008;112(5):1804–1812.
OpenUrl Abstract/FREE Full Text Google Scholar

[45] 45.↵
Willis SN, Tellier J, Liao Y, et al. Environmental sensing by mature B cells is controlled by the transcription factors PU.1 and SpiB. Nat. Commun. 2017;8(1):1426.
OpenUrl Google Scholar

[46] 46.↵
Polverino F, Seys LJM, Bracke KR, Owen CA. B cells in chronic obstructive pulmonary disease: moving to center stage. Am. J. Physiol. Lung Cell. Mol. Physiol. 2016;311(4):L687–L695.
OpenUrl CrossRef PubMed Google Scholar

[47] 47.↵
Brusselle GG, Demoor T, Bracke KR, Brandsma CA, Timens W. Lymphoid follicles in (very) severe COPD: beneficial or harmful? Eur. Respir. J. 2009;34(1):219–230.
OpenUrl Abstract/FREE Full Text Google Scholar

[48] 48.↵
Farber DL, Yudanin NA, Restifo NP. Human memory T cells: generation, compartmentalization and homeostasis. Nat. Rev. Immunol. 2014;14(1):24–35.
OpenUrl CrossRef PubMed Google Scholar

[49] 49.↵
Tanigawa T, Araki S, Nakata A, et al. Increase in memory (CD4+CD29+ and CD4+CD45RO+) T and naive (CD4+CD45RA+) T-cell subpopulations in smokers. Arch. Environ. Health. 1998;53(6):378–383.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[50] 50.↵
Nakata A, Takahashi M, Irie M, Fujioka Y, Haratani T, Araki S. Relationship between cumulative effects of smoking and memory CD4+ T lymphocyte subpopulations. Addict. Behav. 2007;32(7):1526–1531.
OpenUrl PubMed Google Scholar

[51] 51.↵
Starkey MR, Plank MW, Casolari P, et al. IL-22 and its receptors are increased in human and experimental COPD and contribute to pathogenesis. Eur. Respir. J. 2019;54(1).
Google Scholar

[52] 52.↵
Paats MS, Bergen IM, Hoogsteden HC, van der Eerden MM, Hendriks RW. Systemic CD4+ and CD8+ T-cell cytokine profiles correlate with GOLD stage in stable COPD. Eur. Respir. J. 2012;40(2):330–337.
OpenUrl Abstract/FREE Full Text Google Scholar

[53] 53.↵
MacLeod MKL, Kappler JW, Marrack P. Memory CD4 T cells: generation, reactivation and re-assignment. Immunology. 2010;130(1):10–15.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[54] 54.↵
Daniels H, van Schilfgaarde M, Jansen HM, et al. Characterization of CD4+ Memory T Cell Responses Directed against Common Respiratory Pathogens in Peripheral Blood and Lung. J. Infect. Dis. 2007;195(11):1718–1725.
OpenUrl CrossRef PubMed Web of Science Google Scholar

Disentangling Predictors of COPD Mortality with Probabilistic Graphical Models

Abstract

Introduction