Clinical risk factors and blood protein biomarkers of 10-year pneumonia risk

Background Chronic inflammation may increase susceptibility to pneumonia. Research question To explore associations between clinical comorbidities, serum protein immunoassays, and long-term pneumonia risk. Methods Framingham Heart Study Offspring Cohort participants ≥65 years were linked to their Centers for Medicare Services claims data. Clinical data and 88 serum protein immunoassays were evaluated for associations with 10-year incident pneumonia risk using Fine-Gray models for competing risks of death and least absolute shrinkage and selection operators for covariate selection. Results We identified 1,370 participants with immunoassays and linkage to Medicare data. During 10 years of follow up, 428 (31%) participants had a pneumonia diagnosis. Chronic pulmonary disease [subdistribution hazard ratio (SHR) 1.87; 95% confidence interval (CI), 1.33–2.61], current smoking (SHR 1.79, CI 1.31–2.45), heart failure (SHR 1.74, CI 1.10–2.74), atrial fibrillation/flutter (SHR 1.43, CI 1.06–1.93), diabetes (SHR 1.36, CI 1.05–1.75), hospitalization within one year (SHR 1.34, CI 1.09–1.65), and age (SHR 1.06 per year, CI 1.04–1.08) were associated with pneumonia. Three baseline serum protein measurements were associated with pneumonia risk independent of measured clinical factors: growth differentiation factor 15 (SHR 1.32; CI 1.02–1.69), C-reactive protein (SHR 1.16, CI 1.06–1.27) and matrix metallopeptidase 8 (SHR 1.14, CI 1.01–1.30). Addition of C-reactive protein to the clinical model improved prediction (Akaike information criterion 4950 from 4960; C-statistic of 0.64 from 0.62). Conclusions Clinical comorbidities and serum immunoassays were predictive of pneumonia risk. C-reactive protein, a routinely-available measure of inflammation, modestly improved pneumonia risk prediction over clinical factors. Our findings support the hypothesis that prior inflammation may increase the risk of pneumonia.


Summary or "take home" message:
Here, we show that long-term pneumonia risks are associated with clinical characteristics and serum markers of inflammation.Our findings offer preliminary support to the hypothesis that chronic inflammation may be a marker of increased pneumonia risk.

INTRODUCTION
Pneumonia is the leading infectious cause of death globally [1] and is a substantial public health issue.In the U.S., pneumonia accounts for an estimated 1.5 million adult hospitalizations annually with 100,000 in-hospital deaths [2].Furthermore, pneumonia survivors experience numerous deleterious consequences, such as reduced pulmonary function, new cardiovascular complications, decline in cognitive function, depression, decreased functional status and quality of life, and shortened lifespans [3].
Prior studies have investigated clinical risk factors for pneumonia and found associations with age (both the youngest and oldest), chronic conditions, socioeconomic status and lifestyle factors such as smoking and alcohol ingestion [4].Given associations between pneumonia, preexisting comorbid conditions, and future health decline, some have hypothesized that pneumonia represents an acute manifestation of a chronic susceptibility state [5].Although the mechanisms of increased pneumonia susceptibility are multi-factorial and incompletely understood, prior studies have posited links between chronic inflammation, immunosenescence and pneumonia risk [6][7][8].However, few studies have evaluated associations between comorbidities, inflammation, and pneumonia in humans.
We explored the associations between clinical comorbidities, serum protein markers of organ function and inflammation, and long-term pneumonia risk in a well-characterized, community-dwelling, prospective cohort.Furthermore, we evaluated whether routinelymeasured serum markers of inflammation such as C-reactive protein may be associated with future risk of pneumonia.Thus, we hypothesize that clinical risk factors and serum markers of inflammation would be associated with long-term risk for the development of pneumonia.

Data Source and Cohort
We conducted a retrospective cohort study including participants in the Framingham Heart Study (FHS) Offspring Cohort with FHS data linked to Centers for Medicare and Medicaid Services (CMS) claims data.The design of the FHS Offspring Cohort has been previously described [9], as well as linkage of FHS to CMS data [10].In brief, FHS Offspring participants underwent standardized examinations and questionnaires with a study physician every 4 to 8 years.We used data from examinations 5 (1991-1995), 6 (1995-1998) and 7 (1998-2001) to characterize patient characteristics potentially associated with pneumonia based upon prior literature reviews [4].Data missing from more recent examinations were carried forward if present at prior examinations.Data were missing if a participant did not answer a particular questionnaire item or the question was not asked at an examination.In addition to clinical characteristics collected during examinations, FHS data contained measurements of 88 serum protein immunoassays drawn at examination 7 as part of the Systems Approach to Biomarker Research in Cardiovascular Disease Initiative, a sub-study of the FHS (see Table E1 in the online data supplement) [11].Samples were obtained between 7 to 9 a.m. after an overnight fast and stored at -80°C after centrifugation and until assay.Luminex™ xMAP multiplex immunoassay technology was used to measure the biomarkers [12].Fourteen proteins were analyzed as binary variables defined as above or below a detection limit due to a high rate of missing data above the detection limit [11].CMS inpatient, outpatient, skilled nursing facility, home health agency, hospice, carrier and durable medical equipment files were used to identify pneumonia diagnoses.Participants were included if they were 65 years old or older, had FHS data linked to CMS data, had no history of pneumonia prior to exam 7, and had a protein immunoassay panel measurement at examination 7 (Figure 1).Subjects were followed for up to 10 years from the time of examination 7 until diagnosis of pneumonia or death.Data for this study was de-identified, with a waiver of informed consent and protocol approval by the Boston University Medical Center Institutional Review Board (H-35870).

Outcomes
The primary outcome was pneumonia defined by International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) codes identified from CMS claims within 10 years of follow up.Based upon prior studies [13], we explored several ICD-9-CM coding algorithms for identifying acute pneumonia episodes from claims data.Table E2 shows that clinical variables associated with pneumonia were similar regardless of the pneumonia definition based on various ICD-9CM codes for pneumonia and in various clinical settings, thus we chose to use the most sensitive definition capturing any pneumonia diagnosis in any clinical setting (definition 1) for all analyses.Death was defined from Medicare denominator files and loss to follow-up from Medicare files was likely rare.

Covariates
We abstracted baseline demographics, clinical comorbidities, vital signs, anthropometric measurements (i.e., body mass index [BMI]), smoking and alcohol use and measures of functional and cognitive status from FHS clinical exam files.Clinical comorbidities included chronic pulmonary disease, hypertension, atrial fibrillation/flutter, valvular disease, myocardial infarction, heart failure, stroke, diabetes, chronic kidney disease, peripheral vascular disease, pulmonary embolus/deep vein thrombosis, cancer, hip fracture and depression as defined by FHS clinician assessments.Measures of functional and cognitive status included the Kannel physical activity index [14], mini-mental state examination [15], Katz index of independence in activities of daily living [16], physical disability [17] and overnight hospitalization in the year prior to the FHS examination.Detailed definitions of the covariates validated by the FHS protocols are provided in Table E3.

Statistical methods
We used a least absolute shrinkage and selection operator (LASSO) machine approach to select model covariates [18].LASSO imposes a constraint to the model that minimizes prediction error and causes some regression coefficients to shrink toward 0, excluding those variables, and thus selects variables most strongly associated with the outcome of interest.
Because of the presence of missing data deemed missing at random, we used a multiple imputation approach to missing data.We imputed missing data across 5 datasets and used LASSO methods to select covariates averaged across the pooled imputed data sets, where the tuning parameter lamba was selected using 5-fold cross-validation.Because of the potential for competing risk of death for pneumonia outcomes, we used Fine-Gray competing risk models to estimate the subdistribution hazard ratio of each selected predictor of pneumonia [19].In order to explore contributions from comorbidities and serum protein markers to pneumonia risk, we  (CRP) (to assess how CRP may add to risk prediction above comorbidities).Models were assessed for fit by measured change in C-statistic (increase) and Akaike Information Criterion (AIC; decrease).Model calibration was assessed by plotting predicted risk of pneumonia against observed frequencies of pneumonia.Because optimal choice of model-building strategy is an area of debate [20] and strongly determines covariates included in the model, we performed sensitivity analyses building clinical models using multiple different strategies (e.g., p<0.2 based selection, use of Cox proportional hazards models that did not account for competing risk of death) to assess the robustness of the multiple imputation LASSO selection approach (Tables E4   and E5).We performed additional sensitivity analyses using the four aforementioned risk models restricted to only pneumonia in the inpatient setting to assess for changes in model performance among patients with more severe pneumonia.All statistical analyses were performed with R software version 3.4.3.

Cohort Characteristics
We identified 1,370 FHS participants aged 65 years or older with linked CMS claims data and protein immunoassay panel measurements from examination 7. Subject characteristics are summarized in Table 1.During the 10 years of follow up, 428 (31%) subjects had a diagnosis of pneumonia.Characteristics of subjects with and without a diagnosis of pneumonia are shown in Table 2. Patients with pneumonia more frequently were older, had higher rates of comorbid conditions, physical disability, and smoking.
Sensitivity analyses of clinical variables in complete case analysis showed similar results to the multiple imputed LASSO approach (Table E6).Sensitivity analyses using the risk models restricted to pneumonia in the inpatient setting did not substantively change model performance  E7).

DISCUSSION
We explored clinical and immunoassay predictors of pneumonia risk in a cohort of community dwelling participants in the FHS.In addition to observing multiple clinical variables associated with pneumonia, we identified 3 baseline serum protein immunoassay measures (GDF-15, MMP-8 and CRP), after adjusting for clinical variables, that were associated with pneumonia events up to 10 years prior to the pneumonia diagnosis.Furthermore, the addition of routinely-available CRP to a model including clinical variables improved model performance for .discrimination of pneumonia events.Models performed similarly when restricted to more severe pneumonia requiring hospitalization.Our findings suggest that long-term pneumonia risks are associated not only with previously described clinical characteristics, but also with serum markers of inflammation.Taken together, our findings provide preliminary support to the hypothesis that prior inflammation may be a marker of increased pneumonia risk and inform future efforts at pneumonia risk prediction.
We caution against interpreting the association between the clinical variables and protein immunoassays and future pneumonia as casual and acknowledge that these predictors may correlate with unmeasured causal variables.In addition, single measurements of protein immunoassays may not represent a chronic inflammatory state.However, our results expand upon prior studies of inflammation, clinical comorbidities, and pneumonia risk.Yende et al. [8] identified associations between increased baseline levels of inflammatory cytokines (IL-6 and tumor necrosis factor-alpha [TNF-]) and future pneumonia risk in a cohort of well-functioning, community-dwelling elderly subjects.Unfortunately, a high rate of missing data in our study restricted analysis of IL-6, and tumor necrosis factor-alpha was not included in the immunoassay panel.However, we found other markers of inflammation independent of clinical comorbidities associated with long-term pneumonia risk, including routinely-available clinical measures such as CRP, an acute-phase protein synthesized by the liver.Although the utility of CRP in the diagnosis of acute pneumonia and as a predictor of more severe disease with poor outcomes has been reported [21], the association of baseline CRP with future pneumonia risk has not been described.
Inflammation may increase pneumonia susceptibility through various biological mechanisms.The highest incidence of pneumonia in adults is amongst the elderly, and aging is .associated with inflammation and immunosenescence [22,23].Aged lungs experience greater pathogen adhesion and host cell invasion, a delayed and diminished immune response during early infection, attenuated phagocytic capabilities of macrophages and neutrophils via Toll-like receptor dysfunction, and a prolonged immune response to infection [6,7,24,25].These preclinical findings suggest age-related chronic inflammation results in immune dysregulation that may contribute to increased pneumonia susceptibility.But, the exact causal pathways between the clinical factors and biomarkers and future pneumonia risk remain unclear.
In addition to CRP, baseline GDF-15 and MMP-8 were also associated with future pneumonia risk independent of clinical comorbidities.The association of these inflammatory markers with future pneumonia risk has not been previously reported.Although our methodological approach was structured to maximize pneumonia risk prediction, and not establish causal pathways, we review below previously described biological roles of the biomarkers associated with pneumonia risk to guide potential future research.GDF-15 is a transforming growth factor-ß superfamily cytokine that is expressed in various cell types, and its expression increases in response to tissue injury and inflammation [26].In subjects with chronic obstructive pulmonary disease (COPD) and in response to cigarette smoke exposure, GDF-15 is upregulated in airway epithelial cells and activates pathways that promote mucin production and cellular senescence, altering mucosal immunity and leading to chronic airway inflammation, respectively.Furthermore, GDF-15 overexpression has been shown to increase susceptibility to and severity of respiratory viral infections via increased viral replication and exaggerated virusinduced inflammation [27].Baseline serum GDF-15 levels have been found to be an independent predictor of adverse outcomes in COPD patients, including accelerated pulmonary function decline, higher yearly exacerbation rates and increased mortality [28].Finally, increased GDF-15 .expression in the muscle of COPD subjects suggests it contributes to muscle wasting in COPD [29].Although these studies report the association of GDF-15 with increased susceptibility to viral infection, chronic inflammation and senescence in the context of COPD and cigarette smoke exposure, they introduce possible mechanisms by which baseline GDF-15 increases future pneumonia risk.MMP-8 is a member of the matrix metalloproteinase family of proteases and is involved in the breakdown of extracellular matrix and cleavage of chemokines [30].
MMPs contribute to the pathogenesis of multiple inflammatory pulmonary conditions such as emphysema, COPD, bronchiectasis and acute respiratory distress syndrome through primarily increased neutrophilic activity [31][32][33][34].Hartog et al. [35] found elevated levels of MMP-8 in the bronchoalveolar lavage fluid of patients with hospital-acquired pneumonia, with levels correlated to clinical severity.Our findings add to Hartog et al.'s by demonstrating that baseline MMP-8 levels may predict future pneumonia.Potential mechanisms linking baseline levels of CRP, GDF-15 and MMP-8 to pneumonia risk require further study.
Our study also examined the association of clinical comorbidities and lifestyle factors with pneumonia susceptibility, and our finding that age, atrial fibrillation, chronic pulmonary disease, diabetes, heart failure, prior overnight hospitalization and smoking were associated with pneumonia risk is similar to that of previous studies [4,23,36] providing face validity to our results.In contrast to prior studies [4,37], we did not observe chronic renal disease, cancer, BMI, alcohol use and nursing home residence to be independently predictive of future pneumonia risk.
The adjustment for additional clinical covariates and potential confounders in our study as compared with prior studies, which may be partially collinear with some comorbidities, as well as the use of competing risk statistical models, may explain some of the differences between our results and prior studies of pneumonia risk factors.Despite studies showing an association between cognitive impairment and pneumonia [4,38], we interestingly did not find mini-mental status scores to be predictive of future pneumonia.Regardless, clinical comorbidities increase pneumonia susceptibility through a variety of mechanisms, in addition to chronic inflammation, that increase pathogen burden and impair immune responses.A more detailed description of links between clinical comorbidities and pneumonia risk can be found in Quinton et al. [3].
Our study has several limitations.Our current prediction model is not intended to be used for clinical decision-making, however it highlights a novel association between inflammatory markers and future pneumonia.We chose to study FHS participants because they are a cohort of well characterized, community-dwelling residents with available serum protein immunoassay measurements and linked CMS data; however, we acknowledge the lack of generalizability of our findings to other dissimilar populations.Thus, our findings of associations between individual immunoassays and pneumonia risk are hypothesis-generating and should be replicated in additional datasets prior to further exploration of potential mechanisms between these biomarkers and pneumonia susceptibility.Furthermore, because we used the most sensitive definition of pneumonia for analysis, we found biomarkers that predicted future pneumonia of various types (i.e., community-acquired, hospital-acquired, bacterial, viral, etc.).We also lacked data on the pneumonia-causing pathogens.Thus, future studies examining more specific definitions of pneumonia and causative pathogens might improve our understanding of the pathogenesis of different types of pneumonia.We also did not include the effect of vaccination, such as vaccination against pneumococcus or influenza, immunosuppressive medications, other infections or time-varying factors on pneumonia risk.In addition, studies have shown that the microbiome plays an important role in host immunity and that dysbiosis leads to impaired immune defenses [39] and potentially chronic inflammation.Future studies should explore .potential relationships between dysbiosis, inflammation and pneumonia risk.Missing data on specific biomarkers (e.g., IL-6), use of multiple imputation and multiple hypothesis tests may increase risk of both systematic and random bias.However, we used methods designed to minimize bias such as multiple imputation, and LASSO covariate selection, and multiple sensitivity analyses demonstrated robustness of our models.Finally, misclassification of pneumonia diagnoses by ICD-9-CM codes may increase risk of bias.
In conclusion, we found clinical and serum protein predictors of pneumonia risk.Our findings showing associations between biomarkers of inflammation and pneumonia support the hypothesis that pneumonia is, in part, a disease of persistent or increased susceptibility.The exact mechanisms by which aging, inflammation and clinical comorbidities increase pneumonia susceptibility remain fertile areas for further investigation and potential targets for novel interventions to reduce pneumonia risk. .

estimated 4
risk models: 1) a model including only clinical variables from the FHS clinical exam data; 2) a model including only serum protein immunoassay measurements; 3) a model including both clinical variables and serum protein immunoassay measurements and 4) an exploratory model including clinical variables and routinely available protein biomarker C-reactive protein

(
C-statistic of the model including only clinical variables was 0.67; C-statistic of the model including only serum protein immunoassays was 0.64; C-statistic of the model including clinical variables and serum protein immunoassays was 0.69; C-statistic of the exploratory model including clinical covariates and C-reactive protein was 0.68 (Table

Table 1 .
Baseline characteristics of participants (N=1,370)Definition of abbreviation: SD = standard deviation.Data presented as mean ± SD or n (%).

Table 2 .
Distribution of baseline characteristics by pneumonia outcome (N=1,370)Definition of abbreviation: SD = standard deviation.Data presented as mean ± SD or n (%).