Abstract
Rationale While determining normal variation of spirometric volumes, the geo-political construct of ‘Indian’ inadequately captures the diverse racial structure and varied lifestyles that exist for 1 in 5 people globally. It is necessary to determine the degree of racial heterogeneity and other underlying factors to know whether lower spirometric volumes of ‘Indians’ is normal or abnormal.
Objectives To investigate the differences in spirometric volumes and their associations with overall health parameters, for adolescent children and young adults, across geo-ethnic regions within India.
Methods Data was analyzed for 2338 healthy subjects aged 9-19 years with acceptable spirometry from SOLID cohort. Associations between lung function and potential determinants (Race, anthropometry, nutrition, family/life history, airway oscillometric parameters and systemic inflammation) were examined by using multiple regression. Latent profile analysis (LPA) was performed for 903 subjects to uncover hidden sub-phenotypes.
Results FEV1 and FVC varied significantly between Indian genetic ancestries, being highest in Tibeto-Burmans and lowest in Dravidians. After statistical adjustments, FEV1 and FVC were positively correlated with waist-height-ratio, shoulder-height-ratio, normalized BMI, and blood hemoglobin. Lower FVC was associated with lower expiratory flows (PEF and FEF25-75), higher lung reactance (X5), higher airway resistance (R10 and R15), gastro-intestinal symptoms, and higher inflammation (IL-8 and IL-17). A sub-phenotype of thinness, higher inflammation (IFN-gamma, IL-17, TNF-alpha, IL-8) and lower FVC was identified on LPA for 35% of the sample.
Conclusions Spirometric volumes in Indians are different between major racial subgroups but may also be abnormally low due to prevalent nutritional or environmental adversity. Care must be taken while establishing normative standards.
INTRODUCTION
Spirometry, the most commonly used pulmonary function test, has been through many standardization guidelines. Spirometry volumes are seen to vary widely between apparently healthy people, even after accounting for obvious anthropometric determinants like height, weight, and gender. Other than the ratio of forced expiratory volume in 1 s to forced vital capacity (FEV1 / FVC), where fixed cutoffs are possible even if not recommended, other parameters are absolute values that need to be interpreted against the predicted normal. This is statistically simple but physiologically complex. Differences in genetic backgrounds, environmental exposures and co-morbid conditions lead to variations that may be as large as those seen in disease. For example, South Asians have 30% lower median FEV1 or FVC than White Europeans or Americans (1), the lowest in the world; well within the range of pathological reduction for a White person. The solution so far has been to adjust the normal based on race, ethnicity or geography; imperfect surrogates for differences in nature and nurture (2). This creates further problems when these stereotypes are poorly thought out, as in the case of ‘Blacks’. Use of wrong normal values has associated problems of bias and justice, ranging from ineligibility for occupations with respiratory health thresholds when normal values are overestimated, to denial of medical care and disability relief when normal values are underestimated (3, 4). Here, we examine the concept of spirometric normal for ‘Indians’ – which is a geo-political construct, used as a general label for 1 in 5 people in the world, but failing to recognize genetic diversity second only to Africa and wide variations in living conditions (5). The Indian Genome Variation Database (IGVDb) has cataloged genetic variation within India and identified regions of relatively distinct genetic ancestries referred to as Indo-European (IE), Dravidian (DR) and Tibeto-Burman (TB), as well as areas of admixture (6). While there are many reports of Southern regions of India (DR ancestry) having lower lung function than Northern regions (IE and TB ancestry), this has never been tested in an equivalent nation-wide group of subjects (7). Heterogeneity of socio-economic backgrounds and environmental exposures further complicates matters (8, 9). It has been previously reported in the journal that children from rural and semi-urban India have poorer lung function than those in cities or their Indian counterparts in UK (10). Similarly, it has been shown that Indian-origin first generation Americans have better lung function than their parents, but worse than that of other Americans (11). To better understand and more finely resolve the heterogeneity of lung function in India, we studied adolescent children from a residential school system across India that caters to local populations and is relatively homogenous from a lower-middle socioeconomic class standpoint. We chose the second decade of life as our target age-group since this is when lung growth is maximal and adult lung function is reached. Also, diseases and co-morbidities other than asthma, which may affect lung function, are uncommon during this period. We focused on identifying ethnic, lifestyle and personal predictors, as well as relating spirometry measurements to lung oscillometry and other non-respiratory measures of health. We find that there is much complexity in ‘normal’ lung function of ‘Indians’ and rational non-discriminatory systems to assess general and respiratory health through spirometry are currently lacking, with ‘Indian’ or ‘South Asian’ being an inadequate but widely used label. Some of the results of these studies have been previously reported in the form of an abstract (12).
METHODS
Study Participants and Setting
Study participants were enrolled from 14 districts from a residential school system (Jawahar Navodaya Vidyalaya) run by an autonomous government organization (Navodaya Vidyalaya Samiti). Children in these boarding schools mostly belong to lower-middle socio-economic background and stay in a similar living arrangement. School visits were conducted to all the 14 sites and participants of Classes VI, IX, and XII (aged 9-19 years) were screened from 2017-2019. Subjects with asthma were removed from the analysis.
2609 subjects (1210 girls, 1399 boys) from 14 Indian districts were examined in SOLID cohort. Data was analyzed for 2338 subjects with acceptable spirometry with no asthma (Table 1). Oscillometry and blood investigations were performed in 1493 and 1582 subjects, respectively (Figure 1).
Approval of the human ethical committee of CSIR-Institute of genomics and integrative biology was obtained for the present study (Ref No: CSIR-IGIB/IHEC/2017-2018). Written Parental consent and verbal assent were taken for all the participants.
Lung Function Assessment and its Definition
Spirometry was done using portable spirometers (EasyOne air, NDD) as per the ATS guidelines (13). Airway oscillometry was performed on 1497 SOLID participants using forced oscillation technique (Pulmoscan, Cognita Labs). A dataset from 51 older subjects who underwent oscillometry on well-established MasterIOS with acceptable spirometry was used to verify oscillometric findings from this study.
Absolute values of spirometric indices were converted to z-scores using the reference equations from GLI (Global Lung Initiative) and Chhabra et al (14, 15). Reference equations were developed internally for the PulmoScan from the SOLID data, to calculate normalized predicted values for comparisons only within this cohort (supplementary S2). R5 measurements were excluded from the association analysis as its reference equations had unacceptably poor fit.
Blood Sample Collection and Biochemical Analysis
About 8 ml of whole blood was collected in vacutainer with no additives (Red top Vacutainers, BD). The Blood was allowed to clot and the supernatant was removed post centrifugation. Both the supernatant and blood pellets were stored at −80 degrees Celsius for further investigation. Cytokine levels were measured in the supernatant using a custom MILLIPLEX MAP Human Cytokine/Chemokine Magnetic Bead Panel (Merck Millipore) of 9 different cytokines. The prescribed standard assay protocol was used and the data was acquired on a magnetic bead compatible assay analyzer (Magpix, Luminex corp.). The data was analyzed using the xPONENT 3.1 software (Luminex corp.). MFI (Mean Fluorescence Intensity) values obtained were transformed to concentration (pg/mL) using a 5-parameter logarithmic standard curve. Cut-off from Mayo clinic laboratories were used to define subjects with “High” and “Normal” levels of these cytokines(16).
Anthropometry and Other Assessments
Height, weight, shoulder width and waist circumference were measured for the anthropometry. Z-scores of BMI was calculated using the World health organization (WHO) growth chart (17). A parental and subject questionnaire was administered having questions related to background, habits and symptoms in the SOLID cohort. Genetic ancestry was mapped at the level of geographic regions that have previously been mapped by the Indian Genome Variation Database (IGVdb) to correspond to distinct or mixed genetic ancestries denoted as Indo-European (IE), Dravidian (DR), Tibeto-Burman (TB), or mixed (6). It is noted that individuals were not mapped to specific ancestries, only the geographic regions. Since geo-ethnic, linguistic and genetic maps in India are broadly concordant, this approach provides useful information in resolving the contributors to variation in lung function across India, without making assumptions about individual participants based on appearance or names.
Statistical Analysis
Associations of lung functions were investigated using multiple (linear) regression models adjusted for covariates (age, sex, height, altitude and ethnicity). Association of each determinant was tested separately and not adjusted for the other potential lung function determinants. Cytokine levels were logarithmically transformed, using base e (2.71828), to measure their association with the lung function. A model-based clustering method (Latent profile analysis) was used to segregate 903 SOLID participants, with numerical data available for WHtR and cytokine levels, into groups based on their anthropometry and inflammatory profile. The method of LPA has already been explained extensively elsewhere(18). Interquartile (IQR) method was used to remove outliers for lung function, anthropometry and natural logarithm of cytokine levels for the cluster analysis. Values were removed if they were 1.5 times IQR below or above the first and third quartile, respectively. Pearson’s correlation coefficient was calculated to define the association amongst the model parameters and with zFVC. Listwise deletion was used to remove the missing values for a multivariate analysis. Latent profile analysis was performed using the “mclust” package (version 5.4.6) implemented in R(19). Bayesian Information Criteria (BIC) was used for the optimal model selection. An ellipsoidal model with equal orientation was selected consisting of two classes (Supplementary: Figure S4). Lung function and model parameters were compared between these two classes by independent t-test. R (version 3.6.1) was used for data analysis and visualization.
RESULTS
Data was analyzed for 2338 (1058 girls, 1280 boys) healthy subjects with no asthma and acceptable spirometry (Figure 1). Within this group, cytokine markers were measured in 1504 subjects and oscillometry data was available for 1202 subjects. As expected, boys had higher absolute values of height, weight, FVC, FEV1, PEF and FEF25-75 and lower BMI, compared to girls (Table 1). Age and height were similar across the groups. There were modest differences in weight and BMI across regions, with IE being the lowest and TB/mixed being the highest.
Variations in lung function based on ethnicity, habits and life-history
Table 2 summarizes the variability of lung function after adjustment for age, sex, height, altitude and ethnicity as covariates. TB group had the highest lung volumes (FVC and FEV1) followed by IE, after adjusting for all remaining covariates. DR group was characterized by the poorest lung function. Birth weight and non-vegetarian diet had no influence over the adolescent/adult lung volumes of the study population. Lower birth weight subjects however had slightly higher FEV1/FVC ratio. Subjects with family history of asthma and allergic diseases had lower FEV1/FVC ratio but similar lung volumes. Type of diet showed no association with the lung function, however skipping meals and chronic digestive health issues (gastrointestinal symptoms) were associated with a reduction in lung volumes.
Association of lung function with markers of growth and nutrition
Waist to height ratio (WHtR), as the marker of central adiposity, showed a positive association with lung volumes after adjustment for ethnicity and other covariates (Table 3). Subjects with broader shoulders proportional to height (SHtR) and a higher z-score of BMI for age showed improved lung volumes, but a lower FEV1/FVC ratio. Hemoglobin levels were also positively associated with the lung volumes, with moderately severe anemia being associated with significantly reduced lung function (Table 2). There were very few subjects with severe anemia but size of effect was similar to that of moderate anemia, although not reaching statistical significance.
Association of lung function with airflow and oscillometric parameters
Peak flow (PEF) and mid-expiratory flow (FEF25-75) was positively associated with both the lung volumes and FEV1/FVC ratio (Table 3). Percent predicted values of reactance (X5 % predicted) calculated from internal reference equations showed a negative association with all the three lung function parameters, such that low FEV1 or FVC or FEV1/FVC ratio corresponded to higher reactance. Similarly, percent predicted values of resistance at different frequencies (10, 15 and 20 Hz) were higher in subjects with lower FEV1, FVC and FEV1/FVC ratio. Since increased reactance and resistance may correspond to sub-clinical airway pathology, we additionally verified the association of lower FEV1 or FVC (normal FEV1/FVC ratio), with markers of sub-clinical airway pathology in PFT lab data from 51 apparently healthy adult Indian subjects who underwent oscillometry (Supplementary: Figure S1).
Systemic inflammation and lung function
In view of data suggesting potential association of lower FEV1 or FVC in these children with poorer general, digestive, or respiratory health, we additionally looked at associations with markers of systemic inflammation. Inflammatory studies showed a very high prevalence of clinically elevated levels of IL-8 and TNF-alpha, based on available normal ranges from western data (Supplementary: Figure S2). Many subjects also had “High” levels of IL-4, IL-17, and IFN-gamma. Prevalence of high IL-13, IL-6 and IL-1beta was very minimal. There are no specific reference values for India. IL-8 and IL-17 showed a negative association with FVC but not with FEV1 and FEV1/FVC.
Global reference equation and latent profile analysis
Given the associations of adiposity, digestive health, nutritional markers, and inflammation, with lung function – particularly FVC – we further determined if there was a profile associated with poorer lung function in India, which operates across ethnicities. Global z-scores of forced vital capacity (zFVC) were calculated for all subjects, independent of ethnicity, from GLI equations for Caucasian ethnic group. zFVCGLI (without any ethnic adjustment) also showed similar associations with central obesity and inflammatory parameters (Supplementary: Figure S3). zFVCGLI was positively associated with WHtR and negatively with IL-8, TNF-alpha, IFN-gamma and IL-17. Subjects with low lung function from Caucasian equation but normal according to Indian standard showed greater inflammation, small airway abnormalities and poor growth (Supplementary: Table S1).
Latent profile analysis was performed to find sub-classes within the zFVCGLI spectrum. Two potential sub-classes (C1 and C2) were identified (Table 4, Figure 2) using central obesity and inflammatory parameters. Subjects belonging to C1 were younger, had lower adiposity (WHtR), greater inflammation and more restrictive lung function. The inflammatory pattern of C1 was most strongly characterized by differences in IFN-gamma (2.6-fold elevation in mean absolute values versus C2), followed by IL-17 (1.8-fold), TNF-alpha (1.6-fold) and IL-8 (1.1-fold).
DISCUSSION
This study unequivocally demonstrates that there is no singular adjustment that can be made to determine normal lung function of Indians from global equations, nor is there a clearly definable ‘normal’ lung function that applies to all members of a geographic region known for racial diversity and extremely heterogenous living standards. To the best of our knowledge this is the first study from India that looks at adolescent to young adult spirometric lung function through a multi-prismatic lens combining anthropometry, ethno-genetics, life-history questionnaire, and general clinical markers of nutritional deficiency and inflammatory state. It is also the first study that looks at a nationally representative, but socio-economically homogenous group– members of a single government boarding school system that caters primarily to the majority lower-middle socioeconomic class. While our data does not reflect the upper socio-economic class, which is a limitation, the average Indian is well represented in this sample.
We show that there are clear racial differences in lung function, corresponding to well-known differences in genetic ancestry, with Dravidian Indians, having lower FEV1 and FVC after adjusting for all co-variates. This difference is most likely genetic and merits further exploration. We specifically note that the anthropometric parameters were better for this group, when compared to Indo-European group that had better lung function. While not included in the regression, this region has better air quality and higher socio-demographic indicators (SDI). We have not used regional SDI here since we selected for a similar socio-economic class. We did not introduce ambient air quality indicators for two reasons. First, air pollution exposure in Indian children of this socio-economic class is primarily through a life-history of indoor air pollution that is not quantifiable by ambient air quality measures or by simple questions. Second, it was clear that the lung function was higher in the Indo-Gangetic region of North India, including Delhi with possibly the worst air globally (not shown). Thus, ambient air quality was neither a good measure, nor a major determinant of lung function variations in our study. The Tibeto-Burman group had the best lung function. The reasons may be both genetic and environmental, since many of the regions with TB ancestry are at higher altitudes and the effects may be non-linear.
That ‘Indian’ normal lung function is different across its racial groups, was expected in view of existing global data as well as smaller normative lung function studies from different parts of India. There were two new hypothesis-generating findings in our study, however, that merit further debate and investigation. First, an association between low FVC and a phenotype of poor nutritional status and high inflammation. Second, an association between low FVC and sub-clinical airway pathology.
After adjusting for likely racial effects, we found that gastrointestinal symptoms, meal skipping, thinness, low BMI, vitamin deficiency symptoms, moderate to severe anemia, and elevated inflammation markers were all associated with lower FEV1 or FVC. A relatively well-defined endotype of poor lung function, low waist-height ratio and elevated IFN-gamma, IL-17, TNF-alpha and IL-8 was seen in one-third of the subjects, with membership across all ethnicities. This suggests that while there is a racial component to lower lung function in some Indians, especially those with Dravidian ancestry, a large fraction of Indians may not be reaching their genetic potential due to nutritional or environmental adversity. Since the study subjects are unlikely to be calorically limited, other possibilities to be investigated are low nutritional quality or gut dysbiosis leading to nutritional steal and enteropathy. We speculate that since the elevated cytokines are well-known mediators of gut immunity, specifically neutrophil mediated killing of microbes, this represents prevalent environmental enteropathy, a poorly recognized condition that is seen in low-middle income tropical countries and is thought to be driven by poor sanitation and recurring feco-oral exposures (20). If so, the route to better lung function for Indians may lie through better sanitation and nutrition. This fits the previous report in the journal that children from rural and semi-urban India have poorer lung function than those in cities or their Indian counterparts in UK (10). This also would explain why Indian-origin first generation Americans have better lung function than their parents (11).
In the SOLID cohort we found that low FEV1 or FVC, with normal FEV1/FVC ratio, was correlated with increased resistance and reactance. These may be signs of early small airway pathology, but these were field studies with a new portable oscillometer in an age-group not expected to have airway disease. Therefore, we decided to examine this further in older adults, using a combination of clinical grade oscillometry and nitrogen breath washout in a PFT lab setting. Lower FVC was associated with statistically and clinically significant elevations of X5 and R5-R20, suggesting that there may be prevalent sub-clinical small airway disease in Indians with spirometric lung function that is apparently normal. This is consistent with growing recognition of Preserved Ratio Impaired Spirometry (PRISm) as a transitional state for some population sub-groups that may foretell progression to clinical airway disease and increased all-cause mortality (21). This also fits the higher age-adjusted prevalence of COPD in India, as described recently in the Global Burden of Disease (GBD) investigator group led study of chronic respiratory diseases and their state level heterogeneity (22). Interestingly, even within India, COPD risk was highest in states with lowest socio-demographic development levels. Strong associations between low lung function and high cardio-respiratory disease risk have been reported starting from the Framingham cohort (23), which may be explained by systemic inflammation as a common link. This also highlights the potential of spirometry as a cheap precision health tool (24).
In summary, we report evidence that there is a racial sub-structure to normal spirometric lung function values in India. For those with Dravidian ancestry, normal values may be lower than for others. While India has not yet seen much genetic mixing between endogamous geographic regions, genetic measures of ancestry would be preferable to ethnic identities going forward. We also report that at least one-third of adolescents and young adults are likely to be not reaching their genetic potential due to adverse nutrition and environment.
CONCLUSION
Spirometric volumes in Indians are different between major racial subgroups but may also be abnormally low due to prevalent nutritional or environmental adversity. Based on our data, respiratory disease would be underdiagnosed or underestimated in Tibeto-Burmans, but over-diagnosed or overestimated in Dravidians by a common yardstick. Care must be taken while establishing normative standards for ‘Indians’ and studies including deep phenotyping and genetic markers are required to dissect this further.
Data Availability
Data can be provided on request on contacting the corresponding author. The data could not be provide publicly since it contain extensive individual level information that could compromise anonymity.