Cluster Analysis in Patients with GOLD 1 Chronic Obstructive Pulmonary Disease

Background We hypothesized that heterogeneity exists within the Global Initiative for Chronic Obstructive Lung Disease (GOLD) 1 spirometric category and that different subgroups could be identified within this GOLD category. Methods Pre-randomization study participants from two clinical trials were symptomatic/asymptomatic GOLD 1 chronic obstructive pulmonary disease (COPD) patients and healthy controls. A hierarchical cluster analysis used pre-randomization demographics, symptom scores, lung function, peak exercise response and daily physical activity levels to derive population subgroups. Results Considerable heterogeneity existed for clinical variables among patients with GOLD 1 COPD. All parameters, except forced expiratory volume in 1 second (FEV1)/forced vital capacity (FVC), had considerable overlap between GOLD 1 COPD and controls. Three-clusters were identified: cluster I (18 [15%] COPD patients; 105 [85%] controls); cluster II (45 [80%] COPD patients; 11 [20%] controls); and cluster III (22 [92%] COPD patients; 2 [8%] controls). Apart from reduced diffusion capacity and lower baseline dyspnea index versus controls, cluster I COPD patients had otherwise preserved lung volumes, exercise capacity and physical activity levels. Cluster II COPD patients had a higher smoking history and greater hyperinflation versus cluster I COPD patients. Cluster III COPD patients had reduced physical activity versus controls and clusters I and II COPD patients, and lower FEV1/FVC versus clusters I and II COPD patients. Conclusions The results emphasize heterogeneity within GOLD 1 COPD, supporting an individualized therapeutic approach to patients. Trial registration www.clinicaltrials.gov. NCT01360788 and NCT01072396.


Introduction
According to the Global Initiative for Chronic Obstructive Lung Disease (GOLD) [1] spirometric classification, mild airflow obstruction is defined by a post-bronchodilator forced expired volume in 1 second (FEV 1 ) to forced vital capacity (FVC) ratio at a fixed cut-off of <0.70 and an FEV 1 80% predicted [2]. Although this grading severity system has proved to be of value in the assessment of chronic obstructive pulmonary disease (COPD), it is a simplistic approach, poorly representing the complexity of COPD [3].
According to the Burden of Obstructive Lung Disease (BOLD) study [4], which used the 2006 GOLD consensus report [5], patients with mild COPD represent nearly 45% of patients with COPD, the remainder being GOLD stage 2 to 4. Paradoxically, there is limited information on patients with mild COPD even though they represent a large portion of patients with COPD. While the latest GOLD statement places emphasis on a more broad assessment of the disease [2], there is still a need to refine the GOLD classification, to avoid misclassification of patients with mild COPD. Accordingly, phenotypes could be one promising approach to the clinical heterogeneity of COPD [6]; potentially helping to identify a better type of approach to use for patients with mild disease.
We took advantage of a cohort of symptomatic and asymptomatic patients with mild COPD to explore possible heterogeneity in GOLD 1 COPD and to evaluate whether different subtypes of patients could be identified within this GOLD category. We used cluster analysis to divide our population into subgroups (clusters) according to the clinical parameters included in the study. The participants were characterized in five different domains: 1) baseline characteristics; 2) symptoms; 3) baseline lung function; 4) peak exercise response; and 5) levels of physical activity (steps/day, daily energy expenditure >3 metabolic equivalents [METs], daily time >3 METs). Since the clinical significance of this relatively new category of patients with mild COPD has been questioned [7][8][9], we also included healthy control subjects in the cluster analysis to investigate how the GOLD 1 patients would be differentiated. Based on the notion that considerable heterogeneity exists within GOLD 2 to 4 COPD [10], we hypothesized that a similar phenomenon would be seen within the GOLD 1 category and that different clinical phenotypes could be identified.

Study Design and Subjects
Data for this study were obtained, pre-randomization, from a single-center study, aimed at characterizing mild COPD and its exercise response to bronchodilation (ClinicalTrials.gov identifier: NCT01360788) and a multicenter clinical study involving 14 investigation sites, aimed at evaluating exercise response to bronchodilation in mild-to-moderate COPD (NCT01072396). The protocols for the individual trials are available in S1 Protocol  [11]. A total of 85 patients meeting the GOLD 1 COPD spirometric classification criteria (postbronchodilator FEV 1 80% predicted and FEV 1 /FVC <0.70) [2] and a smoking history 10 pack-years were included in the study; 118 healthy subjects with normal spirometry (FEV 1 >80% predicted and FEV 1 /FVC 0.7) served as controls. All subjects must have been in a stable condition for at least 6 weeks before study enrolment. Patients with COPD treated with short or long-acting β-adrenergic bronchodilators were asked to withdraw from their medication from 8 and 36 hours prior to the visit, respectively; similarly, short or long-acting anticholinergic bronchodilators were discontinued 8 hours and 2 weeks prior to the visit, respectively. This was done in order to avoid any confounding effects on exercise testing or pulmonary function. In all groups, subjects were excluded if they presented with any medical condition, other than COPD, likely to influence exercise testing as well as participation in physical activities of daily life (i.e. cardiovascular, neurological, musculoskeletal, locomotor or other respiratory diseases as well as β-blocker therapy).

Symptoms
The baseline dyspnea index (BDI) [12] was used to quantify the degree of dyspnea on a scale of 0 to 12, where a lower score denotes worse severity. Cough was considered present when occurring daily for 3 months per year, during at least 2 consecutive years.

Pulmonary Function Testing
Standard pulmonary function tests, including spirometry, lung volumes (by plethysmography) and diffusion capacity (DLCO) were obtained according to previously described guidelines [13] and related to predicted normal values [14]. The FEV 1 /FVC ratio was compared with the lower limit of the normal (LLN) range according to the National Health and Nutrition Examination Survey (NHANES) III predicted values [15]. The predicted value for inspiratory capacity (IC) was obtained by subtracting the functional residual capacity (FRC) predicted value from the total lung capacity (TLC) predicted value. Maximum voluntary ventilation (MVV) was estimated by multiplying FEV 1 by 35 [16].

Exercise Testing
Peak exercise capacity was determined using a walking exercise test, either an incremental shuttle walk test (ISWT) (NCT01360788) or an incremental treadmill exercise test (NCT01072396).
Incremental shuttle walking test. The originally described test [17], was modified to add three additional speed steps in order to reach symptom limitation in all participants [18]. Subjects were allowed to run in order to attain maximal exercise capacity. During the ISWT, subjects breathed through a facemask, connected to a portable gas exchange analyzer (Oxycon Mobile, Viasys Healthcare, Jaeger, Germany), which measured oxygen consumption (V 0 O 2 ), carbon dioxide output (V 0 CO 2 ) and minute ventilation (V 0 E). Dyspnea and leg fatigue Borg scores [19] were obtained at baseline and at end of exercise; with higher scores indicating worse severity. Finally, the locus of symptom limitation was determined by asking whether participants stopped exercise because of dyspnea/leg fatigue/both or for another reason.
Incremental treadmill exercise test. The incremental treadmill test was performed in a ramp-fashion adapted from the protocol established by Porszasz et al. [20], with a 10 W•min -1 and a 15 W•min -1 increase for patients with GOLD 1 COPD and control subjects, respectively. As for the ISWT, subjects were connected to a gas exchange analyzer using a mouthpiece and a nose clip. Finally, the same procedure as for the ISWT was implemented for effort perception and locus of symptom limitation.

Levels of Physical Activity
Physical activity in daily life was monitored during 7 to 14 consecutive days via a monitor (Sen-seWear ArmBand, BodyMedia Inc., Pittsburgh, PA, USA), which was worn on the right upper arm for at least 12 hours per day. This device produced estimates of the steps taken per day, as well as daily time and energy expenditure associated with at least moderate intensity (>3 METs). We report the mean daily values over the period of measure.

Cluster Analysis
Hierarchical cluster analysis was used to define homogeneous groups of individuals based on given parameters [21]. This analysis was performed using Ward's minimum-variance method and distances between individuals were measured in the metric of the pooled within-cluster covariance matrix as proposed by Art and colleagues [22]. The analysis results in groups (clusters) of members who share strong associations, while these associations are weak between members of different clusters [23]. Hierarchical clustering methods first assigned each individual to their own cluster. Then the most similar pairs of clusters (in terms of the chosen distance metric) were merged into a new cluster, so that there was one less cluster. The iteration process continued by merging the next two similar clusters, or new clusters, until all individuals could be included in a cluster. The parameters included in the analysis are shown in Table 1. The number of clusters was determined by using three statistics (pseudo F statistic, pseudo t 2 statistic and cubic clustering criterion), which performed best in the simulation study of Milligan and Cooper [24].

Ethics Statement
The parent clinical trials, from which data were obtained for this cluster analysis study, were carried out in compliance with the approved protocols, the principles laid down in the

Statistical Analysis
Results obtained in all patients with GOLD 1 COPD and controls were first shown as frequency distributions and compared between the two groups using Pearson's Chi-squared statistic tests. Second, comparisons were made between the clusters, which were identified through the cluster analysis. Quantitative variables, expressed as mean ± standard deviation (SD), were compared among clusters using an analysis of variance (ANOVA) model. Following a significant finding, Tukey's post hoc multiple comparisons technique was used to compare each cluster with the other clusters. Qualitative variables, expressed as percentages, were compared among clusters using Pearson's chi-squared statistic test. All analyses were done at the level of significance of p<0.05.

Heterogeneity in GOLD 1 COPD
The frequency distributions for pulmonary function, peak V 0 O 2 , BDI score and physical activity are provided in Fig 1. FEV 1 % predicted, FEV 1 /FVC and DLCO % predicted were lower while TLC, FRC and reserve volume (RV) were higher in patients with GOLD 1 COPD compared with controls (all p<0.001). For all these variables, with the exception of FEV 1 /FVC ratio, a considerable degree of overlap between GOLD 1 COPD and controls was seen. Compared with controls, peak V 0 O 2 was lower on average by 15% in patients with COPD, who also expressed a lower BDI score. The number of steps per day tended to be reduced in COPD compared with control (p = 0.09). No difference was observed for daily time spent at physical activity >3 METs between the two groups (p = 0.47).

Cluster Analysis
We obtained a three-cluster solution, which best fitted the parameters and subjects included in the study; this decision was based on local peaks of the cubic clustering criterion and pseudo F statistic combined with a small value of the pseudo t 2 statistic and a larger value for the next cluster fusion (Fig 2a) [24]. This was also in accordance with the dendrogram issued from the hierarchical Ward's clustering method (Fig 2b) [25]. Cluster I included 105 controls and 18 patients with GOLD 1 COPD, while clusters II and III were mostly composed of patients with GOLD 1 COPD (Fig 3). The characteristics of the patients with COPD in the three clusters, excluding the controls from this analysis, are presented in Table 2. Patients in the three clusters had a similar body mass index and sex distribution; patients in cluster III were older than those in cluster II (p = 0.03). Smoking history was significantly higher in patients belonging to cluster II than cluster I (p = 0.002). Prevalence of cough and dyspnea BDI scores was similar across the clusters (Table 2). Pulmonary function data are provided in Table 2 01; Fig 4). The three clusters of patients with COPD had similar FEV 1 (Fig 4), but FRC and RV were significantly increased in cluster II compared with cluster I (p = 0.01 and p = 0.04, respectively). Cluster II also tended to display a lower IC/TLC ratio compared with cluster I (0.43 ± 0.08 versus 0.48 ± 0.07; p = 0.07). Finally, cluster III was differentiated from clusters I and II by a significantly lower FEV 1 /FVC ratio (p = 0.002 and p = 0.008, respectively).
Although patients in the three clusters had similar peak V 0 O 2 , only patients in clusters II and III showed a significantly reduced peak V 0 O 2 compared with controls (p<0.01; Fig 5). The number of step per day and amounts of physical activity with energy expenditure >3 METs was significantly reduced in patients in cluster III compared with controls (p<0.01) and patients in clusters I (p<0.001) and II (p<0.001). As indicated in Table 3, patients belonging to cluster III differed from those of cluster II on the basis of a significantly higher V 0 E/MVV ratio at peak exercise (p<0.001). When compared with patients in clusters I and II, patients in Cluster III had significantly higher V 0 E/V 0 O 2 ratio (p = 0.005 and p = 0.05, respectively), higher respiratory exchange ratio at peak exercise (p = 0.01 and p<0.001, respectively) and were mainly limited by dyspnea (p = 0.02; Table 3).

Discussion
This study highlights heterogeneity in the clinical manifestations of GOLD 1 COPD, as defined by the 2014 GOLD consensus report [1]. Three clusters of patients with GOLD 1 COPD could be identified: cluster I was characterized by reduced DLCO and decreased BDI dyspnea scores (compared with controls) with preserved lung volumes, exercise capacity and physical activity levels; cluster II showed more prominent static hyperinflation (FRC) and gas trapping (RV) but preserved levels of physical activity; and cluster III exhibited marked reduction in physical activity levels and higher V 0 E/MVV ratio, V 0 E/V 0 O 2 and respiratory exchange ratio at peak exercise.
Heterogeneity in the clinical manifestations of COPD has been highlighted in patients involved in the ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints) cohort [10]. The present study extends these results by showing a similar    phenomenon within the GOLD 1 COPD category. One potential implication of these findings is that all patients within the GOLD 1 COPD category should not be considered as having the same disease. Some of them (cluster I) may exhibit preserved functional capacity and physical activity levels despite evidence of airflow obstruction. It could be argued that patients belonging to cluster I were actually healthy subjects who were misclassified based on a fixed FEV 1 /FVC ratio [26]. This issue of misclassification is supported by the fact that the FEV 1 /FVC ratio was >LLN in 39% of subjects. Conversely, decreased BDI dyspnea scores and reduced DLCO found in this cluster would argue that these patients were nevertheless showing some pathophysiological features of COPD. Clearly, the delineation between healthy smokers and mild COPD is not necessarily perfect, illustrating that the differentiation between health and disease is likely to be a continuum. However, by combining both healthy controls and patients with GOLD 1 COPD in a cluster analysis we have illustrated that the majority of GOLD 1 patients (67/85, 79%) stood out as being different from most healthy controls, which supports the notion that these individuals exhibit clinical features of a "true" disease. Patients with GOLD 1 COPD included in cluster II were mostly characterized by a substantial smoking history (>50 pack-years) and, from a physiological standpoint, by FRC and RV >120% predicted and reduced exercise capacity. Surprisingly, the levels of physical activity were still preserved in cluster II. It is also interesting to consider that vital capacity was preserved in these individuals despite static hyperinflation and gas trapping. This finding in patients with mild airflow obstruction has been previously reported in cross-sectional studies and may have important implications in terms of maintaining ventilatory capacity during exercise [27][28][29][30].
Patients with mild COPD in cluster III were mainly characterized by a lower FEV 1 /FVC ratio, reduced exercise capacity and striking reduction in the level of physical activity compared with the other clusters. Reduced physical activity level has already been reported in patients with GOLD 1 COPD [31,32]. Our data add to the existing literature by showing that this reduction in physical activity may be occurring only in a subset of patients with GOLD 1 COPD. Considering the strong negative prognostic implications of low physical activity levels in COPD [33,34], our analysis may have identified a category of mild COPD that is at higher risk of poor outcomes. Taking into account that increasing physical activity represents an important objective of pulmonary rehabilitation [35], our results support a possible role of this intervention in mild COPD, particularly when FEV 1 /FVC is low. Interestingly, this profound reduction in physical activity level seen in cluster III in comparison with the other two clusters of patients with COPD was present despite similar peak V 0 O 2 . This dissociation between peak exercise capacity and level of physical activity is important because it illustrates that these two parameters are assessing different concepts and that although preserved peak exercise capacity is permissive to physical activity, it does not guarantee an active lifestyle [36]. We do not have a clear explanation for the reduced level of physical activity in cluster III. These patients had a higher erosion of the ventilatory reserve at peak exercise [37] in comparison with the other clusters and they were mostly limited by dyspnea at peak exercise. In the face of a similar V 0 E/ V 0 CO 2 , the higher V 0 E/V 0 O 2 and respiratory exchange ratio observed at peak exercise in these patients may reflect greater metabolic acidosis, perhaps due to higher reliance of the limb muscles on glycolytic metabolism. Although we can only speculate on this issue, it is interesting to consider that evidence of limb muscle dysfunction has been reported in patients with mild COPD [38,39]. Being more physically inactive, this subset of patients may be at a greater risk of developing limb muscle dysfunction.
In this study, we used the GOLD classification [1] to stratify our patients because of its wide clinical application. However, we appreciate the fact that any attempt to categorize disease severity based on FEV 1 cut-offs is arbitrary in nature and that, in fact, COPD severity is a continuum. Stratifying patients into subcategories is particularly useful when it helps in disease prognostication or in individualizing clinical management. We acknowledge that we have not reached this goal with the current study. The main purpose of the present cluster analysis was to highlight heterogeneity in GOLD 1 COPD patients; an information potentially useful for future studies in this specific patient population. Our results emphasize that the clinical manifestations of COPD are heterogeneous, even within the same GOLD severity category, and that the evaluation of a patient should not rely solely on FEV 1 . We appreciate that respiratory symptoms were measured only once and that they may fluctuate over time [40]. In order to avoid potential misclassifications for respiratory symptoms, all participants were studied in a stable condition. Given the majority of men in our study, caution should be taken before applying the findings to women with mild COPD. One further potential limitation was that exacerbations were not systematically recorded in this population. Patients involved in study NCT01360788 did not report any exacerbation in the year preceding their involvement in the study, whereas patients involved in study NCT01072396 had to be stable for 6 weeks before the trial. Therefore, we are confident that exacerbation was not a major issue in this population and that this information would not have had a substantial impact on the outcomes of the cluster analysis. It is acknowledged that the sample size for this study is relatively small, since larger sample sizes are not available. However, this is the first time that such a group of patients with GOLD 1 COPD has been thoroughly investigated, and the results should be followed up with a larger sample size when available.
Patient data used in this study were pooled from two clinical trials. Some patients were initially identified through a lung cancer screening study, during which spirometry was performed, and when they had completed their participation in the study, they were referred on for participation in NCT01360788. Patients in NCT01072396 were recruited through respirology clinics in order to evaluate the exercise response to bronchodilation in mild-to-moderate COPD. The resultant patient population for our cluster analysis included a mixture of asymptomatic and symptomatic patients; this is reflected in the heterogeneity that was found in this population. How truly representative this cohort is of the entire GOLD 1 COPD population is difficult to assess but we nevertheless believe that we covered a spectrum of the GOLD 1 COPD population.