COPD-diagnosis is confirmed by post-bronchodilator (BD) spirometry. However, epidemiological studies often rely on pre-BD spirometry, self-reports, or medical records. This population-based study aims to determine COPD-prevalence based on four different operational definitions and their level of agreement, and to compare associations between COPD-definitions and risk factors.
COPD-prevalence in 1,793 adults from the general Dutch population (aged 18–70 years) was assessed based on self-reported data, Electronic Medical Records (EMR), and post-BD spirometry: using the FEV1/FVC below the lower limit of normal (LLN) and GOLD fixed cut-off (FEV1/FVC <0.70). Using spirometry as a reference, sensitivity was calculated for self-reported and EMR-based COPD. Associations between COPD and known risk factors were assessed with logistic regression. Data were collected as part of the cross-sectional VGO study (Livestock Farming and Neighboring Residents’ Health Study).
The highest prevalence was found based on spirometry (GOLD: 10.9%, LLN: 5.9%), followed by self-report (4.6%) and EMR (2.9%). Self-reported or EMR-based COPD identified less than 30% of all COPD-cases based on spirometry. The direction of association between known risk factors and COPD was similar across the four definitions, however, magnitude and significance varied. Especially indicators of allergy were more strongly associated with self-reported COPD compared to the other definitions.
COPD-prevalence varied depending on the used definition. A substantial number of subjects with spirometry-based COPD cannot be identified with questionnaires or medical records which can cause underestimation of COPD-prevalence. The influence of the different COPD-definitions on associations with known risk factors was limited.
Citation: Borlée F, Yzermans CJ, Krop E, Aalders B, Rooijackers J, Zock J-P, et al. (2017) Spirometry, questionnaire and electronic medical record based COPD in a population survey: Comparing prevalence, level of agreement and associations with potential risk factors. PLoS ONE 12(3): e0171494. https://doi.org/10.1371/journal.pone.0171494
Editor: Christophe Leroyer, Universite de Bretagne Occidentale, FRANCE
Received: October 25, 2016; Accepted: January 21, 2017; Published: March 8, 2017
Copyright: © 2017 Borlée et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: In consultation with the Medical Ethical Committee that approved the study protocol, data from the VGO study are not publicly available due to privacy protection of participants. The study’s privacy regulations stated that only researchers from NIVEL, IRAS, and RIVM (consortium partners) have access to the study database. Sharing an anonymized and de-identified dataset is not possible as it would still contain Electronical Medical Records and the personal data of participants, which could potentially lead to the identification of subjects. Researchers may reach a privacy agreement to access the data by contacting Prof. Dr. Dick Heederik (firstname.lastname@example.org) or Dr. L.A.M. Smit (email@example.com).
Funding: The Livestock Farming and Neighbouring Residents’ Health (VGO) study was funded by the Ministry of Health, Welfare and Sports and the Ministry of Economic Affairs of The Netherlands. LAMS received also a grant from the Lung Foundation Netherlands (Grant number: 3.2.11.022) (www.longfonds.nl/) to support this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Chronic obstructive pulmonary disease (COPD) is a leading cause of mortality and morbidity worldwide and expected to increase in the coming decades . Epidemiological studies estimating COPD prevalence show remarkable variation due to differences in measurement methodology . Halbert et al. conducted a meta-analysis to quantify the global prevalence of COPD . Objective definitions based on spirometry tended to produce higher prevalence estimates than patient reported diagnosis and physician diagnosis (9.2% versus 4.9% versus 5.2%, respectively). This likely reflects the underestimation and under-diagnosis of the disease prevalence . COPD based on post-bronchodilator (BD) spirometry is therefore preferred in epidemiological studies and very common. Objective measurements are also preferred because they are not influenced by symptom-perception, recall-bias and access to health care . However, the advantage of self-reports or medical records are the relatively low costs, allowing large sample sizes and “big data” analysis.
Studies comparing COPD-prevalence based on different data sources in the same population also found that the definitions used to assess COPD greatly influence prevalence estimates [5–10]. A study from de Marco et al. showed that the effect of risk factors for the development of COPD, such as gender, age and Body Mass Index (BMI), may also depend on the definition used . However, most of these studies were conducted in patient populations [7,9,10]. In the few studies that compared COPD-definitions in the general population, only pre-BD spirometry results were available [5,6,11]. To our knowledge, this is the first population-based study that compares post-BD spirometry-based COPD with COPD-prevalence based on other data sources.
For spirometry-based COPD, the recommended cut-off for the Forced Expiratory Volume in 1 second (FEV1)/ Forced Vital Capacity (FVC) is the lower limit of normal (LLN) based on the Global Lung Initiative-2012 (GLI) reference equations that take into account sex, age, and height [12,13]. Another commonly used cut-off point for COPD is the ratio between post-BD FEV1 and FVC <0.70 (Global Initiative for Chronic Obstructive Lung Disease (GOLD)) . This GOLD-definition has been criticized since the FEV1/FVC ratio generally decreases with age which results in over-diagnosis in elderly and under-diagnosis in younger people [14,15].
A comparison of different definitions for determining COPD-prevalence will give more insight into the possible effects of using various COPD-definitions on prevalence estimates and their associations with potential risk factors.
The objectives of this study are: 1) to compare COPD-prevalence and the level of agreement based on four different operational definitions: self-reported COPD, COPD based on general practitioners’ (GP) Electronic Medical Records (EMR) and COPD based on post-BD spirometry: LLN and GOLD-definition, 2) to compare associations between COPD (four operational definitions) and potential risk factors and severity measures and 3) to analyze COPD-prevalence based on pre-BD spirometry and to assess whether associations with potential risk factors are different from COPD based on post-BD spirometry.
Materials and methods
Data of the present study are derived from the cross-sectional VGO study (Dutch acronym for Livestock Farming and Neighboring Residents’ Health), which aims to investigate health of residents living in the vicinity of livestock farms. In 2012, a questionnaire survey was conducted among 14,163 adults from the general population (aged 18–70 years) in the south of the Netherlands. Recruitment and inclusion criteria have been described previously by Borlée et al.. Questionnaire participants who gave consent for further contact for a follow-up study, and who were not working or living on a farm were eligible for a medical survey (n = 8,714). Based on their home addresses, twelve temporary research centers were established. Between March 2014 and February 2015, all participants living within a distance of approximately 10 km of a temporary research center (n = 7,180) were invited to the nearest research center for medical examination which resulted in 2,494 participants (response 34.7%). The medical examination consisted of a second and more extended questionnaire, length and weight measurements, a lung function measurement (pre- and post-BD spirometry), collection of serum, EDTA-blood, nasal and buccal cells, and a nasal swab. In addition, fecal samples were taken by the participants at home and sent to the laboratory by mail. In this study we conducted analyses on subjects with a pre- and post-BD measurement with a sufficient quality (quality C or better), with good quality EMR available and with non-missing self-reported COPD (n = 1,793) (see Fig 1 for a flow chart of the study population).
The VGO study protocol was approved by the Medical Ethical Committee of the University Medical Centre Utrecht (protocol number 13/533). All 2,494 subjects signed informed consent. Patients’ privacy was ensured by keeping medical information and address records separated at all times by using a Trusted Third Party.
Data sources and COPD-definitions
Self-reported COPD was defined as a positive answer to the question: ‘Have you ever been told by a doctor that you had chronic obstructive pulmonary disease or emphysema?’ Questionnaire data on respiratory health was assessed with the first questionnaire collected in November 2012 as described previously . This was a two-page questionnaire with questions on respiratory health adopted from the European Community Respiratory Health Survey-III (ECRHS-III) screening questionnaire .
Electronic Medical Records.
EMR-based COPD was defined as: ICPC-code R91 (Chronic bronchitis) or R95 (Emphysema/COPD) recorded in 2010–2012. EMR data were available through the GPs who all participated in the Netherlands Institute for Health Services Research (NIVEL) Primary Care Database . The practice had to meet the following EMR quality criteria: 1) record diagnostic information in the patients’ EMR using the International Classification of Primary Care ICPC (4), 2) assign ICPC-codes to at least 70% of the morbidity records, and 3) record morbidity data at least 46 weeks per year. In addition, patients had to be registered at the GP for at least three-quarters of the year 2012. All subjects included in the data analysis gave written permission to link their study data to their EMR.
Two COPD-definitions were used based on spirometry data 1) a post-BD measurement of FEV1/FVC below the LLN, and 2) a post-BD measurement of FEV1/FVC <0.70 (GOLD). LLN was calculated with GLI-reference values based on age, gender and height . Pre- and post-BD spirometry was conducted according to European Respiratory Society (ERS) guidelines and the European Community Respiratory Health Survey III (ECRHS-III) . Participants stopped using inhalers and oral lung medication 4 and 8 hours prior to the lung function test, respectively. An EasyOne Spirometer (NDD Medical Technologies, Inc.) was used which measures flow and volume by ultra-sound transit time. After the pre-BD test, four puffs of short-acting beta-agonist (salbutamol, 100 μg per puff) were administered to the participant using a standard spacer. The post-BD measurement was performed at least 15 minutes after the lastly administered puff. To increase the quality of the spirometry data, we attempted to obtain four acceptable spirograms (pre- and post) per subject. The quality of all lung function curves were manually reviewed in NDD software by a specialist. The three best curves were selected or ranked manually when the best curves that were chosen by the NDD software program were not the best curves based on predefined ATS/ERS and NDD criteria . In total 97.8% of the participants who conducted a lung function test had a pre- and/or post-BD measurement with a quality of C or higher (quality C: at least two reproducible curves or reproducibility within 200 ml) (N = 2,322/2,375, respectively see Fig 1).
Potential risk factors and severity measures of COPD
Patient characteristics and severity measures of COPD were collected with the second, more extended, questionnaire which subjects completed before the medical examination. The questionnaire comprised amongst others items on symptoms and diseases, smoking habits, education and profession. Body Mass Index (BMI, kg/m2) was assessed during the medical examination. Atopy was defined as the presence of specific serum IgE antibodies to one or more common allergens and/or a total IgE higher than 100 IU/ml. Specific IgE to common allergens (house dust mite, grass, cat and dog) and total IgE levels were determined in serum with enzyme immunoassays as described before . To gain more insight into asthma-COPD overlap, we included self-reported current asthma as a potential risk factor. Self-reported current asthma was defined as: self-reported ever asthma AND either one or more asthma-like symptoms (wheezing/whistling in the chest, chest tightness, shortness of breath at rest/following strenuous activity/at night-time or asthma attacks) or use of inhaled or oral medication for breathing problems in the last year (described before by de Marco et al.. Three severity measures for COPD were computed for all participants: GOLD grades , self-reported health status, and the Clinical COPD Questionnaire (CCQ)-score . The CCQ-score is developed to identify activity limitations and emotional dysfunction of COPD patients.
We conducted a detailed non-response analysis in order to detect possible selection bias. Characteristics of different population subsets were compared (see Fig 1 and Table 1). The likelihood of agreeing to follow-up, or being a participant was modeled for different characteristics with logistic regression and adjusted for age, gender and smoking habits. In order to study the effect of potential selection bias, we compared the association between self-reported COPD and risk factors among different populations subsets (see S1 Table.).
Agreement between the presence of COPD based on the three different data sources was determined by calculating Cohen’s kappa. Using the results of the post-BD spirometry as reference standards for COPD, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) for self-reported and EMR-reported COPD were calculated.
The association between each potential risk factor or severity measure with COPD was assessed by means of multiple logistic regression analysis. All analyses were adjusted for age (as a continuous variable), gender and smoking habits. To include both the qualitative effect of smoking status and the quantitative effect of smoking exposure, we included ever smoking and pack-years of smoking together as confounders . Sensitivity analyses were conducted: 1) with COPD based on pre-BD measurements; 2) on subjects aged ≥40 years, since COPD diagnosis is more reliable in older patients [25,26]; and 3) after excluding subjects with self-reported current asthma.
Data were analyzed using SAS 9.4 (SAS Institute Inc., Cary, NC, USA).
Subjects who agreed to be contacted for a follow-up study were slightly older (mean age 51.1 vs. 49.8 years), were more often former smokers (38.8% vs. 31.4%) and had more often asthma (both self-reported and EMR-based asthma) compared to subjects who disagreed (Table 1). Subjects who participated in the medical examination were older (mean age 54.7 vs. 49.1 years), more often female (54.6% vs. 52.2%) and more often former smokers (44.6% vs. 35.7%) compared to subjects who were invited but did not participate. Selection bias did not seem to affect associations between potential risk factors and self-reported COPD in different population subsets (S1 Table).
COPD-prevalence and the level of agreement
The highest COPD-prevalence was found based on spirometry using the GOLD-definition (10.9%), followed by LLN-definition (5.9%), self-report (4.6%) and EMR (2.9%) (see Table 2).
In total 243 COPD cases were ascertained by at least one definition. The overlap between COPD-prevalence based on the four different definitions was low (see Fig 2). Only 9.1% of COPD cases (n = 22) had COPD according to all four definitions. A substantial part (59.7%) was only ascertained by spirometry: 27.6% by both the LLN and the GOLD-definition, 31.3% by the GOLD-definition alone and 0.8% by the LLN-definition alone. In total, 73.2% (145/198) of spirometry-based COPD was not identified by self-reported and or EMR-based data.
Legend Fig 2: In total 243 COPD cases were ascertained by at least one definition. In total, 1793 subjects who had a pre- and post-BD lung function measurement with a sufficient quality (C or better), Electronic Medical Records (EMR) of good quality, and without missing data on self-reported COPD (see Fig 1) were included. Self-report = self-reported data based on the ECRHSIII screening questionnaire, EMR = Electronic Medical Records, spirometry LLN = post-bronchodilator measurement of FEV1/FVC lower than FEV1/FVC-lower limits of normal, spirometry GOLD = post-bronchodilator measurement of FEV1/FVC < 0.70.
The highest agreement, expressed as Cohen’s kappa , was found between COPD based on the two spirometry definitions (ĸ = 0.65), followed by a moderate agreement between self-reported and EMR-based COPD (ĸ = 0.52) (Table 3). Agreement between spirometry-based COPD compared with self-reported or EMR-based COPD was fair (LLN-definition: self-report: ĸ = 0.30, EMR: ĸ = 0.31, GOLD-definition: self-report: ĸ = 0.25, EMR: ĸ = 0.26).
Agreement between the three different data sources was determined with Cohen’s Kappa.
Self-reported or EMR-based COPD identified less than 30% of all subjects with spirometry-based COPD (sensitivity varied between 0.19 and 0.30, specificity: 0.97–0.99) (Table 3).
As expected, since LLN is a subset of the GOLD definition, the proportion of subjects with self-reported or EMR-based COPD confirmed by spirometry-based COPD (PPV) was higher when compared with the GOLD-definition (PPV self-report: 0.50, PPV EMR: 0.71) than with the LLN-definition (PPV self-report: 0.38, PPV EMR: 0.52).
Associations between COPD-definitions and potential risk factors and severity measures
Overall, the direction of associations was consistent across all four COPD-definitions (Table 4). A low BMI (<20 vs. 20–25) and pack years of smoking were significant risk factors for each COPD-definition with comparable magnitude. However, the magnitude and significance of other associations varied between the definitions.
In particular, the association of age and gender with COPD varied according to the definition used. Age was significantly positively associated with COPD, except when the LLN-definition was used. The negative association with female gender was only statistically significant when the GOLD-definition was used, whereas the EMR-based definition showed a non-significant positive association. The positive association between self-reported allergy and COPD was only significant when using self-reported COPD or EMR-based COPD. When focusing on indicators for objectively measured allergy, we found strong positive associations between self-reported COPD and all three definitions of IgE sensitization (>1 positive specific IgE, total IgE > = 100 IU/ml, and a combination of both). EMR-based COPD and COPD based on LLN-definition were only associated with total IgE > = 100 IU/ml. Current asthma was positively associated with all four definitions, nonetheless, a substantially stronger association was observed with self-reported COPD. Indicators for COPD severity were positively associated with COPD regardless of the definition used, but stronger associations were observed with self-reported and EMR-based COPD.
Sensitivity analyses of the 1626 subjects aged ≥40 years showed a small increase in COPD-prevalence based on all four definitions (self-report: n = 81 (5.0%), EMR: n = 52 (3.2%), LLN: n = 103 (5.7%), GOLD: n = 196 (12.1%))(S2 Table). The associations between COPD and potential risk factors did not change. Analyses without patients with current asthma showed a lower prevalence of self-reported COPD (n = 52 (3.0%) vs. n = 82 (4.6%)), prevalence based on the other definitions did not show major changes (S3 Table). A stronger association was observed between self-reported COPD and age and a low BMI. The association between self-reported COPD and self-reported allergy and indicators for objectively measured allergy became weaker.
Pre- versus post bronchodilator spirometry
COPD-prevalence increased when using pre-BD measurements (LLN pre: 9.1% vs. post: 5.9%, GOLD pre: 16.4% vs. post: 10.9%) (see Table 5). In general, similar associations with risk factors were identified by using pre- or post-BD spirometry, although associations were stronger and more often significant when COPD was based on post-measurements.
In a general population sample of adults aged 20–72 years from the Netherlands, we found that COPD-prevalence varied depending on the used definition (2.9–10.9%). The overlap between COPD-prevalence based on the four different operational definitions was low. Self-reported or EMR-based COPD identified less than 30% of all subjects with spirometry-based COPD, but specificity was high. Despite the variation in prevalence estimates, low overlap and low sensitivity, the direction of associations between potential risk factors and all four operational definitions of COPD were more or less similar, although the magnitude and statistical significance of the associations varied between the definitions. The combination of a relatively low prevalence and high specificity of self-reported and EMR-based COPD compared to both LLN and GOLD as a reference explains the minor changes in the associations between risk factors with the different COPD-definitions . A high specificity causes a relatively low number of ‘false positive’ COPD cases in the ‘true positive’ COPD group. COPD-prevalence was substantially higher based on pre- instead of post-BD measurements. We found similar associations with risk factors when using pre- or post-BD spirometry, but the associations with risk factors were stronger and more often significant when COPD was based on post-BD measurements.
The highest COPD-prevalence was found based on the GOLD-definition (10.9%), followed by spirometry LLN-definition (5.9%), self-report (4.6%) and EMR (2.9%). Prevalence estimates were comparable with spirometry-based prevalence estimates in the larger general population studies. The PLATINO-study (n = 5,571, age ≥ 40 years) in Latin America found a prevalence varying between 7.8%-19.7% based on post-BD GOLD-definition . The BOLD-study (n = 9,425, 52–60 years) estimated world-wide COPD-prevalence to be 10.1% (GOLD-2 or higher) based on post-BD measurements . The number of studies in the general population comparing prevalence estimates based on different data sources is limited. Celli et al. found a twice as high COPD-prevalence based on pre-BD spirometry compared with self-reported COPD (self-report: 7.7%, LLN: 14.2%, GOLD: 16.8%) in the United States population (n = 9,838, mean±SD 48.3 ± 13.6 years). Despite the use of pre-BD spirometry, a Swedish study conducted in a population-based sample (n = 3,892, mean±SD 51.7 ± 10.6 years), found a lower COPD-prevalence compared with our results, (LLN: 4.2%, GOLD: 9.4%, self-report: 0.8%). This is possibly explained due to exclusion of subjects with physician-diagnosed asthma in the Swedish study. Mohangoo et al.(n = 12,699, mean±SD 39±23 years) found almost twice as high self-reported “asthma or COPD” prevalence compared to the prevalence based on GP data. Our study also found higher COPD-prevalence based on self-reported data compared to GP data.
Other studies also confirmed underestimation of COPD in the general population when using self-reported or EMR-data [2,3,5,7,10]. Pulmonary specialists may argue that COPD only based on spirometry is an overestimation since for a clinical COPD diagnosis also other indicators are needed like respiratory symptoms, family history of COPD, or history of exposure to risk factors . We want to emphasize that this study aims to assess COPD for epidemiological usage and not for clinical case finding. Therefore COPD based on only lung function criteria is justifiable. Therefore COPD based on only lung function criteria is justifiable. On the other hand there are also arguments for underestimation of COPD prevalence based on spirometry since the likelihood of producing a reproducible spirometric measurement decreases with disease severity. We excluded non-reproducible tests and therefore it is likely to selectively exclude a higher proportion of subjects with airflow obstruction . Furthermore, COPD is a slowly progressive disease and symptoms slowly worsen over time . People adapt to these slowly developing respiratory problems and might be unaware of their condition and may not visit a GP. This could explain the low sensitivity of self-reported or EMR–based COPD. Furthermore, self-reported COPD or a diagnosis of COPD in EMR will also depend on the severity of the disease which is highly associated with care seeking behavior . The CCQ-score and self-reported health–indicators of the severity of the disease—were both more strongly associated with self-reported- and EMR-based COPD compared to spirometry-based COPD. Decline in lung function occurs faster in earlier stages of the disease. Therefore, early diagnosis may slow disease progression by physical activity and prevention of exposure to smoke and other noxious agents. In addition, pharmacological intervention may control symptoms and improve quality of life [30,31].
A follow-up study by de Marco et al.  in young adults (ECRHS-II study, n = 4,636, 20–44 years old at the time of inclusion) studied risk factors of new-onset COPD and compared associations between risk factors and several pre-BD spirometric COPD-definitions. The association with LLN-based COPD incidence and gender, age, and being underweight lost their statistical significance compared to GOLD-based COPD incidence. We found similar associations with age, gender and underweight and these associations were also stronger with pre-BD GOLD-based COPD compared with LLN. However, being underweight was stronger associated with LLN-based COPD than GOLD when using post-BD measurements.
In our study, most associations between risk factors and different COPD-definitions had a similar magnitude and overlapping confidence intervals, except for the associations with allergy indicators. We found strong positive associations between self-reported COPD and indicators for allergy. Allergy is associated with asthma, and the association between COPD and current asthma was more prominent for self-reported COPD than for the other COPD-definitions. The associations between self-reported COPD and allergy indicators became weaker when subjects with current asthma were excluded, this indicates that some misclassification in self-reported COPD may be present due to overlap with asthma. Except for allergy indicators, this study overall indicates that for epidemiological studies with the aim to evaluate risk factors for COPD, the influence of the used definition seems to be limited. However, we focused only on risk factors that are known to be associated with COPD, and we can only speculate that the influence of different COPD-definitions on associations with unknown risk factors is limited.
Our population-based study is unique in the simultaneous use of three different data sources to assess COPD: post-BD spirometry, GP registrations, and ECRHS questionnaire items. We applied stringent quality standards to both spirometry and EMR data. In most population-based epidemiological studies based on spirometry, only pre-BD lung function measurements are available. It is not unsurprising that the prevalence estimates were higher when COPD was based on pre-BD spirometry. By using post-BD spirometry we studied fixed airway obstruction, which will reduce the number of false-positives due to overlap with asthma [32,33]. Nevertheless, the influence of using pre-BD instead of post-BD definitions on associations with potential risk factors, including current asthma, was limited. As expected, associations were somewhat stronger and more often significant when COPD was based on post-measurements, since a reduction in the number of false-positives will reduce measurement error and consequently will strengthen risk factor associations. To the best of our knowledge, this was not studied before in a population–based study. Another strength of our study was the extensive non-response analysis from the source population up to the current study population. We previously compared characteristics of non-responders and responders of the questionnaire survey (source-population). This study continued the non-response analysis by comparing characteristics of responders and non-responders in different stages of the data collection. Participants of the medical examination were older, more often female and more often former smokers compared to subjects who were invited but did not participate. Both in the previous analysis  and in the present study we were able to demonstrate that selection bias did not affect the associations under study, e.g. the association between self-reported COPD and potential risk factors (see S1 Table).
The three different data sources were not collected at the same time, which is a limitation of our study. Questionnaire data on COPD were collected in November 2012, EMR from 2010–2012 were used, and spirometry was conducted between March 2014 and February 2015. However, it is unlikely that the lack of overlap is to a large degree explained by COPD development during the relatively short data collection period. Previous studies that did collect self-reported data and spirometry data synchronically, also found a large degree of non-overlap [5,6].
General population studies are often conducted in urban populations. Our study population lived in a rural area outside the larger cities and farmers were excluded. The prevalence of GP-diagnosed COPD in the study area did not differ from the prevalence in other Dutch rural areas without livestock farming (42.6 vs. 47.1 prevalence per 1000 for patients aged >40 years, average over 2007–2013), and we have no reason to expect that agreement between different COPD-definitions would be different in other areas.
The operational definition used for COPD greatly influences prevalence estimates. Self-reported or EMR-based COPD identified less than 30% of all subjects with COPD based on persistent airflow limitation, which implies that a substantial number of subjects with COPD cannot be identified by questionnaires or medical records. However, the effect of the different COPD-definitions on associations with potential risk factors was limited, except for indicators of allergy, which were more strongly associated with self-reported COPD compared to the other definitions. In addition, the use of pre-BD spirometry instead of post-BD spirometry resulted in higher prevalence estimates, but had a minimal effect on associations with potential risk factors. Researchers using these operational definitions to group individuals according to COPD status, need to be aware of the impact of such choices. Results of this study may be informative for population-based epidemiological studies with the aim to evaluate potential risk factors for COPD.
S1 Table. Association between self-reported COPD and several characteristics compared between different population subsets to study potential selection bias.
Legend Table S1. OR and 95% CI were adjusted for age, gender, and ever smoking. Bold type indicates statistical significance (p <0.05). Self-reported COPD was defined as a positive answer to the question: ‘Have you ever been told by a doctor that you had chronic obstructive pulmonary disease or emphysema?’. The sub-populations are represented in Fig 1 (main article).
S2 Table. Associations between risk factors and severity measures with four different definitions of COPD, only subjects older than 39 years of age are included.
Legend Table S2. Data are presented as mean ±SD or %, unless otherwise stated. OR and 95% CI were adjusted for age, gender, ever smoking and pack years (number of pack years was mean-centered for ex- and current smokers). Bold type indicates statistical significance (p <0.05). Self-reported: self-reported data based on the ECRHSIII screening questionnaire, EMR: Electronic Medical Records, spirometry: post-bronchodilator lung function measurement. Used definitions for COPD based on different databases are presented in Table 2. *Mean packyears are calculated for ex-smokers and current smokers. † GOLD 1: FEV1/FVC<0.70 and FEV1≥ 80% predicted, GOLD 2–4: FEV1/FVC <0.7 and FEV1 <80% predicted ‡ Clinical COPD Questionnaire (CCQ)-score (van der Molen et al. ‘Development, validity and responsiveness of the Clinical COPD Questionnaire.’ Health Qual Life Outcomes 2003;1:13.) NA: Not available, as very few (LLN) or no (GOLD) subjects with spirometry-based COPD had FEV1/FVC > 0.7. §Less than good self-reported health: bad/moderate/reasonable, reference category: good/excellent self-reported health
S3 Table. Associations between risk factors and severity measures with four different definitions of COPD, subjects with current asthma are excluded.
Legend S3 Table. Data are presented as mean ±SD or %, unless otherwise stated. OR and 95% CI were adjusted for age, gender, ever smoking and pack years (number of pack years was mean-centered for ex- and current smokers). Bold type indicates statistical significance (p <0.05). Self-reported: self-reported data based on the ECRHSIII screening questionnaire, EMR: Electronic Medical Records, spirometry: post-bronchodilator lung function measurement. Used definitions for COPD based on different databases are presented in Table 2. *Mean packyears are calculated for ex-smokers and current smokers. † GOLD 1: FEV1/FVC<0.70 and FEV1≥ 80% predicted, GOLD 2–4: FEV1/FVC <0.7 and FEV1 <80% predicted ‡ Clinical COPD Questionnaire (CCQ)-score (van der Molen et al. ‘Development, validity and responsiveness of the Clinical COPD Questionnaire.’ Health Qual Life Outcomes 2003;1:13.) NA: Not available, as very few (LLN) or no (GOLD) subjects with spirometry-based COPD had FEV1/FVC > 0.7. §Less than good self-reported health: bad/moderate/reasonable, reference category: good/excellent self-reported health.
We would like to thank colleagues of the Institute for Risk Assessment Sciences for their contribution to the medical examination: Marieke Oldenwening, Siegfried de Wind, Amena Magielsen-van As, Saskia Martens, Sigrid Nieuwenweg, Anna Strumphler, Edwina Traanman and Dorine Verhoef. Jack Spithoven and Nena Burger are acknowledged for handling the biological samples. Jack Spithoven performed the IgE analyses. We thank Eef van Otterloo for technical support. We thank ‘Stichting Informatie Voorziening Zorg’ (Trusted Third Party) for ensuring participants privacy.
- Conceptualization: FB CJY DH LAMS.
- Data curation: LAMS.
- Formal analysis: FB LAMS.
- Funding acquisition: CBMM DH CJY LAMS.
- Investigation: FB EK BA.
- Methodology: FB LAMS.
- Project administration: FB EK.
- Resources: FB EK BA.
- Software: CED.
- Supervision: CBMM DH CJY LAMS.
- Validation: BA.
- Visualization: FB.
- Writing – original draft: FB.
- Writing – review & editing: FB CJY EK BA JR JPZ CED CBMM FS DH LAMS.
- 1. Vestbo J, Hurd SS, Agusti AG, Jones PW, Vogelmeier C, Anzueto A, et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease: GOLD executive summary. Am J Respir Crit Care Med 2013 Feb 15;187(4):347–365. pmid:22878278
- 2. Halbert RJ, Natoli JL, Gano A, Badamgarav E, Buist AS, Mannino DM. Global burden of COPD: systematic review and meta-analysis. Eur Respir J 2006 Sep;28(3):523–532. pmid:16611654
- 3. Dirven JA, Muris JW, van Schayck CP. COPD screening in general practice using a telephone questionnaire. COPD 2010 Oct;7(5):352–359. pmid:20854050
- 4. Peat JK, Toelle BG, Marks GB, Mellis CM. Continuing the debate about measuring asthma in population studies. Thorax 2001 May;56(5):406–411. pmid:11312411
- 5. Murgia N, Brisman J, Claesson A, Muzi G, Olin AC, Toren K. Validity of a questionnaire-based diagnosis of chronic obstructive pulmonary disease in a general population-based study. BMC Pulm Med 2014 Mar 21;14:49-2466-14-49.
- 6. Celli BR, Halbert RJ, Isonaka S, Schau B. Population impact of different definitions of airway obstruction. Eur Respir J 2003 Aug;22(2):268–273. pmid:12952259
- 7. Prieto-Centurion V, Rolle AJ, Au DH, Carson SS, Henderson AG, Lee TA, et al. Multicenter study comparing case definitions used to identify patients with chronic obstructive pulmonary disease. Am J Respir Crit Care Med 2014 Nov 1;190(9):989–995. pmid:25192554
- 8. Mohangoo AD, van der Linden MW, Schellevis FG, Raat H. Prevalence estimates of asthma or COPD from a health interview survey and from general practitioner registration: what's the difference? Eur J Public Health 2006 Feb;16(1):101–105. pmid:16141304
- 9. Romanelli AM, Raciti M, Protti MA, Prediletto R, Fornai E, Faustini A. How Reliable Are Current Data for Assessing the Actual Prevalence of Chronic Obstructive Pulmonary Disease? PLoS One 2016 Feb 22;11(2):e0149302. pmid:26901166
- 10. Cooke CR, Joo MJ, Anderson SM, Lee TA, Udris EM, Johnson E, et al. The validity of using ICD-9 codes and pharmacy records to identify patients with chronic obstructive pulmonary disease. BMC Health Serv Res 2011 Feb 16;11:37-6963-11-37.
- 11. de Marco R, Accordini S, Marcon A, Cerveri I, Anto JM, Gislason T, et al. Risk factors for chronic obstructive pulmonary disease in a European cohort of young adults. Am J Respir Crit Care Med 2011 Apr 1;183(7):891–897. pmid:20935112
- 12. Pellegrino R, Viegi G, Brusasco V, Crapo RO, Burgos F, Casaburi R, et al. Interpretative strategies for lung function tests. Eur Respir J 2005 Nov;26(5):948–968. pmid:16264058
- 13. Quanjer PH, Stanojevic S, Cole TJ, Baur X, Hall GL, Culver BH, et al. Multi-ethnic reference values for spirometry for the 3-95-yr age range: the global lung function 2012 equations. Eur Respir J 2012 Dec;40(6):1324–1343. pmid:22743675
- 14. Shirtcliffe P, Weatherall M, Marsh S, Travers J, Hansell A, McNaughton A, et al. COPD prevalence in a random population survey: a matter of definition. Eur Respir J 2007 Aug;30(2):232–239. pmid:17666557
- 15. van Dijk W, Tan W, Li P, Guo B, Li S, Benedetti A, et al. Clinical relevance of fixed ratio vs lower limit of normal of FEV1/FVC in COPD: patient-reported outcomes from the CanCOLD cohort. Ann Fam Med 2015 Jan-Feb;13(1):41–48. pmid:25583891
- 16. Borlee F, Yzermans CJ, van Dijk CE, Heederik D, Smit LA. Increased respiratory symptoms in COPD patients living in the vicinity of livestock farms. Eur Respir J 2015 Dec;46(6):1605–1614. pmid:26250492
- 17. Jarvis D, Newson R, Burney P, On Behalf of 26 Principal Investigators. Change in respiratory symptoms in young adults as they age: European community respiratory health survey 3 (ECRHS 3). Eur Respir J 2013;42(Suppl 57):1809.
- 18. Verheij RA. NIVEL Zorgregistraties eerste lijn: huisarts. (Netherlands Primary Care DatabaseL General Practitioners). 2014; Available at: www.nivel.nl. Accessed 1/28, 2015.
- 19. Burney P, Jarvis D. Lung function protocol, data sheets and lung function questionnaire. 2015; Available at: www.ecrhs.org.
- 20. Miller MR, Hankinson J, Brusasco V, Burgos F, Casaburi R, Coates A, et al. Standardisation of spirometry. Eur Respir J 2005 Aug;26(2):319–338. pmid:16055882
- 21. Doekes G, Douwes J, Wouters I, de Wind S, Houba R, Hollander A. Enzyme immunoassays for total and allergen specific IgE in population studies. Occup Environ Med 1996 Jan;53(1):63–70. pmid:8563861
- 22. de Marco R, Marcon A, Rossi A, Anto JM, Cerveri I, Gislason T, et al. Asthma, COPD and overlap syndrome: a longitudinal study in young European adults. Eur Respir J 2015 Sep;46(3):671–679. pmid:26113674
- 23. van der Molen T, Willemse BW, Schokker S, ten Hacken NH, Postma DS, Juniper EF. Development, validity and responsiveness of the Clinical COPD Questionnaire. Health Qual Life Outcomes 2003 Apr 28;1:13. pmid:12773199
- 24. Leffondre K, Abrahamowicz M, Siemiatycki J, Rachet B. Modeling smoking history: a comparison of different approaches. Am J Epidemiol 2002 Nov 1;156(9):813–823. pmid:12396999
- 25. Buist AS, McBurnie MA, Vollmer WM, Gillespie S, Burney P, Mannino DM, et al. International variation in the prevalence of COPD (the BOLD Study): a population-based prevalence study. Lancet 2007 Sep 1;370(9589):741–750. pmid:17765523
- 26. Menezes AM, Perez-Padilla R, Jardim JR, Muino A, Lopez MV, Valdivia G, et al. Chronic obstructive pulmonary disease in five Latin American cities (the PLATINO study): a prevalence study. Lancet 2005 Nov 26;366(9500):1875–1881. pmid:16310554
- 27. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977 Mar;33(1):159–174. pmid:843571
- 28. Armstrong BG. Effect of measurement error on epidemiological studies of environmental and occupational exposures. Occup Environ Med 1998 Oct;55(10):651–656. pmid:9930084
- 29. Alexopoulos EC, Malli F, Mitsiki E, Bania EG, Varounis C, Gourgoulianis KI. Frequency and risk factors of COPD exacerbations and hospitalizations: a nationwide study in Greece (Greek Obstructive Lung Disease Epidemiology and health ecoNomics: GOLDEN study). Int J Chron Obstruct Pulmon Dis 2015 Dec 11;10:2665–2674. pmid:26715845
- 30. Kjeldgaard P, Dahl R, Lokke A, Ulrik CS. Detection of COPD in a high-risk population: should the diagnostic work-up include bronchodilator reversibility testing? Int J Chron Obstruct Pulmon Dis 2015 Feb 23;10:407–414. pmid:25759573
- 31. Viegi G, Pistelli F, Sherrill DL, Maio S, Baldacci S, Carrozzi L. Definition, epidemiology and natural history of COPD. Eur Respir J 2007 Nov;30(5):993–1013. pmid:17978157
- 32. Schermer TR, Smeele IJ, Thoonen BP, Lucas AE, Grootens JG, van Boxem TJ, et al. Current clinical guideline definitions of airflow obstruction and COPD overdiagnosis in primary care. Eur Respir J 2008 Oct;32(4):945–952. pmid:18550607
- 33. Gupta RP, Perez-Padilla R, Marks G, Vollmer W, Menezes A, Burney P. Summarising published results from spirometric surveys of COPD: the problem of inconsistent definitions. Int J Tuberc Lung Dis 2014 Aug;18(8):998–1003. pmid:25199020
- 34. van Dijk CE, Zock JP, Baliatsas C, Smit LA, Borlee F, Spreeuwenberg P, et al. Health conditions in rural areas with high livestock density: Analysis of seven consecutive years. Environ Pollut 2016 Dec 30.