Occupational exposure to endotoxin is associated with decrements in pulmonary function, but how much variation in this association is explained by genetic variants is not well understood.
We aimed to identify single nucleotide polymorphisms (SNPs) that are associated with the rate of forced expiratory volume in one second (FEV1) decline by a large scale genetic association study in newly-hired healthy young female cotton textile workers.
DNA samples were genotyped using the Illumina Human CVD BeadChip. Change rate in FEV1 was modeled as a function of each SNP genotype in linear regression model with covariate adjustment. We controlled the type 1 error in study-wide level by permutation method. The false discovery rate (FDR) and the family-wise error rate (FWER) were set to be 0.10 and 0.15 respectively.
Two SNPs were found to be significant (P<6.29×10−5), including rs1910047 (P = 3.07×10−5, FDR = 0.0778) and rs9469089 (P = 6.19×10−5, FDR = 0.0967), as well as other eight suggestive (P<5×10−4) associated SNPs. Gene-gene and gene-environment interactions were also observed, such as rs1910047 and rs1049970 (P = 0.0418, FDR = 0.0895); rs9469089 and age (P = 0.0161, FDR = 0.0264). Genetic risk score analysis showed that the more risk loci the subjects carried, the larger the rate of FEV1 decline occurred (Ptrend = 3.01×10−18). However, the association was different among age subgroups (P = 7.11×10−6) and endotoxin subgroups (P = 1.08×10−2). Functional network analysis illustrates potential biological connections of all interacted genes.
Citation: Zhang R, Zhao Y, Chu M, Mehta A, Wei Y, Liu Y, et al. (2013) A Large Scale Gene-Centric Association Study of Lung Function in Newly-Hired Female Cotton Textile Workers with Endotoxin Exposure. PLoS ONE 8(3): e59035. https://doi.org/10.1371/journal.pone.0059035
Editor: Dan E. Arking, Johns Hopkins University, United States of America
Received: November 29, 2012; Accepted: February 8, 2013; Published: March 19, 2013
Copyright: © 2013 Zhang et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was supported by grants from USA: The National Institute for Occupational Safety and Health (RO104241), National Institute of Environmental Health Sciences (ES00002) and China: National Natural Science Foundation of China (81072389, 30901232), Natural Science Foundation of the Jiangsu Higher Education Institutions of China (10KJA33034), the Research Fund for the Doctoral Program of Higher Education of China (20113234110002), the Research and Innovation Project for College Graduates of Jiangsu Province (CXZZ11_0733), and the grant from the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
According to an official statement from the American Thoracic Society (ATS) , cigarette smoking is not the sole meaningful cause of chronic obstructive pulmonary disease (COPD). There is also strong evidence suggesting a causal relationship between COPD and occupational exposures (e.g. endotoxin ) or genetic syndromes (e.g. α1-antitrypsin deficiency ). Generally released from bacterial lysis, endotoxin is ubiquitous in the airborne environment . High endotoxin levels are observed in cotton textile mills processing cotton. Exposure to cotton dust leads to chronic respiratory disease and excessive loss of pulmonary function –. Recently, genome-wide association studies (GWAS) – and large-scale meta-analysis of GWAS – of COPD also offered mechanistic insight into pulmonary function regulation due to individual genetic heterogeneity (e.g. 6p21.32 region) in non-occupationally exposed populations.
Even for occupationally exposed populations, there is evidence that single nucleotide polymorphisms (SNPs) may play an important role in pulmonary function while taking endotoxin effect into account. Our previous study in 20-year longitudinal cohort suggested that Tyr113His and His139Arg polymorphisms in microsomal epoxide hydrolase (mEH) gene , rs1800629 in tumor necrosis factor (TNF) gene and rs909253 in lymphotoxin alpha (LTA) gene , may modify the association between endotoxin exposure and annual decline of FEV1 in cotton textile workers, though no statistically significant marginal effects of those polymorphisms were observed.
In this study, we aimed to explore the association between the rate of FEV1 decline and genetic heterogeneity in previously unexposed newly-hired young healthy female cotton textile workers entering endotoxin-exposed work areas in yarn preparation based on large scale genomic data. To the best of our knowledge, there is no large scale association study of lung function in occupational settings.
Materials and Methods
The institutional review boards of the Harvard School of Public Health, the Putuo District People’s Hospital, and the Human Resources Administration of China approved the study. All participants gave written informed consent before the study.
The study population was derived from the Newly-Hired Chinese Textile Workers Study, an intensive, repeated measures prospective cohort study in the cotton textile industry consisting of 384 adults in Shanghai, China. All subjects were self-reported Han Chinese ethnicity, females, and lifelong non-smokers, asymptomatic of cardiopulmonary disease and naive to occupational endotoxin exposure before entering cotton textile mill. Detailed information on subject selection, methods for testing pulmonary function, and exposure assessment has been described previously –. A baseline examination took place in March of 1997, and follow-up examinations occurred at 3 months, 12 months, and 18 months later. Only subjects with DNA samples that met quality assurance requirements for genotyping and analysis (n = 301; refer to quality control section for details) were included in this study. Among the 301 subjects, one hundred and sixty-three (54.2%) subjects with average age of 18.4 years (range 16.0–28.8 years) were newly employed, while 138 (45.8%) subjects with average age of 33.1 years (range 16.6–46.7 years) were currently working in converted silk mills. Sixty-two (20.6%) subjects were followed up to 3 months, 42 (14.0%) subjects were followed up to 12 months, and 197 (65.4%) subjects were followed up to 18 months.
Exposure Assessment and Pulmonary Function Tests
Briefly, airborne cotton dust was sampled with a settled vertical elutriator (General Metalworks Corp., Mequon, WI) fixed in work area using 37 mm PVC filters, and endotoxin concentration was analyzed using chromogenic Limulus amoebocyte lysate assay, as previously described . The workers remained in the same area throughout the observation. We recorded the endotoxin concentration for all work areas at each follow-up. Spirometric measurements were conducted by a trained technician at cotton mills using an 8-liter water-sealed field spirometer (W.E. Collins Co., Braintree, MA) calibrated twice each day with a 3-liter syringe. Forced expiratory spirograms were carried out before and after work shifts on the first day back to work after a two day rest, and each worker performed up to seven trials to produce three acceptable curves . The present analysis focused on change rate in pre-shift FEV1 over the 18 month period of follow-up.
DNA Isolation and SNP Genotyping
DNA samples were extracted from whole blood using a DNA purification kit (Gentra Systems, Inc., Research Triangle, NC). Of the 384 DNA samples collected from subjects who participated in the baseline examination, 55 samples were excluded due to inadequate DNA content or low DNA quality. Genotyping was performed using the Illumina HumanCVD Beadchip (IBC array) by laboratory personnel without the knowledge of phenotype information. The IBC array incorporates about 50,000 SNPs to efficiently capture genetic diversity across >2,000 genic regions related to cardiovascular, inflammatory and metabolic phenotypes. Genetic variation within the majority of these regions is captured at density equal to or greater than that afforded by genome-wide genotyping products . Of the 329 samples that were sent for genotyping, 312 were genotyped successfully.
We performed systematic quality control procedures to filter both unqualified samples and SNPs before the association analysis. Briefly, SNPs were removed for a total of 49,094 genotyped ones if they were located in non-autosomal chromosomes (1,126), had a call rate ≤95% (954), minor allele frequency (MAF) <0.05 (19,357), P<0.001 for Hardy-Weinberg equilibrium (HWE) test (46). For these 312 samples, one subject was removed with a genotyping call rate <95%. None was ambiguous for genetic sex. Seven subjects failed the cryptic relatedness check. We also detected outliers (three individuals were excluded) and population stratification using EIGENSTRAT 4.2 , a method based on principal component analysis. As a result, a total of 301 subjects and 27,611 SNPs were left for subsequent analysis. Quality control procedures were implemented in PLINK (version 1.07) .
As presented in Figure S1 in File S1, the FEV1 declines linearly with increased exposure time, but the rate of FEV1 decline varies among individuals. We aimed to evaluate the association between SNP and the rate of FEV1 decline. Firstly, we estimated the rate of monthly FEV1 declines for each individual in a linear regression model through Equation 1. For the ith individual, FEV1ij is the observed FEV1 value at different months (monthj = 0, 3, 12 and 18), u0i is baseline FEV1 and eij is the random error at jth months. The estimated slope (ESi) reflected the rate of FEV1 decline as a function of month for each individual. The mean and median R2 of the linear model for all 301 subjects are 0.63 and 0.73 respectively indicating the present model is not bad to describe the monthly decline rate of FEV1. ESi was then used as the outcome in the following association analysis.(1)
Secondly, we assessed the associations between ES and each SNP by fitting multiple-variable linear regression models (Equation 2) adjusted for height, age, FEV1 at baseline and the average level of log-transformed endotoxin exposure. In Equation 2, ei is the random error for ith individual. Two genetic models (dominant and additive) of inheritance for each SNP were considered respectively. In additive model, SNPs with low frequency (≤5%) of rare homozygous were excluded for lack of statistical robustness .(2)
The bootstrap re-sampling analysis was then performed to control the false positive discovery for significant SNPs . We generated 2,000 bootstrap samples, and SNPs were tested 2,000 times respectively. The results are not likely to be falsely positive if the SNP has more than 1,600 times (80%) bootstrap P value less than the predefined threshold (e.g. 0.005 for 10 SNPs).
Gene-gene and gene-environment interactions were tested using Equation 3. Here, βA and βB are the main effects of factor A and B, respectively. βAB is the interaction.(3)
We also constructed a genetic risk score (GRS) by counting the number of risk genotypes of top 10 SNPs that the individuals carried . The association between the GRS and the rate of FEV1 decline was also tested in multiple-variable linear regression models to evaluate the join effects of multiple genetic factors. Additionally, stratification analysis of GRS by age and endotoxin level was performed.
Due to the linkage disequilibrium (LD) structure among SNPs, a standard Bonferroni correction would yield a significance level of approximately 1.81×10−6, which is very conservative. And, the Human CVD array has a dense gene-centric design, similar studies have used a less stringent level around 1×10−4 previously , . In this exploratory study, we controlled the false discovery rate (FDR) and the family-wise error rate (FWER) in study-wide level by permutation method (see details in Appendix S1). The FDR was set to be 0.10, meanwhile the FWER were set to be 0.15 (corresponding threshold for each SNP was 6.29×10−5). We imputed un-genotyped SNPs using Minimac software  based on LD information from hg/19 1000 Genomes database (with ASN as the reference set, released November 2010). Gene functional network was built using MetaCore™ online software  (GeneGo, Inc., Carlsbad, CA) to illustrate potential biological connections of interacted genes identified in this study. Statistical analysis and data management were carried out through: PLINK (Version 1.07)  and R software (version 2.14.0; The R Foundation for Statistical Computing). Manhattan plots were generated using Haploview (version 4.2) .
The mean decline of FEV1 for the overall 301 subjects was 6.79±25.09 ml per-month. Individuals were classified to different subgroups by age, height, FEV1 at baseline and endotoxin level respectively and the rate of FEV1 decline (ES) in each subgroup was presented in Table 1. The large standard deviation (SD) of ES reflects a large individual variation of ES among subjects, which probably be caused by different environmental exposure, demographic characteristics or genetic background. We fitted a multi-variable linear regression model (Equation 2) and predicted the ES for each subject. The SD of the predictive ES was presented in Table S1 in File S1. When deducted the effects of non-genetic factors, the SD of predictive ES ranges from ∼4 to ∼6. We assumed that the genetic factors might contribute to the remained variation of ES.
Results of Individual SNP Analysis
In additive model, the frequency of rare homozygosity of the top-10 SNPs was less than 2.5% (data not shown) and none was significant. Thus, only SNPs identified by the dominant model are presented (Table 2, Tables S2 and S3 in File S1). Two risk SNPs (rs1910047 and rs9469089) were significant with P<6.29×10−5. Individuals carrying the TT genotype of rs1910047 had average 4.32 ml decline of FEV1 per-month, while those carrying the TA or AA genotypes had 18.32 ml decline of FEV1 per-month. After adjustment of covariates, individuals carrying the AA genotype had more rapid FEV1 decline (15.17 ml/moth) compared with those carrying the TA or AA genotype (P = 3.70×10−5, FDR = 0.0778). For rs9469089, individuals carrying the GC or CC genotype also had more rapid FEV1 decline (12.50 ml/month) than those carrying the GG genotype (P = 6.19×10−5, FDR = 0.0967).
Eight SNPs (rs32588, rs4855881, rs10515978, rs601675, rs11761231, rs10129426, rs10201627 and rs1049970) with P<5×10−4 were considered suggestive for accelerated changes of FEV1. Two SNPs (rs32586 and rs32589) are in perfect LD with rs32588 (R2 = 1.00). rs3749073 is also in high LD with rs10201627 (R2 = 0.98) (Table S4 in File S1). The bootstrap re-sampling analysis was then performed for the top-10 SNPs to internally validate the results. All top-10 SNPs had a bootstrap P value<0.005 more than 1,600 times out of 2,000 bootstrap samples (over 80% significant), indicating that those SNPs were not likely to be false-positive results.
Manhattan plot using P values derived from the dominant model are presented in Figure S2 in File S1. A small genomic control inflation factor (λ) of 1.026 indicated little inflation of the large scale genotyping study results due to population stratification (Figure S3 in File S1). We again did a sensitivity analysis with further adjustment of first eight PCs generated by EIGENSTRAT 4.2. However, none PCs was significant in the model (P values range from 0.24 to 0.82). As presented in Table S5 in File S1, the results do not vary much whenever PCs adjusted or not.
After imputation, a cluster of adjacent SNPs in high or low LD with targeted SNP were observed with P value from 10−5 to 10−3 (Appendix S2 and Table S6 in File S2). We also tested those 10 SNPs in additive model (Table S7 in File S1). All SNPs were significant at 0.005 level, except for rs10515978 (P = 0.0159). Considering 73 subjects are young women who may still have been in the lung function growth period of their development, we did another sensitivity analysis only in those subjects with age ≥18 years. As a result, the effects of SNPs do not vary much between all samples and only adult samples (Table S8 in File S1).
Results of Gene-gene and Gene-environment Interaction Analysis
We tested all pair-wise gene-gene interactions for these ten SNPs, and interactions with P value <0.05 and FDR <0.10 were presented in Table 3. rs1910047 which had significant marginal effect had synergistic interaction with rs1049970 (P = 0.0418, FDR = 0.0895). The other three interactions which composed of four SNPs, also showed synergy between SNPs. The interaction P values (FDRs) were 0.0051 (0.0313), 0.0052 (0.0313) and 0.0095 (0.0390) for rs10515978_rs32588, rs32588_ rs11761231 and rs11761231_ rs4855881 respectively. We mapped these interacted SNPs to their nearest genes (Table S3 in File S1), and built network (Figure 1) using the six genes (TBX3, CDH5, PPARGC1B, LMAN1, PODXL and APEH). There are four highlighted functional paths corresponding to four pairs of gene-gene interactions: TBX3 - KLF4 - VE_cadherin (CDH5); PERC (PPARGC1B) - ESR1(nuclear) - ATF_6 alpha - LMAN1; PERC (PPARGC1B) - ESR1(nuclear) - Progesterone receptor - p300 - NF_kB - Podocalyxin_like 1 (PODXL); Podocalyxin_like 1 (PODXL) - NF_kB - c_Myc - APEH. The information of all 63 nodes in the network is given in Appendix S3. Fourteen nodes (Ubiquitin, AP_1 and other twelve blue-circle marked ones) directly connect with the four functional paths and form the gene-gene interaction functional network.
Four gene-gene interaction functional paths were highlighted, including TBX3 - VE_cadherin (CDH5), PERC (PPARGC1B) - LMAN1, PERC (PPARGC1B) - Podocalyxin_like 1 (PODXL) and Podocalyxin_like 1 (PODXL) - APEH.
From the results of gene-environment interaction analysis (Table 4), three SNPs (rs9469089, rs1049970 and rs10201627) with FDR <0.10 exhibited interaction with age (P = 0.0161, FDR = 0.0264; P = 0.0206, FDR = 0.0264 and P = 0.0356, FDR = 0.0308 respectively); rs4855881 had interaction with endotoxin level (P = 0.0454, FDR = 0.4086).
Results of Genetic Risk Score Analysis
As presented in Figure 2 and Table S9 in File S1, the more risk loci the subjects carried, the larger the rate of FEV1 decline occurred (Ptrend = 3.01×10−18). Bootstrap re-sampling analysis showed that bootstrap Ptrend was less than 1×10−15 with 92.75% of 2,000 bootstrap samples. Additionally, we performed stratification analysis of GRS by age and endotoxin level (Table S10 in File S1). Though, the rate of FEV1 decline became larger with the increase of GRS, the association differed among age subgroups or endotoxin subgroups, which indicated that there were interactions between GRS and age (P = 7.11×10−6) or GRS and endotoxin (P = 1.08×10−2) adjusted for other covariates (Figure 3).
The more risk loci the subjects carried, the larger the rate of FEV1 decline occurred (Ptrend = 3.01×10−18).
Each box and horizontal line represents the Coefficient and 95% CI derived from the linear regression model adjusted for height, age and FEV1 at baseline, and average log transformed endotoxin level. P values of interaction between GRS and age/endotoxin were acquired from linear regression model with covariates adjustment.
Host genetic factors probably influence susceptibility to the rate of FEV1 decline after occupational exposure to endotoxin. This exploratory study is the first to provide evidence that genetic factors may play an important role in determining the rate of FEV1 decline using a large scale gene-centric aproach. We found that two SNPs (rs1910047 and rs9469089) were significantly associated with the rate of FEV1 decline, as well as eight suggestive associated SNPs. Further analysis indicated that potential gene-gene interaction and gene-environment interaction were ubiquitous in the genetic architecture of complex traits.
rs1910047 is located at 12p24.21, upstream of two genes: T-box 5 (TBX5, 280 kb upstream) and T-box 3 (TBX3, 7 kb upstream). T-box genes encode transcription factors that can mediate the production of branching signals by the lung mesenchyme in the embryonic mouse lung. TBX4;TBX5 double heterozygous mutants showed decreased lung branching, indicating that T-box genes may play an important role in the process of lung growth , . In addition, the Tbx5 was reported to be associated with asthma susceptibility in the developing lung in rat models . Thus, it is biological plausible that SNP variation may result in aberrant activity of the TBX5 gene, which may affect lung function, accelerating FEV1 decline. TBX3 is a downstream target of Wnt/beta-catenin pathway, which is a key regulator in cell proliferation and differentiation . Recent evidence has suggested that TBX3 is over-expressed in a number of cancers including lung cancer , as well as in transformed lung fibroblast cells .
rs9469089 is located at 6p21.32, within the first intron of RNF5 (encoding membrane-bound ubiquitin ligase). RNF5 controls the membrane fraction of ATG4B and limits LC3 (ATG8) processing, aberrant of which may limit basal levels of autophagy and influence susceptibility to bacterial infection . Previous studies have collectively suggested that RNF5 negatively regulates the virus-triggered immune response , . Besides, rs9469089 is about 2 kb downstream of advanced glycosylation end product-specific receptor (AGER), which is consistently reported to be associated with lung function –.
rs1049970 which interacted with rs1910047 is a missense variant of the cadherin 5, type 2 (CDH5) gene that may play an important role in endothelial cell biology through control of the cohesion and organization of the intercellular junctions. Deficiency or truncation of Cdh5 induced endothelial apoptosis and abolished transmission of the endothelial survival signal in a mouse model . Both two SNPs are involved in cell stability, including proliferation, differentiation and apoptosis. Thus, it is biologically conceivable that these two SNPs, or those in LD with them, may directly regulate targeted genes, together resulting in more rapid FEV1 decline.
It is noteworthy that four SNPs interacted with each other adjacently, producing three pairs of end-to-end interactions in this study (rs10515978_rs32588, rs32588_rs11761231 and rs11761231_rs4855881). rs10515978 is within the intron of lectin, mannose-binding, 1 (LMAN1), mutational inactivation of which was a frequent and early event potentially contributing to colorectal tumorigenesis . rs32588 is within the exon of peroxisome proliferator-activated receptor gamma, coactivator 1 beta (PPARGC1B), which was associated with airway hyperreactivity in asthmatic patients . rs11761231 was reported to be associated with rheumatoid arthritis in Caucasians –. rs4855881 is located at 3p21.31, within the intron of N-acylaminoacyl-peptide hydrolase (APEH). Deletions of which are commonly at the 3p region, and have been found in various types of carcinomas, including lung cancer .
Functional network analysis reveals potential biological connections among these four pairs interacted genes. Fourteen nodes as well as other ones directly or indirectly connect to the four functional paths and form the network. Among the fourteen key nodes, genetic variation in the gene encoding p53 can impair the response to cell damage and increase the loss of alveolar epithelial cells . Also, it’s reported that p53 may modify the effect of diminishing PM10 exposure on lung function decline . Vascular endothelial growth factor receptor (VEGFR) was reported to mediate the anti-inflammatory, anti-protease, and anti-apoptosis effects of the lung, hence contribute to an attenuation of emphysema and destructive pulmonary function in COPD . Signal transducer and activator of transcription 3 (Stat3) plays an essential role in the pathogenesis of IgG immune complex (IC)-induced acute lung injury . Beoseanrceh morphogenetic protein 4 (BMP4) plays a potential regulatory role of lung fibroblast function . Combined with our gene-gene interaction results, it is possible that the interaction among genes is associated with the rate of FEV1 decline through some possible functional paths.
rs10129426 is about 4 kb downstream of BCL2-associated athanogene 5 (BAG5), which encodes anti-apoptotic protein that functions through interactions with a variety of cell apoptosis and growth related proteins . According to the UCSC and TFSearch  databases, polymorphisms of this gene may influence transcription factor binding (GATA-binding factor 2) .
rs10201627 which is located in the micro-RNA binding sites of the 3′-UTR of G protein-coupled receptor 55 (GPR55), was predicted to affect miR-26b binding. And it was found that expression of miR-26b was down-regulated in lung cancer tissues . Moreover, rs3749073, as the sole non-synonymous polymorphism (Gly195Val) in the GPR55, is in high LD (r2 = 0.98) with rs10201627. It was predicted that GPR55 with Val195 appeared to induce less phosphorylation of extracellular signal-regulated kinase (ERK) than Gly195 . It was also reported that GPR55 promoted cancer cell proliferation by ERK .
Actually, among these top ten SNPs, five SNPs (rs1910047, rs1049970, rs4855881, rs10129426 and rs10201627) are related to cell proliferation or apoptosis and three SNPs (rs9469089, rs32588 and rs11761231) are related to the immune inflammatory response.
We then again investigated the marginal effects of rs1051740 (Tyr113His), rs1051741, which was in high LD with rs2234922 (His139Arg, not genotyped in our chip), rs1800629 and rs909253 in this study according to our previous studies of long-term pulmonary function decline , . None of them reached the statistical significance level. However, rs1051740, rs1051741 and rs909253 could still be significant modifiers in the association between endotoxin exposure and the rate of FEV1 decline. The effect of endotoxin exposure was different among subjects carrying different genotypes of SNP. The P values of heterogeneity for these three SNPs were 4.83×10−6, 6.07×10−3 and 1.04×10−2 respectively. Although, the subjects in this current study were all newly-hired young healthy female cotton workers, and therefore different from those prospectively followed for 20 years in our previous studies, the results still support what we reported previously. Additionally, four gene-gene interactions and four gene-environment interactions in this study again provided evidence that multiple factors might contribute to the rate of FEV1 decline interactively.
Overall, the more risk loci carried by the cotton worker, the larger the rate of FEV1 decline over 18 months. However, younger workers (age <25 years) had more accelerated decline per-month than relatively older ones (age ≥25 years) when GRS increased. Moreover, for those carrying same number of risk loci, younger workers had more rapid decline of FEV1 than relatively older ones. This provided evidence that endotoxin exposure tended to cause more serious damage to lung function of young workers, even with same genetic background.
Because all the workers of the cohort were lifelong young female non-smokers with relatively pure occupational exposure to endotoxin, this population provided a unique opportunity to observe gene effect to the rate of FEV1 decline when adjusted environmental effect. The results contribute scientific support to do further functional studies of those genes in relation to the pathogenesis of endotoxin-related airways disease.
However, it is important to recognize the limitations of our study. Firstly, there is no similar external population to validate the observed SNP associations. Although we have performed internal statistical validation by bootstrap re-sampling procedures to minimize false positive discovery, we could not exclude the possibility of false discoveries among our findings. Besides, we controlled both FDR and FWER in study-wide level (0.10 and 0.15 respectively), which guaranteed the confidence of what we found. Secondly, the sample size was not large enough to detect some SNPs with modest effects. Thus, study with enlarged sample size were warranted. Finally, airborne endotoxin concentrations were estimated from sampling airborne cotton dust at fixed positions in work areas, rather than from sampling the air in the personal breathing zone of each participant, since there is no personal sampling technology developed for these kinds of exposure. The lack of personal air sampling data may be a possible source of exposure misclassification for this study. Thus, we used average of repeated measurement of endotoxin exposure level for each subject.
In summary, we performed the first large scale exploratory study of genetic variation in the rate of FEV1 decline for newly-hired young healthy female cotton textile workers exposed to endotoxin. Genetic variants that play a role in immune inflammatory response, cell proliferation and cell apotosis, together with environmental factors interact to affect the rate of FEV1 decline after initiation of exposure and this decrement happens over short time. Additional replication and gene function studies are necessary to confirm what we found.
Supplementary tables (Table S1–S5, S7–S10) and figures (Figure S1–S3).
The association results of imputed SNPs and genotyped SNPs 500 kb around targeted SNPs (Table S6).
Description of the FWER and FDR controlling procedure.
Regional plot of the top-10 SNP using P values derived from multi-variable linear regression model adjusted for height, age and FEV1 at baseline, and average log transformed endotoxin level.
The authors thank all cotton textile workers and textile industry staff who participated in this study. We are grateful to Helian Dai and Marcia Chertok for research assistance, Dr. Hakon Hankonarson of Children’s Hospital of Pennsylvania for running the IBC-CVD chips, and Drs Honggang Yi and Zhihang Peng for double-checking results of statistical analysis. We also appreciate two anonymous reviewers for their valuable suggestions.
Conceived and designed the experiments: DCC FC HS ZH. Performed the experiments: AM LS HZ. Analyzed the data: RZ YZ YW JB. Wrote the paper: RZ YZ MC YL PX HY.
- 1. Eisner MD, Anthonisen N, Coultas D, Kuenzli N, Perez-Padilla R, et al. (2010) An official American Thoracic Society public policy statement: Novel risk factors and the global burden of chronic obstructive pulmonary disease. Am J Respir Crit Care Med 182: 693–718.
- 2. Harkema JR, Hotchkiss JA (1993) Ozone- and endotoxin-induced mucous cell metaplasias in rat airway epithelium: novel animal models to study toxicant-induced epithelial transformation in airways. Toxicol Lett 68: 251–263.
- 3. Janus ED, Phillips NT, Carrell RW (1985) Smoking, lung function, and alpha 1-antitrypsin deficiency. Lancet 1: 152–154.
- 4. Liebers V, Bruning T, Raulf-Heimsoth M (2006) Occupational endotoxin-exposure and possible health effects on humans. Am J Ind Med 49: 474–491.
- 5. Beck GJ, Schachter EN, Maunder LR, Schilling RS (1982) A prospective study of chronic lung disease in cotton textile workers. Ann Intern Med 97: 645–651.
- 6. Castellan RM, Olenchock SA, Kinsley KB, Hankinson JL (1987) Inhaled endotoxin and decreased spirometric values. An exposure-response relation for cotton dust. N Engl J Med 317: 605–610.
- 7. Kennedy SM, Christiani DC, Eisen EA, Wegman DH, Greaves IA, et al. (1987) Cotton dust and endotoxin exposure-response relationships in cotton textile workers. Am Rev Respir Dis 135: 194–200.
- 8. Wilk JB, Walter RE, Laramie JM, Gottlieb DJ, O’Connor GT (2007) Framingham Heart Study genome-wide association: results for pulmonary function measures. BMC Med Genet 8 Suppl 1S8.
- 9. Wilk JB, Chen TH, Gottlieb DJ, Walter RE, Nagle MW, et al. (2009) A genome-wide association study of pulmonary function measures in the Framingham Heart Study. PLoS Genet 5: e1000429.
- 10. Pillai SG, Ge D, Zhu G, Kong X, Shianna KV, et al. (2009) A genome-wide association study in chronic obstructive pulmonary disease (COPD): identification of two major susceptibility loci. PLoS Genet 5: e1000421.
- 11. Hancock DB, Eijgelsheim M, Wilk JB, Gharib SA, Loehr LR, et al. (2010) Meta-analyses of genome-wide association studies identify multiple loci associated with pulmonary function. Nat Genet 42: 45–52.
- 12. Repapi E, Sayers I, Wain LV, Burton PR, Johnson T, et al. (2010) Genome-wide association study identifies five loci associated with lung function. Nat Genet 42: 36–44.
- 13. Artigas MS, Loth DW, Wain LV, Gharib SA, Obeidat M, et al. (2011) Genome-wide association and large-scale follow up identifies 16 new loci influencing lung function. Nat Genet 43: 1082–1090.
- 14. Hang J, Zhou W, Wang X, Zhang H, Sun B, et al. (2005) Microsomal epoxide hydrolase, endotoxin, and lung function decline in cotton textile workers. Am J Respir Crit Care Med 171: 165–170.
- 15. Zhang H, Hang J, Wang X, Zhou W, Sun B, et al. (2007) TNF polymorphisms modify endotoxin exposure-associated longitudinal lung function decline. Occup Environ Med 64: 409–413.
- 16. Wang XR, Pan LD, Zhang HX, Sun BX, Dai HL, et al. (2002) Follow-up study of respiratory health of newly-hired female cotton textile workers. Am J Ind Med 41: 111–118.
- 17. Wang XR, Pan LD, Zhang HX, Sun BX, Dai HL, et al. (2003) Lung function, airway reactivity, and atopy in newly hired female cotton textile workers. Arch Environ Health 58: 6–13.
- 18. Wang XR, Pan LD, Zhang HX, Sun BX, Dai HL, et al. (2003) A longitudinal observation of early pulmonary responses to cotton dust. Occup Environ Med 60: 115–121.
- 19. Olenchock SA, Christiani DC, Mull JC, Ye TT, Lu PL (1990) Airborne endotoxin concentrations in various work areas within two cotton textile mills in the People’s Republic of China. Biomed Environ Sci 3: 443–451.
- 20. Keating BJ, Tischfield S, Murray SS, Bhangale T, Price TS, et al. (2008) Concept, design and implementation of a cardiovascular gene-centric 50 k SNP array for large-scale genomic association studies. PLoS One 3: e3583.
- 21. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38: 904–909.
- 22. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81: 559–575.
- 23. Stark JR, Wiklund F, Gronberg H, Schumacher F, Sinnott JA, et al. (2009) Toll-like receptor signaling pathway variants and prostate cancer mortality. Cancer Epidemiol Biomarkers Prev 18: 1859–1863.
- 24. Dai J, Gu J, Lu C, Lin J, Stewart D, et al. (2011) Genetic variations in the regulator of G-protein signaling genes are associated with survival in late-stage non-small cell lung cancer. PLoS One 6: e21120.
- 25. Qi L, Ma J, Qi Q, Hartiala J, Allayee H, et al. (2011) Genetic risk score and risk of myocardial infarction in Hispanics. Circulation 123: 374–380.
- 26. Talmud PJ, Drenos F, Shah S, Shah T, Palmen J, et al. (2009) Gene-centric association signals for lipids and apolipoproteins identified via the HumanCVD BeadChip. Am J Hum Genet 85: 628–642.
- 27. Zuniga J, Buendia I, Zhao Y, Jimenez L, Torres D, et al.. (2011) Genetic variants associated with severe pneumonia in A/H1N1 influenza infection. Eur Respir J 39: 604–610. Epub Jul 607.
- 28. Howie B, Fuchsberger C, Stephens M, Marchini J, Abecasis GR (2012) Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet 44: 955–959.
- 29. Ekins S, Nikolsky Y, Bugrim A, Kirillov E, Nikolskaya T (2007) Pathway mapping tools for analysis of high content data. Methods Mol Biol 356: 319–350.
- 30. Barrett JC, Fry B, Maller J, Daly MJ (2005) Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21: 263–265.
- 31. Cebra-Thomas JA, Bromer J, Gardner R, Lam GK, Sheipe H, et al. (2003) T-box gene products are required for mesenchymal induction of epithelial branching in the embryonic mouse lung. Dev Dyn 226: 82–90.
- 32. Arora R, Metzger RJ, Papaioannou VE (2012) Multiple roles and interactions of Tbx4 and Tbx5 in development of the respiratory system. PLoS Genet 8: e1002866.
- 33. Carpe N, Mandeville I, Ribeiro L, Ponton A, Martin JG, et al. (2010) Genetic influences on asthma susceptibility in the developing lung. Am J Respir Cell Mol Biol 43: 720–730.
- 34. Lu J, Li XP, Dong Q, Kung HF, He ML (2010) TBX2 and TBX3: the special value for anticancer drug targets. Biochim Biophys Acta 1806: 268–274.
- 35. Yin XJ, Xu JN, Zou CQ, He FS, Fang FD (2004) Genes differentially expressed in human lung fibroblast cells transformed by glycidyl methacrylate. Biomed Environ Sci 17: 432–441.
- 36. Kuang E, Okumura CY, Sheffy-Levin S, Varsano T, Shu VC, et al. (2012) Regulation of ATG4B Stability by RNF5 Limits Basal Levels of Autophagy and Influences Susceptibility to Bacterial Infection. PLoS Genet 8: e1003007.
- 37. Zhong B, Zhang Y, Tan B, Liu TT, Wang YY, et al. (2011) The E3 ubiquitin ligase RNF5 targets virus-induced signaling adaptor for ubiquitination and degradation. J Immunol 184: 6249–6255.
- 38. Zhong B, Zhang L, Lei C, Li Y, Mao AP, et al. (2009) The ubiquitin ligase RNF5 regulates antiviral responses by mediating degradation of the adaptor protein MITA. Immunity 30: 397–407.
- 39. Carmeliet P, Lampugnani MG, Moons L, Breviario F, Compernolle V, et al. (1999) Targeted deficiency or cytosolic truncation of the VE-cadherin gene in mice impairs VEGF-mediated endothelial survival and angiogenesis. Cell 98: 147–157.
- 40. Roeckel N, Woerner SM, Kloor M, Yuan YP, Patsos G, et al. (2009) High frequency of LMAN1 abnormalities in colorectal tumors with microsatellite instability. Cancer Res 69: 292–299.
- 41. Lee SH, Jang AS, Woo Park S, Park JS, Kim YK, et al. (2011) Genetic effect of single-nucleotide polymorphisms in the PPARGC1B gene on airway hyperreactivity in asthmatic patients. Clin Exp Allergy 41: 1533–1544.
- 42. Korman BD, Seldin MF, Taylor KE, Le JM, Lee AT, et al. (2009) The chromosome 7q region association with rheumatoid arthritis in females in a British population is not replicated in a North American case-control series. Arthritis Rheum 60: 47–52.
- 43. Burton P, Clayton D, Cardon L, Craddock N, Deloukas P, et al. (2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447: 661–678.
- 44. Barton A, Thomson W, Ke X, Eyre S, Hinks A, et al. (2008) Rheumatoid arthritis susceptibility loci at chromosomes 10p15, 12q13 and 22q13. Nat Genet 40: 1156–1159.
- 45. Kohno T, Takayama H, Hamaguchi M, Takano H, Yamaguchi N, et al. (1993) Deletion mapping of chromosome 3p in human uterine cervical cancer. Oncogene 8: 1825–1832.
- 46. Korthagen NM, van Moorsel CH, Barlo NP, Kazemier KM, Ruven HJ, et al. (2012) Association between variations in cell cycle genes and idiopathic pulmonary fibrosis. PLoS One 7: e30442.
- 47. Imboden M, Schwartz J, Schindler C, Curjuric I, Berger W, et al. (2009) Decreased PM10 exposure attenuates age-related lung function decline: genetic variants in p53, p21, and CCND1 modify this effect. Environ Health Perspect 117: 1420–1427.
- 48. Guan XJ, Song L, Han FF, Cui ZL, Chen X, et al. (2013) Mesenchymal stem cells protect cigarette smoke-damaged lung and pulmonary function partly via VEGF-VEGF receptors. J Cell Biochem 114: 323–335.
- 49. Tang H, Yan C, Cao J, Sarma JV, Haura EB, et al. (2011) An essential role for Stat3 in regulating IgG immune complex-induced pulmonary inflammation. FASEB J 25: 4292–4300.
- 50. Pegorier S, Campbell GA, Kay AB, Lloyd CM (2010) Bone morphogenetic protein (BMP)-4 and BMP-7 regulate differentially transforming growth factor (TGF)-beta1 in normal human lung fibroblasts (NHLF). Respir Res 11: 85.
- 51. Takayama S, Sato T, Krajewski S, Kochel K, Irie S, et al. (1995) Cloning and functional analysis of BAG-1: a novel Bcl-2-binding protein with anti-cell death activity. Cell 80: 279–284.
- 52. Heinemeyer T, Wingender E, Reuter I, Hermjakob H, Kel AE, et al. (1998) Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL. Nucleic Acids Res 26: 362–367.
- 53. German Z, Chambliss KL, Pace MC, Arnet UA, Lowenstein CJ, et al. (2000) Molecular basis of cell-specific endothelial nitric-oxide synthase expression in airway epithelium. J Biol Chem 275: 8183–8189.
- 54. Gao W, Shen H, Liu L, Xu J, Xu J, et al. (2011) MiR-21 overexpression in human primary squamous cell lung carcinoma is associated with poor patient prognosis. J Cancer Res Clin Oncol 137: 557–566.
- 55. Ishiguro H, Onaivi ES, Horiuchi Y, Imai K, Komaki G, et al. (2011) Functional polymorphism in the GPR55 gene is associated with anorexia nervosa. Synapse 65: 103–108.
- 56. Andradas C, Caffarel MM, Perez-Gomez E, Salazar M, Lorente M, et al. (2011) The orphan G protein-coupled receptor GPR55 promotes cancer cell proliferation via ERK. Oncogene 30: 245–252.