Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Do Variants Associated with Susceptibility to Pancreatic Cancer and Type 2 Diabetes Reciprocally Affect Risk?

  • Lang Wu,

    Affiliations Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, United States of America, Center for Clinical and Translational Science; Mayo Clinic, Rochester, Minnesota, United States of America

  • Kari G. Rabe,

    Affiliation Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, United States of America

  • Gloria M. Petersen

    Petersen.Gloria@mayo.edu

    Affiliation Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, United States of America

Abstract

Objectives

Although type 2 diabetes mellitus is a known risk factor for pancreatic cancer, the existence of shared genetic susceptibility is largely unknown. We evaluated whether any reported genetic risk variants of either disease found by genome-wide association studies reciprocally confer susceptibility.

Methods

Data that were generated in previous genome-wide association studies (GENEVA Type 2 Diabetes; PanScan) were obtained through the National Institutes of Health database of Genotypes and Phenotypes (dbGaP). Using the PanScan datasets, we tested for association of 38 variants within 37 genomic regions known to be susceptibility factors for type 2 diabetes. We further examined whether type 2 diabetes variants predispose to pancreatic cancer risk stratified by diabetes status. Correspondingly, we examined the association of fourteen pancreatic cancer susceptibility variants within eight genomic regions in the GENEVA Type 2 Diabetes dataset.

Results

Four plausible associations of diabetes variants and pancreatic cancer risk were detected at a significance threshold of p = 0.05, and one pancreatic cancer susceptibility variant was associated with diabetes risk at threshold of p = 0.05, but none remained significant after correction for multiple comparisons.

Conclusion

Currently identified GWAS susceptibility variants are unlikely to explain the potential shared genetic etiology between Type 2 diabetes and pancreatic cancer.

Introduction

It has been long recognized that Type 2 diabetes mellitus (T2DM) is a risk factor for development of pancreatic cancer (PaC). A meta-analysis of 17 case-control and 19 cohort or nested case-control studies found that the combined odds ratio (OR) was 1.82 for PaC risk among diabetic subjects compared to nondiabetic subjects [1]. Another meta-analysis supported a conclusion that long-standing diabetes of at least 5 years duration has a RR of 2.0 (95% CI, 1.2 to 3.2) for development of PaC [2]. Recently, a study analyzing 15 case-control studies from the Pancreatic Cancer Case-Control Consortium (PanC4) found that the OR for a diagnosis of diabetes two or more years before the diagnosis of PaC (long-standing diabetes) is 1.90 (95% CI:1.72 to 2.09), and such an association even persists for 20 or more years after diabetes was diagnosed (OR = 1.30, 95% CI 1.17 to 2.03) [3]. Thus, longstanding diabetes can increase risk of developing PaC. Interestingly, a population-based case-control study found that first-degree relative history of T2DM is associated with increased risk of PaC (OR 1.95, 95% CI 1.23–3.09) [4]. Any underlying link to explain these observations is unknown. Given the observations regarding family history, one potential explanation is that there is a shared genetic etiology for risk of both T2DM and PaC. With the significant research investment in genetic analysis of both diseases, particularly in genome-wide association studies (GWAS), it is now possible to test whether reported genetic susceptibility variants might be reciprocally associated with risk of developing either condition.

GWAS is a design that aims to agnostically identify genetic susceptibility to complex diseases[5]. This study design has been extensively employed in T2DM and PaC, and many genetic susceptibility variants have been identified [6,7]. For example, the GENEVA Genes and Environment Initiatives in Type 2 Diabetes study, which includes the Nurses' Health Study (NHS) and the Health Professionals Follow-up Study (HPFS) subsets, contributed to understanding the genetic predisposition to T2DM [8]. PanScan studies, comprised of PanScan 1, PanScan 2, and PanScan 3, have reported eight susceptibility loci for PaC [9,10,11].

We hypothesized that the susceptibility variants identified though these published GWAS contribute to a shared etiology. A study examining the association between T2DM susceptibility variants and PaC risk using the PanScan 1 dataset reported three variants that were associated with PaC risk [12]. However, this study did not validate their findings using an independent dataset. Associations between PaC susceptibility variants and T2DM risk were not estimated, thus limiting the knowledge on the potential role of these GWAS-derived variants in any shared etiology. To systematically evaluate this question, we analyzed the combined PanScan 1and 2 data to evaluate whether T2DM susceptibility variants can predispose to PaC risk. Additionally, since around 15% of pancreatic cancer patients have a diagnosis of diabetes two or more years before the cancer diagnosis [3], it was of interest to assess whether the association between T2DM susceptibility variants and PaC risk can be differentiated by diabetes status of the studied individuals. We thus conducted a stratified analysis based on diabetes status (contrasting long-standing diabetes and no history of diabetes). We then used the GENEVA Type 2 Diabetes datasets to evaluate whether reported PaC susceptibility variants are associated with risk of T2DM. The overall study design is illustrated in Fig. 1.

thumbnail
Figure 1. Study Design.

Distinct comparisons examined T2DM SNPs in PanScan datasets and examined PaC SNPs in the GENEVA T2DM dataset.

https://doi.org/10.1371/journal.pone.0117230.g001

Materials and Methods

Genotype data

Two publicly available GWAS datasets of GENEVA Type 2 Diabetes and PanScan were obtained through dbGaP [13] to investigate our hypothesis.

PanScan included genotypes generated in two phases: PanScan 1 and PanScan 2 (http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000206.v3.p2). Briefly, PanScan 1 consisted of 12 prospective cohort studies and a Mayo Clinic case-control study. The Illumina HumanHap550v3.0 genotyping platform genotyped 558,542 SNPs. Genotyping in PanScan 2 augmented PanScan 1 with 8 case-control studies from the PanC4. The Illumina Human610_Quadv1_B genotyping platform genotyped 581,188 SNPs. Age, sex, study site, and race comprised available covariate data. The Mayo Clinic subset drawn from PanScan 1 and 2 comprised 654 cases and 618 controls, with additional covariate information available: smoking status, first degree family history of cancer, body mass index (BMI), and diabetes status [14].

The GENEVA Genes and Environment Initiatives in Type 2 Diabetes study consisted of GWAS data generated using the NHS and HPFS cohorts. This dataset (http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000091.v2.p1) included two subsets: NHS (1581 cases and 1810 controls), and HPFS (1164 T2DM cases and 1338 controls, and 68 cases of uncertain diabetes type). Both studies used the Affymetrix Affy_6.0 genotyping platform and 934,940 SNPs were genotyped. Available covariates included family history of diabetes, high blood pressure, high blood cholesterol, smoking status, physical activity, age, BMI, alcohol intake, fat intake measures, magnesium intake, cereal fiber intake, heme iron intake, glycemic load, and gender.

Quality Control filters

PanScan genotype data had been filtered previously for call rate, relatedness, controls that became cases, and sex chromosome abnormalities. If duplicates were genotyped, the individual with the higher call rate was used. The final analysis included 3360 cases and 3468 controls in the combined PanScan data. The Mayo Clinic subset analysis consisted of 63 cases and 23 controls in the long-standing diabetes group (a diabetes diagnosis of 2 or more years earlier than diagnosis of pancreatic cancer for cases, or 2 or more years earlier than date of enrollment for controls), and 448 cases and 557 controls without diabetes.

The GENEVA datasets had been pre-filtered for sex chromosome abnormalities, sample identity, and call rate. We further filtered for relatedness of the sample and Hardy–Weinberg equilibrium (HWE) for variants studied in both NHS and HPFS sets. For the HPFS, we additionally filtered out subjects with uncertain diabetes status. The final sample for our analysis consisted of 1579 cases and 1801 controls from the NHS set (all females), 1162 cases and 1336 controls from the HPFS set (all males), for a total of 2741 cases and 3137 controls.

Genetic Variant Selection

Thirty-eight T2DM susceptibility SNPs [8,15,16,17,18,19,20,21,22,23,24,25] and 14 PaC susceptibility SNPs [9,10,11] from published GWAS were included in our analyses. Thirty-seven of the 38 T2DM predisposition variants and five of the 14 PaC predisposition variants had been genotyped and were available in the PanScan datasets and GENEVA T2DM datasets, respectively. To represent variants which were not captured in the GWAS data, we captured the genotype by identifying SNPs in high LD (r2>0.5) as determined by Haploview [26]. If multiple variants were identified for a variant of interest not in the GWAS, the SNP with the smallest p-value was chosen as representative for the association.

Statistical Analysis

Unconditional multivariable logistic regression assuming an additive model was employed in the association analyses. Available covariates were adjusted in the analyses. For associations reaching p = 0.05 threshold, a Bonferroni correction was conducted to adjust for multiple comparisons. Specifically, for the PaC dataset analysis, a threshold of 0.05/37 = 0.0014 was used since all 38 variants from 37 genomic regions were tested; for the T2DM dataset analysis, a threshold of 0.05/6 = 0.008 was used since 11 variants from 6 genomic loci were tested. Additionally, a correction of false discovery rate (FDR) was conducted to evaluate the significance [27].

For the PanScan data, covariates in the model included age, sex, study site, genotypic race (Eigenstrat principal components 1 and 2), and other significant principal components. We further conducted a stratified analysis in the Mayo Clinic subset according to diabetes status to investigate whether the T2DM susceptibility variants have a specific effect on either (1) long-standing diabetic PaC patients (cases who had long-standing diabetes 2 or more years before the diagnosis of pancreatic cancer) versus long-standing diabetic PaC controls, or (2) PaC cases and controls without diabetes.

In the GENEVA Type 2 Diabetes datasets, we conducted similar adjusted analysis. Covariates in the model included age, principal components (the first two components were used for NHS and 20 components were used for HPFS), family history of diabetes, high blood pressure, high blood cholesterol, smoking status, physical activity, BMI, alcohol intake, fat intake measures, magnesium intake, cereal fiber intake, heme iron intake, and glycemic load. As the NHS and HPFS set each contained a single gender, we did not include gender as a covariate in analyses of the separate sets but included gender in the combined set.

In all analyses, odds ratio (OR) and 95% confidence intervals (CI) were computed using Plink (http://pngu.mgh.harvard.edu/purcell/plink/) [28].

Results

T2DM SNPs and pancreatic cancer risk in the PanScan datasets

All 38 T2DM susceptibility variants that had been reported in the literature were included in the analyses as either observed in the GWAS data (n = 37) or represented by SNPs in high LD (n = 1). The results from the combined PanScan datasets are shown in Table 1.

thumbnail
Table 1. Association between Type 2 diabetes (T2DM) susceptibility variants and pancreatic cancer risk.

https://doi.org/10.1371/journal.pone.0117230.t001

In the combined data, four SNPs were associated with PaC risk. These included two variants, FTO rs8050136 (OR = 1.08, 95% CI 1.01–1.16, p = 0.03) and BCL11A rs243021 (OR = 0.90, 95% CI 0.84–0.97, p = 0.005), which confirmed the reported results of the PanScan 1 data (same direction and similar effect size) [12]. The other two variants were newly identified as JAZF1 rs1635852 (OR = 0.9, 95% CI 0.84–0.96, p = 0.003) and GCK rs4607517 (OR = 1.11, 95% CI 1.01–1.21, p = 0.03). In the PanScan 2 dataset (data not shown), the three SNPs (rs1387153, rs243021, and rs8050136) previously identified in PanScan 1 [12] did not replicate.

After stratifying by diabetes status in the Mayo Clinic subset, we detected five significant associations at p = 0.05 level. Focusing on long-standing diabetic PaC cases and controls, we identified SLC30A8 rs13266634 (OR = 4.92, 95% CI 1.71–14.2, p = 0.003). Focusing on PaC cases and controls without diabetes, we identified DGKB-TMEM195 rs2191348 (OR = 1.23, 95% CI 1.03–1.47, p = 0.02), KCNQ1 rs231362 (OR = 1.24, 95% CI 1.00–1.53, p = 0.046), ADCY5 rs2877716 (OR = 1.28, 95% CI 1.05–1.58, p = 0.02) and BCL11A rs243021 (OR = 0.83, 95% CI 0.69–0.99, p = 0.04) (Table 2).

thumbnail
Table 2. Association analysis of Type 2 diabetes mellitus (T2DM) susceptibility variants and pancreatic cancer risk stratified by diabetes status using the Mayo Clinic subset.

https://doi.org/10.1371/journal.pone.0117230.t002

Overall, four associations in PanScan sets reached statistical significance at a threshold of p = 0.05. However, none of them were significant after correction for multiple comparisons using the Bonferroni correction and FDR correction.

Association with type 2 diabetes

Among the 14 reported PaC susceptibility variants, 11 were genotyped in the GENEVA diabetes GWAS (n = 5) or were represented by SNPs in high LD (n = 6). Variants ABO rs505922, PDX1 rs9581943 and TERT rs2736098 were not captured in these datasets. Only the association between LINC-PINT rs6971499 and T2DM risk appeared significant at p = 0.05 level in the combined dataset of NHS and HPFS, however, it was no longer significant after correction for multiple comparisons (Table 3).

thumbnail
Table 3. Association between pancreatic cancer susceptibility variants with Type 2 diabetes (T2DM) risk in the GENEVA study.

https://doi.org/10.1371/journal.pone.0117230.t003

Discussion

Based on previous results that demonstrated a relationship between T2DM and PaC, we hypothesized a shared genetic etiology. Using published GWAS data, we tested the association between reported T2DM susceptibility variants and PaC risk, as well as the association between reported PaC susceptibility variants and T2DM risk. The analyses showed that only one PaC susceptibility variant was associated with T2DM risk at a weak significance level; similarly, there were only weak associations between T2DM susceptibility variants and PaC risk. These associations, found at a significance threshold of p = 0.05 became not significant after adjustment for multiple comparisons. We also did not replicate any of the three T2DM SNPs from the PanScan 1 dataset analysis in the PanScan 2 dataset.

One design strength of our study is that we could perform initial stratified analyses by diabetes status of PaC cases and controls to evaluate whether associations between T2DM susceptibility variants and PaC risk differ by diabetes history. We found no associations significant, despite one potential association showing a relatively large effect size (rs13266634, OR = 4.92). We discounted the large effect size for rs13266634 because it was likely due to the subgroup sample sizes of 63 cases and 23 controls, and the frequency of the minor allele (T) in the control subjects was not congruent with reports in the general population (0.46 vs 0.29). The sample size of the PaC cases and controls without diabetes (448 cases and 557 controls), was likely underpowered as well, but our findings on DGKB-TMEM195 rs2191348, KCNQ1 rs231362, ADCY5 rs2877716 and BCL11A rs243021 may warrant further evaluation from larger studies.

Our study has other limitations. We did not evaluate susceptibility variants from candidate gene association studies. Many biologically plausible candidate genes have been reported to be associated with T2DM and PaC in different studies [29,30]. However, it is well known that the replication rate for SNPs derived using the candidate gene association design is relatively low [6]. Thus rather than test variants which were not always demonstrated to be associated with disease risk, we focused on GWAS derived susceptibility SNPs which were consistently replicated.

One possible explanation for these results is that the variants we analyzed from the GWAS studies account for only a modest genetic susceptibility to disease risk. There may still be genetic loci beyond these GWAS-derived SNPs that could play a role that link these two diseases. It is also likely that the majority of the shared etiology of PaC and T2DM involve other factors beyond genetic level, such as obesity, epigenetic or environmental factors. Shared family environment could potentially explain the observed association between family history of diabetes and pancreatic cancer risk. Complementary genetic methods, such as family-based linkage studies or high throughput sequencing studies, may offer alternatives to characterize the potential shared genetic etiology of the two conditions, and may reveal novel associations or interactions.

In conclusion, we found that GWAS-derived susceptibility variants do not explain the potential shared genetic etiology of PaC and T2DM. We do report interesting associations that may warrant further study using independent datasets with larger sample sizes.

Acknowledgments

We thank Martha Matsumoto and Sarah Perkins for assistance with PanScan data preparation. The authors also thank the investigators and participants in the Pancreatic Cancer Cohort Consortium (PanScan), the Pancreatic Cancer Case-Control Consortium (PanC4) studies, and the GENEVA Genes and Environment Initiatives in Type 2 Diabetes study. DbGaP provided access to the datasets (dbGaP Study Accession: phs000091.v2.p1).

Author Contributions

Conceived and designed the experiments: LW GMP. Performed the experiments: LW KGR. Analyzed the data: LW. Contributed reagents/materials/analysis tools: KGR GMP. Wrote the paper: LW.

References

  1. 1. Huxley R, Ansary-Moghaddam A, Berrington de Gonzalez A, Barzi F, Woodward M (2005) Type-II diabetes and pancreatic cancer: a meta-analysis of 36 studies. British journal of cancer 92: 2076–2083. pmid:15886696
  2. 2. Everhart J, Wright D (1995) Diabetes mellitus as a risk factor for pancreatic cancer. A meta-analysis. JAMA: the journal of the American Medical Association 273: 1605–1609.
  3. 3. Bosetti C, Rosato V, Li D, Silverman D, Petersen GM, et al. (2014) Diabetes, antidiabetic medications and pancreatic cancer risk: an analysis from the international pancreatic cancer case-control consortium. Annals of oncology: official journal of the European Society for Medical Oncology / ESMO.
  4. 4. Austin MA, Kuo E, Van Den Eeden SK, Mandelson MT, Brentnall TA, et al. (2013) Family history of diabetes and pancreatic cancer as risk factors for pancreatic cancer: the PACIFIC study. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology 22: 1913–1917. pmid:23966578
  5. 5. Bush WS, Moore JH (2012) Chapter 11: Genome-wide association studies. PLoS computational biology 8: e1002822. pmid:23300413
  6. 6. Klein AP (2012) Genetic susceptibility to pancreatic cancer. Molecular carcinogenesis 51: 14–24. pmid:22162228
  7. 7. Imamura M, Maeda S (2011) Genetics of type 2 diabetes: the GWAS era and future perspectives [Review]. Endocrine journal 58: 723–739. pmid:21778616
  8. 8. Zeggini E, Scott LJ, Saxena R, Voight BF, Marchini JL, et al. (2008) Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nature genetics 40: 638–645. pmid:18372903
  9. 9. Amundadottir L, Kraft P, Stolzenberg-Solomon RZ, Fuchs CS, Petersen GM, et al. (2009) Genome-wide association study identifies variants in the ABO locus associated with susceptibility to pancreatic cancer. Nature genetics 41: 986–990. pmid:19648918
  10. 10. Petersen GM, Amundadottir L, Fuchs CS, Kraft P, Stolzenberg-Solomon RZ, et al. (2010) A genome-wide association study identifies pancreatic cancer susceptibility loci on chromosomes 13q22.1, 1q32.1 and 5p15.33. Nature genetics 42: 224–228. pmid:20101243
  11. 11. Wolpin BM, Rizzato C, Kraft P, Kooperberg C, Petersen GM, et al. (2014) Genome-wide association study identifies multiple susceptibility loci for pancreatic cancer. Nature genetics 46: 994–1000. pmid:25086665
  12. 12. Pierce BL, Austin MA, Ahsan H (2011) Association study of type 2 diabetes genetic susceptibility variants and risk of pancreatic cancer: an analysis of PanScan-I data. Cancer causes & control: CCC 22: 877–883. pmid:21445555
  13. 13. Mailman MD, Feolo M, Jin Y, Kimura M, Tryka K, et al. (2007) The NCBI dbGaP database of genotypes and phenotypes. Nature genetics 39: 1181–1186. pmid:17898773
  14. 14. Wu L, Goldstein AM, Yu K, Yang XR, Rabe KG, et al. (2014) Variants Associated with Susceptibility to Pancreatic Cancer and Melanoma Do Not Reciprocally Affect Risk. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.
  15. 15. Bouatia-Naji N, Bonnefond A, Cavalcanti-Proenca C, Sparso T, Holmkvist J, et al. (2009) A variant near MTNR1B is associated with increased fasting plasma glucose levels and type 2 diabetes risk. Nature genetics 41: 89–94. pmid:19060909
  16. 16. Lyssenko V, Nagorny CL, Erdos MR, Wierup N, Jonsson A, et al. (2009) Common variant in MTNR1B associated with increased risk of type 2 diabetes and impaired early insulin secretion. Nature genetics 41: 82–88. pmid:19060908
  17. 17. Rung J, Cauchi S, Albrechtsen A, Shen L, Rocheleau G, et al. (2009) Genetic variant near IRS1 is associated with type 2 diabetes, insulin resistance and hyperinsulinemia. Nature genetics 41: 1110–1115. pmid:19734900
  18. 18. Saxena R, Voight BF, Lyssenko V, Burtt NP, de Bakker PI, et al. (2007) Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science 316: 1331–1336. pmid:17463246
  19. 19. Scott LJ, Mohlke KL, Bonnycastle LL, Willer CJ, Li Y, et al. (2007) A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science 316: 1341–1345. pmid:17463248
  20. 20. Sladek R, Rocheleau G, Rung J, Dina C, Shen L, et al. (2007) A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature 445: 881–885. pmid:17293876
  21. 21. Unoki H, Takahashi A, Kawaguchi T, Hara K, Horikoshi M, et al. (2008) SNPs in KCNQ1 are associated with susceptibility to type 2 diabetes in East Asian and European populations. Nature genetics 40: 1098–1102. pmid:18711366
  22. 22. Yasuda K, Miyake K, Horikawa Y, Hara K, Osawa H, et al. (2008) Variants in KCNQ1 are associated with susceptibility to type 2 diabetes mellitus. Nature genetics 40: 1092–1097. pmid:18711367
  23. 23. Dupuis J, Langenberg C, Prokopenko I, Saxena R, Soranzo N, et al. (2010) New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nature genetics 42: 105–116. pmid:20081858
  24. 24. Saxena R, Hivert MF, Langenberg C, Tanaka T, Pankow JS, et al. (2010) Genetic variation in GIPR influences the glucose and insulin responses to an oral glucose challenge. Nature genetics 42: 142–148. pmid:20081857
  25. 25. Voight BF, Scott LJ, Steinthorsdottir V, Morris AP, Dina C, et al. (2010) Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis. Nature genetics 42: 579–589. pmid:20581827
  26. 26. Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, et al. (2002) The structure of haplotype blocks in the human genome. Science 296: 2225–2229. pmid:12029063
  27. 27. Benjamini Y, Yekutieli D (2001) The control of the false discovery rate in multiple testing under dependency. Ann Statist 29: 1165–1188.
  28. 28. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. American journal of human genetics 81: 559–575. pmid:17701901
  29. 29. Barroso I, Luan J, Middelberg RP, Harding AH, Franks PW, et al. (2003) Candidate gene association study in type 2 diabetes indicates a role for genes involved in beta-cell function as well as insulin action. PLoS biology 1: E20. pmid:14551916
  30. 30. Lin Y, Yagyu K, Egawa N, Ueno M, Mori M, et al. (2011) An overview of genetic polymorphisms and pancreatic cancer risk in molecular epidemiologic studies. Journal of epidemiology / Japan Epidemiological Association 21: 2–12. pmid:21071884