Figure 1.
Flow-chart showing the analysis pipeline.
Direction of the arrows represents the workflow of the study design with performed analysis indicated. Solid lines indicate analysis performed within data of human pancreatic islets. Dashed lines indicate analysis performed against external databases. Light grey boxes indicate input data of human pancreatic islets. Dark grey boxes indicate output of significant data. White boxes indicate follow-up studies for look-up or functional- and biological validation of significant results.
Table 1.
Number of significant mQTL results in human pancreatic islets.
Figure 2.
Depiction and distance analysis of associations between genotype and DNA methylation of significant mQTLs in human pancreatic islets.
Depiction of (A) the most significant cis-mQTL; rs1771445 vs. cg02372404, and (B) the least significant cis-mQTL; rs196489 vs. cg06433283, among all identified cis-mQTLs in human pancreatic islets. Data is presented as Box and Whisker plots with P-values adjusted for multiple testing. (C) Distance analysis between SNPs and CpG sites of significant cis-mQTLs plotted as the number of identified mQTLs within each distance bin. Distance summary: minimum = 0 kb, 10%ile = 1.88 kb, 25%ile = 7.62 kb, 50%ile = 26.31 kb, 75%ile = 74.76 kb, 90%ile = 164.5 kb, maximum = 499.6 kb. (D) The strength of associations plotted against the distance between SNPs and CpG sites of significant cis-mQTLs after correction for multiple testing. Depiction of (E) the most significant trans-mQTL; rs17660464 vs. cg22968622, and (F) the least significant trans-mQTL; rs6440971 vs. cg10438649, among all identified trans-mQTLs in human pancreatic islets. Data is presented as Box and Whisker plots with P-values adjusted for multiple testing. (G) Quantile-Quantile plots (Q-Q plots) of –log10 (P-values) illustrating the distribution of P-values for all analyzed SNP-CpG pairs in the cis- (red dots) and trans- (blue dots) mQTL analysis in relation to a theoretical null distribution (grey diagonal line). Bold dots indicate significant mQTLs identified in the cis- (red dots) and trans-(blue dots) mQTL analysis after correction for multiple testing.
Figure 3.
Genomic distribution of CpG sites of significantly identified mQTLs in human pancreatic islets.
(A) Chromosomal distribution of CpG sites of significant cis- and trans-mQTLs in comparison to all analyzed CpG sites on the Infinium Human Methylation450 BeadChip. (B) All analyzed CpG sites on the Infinium Human Methylation450 BeadChip have been annotated to genomic regions based on their relation to the nearest gene (TSS: proximal promoter, defined as 200 bp or 1500 bp upstream of transcription start site; UTR: untranslated region) or in relation to the nearest CpG island (CpG island: DNA stretch of 200 bp or more with a C+G content of >50% and an observed CpG/expected CpG in excess of 0.6; Shore: the flanking region of CpG islands, 0–2000 bp; Shelf: regions flanking island shores, i.e., covering 2000–4000 bp distant from the CpG island). Distribution of CpG sites of significant mQTLs in relation to (C) the nearest gene and (D) in relation to CpG islands. *Significantly different distribution (P<0.05) of CpGs of significant cis- or trans-mQTLs from what is expected by chance based on a Chi-squared-test when compared with all analyzed CpG sites on the Infinium HumanMethylation450 BeadChip.
Table 2.
Number of significant eQTL results in the human pancreatic islets.
Figure 4.
CIT analysis identifies mQTLs where DNA methylation potentially mediates genetic associations with mRNA expression in human pancreatic islets.
(A) Depiction of possible relationship models between genotype as a causal factor (G), DNA methylation as a potential mediator (M) and islet mRNA expression as a phenotypic outcome (E). Left diagram: The causal or methylation mediated model. Middle diagram: The reactive or methylation-consequential model (reverse causality). Right diagram: The independent model. (B) Illustration of the study approach to identify if DNA methylation of CpG sites potentially mediates the causal association between SNPs and islet mRNA expression. Left: Workflow steps. Middle: Tested relationships between G, M and E in the different steps. Right: Number of identified sites in each step. Bottom: Conditions that must be fulfilled to conclude a mathematical definition of a causal relationship between G, M and E. Significantly called as causal at 5% FDR (causal hypothesis Q<0.05).
Table 3.
Identified cis-mQTLs/eQTLs where methylation of CpG sites is a potential mediator of genetic association with mRNA expression in human pancreatic islets.
Table 4.
KEGG pathways with enrichment of genes annotated to CpG sites of significant cis-mQTLs in human pancreatic islets.
Figure 5.
Identified mQTL/eQTL candidate genes GPX7, GSTT1 and SNX19 affect β-cell number and apoptosis.
Associations identified in the mQTL/eQTL analyses of human pancreatic islets. (A) rs835342 located approximately 5 kb upstream of GPX7 associates with DNA methylation of cg18087326 located 406 bp upstream of the GPX7 transcription start site (TSS) as well as with mRNA expression of GPX7. (B) rs4822453 located ∼121 kb downstream of GSTT1 associates with DNA methylation of cg17005068 located 241 bp upstream of the GSTT1 TSS as well as with mRNA expression of GSTT1. (C) rs3751035 located within exon 1 of SNX19 associates with DNA methylation of cg08912652 located within the gene body of SNX19 as well as with mRNA expression of SNX19. Data are presented as Box and Whisker plots with P-values adjusted for multiple testing. (D) qPCR quantification of siRNA mediated knockdown of Gpx7 (siGpx7), Gstt1 (siGstt1) and Snx19 (siSnx19) compared to negative control siRNA (siNC). * P<0.01, the graphs show the average of four independent knockdown experiments presented as mean ± SEM. (E) Knockdown of Gpx7 and Gstt1 resulted in increased combined caspase-3/7 activity compared to negative control siRNA under both control (white bars) and lipotoxic (black bars) conditions. * P<0.05, the graph shows the average of three independent knockdown experiments presented as mean ± SEM. (F) Knockdown of Snx19 (siSnx19) resulted in increased cell number compared to negative control siRNA (siNC) under both control (white bars) and lipotoxic (black bars) conditions. * P<0.05, the graph shows the average of six independent knockdown experiments presented as mean ± SEM.
Figure 6.
Diabetes SNPs reported by GWAS associate with DNA methylation in human pancreatic islets.
Depiction of some identified associations between SNP and DNA methylation in islets of reported type 1 diabetes loci: (A) INS, (B) HLA and (C) PTPN2; type 2 diabetes loci: (D) KCNJ11, (E) WFS1 and (F) ADCY5; and glucose-trait loci: (G) PDX1 and (H) GRB10. P-values adjusted for multiple testing. HLA rs1063355 and WFS1 rs1801216 were identified through proxy search and are in linkage with the GWAS reported diabetes SNPs HLA rs9272346 and WFS1 rs1801214, respectively.
Figure 7.
Distribution of CpG sites significantly associated with one or more mRNA transcripts, separated based on negative or positive correlations.
(A) 20,376 combinations in the region 0–500 kb upstream of transcription start site and (B) 5,718 intragenic combinations. Negative correlations were enriched in the region surrounding the transcription start site, both (C) upstream and (D) downstream. (E) 5,221 combinations 0–100 kb downstream of the gene. Associations corrected for multiple testing using false discovery rate at 5% (Q<0.05).
Figure 8.
mQTLs/eQTLs of GPX7 and SNX19 identified in the genome-wide analysis were biologically validated in pancreatic islets from a different set of human donors.
Biological validation of associations for (A) GPX7 rs835342 with DNA methylation of cg18087326 as well as with mRNA expression of GPX7 and (B) SNX19 rs3751035 with DNA methylation of cg08912652 as well as with mRNA expression of SNX19 in a set of human pancreatic islets from donors (n = 37) not included in the genome-wide mQTL/eQTL analysis. DNA methylation was analyzed using Pyrosequencing and mRNA expression using Affymetrix microarray. Data are presented as Box and Whisker plots with P-values.
Table 5.
Number of significant mQTL results in human pancreatic islets when including imputed genotyped data.