Pleiotropic genetic variants have independent effects on different phenotypes. C-reactive protein (CRP) is associated with several cardiometabolic phenotypes. Shared genetic backgrounds may partially underlie these associations. We conducted a genome-wide analysis to identify the shared genetic background of inflammation and cardiometabolic phenotypes using published genome-wide association studies (GWAS). We also evaluated whether the pleiotropic effects of such loci were biological or mediated in nature. First, we examined whether 283 common variants identified for 10 cardiometabolic phenotypes in GWAS are associated with CRP level. Second, we tested whether 18 variants identified for serum CRP are associated with 10 cardiometabolic phenotypes. We used a Bonferroni corrected p-value of 1.1×10-04 (0.05/463) as a threshold of significance. We evaluated the independent pleiotropic effect on both phenotypes using individual level data from the Women Genome Health Study. Evaluating the genetic overlap between inflammation and cardiometabolic phenotypes, we found 13 pleiotropic regions. Additional analyses showed that 6 regions (APOC1, HNF1A, IL6R, PPP1R3B, HNF4A and IL1F10) appeared to have a pleiotropic effect on CRP independent of the effects on the cardiometabolic phenotypes. These included loci where individuals carrying the risk allele for CRP encounter higher lipid levels and risk of type 2 diabetes. In addition, 5 regions (GCKR, PABPC4, BCL7B, FTO and TMEM18) had an effect on CRP largely mediated through the cardiometabolic phenotypes. In conclusion, our results show genetic pleiotropy among inflammation and cardiometabolic phenotypes. In addition to reverse causation, our data suggests that pleiotropic genetic variants partially underlie the association between CRP and cardiometabolic phenotypes.
Citation: Ligthart S, de Vries PS, Uitterlinden AG, Hofman A, CHARGE Inflammation working group, Franco OH, et al. (2015) Pleiotropy among Common Genetic Loci Identified for Cardiometabolic Disorders and C-Reactive Protein. PLoS ONE 10(3): e0118859. https://doi.org/10.1371/journal.pone.0118859
Academic Editor: Yun Li, University of North Carolina, UNITED STATES
Received: July 11, 2014; Accepted: January 20, 2015; Published: March 13, 2015
Copyright: © 2015 Ligthart et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: GWAS data is available on the webpage of the GIANT (http://www.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium), MAGIC (http://www.magicinvestigators.org/), GLGC (http://www.sph.umich.edu/csg/abecasis/public/lipids2010/), CARDIOGRAMplusC4D (http://www.cardiogramplusc4d.org/), DIAGRAM (http://diagram-consortium.org/) and ICBP (http://www.georgehretlab.org/icbp_088023401234-9812599.html) consortia. The CRP GWAS is already freely available for researchers now (contact person: Abbas Dehghan), however, it will be available soon for public at the webpage of LocusZoom (http://csg.sph.umich.edu/locuszoom/). The IRB protocol for the WGHS does not allow public release of individual level data. WGHS data are available to researchers who are approved for analysis on site (contact person: Daniel Chasman).
Funding: This work was supported by the Nederlandse Organisatie voor Wetenschappelijk Onderzoek grant [916.12.154 to A.D.]. The WGHS is supported by [HL043851] and [HL080467] from the National Heart, Lung, and Blood Institute and [CA047988] from the National Cancer Institute, the Donald W. Reynolds Foundation and the Fondation Leducq, with collaborative scientific support and funding for genotyping provided by Amgen. The CRP GWAS meta-analysis was supported by the National Institutes of Health (NIH) ([N01-HC 25195 1]; [RO1 HL64753]; [R01 HL076784]; [1 R01 AG028321]). Part of the funding for genotyping in the WGHS was provided by Amgen. Amgen has not been involved in the design, analysis and interpretation of the study. This does not alter the authors’ adherence to PLOS ONE policies on sharing data and materials.
Competing interests: The authors have declared that no competing interests exist.
The risk of cardiometabolic diseases, the world’s leading cause of mortality, is higher in people with elevated levels of systemic inflammation, independent of traditional cardiometabolic risk factors . Elevated levels of C-reactive protein (CRP), as a measurement of systemic inflammation, are associated with hypertension , type 2 diabetes (T2D) [3,4], coronary artery disease (CAD) [1,5,6], stroke [7,8], peripheral artery disease , and mortality . Although observational data suggest a link between CRP and cardiometabolic phenotypes, Mendelian randomization approaches have provided evidence against a causal link between CRP and these cardiometabolic phenotypes [11–14].
Genome-wide association studies (GWAS) have discovered multiple single-nucleotide polymorphisms (SNPs) associated with inflammatory markers including CRP and different cardiometabolic phenotypes including T2D, coronary artery disease (CAD), lipids and hypertension [15–21]. From these GWAS we already learned that several genes, such as IL6R, APOC1, GCKR and HNF1A, are associated both with systemic inflammation and cardiometabolic phenotypes such as CAD, lipids and diabetes [15,17,21,22]. This phenomenon of one genetic locus affecting more than one phenotype is called genetic “pleiotropy”. In general, two types of pleiotropy can be defined. As previously defined by Solovieff et al., “biological pleiotropy” refers to a gene that has independent biological effects on more than one phenotype, and “mediated pleiotropy” refers to the situation where the genetic effect on phenotype B is mediated by phenotype A that is causally related to phenotype B . Although both types of pleiotropy are interesting, only biological pleiotropy refers to the genuine pleiotropy where the effect of the genetic variant on two or more phenotypes is independent.
We hypothesize that in addition to reverse causation, genetic loci with pleiotropic effects may underlie the association between CRP and cardiometabolic phenotypes. To this end, we applied a simple and robust approach to point out these pleiotropic genetic variants . First, we examined whether common variants identified for cardiometabolic phenotypes are associated with serum CRP levels as a measure of systemic inflammation. Second, we conversely examined whether variants so far identified for serum CRP associate with cardiometabolic phenotypes. In addition, we adjusted the association between the SNP and CRP for the cardiometabolic phenotypes and vice versa to distinguish a genuine biological pleiotropic effect from mediated pleiotropy.
Materials and Methods
To examine the overlap between genes for inflammation and cardio-metabolic disorders we collected GWAS meta-analyses data from published GWAS on cardiometabolic phenotypes and CRP [15–17,19,22]. These GWAS are mainly conducted in individuals from European ancestry (Table 1). We tested the genetic association of cardiometabolic SNPs with systemic inflammation using the largest published GWAS meta-analysis on CRP levels from the CHARGE (the Cohorts for Heart and Aging Research in Genomic Epidemiology) inflammation working group . Testing the genetic association of the CRP SNPs with 10 cardiometabolic phenotypes we used the recent GWAS data from the following consortia: Coronary Artery Disease Genome-wide Replication and Meta-analysis plus the Coronary Artery Disease, CARDIoGRAMplusC4D , International Consortium for Blood Pressure, ICBP , the Meta-Analyses of Glucose and Insulin-related traits Consortium, MAGIC [17,18], DIAbetes Genetics Replication And Meta-analysis, DIAGRAM , The Genetic Investigation of Anthropometric Traits, GIANT  and Global Lipids Genetic Consortium, GLGC . Additionally, we carried out analyses in a population based cohort study to explore the type of pleiotropy of the overlapping SNPs.
Cardiometabolic SNPs and association with CRP
We first compiled a list of genome-wide significant SNPs (p-value < 5×10-8) previously identified in large GWAS on cardiometabolic traits to test the genetic association in the CRP GWAS. The following cardiometabolic traits were included to generate the SNP list: coronary artery disease (51 SNPs in CARDIOGRAMplusC4D, n = 130,681 with 63,746 cases) ; blood pressure (29 SNPs in ICBP, n = 69,395) ; fasting glucose, fasting insulin (53 SNPs in MAGIC, n = 133,010) [17,18]; type 2 diabetes (55 SNPs in DIAGRAM, n = 149,821 with 34,840 cases) ; body-mass index (38 SNPs in GIANT, n = 123,865) ; LDL cholesterol, HDL cholesterol, triglycerides and total cholesterol (102 loci in GLGC, n = 100,184) . When the SNP was not available in the CRP GWAS, we searched for a proxy with an r2 > 0.8. For 6 SNPs, this was not possible. LD-based pruning was performed (r2 threshold of 0.3) using HapMap LD information to make sure that independent SNPs were included in the analysis . The SNP with the lowest p-value in one of the cardiometabolic GWAS was chosen. The final list included 283 independent SNPs that are genome-wide significantly associated with one or more cardiometabolic phenotypes.
CRP SNPs and association with cardiometabolic phenotypes
We used the publically available GWAS meta-analyses data to test whether any of the 18 independent genome-wide significant SNPs identified in the CRP GWAS were associated with the following cardiometabolic phenotypes: LDL cholesterol, HDL cholesterol, triglycerides and total cholesterol (GLGC); body mass index (GIANT); systolic blood pressure (ICBP); coronary artery disease (CARDIoGRAMplusC4D consortium); fasting glucose and fasting insulin (MAGIC); type 2 diabetes (DIAGRAM). All available GWASs provided p-values for all 18 CRP SNPs, except the GWAS on CAD and the glycemic phenotypes which were based on a custom chip array (Metabochip array ) containing 79,000 SNPs and this array did not include 8 of the CRP SNPs. For the SNPs that were not on the Metabochip, we used for fasting glucose and fasting insulin the previous GWAS published by Dupuis et al. , for type 2 diabetes only the stage 1 GWAS including all HapMap SNPs  and for CAD we used the summary data from the CARDIoGRAM meta-analysis only .
CRP and cardiometabolic measures
Coronary artery diseases was defined in the CARDIoGRAMplusC4D consortium using standard criteria for myocardial infarction or coronary artery disease namely symptoms of angina pectoris, previous myocardial infarction or cardiac intervention . Hypertension was defined in the ICBP consortium as systolic blood pressure ≥ 140 mmHg or diastolic blood pressure ≤ 90 mmHg . Fasting glucose and fasting insulin were measured in MAGIC using standard laboratory techniques . Type 2 diabetes was in the DIAGRAM consortium defined as fasting plasma glucose level ≥ 7.0 mmol/l or non-fasting glucose plasma level ≥ 11.0 mmol/l and/or treatment with oral antidiabetic medication or insulin . LDL cholesterol, HDL cholesterol, triglycerides and total cholesterol were measured in the GLGC using standard laboratory techniques .
We used the discovery panel of the recently published GWAS meta-analysis on serum CRP (CHARGE Inflammation) . The meta-analysis included 15 studies in the discovery panel (n = 65,000). CRP was natural log-transformed (lnCRP) and effects represented a 1-unit change in lnCRP per copy increase in risk allele.
In this study we evaluated 463 possible SNP-phenotype associations including 283 independent cardiometabolic SNPs in the CRP GWAS and 18 independent CRP SNPs in 10 different cardiometabolic GWAS. To address the issue of multiple testing we used a Bonferroni corrected alpha of 1.1×10-4 (0.05/463 tests) as a robust threshold for a significant association between the SNP and the phenotype in our study .
In a quantile-quantile (Q-Q) plot, a nominal probability distribution is compared against an empirical distribution. In the scenario that the nominal p-values form a straight line on a Q-Q plot when they are plotted against the empirical distribution, all relations are null. When the observed distribution is deflected to the left from the uniform null distribution, lower p-values are observed compared to that expected by chance (enrichment). We used QQ-plots to evaluate whether SNPs that are genome-wide significant associated with the cardiometabolic phenotype, were in the CRP GWAS distributed differently from what is expected under the null-hypothesis. Vice versa, we evaluated whether genes identified for CRP were in the cardiometabolic GWAS distributed differently from what is expected under the null-hypothesis. We used Fisher’s combined probability test to test for significant enrichment in the QQ-plots .
Evaluation of the type of pleiotropy
To clarify the type of genetic pleiotropy (biological or mediated), we performed additional analyses in the Women’s Genome Health Study (WGHS) including 23,294 women . In the first model, we analyzed the age-adjusted association between CRP (dependent variable) and the lead SNP for CRP in the pleiotropic regions. To examine whether the association is independent of cardiometabolic traits we further adjusted this association for BMI, lipid levels (HDL-cholesterol, LDL-cholesterol, triglycerides and total cholesterol) and HbA1C. We used HbA1C as a proxy for glycemic metabolism given the fact that glycated hemoglobin is an acceptable marker of average blood glucose level in the last 2–3 months . In addition, we adjusted the association for age and in a stepwise manner we added lipids, BMI and HbA1C to the model to evaluate the different effects of the phenotypes on the association. Last, we analyzed the association between the pleiotropic SNP and the associated cardiometabolic phenotypes unadjusted and adjusted for CRP. As we tested 43 SNP-phenotype associations in the WGHS, we used a Bonferroni corrected alpha of 1.2×10-03 as a threshold of study-wide significance. All regression analyses were carried out in the statistical software R version 2.15.3 .
Women’s Genome Health Study (WGHS)
The WGHS is a prospective cohort of female North American health care professionals representing participants in the Women’s Health Study who provided a blood sample at baseline and consent for blood-based analyses. Participants were 45 or older at enrollment and free of cardiovascular disease, cancer or other major chronic illness. The current data are derived from 23,294 WGHS participants with whole genome genetic data and verified self-reported, European ancestry. The study protocol was approved by the institutional review board of the Brigham and Women’s Hospital (Boston, MA, USA). All participants provided written informed consent to participate in the study.
BMI (weight in kilograms divided by height in meters squared) was calculated from responses to the baseline questionnaire. All baseline blood samples underwent measurement for high-sensitivity C-reactive protein (hsCRP) via a validated immunoturbidometric method (Denka Seiken, Tokyo, Japan). Concentrations of total cholesterol (TC) and HDL-C were measured enzymatically on a Hitachi 911 autoanalyzer (Roche Diagnostics) with day-to-day reproducibility of 1.36% and 1.07% for TC concentrations of 129.8 and 277.2 mg/dL, respectively, (throughout this report, concentrations and units given are those reported in the referenced sources) and of 1.98% and 2.68% for HDL-C concentrations of 35 and 55 mg/dL, respectively. LDL-C was determined directly (Genzyme) with reproducibility of 2.16% and 1.98% for concentrations of 76.2 and 148.7 mg/dL, respectively. Triglycerides were measured enzymatically, with correction for endogenous glycerol, using a Hitachi 917 analyzer and reagents and calibrators from Roche Diagnostics; reproducibility was 1.52% and 1.49% for triglyceride concentrations of 82.5 and 178.8 mg/dL, respectively. Hemoglobin A1c was measured using turbidimetric immunoinhibition directly from packed red blood cells (Roche Diagnostics) with reproducibility of 3.63% and 3.77% at levels of 5.2% and 8.8%, respectively. A total of 22,773 participants with genotyped and covariates available were included in this study.
DNA extracted from the baseline blood samples underwent SNP genotyping via the Illumina Infinium II assay for querying of a genome-wide set of SNPs from the Illumina HumanHap300 Duo “+” platform. This panel including the standard content of approximately 318,237 SNPs covering the entire genome from the HumanHap300 panel with an additional focused panel of 45,571 SNPs selected to enhance coverage of cardiovascular candidate genes and SNPs with suspected functional consequences. For the current analysis, all samples had successful genotype information for >98% of the SNPs, while all SNPs had successful genotype information for >90% of the samples. SNPs with significance p <10-6 for deviations from Hardy-Weinberg equilibrium were excluded from analysis. Self-reported European ancestry was confirmed in the 23,294 samples on the basis of a principal component analysis using PLINK among 1,443 ancestry informative SNPs selected for Fst >0.4 in the HapMap2. In total, 339,875 genotyped SNPs passing the criteria for inclusion also had minor allele frequency at least 1 percent. On the basis of linkage disequilibrium relationships in the HapMap (release 22), genotypes for a total of 2,621,896 SNPs were imputed from the 23,294 samples passing the quality criteria using Mach v. 1.0.16.
Pathway analysis was performed on the pleiotropic loci that we identified using Ingenuity Pathway Analysis software tool (IPA Ingenuity Systems). The Ingenuity Knowledge Base (including only genes) was used as a reference set and we considered molecules and/or direct and indirect relationships. The confidence filter was set to experimentally observed or high (predicted). Pathways were generated with a maximum size of 35 genes and we allowed up to 25 pathways. The significance p-value associated with enrichment of pathways was calculated using the right-tailed Fisher’s exact test, considering the number of query molecules that participate in that pathway and the total number of molecules that are known to be associated with that pathway in the reference set. A False Discovery Rate of five percent was used as a threshold of significance using the Benjamini-Hochberg method.
Cardiometabolic SNPs in CRP GWAS
First, we used QQ-plots to evaluate whether the p-values for the associations of the 283 cardiometabolic SNPs with serum CRP are distributed differently from what is expected under the null hypothesis in each trait group. As depicted in Fig. 1, the leftward deviation of the dotted lines in the QQ-plots shows that the 283 SNPs known for cardiometabolic phenotypes to have p-values smaller than expected under the null hypothesis in the CRP GWAS (p-value: 7.2×10-306).
QQ-plot was used to evaluate whether SNPs that are genome-wide significant associated with the cardiometabolic phenotypes, were in the CRP GWAS distributed differently from what is expected under the null-hypothesis. The observed p-values (dotted line) for the phenotypes deviated significantly leftwards indicating that these p-values are smaller than expected under null hypothesis.
A total of 19 SNPs out of 283 independent cardiometabolic SNPs (6.7%) were associated with CRP after correction for multiple testing (p-value threshold 1.1×10-04). These 19 SNPs were located within or close to 12 different genes APOC1, HNF1A, GCKR, IL6R, PPP1R3B, HNF4A, PABPC4, BCL7B, FTO, TMEM18, PLTP and MC4R. Table 2 shows the SNPs with the lowest p-values in the 12 pleiotropic loci based on the CRP GWAS, i.e. the lowest p-value in that genomic locus. The eight SNPs in Table 2 with the lowest p-value were already known to be associated with CRP based on the recent CRP GWAS . The next four SNPs were not identified in the genome-wide association study of CRP. The first novel association was rs1558902 with a p-value of 2.2×10-6. This SNP is located in the first intron of the FTO gene on chromosome 16. The second novel signal was the SNP rs2867125 which is located on chromosome 2, near 46kb downstream of TMEM18. This SNP had a p-value of 5.0×10-6 in the CRP meta-analysis. The third association was with rs6065906 which is located on chromosome 20, near the PLTP and PCIF1 gene (p-value = 6.7×10-6). The last finding was rs571312 which is located 2 Mb upstream of the MC4R gene on chromosome 18 (p-value = 3.8×10-5).
Among the associated SNPs, we observed many SNPs with different directions of effect on the cardiometabolic phenotypes and CRP than one would expect based on the association of CRP and these phenotypes in observational data. As an example, the A-allele of the SNP rs4420638 in the APOC1 locus increases serum CRP levels. However, this allele is associated with a decrease in the level of total cholesterol, LDL-cholesterol and triglycerides. We also observed such effects for the G-allele of the SNP rs1183910 in the HNF1A locus. This allele increases serum CRP levels and is associated with a decline in total cholesterol and LDL-cholesterol.
Out of the 12 pleiotropic loci, 6 loci had the same lead SNP in both the CRP and one or more of the cardiometabolic GWAS. In the other 6 loci the lead SNPs were different between the CRP GWAS and the cardiometabolic GWAS. However, in the majority of these loci the lead SNPs of the cardiometabolic GWAS were in high LD (r2 > 0.8) with the lead SNP in the CRP GWAS. In the IL6R locus we observed the lowest LD between the top hit in the CRP GWAS and the CAD GWAS (r2 = 0.52).
CRP SNPs in cardiometabolic GWAS
We used the same QQ-plots as described previously to evaluate whether the association p-values for the 18 CRP SNPs are distributed differently from what is expected under the null hypothesis in the different cardiometabolic GWAS. As depicted by the leftward deviation of the dotted lines in the QQ-plots for CAD (p = 1.4×10-09), the cholesterol phenotypes (HDL-cholesterol, p = 6.4×10-69; LDL-cholesterol, p = 2.9×10-166; total cholesterol, p = 3.6×10-169 and triglycerides, p = 2.5×10-196) and the glycemic phenotypes (fasting glucose, p = 2.4×10-12 and fasting insulin 5.5×10-04), the p-values for the association between the 18 CRP SNPs and these phenotypes are significantly smaller than expected under the null hypothesis (Fig. 2). We did not observe such a significant deviation in the QQ-plots of BMI (p = 0.18) and SBP (p = 0.06).
QQ-plots were used to evaluate whether SNPs that are genome-wide significant associated with CRP, were in the cardiometabolic GWAS distributed differently from what is expected under the null-hypothesis. The observed p-values (dotted line) for the phenotypes HDL-cholesterol, fasting glucose, type 2 diabetes and coronary artery disease deviated significantly leftwards indicating that these p-values are smaller than expected under null hypothesis.
Results of the association of the 18 genome-wide significant associations with CRP-level are depicted in Fig. 3 (S1 and S2 tables). We observed 9 associations with one or more of the 10 cardiometabolic phenotypes close to or within the genes APOC1, HNF1A, IL6R, GCKR, IL1F10, PPP1R3B, HNF4A, PABPC and BCL7B (p-value < 1.1×10-4). Only 1 gene (IL1F10) was not identified in the previous analysis where we tested the association between the cardiometabolic SNPs and CRP. Among all 9 associations, we found three associations that are not reported in the GWAS for that specific phenotype. The first was rs1183910 with CAD (p-value 5.6×10-6). This SNP is located in the first intron of the HNF1A gene on chromosome 12. The second was rs6734238 with total cholesterol (p-value 1.16×10-5). This SNP is located on chromosome 2, nearby the IL1F10 gene and other interleukin 1 family genes. The third was rs4420638 with T2D (p-value 4.0x10-6) nearby the APOC1 gene on chromosome 19.
P-values for the associations between the 18 CRP SNPs and BMI, lipids, glycemic phenotypes, SBP and coronary artery disease. The genes on the x-axis represent the genes in which the CRP SNPs are located or closest by. The numbers on the y-axis indicate the p-values of the association between the SNPs and the cardiometabolic phenotypes. Significant associations are colored as depicted in the figure legend. For BMI and SBP, no significant associations were observed. CAD, coronary artery disease; FG, fasting glucose; FI, fasting insulin; HDLC, HDL-cholesterol; LDLC, LDL-cholesterol; T2D, type 2 diabetes; TC, total cholesterol; TG, Triglycerides.
Comparable with the previous associations results, we observed many different direction of effects. For instance, the A-allele of the SNP rs4420638 in the APOC1 locus increases serum CRP levels and is associated with a lower risk of type 2 diabetes. Furthermore, the G-allele of the SNP rs1183910 in the HNF1A locus increases serum CRP levels and is associated with a lower risk of coronary artery disease.
Exploring the type of pleiotropy
We observed a total number of 13 genetic regions with pleiotropic effects on CRP and cardiometabolic phenotypes: 12 regions identified in the first step testing the cardiometabolic SNPs with CRP and 1 additional region identified in the second step testing the associations of the CRP SNPs with the cardiometabolic phenotypes. Table 3 shows the unadjusted and adjusted associations between the 13 overlapping SNPs and CRP-level using individual level data from the WGHS. There was no significant association in the WGHS between the SNPs located near PLTP and MC4R and CRP after correction for multiple testing. The effect sizes of the genetic loci in or near the genes APOC1, HNF1A, IL6R, PPP1R3B, HNF4A and IL1F10 did not diminish substantially after adjustment for BMI, cholesterol levels and HbA1C suggesting biological pleiotropy. For BCL7B, FTO and TMEM18 the effect sizes decreased considerably implying mediated pleiotropy. The estimate of the association between rs1260326 (GCKR) and CRP decreased substantially after adjustment but was still strongly associated. We observed the same scenario for the association between rs4660293 (PABPC4) and CRP. When we added the phenotypes in a stepwise manner to the model, we observed for the mediated pleiotropic loci FTO and TMEM18 that the effect was mainly mediated through BMI (S3 Table). For BCL7B and PABPC4, lipids appeared to be the most important mediators. Fig. 4 shows graphically the biological and mediated pleiotropic effects.
Overlapping loci among inflammation and cardiometabolic phenotypes and type of pleiotropy according to the additional analyses. We identified six overlapping loci with mediated pleiotropic effects on CRP (left) and seven with a biological pleiotropic effect (right).
The results for the associations between the pleiotropic SNPs and the associated cardiometabolic phenotypes are presented in S4 Table. Eight SNPs were not significantly associated with the cardiometabolic phenotype in the WGHS after adjustment for multiple testing. The majority of the estimates in- or decreased slightly after adjustment for CRP. However, the estimates between APOC1 and HbA1C, PABPC4 and triglycerides and BCL7B and HDL-cholesterol decreased considerably after the adjustment for CRP.
The results from the pathway analysis including all 13 pleiotropic genes are listed in the S5 Table. A total number of 13 canonical pathways were significantly enriched using an FDR of five percent. The top pathways included the FXR/RXR activation (p = 7.4×10-09), LXR/RXR activation (p = 4.6×10-05), Maturity Onset Diabetes of the Young (MODY) signaling (p = 7.6×10-05), hepatic cholestasis (p = 1.1×10-04) and acute phase response signaling (p = 1.3×10-04).
We observed several overlapping common genetic risk factors for cardiometabolic phenotypes and systemic inflammation. The additional analyses provided evidence for six biological pleiotropic loci with independent effects on both CRP and the cardiometabolic phenotype. These pleiotropic loci suggest a shared genetic background for CRP and cardiometabolic phenotypes. In addition, 5 pleiotropic loci appeared to have an effect on CRP mediated through the cardiometabolic phenotypes. Taken together, our results highlight the complex shared genetic architecture of cardiometabolic phenotypes and chronic inflammation.
Several of the identified biological pleiotropic loci suggest that the association between CRP and cardiometabolic phenotypes is not only reverse causation, but also shared independent genetic effects. Both the HNF1A and HNF4A loci were associated with CRP after adjustment for the cardiometabolic phenotypes. The effect directions were the same for type 2 diabetes and CRP, implying people carrying the risk allele for type 2 diabetes also have higher CRP values. We observed this also for the PPP1R3B locus where people carrying the risk allele for higher cholesterol also experience higher CRP levels. In both cases the effect on CRP is independent of the effect on the corresponding cardiometabolic trait.
Three of the associations that were not reported in the GWAS on CRP-level (FTO, TMEM18 and MC4R) are associations with SNPs discovered in the GWAS on BMI by Speliotes et al. . Moreover, these SNPs were also the leading findings in this large BMI GWAS meta-analysis. Our additional analyses clearly showed that after adjustment for BMI, the effects of FTO and TMEM18 decreased substantially, resulting in a non-significant association, which suggests that their effect on inflammation is indeed mediated by BMI. This is in line with previous research that already provided evidence for a causal role of BMI in inflammation . Conversely, none of the SNPs identified in the CRP GWAS were associated with BMI when we tested these SNPs in the BMI GWAS.
Our results also suggest a role for lipids in systemic inflammation. When we adjusted the association between BCL7B loci and CRP for the cardiometabolic phenotypes including lipids, the association was not present anymore. This locus appears to increase systemic inflammation through their effect on lipids. Also the association between PABPC4 and CRP decreased after adjustment for CRP, but there was a significant residuals effect suggesting partly mediated effects through lipids. The observation that lipids may cause inflammation is in line with previous studies that have shown an important role for oxidized LDL-cholesterol molecules and free fatty acids in the development of systemic inflammation . However, in addition to the mediated pleiotropic loci among lipids and CRP, we also observed loci with independent effects (biological pleiotropy) on lipids and CRP including APOC1, HNF1A and HNF4A, highlighting the complex interrelationship of lipids and inflammation. Moreover, the pathway analysis confirmed the role of the pleiotropic genes in both inflammation and lipid metabolism.
We observed little overlap between risk loci for CAD and CRP. Apart from the IL6R gene, our results suggest an association with CAD at the HNF1A locus. The HNF1A gene is an important hepatic nuclear transcription factor that has been associated in GWAS with lipids and diabetes [19,21]. This gene is known to regulate many target genes involved in lipid metabolism and transport . A previous study has associated this locus with different cardiovascular phenotypes including coronary artery calcification and incident CHD . Unfortunately we were not able to look-up 9 CRP SNPs in the larger CAD Metabochip GWAS because these variants were not on the Metabochip and no appropriate proxies were available. This might partly explain the little overlap between CRP and CAD genetic risk variants.
In the additional analyses we used glycated hemoglobin (HbA1C) to adjust for fasting glucose, fasting insulin, T2D and other components of the glucose homeostasis. HbA1C represents the average glucose level in the last 3 months, implying that this is only a proxy for the complex glucose homeostasis rather than a covariate that reflects its entire biological metabolism. Therefore, there may still be residual confounding from other biological pathways that have an effect on glucose and insulin levels. This could explain the observed residual effect of GCKR on CRP after adjustment for the cardiometabolic phenotypes.
In the evaluation of the type of pleiotropy, we adjusted the association between the pleiotropic SNP and CRP for the cardiometabolic phenotypes. For some variants we observed a convincing attenuation in the effect estimates (BCL7B, FTO and TMEM18). For other variants, the attenuation was less pronounced (GCKR and PABPC4). From these results we cannot conclude whether the residual effect is residual confounding or a true residual effect. Additionally, for several variants the effect estimates were the same or even increased after adjustment suggesting biological pleiotropy. The latter increase in estimate might be due to negative confounding where the SNP has an opposite direction of effect on the cross-associated phenotype compared to CRP and the effect of this phenotype on CRP is in the same direction (negative confounding). We also analyzed the association between the pleiotropic SNP and the cardiometabolic phenotypes unadjusted and adjusted for CRP. Although there is ample of evidence against a causal role for CRP in the development of cardiometabolic phenotypes, for some associations the effect estimates attenuated considerably [11–14]. This might be explained by the fact that CRP is correlated with many intermediate phenotypes that mediate the association between the SNP and the cardiometabolic phenotype.
Among some pleiotropic SNPs we observed opposite direction of effects on the phenotypes than one would expect based on their effects on the health of the possessor and the association of CRP and these cardiometabolic phenotypes in observational data. This phenomenon is known as “antagonistic pleiotropy” . For instance, the SNP in the HNF1A locus increases serum CRP level according to the G allele and decreases LDL-cholesterol level. For biological pleiotropic loci we can substantiate this antagonistic effect. A genetic locus may have a deleterious effect on one phenotype, but an independent beneficial effect on a second phenotype. An explanation for these findings might be the fact that the effect sizes and variances explained by the genetic variants are small and therefore they only play a minor role in the phenotypical correlations. Moreover, the high frequency of seemingly detrimental alleles in human populations may partly be the effect of antagonistic pleiotropy . As expected, among the loci where no independent effect was observed (mediated pleiotropy), we did not observe antagonistic pleiotropy.
Our study has certain strengths. We used the largest available GWAS data on lipids, blood pressure, BMI, CAD, glycemic traits, T2D and CRP from the GLGC, ICBP, GIANT, CARDIoGRAMplusC4D, MAGIC, DIAGRAM and CHARGE Inflammation consortia to attain as much power as possible. By including only genome-wide significant findings, we restricted the analysis to the most robust genetic associations. Moreover, we used a conservative method to correct for multiple testing, lowering the probability of false positive findings. Nonetheless, some limitations should be acknowledged. Although we used the largest available GWAS sample sizes, the identified common genetic variants for above mentioned phenotypes only explain a modest fraction of the genetic variance of these phenotypes (ranging from 5 to 12 percent). Therefore, the effects of the cardiometabolic SNPs on CRP and vice-versa may still be too small to detect cross-phenotype associations, resulting in an underestimation of the amount of genetic overlap. Moreover, we only focused on common SNPs and it might be that also rare variants underlie the shared genetic associations. The method we applied to distinguish “biological” from “mediated” pleiotropy is a classical and widely used approach in the field of epidemiology. However, we cannot completely rule out reverse causation or unknown confounders as potential drivers of the association between the genetic variant and CRP. Furthermore, we only studied GWAS including participants of European ancestry. We are aware of differences in haplotype structures between different ethnicities; however, the results are likely to be generalizable given the biological pathways.
In conclusion, we observed several genetic loci with independent effects on both CRP and one or more cardiometabolic phenotypes. These results suggest that the association between CRP and cardiometabolic phenotypes is partly explained by a shared genetic background.
S1 Table. The associations of CRP SNPs with body mass index and cholesterol levels.
S2 Table. The associations of CRP SNPs with coronary artery disease, glycaemic phenotypes and blood pressure.
S3 Table. Pleiotropic SNPs and their association with CRP stepwise adjusted for cardiometabolic phenotypes.
S4 Table. Pleiotropic SNPs and their association with cardiometabolic phenotype.
S5 Table. Pathway analysis results from the 13 pleiotropic genes.
We thank all collaborating scientists from the consortia GLGC, ICBP, GIANT, CARDIoGRAMplusC4D, MAGIC, DIAGRAM for making the meta-analysis results publically available. The authors would also like to thank all the members of the CHARGE Inflammation Working Group: Abbas Dehghan, Josée Dupuis, Maja Barbalic, Joshua C Bis, Gudny Eiriksdottir, Chen Lu, Niina Pellikka, Henri Wallaschofski, Johannes Kettunen, Peter Henneman, Jens Baumert, David P Strachan, Christian Fuchsberger, Veronique Vitart, James F Wilson, Guillaume Paré, Silvia Naitza, Megan E Rudock, Ida Surakka, Eco JC de Geus, Behrooz Z Alizadeh, Jack Guralnik MD, Alan Shuldiner, Toshiko Tanaka, Robert YL Zee, Renate B Schnabel, Vijay Nambi, Maryam Kavousi, Samuli Ripatti, Matthias Nauck, Nicholas L Smith, Albert V Smith, Jouko Sundvall, Paul Scheet, Yongmei Liu, Aimo Ruokonen, Lynda M Rose, Martin G Larson, Ron C Hoogeveen, Nelson B Freimer, Alexander Teumer, Dipl-Math, Russell P Tracy, Lenore J Launer, Julie E Buring, Jennifer F Yamamoto, Aaron R Folsom, Eric JG Sijbrands, James Pankow, Paul Elliott, John F Keaney, Wei Sun, Antti-Pekka Sarin, João D Fontes, Sunita Badola, Brad C Astor, Albert Hofman, Anneli Pouta, Karl Werda, Karin H Greiser, Oliver Kuss, Henriette E Meyer zu Schwabedissen, Joachim Thiery, Yalda Jamshidi, Ilja M Nolte, Nicole Soranzo, Timothy D Spector, Henry Völzke, Alexander N Parker, Thor Aspelund, David Bates, Lauren Young, Kim Tsui, David S Siscovick, Xiuqing Guo, Jerome I Rotter, Manuela Uda, David Schlessinger, Igor Rudan, Andrew A Hicks, Brenda W Penninx, Barbara Thorand, Christian Gieger, Joe Coresh, Gonneke Willemsen, Tamara B Harris, Andre G Uitterlinden, Marjo-Riitta Järvelin, Kenneth Rice, Dörte Radke, Veikko Salomaa, Ko Willems van Dijk, Eric Boerwinkle, Ramachandran S Vasan, Luigi Ferrucci, Quince D Gibson, Stefania Bandinelli, Harold Snieder, Dorret I Boomsma, Xiangjun Xiao, Harry Campbell, Caroline Hayward, Peter P Pramstaller, Cornelia M van Duijn, Leena Peltonen, Bruce M Psaty, Vilmundur Gudnason, Paul M Ridker, Georg Homuth, Wolfgang Koenig, Christie M Ballantyne, Jacqueline CM Witteman, Emelia J Benjamin, Markus Perola, Daniel I Chasman. Correspondence to the leading author of the CHARGE Inflammation Working Group Abbas Dehghan, MD, PhD, Erasmus Medical Center, Dr Molewaterplein 50, 3015 GE Rotterdam, Netherlands (E-mail: firstname.lastname@example.org).
Conceived and designed the experiments: SL AD. Performed the experiments: SL AD DIC. Analyzed the data: SL AD DIC. Contributed reagents/materials/analysis tools: AH AGU OHF. Wrote the paper: SL PSdV AGU AH OHF DIC AD.
- 1. Danesh J, Wheeler JG, Hirschfield GM, Eda S, Eiriksdottir G, Rumley A, et al. (2004) C-reactive protein and other circulating markers of inflammation in the prediction of coronary heart disease. N Engl J Med 350: 1387–1397. pmid:15070788
- 2. Sesso HD, Buring JE, Rifai N, Blake GJ, Gaziano JM, Ridker PM (2003) C-reactive protein and the risk of developing hypertension. JAMA 290: 2945–2951. pmid:14665655
- 3. Dehghan A, Kardys I, de Maat MP, Uitterlinden AG, Sijbrands EJ, Bootsma AH, et al. (2007) Genetic variation, C-reactive protein levels, and incidence of diabetes. Diabetes 56: 872–878. pmid:17327459
- 4. Pradhan AD, Manson JE, Rifai N, Buring JE, Ridker PM (2001) C-reactive protein, interleukin 6, and risk of developing type 2 diabetes mellitus. JAMA 286: 327–334. pmid:11466099
- 5. Cushman M, Arnold AM, Psaty BM, Manolio TA, Kuller LH, Burke GL, et al. (2005) C-reactive protein and the 10-year incidence of coronary heart disease in older men and women: the cardiovascular health study. Circulation 112: 25–31. pmid:15983251
- 6. Blake GJ, Rifai N, Buring JE, Ridker PM (2003) Blood pressure, C-reactive protein, and risk of future cardiovascular events. Circulation 108: 2993–2999. pmid:14638538
- 7. Ridker PM, Cushman M, Stampfer MJ, Tracy RP, Hennekens CH (1997) Inflammation, aspirin, and the risk of cardiovascular disease in apparently healthy men. N Engl J Med 336: 973–979. pmid:9077376
- 8. Rost NS, Wolf PA, Kase CS, Kelly-Hayes M, Silbershatz H, Massaro JM, et al. (2001) Plasma concentration of C-reactive protein and risk of ischemic stroke and transient ischemic attack: the Framingham study. Stroke 32: 2575–2579. pmid:11692019
- 9. Ridker PM, Cushman M, Stampfer MJ, Tracy RP, Hennekens CH (1998) Plasma concentration of C-reactive protein and risk of developing peripheral vascular disease. Circulation 97: 425–428. pmid:9490235
- 10. Hindorff LA, Rice KM, Lange LA, Diehr P, Halder I, Walston J, et al. (2008) Common variants in the CRP gene in relation to longevity and cause-specific mortality in older adults: the Cardiovascular Health Study. Atherosclerosis 197: 922–930. pmid:17888441
- 11. Elliott P, Chambers JC, Zhang W, Clarke R, Hopewell JC, Peden JF, et al. (2009) Genetic Loci associated with C-reactive protein levels and risk of coronary heart disease. JAMA 302: 37–48. pmid:19567438
- 12. Collaboration CRPCHDG (2011) Association between C reactive protein and coronary heart disease: mendelian randomisation analysis based on individual participant data. BMJ: British Medical Journal 342.
- 13. Smith GD, Lawlor DA, Harbord R, Timpson N, Rumley A, Lowe GDO, et al. (2005) Association of C-reactive protein with blood pressure and hypertension life course confounding and Mendelian randomization tests of causality. Arteriosclerosis, thrombosis, and vascular biology 25: 1051–1056. pmid:15731495
- 14. Timpson NJ, Nordestgaard BG, Harbord RM, Zacho J, Frayling TM, Tybjærg-Hansen A, et al. (2011) C-reactive protein levels and body mass index: elucidating direction of causation through reciprocal Mendelian randomization. International Journal of Obesity 35: 300–308. pmid:20714329
- 15. The CDC, Deloukas P, Kanoni S, Willenborg C, Farrall M, Assimes TL, et al. (2012) Large-scale association analysis identifies new risk loci for coronary artery disease. Nat Genet 45: 25–33. pmid:23202125
- 16. International Consortium for Blood Pressure Genome-Wide Association S, Ehret GB, Munroe PB, Rice KM, Bochud M, Johnson AD, et al. (2011) Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 478: 103–109. pmid:21909115
- 17. Dupuis J, Langenberg C, Prokopenko I, Saxena R, Soranzo N, Jackson AU, et al. (2010) New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet 42: 105–116. pmid:20081858
- 18. Scott RA, Lagou V, Welch RP, Wheeler E, Montasser ME, Luan J, et al. (2012) Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways. Nat Genet 44: 991–1005. pmid:22885924
- 19. Morris AP, Voight BF, Teslovich TM, Ferreira T, Segre AV, Steinthorsdottir V, et al. (2012) Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet 44: 981–990. pmid:22885922
- 20. Speliotes EK, Willer CJ, Berndt SI, Monda KL, Thorleifsson G, Jackson AU, et al. (2010) Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat Genet 42: 937–948. pmid:20935630
- 21. Teslovich TM, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, et al. (2010) Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466: 707–713. pmid:20686565
- 22. Dehghan A, Dupuis J, Barbalic M, Bis JC, Eiriksdottir G, Lu C, et al. (2011) Meta-analysis of genome-wide association studies in >80 000 subjects identifies multiple loci for C-reactive protein levels. Circulation 123: 731–738. pmid:21300955
- 23. Solovieff N, Cotsapas C, Lee PH, Purcell SM, Smoller JW (2013) Pleiotropy in complex traits: challenges and strategies. Nat Rev Genet 14: 483–495. pmid:23752797
- 24. Olden M, Teumer A, Bochud M, Pattaro C, Köttgen A, Turner ST, et al. (2013) Overlap between common genetic polymorphisms underpinning kidney traits and cardiovascular disease phenotypes: the CKDGen Consortium. American Journal of Kidney Diseases 61: 889–898. pmid:23474010
- 25. Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O’Donnell CJ, de Bakker PIW (2008) SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics 24: 2938–2939. pmid:18974171
- 26. Voight BF, Kang HM, Ding J, Palmer CD, Sidore C, Chines PS, et al. (2012) The metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits. PLoS Genet 8: e1002793. pmid:22876189
- 27. Schunkert H, Konig IR, Kathiresan S, Reilly MP, Assimes TL, Holm H, et al. (2011) Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nat Genet 43: 333–338. pmid:21378990
- 28. McIntyre LM, Martin ER, Simonsen KL, Kaplan NL (2000) Circumventing multiple testing: a multilocus Monte Carlo approach to testing for association. Genet Epidemiol 19: 18–29. pmid:10861894
- 29. Peng G, Luo L, Siu H, Zhu Y, Hu P, Hong S, et al. (2010) Gene and pathway-based second-wave analysis of genome-wide association studies. European Journal of Human Genetics 18: 111–117. pmid:19584899
- 30. Ridker PM, Chasman DI, Zee RY, Parker A, Rose L, Cook NR, et al. (2008) Rationale, design, and methodology of the Women’s Genome Health Study: a genome-wide association study of more than 25,000 initially healthy american women. Clin Chem 54: 249–255. pmid:18070814
- 31. Nathan DM, Singer DE, Hurxthal K, Goodson JD (1984) The clinical information value of the glycosylated hemoglobin assay. N Engl J Med 310: 341–346. pmid:6690962
- 32. Team RC (2012) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria Available from: http://wwwR-projectorg/.
- 33. Rocha VZ, Libby P (2009) Obesity, inflammation, and atherosclerosis. Nat Rev Cardiol 6: 399–409. pmid:19399028
- 34. Odom DT, Zizlsperger N, Gordon DB, Bell GW, Rinaldi NJ, Murray HL, et al. (2004) Control of pancreas and liver gene expression by HNF transcription factors. Science 303: 1378–1381. pmid:14988562
- 35. Reiner AP, Gross MD, Carlson CS, Bielinski SJ, Lange LA, Fornage M, et al. (2009) Common Coding Variants of the HNF1A Gene Are Associated With Multiple Cardiovascular Risk Phenotypes in Community-Based Samples of Younger and Older European-American Adults The Coronary Artery Risk Development in Young Adults Study and The Cardiovascular Health Study. Circulation: Cardiovascular Genetics 2: 244–254. pmid:20031592
- 36. Williams GC (2001) Pleiotropy, natural selection, and the evolution of senescence. Science’s SAGE KE 2001: 13.
- 37. Carter AJ, Nguyen AQ (2011) Antagonistic pleiotropy as a widespread mechanism for the maintenance of polymorphic disease alleles. BMC Med Genet 12: 160. pmid:22151998