Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genome-Wide Identification of Expression Quantitative Trait Loci (eQTLs) in Human Heart

  • Tamara T. Koopmann ,

    Contributed equally to this work with: Tamara T. Koopmann, Michiel E. Adriaens

    Affiliation Department of Experimental Cardiology, Heart Failure Research Centre, Academic Medical Center, Amsterdam, The Netherlands

  • Michiel E. Adriaens ,

    Contributed equally to this work with: Tamara T. Koopmann, Michiel E. Adriaens

    Affiliation Department of Experimental Cardiology, Heart Failure Research Centre, Academic Medical Center, Amsterdam, The Netherlands

  • Perry D. Moerland,

    Affiliation Bioinformatics Laboratory, Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Center, Amsterdam, The Netherlands

  • Roos F. Marsman,

    Affiliation Department of Experimental Cardiology, Heart Failure Research Centre, Academic Medical Center, Amsterdam, The Netherlands

  • Margriet L. Westerveld,

    Affiliation Department of Experimental Cardiology, Heart Failure Research Centre, Academic Medical Center, Amsterdam, The Netherlands

  • Sean Lal,

    Affiliation Muscle Research Unit, Department of Anatomy, Bosch Institute, The University of Sydney, Sydney, Australia

  • Taifang Zhang,

    Affiliation Department of Medicine, University of Miami School of Medicine, Miami, Florida, United States of America

  • Christine Q. Simmons,

    Affiliation Division of Genetic Medicine, Department of Medicine, Vanderbilt University, Nashville, Tennessee, United States of America

  • Istvan Baczko,

    Affiliation Department of Pharmacology and Pharmacotherapy, Faculty of Medicine, University of Szeged, Szeged, Hungary

  • Cristobal dos Remedios,

    Affiliation Muscle Research Unit, Department of Anatomy, Bosch Institute, The University of Sydney, Sydney, Australia

  • Nanette H. Bishopric,

    Affiliations Department of Medicine, University of Miami School of Medicine, Miami, Florida, United States of America, Department of Molecular and Cellular Pharmacology, University of Miami School of Medicine, Miami, Florida, United States of America

  • Andras Varro,

    Affiliation Department of Pharmacology and Pharmacotherapy, Faculty of Medicine, University of Szeged, Szeged, Hungary

  • Alfred L. George Jr,

    Affiliation Division of Genetic Medicine, Department of Medicine, Vanderbilt University, Nashville, Tennessee, United States of America

  • Elisabeth M. Lodder,

    Affiliation Department of Experimental Cardiology, Heart Failure Research Centre, Academic Medical Center, Amsterdam, The Netherlands

  • Connie R. Bezzina

    c.r.bezzina@amc.uva.nl

    Affiliation Department of Experimental Cardiology, Heart Failure Research Centre, Academic Medical Center, Amsterdam, The Netherlands

Genome-Wide Identification of Expression Quantitative Trait Loci (eQTLs) in Human Heart

  • Tamara T. Koopmann, 
  • Michiel E. Adriaens, 
  • Perry D. Moerland, 
  • Roos F. Marsman, 
  • Margriet L. Westerveld, 
  • Sean Lal, 
  • Taifang Zhang, 
  • Christine Q. Simmons, 
  • Istvan Baczko, 
  • Cristobal dos Remedios
PLOS
x

Abstract

In recent years genome-wide association studies (GWAS) have uncovered numerous chromosomal loci associated with various electrocardiographic traits and cardiac arrhythmia predisposition. A considerable fraction of these loci lie within inter-genic regions. The underlying trait-associated variants likely reside in regulatory regions and exert their effect by modulating gene expression. Hence, the key to unraveling the molecular mechanisms underlying these cardiac traits is to interrogate variants for association with differential transcript abundance by expression quantitative trait locus (eQTL) analysis. In this study we conducted an eQTL analysis of human heart. For a total of 129 left ventricular samples that were collected from non-diseased human donor hearts, genome-wide transcript abundance and genotyping was determined using microarrays. Each of the 18,402 transcripts and 897,683 SNP genotypes that remained after pre-processing and stringent quality control were tested for eQTL effects. We identified 771 eQTLs, regulating 429 unique transcripts. Overlaying these eQTLs with cardiac GWAS loci identified novel candidates for studies aimed at elucidating the functional and transcriptional impact of these loci. Thus, this work provides for the first time a comprehensive eQTL map of human heart: a powerful and unique resource that enables systems genetics approaches for the study of cardiac traits.

Introduction

It is well established that many cardiac traits and susceptibility to heart disease are heritable [1], [2], [3], [4], [5], [6], [7]. Several genome-wide association studies (GWAS) have uncovered common genetic variation, in the form of single nucleotide polymorphisms (SNPs), impacting on cardiac traits such as susceptibility to atrial fibrillation [8], ventricular fibrillation [9], heart rate [10] and electrocardiographic (ECG) indices of cardiac conduction [11], [12], [13], [14] and repolarization [15], [16]. There is widespread consensus that functional studies of GWAS-defined loci will advance our understanding of the molecular underpinnings of the associated traits.

SNPs identified by GWAS are considered to impact the respective clinical phenotype, either directly or indirectly by virtue of linkage disequilibrium (LD) with the causal variant(s) in the context of a haplotype. Many trait-associated haplotypes occur in non-coding regions of the genome [17] and are hypothesized to modulate the respective trait through effects on gene expression [18]. Such SNPs are particularly challenging to understand because they may exert effects on the trait either by affecting the expression of a neighbouring gene (cis-effect) or the expression of a gene located elsewhere in the genome (trans-effects). One way of understanding GWAS signals thus entails interrogating trait-associated variants for association with differential transcript abundance by expression quantitative trait locus (eQTL) analysis. Studying gene expression level effects of disease-associated haplotypes has successfully uncovered the molecular mechanisms underlying loci associated with increased risk of myocardial infarction [19], coronary artery disease [20] and colorectal cancer [21]. In recent years, multiple genome-wide eQTL resources have become available for various tissues including brain, liver and adipose tissue [22], [23], [24], [25], [26], [27], [28], [29]. Because eQTLs may be tissue-specific, a similar resource for human heart is anticipated to have great value [23], [29], [30], [31].

To this end, we have generated a human heart eQTL resource by genome-wide genotyping and determination of transcript abundance in 129 human donor heart samples. We subsequently overlaid previously identified cardiac trait GWAS signals with the identified eQTLs to identify candidate causal genes for the effects at these GWAS loci. This work provides an eQTL map of human heart, a resource that is likely to play an important role in furthering our understanding of the mechanisms associated with loci identified in GWAS on cardiac traits.

Results

General design of study

We collected left ventricular samples from 180 non-diseased human hearts of unrelated organ donors whose hearts were explanted to obtain pulmonary and aortic valves for transplant surgery or explanted for heart transplantation but not used due to logistical reasons (e.g. no tissue-matched recipient was available). The subjects were assumed to be mainly of Western European descent. mRNA and DNA were isolated according to standard procedures. Transcript abundance was measured using the HumanHT-12 v4.0 whole genome array (Illumina) and genotyping was carried out using the HumanOmniExpress genome-wide SNP arrays (Illumina).

Data preprocessing and normalization

Gene transcript abundance: Of the 47,231 transcripts whose expression levels were measured on the array, only those that were expressed above background level and for which the probe sequence mapped unambiguously to the genome and did not contain common SNPs, were used in further analyses. This procedure left 18,402 transcripts for eQTL analysis. Model-based background correction and normalization across arrays and transcripts was performed to correct for technical variance present in gene expression levels. A total of 162 arrays passed the standardized microarray gene expression quality control.

Genotyping: Manhattan distance clustering and principal component analysis of the genotype data of 154 samples that were successfully genotyped, revealed 13 genetic outliers (Figure S1). To ensure a genetically homogenous group for further analysis, samples pertaining to these clusters were removed. An additional 12 samples were removed due to low call rate (<95%), high proportion of alleles identical-by-state (>95%), or extreme heterozygosity (FDR 1%). Only SNPs with a minor allele frequency (MAF) higher than 0.15 were considered in eQTL analysis. This cutoff was chosen to ensure sufficient power to detect eQTLs within a broad range of effect sizes (Figure S2). Imputation was performed using the HAPMAP Phase III data (see Materials & Methods for details). This left 129 samples (74 male, 55 female; age 41±14), 18,402 transcripts and 897,683 SNPs for eQTL analysis.

Genome-wide eQTL mapping

Each of the measured transcripts was tested for association with all SNPs using linear modeling, taking age, sex and clinical/university center as covariates. We thus identified 6402 significant eQTLs (FDR ≤0.05). To remove redundant signals and identify independent expression-controlling loci, we performed linkage-disequilibrium (LD)-pruning. For this we grouped SNPs exhibiting LD (r2>0.6) into clusters, revealing 771 independent loci regulating 429 unique transcripts. These results are comparable to eQTL studies in other non-diseased tissues of similar sample size [22], [23], [24], [28], [29].

Of these 771 eQTLs, 770 were cis-eQTLs for 428 unique transcripts (p<2.82×10−5; FDR ≤0.05), where the associated SNPs lie within 1 Mb of the transcriptional start site (TSS) of the cognate transcript. For the four most significant cis-eQTLs, box-and-whisker plots and mean-standard-error plots for the individual genotypes are given in Figure 1. An overview of the most significant cis-eQTLs is given in Table 1 and the complete results are given in supplemental Table S1.

thumbnail
Figure 1. Overview plots for top cis eQTLs.

An overview of the 4 most significant cis eQTLs: rs11150882 with C17orf97 (panel A), rs11158569 with CHURC1 (panel B), rs2779212 with ZSWIM7 (panel C) and rs2549794 with ERAP2 (panel D). On the left of each panel, box-and-whisker plots of mRNA levels for all genotypes. On the right, mean and standard-error plots of mRNA levels for all genotypes are illustrated. Right upper corner gives the association p-value and the gene name.

https://doi.org/10.1371/journal.pone.0097380.g001

thumbnail
Table 1. Overview of the 30 most significant cis eQTLs, reported as independent LD-pruned SNP clusters (see Materials & Methods).

https://doi.org/10.1371/journal.pone.0097380.t001

Of the independent significant eQTLs, one was found to be in trans (p<2.12×10−11; FDR ≤0.05), with the expression of LOC644936 located on chromosome 5 being seemingly modulated by an eQTL (rs852423) on chromosome 7. However, as LOC644936 is a known pseudogene of ACTB and rs852423 is located within ACTB, we cannot rule out the possibility that rs852423 is in fact a cis eQTL for ACTB rather than a trans eQTL for LOC644936. Using BLAST to align the microarray probe sequence of LOC644936 to the human transcriptome uncovered a partial match with ACTB in addition to a 100% match with LOC644936.

Integration of eQTL data with cardiac GWAS loci

In order to provide candidate genes for the reported heart-related GWAS loci, we listed the 102 SNPs previously associated with a cardiac trait at genome-wide statistical significance (pgwas ≤5×10−8), representing 74 independent loci (LD-pruned with r2>0.6, see Materials & Methods). These corresponded to loci associated with ventricular fibrillation/sudden cardiac death, atrial fibrillation, heart rate, PR interval, QRS duration and QTc interval. Of these, the 64 SNPs that displayed a MAF of 15% or higher in the eQTL sample were overlaid with the eQTL data to identify transcripts under genetic regulation by these loci. All GWAS SNPs were tested for association with transcript levels of all 18,402 transcripts in this study. We identified a cis association between rs9912468, a modulator of QRS duration [12] with the level of expression of the PRKCA transcript at genome-wide statistical significance (p = 2.90×10−9, see Figure 2A). Besides PRKCA, no other GWAS SNP displayed an eQTL association p-value that passed the stringent Bonferroni-corrected p-value threshold (p<0.05/64 SNPs ×18,402 transcripts ∼ 4×10−8). A total of 34 SNPs were associated with the transcript level of a gene at a p≤0.05 (Table 2). Among these, rs8049607, a modulator of QTc-interval [16] was found to be associated in cis with the transcript level of LITAF (p<5×10−4, Figure 2C), and rs7612445 and rs6882776, both associated with heart rate [10] were associated in cis with the transcript levels of GNB4 (p<2×10−4, Figure 2B) and NKX2-5 (p<6×10−3, Figure 2D), respectively. The number of nominal associations for the 64 cardiac trait-associated SNPs tested represents a more than 7-fold enrichment (p<0.05, see Materials & Methods) compared to a random selection of 64 variants from the entire set of SNPs used in eQTL analysis.

thumbnail
Figure 2. eQTL overview plots for 4 cardiac trait GWAS candidate genes.

An overview of 4 GWAS cis eQTLs: rs9912468 with PRKCA (panel A), rs7912445 with GNB4 (panel B), rs8049607 with LITAF (panel C) and rs6882776 with NKX2-5 (panel D). On the left of each panel, box-and-whisker plots of mRNA levels for all genotypes. On the right, mean and standard-error plots of mRNA levels for all genotypes are illustrated. Right upper corner gives the association p-value and the gene name.

https://doi.org/10.1371/journal.pone.0097380.g002

Discussion

We conducted a genome-wide eQTL analysis in 129 samples of normal human myocardium, identifying genetic variation regulating gene expression in human heart and uncovering 771 genome-wide significant independent eQTLs. This resource, heretofore unavailable in human heart will contribute to advancing our understanding of the genetic mechanisms underlying loci associated with cardiac traits. All but one of the eQTLs identified were cis eQTLs. Other eQTL studies have identified only few trans eQTLs [22], [24], [28], [29], illustrating the general difficulty of detecting trans-regulatory variants in eQTL studies [31], [32]. Based on larger eQTL studies in other tissues [22], [24], [25], [26], [29] as many as 4000 independent cardiac cis eQTLs are expected to be present, hence the results presented here are a subset of this theoretical complete set of cardiac eQTLs.

In recent years, many novel loci associated with a number of cardiac traits, including cardiac arrhythmia and ECG indices, have been discovered. However, the identification of (novel) genes at these loci has lagged behind. The availability of a cardiac eQTL resource is likely to aid in the dissection of these loci by providing a means of prioritizing candidate genes for follow-up functional studies. Indeed, our current findings already provide candidate genes for a number of these loci (Table 2). One such example is the PRKCA gene for the effect observed on QRS duration for the rs9912468-tagged haplotype on chromosome 9. PRKCA encodes protein kinase C alpha, a fundamental regulator of cardiac contractility and Ca2+ handling in cardiomyocytes [33]. The mechanism by which it regulates QRS duration is unknown. Other candidates include the LITAF gene (encoding lipopolysaccharide-induced TNF factor) for the rs8049607-tagged haplotype associated with QTc-interval and the GNB4 gene (encoding guanine nucleotide binding protein) for the rs7612445-tagged haplotype associated with heart rate. None of these eQTLs (for PRKCA, LITAF and GNB4) have been previously identified in non-cardiac tissues.

The utility of this approach is further evidenced by the fact that the 64 GWAS SNPs were enriched in nominally significant eSNPs as compared to a random selection of 64 variants from the entire set of SNPs used in eQTL analysis. Such an enrichment was reported before for GWAS loci in general based on eQTLs identified in lymphoblastoid cell lines from HAPMAP samples [18].

The eQTLs we identified represent an enriched set of highly relevant candidates to test in future studies for association with cardiac traits and disease. Among the highly significant eQTLs listed in Table 1, at least two SNPs could also be interesting from a pharmacogenetic point of view. One is rs1222809 which was found to be strongly associated with the expression level of the DHFR gene encoding dihydrofolate reductase, a putative target of the drug methotrexate. Of note previous studies have provided evidence that rs1650697, which is in complete LD with rs1222809, may be associated with adverse events to methotrexate in patients with rheumatoid arthritis [34], [35]. The other potentially interesting eQTL from a pharmacogenetic point of view is rs4822466 which was found to be highly associated with the expression of GSTT1, a gene encoding the liver detoxifying enzyme Glutathione S-transferase T1.

The eQTLs we identified are expected to be enriched in the regulatory regions of the genome such as promoter regions, enhancers and transcription factor binding sites [36]. Recent work has begun to uncover these relationships for adult human heart [37]. However, formal testing for enrichment of eQTLs in the known regulatory regions [37] did not provide statistically significant enrichment (data not shown). At least in part, this may be due to the limited number of eQTLs we have identified.

A limitation of the presented study concerns the fact that not all transcripts have been tested for eQTL effects. Transcripts that were expressed below the (array-based) detection level or for which probe design was not optimal could not be tested. Conversely, not all haplotypes in the genome were tested as for instance we only tested SNPs with a MAF higher than 0.15. Furthermore, our sample size and therefore statistical power was limited, preventing the identification of eQTLs of smaller effect and trans eQTLs. The interpretation of the data concerning SNPs from GWAS presented in Table 2 must take these considerations into account. Additionally, the single trans eQTL we identified is likely a false discovery and will require further investigation.

Our study was conducted in left ventricular myocardium. However, it is well known that different cardiac compartments such as the atria or the specialized conduction system display different gene expression patterns [38], [39], [40], [41] and eQTL effects might thus differ across cardiac compartments. Furthermore, we have no information relating to cardiac traits such as ECG indices in the 129 individuals from whom the left ventricular samples were obtained; we were therefore unable to correlate gene expression with cardiac traits in these individuals [23], [42].

In summary, we here provide the first eQTL map of human left ventricular myocardium that will enable systems genetics approaches in the study of cardiac traits.

Materials and Methods

Ethics statement

Investigations using the human ventricular samples conformed to the principles outlined in the Helsinki Declaration of the World Medical Association. The ethical review boards of University of Szeged (Ethical Review Board of the University of Szeged Medical Center; Szeged, Hungary), Vanderbilt University (Institutional Review Board of Vanderbilt University School of Medicine; Nashville, USA), University of Miami (Institutional Review Board of the University of Miami School of Medicine; Miami, USA), and the University of Sydney (Human Research Ethics Committee (HREC); Sydney, Australia) approved procurement and handling of the human cardiac material. Written informed consent from the donor or the next of kin was obtained for use of this sample in research. All data was analyzed anonymously.

Sample collection

Left ventricular samples were obtained from 180 non-diseased human hearts of unrelated organ donors whose hearts were explanted to obtain pulmonary and aortic valves for transplant or valve replacement surgery or explanted for transplantation but not used due to logistical reasons. The tissues were ascertained at the University of Szeged (Hungary; n = 79), Vanderbilt University (Nashville, USA; n = 46), University of Miami (USA; n = 30), and the University of Sydney (Australia; n = 25) and assumed to consist mainly of subjects of Western European descent based on self-reported ethnicity. The Vanderbilt samples were procured with the assistance of the National Disease Research Interchange (Philadelphia, PA).

Generation and processing of gene expression data

Total RNA was extracted from the human left ventricular heart samples using the mirVana miRNA isolation kit (Ambion) at the AMC, Amsterdam, The Netherlands. Sample processing order was randomized. RNA quality was assessed by Agilent Bioanalyzer (minimum RIN = 7) and spectrophotometry (minimum 260 nm:280 nm = 1.8). The Illumina TotalPrep-96 RNA Amplification Kit was used to generate cRNA starting from 200 ng total RNA. Genome-wide gene expression data was generated using Illumina HumanHT-12 v4 BeadArrays, containing 47,231 probes representing 28,688 RefSeq annotated transcripts (ServiceXS, Leiden, The Netherlands), following the instructions of the manufacturer.

Raw expression data were imported into the Illumina BeadStudio and summarized at probe-level for each sample without normalization or background correction. The summarized data were subsequently imported into R (version 2.15.3) [43] using the beadarray package [44]. Quality control was performed using the ArrayQualityMetrics package in R [45]. Samples displaying transcriptional stratification using hierarchical clustering were omitted from the analysis. The summarized data of the 162 remaining samples was background corrected and quantile normalized using the neqc algorithm [46] across all samples. The neqc algorithm is the current standard data-preprocessing method for Illumina gene expression BeadArrays [47], and has been applied in eQTL studies with comparable sample size [29], [30].

Probes containing common SNPs (HAPMAP Phase III release 2) [27], [29] and probes whose sequence did not align or aligned ambiguously to the human reference genome (HG19), according to up-to-date Illumina HumanHT-12 v4.0 BeadArray annotation available from the Bioconductor project, were left out of the analysis. Additionally, probes with median expression levels below a study specific threshold (the median expression levels of Y chromosome transcripts in the female subjects of the sample population) were not considered for subsequent analyses.

Genotyping and genotype imputation

DNA was extracted for genotyping from 162 heart samples that passed the gene expression analysis quality control criteria (see above) at the AMC, Amsterdam, The Netherlands. Genome-wide SNP genotyping was carried out using Illumina HumanOmniExpress Beadchips interrogating 733,202 genetic markers (Genome Analysis Center, Helmholtz Zentrum München, Germany). A total of 8 samples had sample quality issues (and were not hybridized) or failed hybridization, leaving genotype data for 154 samples. Quality control was performed in the GenABEL [48] package in R using default settings. Samples with low call rate (<95%), extreme heterozygosity (FDR 1%) or high proportion of alleles identical-by-state (>95%) were removed. Additionally, any remaining samples showing genetic stratification through Manhattan distance hierarchical clustering (using the popgen [49] package in R), and confirmed with principal component analysis [48], were not considered (Figure S1).

Power calculations were performed (with a fixed FDR of 0.05) to assess the influence of MAF on power in relation to observed gene expression fold changes. Based on these results, a MAF threshold of 0.15 was chosen to ensure sufficient power to detect cis eQTLs within a broad range of effect sizes (Figure S2). Additionally, assuming Hardy-Weinberg equilibrium, a MAF of 0.15 or higher yields an expected number of three individuals homozygous for the minor allele, which we considered the minimum for fitting a meaningful additive genetic model.

Imputation was performed using the MACH software [50] and the HAPMAP Phase III data. Only SNPs imputed with sufficient confidence were considered, using the estimate of the squared correlation between imputed and true genotypes. By setting the cut-off at 0.30, most of the poorly imputed SNPs are filtered out, compared to only a small number (<1%) of well imputed SNPs [51].

eQTL statistical analysis

After pre-processing and stringent quality control of gene expression and genotypic data as described above, a total of 129 heart samples were used in eQTL analysis. Each transcript was tested for association with SNP genotypes genome-wide using linear modeling (assuming an additive genetic model), taking age, gender and tissue collection center as covariates, using the GenABEL package [48] in R. Correction for multiple testing was performed on the complete set of cis eQTL p-values in the qvalue package in R [52]. A q-value (FDR) ≤0.05 was considered significant for cis eQTLs, corresponding to a p-value of 2.82×10−5. Cis relations were defined as those within 1 Mb of a transcription start site (TSS), in accordance with previous reports demonstrating that over 90% of cis SNPs are situated within 100 Kb of a TSS [26], [27], [29], [47], [53]. SNPs with an LD R2 of larger than 0.6 were considered dependent and LD-pruned into clusters (LD clusters), in accordance with previous studies [23], [29], [30]. For trans eQTLs, only results with a p-value <5×10−8 were considered (corresponding to a target α (or p value) of 0.05 with a Bonferroni correction for 1 million independent tests [54], [55]). Correction for multiple testing was done by using a step-up Benjamini & Hochberg procedure on all p-values <5×10−8, and a q-value (FDR) ≤0.05 was considered genome-wide significant for trans eQTLs, corresponding to a p-value of 2.12×10−11.

eQTL biological interpretation and candidate gene prioritization

To prioritize candidate genes for further studies, additional data sources were integrated. Additional trait and disease associated SNPs were extracted from PubMed (www.ncbi.nlm.nih.gov/pubmed; search terms: ‘GWAS’ AND ‘cardiac’, ‘atrial fibrillation’, ‘sudden cardiac death’, ‘ECG [electrocardiographic]’, ‘PR interval’, ‘QRS’, ‘QT’, ‘repolarization’), the NHGRI catalog of published GWAS (http://www.genome.gov/gwastudies/), and GWAS central (https://www.gwascentral.org) on January 8, 2013. Analyses were restricted to samples of European ancestry. Results were classified into six categories: sudden cardiac death, atrial fibrillation, heart rate, PR duration, QRS duration and QTc duration. Next, each GWAS SNP passing genome-wide significance in the respective study (5×10−8, a target α of 0.05 with a Bonferroni correction for 1 million independent tests) was tested for association with expression of all 18,402 measured transcripts. To determine the number of independent loci, LD-pruning was performed by merging all GWAS SNPs with LD r2>0.6 (HAPMAP R22 and HAPMAP Phase III). The p-value threshold for significant eQTL effects was set at 4×10−8, a target α of 0.05 with a Bonferroni correction for 1,177,728 tests (64 independent loci ×18,402 transcripts).

To quantify the enrichment of eQTLs among the cardiac trait GWAS SNPs, we generated 100,000 randomized independent SNP sets of the same size as the number of independent GWAS loci, and with corresponding MAF distribution and proximity to genes. The number of nominally significant eQTL associations for the original independent GWAS loci is referred to as Q. Next, for each random set Si, we determined the number of eQTLs at nominal significance (p≤0.05), referred to as Qi. The simulations yielded a fold-enrichment score, calculated as the average over all random sets of the ratio between Q and Qi, and an empirical p-value, calculated as the proportion of simulations in which the number of eQTLs exceeds the number of nominally significant eQTL associations in the original independent GWAS loci.

Public access to microarray data

The microarray genotyping and gene expression data of the study have been deposited online at the Gene Expression Omnibus (GEO), with accession number GSE55232.

Supporting Information

Figure S1.

Manhattan distance hierarchical clustering dendogram of 154 genotyped subjects. Manhattan distance hierarchical clustering revealed several genotypic outliers. The clustering was repeated using principal component analysis, identifying the same groups of outliers.

https://doi.org/10.1371/journal.pone.0097380.s001

(TIF)

Figure S2.

Results of eQTL power analyses in relation to MAF and gene expression fold change. eQTL power analyses were performed for different minimum minor allele frequencies (0.05, 0.10, 0.15, 0.20, 0.30 and 0.40). The gene expression fold change is defined as log2 difference in gene expression observed per copy of the minor allele. In each analysis, for each log2 fold change X, all eQTLs with an absolute log2 fold change larger than X were considered, and the power was calculated as the percentage of those eQTLs for which the null hypothesis is rejected at FDR ≤0.05.

https://doi.org/10.1371/journal.pone.0097380.s002

(TIF)

Table S1.

Table of all significant eQTLs. This table contains the complete results for all significant non-diseased human heart eQTLs (FDR ≤0.05). It contains for each SNP-transcript pair the SNP ID, gene or transcript IDs (HGNC, Entrez Gene, RefSeq), genomic locations, minor and major allele, minor allele frequency, beta (effect size per copy of the minor allele), p-value and distance between SNP and gene. The table is sorted on HGNC official gene symbol.

https://doi.org/10.1371/journal.pone.0097380.s003

(XLS)

Acknowledgments

The authors are grateful to Michael Tanck for helpful discussion and advice on the statistical analysis of the genotyping data, and to Dr. Jolanda van der Velden for her help in acquisition of cardiac samples.

Author Contributions

Conceived and designed the experiments: TTK MEA CRB. Performed the experiments: TTK MLW. Analyzed the data: MEA PDM. Contributed reagents/materials/analysis tools: SL TZ CQS IB CdR NHB ALG AV RFM. Wrote the paper: TTK MEA CRB EML.

References

  1. 1. Jouven X, Desnos M, Guerot C, Ducimetiere P (1999) Predicting sudden death in the population: the Paris Prospective Study I. . Circulation 99: 1978–1983.
  2. 2. Friedlander Y, Siscovick DS, Weinmann S, Austin MA, Psaty BM, et al. (1998) Family history as a risk factor for primary cardiac arrest. Circulation 97: 155–160.
  3. 3. Kolder IC, Tanck MW, Bezzina CR (2012) Common genetic variation modulating cardiac ECG parameters and susceptibility to sudden cardiac death. Journal of molecular and cellular cardiology 52: 620–629.
  4. 4. Dekker LRC, Bezzina CR, Henriques JPS, Tanck MW, Koch KT, et al. (2006) Familial sudden death is an important risk factor for primary ventricular fibrillation: a case-control study in acute myocardial infarction patients. Circulation 114: 1140–1145.
  5. 5. Lubitz SA, Yin X, Fontes JoD, Magnani JW, Rienstra M, et al. (2010) Association between familial atrial fibrillation and risk of new-onset atrial fibrillation. JAMA : the journal of the American Medical Association 304: 2263–2269.
  6. 6. Myers RH, Kiely DK, Cupples LA, Kannel WB (1990) Parental history is an independent risk factor for coronary artery disease: the Framingham Study. American heart journal 120: 963–969.
  7. 7. Hawe E, Talmud PJ, Miller GJ, Humphries SE (2003) Family history is a coronary heart disease risk factor in the Second Northwick Park Heart Study. Annals of human genetics 67: 97–106.
  8. 8. Ellinor PT, Lunetta KL, Glazer NL, Pfeufer A, Alonso A, et al. (2010) Common variants in KCNN3 are associated with lone atrial fibrillation. Nat Genet 42: 240–244.
  9. 9. Bezzina CR, Pazoki R, Bardai A, Marsman RF, de Jong JS, et al. (2010) Genome-wide association study identifies a susceptibility locus at 21q21 for ventricular fibrillation in acute myocardial infarction. Nat Genet 42: 688–691.
  10. 10. den Hoed M, Eijgelsheim M, Esko T, Brundel BJ, Peal DS, et al. (2013) Identification of heart rate-associated loci and their effects on cardiac conduction and rhythm disorders. Nat Genet 45: 621–631.
  11. 11. Pfeufer A, van Noord C, Marciante KD, Arking DE, Larson MG, et al. (2010) Genome-wide association study of PR interval. Nat Genet 42: 153–159.
  12. 12. Sotoodehnia N, Isaacs A, de Bakker PI, Dorr M, Newton-Cheh C, et al. (2010) Common variants in 22 loci are associated with QRS duration and cardiac ventricular conduction. Nat Genet 42: 1068–1076.
  13. 13. Chambers JC, Zhao J, Terracciano CM, Bezzina CR, Zhang W, et al. (2010) Genetic variation in SCN10A influences cardiac conduction. Nat Genet 42: 149–152.
  14. 14. Holm H, Gudbjartsson DF, Arnar DO, Thorleifsson G, Thorgeirsson G, et al. (2010) Several common variants modulate heart rate, PR interval and QRS duration. Nat Genet 42: 117–122.
  15. 15. Newton-Cheh C, Eijgelsheim M, Rice KM, de Bakker PI, Yin X, et al. (2009) Common variants at ten loci influence QT interval duration in the QTGEN Study. Nat Genet 41: 399–406.
  16. 16. Pfeufer A, Sanna S, Arking DE, Muller M, Gateva V, et al. (2009) Common variants at ten loci modulate the QT interval duration in the QTSCD Study. Nat Genet 41: 407–414.
  17. 17. Cookson W, Liang L, Abecasis G, Moffatt M, Lathrop M (2009) Mapping complex disease traits with global gene expression. Nature reviews Genetics 10: 184–194.
  18. 18. Nicolae DL, Gamazon E, Zhang W, Duan S, Dolan ME, et al. (2010) Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS genetics 6: e1000888.
  19. 19. Musunuru K, Strong A, Frank-Kamenetsky M, Lee NE, Ahfeldt T, et al. (2010) From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus. Nature 466: 714–719.
  20. 20. Visel A, Zhu Y, May D, Afzal V, Gong E, et al. (2010) Targeted deletion of the 9p21 non-coding coronary artery disease risk interval in mice. Nature 464: 409–412.
  21. 21. Pittman AM, Naranjo S, Jalava SE, Twiss P, Ma Y, et al. (2010) Allelic variation at the 8q23.3 colorectal cancer risk locus functions as a cis-acting regulator of EIF3H. PLoS genetics 6: e1001126.
  22. 22. Zou F, Chai HS, Younkin CS, Allen M, Crook J, et al. (2012) Brain expression genome-wide association study (eGWAS) identifies human disease-associated variants. PLoS genetics 8: e1002707.
  23. 23. Hernandez DG, Nalls MA, Moore M, Chong S, Dillman A, et al. (2012) Integration of GWAS SNPs and tissue specific expression profiling reveal discrete eQTLs for human traits in blood and brain. Neurobiol Dis 47: 20–28.
  24. 24. Fu J, Wolfs MGM, Deelen P, Westra H-J, Fehrmann RSN, et al. (2012) Unraveling the regulatory mechanisms underlying tissue-dependent genetic variation of gene expression. PLoS genetics 8: e1002431.
  25. 25. Schadt EE, Molony C, Chudin E, Hao K, Yang X, et al. (2008) Mapping the genetic architecture of gene expression in human liver. PLoS Biol 6: e107.
  26. 26. Mehta D, Heim K, Herder C, Carstensen M, Eckstein G, et al. (2013) Impact of common regulatory single-nucleotide variants on gene expression profiles in whole blood. European journal of human genetics : EJHG 21: 48–54.
  27. 27. Dubois PCA, Trynka G, Franke L, Hunt KA, Romanos J, et al. (2010) Multiple common variants for celiac disease influencing immune gene expression. Nature genetics 42: 295–302.
  28. 28. Rotival M, Zeller T, Wild PS, Maouche S, Szymczak S, et al. (2011) Integrating genome-wide genetic variations and monocyte expression data reveals trans-regulated gene modules in humans. PLoS genetics 7: e1002367.
  29. 29. Grundberg E, Small KS, Hedman ÃsK, Nica AC, Buil A, et al. (2012) Mapping cis- and trans-regulatory effects across multiple tissues in twins. Nature genetics 44: 1084–1089.
  30. 30. Nica AC, Parts L, Glass D, Nisbet J, Barrett A, et al. (2011) The architecture of gene regulatory variation across multiple human tissues: the MuTHER study. PLoS genetics 7: e1002003.
  31. 31. Petretto E, Mangion J, Dickens NJ, Cook SA, Kumaran MK, et al. (2006) Heritability and tissue specificity of expression quantitative trait loci. PLoS genetics 2: e172.
  32. 32. Grundberg E, Kwan T, Ge B, Lam KC, Koka V, et al. (2009) Population genomics in a disease targeted primary cell model. Genome research 19: 1942–1952.
  33. 33. Kooij V, Boontje N, Zaremba R, Jaquet K, dos Remedios C, et al. (2010) Protein kinase C alpha and epsilon phosphorylation of troponin and myosin binding protein C reduce Ca2+ sensitivity in human myocardium. Basic research in cardiology 105: 289–300.
  34. 34. Owen SA, Hider SL, Martin P, Bruce IN, Barton A, et al. (2013) Genetic polymorphisms in key methotrexate pathway genes are associated with response to treatment in rheumatoid arthritis patients. Pharmacogenomics J 13: 227–234.
  35. 35. Wessels JA, de Vries-Bouwstra JK, Heijmans BT, Slagboom PE, Goekoop-Ruiterman YP, et al. (2006) Efficacy and toxicity of methotrexate in early rheumatoid arthritis are associated with single-nucleotide polymorphisms in genes coding for folate pathway enzymes. Arthritis and rheumatism 54: 1087–1095.
  36. 36. Brown CD, Mangravite LM, Engelhardt BE (2013) Integrative modeling of eQTLs and cis-regulatory elements suggests mechanisms underlying cell type specificity of eQTLs. PLoS genetics 9: e1003649.
  37. 37. May D, Blow MJ, Kaplan T, McCulley DJ, Jensen BC, et al. (2012) Large-scale discovery of enhancers from human heart tissue. Nat Genet 44: 89–93.
  38. 38. Gaborit N, Le Bouter S, Szuts V, Varro A, Escande D, et al. (2007) Regional and tissue specific transcript signatures of ion channel genes in the non-diseased human heart. The Journal of physiology 582: 675–693.
  39. 39. Sharma S, Razeghi P, Shakir A, Keneson BJ 2nd, Clubb F, et al. (2003) Regional heterogeneity in gene expression profiles: a transcript analysis in human and rat heart. Cardiology 100: 73–79.
  40. 40. Nerbonne JM, Guo W (2002) Heterogeneous expression of voltage-gated potassium channels in the heart: roles in normal excitation and arrhythmias. Journal of cardiovascular electrophysiology 13: 406–409.
  41. 41. Tsubakihara M, Williams NK, Keogh A, dos Remedios CG (2004) Comparison of gene expression between left atria and left ventricles from non-diseased humans. Proteomics 4: 261–270.
  42. 42. Gaunt TR, Shah S, Nelson CP, Drenos F, Braund PS, et al. (2012) Integration of genetics into a systems model of electrocardiographic traits using HumanCVD BeadChip. Circulation Cardiovascular genetics 5: 630–638.
  43. 43. R-Core-Team (2012) R: A Language and Environment for Statistical Computing.
  44. 44. Dunning MJ, Smith ML, Ritchie ME, Tavare S (2007) beadarray: R classes and methods for Illumina bead-based data. Bioinformatics (Oxford, England) 23: 2183–2184.
  45. 45. Kauffmann A, Gentleman R, Huber W (2009) arrayQualityMetrics—a bioconductor package for quality assessment of microarray data. Bioinformatics (Oxford, England) 25: 415–416.
  46. 46. Shi W, Oshlack A, Smyth GK (2010) Optimizing the noise versus bias trade-off for Illumina whole genome expression BeadChips. Nucleic acids research 38: e204.
  47. 47. Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, et al. (2007) Population genomics of human gene expression. Nature genetics 39: 1217–1224.
  48. 48. Aulchenko YS, Ripke S, Isaacs A, van Duijn CM (2007) GenABEL: an R library for genome-wide association analysis. Bioinformatics (Oxford, England) 23: 1294–1296.
  49. 49. Tibshirani R, Walther G, Hastie T (2001) Estimating the number of clusters in a dataset via the Gap statistic. J R Statist Soc 63: 411–423.
  50. 50. Scott LJ, Mohlke KL, Bonnycastle LL, Willer CJ, Li Y, et al. (2007) A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science (New York, NY) 316: 1341–1345.
  51. 51. MACH-Development-Team (2013) MACH website.
  52. 52. Storey JD, Tibshirani R (2003) Statistical significance for genomewide studies. Proc Natl Acad Sci U S A 100: 9440–9445.
  53. 53. Emilsson V, Thorleifsson G, Zhang B, Leonardson AS, Zink F, et al. (2008) Genetics of gene expression and its effect on disease. Nature 452: 423–428.
  54. 54. Dudbridge F, Gusnanto A (2008) Estimation of significance thresholds for genomewide association scans. Genetic epidemiology 32: 227–234.
  55. 55. Pe'er I, Yelensky R, Altshuler D, Daly MJ (2008) Estimation of the multiple testing burden for genomewide association studies of nearly all common variants. Genetic epidemiology 32: 381–385.
  56. 56. Ellinor PT, Lunetta KL, Albert CM, Glazer NL, Ritchie MD, et al. (2012) Meta-analysis identifies six new susceptibility loci for atrial fibrillation. Nat Genet 44: 670–675.
  57. 57. Eijgelsheim M, Newton-Cheh C, Sotoodehnia N, de Bakker PI, Muller M, et al. (2010) Genome-wide association analysis identifies multiple loci related to resting heart rate. Hum Mol Genet 19: 3885–3894.
  58. 58. Nolte IM, Wallace C, Newhouse SJ, Waggott D, Fu J, et al. (2009) Common genetic variation near the phospholamban gene is associated with cardiac repolarisation: meta-analysis of three genome-wide association studies. PloS one 4: e6138.
  59. 59. Smith JG, Magnani JW, Palmer C, Meng YA, Soliman EZ, et al. (2011) Genome-wide association studies of the PR interval in African Americans. PLoS genetics 7: e1001304.
  60. 60. Arking DE, Pfeufer A, Post W, Kao WH, Newton-Cheh C, et al. (2006) A common genetic variant in the NOS1 regulator NOS1AP modulates cardiac repolarization. Nat Genet 38: 644–651.
  61. 61. Lubitz SA, Sinner MF, Lunetta KL, Makino S, Pfeufer A, et al. (2010) Independent susceptibility markers for atrial fibrillation on chromosome 4q25. Circulation 122: 976–984.