Myocardial infarction (MI) is a common complex disease with a genetic component. While several single nucleotide polymorphisms (SNPs) have been reported to be associated with risk of MI, they do not fully explain the observed genetic component of MI. We have been investigating the association between MI and SNPs that are located in genes and have the potential to affect gene function or expression. We have previously published studies that tested about 12,000 SNPs for association with risk of MI, early-onset MI, or coronary stenosis. In the current study we tested 17,576 SNPs that could affect gene function or expression. In order to use genotyping resources efficiently, we staged the testing of these SNPs in three case–control studies of MI. In the first study (762 cases, 857 controls) we tested 17,576 SNPs and found 1,949 SNPs that were associated with MI (P<0.05). We tested these 1,949 SNPs in a second study (579 cases and 1159 controls) and found that 24 SNPs were associated with MI (1-sided P<0.05) and had the same risk alleles in the first and second study. Finally, we tested these 24 SNPs in a third study (475 cases and 619 controls) and found that 5 SNPs in 4 genes (ENO1, FXN (2 SNPs), HLA-DPB2, and LPA) were associated with MI in the third study (1-sided P<0.05), and had the same risk alleles in all three studies. The false discovery rate for this group of 5 SNPs was 0.23. Thus, we have identified 5 SNPs that merit further examination for their potential association with MI. One of these SNPs (in LPA), has been previously shown to be associated with risk of cardiovascular disease in other studies.
Citation: Shiffman D, Kane JP, Louie JZ, Arellano AR, Ross DA, et al. (2008) Analysis of 17,576 Potentially Functional SNPs in Three Case–Control Studies of Myocardial Infarction. PLoS ONE 3(8): e2895. doi:10.1371/journal.pone.0002895
Editor: Florian Kronenberg, Innsbruck Medical University, Austria
Received: May 29, 2008; Accepted: July 15, 2008; Published: August 6, 2008
Copyright: © 2008 Shiffman et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study received funding from the University of California Discovery Grant Program, which is jointly funded by the University of California and the State of California with matching funds from Celera.
Competing interests: DS, JL, AA, DR, JC and JD are current employees of Celera. J.K. and M.M. received funding from the University of California Discovery Grant Program which is jointly funded by the University of California and the State of California with matching funds from Celera. S.E. had been a paid consultant of Celera.
Myocardial infarction (MI) is a prevalent and often fatal consequence of coronary heart disease. Each year approximately 865 thousand Americans are diagnosed with MI and about 180 thousand die from the disease . MI occurs when thrombosis—precipitated by ruptured or eroded atherosclerotic plaque—leads to acute ischemia and subsequent necrosis of the myocardium.
Risk factors for MI include age, sex, elevated LDL-cholesterol, hypertension, low HDL-cholesterol, smoking, type 2 diabetes, and family history of cardiovascular disease. Risk of MI has a genetic component as evidenced by large twin studies  which showed that death from cardiovascular disease is more highly correlated among identical twins than fraternal twins. Among genetic variants that are associated with MI, some can affect traditional risk factors ,  but for others the underlying biological explanation for the association is not known , .
The identification of genetic variants that are associated with MI is a challenging task, because variants that are associated with MI are expected to only modestly increase the risk of MI, and because a large number of variants could potentially be tested. Thus, very large studies are needed to detect modest association and account for multiple testing confidently . We have limited the magnitude of multiple testing in the past – by testing SNPs with high prior probability for association with MI. Previously, we described the results from investigating ~12,000 such SNPs –. Here we asked if we could identify SNPs that are associated with MI by investigating ~17,000 SNPs. We identified 5 SNPs that appear to be associated with MI and should be investigated in additional studies of MI.
To identify genetic polymorphisms associated with MI, we interrogated three case–control studies comprising cases with a history of MI and controls without a history of MI. The first two case–control studies (Study 1 and Study 2) identified SNPs nominally associated with MI. The hypotheses that these SNPs are associated with MI were tested in Study 3. We determined the allele frequency of each SNP in pools of case and control DNA prior to determining the genotype of a smaller number of SNPs for all individual DNA samples.
Participants in Study 1 and Study 2 were enrolled between July 1989 and May 2005 by the University of California, San Francisco (UCSF) Genomic Resource in Arteriosclerosis. UCSF samples received at the Celera genotyping facility by May 2004 were considered for inclusion in Study 1. Samples that arrived past that date were considered for Study 2. Cases in Study 1 and Study 2 included patients who had undergone diagnostic or interventional cardiac catheterization and patients of the UCSF Lipid Clinic. Controls were enrolled by the UCSF Genomic Resource in Arteriosclerosis and included UCSF staff, patients of UCSF Clinics, and senior citizens who participated in physical activities at regional community centers and events for senior citizens. A history of MI for Study 1 cases was verified by a clinical chart review or by The International Classification of Diseases, 9th Revision (ICD9) codes 410 or 411 in the patient records. MI status for Study 2 cases was determined by ICD9 codes 410 or 411 or by a self-reported history of MI. To characterize the accuracy of these self–reported histories, medical record review for a sample of Study 2 cases resulted in verification of the self–reported MI status for 98% of the sample (verification by electrocardiogram, cardiac enzymes or imaging). Controls had no history of MI, diabetes or symptomatic vascular disease. All participants of Study 1 and Study 2 chose Caucasian as their ethnicity in response to a multiple-choice questionnaire.
Participants in Study 3 were patients of the Cleveland Clinic Foundation (CCF) Heart Center who had undergone diagnostic or interventional cardiac catheterization between July 2001 and March 2003 and enrolled in the Genebank at Cleveland Clinic Study. A history of MI was verified by electrocardiogram, cardiac enzymes, or perfusion imaging. Controls had less than 50% coronary luminal narrowing. All participants in Study-3 selected North European, Eastern European, or ‘other Caucasian’ as the description of both their mother and father on the enrollment questionnaire. The demographic and risk factor characteristics of the participants in the 3 studies are presented in Table 1.
The 17,576 SNPs investigated in Study 1 are located in 10,152 genes. Of these 17,576 SNPs, 2767 were tested in at least one of 3 previously reported studies –. We previously reported that one of these SNPs (rs3798220 in LPA) was associated with cardiovascular disease . Most of these SNPs (65%) could potentially affect gene function or expression because they cause an amino-acid change in a predicted open reading frame (missense SNPs), or they are located in exon acceptor or donor splice site, and could change the splicing pattern of predicted open reading frames. We also considered as potentially functional some SNPs that are located in regions that are known to be involved in transcriptional regulation (predicted transcription factor binding sites), or RNA stability (3′ or 5′ untranslated regions, or predicted microRNA binding sites).
Allele Frequency and Genotype Determination
Initially, the allele frequency for each individual SNP was determined for all the cases and all the controls in pools of DNA. Pools were made by mixing equal volumes of standardized DNA from each individual member of the pool. Prior to pooling, DNA concentration for each sample was determined in triplicate using Picogreen fluorescent detection (Invitrogen). Measurements were repeated for samples which had high variation of fluorescence values (5% or greater coefficient of variation). DNA concentrations were determined from mean fluorescence values using a standard curve of salmon sperm DNA. DNA samples were then diluted to 6 ng/µL using automated liquid handling robotics (Beckman Coulter Fx, or Perkin Elmer Multiprobe II). The final concentration was confirmed using Picogreen fluorescent detection. Typically, several unique pools of DNA were made for cases and controls, made up of about 50 cases or controls. For each SNP, two real-time PCR reactions were performed, using 3 ng of pooled DNA in each reaction and allele-specific primers. The allele frequency in each pool was calculated from amplification curves for each allele. Genotyping of individual DNA samples was done by performing two real-time PCR reactions for each individual sample, using 0.3 ng DNA from each sample and allele specific primers.
Subjects of all three studies gave written informed consent and completed questionnaire approved by the Institutional Review Board of UCSF (Study 1 and Study 2) or the Cleveland Clinic Foundation (Study 3).
We assessed association between MI status and allele frequencies by two-tailed χ2 tests, and between MI status and genotype by logistic regression using an additive inheritance model (Wald test). In Study 2 and Study 3, since we tested a single prespecified risk allele for each SNP, we present one-sided P values and 90% confidence intervals (for odds ratios greater than one, there is 95% confidence that the true risk estimate is greater than the lower bound of a 90% confidence interval). We used a P threshold value of 0.05 in all three studies, and adjusted for multiple testing by calculating the False discovery rate (FDR) in Study 3. FDR was calculated using the MULTTEST procedure (SAS statistical package Version 9.1); for SNPs that were in the same gene, only the SNP with the higher (less significant) P value was included in the calculation.
We measured the allele frequencies of 17,576 putative functional SNPs in Study 1 cases and controls using pooled DNA samples and identified 1,949 SNPs that were associated with MI (P<0.05) and had minor allele frequency estimates that were greater than 1%. For these 1,949 SNPs, we determined allele frequencies in Study 2 cases and controls using pooled DNA samples and verified that the risk allele identified in Study 1 was also associated with risk of MI in Study 2. For those SNPs that were associated with MI and had the same risk alleles in both pooling studies, we then confirmed the association of the SNP with MI in Study 1 and Study 2 by genotyping individual DNA samples. We found that the risk alleles of 24 SNPs in 23 genes were associated with MI in both studies using an additive inheritance model (Table 2) and a P value threshold of 0.05. Next we tested the hypotheses that the risk alleles of these 24 SNPs would be associated with MI in Study 3. The power to detect association with MI for these 24 SNPs (based on the risk and allele frequency observed in Study 2) ranged from 41% (for rs3812475 in TRMT12) to 83% (for rs725660 in LOC388553). We found that the risk allele of 5 SNPs, in 4 genes (ENO1, FXN (2 SNPs), HLA-DPB2, and LPA) were associated with MI using an additive inheritance model after adjustment for age and sex (Table 3). The false discovery rate for these 5 SNPs was 0.23. The distribution of the genotypes for each of the SNPs did not deviate from what was expected under Hardy-Weinberg equilibrium (P>0.5). Further adjustment for traditional risk factors (dyslipidemia, hypertension, smoking status, and BMI), did not appreciably change the risk estimate for 4 of these SNPs (Table 3, LPA, FXN (2 SNPs), and HLA-DPB2). However, the risk for the ENO1 SNP was not statistically significant after further adjustment for traditional risk factors (OR = 1.09, 90% CI 0.85–1.38, P = 0.28). Dyslipidemia could be confounding the association of the ENO1 SNP with MI since this SNP trended toward association with dyslipidemia (P = 0.1).
We conducted an analysis of 17,576 SNPs that could potentially affect gene function or expression in three case-control studies of MI and identified 5 SNPs in four genes (ENO1, FXN (2 SNPs), HLA-DPB2, and LPA) that were associated with MI. The false discovery rate for this group of 5 SNPs was 0.23, indicating that several of these SNPs are expected to be associated with MI.
The first SNP is located in ENO1, a gene that encodes α-enolase, a glycolytic enzyme that catalyzes the conversion of 2-phospho-D-glycerate to phosphoenolpyruvate. α-enolase is also known to be a plasminogen receptor on the surface of hematopoietic cells and endothelial cells . Thus, α-enolase could contribute to fibrinolysis, hemostasis, and arterial thrombus formation–processes that are critical in the pathophysiology of MI. The SNP in ENO1 (rs1325920) is located about 1 kb upstream of the gene and could be involved in transcriptional regulation.
Two of the SNPs are in the FXN gene. The FXN gene encodes Frataxin, a mitochondrial protein involved in maintaining cellular iron homeostasis . Expanded GAA triplet repeats in intron 1 of FXN leads to silencing of the FXN gene and to accumulation of iron in the mitochondria, which makes mitochondria sensitive to oxidative stress . These changes lead to Friedreich's ataxia, an autosomal recessive disease of the central nervous system that is frequently associated hypertrophic cardiomyopathy . The two SNPs in FXN found to be associated with MI are located in the 3′ untranslated region of FXN (rs10890) and in a putative transcription factor binding site (rs3793456), thus one or both of these SNPs could have an effect on FXN gene expression. These two SNPs are in linkage disequilibrium (r2 = 0.57 in Study 1) and thus, are not independent of one another. Whether these SNPs are associated with increased sensitivity of mitochondria to oxidative stress or to other mild manifestations of Friedreich's Ataxia symptoms is not known.
The fourth SNP (rs3798220 in LPA) encodes a isoleucine to methionine substitution at amino acid 4399 of apolipoprotein(a). We have previously shown that this SNP is associated with coronary artery narrowing and with increased levels of plasma lipoprotein(a) in case-control studies . This SNP was also associated with incident myocardial infarction in the Cardiovascular Health Study, a population–based prospective study of about 5000 individuals aged 65 or older . The low minor allele frequency of this SNP in LPA (1% in the European American population of CHS ) suggests that this SNP accounts for only a small fraction of the total variability of plasma Lp(a) levels. Our previous data showed that Lp(a) levels were 5.9-fold higher in carriers of the 4399 methionine allele than in noncarriers ; and high levels of Lp(a) are associated with an increased risk for MI , . Additionally, one can speculate that the association of the 4399 methionine allele with increased risk of disease could also be due to the isoleucine to methionine change in apolipoprotein(a) that may result in a more deleterious form of Lp(a).
Lastly, the fifth SNP is in HLA-DPB2 (rs35410698) is also associated with MI in this study. HLA-DPB2 is a pseudogene in the Human Leukocyte Antigen (HLA) region. The HLA region is highly polymorphic, gene rich region. Linkage disequilibrium in this region can extend across hundred kilobases and encompass HLA as well as non-HLA genes . Therefore, additional genotyping of SNPs in this region would be needed in order to know which gene variant in this region could be associated with MI.
We analyzed case–control studies that were retrospectively collected and did not include fatal cases of MI. Therefore, SNPs specifically associated with fatal MI would not have been identified. There were some differences between the participants in these three studies, specifically, Study 3 controls were recruited from patients who underwent coronary catheterization, whereas Study 1 and Study 2 controls were recruited from a lipid clinic population and from community centers. Thus, SNPs that were associated with MI in Study 1 and Study 2 but not in Study 3 might be explained by the differences between these studies. For example, a SNP in THBS4 (rs1866389) that was found to be associated with MI in Study 1 and Study 2 but not in Study 3, has been previously reported to be associated with premature MI . However, the power to detect the association of THBS4 with MI in Study 3 was limited (40% power), thus the lack of association in Study 3 may represent a false negative result. The false discovery rate for the 5 SNPs that were associated with MI in Study 3 was 0.23. Thus we expect that some of the SNPs we identified could be false positive associations (type 1 error); replication from additional studies is required to validate the observed associations. We have looked for support for these associations in the published data from the Welcome Trust Case-Control Consortium data , unfortunately, none of the 5 SNPs we report here was genotyped in that study. Finally, although the SNPs in this study could potentially affect gene function, additional linkage disequilibrium analysis would be needed in order to determine if other SNPs in these region could better account for the associations with MI we observed.
We identified 5 SNPs in 4 genes that are likely associated with MI. These SNPs merit investigation in additional studies of MI.
Conceived and designed the experiments: DS JJD. Performed the experiments: ARA JC. Analyzed the data: DS JZL. Contributed reagents/materials/analysis tools: JPK DR MM SGE. Wrote the paper: DS ARA JJD.
- 1. American Heart Association (2002) Heart disease and stroke statistics: 2005 update. American Heart Association, Dallas.
- 2. Marenberg ME, Risch N, Berkman LF, Floderus B, de Faire U (1994) Genetic susceptibility to death from coronary heart disease in a study of twins. N Engl J Med 330: 1041–1046.
- 3. Cohen JC, Boerwinkle E, Mosley TH Jr, Hobbs HH (2006) Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. N Engl J Med 354: 1264–1272.
- 4. Kathiresan S, Melander O, Anevski D, Guiducci C, Burtt NP, et al. (2008) Polymorphisms associated with cholesterol and risk of cardiovascular events. N Engl J Med 358: 1240–1249.
- 5. McPherson R, Pertsemlidis A, Kavaslar N, Stewart A, Roberts R, et al. (2007) A common allele on chromosome 9 associated with coronary heart disease. Science 316: 1488–1491.
- 6. Helgadottir A, Thorleifsson G, Manolescu A, Gretarsdottir S, Blondal T, et al. (2007) A common variant on chromosome 9p21 affects the risk of myocardial infarction. Science 316: 1491–1493.
- 7. Shiffman D, Rowland CM, Sninsky JJ, Devlin JJ (2006) Polymorphisms associated with coronary heart disease: better by the score. Curr Opin Mol Ther 8: 493–499.
- 8. Shiffman D, Ellis SG, Rowland CM, Malloy MJ, Luke MM, et al. (2005) Identification of four gene variants associated with myocardial infarction. Am J Hum Genet 77: 596–605.
- 9. Shiffman D, Rowland CM, Louie JZ, Luke MM, Bare LA, et al. (2006) Gene Variants of VAMP8 and HNRPUL1 Are Associated With Early-Onset Myocardial Infarction. Arterioscler Thromb Vasc Biol 26: 1613–1618.
- 10. Luke MM, Kane JP, Liu DM, Rowland CM, Shiffman D, et al. (2007) A polymorphism in the protease-like domain of apolipoprotein(a) is associated with severe coronary artery disease. Arterioscler Thromb Vasc Biol 27: 2030–2036.
- 11. Pancholi V (2001) Multifunctional alpha-enolase: its role in diseases. Cell Mol Life Sci 58: 902–920.
- 12. Gottesfeld JM (2007) Small molecules affecting transcription in Friedreich ataxia. Pharmacol Ther 116: 236–248.
- 13. Al-Mahdawi S, Pinto RM, Varshney D, Lawrence L, Lowrie MB, et al. (2006) GAA repeat expansion mutation mouse models of Friedreich ataxia exhibit oxidative stress leading to progressive neuronal and cardiac pathology. Genomics 88: 580–590.
- 14. Shiffman D, O'Meara ES, Bare LA, Rowland CM, Louie JZ, et al. (2008) Association of Gene Variants With Incident Myocardial Infarction in the Cardiovascular Health Study. Arterioscler Thromb Vasc Biol 28: 173–179.
- 15. Kamstrup PR, Benn M, Tybaerg-Hansen A, Nordestgaard BG (2008) Extreme lipoprotein(a) levels and risk of myocardial infarction in the general population: the Copenhagen City Heart Study. Circulation 117: 176–184.
- 16. Bennet A, Di Angelantonio E, Erqou S, Eiriksdottir G, Sigurdsson G, et al. (2008) Lipoprotein(a) levels and risk of future coronary heart disease: large-scale prospective data. Arch Intern Med 168: 598–608.
- 17. de Bakker PI, McVean G, Sabeti PC, Miretti MM, Green T, et al. (2006) A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC. Nat Genet 38: 1166–1172.
- 18. Topol EJ, McCarthy J, Gabriel S, Moliterno DJ, Rogers WJ, et al. (2001) Single nucleotide polymorphisms in multiple novel thrombospondin genes may be associated with familial premature myocardial infarction. Circulation 104: 2641–2644.
- 19. The Wellcome Trust Case Control Consortium (2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 447: 661–678.