Novel Association of HK1 with Glycated Hemoglobin in a Non-Diabetic Population: A Genome-Wide Evaluation of 14,618 Participants in the Women's Genome Health Study

  • Guillaume Paré ,

    Contributed equally to this work with: Guillaume Paré, Daniel I. Chasman

    Affiliations: Center for Cardiovascular Disease Prevention, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America, Donald W. Reynolds Center for Cardiovascular Research, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

  • Daniel I. Chasman ,

    Contributed equally to this work with: Guillaume Paré, Daniel I. Chasman

    Affiliations: Center for Cardiovascular Disease Prevention, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America, Donald W. Reynolds Center for Cardiovascular Research, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

  • Alexander N. Parker,

    Affiliation: Amgen, Inc., Cambridge, Massachusetts, United States of America

  • David M. Nathan,

    Affiliation: Diabetes Center, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

  • Joseph P. Miletich,

    Affiliation: Amgen, Inc., Cambridge, Massachusetts, United States of America

  • Robert Y. Zee,

    Affiliations: Center for Cardiovascular Disease Prevention, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America, Donald W. Reynolds Center for Cardiovascular Research, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

  • Paul M. Ridker

    Affiliations: Center for Cardiovascular Disease Prevention, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America, Donald W. Reynolds Center for Cardiovascular Research, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America

Novel Association of HK1 with Glycated Hemoglobin in a Non-Diabetic Population: A Genome-Wide Evaluation of 14,618 Participants in the Women's Genome Health Study

  • Guillaume Paré, 
  • Daniel I. Chasman, 
  • Alexander N. Parker, 
  • David M. Nathan, 
  • Joseph P. Miletich, 
  • Robert Y. Zee, 
  • Paul M. Ridker
  • Published: December 19, 2008
  • DOI: 10.1371/journal.pgen.1000312


Type 2 diabetes is a leading cause of morbidity and mortality. While genetic variants have been found to influence the risk of type 2 diabetes mellitus, relatively few studies have focused on genes associated with glycated hemoglobin, an index of the mean blood glucose concentration of the preceding 8–12 weeks. Epidemiologic studies and randomized clinical trials have documented the relationship between glycated hemoglobin levels and the development of long-term complications in diabetes; moreover, higher glycated hemoglobin levels in the subdiabetic range have been shown to predict type 2 diabetes risk and cardiovascular disease. To examine the common genetic determinants of glycated hemoglobin levels, we performed a genome-wide association study that evaluated 337,343 SNPs in 14,618 apparently healthy Caucasian women. The results show that glycated hemoglobin levels are associated with genetic variation at the GCK (rs730497; P = 2.8×10−12), SLC30A8 (rs13266634; P = 9.8×10−8), G6PC2 (rs1402837; P = 6.8×10−10), and HK1 (rs7072268; P = 6.4×10−9) loci. While associations at the GCK, SLC30A8, and G6PC2 loci are confirmatory, the findings at HK1 are novel. We were able to replicate this novel association in an independent validation sample of 455 additional non-diabetic men and women. HK1 encodes the enzyme hexokinase, the first step in glycolysis and a likely candidate for the control of glucose metabolism. This observed genetic association between glycated hemoglobin levels and HK1 polymorphisms paves the way for further studies of the role of HK1 in hemoglobin glycation, glucose metabolism, and diabetes.

Author Summary

Type 2 diabetes is a leading cause of morbidity and mortality in both the developed and developing world. Because the main metabolic characteristic of diabetes is increased blood glucose concentration, we sought to uncover the genetic determinants of glycated hemoglobin, an index of the mean blood glucose concentration of the preceding 8–12 weeks. Taking advantage of new technologies allowing us to interrogate genetic variation on a whole-genome basis, we found that variations in the GCK, SLC30A8, G6PC2, and HK1 genes are important determinants of glycated hemoglobin concentrations. While associations with the GCK, SLC30A8, and G6PC2 genes have previously been identified in genetic studies of diabetes and blood glucose concentration, the findings at HK1 are novel. HK1 encodes the enzyme hexokinase, responsible for the first step in glucose utilization and a likely candidate for the control of glucose metabolism. This observed genetic association between glycated hemoglobin levels and HK1 genetic variants paves the way for further studies of the role of HK1 in glucose metabolism and diabetes.


Type 2 diabetes is a leading cause of morbidity and mortality. While genetic variants influence the risk of type 2 diabetes mellitus [1], relatively little is known about the role of genetic variations in the regulation of glucose concentration in healthy individuals. Indeed, only two genes, glucokinase (GCK) [2][4] and G6PC2 [5],[6], have been unequivocally associated with fasting blood glucose concentration in healthy subjects. Other genes, such as SLC30A8, have been linked to glucose metabolism without ever having been associated with blood glucose concentration per se [7].

Glycated proteins are formed post-translationally from the slow, non-enzymatic attachment of glucose to N-terminal valine and internal lysine amino groups [8]. The concentration of glycated hemoglobin is a function of the concentration of glucose to which the erythrocytes are exposed over their lifespan (120 days on average). The glycated hemoglobin assay reflects the integrated blood glucose concentration of the preceding 8–12 weeks [9], provides a better estimate of mean glycemia than routine determinations of blood glucose concentration, and is the most widely used index of chronic glycemia [10],[11]. Clinical trials have documented the relationship between glycated hemoglobin levels and the development of long-term complications in type 1 and type 2 diabetes [12],[13], and higher glycated hemoglobin levels in the subdiabetic range have been shown to predict type 2 diabetes risk [14] and cardiovascular disease [14][17].

Despite considerable effort to decipher the genetic basis of type 2 diabetes, relatively few studies have focused on the genetic basis of glycated hemoglobin levels. To examine the common genetic determinants of glycated hemoglobin levels, we performed a genome-wide association study evaluating 337,343 SNPs in 14,618 apparently healthy non-diabetic women of Caucasian ancestry.

Materials and Methods

WGHS Study Participants

The study population derived from the Women's Genome Health Study (WGHS) [18]. Briefly, participants in the WGHS include women from the United States Women's Health Study (WHS) with no prior history of cardiovascular disease, cancer, or other major chronic illness [19]. All participants provided a blood sample at the time of study enrollment from which genomic DNA was extracted.

For all WGHS participants, baseline glycated hemoglobin (HbA1c) was measured using the Tina-Quant turbidimetric inhibition immunoassay (Roche Diagnostics, Indianapolis, Ind) on a Hitachi 911 autoanalyzer using packed red blood cells. This assay is standardized against the approved International Federation of Clinical Chemists reference method and has a coefficient of variation of 7.2%. In addition to excluding women with self-reported diabetes at baseline, individuals with glycated hemoglobin of 7.0% or more were removed from analysis as this level has been proposed as diagnostic of drug-requiring diabetes [20]. The study was approved by the institutional review board of the Brigham and Women's Hospital.


DNA samples were genotyped with the Infinium II technology from Illumina. Either the HumanHap300 Duo-Plus chip or the combination of the HumanHap300 Duo and I-Select chips was used. In either case, the custom content was identical and consisted of candidate SNPs chosen without regard to allele frequency to increase coverage of genetic variation with impact on biological function including metabolism, inflammation or cardiovascular diseases. Genotyping at 318,237 HumanHap300 Duo SNPs and 45,571 custom content SNPs was attempted, for a total of 363,808 SNPs. Genetic context for all annotations are derived from human genome build 36.1 and dbSNP build 126.

SNPs with call rates <90% were excluded from further analysis. Likewise, all samples with percentage of missing genotypes higher than 2% were removed (N = 1426). No obvious bias distinguished failed from successful samples and failures are therefore assumed to be the result of random technical artifacts. Among retained samples, SNPs were evaluated for deviation from Hardy-Weinberg equilibrium using an exact method [21] and were excluded when the P-value was lower than 10−6. Samples were further validated by comparison of genotypes at 44 SNPs that had been previously ascertained using alternative technologies. SNPs with minor allele frequency >1% in Caucasians were used for analysis. After quality control, 308,625 HumanHap300 Duo SNPs and 28,718 custom content SNPs were left, for a total of 337,343 SNPs.

Population Stratification

Because population stratification can result in inflated type I error in genome-wide association analysis, a principal component analysis using 1443 ancestry informative SNPs was performed using PLINK [22] in order to confirm self-reported ancestry. Briefly, these SNPs were chosen based on Fst>0.4 in HapMap populations (YRB, CEU, CHB+JPT) and inter-SNP distance at least 500 kb in order to minimize linkage disequilibrium. Different ethnic groups were clearly distinguished with the two first components. Based on this analysis, 58 individuals were excluded from further evaluation as they did not cluster with other Caucasians, leaving 14,618 participants for the current study population. Two additional steps were taken to rule out the possibility that residual stratification within Caucasians was responsible for the associations observed. First, association analysis was done with correction by genomic control. Second, a principal component analysis [23] was performed in previously identified Caucasians (only) using 124,931 SNPs chosen to have pair-wise linkage disequilibrium lower than r2 = 0.4. The first ten components were then used as covariates in the association analysis. As adjustment by these covariates did not change the conclusions, we present analysis among Caucasian participants without further correction for sub-Caucasian ancestry. The value of the genomic control was 1.03916 before adjustment for the first ten components and 1.03328 after adjustment.

Association Analysis

To identify common genetic variants influencing glycated hemoglobin levels, we first attempted to discover which loci significantly contributed to glycated hemoglobin concentration. Glycated hemoglobin concentrations were adjusted for age, menopause and body mass index using a linear regression model in R to reduce the impact of clinical covariates on glycated hemoglobin variance. The normal distribution of adjusted glycated hemoglobin values was checked by visual inspection of the normal Quantile-Quantile plot. Adjusted glycated hemoglobin values were then tested for association with SNP genotypes by linear regression in PLINK [22], assuming an additive contribution of each minor allele [24]. Beta coefficients are given using the major allele as the reference allele (major and minor alleles are presented in Supplementary Table S2). We also tested for association using genotypic, recessive and dominant genetic models. Since no additional locus was identified using these models we only present the results of the additive genetic model. A conservative P-value cut-off of 5×10−8 was used to correct for the maximum of 1,000,000 independent statistical tests thought to correspond to all the common genetic variation of the human genome [25]. The false discovery rate was also calculated as described by Benjamini and Hochberg [26].

Once any locus with genome-wide significance was identified, a forward selection linear multiple regression model was used to further define the extent of the genetic association. Briefly, all genotyped SNPs within 100 kb of the most significantly associated SNP at each locus and passing quality control requirements were tested for possible incorporation into a multiple regression model. In stepwise fashion, a SNP was added to the model if it had the smallest P-value among all the SNPs not yet included in the model and if it was statistically significant after adjusting for multiple comparisons. SNPs selected by this algorithm were also used in haplotype analysis using WHAP [27], as implemented in PLINK [22].

Validation Sample

We sought to validate any novel findings from the WGHS in an independent replication sample comprised of 455 non-diabetic Caucasian participants recruited from the Boston metropolitan area [28]. A history of diabetes, or use of anti-diabetic medications, and/or fasting glucose levels >125 mg/dL, or abnormal oral glucose tolerance test results were used to exclude persons with diabetes in this cohort. Glycated hemoglobin was measured with an HPLC assay that serves as a primary reference assay for the National Glycohemoglobin Standardization Program, with coefficients of variation <2.5% for low and high standards. Genotype determination was performed with an ABI fluorescence-based allelic discrimination method (rs7072268) [29] or a single reaction kinetic thermal cycling method (rs2305198) [30]. The study was approved by the Brigham and Women's Hospital and Massachusetts General Hospital Institutional Review Boards for human subjects research.


Clinical characteristics of the 14,618 non-diabetic women in the primary WGHS analysis and of the 455 non-diabetic men and women in the external validation cohort are provided in Supplementary Table S1. Results of the genome-wide association analysis of adjusted baseline glycated hemoglobin concentration are presented in Table 1, which includes all SNPs with P-value lower than 10−6. Six SNPs at three different loci – G6PC2, GCK and HK1 - had an association P-value lower than our previously defined genome-wide significance threshold of 5×10−8: rs1402837, rs560887 and rs6709087 at G6PC2; rs730497 and rs4607517 at GCK; rs7072268 at HK1. In addition, two SNPs, one at the HK1 locus and one at the SCL30A8 locus, had association P-values just above this threshold level (5.5×10−8 and 9.8×10−8, respectively). Genetic context of all these loci is presented in Figure 1 along with the −log10 transformed P-values.

Figure 1. Genetic Context of Significant Associations.

Genomic context for each of four loci with significant association with glycated hemoglobin levels. (A) G6PC2 locus (2q24.3); (B) GCK locus (19p13.2); (C) SLC30A8 locus (8q24.11); (D) HK1 locus (10q21.3). Upper panel: Genes from RefSeq release 25. Only one isoform is shown when multiple splicing variants are known. Lower Panel: SNPs are shown according to their physical location and −log10 P-values for association with glycated hemoglobin. At each locus, the SNP with the lowest P-value is represented by a black dot. The color of each other SNP was determined according to its linkage disequilibrium (r2) with the SNP with the lowest P-value, with colors varying from blue (r2 = 0) to red (r2 = 1). In Figure 1-D, the chromosomal position of rs2305198 is shown as this SNP was selected by the model selection algorithm in addition to the SNP with the lowest P-value (see text for details). The red line represents the genome-wide significance threshold of 5×10−8. Also shown is the genetic distance in cM from the lowest P-value SNP (light grey line) along with the position of recombination hotspots (light grey vertical bars). Recombination rates and hotspots are based on HapMap data, as described by McVean et al.[46] and Winckler et al.[47].


Table 1. SNPs with a P-Value Lower than 10−6 for Association with Adjusted Glycated Hemoglobin.


To define further the extent of genetic associations at these 4 loci, we applied a forward model selection algorithm to each of them, identifying SNPs that were non-redundantly associated with glycated hemoglobin. Briefly, 38 SNPs at GCK, 31 at SLC30A8, 41 at HK1 and 44 at G6PC2 were initially assessed for possible inclusion in a multiple linear regression model. Using a P-value cut-off of 3×10−4 to account for the 154 SNPs considered, 5 SNPs were selected, representing the lead SNP at each of the 4 loci considered plus one SNP (rs2305198; located at 70799 Kb on chromosome 10) at the HK1 locus (see Table 2). Interestingly, this later SNP (rs2305198; HK1) was marginally significant in univariable analysis (p = 0.02), illustrating that its inclusion in the model and significant association are conditional on the genotypes at rs7072268 (HK1). Multiple regression beta coefficients and P-values of the 5 SNPs selected are shown in Table 2 and range from 5.4×10−8 to 1.8×10−25. Shown in Figure 2 are the quantile-quantile plots of association P-values before and after adjusting glycated hemoglobin concentration for the combined effect of these 5 SNPs. Among these SNPs, there was no strong evidence for non-additive effects of the minor allele as judged by lack of significance for a likelihood ratio test comparing the additive regression model to an alternative genotype model with an additional degree of freedom. No pairwise gene-gene interaction was observed between any of the model selected SNPs. That is, we tested every possible 2-SNPs combination among these 5 SNPs.

Figure 2. Quantile-Quantile plot of Glycated Hemoglobin Association P-Values.

The quantile-quantile plot of glycated hemoglobin association P-values is shown on the left. On the right, the same quantile-quantile plot is shown, but after adjusting glycated hemoglobin values for the additive genetic effect of the 5 SNPs retained by the model selection algorithm (rs1402837, rs730497, rs13266634, rs7072268 and rs2305198; see text for details).


Table 2. Multiple Linear Regression Statistics of SNPs Retained by the Forward Model Selection Algorithm.


We sought to replicate the novel association between glycated hemoglobin level and HK1 SNPs (rs2305198 and rs7072268) in the external validation sample of 455 non-diabetic men and women from the Boston metropolitan area. We included both SNPs (assuming an additive genetic effect) in a multiple regression model that also included age, sex, menopause and BMI as covariates. Both SNPs were successfully replicated, with two-sided P-values of 0.027 (rs2305198) and 0.050 (rs7072268), and both with consistent direction of effect. Furthermore, the effect size (i.e. beta coefficients) was larger than in the WGHS, with changes in glycated hemoglobin of 0.101% (rs2305198) and 0.085% (rs7072268) per allele in the replication sample, as opposed to 0.041% and 0.046% respectively in the WGHS. The association was validated in both men (N = 204) and women (N = 251), without any evidence of interaction with sex (data not shown). Both SNPs were in Hardy-Weinberg equilibrium, had allelic frequencies consistent with the ones observed in WGHS, and were successfully genotyped in over 93% of individuals. Of note, the P-values for rs2305198 and rs7072268 were non-significant (p>0.05) when each SNP was analyzed separately in a linear model. Overall, the P-values for rs2305198 and rs7072268 when both SNPs were included in a multiple regression model that used the combined WGHS and replication sample were 9.9×10−20 and 1.7×10−25, respectively.

The 2 SNPs at the HK1 locus selected by our algorithm were also used in haplotype analysis in WGHS individuals (Table 3). The estimate of the proportion of variance attributable to haplotypes, as well as their regression coefficients, is consistent with the linear model of these same SNPs, reinforcing the adequacy of the multiple linear model to explain the association. Linkage disequilibrium between these two HK1 SNPs was 0.49 for r2 and 0.90 for D'. The 2 SNPs at HK1 collectively explained 0.7% of the total variance in glycated hemoglobin concentration, whereas the G6PC2 SNP (rs1402837) explained 0.2%, the GCK SNP (rs730497) 0.3% and the SLC30A8 SNP (rs13266634) 0.2%. In comparison, clinical covariates accounted for 9.5% of the variance (Table 4), and together the candidate loci and the clinical variables accounted for 10.9% of total variance. As was the case in the WGHS, haplotype analysis of the replication sample yielded similar results as the multivariable linear regression, albeit with lesser statistical significance (data not shown).

Table 3. Haplotype Analysis of rs7072268 and rs2305198 (HK1 locus) in Relation to Adjusted Glycated Hemoglobin.


Table 4. Partition of Glycated Hemoglobin Variance According to Genetic and Clinical Variables.


Since several SNPs have been unequivocally associated with type 2 diabetes, we tested whether these “candidate” SNPs [1],[4],[31],[32] were associated with glycated hemoglobin as well (Table 5). When a SNP was not directly typed in WGHS, we used HapMap CEU linkage disequilibrium data to find the WGHS SNP with the highest r2 with that SNP. While all SNPs tested had a direction of association consistent with their effect on the risk of diabetes, only 10 SNPs had a (one-sided) P-value lower than 0.05 (Table 5). Overall, the one-sided association P-value for the 19 SNPs taken together (i.e. the P-value obtained by combining all 19 one-sided P-values; this was done using Fisher's method for combining P-values from independent tests) was 3.0×10−22. The P-value was 8.5×10−9 when the SLC30A8 SNP rs13266634 and the GCK SNP rs730497 were removed. It has previously been shown that the metabolic effects of rs8050136 (FTO locus) are mediated mainly through increased body weight [33],[34]. Adjustment of glycated hemoglobin for BMI would therefore be expected to dampen the association of rs8050136 (FTO locus) with glycated hemoglobin level. Indeed, non-adjusted glycated hemoglobin showed stronger association with rs8051136 (p = 0.007).

Table 5. Association of Glycated Hemoglobin with Type II Diabetes Candidate SNPs.



Four loci – GCK, SLC30A8, HK1 and G6PC2 – have been identified in this report as being significantly associated with glycated hemoglobin levels in non-diabetic individuals. While genetic variants of GCK [2][4] and G6PC2 [5],[6] are known to influence fasting glucose concentrations in non-diabetics, the association with the diabetes gene SLC30A8 [7] has not been previously identified. In contrast, genetic associations with glycated hemoglobin have previously been described with the candidate genes calpain-10 (CAPN10) [35], resistin (RETN) and adiponectin (ADIPOQ) loci [36], although none of these associations were replicated in our data.

The most striking association observed in the current report involves HK1, which is a novel finding. Four different hexokinase isozymes, named type I, II, III and glucokinase (GCK), are present in mammalian tissues. Hexokinase type I (HK1) is the predominant isoform found in erythrocytes, although it is also expressed in other human tissues that depend strongly on glucose utilization for their physiologic functioning, such as brain and muscle [37],[38]. Despite this wide tissue distribution, the main clinical manifestation of rare HK1 mutations is severe nonspherocytic hemolytic anemia [38]. To our knowledge, no common genetic variation has previously been correlated with any phenotype. The reason why erythrocytes are particularly susceptible to these rare mutations is not clear, but the prevailing assumption is that tissues strongly dependent on HK1, such as erythrocytes, might be preferably affected. One hypothesis to explain the strong association of rs7072268 and rs2305198 with glycated hemoglobin levels is that these SNPs affect hexokinase concentration or activity in erythrocytes (or act as proxies for other HK1 functional variants). Since intra-erythrocyte glucose concentration is thought to be the main determinant of hemoglobin glycation rate [39] and hexokinase is the initial and rate-limiting enzymatic step in erythrocyte glucose metabolism, after glucose transporter mediated diffusion, it is likely that variations in HK1 activity would affect glycation. Alternatively, genetic variations at HK1 might modulate systemic glucose metabolism, raising the possibility that these variations are associated with the risk of incident diabetes. Indeed, HK1 is expressed in muscle, an insulin-sensitive tissue recognized for its importance in glucose metabolism.

SLC30A8 functions as a zinc transporter in secretory vesicles of pancreatic beta cells, providing zinc for insulin maturation and storage [40],[41]. A coding non-synonymous polymorphism of SLC30A8 (R325W; rs13266634) has been demonstrated to be protective for type 2 diabetes [32], [42][45] and was shown to influence insulin secretion following an intravenous glucose challenge [7]. In accordance with these results, that same coding SNP was associated with lower glycated hemoglobin in our study.

Because many genetic variants have been previously associated with the risk of type 2 diabetes, we also tested these for association with glycated hemoglobin. Although not genome-wide significant, all of these associations had a consistent direction of effect (as compared with their association with diabetes). The low combined P-value shows that it is very unlikely that these glycated hemoglobin associations had such a consistent direction of effect by chance alone, suggesting that they likely reflect underlying “true” associations. Furthermore, it also suggests that if we had analyzed a much bigger sample, some of these associations might have become genome-wide significant. To address the possibility that the very low P-value from the analysis of all 19 SNPs stems solely from the inclusion of the very significant SLC30A8 and GCK SNPs, we repeated the procedure using only the weakly associated SNPs as a sensitivity analysis, with similar results. It therefore appears that genetic variations affecting the risk of future diabetes do not necessarily have a marked impact on glycated hemoglobin in healthy women, pointing to different, yet partially overlapping, genetic architectures for glycated hemoglobin and diabetes.

The limitations of this study include less than certain elimination of diabetic subjects in the WGHS cohort, since we did not perform oral glucose tolerance testing. Therefore, some subjects with undiagnosed diabetes might have been included in that population. However, we believe it is unlikely this would have affected the results for two reasons. First, when we reproduced our analysis using only women with glycated hemoglobin lower than 6.0%, we obtained similar results. Second, we were able to replicate our novel HK1 findings in the validation cohort in which OGTT was used to eliminate diabetic participants. Although HK1 polymorphisms are associated with glycated hemoglobin levels, the functional consequences of these polymorphisms needs to be established.

In this report, we demonstrate that glycated hemoglobin levels in non-diabetic women are affected by genetic variation at the GCK, SLC30A8, G6PC2 and HK1 loci. While associations at the GCK, SLC30A8 and G6PC2 loci confirm or extend previous work done on the genetic basis of diabetes or fasting glucose, our replicated findings at HK1 are novel. It will be of considerable interest to study HK1 in non-Caucasian populations, as well as in pre-diabetic and diabetic populations, although the confounding effects of treatment might obscure any role of these polymorphisms in the diabetic population. This hypothesis generating genetic observation paves the way for further studies of the role of HK1 in hemoglobin glycation, glucose metabolism and diabetes. Whether the differences in hemoglobin glycation associated with genetic polymorphisms demonstrated herein are paralleled by differences in glycation in other tissues that are thought to underlie long-term complications of diabetes remains an open question.

Supporting Information

Table S1.

Clinical Characteristics of the Samples Used.


(0.04 MB DOC)

Table S2.

Minor and Major Alleles.


(0.04 MB DOC)


We thank the study research team and the women who have participated in the study. We also thank Lynda Rose for her review of the manuscript and helpful comments.

Author Contributions

Conceived and designed the experiments: GP DIC DMN PMR. Performed the experiments: GP ANP. Analyzed the data: GP. Contributed reagents/materials/analysis tools: DIC JPM RYZ PMR. Wrote the paper: GP.


  1. 1. Frayling TM (2007) Genome-wide association studies provide new insights into type 2 diabetes aetiology. Nat Rev Genet 8: 657–662.
  2. 2. Weedon MN, Clark VJ, Qian Y, Ben-Shlomo Y, Timpson N, et al. (2006) A common haplotype of the glucokinase gene alters fasting glucose and birth weight: association in six studies and population-genetics analyses. Am J Hum Genet 79: 991–1001.
  3. 3. Weedon MN, Frayling TM, Shields B, Knight B, Turner T, et al. (2005) Genetic regulation of birth weight and fasting glucose by a common polymorphism in the islet cell promoter of the glucokinase gene. Diabetes 54: 576–581.
  4. 4. Sparso T, Andersen G, Nielsen T, Burgdorf KS, Gjesing AP, et al. (2008) The GCKR rs780094 polymorphism is associated with elevated fasting serum triacylglycerol, reduced fasting and OGTT-related insulinaemia, and reduced risk of type 2 diabetes. Diabetologia 51: 70–75.
  5. 5. Chen WM, Erdos MR, Jackson AU, Saxena R, Sanna S, et al. (2008) Variations in the G6PC2/ABCB11 genomic region are associated with fasting glucose levels. J Clin Invest.
  6. 6. Bouatia-Naji N, Rocheleau G, Van Lommel L, Lemaire K, Schuit F, et al. (2008) A Polymorphism Within the G6PC2 Gene Is Associated with Fasting Plasma Glucose Levels. Science.
  7. 7. Staiger H, Machicao F, Stefan N, Tschritter O, Thamer C, et al. (2007) Polymorphisms within novel risk loci for type 2 diabetes determine beta-cell function. PLoS ONE 2: e832.
  8. 8. Bunn HF (1981) Nonenzymatic glycosylation of protein: relevance to diabetes. Am J Med 70: 325–330.
  9. 9. Nathan DM, Turgeon H, Regan S (2007) Relationship between glycated haemoglobin levels and mean glucose levels over time. Diabetologia 50: 2239–2244.
  10. 10. Nathan DM, Singer DE, Hurxthal K, Goodson JD (1984) The clinical information value of the glycosylated hemoglobin assay. N Engl J Med 310: 341–346.
  11. 11. Singer DE, Coley CM, Samet JH, Nathan DM (1989) Tests of glycemia in diabetes mellitus. Their use in establishing a diagnosis and in treatment. Ann Intern Med 110: 125–137.
  12. 12. The Diabetes Control and Complications Trial Research Group (1993) The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N Engl J Med 329: 977–986.
  13. 13. UK Prospective Diabetes Study Group (1998) Intensive blood-glucose control with sulphonylureas or insulin compared with conventional treatment and risk of complications in patients with type 2 diabetes (UKPDS 33). Lancet 352: 837–853.
  14. 14. Pradhan AD, Rifai N, Buring JE, Ridker PM (2007) Hemoglobin A1c predicts diabetes but not cardiovascular disease in nondiabetic women. Am J Med 120: 720–727.
  15. 15. Khaw KT, Wareham N, Bingham S, Luben R, Welch A, et al. (2004) Association of hemoglobin A1c with cardiovascular disease and mortality in adults: the European prospective investigation into cancer in Norfolk. Ann Intern Med 141: 413–420.
  16. 16. Khaw KT, Wareham N (2006) Glycated hemoglobin as a marker of cardiovascular risk. Curr Opin Lipidol 17: 637–643.
  17. 17. Lawlor DA, Fraser A, Ebrahim S, Smith GD (2007) Independent associations of fasting insulin, glucose, and glycated haemoglobin with stroke and coronary heart disease in older women. PLoS Med 4: e263.
  18. 18. Ridker PM, Chasman DI, Zee RY, Parker A, Rose L, et al. (2008) Rationale, design, and methodology of the Women's Genome Health Study: a genome-wide association study of more than 25,000 initially healthy american women. Clin Chem 54: 249–255.
  19. 19. Ridker PM, Cook NR, Lee IM, Gordon D, Gaziano JM, et al. (2005) A randomized trial of low-dose aspirin in the primary prevention of cardiovascular disease in women. N Engl J Med 352: 1293–1304.
  20. 20. Peters AL, Davidson MB, Schriger DL, Hasselblad V (1996) A clinical approach for the diagnosis of diabetes mellitus: an analysis using glycosylated hemoglobin levels. Meta-analysis Research Group on the Diagnosis of Diabetes Using Glycated Hemoglobin Levels. Jama 276: 1246–1252.
  21. 21. Wigginton JE, Cutler DJ, Abecasis GR (2005) A note on exact tests of Hardy-Weinberg equilibrium. Am J Hum Genet 76: 887–893.
  22. 22. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81: 559–575.
  23. 23. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38: 904–909.
  24. 24. Lettre G, Lange C, Hirschhorn JN (2007) Genetic model testing and statistical power in population-based association studies of quantitative traits. Genet Epidemiol 31: 358–362.
  25. 25. Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL, et al. (2007) A second generation human haplotype map of over 3.1 million SNPs. Nature 449: 851–861.
  26. 26. Benjamini Y, Hochberg Y (1995) Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing Journal of the Royal Statistical Society Series B (Methodological) 57: 289–300.
  27. 27. Purcell S, Daly MJ, Sham PC (2007) WHAP: haplotype-based association analysis. Bioinformatics 23: 255–256.
  28. 28. Zee RY, Germer S, Thomas A, Raji A, Rhees B, et al. (2008) C-reactive protein gene variation and type 2 diabetes mellitus: a case-control study. Atherosclerosis 197: 931–936.
  29. 29. de Kok JB, Wiegerinck ET, Giesendorf BA, Swinkels DW (2002) Rapid genotyping of single nucleotide polymorphisms using novel minor groove binding DNA oligonucleotides (MGB probes). Hum Mutat 19: 554–559.
  30. 30. Wang J, Chuang K, Ahluwalia M, Patel S, Umblas N, et al. (2005) High-throughput SNP genotyping by single-tube PCR with Tm-shift primers. Biotechniques 39: 885–893.
  31. 31. Zeggini E, Scott LJ, Saxena R, Voight BF, Marchini JL, et al. (2008) Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat Genet 40: 638–645.
  32. 32. Saxena R, Voight BF, Lyssenko V, Burtt NP, de Bakker PI, et al. (2007) Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science 316: 1331–1336.
  33. 33. Freathy RM, Timpson NJ, Lawlor DA, Pouta A, Ben-Shlomo Y, et al. (2008) Common variation in the FTO gene alters diabetes-related metabolic traits to the extent expected given its effect on BMI. Diabetes 57: 1419–1426.
  34. 34. Do R, Bailey SD, Desbiens K, Belisle A, Montpetit A, et al. (2008) Genetic variants of FTO influence adiposity, insulin sensitivity, leptin levels, and resting metabolic rate in the Quebec Family Study. Diabetes 57: 1147–1150.
  35. 35. Shima Y, Nakanishi K, Odawara M, Kobayashi T, Ohta H (2003) Association of the SNP-19 genotype 22 in the calpain-10 gene with elevated body mass index and hemoglobin A1c levels in Japanese. Clin Chim Acta 336: 89–96.
  36. 36. Krizova J, Dolinkova M, Lacinova Z, Sulek S, Dolezalova R, et al. (2007) Adiponectin and resistin gene polymorphisms in patients with anorexia nervosa and obesity and its influence on metabolic phenotype. Physiol Res.
  37. 37. Murakami K, Piomelli S (1997) Identification of the cDNA for human red blood cell-specific hexokinase isozyme. Blood 89: 762–766.
  38. 38. van Wijk R, van Solinge WW (2005) The energy-less red blood cell is lost: erythrocyte enzyme abnormalities of glycolysis. Blood 106: 4034–4042.
  39. 39. Burtis CA, Ashwood ER, Bruns DE (2005) Tietz Textbook of Clinical Chemistry and Molecular Diagnostics: Saunders. 2448 p.
  40. 40. Chimienti F, Devergnas S, Favier A, Seve M (2004) Identification and cloning of a beta-cell-specific zinc transporter, ZnT-8, localized into insulin secretory granules. Diabetes 53: 2330–2337.
  41. 41. Chimienti F, Devergnas S, Pattou F, Schuit F, Garcia-Cuenca R, et al. (2006) In vivo expression and functional characterization of the zinc transporter ZnT8 in glucose-induced insulin secretion. J Cell Sci 119: 4199–4206.
  42. 42. Omori S, Tanaka Y, Takahashi A, Hirose H, Kashiwagi A, et al. (2008) Association of CDKAL1, IGF2BP2, CDKN2A/B, HHEX, SLC30A8, and KCNJ11 with susceptibility to type 2 diabetes in a Japanese population. Diabetes 57: 791–795.
  43. 43. Scott LJ, Mohlke KL, Bonnycastle LL, Willer CJ, Li Y, et al. (2007) A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science 316: 1341–1345.
  44. 44. Sladek R, Rocheleau G, Rung J, Dina C, Shen L, et al. (2007) A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature 445: 881–885.
  45. 45. Zeggini E, Weedon MN, Lindgren CM, Frayling TM, Elliott KS, et al. (2007) Replication of genome-wide association signals in UK samples reveals risk loci for type 2 diabetes. Science 316: 1336–1341.
  46. 46. McVean GA, Myers SR, Hunt S, Deloukas P, Bentley DR, et al. (2004) The fine-scale structure of recombination rate variation in the human genome. Science 304: 581–584.
  47. 47. Winckler W, Myers SR, Richter DJ, Onofrio RC, McDonald GJ, et al. (2005) Comparison of fine-scale recombination rates in humans and chimpanzees. Science 308: 107–111.