The existence of multiple inherited disorders of iron metabolism suggests genetic contributions to iron deficiency. We previously performed a genome-wide association study of iron-related single nucleotide polymorphisms (SNPs) using DNA from white men aged ≥25 y and women ≥50 y in the Hemochromatosis and Iron Overload Screening (HEIRS) Study with serum ferritin (SF) ≤12 µg/L (cases) and controls (SF >100 µg/L in men, SF >50 µg/L in women). We report a follow-up study of white, African-American, Hispanic, and Asian HEIRS participants, analyzed for association between SNPs and eight iron-related outcomes. Three chromosomal regions showed association across multiple populations, including SNPs in the TF and TMPRSS6 genes, and on chromosome 18q21. A novel SNP rs1421312 in TMPRSS6 was associated with serum iron in whites (p = 3.7×10−6) and replicated in African Americans (p = 0.0012).Twenty SNPs in the TF gene region were associated with total iron-binding capacity in whites (p<4.4×10−5); six SNPs replicated in other ethnicities (p<0.01). SNP rs10904850 in the CUBN gene on 10p13 was associated with serum iron in African Americans (P = 1.0×10−5). These results confirm known associations with iron measures and give unique evidence of their role in different ethnicities, suggesting origins in a common founder.
Citation: McLaren CE, McLachlan S, Garner CP, Vulpe CD, Gordeuk VR, et al. (2012) Associations between Single Nucleotide Polymorphisms in Iron-Related Genes and Iron Status in Multiethnic Populations. PLoS ONE 7(6): e38339. doi:10.1371/journal.pone.0038339
Editor: Ivan Cruz Moura, Institut national de la santé et de la recherche médicale (INSERM), France
Received: January 14, 2012; Accepted: May 3, 2012; Published: June 22, 2012
Copyright: © 2012 McLaren et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Support was provided by grant R01 HL083328 from the National Heart, Lung, and Blood Institute (C.E.M.), grant R01DK57892 from the United States National Institutes of Health (J.A.M.), and by a Merit Review award from the Department of Veterans Affairs (G.D.M.). The HEIRS Study was initiated and funded by NHLBI, in conjunction with NHGRI (National Human Genome Research Institute). Data collection for this study was supported by contracts N01-HC-05185 (University of Minnesota), N01-HC-05186 (Howard University), N01-HC-05188 (University of Alabama at Birmingham), N01-HC-05189 (Kaiser Permanente Center for Health Research), N01-HC-05190 (University of California, Irvine), N01-HC-05191 (London Health Sciences Centre), and N01-HC-05192 (Wake Forest University). the University of Alabama at Birmingham General Clinical Research Center (G.C.R.C.) grant M01-RR00032, Southern Iron Disorders Center (J.C.B.), Howard University GCRC grant M01-RR10284, Howard University Research Scientist Award UH1-HL03679-05 from the National Heart, Lung, and Blood Institute and the Office of Research on Minority Health (V.R.G.); and grant UC Irvine M01RR 00827-29 from the General Clinical Research Centers Program of the National Center for Research Resources National Institutes of Health. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The levels of iron, a micronutrient required for life and health, must be tightly regulated to avoid excess unbound iron that can generate toxic free radicals while at the same time maintaining adequate supplies for vital functions including oxygen carrying capacity , . Disorders of iron metabolism underlie some of the most prevalent diseases in humans and encompass a broad spectrum of clinical manifestations, ranging from anemia to iron overload and neurodegenerative diseases . Body iron balance normally is maintained by control of iron uptake from the diet by the duodenal enterocytes and its transfer to the systemic circulation, as humans cannot actively excrete iron. Intestinal iron absorption and release of stored iron from macrophages are dependent on similar pathways . While many factors can lead to iron deficiency, most commonly it is attributable to blood loss, a lack of dietary abundance, or defective absorption that collectively affect two-thirds of the world’s population . However, the existence of multiple inherited disorders of iron metabolism in man, rodents and other vertebrates make plausible a genetic contribution to iron deficiency , , . It is widely known that genetics can play a significant role in the iron overload found in whites, the most common example being hereditary hemochromatosis attributable to mutations in the hemochromatosis gene, HFE , . Less is known about the role of genetic factors in disorders of iron status in other ethnic groups or about genetic effects on susceptibility to iron deficiency.
In order to investigate the genetic contribution to iron deficiency in whites, we recently completed a genome-wide association study (GWAS) in iron-deficient white male and female participants in the Hemochromatosis and Iron Overload Screening (HEIRS) study . Case-control status and seven quantitative iron-related measures were studied, including serum iron (SI), total iron-binding capacity (TIBC), unsaturated iron-binding capacity (UIBC), transferrin saturation (TfS), serum ferritin concentration (SF), serum transferrin receptor (sTfR), and body iron. The quantitative iron-related measures were significantly associated with presence of iron deficiency in the GWAS (p<0.001), and a high degree of concordance was observed in the results across the quantitative traits. The study found genome-wide statistically significant associations between at least one of the iron measures and SNPs on chromosomes 2p14, 3q22 (in the transferrin gene, TF), 6p22 (in the HFE gene), 7p21 and 22q11, with the association at TF being replicated in a follow-up case-control study.
Furthermore, mutations in the TMPRSS6 gene have been implicated in iron deficiency anemia refractory to oral iron therapy within white populations , , . Further evidence of genetic influences on iron status was found in a recent GWAS of four serum markers of iron status (serum iron, transferrin, transferrin saturation and serum ferritin) . Along with confirming previously reported associations of the HFE C282Y mutation, significant associations between iron status markers and the TMPRSS6 gene and TF were reported. Tanaka and colleagues investigated genetic variants associated with iron concentrations in persons not affected by overt genetic disorders of iron metabolism and found SNPs in TMPRSS6 were strongly associated with lower serum iron concentration and other hematological variables . Most of the genetic studies of iron deficiency and iron status markers carried out to date have used samples from populations of white individuals.
In the current study, we have investigated SNPs and iron status in multiple ethnic groups, including not only whites but also African Americans, Hispanics, and Asians. Our aims were to investigate whether the same SNPs associated with iron deficiency in whites play a similar role in other ethnic groups and to identify additional SNPs that may play a role in iron deficiency in these populations. The role of 1239 candidate or known SNPs associated with iron deficiency and iron status measures was tested in white, African-American, Hispanic and Asian iron deficient case and normal control samples. For consistency, the same outcome measures of iron status previously examined in whites were assessed in the other ethnic groups in the current study. To our knowledge, this research represents the first major assessment of candidate genetic determinants of iron deficiency and iron status measures in non-white populations. In addition to studying the association between SNPs and the primary outcome of presence of iron deficiency, a unique aspect of the statistical approach was the estimation of the effect of increasing copies of the minor allele of selected SNPs on changes in the degree of iron deficiency in multiple ethnicities.
The current study utilized a subset of subjects who had been enrolled in the initial screening phase of the HEIRS Study at five Field Centers encompassing six geographic locations including Alabama, California, District of Columbia, Hawaii, and Oregon in the United States, and Ontario, Canada , . Participants were eligible for the current study if they had not withdrawn consent and had agreed to blood storage. Selection criteria included self-report of white or Caucasian, African-American, Hispanic or Asian race/ethnicity, males at least 25 years of age and females at least 50 years. Females younger than 50 years were excluded because of pre-menopausal iron depletion from blood loss. Approval for the study was obtained from the following: Institutional Review Board of the University of California, Irvine; Institutional Review Board of the University of California, Berkeley; Institutional Review Board of the University of Minnesota; Howard University Institutional Review Board; Institutional Review Board of the University of Alabama at Birmingham; Institutional Review Board of Kaiser Permanente Northwest Center for Health Research; Institutional Review Board of Wake Forest University Health Sciences; and the University of Western Ontario Research Ethics Board for Health. Written informed consent was obtained from all participants.
Population samples consisting of cases of iron deficiency and controls were selected from the African-American, Asian, Hispanic, and white participants in the HEIRS Study. Cases of iron deficiency were defined as subjects having a serum ferritin concentration (SF) ≤12 µg/L, the point of total depletion of iron stores , . Controls (SF >100 µg/L in men, SF >50 µg/L in women) were frequency matched 2:1 to cases by sex and geographic location. Cases and controls who were selected from African Americans (77 cases, 144 controls), Asians (51 cases, 102 controls), and Hispanics (79 cases, 160 controls) were new to this study and had not been included in the previous GWAS. White subjects included 357 cases and 358 controls from the previous GWAS as well as additional 374 white controls added to achieve the desired 2:1 frequency matching.
Laboratory methods are described in detail elsewhere . Briefly, HFE C282Y and H63D genotypes were determined using the Invader® Assay (Third Wave Technologies, Madison WI). Lack of a detectable C282Y or H63D mutation was designated as HFE wild-type (wt/wt). Spectrophotometric measures of SI and UIBC levels, turbidometric immunoassay of SF (Roche Applied Science/Hitachi 911, Indianapolis, IN), and calculation of TfS were performed on non-fasting blood samples. SI, SF, UIBC, and sTfR, were analyzed using Roche reagents on the Roche/Hitachi Modular P instrument (Roche Diagnostics, Indianapolis IN). TIBC was calculated as the sum of SI + UIBC. TfS was calculated as the ratio, SI/TIBC, and expressed as a percentage. Body iron (mg/kg), an index of iron deficiency, was assessed as follows: body iron = -[log10((sTfR ×1000)/SF) −2.8229]/0.1207. In this approach, body iron is expressed as a positive value when stores are present and negatively with tissue iron deficiency , . A body iron < −4 mg/kg body weight represents a deficit severe enough to produce anemia. However, positive values may occur in some cases of iron deficiency, for example, when sTfR is not elevated as a result of a lack of erythropoietin related to co-morbid conditions such as kidney disease. The sTfR/SF ratio was calibrated previously by quantitative phlebotomy performed in healthy subjects . To exclude common environmental causes of iron deficiency, antibody testing was performed for H. pylori, carcinoembryonic antigen (CEA), and celiac disease. C-reactive Protein (CRP), alanine aminotransferase (ALT), and gamma-glutamyltransferase (GGT) were measured to identify acute phase protein elevations in SF.
Sample and SNP Selection, Genotyping and Quality Control Procedures
Buffy coat DNA was extracted and purified by SDS cell lysis followed by a salt precipitation method for protein removal using commercial Puregene® reagents (Gentra System, Inc., Minneapolis, MN, now Qiagen, Valencia, CA). Using GoldenGate methodology, a custom SNP set with 1536 SNPs per array was designed to cover the number of SNPs that had been chosen for genotyping. These included 1239 SNPs as follows: a) 107 unique SNPs chosen on the basis of our previous GWAS performed in whites that showed significant associations with iron-related outcomes (p-value <0.00005), b) 67 SNPs tagging regions identified in the GWAS, c) 36 SNPs associated with iron status that had been reported in the scientific literature and that were located in TF, HFE, and TMPRSS6 genes, among others, and d) 1029 tag SNPs located in candidate genes for iron metabolism. Additionally, 297 ancestry informative markers were genotyped to estimate the admixture proportions in the African-American and Hispanic samples (Table S1). A CEPH trio and a within-study replicate were placed on each plate to assess the concordance of genotype calls. Mendelian consistency for all trios was greater than 99% and reproducibility for the same CEPH individuals across all plates was greater than 98%. Reproducibility for the thirteen within-study replicates placed across plates was greater than 99%. Reported gender was compared to gender estimated by Illumina’s GenomeStudio, based on gender targets built into the custom OPA. Twelve samples were excluded due to a conflict between reported and genetically inferred gender. Because of allele frequency differences between the four ethnic groups, further SNP and sample quality control assessments were done separately for each group. SNPs were filtered based on call rate (<95%) and allele frequency (<0.005). Individuals were filtered on call rate (<95%), heterozygosity (>~50%) and IBS (>90%). Quality control tests were completed using the GenABEL library  of the R statistical package (http://www.r-project.org/). The genotype distributions of each SNP were tested for fit to Hardy-Weinberg equilibrium HWE expectations. No SNPs were excluded from the association analysis based on the p-value of the HWE test. From the 1702 unique samples included for genotyping, there were 1084 white, 153 Asian, 212 African-American and 233 Hispanic individuals that passed the quality control assessments.
Eight iron-related outcomes were studied. The primary outcome was iron-deficient case-control status. Other indicators of iron status included SI, SF, TfS, sTfR, body iron, TIBC, and UIBC. With the exception of the dichotomous iron deficient case-control status variable, the variables were continuous quantitative traits. The distributions of the seven continuous outcomes and four continuous covariates were tested for their fit to the normal distribution. Natural logarithm transformations were applied to the SF, TfS and sTfR outcomes and the CEA, CRP, GGT and ALT covariates to improve their fit to the normal distribution.
Genotypes were coded as 0, 1 or 2 indicating the number of minor alleles in the genotype and were modeled as continuous variables in the multiple regression models. Covariates showing nominally significant effects on an outcome were included in multiple regression models that also included genotype effects. Odds ratios and regression coefficients were computed with odds ratios representing the multiplicative increase in risk attributed to the addition of one copy of the minor allele to the genotype (sometimes referred to as the single allele odds ratio) and regression coefficients representing the change in the outcome associated with increasing copies of the minor allele. For the African-American and Hispanic samples, ancestry proportions were estimated from ancestry informative markers (AIMs) using the STRUCTURE program with K = 2 . The estimated proportion of the first ancestry component was included as a covariate in the linear and logistic regression models for the two admixed populations.
After filtering based on the quality control assessments, there were 1134, 1115, 1113 and 1134 SNPs analyzed separately for association with the eight iron-related outcomes in the white, African-American, Hispanic and Asian population samples, respectively, adjusted for covariates. Although many of the tested SNPs are correlated through linkage disequilibrium and are not independent, we applied a conservative Bonferroni correction to adjust for the multiple tests. A total of 1134 independent tests were assumed for each population sample such that the Bonferroni multiple test-corrected nominal p-value of 0.05/1134 = 4.4×10−5 represented the threshold for statistical significance. No further multiple test correction was made for the eight iron-related outcomes. The analysis strategy treated the four population samples as independent experiments and SNP association results obtained in one population sample were assessed for replication across the others. For a SNP that showed a statistically significant association with an iron-related outcome in a population sample (p<4.4×10−5), in order to be considered as evidence for replication, the remaining population samples had to show an observed p-value <0.01 for at least one of the iron-related outcomes and the direction of the effects had to be consistent with what was observed in the original sample. The statistical analysis was done using the R statistical package (http://www.r-project.org/) and the genotype association analysis used the GenABEL library of R .
Iron-related Outcomes and Covariates
Table 1 shows the distribution of age, sex, the HFE C282Y/H63D genotype and the continuous outcomes, presented by population sample and iron deficient case-control status. Eight variables were assessed for significant covariate effects on the outcomes, including age, sex, the C282Y/H63D genotype, CagA strain infection status, and natural log of CEA, CRP, GGT, and ALT. The effect of the covariates on each outcome was assessed separately in each population sample using multiple regression models. Table 2 displays the variables that showed significant covariate effects (p-value <0.05 for regression coefficient of the variable) for each outcome and population sample. The relatively large sample size for whites compared to the other population samples could explain why more variables showed significant effects on the outcomes in that sample. The covariates shown in Table 2 were included in the multivariate models that were used to test for association between the genotypes and the outcomes in each population sample. There was a significant association between sex and the outcomes of body iron, natural log of SF, natural log of sTfR, UIBC, and TIBC, adjusted for additional covariates. Table S2 displays the observed differences in males and females with regard to the mean of the continuous iron-related outcomes, and supports the statistical adjustment for sex in the multivariate models.
Associations between SNPs and Iron-related Outcomes
SNPs significantly associated with iron-related outcomes, after correction for multiple comparisons.
Forty-nine SNPs showed statistically significant p-values, corrected for multiple testing, for at least one of the eight iron-related outcomes in at least one of the population samples (Table S3). Forty-eight of the significant associations were observed in the white population samples and one was found in the African Americans. The preponderance of significant results in the white sample was expected given that the sample size and statistical power of the sample far exceeded that of the other population samples, and because many of the SNPs were chosen based on the results of a GWAS that included 63% of the white samples analyzed in the current study. The 49 significantly associated SNPs were distributed within 14 distinct genomic regions. Eleven of the 49 associated SNPs were in the previously reported associated regions on chromosome 2p14, but not in known genes, and 20 of the 49 associated SNPs were in or around the TF gene on chromosome 3q22, which has been repeatedly shown to be associated with iron-related outcomes. Two SNPs on chromosome 7p21 and one SNP on 22q11 were significantly associated in the current study and were also reported in the recent GWAS. Hence, 69% of the 49 significantly associated SNPs were in regions that were reported in the recent GWAS. The remaining 15 SNPs were distributed within ten distinct genomic regions. Eight of the regions (including 10 SNPs) showed suggestive evidence for association in the previous GWAS but failed to reach genome-wide significance and hence, were not reported in that study. Four SNPs in the TMPRSS6 gene on chromosome 22q12 showed statistically significant associations. The SNP rs10904850 in the CUBN gene on chromosome 10p13 was significantly associated with serum iron in the African-American sample (observed p-value = 1.04×10−5), but showed no evidence for association in any of the other population samples.
SNPs in TF gene on chromosome 3q22.
SNPs in and around the TF gene on chromosome 3q22 showed the strongest evidence for association in the white population sample, as well as the strongest evidence for replication in the other population samples. Forty SNPs were genotyped in this region. Twenty SNPs met the multiple test corrected statistical significance threshold in the white sample for association with TIBC, 14 of which showed significant association with UIBC as well. Ten out of the 20 SNPs associated with TIBC were located within the boundaries of the TF gene itself. Strong evidence for association was found between TIBC and the TF gene SNPs in the other three population samples. Table 3 shows the association results for TIBC in the four population samples for the six TF SNPs that were significantly associated in the white sample and showed association in at least one other population. Although the non-white populations did not meet the multiple test corrected statistical significance threshold individually, the evidence for association within these populations is very strong given their role as replication samples. Weaker statistical evidence for association was observed between the TF region SNPs and the other iron-related outcomes in all samples. Figure 1 shows the –log10(p-values) for the 36 SNPs in the TF gene region that passed the quality control assessments and minimum allele frequency threshold in all four population samples. The location and structure of the TF gene is shown across the top of the figure. The most significant associations were found at rs3811647 (observed p-value = 5.02×10−15) and rs1525892 (observed p-value = 4.56×10−15), both located in the TF gene and indicated in Figure 1.
The analysis includes measured genotypes from 36 SNPs. The location and structure of the TF gene is shown along the top of the figure. Dashed lines indicate the threshold for statistical significance with a Bonferroni multiple test-corrected value of 4.4×10−5 and the threshold for replication of an association at a significance level of 0.01. The most significant associations with total iron-binding capacity (TIBC) were observed for six SNPs across the 7 kbp region (delineated) which includes exons 9, 10, and 11 of the transferrin gene, TF, with the measured SNP rs3811647 showing the strongest evidence for association in the white sample (O, 5.02×10−15) and replication at a nominal significance level of 0.01 in Hispanics (Δ, p = 0.00086), African-Americans (×, p = 0.0038), and Asians (+, p = 0.034).
SNPs on chromosome 18q21 and in TMPRSS6 gene on chromosome 22q12.
Table 4 shows the results for the SNPs on chromosomes 18q21 and 22q12 that had statistically significant associations in the white population sample and evidence for replication in at least one of the other three population samples. The SNP rs9948708 on chromosome 18q21 showed evidence for association with all the iron-related outcomes in the white sample with TIBC showing the most statistically significant association (observed p-value = 2.9×10−5). Similar results were observed in the Asian population sample, where TIBC was the most significantly associated of the outcomes (observed p-value = 0.0048) and at least nominal statistical evidence for association was observed at the majority of the outcomes. The regression parameter estimates are consistent across the white and Asian samples although the effects appear considerably stronger in the Asian sample. The larger effect size but reduced evidence of significance of the parameter estimates could be due to the lack of statistical power with the small Asian sample and differences in the minor allele frequencies; the minor allele frequency estimate for rs9948708 was 0.27 in the Asian population sample versus 0.42 in the White sample. There was no evidence for association in either the African-American or Hispanic samples.
Two SNPs on chromosome 22q12 showed statistically significant associations in the white sample with the Asian sample providing evidence for replication at rs2111833 and the African-American sample providing replication evidence for rs1421312 (Table 4). In the white sample, rs2111833 showed the strongest associations with serum iron (observed p-value = 4.7×10−7) and log-transformed transferrin saturation (observed p-value = 0.00014). The strongest associations with rs2111833 in the Asian sample were with UIBC (observed p-value = 0.0067) and TIBC (observed p-value = 0.007), with weaker evidence for association with serum iron (observed p-value = 0.044). Considerably stronger evidence for replication was seen at SNP rs1421312. Again, the most statistically significant associations in the white sample were with serum iron (observed p-value = 3.7×10−6) and the log-transformed transferrin saturation (observed p-value = 0.0018). In the African-American sample, serum iron and log-transformed transferrin saturation were the two most statistically significant associations with rs1421312, with observed p-values of 0.0012 and 0.0011, respectively. The regression coefficients for serum iron and log-transformed transferrin saturation for the white and African-American samples showed opposite signs, indicating that the direction of the effects were not consistent. However, the minor allele in the white (minor allele frequency = 0.40) is the major allele in the Africans (minor allele frequency = 0.61) so the opposite signs reflect associations with opposite minor alleles. If the genotypes in the African-American sample were re-coded to reflect the minor allele in the white sample then the signs of the regression coefficients would be in the same direction. No evidence for association to chromosome 22q12 was found in the Hispanic or Asian population samples.
There were more females than males in each population sample. The association analyses for SNPs on chromosomes 18q21 and 22q12 were repeated using data from females only. Although the overall sample size was smaller, the results reported in Table 5 were generally similar to those based on data from males and females (Table 4). For example, in the white sample SNPs rs2111833 and rs1421312 on TMPRSS6 showed strong associations with serum iron with observed p-values of 5.2×10−6 and 7.9×10−6, respectively.
The primary aim of the current study was to identify SNPs that showed association with iron status in cases with iron deficiency and control subjects across multiple ethnicities. Because patients presenting with iron deficiency are often diagnosed by using multiple measures of iron status, it is important to assess associations with each of these measures. Thus, we tested for associations not only between SNPs and a diagnosis of iron deficiency, but also with SI, TIBC, UIBC, TfS, SF, sTfR, and body iron. Three chromosomal regions showed evidence for association with one or more or these measures across the multiple populations that were sampled, including SNPs in the TF gene on chromosome 3q22, the TMPRSS6 gene on chromosome 22q12, and on chromosome 18q21.
Twenty SNPs in the TF gene region were significantly associated with TIBC in the white sample (observed p<4.4×10−5). Of these, six SNPs showed replication in other ethnicities (Table 3). SNPs chosen for genotyping in the TF gene region that have previously shown an association with serum transferrin included rs1867504, rs4525863 and rs1830084  rs3811658 and rs1880669 , rs1358024 and rs6794945 , , and rs3811647 , , . In our study, the strongest statistical significance with TIBC, a marker for transferrin, was observed for rs3811647 in whites (observed p = 5.02×10−15). In our previous GWAS, this association was statistically significant (p = 7.0×10−9) and replicated in a sample of cases and controls selected from a population of white male and female veterans (observed p = 0.012) . To our knowledge, our study is the first to replicate this observation in non-white population samples, with strongest evidence for replication in Hispanics (observed p = 0.00086).
A novel SNP rs9948708 on chromosome 18q21 satisfied the multiple-test corrected significance level for association with TIBC in the white sample (Table 4, observed p = 2.9×10−5), with evidence for replication in the Asian sample. To our knowledge, this SNP has not been reported in the scientific literature with regard to implications for iron deficiency. In contrast, associations between variants in the TMPRSS6 gene and hemoglobin levels were found in individuals with Indian Asian ancestry as well as in those with European ancestry . In our previous GWAS, we did not find genome-wide associations with TMPRSS6 SNPs in the white sample. However, in this follow-up study, the number of iron-replete controls was increased and that may have increased the power of the study to detect significant associations with serum iron in the white sample with SNP rs2111833 (Table 4, observed p = 4.7×10−7) and SNP rs1421312 (Table 4, observed p = 3.7×10−6), with evidence for replication in Asians and African-Americans, respectively.
Of the 49 SNPs that were statistically significantly associated with at least one iron-related outcome (observed p<4.4×10−5), only one SNP showed a statistically significant association at the multiple test-corrected significance level in a non-white sample. SNP rs10904850 in the CUBN gene on chromosome 10p13 was significantly associated with serum iron in the African-American sample (observed p = 1.0×10−5), but no evidence for association with this SNP was observed in any of the three other samples. The minor allele frequency was 0.13 in African Americans, compared with 0.33 in whites, 0.17 in Hispanics, and 0.19 in Asians. The CUBN gene plays a role in vitamin and iron metabolism by facilitating their uptake. Cubilin and megalin are responsible for reabsorbing transferrin iron in the kidney urine collecting system  and the influence of CUBN on chronic kidney disease in African Americans has been studied . We found that African-American controls had the lowest iron and TIBC concentrations of all the groups (Table 1). A possible explanation is that the cubilin SNP is affecting serum iron through an influence on efficiency of reabsorption of iron-bearing transferrin in the kidney.
In this study, four population samples were examined as a follow-up to a genome wide association study of iron deficiency conducted in white participants enrolled in the HEIRS Study. An advantage of the research design was the genotyping of a panel of markers that show large frequency differentials between major geographic ancestral groupings ,  and this enabled estimation of ancestry proportions for African-American and Hispanic samples. Additionally, models incorporated information from relevant demographic and clinical measures to control for the effects including environmental causes of iron deficiency. Observed similarities and differences in age, sex, iron measures and the distribution of HFE genotypes are shown in Table 1. Although frequency matched by sex and geographic location, on average, whites were three years older than the African-American sample and 4.3 years older than Hispanics and Asians. The proportion of whites with HFE gene mutations was higher than that in the other groups; there were nine C282Y homozygotes (two cases and seven controls) compare to zero in the non-white samples. Generally, females had lower observed mean values for iron-related outcomes than males, especially within the controls (Table S2). This provides further support to the inclusion of sex as a covariate in multiple regression models when there was a significant association between sex and the outcome. A limitation to the study was the relatively small sizes of the non-white population samples, thus lack of association between some SNPs and iron measures may have been due to low statistical power.
In summary, SNPs were identified that showed association across the multiple populations that were sampled. We found a novel SNP in TMPRSS6 that was associated with serum iron in whites and replicated in African Americans, suggesting a role for this SNP in increasing the risk of iron deficiency in affected persons. Our results confirm known associations between SNPs in the TF and TMPRSS6 genes with TIBC and UIBC and give evidence of their role in different ethnic groups, a unique aspect of this study, and suggest the possibility of origins in a common founder.
Basis of SNP selection for genotyping. a) SNPs significantly associated with iron-related outcomes in a previous GWAS performed in whites (GWAS, n = 107), b) SNPs tagging regions identified in the GWAS (GWAS region, n = 67), c) SNPs associated with iron status and reported in the scientific literature (Literature, n = 36),d) tag SNPs located in candidate genes for iron metabolism (Gene, n = 1029), and e) ancestry informative marker (AIM, n = 297).
Descriptive statistics by population, iron deficient case-control status, and sex.
Forty-nine SNPs that showed statistically significant p-values, corrected for multiple testing, for at least one of eight iron-related outcomes in the white sample (48 SNPS) or the African-American sample (one SNP).
We sincerely thank the HEIRS Study participants for volunteering for this research study and all of the HEIRS Study investigators, a full listing of which can be found in reference 9. In addition, we thank K. Goddard, co-investigator at the HEIRS Field Center, Kaiser Permanente Center for Health Research, Northwest (Portland, Oregon).
Conceived and designed the experiments: CEM SM CPG GDM. Performed the experiments: JHE JAM CLF LFB. Analyzed the data: SM CPG. Contributed reagents/materials/analysis tools: JHE LFB JAM. Wrote the paper: CEM SM CPG CDV VRG JHE PCA RTA JAM CLF BMS LFB JDC GDM.
- 1. Donovan A, Roy CN, Andrews NC (2006) The ins and outs of iron homeostasis. Physiology (Bethesda) 21: 115–123.
- 2. Bleackley MR, Wong AY, Hudson DM, Wu CH, Macgillivray RT (2009) Blood iron homeostasis: newly discovered proteins and iron imbalance. Transfus Med Rev 23: 103–123.
- 3. Lieu PT, Heiskala M, Peterson PA, Yang Y (2001) The roles of iron in health and disease. Mol Aspects Med 22: 1–87.
- 4. Anderson GJ, Frazer DM, McLaren GD (2009) Iron absorption and metabolism. Curr Opin Gastroenterol 25: 129–135.
- 5. WHO (2000) Turning the tide of malnutrition: Responding to the challenge of the 21st century. (WHO/NHD/00.7).
- 6. Leboeuf RC, Tolson D, Heinecke JW (1995) Dissociation between tissue iron concentrations and transferrin saturation among inbred mouse strains. J Lab Clin Med 126: 128–136.
- 7. Morse AC, Beard JL, Jones BC (1999) A genetic developmental model of iron deficiency: biological aspects. Proc Soc Exp Biol Med 220: 147–152.
- 8. McLaren CE, Barton JC, Eckfeldt JH, McLaren GD, Acton RT, et al. (2010) Heritability of serum iron measures in the hemochromatosis and iron overload screening (HEIRS) family study. Am J Hematol 85: 101–105.
- 9. Feder JN, Gnirke A, Thomas W, Tsuchihashi Z, Ruddy DA, et al. (1996) A novel MHC class I-like gene is mutated in patients with hereditary haemochromatosis. Nat Genet 13: 399–408.
- 10. Edwards CQ, Ajioka RS, Kushner JP (2000) Hemochromatosis: a genetic definition. In: Barton JC, Edwards CQ, editors. pp. 8–11. Cambridge, United Kingdom: Cambridge University Press.
- 11. McLaren CE, Garner CP, Constantine CC, McLachlan S, Vulpe CD, et al. (2011) Genome-wide association study identifies genetic loci associated with iron deficiency. PLoS One 6: e17390.
- 12. Finberg KE, Heeney MM, Campagna DR, Aydinok Y, Pearson HA, et al. (2008) Mutations in TMPRSS6 cause iron-refractory iron deficiency anemia (IRIDA). Nat Genet 40: 569–571.
- 13. Guillem F, Lawson S, Kannengiesser C, Westerman M, Beaumont C, et al. (2008) Two nonsense mutations in the TMPRSS6 gene in a patient with microcytic anemia and iron deficiency. Blood 112: 2089–2091.
- 14. Melis MA, Cau M, Congiu R, Sole G, Barella S, et al. (2008) A mutation in the TMPRSS6 gene, encoding a transmembrane serine protease that suppresses hepcidin production, in familial iron deficiency anemia refractory to oral iron. Haematologica 93: 1473–1479.
- 15. Benyamin B, McRae AF, Zhu G, Gordon S, Henders AK, et al. (2009) Variants in TF and HFE explain approximately 40% of genetic variation in serum-transferrin levels. Am J Hum Genet 84: 60–65.
- 16. Tanaka T, Roy CN, Yao W, Matteini A, Semba RD, et al. (2010) A genome-wide association analysis of serum iron concentrations. Blood 115: 94–96.
- 17. McLaren CE, Barton JC, Adams PC, Harris EL, Acton RT, et al. (2003) Hemochromatosis and Iron Overload Screening (HEIRS) study design for an evaluation of 100,000 primary care-based adults. Am J Med Sci 325: 53–62.
- 18. Adams PC, Reboussin DM, Barton JC, McLaren CE, Eckfeldt JH, et al. (2005) Hemochromatosis and iron-overload screening in a racially diverse population. N Engl J Med 352: 1769–1778.
- 19. Expert Scientific Working Group (1985) Summary of a report on assessment of the iron nutritional status of the United States population. American Journal of Clinical Nutrition 42: 1318–1330.
- 20. Skikne BS, Flowers CH, Cook JD (1990) Serum transferrin receptor: a quantitative measure of tissue iron deficiency. Blood 75: 1870–1876.
- 21. Cook JD, Flowers CH, Skikne BS (2003) The quantitative assessment of body iron. Blood 101: 3359–3364.
- 22. Pfeiffer CM, Cook JD, Mei Z, Cogswell ME, Looker AC, et al. (2007) Evaluation of an automated soluble transferrin receptor (sTfR) assay on the Roche Hitachi analyzer and its comparison to two ELISA assays. Clin Chim Acta 382: 112–116.
- 23. Flowers CA, Kuizon M, Beard JL, Skikne BS, Covell AM, et al. (1986) A serum ferritin assay for prevalence studies of iron deficiency. Am J Hematol 23: 141–151.
- 24. Aulchenko YS, Ripke S, Isaacs A, van Duijn CM (2007) GenABEL: an R library for genome-wide association analysis. Bioinformatics 23: 1294–1296.
- 25. Pritchard JK, Stephens M, Rosenberg NA, Donnelly P (2000) Association mapping in structured populations. Am J Hum Genet 67: 170–181.
- 26. Constantine CC, Anderson GJ, Vulpe CD, McLaren CE, Bahlo M, et al. (2009) A novel association between a SNP in CYBRD1 and serum ferritin levels in a cohort study of HFE hereditary haemochromatosis. Br J Haematol 147: 140–149.
- 27. Pichler I, Minelli C, Sanna S, Tanaka T, Schwienbacher C, et al. (2011) Identification of a common variant in the TFR2 gene implicated in the physiological regulation of serum iron levels. Hum Mol Genet 20: 1232–1240.
- 28. Chambers JC, Zhang W, Li Y, Sehmi J, Wass MN, et al. (2009) Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels. Nat Genet 41: 1170–1172.
- 29. Kozyraki R, Fyfe J, Verroust PJ, Jacobsen C, Dautry-Varsat A, et al. (2001) Megalin-dependent cubilin-mediated endocytosis is a major pathway for the apical uptake of transferrin in polarized epithelia. Proc Natl Acad Sci U S A 98: 12491–12496.
- 30. Boger CA, Chen MH, Tin A, Olden M, Kottgen A, et al. (2011) CUBN is a gene locus for albuminuria. J Am Soc Nephrol 22: 555–570.
- 31. Risch N, Burchard E, Ziv E, Tang H (2002) Categorization of humans in biomedical research: genes, race and disease. Genome Biol 3: 1–12.
- 32. Shriver MD, Mei R, Parra EJ, Sonpar V, Halder I, et al. (2005) Large-scale SNP analysis reveals clustered and continuous patterns of human genetic variation. Hum Genomics 2: 81–89.