Genetic variation at loci influencing adult levels of HbF have been shown to modify the clinical course of sickle cell disease (SCD). Data on this important aspect of SCD have not yet been reported from West Africa. We investigated the relationship between HbF levels and the relevant genetic loci in 610 patients with SCD (98% HbSS homozygotes) from Cameroon, and compared the results to a well-characterized African-American cohort.
Methods and Findings
Socio-demographic and clinical features were collected and medical records reviewed. Only patients >5 years old, who had not received a blood transfusion or treatment with hydroxyurea were included. Hemoglobin electrophoresis and a full blood count were conducted upon arrival at the hospital. RFLP-PCR was used to describe the HBB gene haplotypes. SNaPshot PCR, Capillary electrophoresis and cycle sequencing were used for the genotyping of 10 selected SNPs. Genetic analysis was performed with PLINK software and statistical models in the statistical package R. Allele frequencies of relevant variants at BCL11A were similar to those detected in African Americans; although the relationships with Hb F were significant (p <.001), they explained substantially less of the variance in HbF than was observed among African Americans (∼ 2% vs 10%). SNPs in HBS1L-MYB region (HMIP) likewise had a significant impact on HbF, however, we did not find an association between HbF and the variations in HBB cluster and OR51B5/6 locus on chromosome 11p, due in part to the virtual absence of the Senegal and Indian Arab haplotypes. We also found evidence that selected SNPs in HBS1L-MYB region (HMIP) and BCL11A affect both other hematological indices and rates of hospitalization.
Citation: Wonkam A, Ngo Bitoungui VJ, Vorster AA, Ramesar R, Cooper RS, Tayo B, et al. (2014) Association of Variants at BCL11A and HBS1L-MYB with Hemoglobin F and Hospitalization Rates among Sickle Cell Patients in Cameroon. PLoS ONE9(3): e92506. https://doi.org/10.1371/journal.pone.0092506
Editor: Ana Paula Arez, Instituto de Higiene e Medicina Tropical, Portugal
Received: January 12, 2014; Accepted: February 21, 2014; Published: March 25, 2014
Copyright: © 2014 Wonkam et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The molecular experiments of the study were funded by the National Health Laboratory Services (NHLS), South Africa; the University of Cape Town (UCT) Research Committee and Carnegie Research Development Grant, South Africa. The students' bursaries were supported by the Third World Academy of Sciences (TWAS). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Sickle cell disease (SCD) affects the structure of erythrocytes by altering the normal biconcave shape to a crescent. During this process the hemoglobin S (HbS) mutation leads to polymerization and precipitation of hemoglobin during deoxygenation or dehydration resulting in sickling, abnormal adhesion of leukocytes and platelets, inducing inflammation, hypercoagulation, hemolysis and hypoxia, in addition to microvascular obstruction, and ultimately organ damage . SCD is prevalent among indigenous populations in tropical regions of Africa and Asia; 305,800 births with SCD are estimated to occur annually, nearly two-third of which take place in Africa. Sickle Cell Anaemia (SCA; the homozygous HbSS state) is by far the most prevalent and severe form of SCD . Only limited early detection and treatment initiatives have been implemented in Africa and as consequence death rates are high before the age of 5 years in this region .
Cameroon has a population about 20 million inhabitants and a growth rates of 3% per annum; the carrier frequency of SCA ranges from 8 to 34% . Cameroon has developed a national control program for SCA, although at present it remains a policy document without implementation; provision of neonatal screening and development of specialized centers for lifelong medical care and surveillance have yet to become part of the health system . Universal medical insurance coverage does not exist, and care of SCA patients is therefore dependent on financial support and care-giving by family members. Poverty in Cameroon affects more than 50% of the rural population and up to 30% of the urban population , which in turn means that the financial burden of medical care for SCA can often not be met  and patients frequently suffer exceptionally severe SCD sequelae such as of stroke  and neurocognitive compromise .
SCA is genetically characterised by a single point mutation in the globin locus, however, there are additional genetic factors that modulate the clinical course of this disease and patients manifest widely varying degrees of severity. Several genomic loci have now been identified that are associated with increased levels of fetal haemoglobin (HbF) and as a consequence influence the clinical course of SCA –. HbF is a highly heritable trait; based on samples of Europeans with a normal hematocrit 89% of the variation is genetically determined . To date at least 3 quantitative trait loci (QTL) have been shown to be associated with persistence of elevated HbF and account for 20–50% of the inter-individual variation in HbF. These loci include the combined XmnI-158 and OR51B5/6 locus on chromosome 11p15.4 , the HBS1L-MYB intergenic polymorphism (HMIP) locus on chromosome 6q23.3 , which may reflect the role of a single variant, and the BCL11A locus on chromosome 2q16.1 , . In order to stimulate HbF expression, experimental and clinical work resulted in adoption of hydroxyurea as adjunct treatment for high risk SCA patients , and this is now widely accepted therapy. Moreover, other investigators have recently demonstrated that inactivation of BCL11A in an SCD transgenic mouse model completely eliminates the hematologic and pathologic defects associated with SCD through induction of high-level pancellular HbF .
SCA was the first molecular disease to be recognized in medical practice and the description of the DNA-protein relationship underlying this disorder was a milestone in molecular medicine. It is both ironic and unfortunate, therefore, that the powerful new tools of genomics have not been applied to research on SCA on the African continent. To the best of our knowledge, only one study has reported data on selected genetic effects associated with increased level of HbF amongst SCA patients in sub-Saharan Africa  and no studies on the association of these SNPs with clinical outcomes has yet been reported from the region. The sickle mutation exists in Africa on diverse genetic haplotype backgrounds  and geographically separate African populations may vary in the effects of linked genetic variants found on specific haplotypes. Studies of SCA in African sub-populations could therefore potentially lead to important new findings about factors that influence the disease phenotype. The aim of this study was to explore the frequency and influence of 10 SNPs known to influence HbF within the HMIP, BCL11A, XmnI-158 and OR51B5/6 loci amongst a group of Cameroonian SCA patients and to compare these data to a well described cohort of African-American patients in the Cooperative Study of Sickle Cell Disease (CSSCD).
Materials and Methods
The study was performed in accordance with the guidelines of the Helsinki Declaration. Ethical approval was given by the National Ethical Committee Ministry of Public Health, Republic of Cameroon (No 033/CNE/DNM/07); and the University of Cape Town, Faculty of Health Sciences Human Research Ethics Committee (HREC RE: 132/2010). Written and signed informed consent was obtained from the adult participants who were 18 years or older, and for the children consent was obtained from parents/guardians with an assent from the participants older than seven years of age.
Patients and assessment of clinical events
The study was conducted at the Yaoundé Central Hospital and Laquintinie Hospital in Douala, the primary port and second largest city in Cameroon. Socio-demographic and clinical data were collected by means of a structured questionnaire. Parents/guardians as well as adult SCA patients were interviewed; patients’ medical records were reviewed, to delineate their clinical features over the past three years. Specifically, the occurrences of vaso-occlusive painful crisis (VOC), hospital outpatient visits, hospitalisations, overt strokes, blood transfusions and administration of hydroxyurea were recorded. VOC events were defined as episodes that could not be attributed to causes other than SCA and required a hospital visit and treatment with prescription analgesics. Anthropomorphic variables (body mass Index (BMI), and blood pressures (BP) were measured in the outpatient setting.
The sampling strategy was not restricted to hospital-based patients in an attempt to avoid the bias that might result from including only the sickest patients. To accomplish this goal two SCA patients’ associations in Cameroon were engaged for collaboration and additional patients were recruited during their monthly meetings. No incentive was provided for participation in the study. Only patients who had not received a blood transfusion or hospitalisation in the past 6 weeks where included. None were currently treated with hydroxyurea.
Hemoglobin electrophoresis and complete routine blood count of the SCA patients were conducted upon arrival at the hospital. Two methods of HbF detection were employed in this study for successive groups of patients; initially, as a result of limited capacity, the alkali denaturation test (ADT) was used, and subsequently high performance liquid chromatography (HPLC) was performed. HbF determinations were performed at the hematological laboratory of the Centre Pasteur in Yaoundé; we excluded from the analysis measurements done in patients <5 years old because HbF levels are not yet stable at this age. HbF levels were measured by ADT in 344 patients (55.5%). HbF measurements by HPLC and ADT techniques were found to have differences in median values (P = 0.001), with ADT yielding the median value of 11.7% compared to 6.8% for the HPLC method.
As previously reported, HbF was measured in the CSSCD samples by ADT and HbA2 was measured by Diethylamino Ethanol column chromatography . For HbF in the CSSCD study, we also excluded from the analysis measurements done in patients <5 years old .
DNA samples were extracted from peripheral blood, following instructions for the commercial kit (Puregene blood kit (Qiagen, USA) at the molecular diagnostic laboratory, Gyneco-Obstetric and Paediatric Hospital, Yaoundé, Cameroon. Genotype analyses were performed in the Division of Human Genetics, Faculty of Health Sciences, University of Cape Town.
Molecular Diagnostic Testing for SCA (HbSS).
Analysis for presence of the sickle mutation was carried out on 200 ng DNA by PCR, on a thermocycler (BIORAD, USA) to amplify a 770 bp segment of the β-globin gene, followed by Dde I (GIBCO-BRL, USA) restriction analysis of the PCR product. Only patients SCD-HbSS were included in the analysis, according to a previously reported method .
Haplotyping of the β-globin gene cluster.
Five restriction fragment length polymorphism (RFLP) regions in the β-globin gene cluster were amplified using published primers and methods to analyse the XmnI (5'Gγ), HindIII (Gγ), HindIII (Aγ), HincII (3˙'Ψβ), and HinfI (5'β) . RFLP sites and the fragments were visualised by agarose gel electrophoresis; β-globin gene haplotypes were defined by examining the combination of the restriction sites.
SNPs genotyping in the HMIP, BCL11A, XmnI-158 and OR51B5/6 loci: For the purpose of obtaining genotype for variants associated with HbF levels that had been previously identified in SCA patients, ten regions containing specific SNPs were amplified: viz, for the BCL11A locus, SNPs rs11886868 and rs4671393; for the HMIP1/2 loci: SNPs rs28384513, rs9376090, rs9399137, rs9389269; rs9402686 and rs9494142; for the OR51B5/6 loci: SNP rs5006884, for HBG loci, SNP rs7482144.
PCR was performed to determine genotypes using SNaPshot multiplex ready reaction mix (Applied Biosystems, California, USA) as previously reported . For SNP genotyping, the accuracy of the SNaPshot method was shown to be comparable to Biplex Invader and Pyrosequencing, while lower in cost when multiplex reactions are used . Two singleplex reactions allowed the amplification of SNPs rs7482144 and rs9399137; and three multiplex reactions allowed the amplification of rs11868668, rs28384513 and rs9389269 (mutiplex1); rs9376090, rs9402686 and rs5006884 (mutiplex 2); rs9494142 and rs4671393 (mutiplex 3).
Capillary electrophoresis was used for the determination of genotype; 0.5 μl treated SNaPshot reaction mixture was mixed with 5 μl HiDi formamide (Applied Biosystems) and 0.2 μl GeneScan -120 Liz size standard. This mixture was loaded onto a Micro Amp 96 well reaction Plate (Applied Biosystems) and analysed using the GeneScan-120 Liz and run model at E5_36_POP7.
For Cameroonian samples, we confirmed the genotyping results for both SNaPshot and RFLP-PCR by sequencing a sub-set of samples (10%). Sequencing was performed using BigDye terminator mix (Promega, Wisconsin, USA) and 5X BigDye terminating buffer (Promega, Wisconsin, USA). Cycle sequencing was then performed on the Applied Biosystems thermal cycler (Gene Amp PCR system 9700). An electropherogram was generated using sequence analysis software and imported into Bioedit Sequence alignment editor 7.0.0 (Tom Hall, Isis Pharmaceuticals, Inc.) for the alignment of the sequence with a reference sequence to identify the SNP of interest.
For the CSSCD patients, all DNA genotyping was performed by using the mass spectrometry-based MassArray iPLEX platform from Sequenom as previously reported . For SNPs passing quality control, the genotyping success rate was >93% and the consensus error rate, estimated from replicates, was <0.3%.
Descriptive statistics were obtained for all quantitative data using SPSS (IBM, USA version 21.0). Normality was confirmed by the Shapiro-Wilk Test followed, by the use of parametric (Chi-squared test and t-test) or non-parametric tests (Mann-Whitney U test for 2 samples or the Kruskal-Wallis ONE way ANOVA for more than 2 samples). Significance was set at the 0.05 level. Genetic analysis was performed with PLINK software , testing only the additive genetic model under a linear regression framework. For quality control, a Hardy-Weinberg test was performed on all genotype results: two SNPs were dropped because of significant violation of HWE (rs1188686 in BCL11A, MAF 0.31; HWE P-value: 0.000609; and rs9389269 in the HBS1L-MYB locus, MAF: 0.17; HWE P-value: 2.42E-05). A third SNP was monomorphic (rs9376090 in the HBS1L-MYB locus, all the patients were T/T homozygous). In the CSSCD patients, on Hardy-Weinberg equilibrium test, P value was >0.05 for all SNPs, except for the -158 (G>A) XmnI polymorphism in the Gγ-globin (HBG2) gene promoter (rs7482144), which was in disequilibrium (P = 0.002) . For a polymorphism in LD (linkage disequilibrium) with the sickle cell mutation, this finding is expected in a population of SCD patients .
When several SNPs were genotyped at the same locus, we used conditional analysis and stepwise regression to determine whether one or more independent association signals were present.
To correct for the skewness of the HbF distribution, we log10-transformed and normalized the data to obtain the quantitative trait used in the association analysis (after correcting for age, gender, and electrophoresis technique and history of transfusion). Statistical models to investigate the relationship between HbF-associated SNPs and SCD complications were conducted in the statistical package R, version 2.5.1.
Socio-demographic and Hematological variables
The description of the study sample is presented in Table 1. Among the 610 patients 50.3% were female; the mean age (± SD) of patients was 17.3±10 years (range: 5–54 years), and the majority of patients were children aged 5–10 years (32.2%; n = 196) or adolescents 11–20 years (50%; n = 305). Patients lived mostly in the urban areas of Yaoundé and Douala (93%; n = 567), the two largest cities in Cameroon. Among the participants, 41% (n = 238), 45.6% (n = 265) and 13.2% (n = 78) attended formal primary, secondary or tertiary education levels, respectively.
Marital status of parents was distributed as follows: married (58%; n = 235), single (28%; n = 113), widows (9%, n = 36) and divorced (5%; n = 21). Forty two percent (n = 158) of fathers and 23% (n = 108) of mothers were formally employed, while 41% (n = 153) of fathers and 39% (n = 183) of mothers were working in the informal sector. The majority of parents (75%) earned monthly direct incomes that were < 300 USD. The median age at diagnosis of SCA, based on haemoglobin electrophoresis, was 3 years (range: 1month - 29 years). Only 31% (n = 180) of patients were diagnosed before their first year of life.
After molecular analysis, the vast majority of patients were determined to have SCA (HbSS) (ie, 97.4%; n = 594); 15 (2.4%) patients were HbS-β thalassemia and one patient had an HbSC genotype.
There were no significant differences among the socio-demographic variables (marital status of parents, employment status, <300 USD direct revenue; urban vs. rural), in association with HbF levels or in association with the clinical events (number of VOC, number of hospital attendance or number of hospitalisations, overt strokes). The hematological parameters were generally comparable to those of African Americans in the CSSCD; the WBC count and the occurrence of VOC were somewhat more frequent in the Cameroonians (Table 1).
Haplotypes in the beta-globin gene cluster
Based on analysis of 1082 chromosomes, the allele frequencies of various haplotypes in the beta-globin genes cluster showed that Benin (74%; n = 799) and Cameroon (19%; n = 207) were most prevalent forms, followed by atypical haplotype (6%; n = 66). Bantu (n = 5); Senegal (n = 3) and Indian-Arab (n = 2) alleles were very rare.
In combination, the Benin/Benin haplotypes represented 57% (n = 307), Benin/Cameroon 25% (n = 137), Benin/atypical 8% (n = 45) and Cameroon/Cameroon 5% (n = 26) haplotypes were most prevalent. There were no significant differences in the association with clinical events or HbF levels among the haplotype combinations.
Association of Genetic Variants to HbF Levels
We did not replicate the association of rs7482144 in HBG2 to the HbF levels. The adjacent rs5006884 OR51B5/6, was also not significantly associated to HbF level in Cameroonian SCA patients (Table 2). On the other hand, the two principal known HbF loci were significantly associated in the study patients (Table 2). The effect of minor alleles at this these loci resulted either in a depression of mean HbF values (rs28384513, HBS1L-MYB), or an increase in HbF (rs4671393 in BCL11A; rs9399137 and rs9494142 in HBS1L-MYB) (figure 1). As previously reported in African Americans at the BCL11A loci, when we conditioned the association analysis on rs4671393, rs11886868 was no longer significant, suggesting that these markers tag the same causal polymorphism n this Cameroonian SCA cohort.
Boxes have lines at the lower quartile, median, and upper quartile. The plot whiskers extend up and down from the median a distance 1.5 times the interquartile range of the boxes (truncated at zero where necessary). Outliers are the points outside the whiskers indicated as circles.
The largest allelic effect (0.41, Table 2) in the Cameroonian patients was detected at the HBS1L-MYB locus, rs9399137; leading to changes in median values of HbF of 10.6%, 10.3% and 8.7% for the C/C, C/T and TT alleles, respectively. We further explored whether the multiple variants at the HBS1L-MYB locus represented independent signals of association. Using stepwise regression, we found that rs28384513, rs9399137, rs9376090, rs9389269 rs9402686 and rs9494142 are independent association signals in the Cameroonian sample (Table S1 and Figure S1).
We disaggregated the patient sample, based on the HbF assessment technique (ADT vs HPLC), and found that the significant associations with HF levels, examined independently, were present in both sub-groups studied using the different assay methods, in rs11886868 (BCL11A), rs4671393 (BCL11A), rs28384513 (HMIP 1) and rs9494142 (HMIP 2) (Table S2).
HbF-associated variants and hematological parameters
We tested correlations between the 10 HbF-associated SNPs and various blood cell parameters [RBC count, WBC count, mean corpuscular volume (MCV), mean corpuscular hemoglobin (MCH), platelet count, and monocyte levels]. BCL11A rs4671393 was associated with a wide range of haematological indices (Table 3). In addition, rs9402686 (HMIP 2) was significantly associated with platelet count; rs28384513 (HMIP 1) with WBC counts, and rs9399137 (HMIP 2) with Hb levels, rs9402686 (HMIP 2) and rs9494142 (HMIP 2) were both associated with MCV.
Effect of HbF-associated variants on pain crises and rate of hospitalization
Two SNPs [rs28384513 (HMIP 1) and rs9494142 (HMIP 2)] were associated with the number of hospitalisations (P = 0.028 and P = 0.04, respectively, independent of HbF level and various socio-demographical parameters (figure 2; Table S3). We did not, however, find an association between the genotyped SNPs and the numbers of VOC, and overt stroke episodes. In addition, there was no differential in MAF in the three loci across various age ranges suggesting that there is not a strong selective advantage for these loci.
Two specific SNPs were associated, with rate of hospitalization, a potential marker of overall disease severity. Boxes have lines at the lower quartile, median, and upper quartile. Outliers are the points outside the whiskers indicated as circles.
Considerable progress has been made in understanding disease modifying genetic factors in SCD, prompting speculation that this information may be of potential therapeutic value. In an earlier study, Thein and colleagues concluded that at least three independent genetic variants (rs28384513, rs9399137 and rs6929404) explain the linkage peak observed at 6q23 in the HBS1L-MYB intergenic region associated with HbF levels . An association signal with HbF levels in the HBS1L-MYB intergenic region was also identified in a large non-anemic Sardinian cohort . Lettre et al.  genotyped SNPs rs28384513 and rs9399137, originally reported by Thein et al.  as well as three SNPs reported in the Sardinian study (rs7776054, rs9389268, and rs4895441), in the US CSSCD and a Brazilian SCD cohort and replicated the association with HbF levels . Moreover, Lettre et al. also explored the intronic SNP in BCL11A, rs11886868, that strongly correlates with HbF levels in the CSSCD and confirmed that rs4671393 in BCL11A provides a strong signal of association with HbF levels in SCD populations . In a Tanzanian SCD cohort, SNPs in HBS1L-MYB and BCL11A were also found to have a significant impact on the HbF level . Most recently, researchers found that common genetic variation at BCL11A associated with HbF levels lies in noncoding sequences associated with an erythroid enhancer chromatin signature, this led to the suggestion that the GWAS-marked BCL11A enhancer represents an attractive target for therapeutic genome engineering for the β-hemoglobinopathies .
In the present study, the first to our knowledge examining these genetic relationships in West Africans, we examined the effect of 10 common genetic polymorphisms on HbF and various clinical outcomes in SCA patients from Cameroon. We have confirmed robust genetic associations in BCL11A and HBS1L-MYB intergenic region loci, as previously reported in African Americans and Afro- Brazilians  and Tanzanians . In the CSSCD, five SNPs explained >20% of residual HbF variation; in the present study 4 SNPs explained about 10% of HbF variation. As a correlate, in the present study, we found much lower effect sizes for the relevant markers, especially in the BCL11A loci (Table 2); this could in part be due to our smaller sample size, or to the reduction in precision through the use of ADT to measure HbF levels in the majority of patients . Nevertheless, HbF was measured in the CSSCD samples in the 1980s by ADT ; thus, the difference in effect size could well be a valid finding that deserves further investigation; additional fine-mapping of BCL11 should therefore be considered in future studies. Alternatively, another potential explanation could be possible poor assay precision of ADT techniques to measure HbF on the Cameroonian samples. Despite this possible limitation in HbF phenotyping, the replication of this level of genetic effect sizes for human complex quantitative traits, explained using a few genetic polymorphisms, is relatively strong in comparison to many other quantitative human traits.
As also reported in the present study, multiple variants at the HBS1L-MYB locus were shown to be independent in African-American, Afro-Brazilian  and Tanzanian SCD patients . Together the polymorphisms explained 8.3% of the variation in HbF levels in our SCA patients (Table 2). In addition, we have shown that rs9399137, which acts as a tagging SNP for the HMIP-2 sub-locus in European populations , , occurred at a low frequency in this Cameroonian cohort, as noted previously in both African-American  and Tanzanian patients ; thus, rs9399137, is not an appropriate proxy for the causative sequence variant that influences HbF at HMIP-2 on African chromosomes . However, we report, in the HMIP-2 sub-locus, a much higher frequency of rs9389269 (0.18) in Cameroonian as compared to the Tanzanian SCD patients (0.03); this observation could indicate a high degree of variation in the MAF of this SNP amongst SCD patients from various African population groups. Specifically, a 3 bp deletion in perfect LD with rs9399137 in Chineses, Europeans and some Africans was reported to be the candidate functional motif for the HMIP-2 region . But, in the current Cameroonian samples, rs9389269 was not linked to rs9399137 (r2 = 0; D’ = 0.138; Table S1). Using 1000Genome data (Phase I Release Version 3 20101123), we also estimated the LD between rs9389269 and rs9399137 among Luhya in Webuye from Kenya (AWK) and Yoruba in Ibadan from Nigeria (YRI); equally, the results did not indicate any linkage between the two SNPs in any of the two native African populations: r2 = 0.012; D' = 0.233 for AWK and r2 = 0.002; D' = 1.000 for YRI. Thus, the candidate for functional motif for the HMIP-2 region in sub-Saharan African populations deserves to be explored in future study.
The importance of the HMIP locus in African populations might therefore have been underestimated by datasets using markers tailored to European studies . Fine-mapping this locus could lead to new findings in African patients with SCD.
The association between the XmnI polymorphism (rs7482144) in the proximal promoter of the γ-globin (HBG2) gene and HbF levels is well documented in SCA patients , . Due to the virtual absence of the Senegal and Indian Arab haplotypes that contain the XmnI variant, we lacked power to replicate the association of rs7482144 in HBG2 with HbF levels; in the CSSCD, rs7482144 explains 2.2% of the variation in HbF levels. Other African populations with the Senegal S haplotype that contains the rs7482144 SNP would be better suited to study the effects of this variant. Similarly, a strong signal adjacent to the HBB cluster, recently detected in African-American patients at rs5006884 in OR51B5/6 , was not significant in our patients. In a report that included Tanzanian patients, the positive association of HbF with rs5006884 disappeared when linkage disequilibrium with rs7482144 was taken into account ; this additional HbF locus does not seem to play an important role in Cameroonians or Tanzanians.
We also described the prevalence of various haplotypes in the β-globin gene cluster in Cameroonian SCD patients and reveal that the Benin haplotype was the most prevalent. It is perhaps anomalous that the “Cameroon” haplotype, which seems to occur more frequently in Sudan , is not dominant in this population. The results raise the question of the geographical origin of this haplotype and also suggest that the detailed geographic distribution of known sickle haplotypes is still not well established. The refinement of the molecular structure (i.e. haplotype) in HBB and migration patterns may yield important information about the evolutionary history of SCA that could also be clinically relevant . Contrary to our findings, a recent report argues that the S haplotype itself (beyond HbF regulation) correlates with severity . Differences in haplotypes and SNP allele frequencies between SCA patients from different geographic populations underscore the need to conduct GWAs in regional sub-Saharan African populations.
Understanding the effect that HbF-associated variants have more generally on blood cell production and other parameters will be important in our efforts to define their biological roles and for the development of targeted therapies ,. As in African Americans and Afro-Brazilians studies , we replicated the finding that in combination HBS1L-MYB rs9399137, and BCL11A rs4671393 SNPs affected platelet counts. In addition BCL11A rs4671393 SNPs was associated with wider range of hematological indices in Cameroonians (Table 3), independently of HbF level. Significant associations of HBS1L-MYB rs9399137 with RBC indices (RBC count, MCV, MCH), and platelet and monocyte counts have been reported in healthy non-anemic Europeans . However, in African-American and Brazilian patients Lettre et al. did not replicate these findings .
Previous analyses of the CSSCD dataset showed that increased HbF levels correlate with less severe complications, fewer pain crises  and improved survival . Having shown that four sequence variants at two loci influence HbF levels (table 2; Fig. 1), we next asked whether these SNPs correlate with pain crisis or stroke in SCA: we did not find any significant associations. However, two SNPs in HBS1L-MYB were associated with the number of hospitalisations (figure 2); if confirmed, this could be the first evidence of the clinical effect that is associated with these variants. This result could suggest that clinical genotyping of these variants, and others, yet to be found, may potentially be useful to risk stratify SCD patients, and to serve as a guide to adjusting therapeutic and follow up strategies. This finding deserves further investigation with a larger sample or, ideally, a prospective birth cohort, to fully appreciate the potential clinical value. Clearly much remains to be learned about the myriad genetic factors which modify the course of SCA in Africans.
Limitations and perspectives
A limitation of this study was our inability to use the HPLC method to measure HbF in all patients; this method was not available in Cameroon at the beginning of the sample collection. Even though we have shown that, use of the older assay did not affect the main outcomes ie, the association of targeted SNPs to HbF (supplementary data 2). The systematic use of HPLC for HbF measurement should have allowed more precise estimation of effect sizes of the genotyped SNPs as well as appropriate comparison to the other populations of SCA patients. In addition, phenotyping HbF levels with ADT instead of HPLC in the majority of samples could have affected the study of the associations of haplotypes, HbF levels and various clinical events. Self-report of clinical variables like VOC episodes can also lack precision and pain tolerance and financial factors could have been limiting factors for hospital attendance. However, it has also proven difficult to detect an association of genetic markers with VOC in affluent settings  .
Despite the above limitations, this study represent an important step forward in the understanding of influence of SNPs in modifying genes on HbF level in Cameroon, and in sub-Saharan African patients generally. While more than 70% of SCA patients world-wide live in Africa, most advances in the molecular understanding and management of SCA have been based on research conducted in either the US or the UK. Clearly contemporary research tools must now be widely implemented in Africa. This study therefore also has a capacity-building dimension, as it was fully performed from design, molecular analysis and reporting, on the African continent. Increased capacity to conduct both clinical and laboratory research on SCA in Africa could in turn create major collaborative research opportunities at the regional and international levels.
This study has confirmed the associations of SNPs in BCL11A and HBS1L-MYB and fetal haemoglobin in Cameroonian SCA patients and the association of two specific SNPs with rate of hospitalization, a potential marker of overall disease severity. These results highlight the differential frequency of SNPs associated to HbF levels amongst various demographic groups of SCA patients. Additional data in other African site are needed to define an updated map of these variants on the continent, as well as potential new loci that could influence fetal hemoglobin.
Linkage analyses of the targeted of SNPs pairs, studied in the BCL11A and HBS1L-MYB loci.
HbF assessment technique and SNPs Associations.
Effect of HbF-associated variants on pain crises and hospitalization rates.
Conceived and designed the experiments: AW AAV JN. Performed the experiments: VJNB AAV RR AW JN. Analyzed the data: AW VJNB AAV BT RSC GL. Contributed reagents/materials/analysis tools: AW RR JN BT RSC GL. Wrote the paper: AW RR BT RSC GL. Revised and approved the manuscript: AW VJNB AAV RR RSC BT GL JN.
- 1. Bartolucci P, Galactéros F (2012) Clinical management of adult sickle-cell disease. Current Opinion in Hematology 19: 149–55.
- 2. Piel FB, Patil AP, Howes RE, Nyangiri OA, Gething PW, et al. (2013) Global epidemiology of sickle haemoglobin in neonates: a contemporary geostatistical model-based map and population estimates. Lancet 381: 142–51.
- 3. Grosse SD, Odame I, Atrash HK, Amendah DD, Piel FB, et al. (2011) Sickle cell disease in Africa: a neglected cause of early childhood mortality. American Journal of Preventive Medicine 41: S398–405.
- 4. Weatherall DJ, Clegg JB (2001) Inherited haemoglobin disorders: an increasing global health problem. Bulletin of the World Health Organization 79: 704–12.
- 5. Wonkam A, Mba CZ, Mbanya D, Ngogang J, Ramesar R, et al. (2013) Psychosocial Burden of Sickle Cell Disease on Parents with an Affected Child in Cameroon. Journal of Genetic Counseling Jul 24. [Epub ahead of print]
- 6. World Bank (2010) Education for all-fast track initiative: support to the education sector. Report No. 48373-CM, 1–2.
- 7. Njamnshi AK, Mbong EN, Wonkam A, Ongolo-Zogo P, Djientcheu VD, et al. (2006) The epidemiology of stroke in sickle cell patients in Yaounde, Cameroon. Journal of the Neurological Sciences 250: 79–84.
- 8. Ruffieux N, Njamnshi AK, Wonkam A, Hauert CA, Chanal J, et al. (2013) Association between biological markers of sickle cell disease and cognitive functioning amongst Cameroonian children. Child Neuropsychology 19: 143–60.
- 9. Lettre G, Sankaran VG, Bezerra MA, Araújo AS, Uda M, et al. (2008) DNA polymorphisms at the BCL11A, HBS1L-MYB, and beta-globin loci associate with fetal hemoglobin levels and pain crises in sickle cell disease. . Proceedings of the National Academy of Sciences U S A. 105: 11869–74.
- 10. Thein SL, Menzel S (2009) Discovering the genetics underlying foetal haemoglobin production in adults. British Journal of Haematology 145: 455–67.
- 11. Bae HT, Baldwin CT, Sebastiani P, Telen MJ, Ashley-Koch A, et al. (2012) Meta-analysis of 2040 sickle cell anemia patients: BCL11A and HBS1L-MYB are the major modifiers of HbF in African Americans. Blood 120: 1961–1962.
- 12. Garner C, Tatu T, Reittie JE, Littlewood T, Darley J, et al. (2000) Genetic influences on F cells and other hematologic variables: a twin heritability study. Blood 95: 342–6.
- 13. Solovieff N, Milton JN, Hartley SW, Sherva R, Sebastiani P, et al. (2010) Fetal hemoglobin in sickle cell anemia: genome-wide association studies suggest a regulatory region in the 5 olfactory receptor gene cluster. Blood115: 1815–1822.
- 14. Thein SL, Menzel S, Peng X, Best S, Jiang J, et al. (2007) Intergenic variants of HBS1L-MYB are responsible for a major quantitative trait locus on chromosome 6q23 influencing fetal hemoglobin levels in adults. Proceedings of the National Academy of Sciences U S A 104: 11346–11351.
- 15. Uda M, Galanello R, Sanna S, Lettre G, Sankaran VG, et al. (2008) Genome-wide association study shows BCL11A associated with persistent fetal hemoglobin and amelioration of the phenotype of beta-thalassemia. Proceedings of the National Academy of Sciences U S A 105: 1620–1625.
- 16. Bauer DE, Kamran SC, Lessard S, Xu J, Fujiwara Y, et al. (2013) An erythroid enhancer of BCL11A subject to genetic variation determines fetal hemoglobin level. Science 342: 253–7.
- 17. Platt OS (2008) Hydroxyurea for the treatment of sickle cell anemia. The New England Journal of Medicine 358: 1362–9.
- 18. Xu J, Peng C, Sankaran VG, Shao Z, Esrick EB, et al. (2011) Correction of sickle cell disease in adult mice by interference with fetal hemoglobin silencing. Science 334: 993–6.
- 19. Makani J, Menzel S, Nkya S, Cox SE, Drasar E, et al. (2011) Genetics of fetal hemoglobin in Tanzanian and British patients with sickle cell anemia. Blood 117: 1390–2.
- 20. Labie D, Pagnier J, Lapoumeroulie C, Rouabhi F, Dunda-Belkhodja O, et al. (1985) Common haplotype dependency of high G gamma-globin gene expressionand high HbF levels inbeta-thalassemia and sickle cell anemia patients. Proceedings of the National Academy of Sciences U S A 82: 2111–2114.
- 21. Steinberg MH, Rosenstock W, Coleman MB, Adams JG, Platica O, et al. (1984) Effects of thalassemia and microcytosis on the hematologic and vasoocclusive severity of sickle cell anemia. Blood. Jun 63(6): 1353–60.
- 22. Saiki RK, Scharf S, Faloona F, Mullis KB, Horn GT, et al. (1985) Enzymatic amplification of beta-globin genomic sequences and restriction site analysis for diagnosis of sickle cell anemia. Science 230: 1350–4.
- 23. Steinberg MH, Lu ZH, Nagel RL, Venkataramani S, Milner PF, et al. (1998) Hematological effects of atypical and Cameroon beta-globin gene haplotypes in adult sickle cell anemia. American Journal of Hematology 59: 121–6.
- 24. Pati N, Schowinsky V, Kokanovic O, Magnuson V, Ghosh S (2004) A comparison between SNaPshot, pyrosequencing, and biplex invader SNP genotyping methods: accuracy, cost, and throughput. Journal of Biochemical and Biophysical Methods 60: 1–12.
- 25. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: A tool set for whole-genome association and population-based linkage analyses. The American Journal of Human Genetics 81: 559–575.
- 26. Menzel S, Jiang J, Silver N, Gallagher J, Cunningham J, et al. (2007) The HBS1L-MYB intergenic region on chromosome 6q23.3 influences erythrocyte, platelet, and monocyte counts in humans. Blood 110: 3624–3626.
- 27. Ondei LS, Zamaro PJ, Mangonaro PH, Valêncio CR, Bonini-Domingos CR (2007) HPLC determination of hemoglobins to establish reference values with the aid of statistics and informatics. Genetics and molecular research 6: 453–60.
- 28. Creary LE, Ulug P, Menzel S, McKenzie CA, Hanchard NA, et al. (2009) Genetic variation on chromosome 6 influences F cell levels in healthy individuals of African descent and HbF levels in sickle cell patients. PLoS ONE 4: e4218.
- 29. Farrell JJ, Sherva RM, Chen ZY, Luo HY, Chu BF, et al. (2011) A 3-bp deletion in the HBS1L-MYB intergenic region on chromosome 6q23 is associated with HbF expression. Blood 117: 4935–45.
- 30. Elderdery AY, Mills J, Mohamed BA, Cooper AJ, Mohammed AO, et al. (2012) Molecular analysis of the B-globin gene cluster haplotypes in a Sudanese population with sickle cell anemia. International Journal of Laboratory Hematology 34: 262–266.
- 31. Hanchard N, Elzein A, Trafford C, Rockett K, Pinder M, et al. (2007) Classical sickle beta-globin haplotypes exhibit a high degree of long range haplotype similarity in African and Afro-Caribbean populations. BioMed Central Genetics 8: 52.
- 32. Bean CJ, Boulet SL, Yang G, Payne AB, Ghaji N, et al. (2013) Acute chest syndrome is associated with single nucleotide polymorphism-defined beta globin cluster haplotype in children with sickle cell anaemia. British Journal of Haematology 163: 268–276.
- 33. Platt OS, Thorington BD, Brambilla DJ, Milner PF, Rosse WF, et al. (1991) Pain in sickle cell disease. Rates and risk factors. The New England Journal of Medicine 325: 11–16.
- 34. Platt OS, Brambilla DJ, Rosse WF, Milner PF, Castro O, et al. (1994) Mortality in sickle cell disease. Life expectancy and risk factors for early death. The New England Journal of Medicine 330: 1639–1644.
- 35. Galarneau G, Coady S, Garrett ME, Jeffries N, Puggal M, et al. (2013) Gene-centric association study of acute chest syndrome and painful crisis in sickle cell disease patients. Blood 122: 434–42.