Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Comprehensive analysis of mitochondrial and nuclear DNA variations in patients affected by hemoglobinopathies: A pilot study

  • Ylenia Barbanera ,

    Contributed equally to this work with: Ylenia Barbanera, Francesco Arcioni, Hovirag Lancioni

    Roles Data curation, Formal analysis, Investigation, Writing – original draft, Writing – review & editing

    Affiliation Department of Medicine, Hematology, University of Perugia, Perugia, Italy

  • Francesco Arcioni ,

    Contributed equally to this work with: Ylenia Barbanera, Francesco Arcioni, Hovirag Lancioni

    Roles Data curation, Resources, Writing – original draft, Writing – review & editing

    Affiliation Pediatric Oncohematology, Hospital Santa Maria della Misericordia, Perugia, Italy

  • Hovirag Lancioni ,

    Contributed equally to this work with: Ylenia Barbanera, Francesco Arcioni, Hovirag Lancioni

    Roles Conceptualization, Formal analysis, Investigation, Writing – original draft, Writing – review & editing

    Affiliation Department of Chemistry, Biology and Biotechnology, University of Perugia, Perugia, Italy

  • Roberta La Starza,

    Roles Investigation, Writing – review & editing

    Affiliation Department of Medicine, Hematology, University of Perugia, Perugia, Italy

  • Irene Cardinali,

    Roles Investigation, Software, Writing – original draft

    Affiliation Department of Chemistry, Biology and Biotechnology, University of Perugia, Perugia, Italy

  • Caterina Matteucci,

    Roles Formal analysis, Investigation

    Affiliation Department of Medicine, Hematology, University of Perugia, Perugia, Italy

  • Valeria Nofrini,

    Roles Formal analysis, Visualization

    Affiliation Department of Medicine, Hematology, University of Perugia, Perugia, Italy

  • Antonella Roetto,

    Roles Investigation, Validation

    Affiliation Department of Clinical and Biological Sciences, University of Turin, Hospital San Luigi Gonzaga, Turin, Italy

  • Antonio Piga,

    Roles Investigation, Validation, Visualization

    Affiliation Department of Clinical and Biological Sciences, University of Turin, Hospital San Luigi Gonzaga, Turin, Italy

  • Paola Grammatico,

    Roles Visualization

    Affiliation Department of Molecular Medicine, Laboratory of Medical Genetics, San Camillo-Forlanini Hospital, Sapienza University, Rome, Italy

  • Maurizio Caniglia,

    Roles Resources, Visualization

    Affiliation Pediatric Oncohematology, Hospital Santa Maria della Misericordia, Perugia, Italy

  • Cristina Mecucci,

    Roles Resources, Supervision, Writing – review & editing

    Affiliation Department of Medicine, Hematology, University of Perugia, Perugia, Italy

  • Paolo Gorello

    Roles Conceptualization, Investigation, Methodology, Supervision, Writing – original draft, Writing – review & editing

    paolo.gorello@unipg.it

    Affiliation Department of Medicine, Hematology, University of Perugia, Perugia, Italy

Abstract

The hemoglobin disorders are the most common single gene disorders in the world. Previous studies have suggested that they are deeply geographically structured and a variety of genetic determinants influences different clinical phenotypes between patients inheriting identical β-globin gene mutations. In order to get new insights into the heterogeneity of hemoglobin disorders, we investigated the molecular variations on nuclear genes (i.e. HBB, HBG2, BCL11A, HBS1L and MYB) and mitochondrial DNA control region. This pilot study was carried out on 53 patients belonging to different continents and molecularly classified in 4 subgroup: β-thalassemia (β++, β00 and β+0)(15), sickle cell disease (HbS/HbS)(20), sickle cell/β-thalassemia (HbS/β+ or HBS/β0)(10), and non-thalassemic compound heterozygous (HbS/HbC, HbO-Arab/HbC)(8). This comprehensive phylogenetic analysis provided a clear separation between African and European patients either in nuclear or mitochondrial variations. Notably, informing on the phylogeographic structure of affected individuals, this accurate genetic stratification, could help to optimize the diagnostic algorithm for patients with uncertain or unknown origin.

Author summary

Worldwide most frequent genetic diseases are represented by a very heterogeneous group of congenital hemolytic anaemias which include β-thalassemia and sickle cell anaemia. They are deeply geographically structured and characterized by high complexity of the genetic background associated with variable clinical severities. Due to this heterogeneity we investigated the nuclear and mitochondrial DNA marker variations in patients with hemoglobinopathies in order to provide the first comprehensive genetic stratification of these hereditary disease. This study contributes to detail the multifaceted picture of hemoglobin disorders.

Introduction

Worldwide, the most frequent genetic diseases are represented by a very heterogeneous subgroup of congenital hemolytic anaemias, which includes β-thalassemia and sickle cell anaemia [1]. Individuals with β-thalassemia have a diverse range of clinical manifestations, from an asymptomatic state to a transfusion-dependent form, depending mainly on the β-globin gene mutations [2,3]. However, individuals with identical β-thalassemia genotypes can exhibit variable clinical severities [4], because of the high complexity of the genetic background associated with the disease [5,6]. The term sickle cell disease (SCD) encompasses a group of disorders characterized by the presence of at least one hemoglobin S allele (HbS; p.Glu6Val in HBB) and a second HBB allele carrying pathogenic variant resulting in abnormal hemoglobin polymerization. SCD (HbS/HbS) caused by the homozygous HBB variant p.Glu6Val is the most common cause of SCD in the US. SCD caused by compound heterozygous HBB pathogenic variants includes sickle-hemoglobin C disease (HbS/HbC) and two types of sickle β-thalassemia (HbS/β+-thalassemia and HbS/β°-thalassemia). Other β-globin chain variants such as D-Punjab, O-Arab, and E also result in SCD when inherited with HbS. The marked variability in phenotype severity is due to the different genotypes of the disease and the co-presence of other independent genetic factors that may worsen or alleviate the phenotype [7,8]. Single nucleotide polymorphism (SNPs) in several genetic modulators and cis-regulatory elements involved in the regulation of human HbF, i.e. HBG2, B cell CLL/lymphoma 11A (BCL11A), Kruppel-like factor 1 (KLF1) and MYB, can delay fetal-to-adult hemoglobin (Hb) switch, ameliorating the severity of β-thalassemia and sickle cell disease [920].

Other studies focusing on different human populations reported that the SNPs are deeply geographically structured and further loci i.e KCNK10, GPR65, RNASE2, RNASE3, and C/EBPE are involved in HbF expression regulation [2125].

In the last decades, mitochondrial DNA (mtDNA) has been proven to be an effective tool to reconstruct the maternal origins of individuals and geographically map samples, but it also plays a key role in various human diseases. Due to its maternal inheritance and the lack of recombination [26,27], its differentiation is generated only by the sequential accumulation of new mutations from a female ancestor that makes the mtDNA a molecular register along maternal lines. Over millennia, the molecular differentiation gave rise to groups of mtDNAs that share a set of variants (called haplotypes) acquired from the same common female ancestor. Even though whole genomic approaches are now answering to questions related to the origin and diversification of species, the mtDNA continues to be an important source of information to reconstruct the genetic history of specific populations. Studies based on sequence variation especially in the noncoding control region have allowed the classification of distinct geographic and ethnic affinities [2834].

Nowadays, a detailed worldwide mtDNA phylogeny is available, where different haplogroups are defined by specific mutational motifs and restricted to specific areas and population groups [35]. The development of phylogeography is closely related to mtDNA analysis, which was described as one of the best tools to study population variability [36].

The consequences of mitochondrial mutations have been also evaluated in human ageing as well as inherited and acquired diseases [37,38], however the only study focusing on the association of mtDNA variations with β-thalassemia claimed that mitochondrial mutations may increase disease susceptibility or the development of phenotype-associated disease in affected individuals, although concluded that there was no strong proof for the association of these polymorphisms with β-thalassemia [39]. Doubtless, as mentioned above, there should be a correlation between hemoglobinopathies and geographic origins of patients [40,41], that could be deepened also by considering the mtDNA variability.

The aim of this pilot study was to provide a first comprehensive analysis of hemoglobin and uniparental marker variants in 53 patients with hemoglobinopathies coming from different countries, in order to depict the phylogeographic structure of affected individuals.

Results and discussion

The study was carried out on 53 patients (M/F: 38/15) affected by hemoglobinophaties who referred to our laboratory at the University of Perugia/Hospital Santa Maria della Misericordia, in Umbria a region of Central Italy.

Except for the first patient for whom we do not know the origin, 16 patients (30.2%) were European, 28 (52.8%) African, four (7.6%) Asian and four (7.6%) American. Complete information about geographical characteristics and genotypes were reported in Table 1, while clinical phenotypes were detailed in S1 Text and S1S4 Tables.

thumbnail
Table 1. Geographic information and α/β genotypes of the 53 patients analysed in this study.

https://doi.org/10.1371/journal.pone.0240632.t001

Homozygotes and compound heterozygotes included: 15 β-thalassemia patients, 20 sickle cell disease (HbS/HbS) patients, ten sickle cell/β-thalassemia (HbS/β+ or β0) patients, eight compound heterozygotes for Hb variants of which seven HbS/HbC and one HbO-Arab/HbC. Ten patients carried α mutations: eight were heterozygous for α -3.7 and two for ααα anti 3.7 (Table 1). Seventeen different mutations of the HBB gene were found: 12 generated β-thalassemic defects, two produced Hb thalassemic variants (i.e. Hb-Lepore and HbE) and three were associated with the production of non-thalassemic Hb variants (HbS, HbC and HbO-Arab) (Table 1).

Overall, β-thalassemia patients carried 17 different gene mutations and 12 different genotypes. Seven out of 15 patients (46.6%) were European, two (13.3%) were African, four (26.7%) were Asian and one (6.7%) was American. No precise information was available for one patient (6.7%) (Table 1).

Sickle cell/β-thalassemia patients were characterized by four different β genotypes while α mutations (ααα anti 3.7) were found in two patients from Albania. Seven out of eleven patients were Europeans (63.6%), two (18.2%) Africans and one (9%) American (Table 1).

All sickle cell disease patients carried the same β genotype, while the presence of the -α3.7/αα genotype was demonstrated in five patients and the -α3.7/-α3.7 genotype was highlighted in one patient. Sixteen out of 20 (80%) were African patients, two were Americans (10%) and two were Europeans (10%) (Table 1).

Moreover, seven out of eight patients included in the cohort of compound heterozygotes who carried HbS/HbC genotype (patients 46–51, 53), while the other patient bore HbC/HbO-Arab genotype (patient 52). -α3.7/αα genotype was reported in two HbS/HbC cases. All eight patients were African (Table 1).

Hemoglobin phenotype modulating SNPs: Geographical distribution

The analysis of the five polymorphic loci related to HbF expression (i.e. HBG2 rs7482144 [C>T], BCL11A rs1427407 [G>T], BCL11A rs10189857 [A>G], HBS1L rs28384513 [A>C], HBS1L-MYB rs9399137 [T>C]), showed BCL11A rs10189857 [A>G] as the most frequent polymorphic site, since the allele G was detected in 33/53 patients (21 heterozygotes, 12 homozygotes). Conversely, HBS1L-MYB rs9399137 [T>C] was the less frequent polymorphic variation, as the allele C was present only in 6/53 patients (one homozygote and five heterozygotes). As referring to the other SNPs: HBG2 rs7482144 [C>T], BCL11A rs1427407 [G>T] and HBS1L-MYB [A>C] were detected in pts 9, 19 and 21 respectively, showing an intermediate frequency. No patients carrying C/EBPE rs45496295 [C>T] variation were found (Table 2, S5 Table).

thumbnail
Table 2. Molecular analysis of phenotype modulating SNPs, mtDNA sequence variation and geographical distribution of the 53 patients analysed in this study.

https://doi.org/10.1371/journal.pone.0240632.t002

Interestingly, we observed that frequencies of the five analysed SNPs were related to the geographical origin of patients (Fig 1). Although in our cohort there were only four Asian cases, all investigated SNPs were detected. Conversely, in the other patients (i.e. Europeans, Africans, Americans), the frequency of each SNP was considerably different. HBS1L-MYB rs9399137 T>C was not detected in both our African or American populations, while HBG2 rs7482144 C>T was not detected in European patients (Fig 1, Table 2, S1A and S1B Fig).

thumbnail
Fig 1. Nuclear SNP frequencies and geographic distribution of five nuclear SNPs in the 53 patients.

Black rings represent the number of patients in different geographic areas. Pie charts represent the frequencies of each SNP. The central number indicates the total alleles carrying SNPs.

https://doi.org/10.1371/journal.pone.0240632.g001

In particular, HBG2 rs7482144 [C>T] was found in nine patients, five African, three Americans and one Asian (S1A Fig, Table 2). The absence of this variation in European subjects did not completely fit with the allele frequencies retrieved from GnomAD database, which reported T alleles in the European population, although in a lower frequency (3839 T alleles and 10893 C alleles) (S1B and S1C Fig).

BCL11A rs1427407 [G>T] spread over all analysed ethnic group: 13 African patients, four Europeans, one American and one Asian. Likewise, BCL11A, rs10189857 [A>G] was present in 33/53 patients: 14 Africans, ten Europeans, four Asians, four Americans and no precise ethnic information were available for one patient (S1A Fig, Table 2). The allele frequencies of both BCL11A rs1427407 and BCL11A rs10189857 were in agreement with those reported in GnomAD database (Fig 1B and 1C).

HBS1L-MYB rs28384513 [A>C] was identified in ten African patients, nine Europeans and two Asians, while it was not detected in American patients (Fig 1A, Table 2). Notably GnomAD database reported the G allele also in this population, although in a lower frequency (9164 T alleles and 5824 G alleles) (S1B and S1C Fig). The low number of American patients included in our cohort (i.e. three pts) could explain the discrepancy between our results and the database.

Finally, the HBS1L-MYB rs9399137 [T>C], was detected only in 6/53 analysed patients, five European patients and one Asian (S1A and S1B Fig, Table 2). No African patients with HBS1L-MYB rs9399137 [T>C] were found in our cohort although this represented the largest group. Allele frequencies obtained by GnomAD database reported C alleles in American and African populations, although at lower frequencies (S1B and S1C Fig).

In conclusion, for three out of five SNPs our results were in agreement with GnomAD database. The only discrepancy regards HBG2 rs7482144 [C>T] which was not detected in European patients and HBS1L-MYB rs9399137 [T>C] which was not detected in Africans although these patients represented 30.2% and 52.8% of the cohort, respectively. These discrepancies may be due to a selection bias as European patients represented only Italian and Albania geographical areas, while 24/28 African patients came from seven Sub-Saharan countries and only 4/28 came from two North Africa countries. (Fig 1)

Analysis of mtDNA sequence variation and integration with hemoglobin marker variants

Mitochondrial control region sequences ranged from nucleotide position (np) 16024 to np 210, thus encompassing the HVSI and most of the HVSII (Table 2). The overall sequence alignment of samples revealed 99 polymorphic sites (S), represented by ten transversions and two indel (one deletion and one insertion). Nucleotide diversity (π) across all samples was estimated at 0.0173. The haplotype diversity was very high (Hd = 0.993) as well as the average number of nucleotide differences (k = 13,052).

When aligned to the revised Cambridge Reference Sequence, a total of 44 different haplotypes were obtained and classified through their mutational motifs into 13 different macro-haplogroups: B, D, G, JT, L0, L1, L2, L3*, M*, N9, R0, UK and X (Fig 2A). Seven haplotypes were shared between two or three samples, but five couples of those individuals declared to be maternally related. For the remaining two haplotypes, one (HT02) was found in two brothers from Algeria and one sample from Morocco the other (HT04) was detected in one patient from Ivory Coast and one from Benin.

thumbnail
Fig 2. Haplogroup frequency.

(A) Haplogroup frequency pie chart. (B) Geographic distribution of patients based on their declared origin and reported for different haplogroups. (C) Distribution of hemoglobinopathies subgroups in all patients for different mtDNA haplogroups.

https://doi.org/10.1371/journal.pone.0240632.g002

In order to better define the phylogenetic relationships between samples, we have constructed the network illustrated in Fig 3. Haplogroup labels are reported according to the Phylotree nomenclature [42]. The network topology showed two major mtDNA clusters, corresponding to super-haplogroups R and not-R. Within R, JT and H haplogroups prevailed. The latter formed a star-like cluster mainly represented by sub-haplogroups H1, H2 and H7. R0a, HV, B and UK were also represented. The not-R cluster included mostly L1, L2 and L3* sub-lineages, followed by L0 and D; also N9, M, X and G were revealed.

thumbnail
Fig 3. Network of mitochondrial variation.

Tree of the 53 mtDNA haplotypes observed among the patients, highlighted with different colors according to their geographic origin. The circles are proportional to the number of samples carrying the same haplotype. Haplogroups and sub-haplogroups are indicated with capital letters. Super-haplogroups R and not-R are defined in the right and left box, respectively.

https://doi.org/10.1371/journal.pone.0240632.g003

Among samples, the most represented lineage belonged to the super-haplogroup L (45%), consisting in L0 (8%), L1 (17%), L2 (54%) and L3* (21%), followed by R0 (25%) and M* (13%) (Fig 2B). The sibling haplogroup JT was detected with 9% of frequency, while B, N9, UK and X lineages were represented by only one sample each.

It is known that macro-haplogroup L represents the most ancestral mitochondrial lineage of all living modern humans and, if excluding the derived L3 branches M and N, it represents the majority of the typical sub-Saharan mtDNA variability. Within L, L2 is the most common haplogroup in Africa and it has been previously observed throughout the continent [43]. Our findings agree with the haplogroup frequency and origin of our samples specifically from Nigeria and Ivory Coast (but also Congo and Benin), except for Syria, from which two samples (brothers) derived. L3* was found in two samples from Nigeria, one from Guinea, one from Ghana and one from Italy. Even if L haplotypes account for <1% of mtDNAs in Europe, with L1b being the most common haplogroup, frequencies widely change, ranging from 3% in Southern Europe to 0.7% in Central Europe and 0.5% in Northern Europe.

Within R0, only one sample R0a and one HV1 were found, while the remaining patients were H, thus corresponding to a total of 11/53 (21%). Haplogroup H is the most frequent mitochondrial lineage in Europe (~40%) with a declining pattern from Western Europe towards the Near East and Caucasus (~10–20%). It was also found in Africa, especially the North region, with a peak of H1 in Maghreb and this is in accordance with the origin of our H samples (Italy, Albania, Morocco and Algeria).

As for the macro-haplogroup M, samples originated from Venezuela and Cambodia and belonged to the D4 sub-branches (D1f1 and D4b2d), which exactly reflect the geographic maximum distribution of these lineages (South America and East Asia, respectively) [44]. Even patient 53 from Morocco, belonging to M1a2a mitochondrial haplogroup, was congruent with the frequency distribution of the lineage, unlike African patients 28 and 29 (brothers) and 40, which belonged to G1a and M30c respectively, both sub-lineages typical of Asia.

The East Asian B4 haplogroup revealed in patient 27 from Dominican Republic could result odd, but we should keep in mind that a subclade of B4b is one of the five haplogroups found among indigenous people of the Americas, thus explaining a lower distribution also in Central America.

Patient 13 from China belonging to N9a10a perfectly reflects the East Asian origin and the distribution peak of the lineage [45].

After evaluating relationships between mitochondrial phylogeny and the geographic origin of individuals, we moved to a different level of resolution, including also the nuclear SNP data in a new network (S2 Fig, Table 2). We considered the mitochondrial control-region haplotypes, whose phylogenetic relationships are illustrated in the Fig 3, together with zygosity of the five variable genes, trying to detail the patients’ genotype. Patients 5, 14, 29 and 45 were the only homozygous wildtype for all variables: two from Italy (both belonging to the R0 haplogroup) and two from Nigeria (carrying L0 and G mitogenomes). A total of 41/53 patients showed the prevalence of wildtype homozygosity, while only six individuals (patients 6, 10, 21, 25, 34 and 44) were heterozygous for three polymorphisms. Fourteen patients were homozygous mutants for one SNP, while only patient 15 was homozygous mutant for three variables. It is to be highlighted that all individuals within L mitochondrial macro-haplogroup were homozygous wildtype for HBS1L-MYB rs9399137 [T>C], as patients harbouring the JT mitochondrial lineage were homozygous wildtype for HBG2 rs7482144 [C>T]. Conversely, C/EBPE rs45496295 [C>T] was always found as homozygous wildtype, thus it was excluded from the representation.

A further analysis was performed, trying to obtain a graphical overview of the possible integration between mtDNA control region variability and nuclear mutations associated to hemoglobinopathies subgroups (Table 2, S2 Text and S5 Table). In S3 Fig, different phenotypical subgroups were highlighted in the mitochondrial phylogenetic tree: β-thalassemia (β-thal) patients, sickle cell/β-thalassemia (HbS/β+ or β0), sickle cell disease (HbS/HbS) and compound heterozygotes (HbS/HbC and HbO-Arab/HbC). The overall network revealed a predominant distribution of the hemoglobinopathies subgroups within clades (S3 Fig). In particular, except for three haplotypes belonging to samples from Albania (patient 26) and South America (patiens 27 and 41) that lied in the R clade, sickle cell disease seemed to be common among not-R haplogroups, especially African lineages (L0, L1, L2 and L3*). It is to be noticed that no sickle cell disease patients belonged to macro-haplogroup R0 (inclusive of R0a, H and HV haplogroups) (Fig 2C). A similar distribution was observed for compound heterozygotes patients, all Africans; they lied in not-R clade (M1, L2 and L3* haplogroups), except for patient 52, from Morocco, that belonged to H1 lineage and it was the only patient characterized by HbC/HbO-Arab genotype. An opposite distribution could be observed for thalassemic patients, which spread among R haplogroups, except for four samples from South America (patient 7), Syria (patients 11 and 12, brothers) and Cambodia (patient 15). In particular, the R0 clade seemed to be overall characterized by β-thalassemic individuals. On the other hand, sickle cell/β-thalassemia patients (HbS/β+ or β0) seemed to be widespread in the network.

When we separated data in four different networks, one for each hemoglobinopathies subgroup coloured with the geographic origin of the patient, it was clear that Asian samples were all thalassemic, but clustered far from European thalassemic patients (S3E Fig). No African patient was affected by thalassemia (with the exception of a couple of brothers from Algeria). If excluding two samples with sickle cell/β-thalassemia patients (HbS/β+ or β0) (S3F Fig) Africans prevail in sickle cell disease grouping (S3G Fig) and are the only source of compound heterozygotes (S3H Fig). No star-like pattern of molecular relatedness emerged that would represent any event of local expansion.

To resume, we observed that European patients fall prevalently into the thalassemic (β++, β00 and β+0) or sickle cell/β-thalassemia (HbS/β+ or β0) subgroups, while Africans into the remaining two (i.e. HbS/HbS and HbS/HbC) (S4 Fig).

The clear structuring and sub-division between African and European clades could be detected also in the principal component analysis (PCA) performed by considering all variable sites (both nuclear and mtDNA SNPs) for the 53 patients (Fig 4). Moreover, American samples lied in two quarters, close to Europeans. The graph showed also three European samples close to the African quarters (left quarters): Albanian patients 17 (X haplogroup) and 26 (JT), and patient 35 (L3*) from Italy.

thumbnail
Fig 4. Two-dimensional Principal Component Analysis plot obtained by including all mitochondrial and nuclear DNA variations, considered separately in each sample and coloured depending on their geographic origin.

https://doi.org/10.1371/journal.pone.0240632.g004

On the other hand, five African samples stand close to the vertical line that separates left and right quarters: patient 24 (L3*) from Guinea; patients 29 (G), 43 (L3*), and 45 (L0) from Nigeria; and patient 44 (L0) from Congo. Interestingly, all these patients together with three Europeans (patients 17, 26, 35) carried at least HbS allele, which is very frequent in the Sub Saharan Africa. The prevalently European quarter (the first quarter on the right) shows four African patients (patients 3 and 4 siblings, and patients 48 and 52). The two most distant patients from African samples are from North Africa (Morocco and Algeria) confirming the closeness to European ones. Intriguingly, only two thalassemic samples lie on the left half of the PCA: they are patients 11 and 12, the only samples from West Asia, while the other two Asian samples were from China and Cambodia, and fall into the right half of the graph.

Even if the past and recent migrations could partially alter the picture, a high level of population and geographic structuring still persist.

Conclusions

This study represents the first comprehensive analysis of nuclear and mitochondrial DNA marker variations in patients with hemoglobinopathies and showed a clear separation between African and European patients allowing for depicting the phylogeographic structure of affected individuals. Further investigations on a larger cohort of patients, as well as on additional mitochondrial and nuclear variants using standard and Next Generation Sequencing methodologies, will allow to establish possible correlations between genetic data and clinical behaviour. Taking all together, this pilot study based on hemoglobin and uniparental marker variants could represent an integrated and powerful tool for enliven the multifaced pictures of hemoglobinophaties.

Materials and methods

Patients and DNA extraction

The study was carried out on 53 patients (M/F: 38/15) affected by hemoglobinophaties who referred to our laboratory at the University of Perugia/Hospital Santa Maria della Misericordia, in Umbria a region of Central Italy. (Table 1). α and β genotype of patients 1–12, 16–23, 26–39 and 46–52 were previously described [46]. Moreover patients 3–4, 11–12, 16–17 and 47, 49–50 were brothers. Patient 21 was mother of patient 7.

Peripheral blood samples were collected during routine medical examinations and genomic DNA was isolated through salting-out procedure [47].

At diagnosis and/or during follow-up all patients or their parents/guardians provided informed consent for sample collection, storage and for research collaborative studies. Protocols were approved by the Ethics Committee for Clinical Experimentation of the University of Perugia (protocol no. 2017–01) and analyses were performed in agreement with the Helsinki declaration.

Genomic DNA analysis

Five SNP associated with phenotype severity modulating respectively HBG2 rs7482144 [C>T], BCL11A rs1427407 [G>T], BCL11A rs10189857 [A>G], HBS1L rs28384513 [A>C] and HBS1L-MYB rs9399137 [T>C] were studied in all patients, using polymerase chain reaction (PCR)-based reverse dot-blot technique (β-Thal Modifier StripAssay; ViennaLab Diagnostics GmbH, Vienna, Austria).

C/EBPE, rs45496295 [C>T] were studied using primers: CEBP_F: 5’-GACAGGAGCTGGCTCTGAGT-3’ and CEBP_R: 5’-TCCGCAGAGTTAGGCCGT-3’

Nucleotide Sequencing was performed using BigDyeTM Terminator Cycler Sequencing Kit and ABI PRISMTM 310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA, USA) [48].

The SNP frequency in the different populations were obtained by GnomAD database (Genome Aggregation Database, https://gnomad.broadinstitute.org/).

Mitochondrial DNA analysis

Mitochondrial control region between sites 15877 and 259 was amplified using two primers designed on the revised Cambridge Reference Sequence (rCRS) with accession number NC_012920 [49]: forward 5’-CAAATGGGCCTGTCCTTGTA-3’ and reverse 5’-TGTGCAGACATTCAATTGTT-3’. The PCR fragments were first enzymatically purified (ExoSAP-IT®, USB Corporation, Cleveland, OH, USA), then cycle sequenced by application of ABI Prism™ BigDye Terminator chemistry on the ABI PRISMTM 310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA, USA), with the forward primer 5’-CCATTAGCACCCAAAGCTA-3’.

Sequences were assembled and aligned to rCRS using Sequencher™ 5.10 (Gene Codes Corporation). Whenever electropherograms showed ambiguities, new PCR amplifications and sequencing reactions were performed.

Final assembling encompassed at least the entire Hyper Variable Sequence HVSI and part of HVSII (from np 16024 to np 210). All mitochondrial control-region sequences were deposited in GenBank with accession numbers MT176183-MT176235.

For each sample, haplotypes were registered and classified in haplogroups according to the respective mutational motifs by referring to HaploGrep2 software based on PhyloTree database build 17, www.phylotree.org [42].

Genetic variation indices were estimated by using DnaSP 5.1 software [50].

The evolutionary relationships among haplotypes were visualized through the construction of a median-joining network using Network 10 software (www.fluxus-engineering.com).

By default, all polymorphisms relative to rCRS were weighted by a value of ten, except for the highly variable site 16519 and the variations in the poly-C stretch around np 16189 that were down-weighted to one. Conversely, triple weights were assigned to the mutational motifs 16223 and 73 in order to strengthen the topology of the network.

Principal component analyses (PCA) were performed using Excel software implemented by XLSTAT, as described elsewhere [51] in order to explain the variance of multivariate data (mitochondrial and nuclear DNA variation, considered separately in each sample).

Supporting information

S1 Fig. Nuclear SNP frequencies.

(A) Frequency per SNP of patients reported according to the geographic origin. (B) Frequency per continent of the ancestral allele and the variant one in the cohort of this study. (C) Frequency per continent of the ancestral allele and the variant one reported by GnomAD database. Please note that panels A and B are referred to coding DNA reference sequence while panel C is referred to the genomic reference sequence.

https://doi.org/10.1371/journal.pone.0240632.s001

(DOCX)

S2 Fig. Network of mitochondrial and nuclear variation.

Tree of the 53 mtDNA haplotypes observed among the patients in which zygosity of SNP was manually highlighted (rs45496295 was excluded because samples were all homozygous for it).

https://doi.org/10.1371/journal.pone.0240632.s002

(DOCX)

S3 Fig. Nuclear SNP frequency, geographic distribution and network mtDNA, in the subgroups of patients.

(A) Nuclear SNP frequency and Geographic distribution in β-thalassemia (β++, β00 and β+0) patients, (B) sickle cell/β-thalassemia (HbS/β+ or β0) patients, (C) sickle cell disease patients (HbS/HbS) (D) compound heterozygotes (HbS/HbC and HbO-Arab/HbC). Panel E-H: network plots obtained by including all mitochondrial and nuclear DNA variation, considered separately in each pathological group, and colored depending on their geographic origin.

https://doi.org/10.1371/journal.pone.0240632.s003

(DOCX)

S4 Fig. Network of mitochondrial variation and hemoglobinopathies.

Tree of the 53 mtDNA haplotypes observed among the patients in which different subgroups of hemoglobinopathies were highlighted.

https://doi.org/10.1371/journal.pone.0240632.s004

(DOCX)

S1 Table. Clinical data of β thalassemia patients.

https://doi.org/10.1371/journal.pone.0240632.s005

(DOCX)

S2 Table. Clinical data of sickle cell/thalassemia patients.

https://doi.org/10.1371/journal.pone.0240632.s006

(DOCX)

S3 Table. Clinical data of sickle cell disease patients.

https://doi.org/10.1371/journal.pone.0240632.s007

(DOCX)

S4 Table. Clinical data of compound heterozygotes patient.

https://doi.org/10.1371/journal.pone.0240632.s008

(DOCX)

S5 Table. Molecular characteristics of the genetic modulators of clinical phenotypes in homozygotes and compound heterozygotes for β gene defect.

https://doi.org/10.1371/journal.pone.0240632.s009

(DOCX)

References

  1. 1. Weatherall DJ. Thalassemia as a global health problem: recent progress toward its control in the developing countries. Ann N Y Acad Sci. 2010;1202:17–23. pmid:20712767.
  2. 2. Sedgewick AE, Timofeev N, Sebastiani P, So JCC, Ma ESK, Chan LC, et al. BCL11A is a major HbF quantitative trait locus in three different populations with beta-hemoglobinopathies. Blood Cells Mol Dis. 2008;41(3):255–8. Epub 2008/08/08. pmid:18691915; PubMed Central PMCID: PMC4100606.
  3. 3. Higgs DR, Engel JD, Stamatoyannopoulos G. Thalassaemia. Lancet. 2012;379(9813):373–83. Epub 2011/09/09. pmid:21908035.
  4. 4. Thein SL. Genetic association studies in β-hemoglobinopathies. Hematology Am Soc Hematol Educ Program. 2013;2013:354–61. pmid:24319204.
  5. 5. Giardine B, Borg J, Viennas E, Pavlidis C, Moradkhani K, Joly P, et al. Updates of the HbVar database of human hemoglobin variants and thalassemia mutations. Nucleic Acids Res. 2014;42(Database issue):D1063–9. Epub 2013/10/16. pmid:24137000; PubMed Central PMCID: PMC3964999.
  6. 6. Li CK. New trend in the epidemiology of thalassaemia. Best Pract Res Clin Obstet Gynaecol. 2017;39:16–26. Epub 2016/10/26. pmid:27847257.
  7. 7. Adam MP, Ardinger HH, Pagon RA, Wallace SE, Bean LJH, Stephens K, et al. GeneReviews. 1993.
  8. 8. Dadheech S, Jain S, Madhulatha D, Sharma V, Joseph J, Jyothy A, et al. Association of Xmn1–158 γG variant with severity and HbF levels in β-thalassemia major and sickle cell anaemia. Mol Biol Rep. 2014;41(5):3331–7. Epub 2014/02/02. pmid:24488321.
  9. 9. Lai Y, Zhou L, Yi S, Chen Y, Tang Y, Yang Z, et al. The association between four SNPs (rs7482144, rs4671393, rs28384513 and rs4895441) and fetal hemoglobin levels in Chinese Zhuang β-thalassemia intermedia patients. Blood Cells Mol Dis. 2017;63:52–7. Epub 2017/01/25. pmid:28160732.
  10. 10. Gilman JG, Huisman TH. DNA sequence variation associated with elevated fetal G gamma globin production. Blood. 1985;66(4):783–7. pmid:2412616.
  11. 11. Bianchi N, Cosenza LC, Lampronti I, Finotti A, Breveglieri G, Zuccato C, et al. Structural and Functional Insights on an Uncharacterized Aγ-Globin-Gene Polymorphism Present in Four β0-Thalassemia Families with High Fetal Hemoglobin Levels. Mol Diagn Ther. 2016;20(2):161–73. pmid:26897028.
  12. 12. Chaouch L, Moumni I, Ouragini H, Darragi I, Kalai M, Chaouachi D, et al. rs11886868 and rs4671393 of BCL11A associated with HbF level variation and modulate clinical events among sickle cell anemia patients. Hematology. 2016;21(7):425–9. Epub 2016/01/22. pmid:27077760.
  13. 13. Liu D, Zhang X, Yu L, Cai R, Ma X, Zheng C, et al. KLF1 mutations are relatively more common in a thalassemia endemic region and ameliorate the severity of β-thalassemia. Blood. 2014;124(5):803–11. Epub 2014/05/14. pmid:24829204; PubMed Central PMCID: PMC4118488.
  14. 14. Sankaran VG, Weiss MJ. Anemia: progress in molecular mechanisms and therapies. Nat Med. 2015;21(3):221–30. pmid:25742458; PubMed Central PMCID: PMC4452951.
  15. 15. Stadhouders R, Aktuna S, Thongjuea S, Aghajanirefah A, Pourfarzad F, van Ijcken W, et al. HBS1L-MYB intergenic variants modulate fetal hemoglobin via long-range MYB enhancers. J Clin Invest. 2014;124(4):1699–710. Epub 2014/03/10. pmid:24614105; PubMed Central PMCID: PMC3973089.
  16. 16. Lettre G, Sankaran VG, Bezerra MA, Araújo AS, Uda M, Sanna S, et al. DNA polymorphisms at the BCL11A, HBS1L-MYB, and beta-globin loci associate with fetal hemoglobin levels and pain crises in sickle cell disease. Proc Natl Acad Sci U S A. 2008;105(33):11869–74. Epub 2008/07/30. pmid:18667698; PubMed Central PMCID: PMC2491485.
  17. 17. Thein SL, Menzel S, Peng X, Best S, Jiang J, Close J, et al. Intergenic variants of HBS1L-MYB are responsible for a major quantitative trait locus on chromosome 6q23 influencing fetal hemoglobin levels in adults. Proc Natl Acad Sci U S A. 2007;104(27):11346–51. Epub 2007/06/25. pmid:17592125; PubMed Central PMCID: PMC2040901.
  18. 18. Bernaudin F, Arnaud C, Kamdem A, Hau I, Lelong F, Epaud R, et al. Biological impact of α genes, β haplotypes, and G6PD activity in sickle cell anemia at baseline and with hydroxyurea. Blood Adv. 2018;2(6):626–37. pmid:29555644; PubMed Central PMCID: PMC5873235.
  19. 19. Al-Allawi N, Qadir SMA, Puehringer H, Chui DHK, Farrell JJ, Oberkanins C. The association of HBG2, BCL11A, and HMIP polymorphisms with fetal hemoglobin and clinical phenotype in Iraqi Kurds with sickle cell disease. Int J Lab Hematol. 2019;41(1):87–93. Epub 2018/09/14. pmid:30216683.
  20. 20. Sebastiani P, Farrell JJ, Alsultan A, Wang S, Edward HL, Shappell H, et al. BCL11A enhancer haplotypes and fetal hemoglobin in sickle cell anemia. Blood Cells Mol Dis. 2015;54(3):224–30. Epub 2015/01/30. pmid:25703683; PubMed Central PMCID: PMC4341902.
  21. 21. Akinsheye I, Solovieff N, Ngo D, Malek A, Sebastiani P, Steinberg MH, et al. Fetal hemoglobin in sickle cell anemia: molecular characterization of the unusually high fetal hemoglobin phenotype in African Americans. Am J Hematol. 2012;87(2):217–9. Epub 2011/12/03. pmid:22139998; PubMed Central PMCID: PMC3302931.
  22. 22. Pereira C, Relvas L, Bento C, Abade A, Ribeiro ML, Manco L. Polymorphic variations influencing fetal hemoglobin levels: association study in beta-thalassemia carriers and in normal individuals of Portuguese origin. Blood Cells Mol Dis. 2015;54(4):315–20. Epub 2015/02/21. pmid:25842369.
  23. 23. Sherva R, Sripichai O, Abel K, Ma Q, Whitacre J, Angkachatchai V, et al. Genetic modifiers of Hb E/beta0 thalassemia identified by a two-stage genome-wide association study. BMC Med Genet. 2010;11:51. Epub 2010/03/30. pmid:20353593; PubMed Central PMCID: PMC2853425.
  24. 24. Pakdee N, Yamsri S, Fucharoen G, Sanchaisuriya K, Pissard S, Fucharoen S. Variability of hemoglobin F expression in hemoglobin EE disease: hematological and molecular analysis. Blood Cells Mol Dis. 2014;53(1–2):11–5. Epub 2014/02/26. pmid:24581976.
  25. 25. Mejri A, Mansri M, Hadj Fredj S, Ouali F, Bibi A, Hafsia R, et al. First description of the rs45496295 polymorphism of the C/EBPE gene in β-thalassemia intermedia patients. Hemoglobin. 2016;40(6):411–6. Epub 2016/11/09. pmid:27829304.
  26. 26. Payne BA, Wilson IJ, Yu-Wai-Man P, Coxhead J, Deehan D, Horvath R, et al. Universal heteroplasmy of human mitochondrial DNA. Hum Mol Genet. 2013;22(2):384–90. Epub 2012/10/16. pmid:23077218; PubMed Central PMCID: PMC3526165.
  27. 27. Gustafsson CM, Falkenberg M, Larsson NG. Maintenance and Expression of Mammalian Mitochondrial DNA. Annu Rev Biochem. 2016;85:133–60. Epub 2016/03/24. pmid:27023847.
  28. 28. Torroni A, Schurr TG, Cabell MF, Brown MD, Neel JV, Larsen M, et al. Asian affinities and continental radiation of the four founding Native American mtDNAs. Am J Hum Genet. 1993;53(3):563–90. pmid:7688932; PubMed Central PMCID: PMC1682412.
  29. 29. Torroni A, Huoponen K, Francalacci P, Petrozzi M, Morelli L, Scozzari R, et al. Classification of European mtDNAs from an analysis of three European populations. Genetics. 1996;144(4):1835–50. pmid:8978068; PubMed Central PMCID: PMC1207732.
  30. 30. Chen YS, Torroni A, Excoffier L, Santachiara-Benerecetti AS, Wallace DC. Analysis of mtDNA variation in African populations reveals the most ancient of all human continent-specific haplogroups. Am J Hum Genet. 1995;57(1):133–49. pmid:7611282; PubMed Central PMCID: PMC1801234.
  31. 31. Watson E, Forster P, Richards M, Bandelt HJ. Mitochondrial footprints of human expansions in Africa. Am J Hum Genet. 1997;61(3):691–704. pmid:9326335; PubMed Central PMCID: PMC1715955.
  32. 32. Macaulay V, Richards M, Hickey E, Vega E, Cruciani F, Guida V, et al. The emerging tree of West Eurasian mtDNAs: a synthesis of control-region sequences and RFLPs. Am J Hum Genet. 1999;64(1):232–49. pmid:9915963; PubMed Central PMCID: PMC1377722.
  33. 33. Forster P, Torroni A, Renfrew C, Röhl A. Phylogenetic star contraction applied to Asian and Papuan mtDNA evolution. Mol Biol Evol. 2001;18(10):1864–81. pmid:11557793.
  34. 34. Brandstätter A, Peterson CT, Irwin JA, Mpoke S, Koech DK, Parson W, et al. Mitochondrial DNA control region sequences from Nairobi (Kenya): inferring phylogenetic parameters for the establishment of a forensic database. Int J Legal Med. 2004;118(5):294–306. pmid:15248073.
  35. 35. Olivieri A, Achilli A, Pala M, Battaglia V, Fornarino S, Al-Zahery N, et al. The mtDNA legacy of the Levantine early Upper Palaeolithic in Africa. Science. 2006;314(5806):1767–70. pmid:17170302.
  36. 36. Kivisild T. Maternal ancestry and population history from whole mitochondrial genomes. Investig Genet. 2015;6:3. pmid:25798216; PubMed Central PMCID: PMC4367903.
  37. 37. Larsson NG. Somatic mitochondrial DNA mutations in mammalian aging. Annu Rev Biochem. 2010;79:683–706. pmid:20350166.
  38. 38. Wallace DC. Mitochondrial DNA variation in human radiation and disease. Cell. 2015;163(1):33–8. pmid:26406369; PubMed Central PMCID: PMC4743751.
  39. 39. Jamali L, Banoei MM, Khalili E, Dadgar S, Houshmand M. Association of genetic variations in the mitochondrial D-loop with β-thalassemia. Mitochondrial DNA A DNA Mapp Seq Anal. 2016;27(3):1693–6. Epub 2014/09/18. pmid:25230702.
  40. 40. Menzel S, Thein SL. Genetic Modifiers of Fetal Haemoglobin in Sickle Cell Disease. Mol Diagn Ther. 2019;23(2):235–44. pmid:30478714.
  41. 41. Nagel RL. Beta-globin-gene haplotypes, mitochondrial DNA, the Y-chromosome: their impact on the genetic epidemiology of the major structural hemoglobinopathies. Cell Mol Biol (Noisy-le-grand). 2004;50(1):5–21. pmid:15040423.
  42. 42. van Oven M, Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat. 2009;30(2):E386–94. pmid:18853457.
  43. 43. Cerezo M, Achilli A, Olivieri A, Perego UA, Gómez-Carballa A, Brisighelli F, et al. Reconstructing ancient mitochondrial DNA links between Africa and Europe. Genome Res. 2012;22(5):821–6. pmid:22454235; PubMed Central PMCID: PMC3337428.
  44. 44. Derenko M, Malyarchuk B, Denisova G, Perkova M, Rogalla U, Grzybowski T, et al. Complete mitochondrial DNA analysis of eastern Eurasian haplogroups rarely found in populations of northern Asia and eastern Europe. PLoS One. 2012;7(2):e32179. Epub 2012/02/21. pmid:22363811; PubMed Central PMCID: PMC3283723.
  45. 45. Behar DM, van Oven M, Rosset S, Metspalu M, Loogväli EL, Silva NM, et al. A "Copernican" reassessment of the human mitochondrial DNA tree from its root. Am J Hum Genet. 2012;90(4):675–84. pmid:22482806; PubMed Central PMCID: PMC3322232.
  46. 46. Gorello P, Arcioni F, Palmieri A, Barbanera Y, Ceccuzzi L, Adami C, et al. The Molecular Spectrum of β- and α-Thalassemia Mutations in Non-Endemic Umbria, Central Italy. Hemoglobin. 2016;40(6):371–6. pmid:28361595.
  47. 47. Miller SA, Dykes DD, Polesky HF. A simple salting out procedure for extracting DNA from human nucleated cells. Nucleic Acids Res. 1988;16(3):1215. pmid:3344216; PubMed Central PMCID: PMC334765.
  48. 48. Sanger F, Nicklen S, Coulson AR. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977;74(12):5463–7. pmid:271968; PubMed Central PMCID: PMC431765.
  49. 49. Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Turnbull DM, Howell N. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat Genet. 1999;23(2):147. pmid:10508508.
  50. 50. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25(11):1451–2. Epub 2009/04/03. pmid:19346325.
  51. 51. Cardinali I, Lancioni H, Giontella A, Capodiferro MR, Capomaccio S, Buttazzoni L, et al. An Overview of Ten Italian Horse Breeds through Mitochondrial DNA. PLoS One. 2016;11(4):e0153004. Epub 2016/04/07. pmid:27054850; PubMed Central PMCID: PMC4824442.