Figure 1.
Analysis of IBS carried out on Romani and non-Romani individuals based on genome-wide SNP data.
(A) IBS values between Romani and non-Romani individuals. The non-Romani population sample was taken from the same Spanish region where the Romani samples were collected for the present study. The pink dots correspond to the two pairs of Romani individuals showing much higher IBS values than those observed between other Romani or non-Romani individuals. (B) All pairs of IBS values between Romani and non-Romani individuals are sorted from the lowest to the highest; the two highest values on the right of the figure correspond to the two pairs of Romani individuals in Figure 1A.
Figure 2.
Maximum parsimony tree of haplogroup M5 Romani mitogenomes.
The inset map shows the geographic location and sample size of all the M5 genomes observed in India subcontinent. The position of the revised Cambridge reference sequence (rCRS) is indicated for reading sequence motifs [52]. Mitochondrial DNA variants are indicated along the branches of the phylogenetic tree. An asterisk (*) as prefix indicates a position located in an overlapping region shared by two mtDNA genes. Mutations are transitions unless a suffix A, C, G, or T indicates a transversion. Other possible suffixes indicate insertions (+), synonymous substitution (s), mutational changes in tRNA (-t), mutational change in rRNA (-r), non-coding variant located in the mtDNA coding region (-nc) and an amino acid replacement (indicated in round brackets). Variants underlined represent recurrent mutations in this tree while a prefix ‘@’ indicates a back mutation. Mutational hotspot variants at positions 16182, 16183, and 16519, as well as variation around position 310 and length or point heteroplasmies were not considered for the phylogenetic reconstruction. The numbers in small squares attached to the haplogroup labels indicate the number of occurrences (mitogenomes) of the corresponding haplogroups found in public databases; the color of the squares indicates their geographic origin according to the legend inset. Spanish Romani complete genomes obtained in this study are indicated with yellow circles. More details on the geographic or ethnic origin of all the mitogenomes used in this network are provided in Table S1. The Indian M5a1b1a genome (FJ383591) seems to belong to M5a1b1a, but note that it lacks four diagnostic sites, most likely due to sequencing or documentation errors [53]–[55].
Figure 3.
Maximum parsimony tree of M5a1b1a1 HVS-I sequences.
Population codes are as follows: ALB = Albania; BOS = Bosnia; BRA = Brazil; BUL = Bulgaria; CRO = Croatia; CRZ = Czech Republic; CUB = Cuba; FIN = Finland; FRA = France; GER = Germany; GRE = Greece; HUN = Hungary; IND = India; ITA = Italy; LIT = Lithuania; PAK = Pakistan; POL = Poland; POR = Portugal; RUS = Russia; SLO = Slovakia; SPA = Spain; USA = United States of America. See Table S3 for detailed geographic information on these haplotypes. See caption to Figure 2 for more information on the features of the tree.
Figure 4.
Map showing the frequency of haplogroup M5a1b1a1 control region sequences (pie charts) in different European Romani groups.
The inset map represents this clade as ultimately originated in India; the numbers in the green circles represent the occurrences of M5a1b1a in non-Romani individuals in Eurasia (see Table S3 for references): 24 incidences in Europe and 9 incidences in India. References for the European Romani groups (red squares in the map) are as follows: 1 = Bulgaria [8], [10]; 2 = Croatia [17]; 3 = Hungary [13]; 4 = Lithuania [8]; 5 = Poland [12]; 6 = Slovakia [16]; 7 = Portugal [22]; 8 = Málaga (Southern Spain) [21]; 9 = Madrid (Central Spain) [8]; 10 = Barcelona (Northeastern Spain) [22].
Figure 5.
Maximum parsimony tree of haplogroup M35 mitogenomes.
The inset map shows the geographic location and sample size of all the M5 genomes observed in the Indian subcontinent. See caption to Figure 2 for more information on the features of the tree.
Figure 6.
Maximum parsimony tree of the Spanish Romani mitogenomes analyzed in the present study excluding those belonging to haplogroup M5 (Figure 2), and U3 (Figure 7).
See the caption to Figure 2 for more information on the features of the tree.
Figure 7.
Maximum parsimony tree of haplogroup U3 mitogenomes.
See caption to Figure 2 for more information on the features of the tree.
Figure 8.
Mitochondrial DNA haplogroup frequencies.
(A) European Romani populations; (B) Iberian Romani; (C) European Romani excluding those from Iberia. Note that HV(×H) represents all haplogroups within HV excluding the H branch; L represents all mtDNA clades excluding macro-haplogroups M and N; and the category ‘other’ represents a paragroup that includes all of the haplotypes that could not be unambiguously assigned to any of the other categories considered in the figure.