Figure 1.
Overview of the current model of eukaryote evolution.
The six “supergroups”—Opisthokonta, Amoebozoa, Archaeplastida, Chromalveolata, Rhizaria, and Excavata—are shown (the placement of Excavata is under debate) [21]–[23], [25], [29], [36], [37], [53], [54].
Table 1.
Protein domains in 172 eukaryotic genomes.
Figure 2.
Numbers of domains and domain combinations in select species.
The colors used correspond to the colors in Figure 1 (orange for Opisthokonta, red for Amoebozoa, green for Archaeplastida, blue for Chromalveolata, and purple for Excavata).
Figure 3.
Average ratios between the numbers of domain combinations and (number of domains)2 for select groups of organisms.
Standard deviations are shown as error bars. The asterix is used to indicate the results for Deuterostoma under exclusion of the amphioxus Branchiostoma floridae genome. The colors used correspond to the colors in Figure 1.
Figure 4.
Clade-specific domains and domain combinations.
This figure shows the numbers of clade-specific domain combinations (black numbers after the slash) and core domain combinations (black numbers before the slash) for select clades. Below these are the numbers of clade-specific domains (gray numbers after the slash) and core domains (gray numbers before the slash). Numbers in brackets refer to domain combination counts under exclusion of the amphioxus Branchiostoma floridae genome. The numbers of analyzed genomes are shown in parentheses below the clade names. For example, the 19 analyzed vertebrate genomes contain 1,416 clade-specific domain combinations, 102 of which are found in each of the 19 analyzed genomes. These 19 genomes also contain 380 clade-specific domains, out of which 67 are present in each vertebrate genome. Ambulacraria is a clade of deuterostomes that includes echinoderms and hemichordates. To facilitate comparison of different taxonomic levels, established phyla are shown with a light-blue background, whereas super-phyla have a light-purple background. This figure was made using the “gathering” cutoffs provided by Pfam. For a detailed description of parameters, see Materials and Methods. Complete counts are shown in Table S6.
Figure 5.
Taxonomic distribution of domain combinations.
A shows the distribution of the 34,778 distinct domain combinations encountered in this work over the five eukaryotic “supergroups” analyzed, plus Thecamonas trahens (see Figure 1). B shows the distribution of the 14,704 Holozoa-specific domain combinations over various groups of Holozoa. See Table 2 and Table S6 for detailed numbers.
Table 2.
Metazoan core domain combinations.
Figure 6.
Parallel evolution of the K Homology (KH)∼DEAD/DEAH box helicase combination between Bilateria and Micromonas (a group of green algae).
The complete diagram on which this simplified version is based is available in the supplementary materials.
Figure 7.
Independent domain combination evolution under an unweighted parsimony model.
The histogram in A shows the sum for reappearing domains versus the number of reappearances. B is a comparison between the sum of domains that appear only once versus the sum of domains that appear more than once.
Table 3.
The most frequently reemerging domain combinations.
Figure 8.
Normalized rates of independent domain combination evolution.
Normalized (by the number of genomes) sums of independently evolved domain combinations across major splits on the eukaryotic tree of life are shown. “Opistho” stands for Opisthokonta and “Choano” stands for Chanoflagellatae. Ambulacraria is a clade that includes echinoderms and hemichordates.
Figure 9.
Parallel evolution of the NACHT∼Ankyrin combination between Neoptera (winged insects) and fungi.
The complete diagram on which this simplified version is based is available in the supplementary materials (which explains that both major groups of fungi, Basidiomycota and Ascomycota, have one independent domain fusion event each).
Figure 10.
Parallel evolution of the Amidohydrolase∼Aspartate/ornithine carbamoyltransferase combination between Metazoa and Dictyostelium.
The complete diagram on which this simplified version is based is available in the supplementary materials.