Figure 1.
What do we mean by the age of a superfamily?
Ages are generated using a phylogenetic species tree and an occurrence profile of a superfamily across the genomes of these species. Parsimony algorithms predict the simplest scenario of loss and gain events on internal nodes of the tree which explain the occurrence profile at its leaves. Ages are normalised between 0, at the leaves of the tree, and 1, at its root. Ancient superfamilies are predicted an age of 1 and new-born superfamilies are estimated to have an evolutionary age .
Figure 2.
The relationships between superfamily ages, secondary structure and length.
Figure A gives a percentile plot of the age distributions of 5 SCOP classes. For ease of interpretation, plots of multi-domain and membrane proteins have been omitted. Each line represents the distribution of ages generated using a different phylogenetic tree. Noticeably, superfamilies' age distributions rise quicker than those of the other classes. Moreover, superfamilies classified as small under SCOP are significantly younger than the other classes. Figure B gives a boxplot of the length distributions for these SCOP classes. Roughly speaking, the ordering of the classes by length corresponds to their ordering by age.
superfamilies are longer and small proteins are shorter than the other classes. Figure C gives a percentile plot of the age distributions of superfamilies with different average domain lengths. Multi-domain superfamilies were omitted from this analysis. Ancient superfamilies are significantly longer than their new-born counterparts. Figure D gives a percentile plot of the age distributions of two populations of superfamilies: those containing a majority parallel strand direction and those with more antiparallel strands. The parallel population is significantly older than the antiparallel superfamilies.
Table 1.
Preferences of different amino acids for new-born or ancient superfamilies.
Figure 3.
Superfamily ages of greek key and jelly roll motifs.
Percentile plots for the age distributions of superfamilies containing a greek key or a jelly roll motif within their beta-sheet topologies. Domains annotated as containing at least one greek key motif are significantly older than those containing the jelly roll motif.