Table 1.
Assessment of the Identification and Classification Procedures
Figure 1.
Schematic Representation of the Procedure Employed to Classify the Three Major Types of Genes and Derived Sequences Identified in Human and Mouse According to Their Origin between and within Each Species
Dashed boxes denote key action steps in the procedure. See text for details.
Figure 2.
Analysis of Gene Coverage between Mouse and Human Paralogs
(A) Identification of orthologous duplicated pairs. Genes are labeled with letters (same letters in human and mouse mean best reciprocal orthologs, e.g., genes “a,” “c,” and “d”). Numbers within circles in tree nodes represent gene duplication events. Dashed lines indicate orthology between human and mouse duplication nodes, which is inferred from the orthologous relations between the products of that duplication in each of the organism.
(B) Distribution of orthologous duplication nodes in human according to the coverage of the shortest coding region relative to the longest one. The line corresponds to the exponential curve adjusted to the observed data (see Materials and Methods).
(C) Distribution of the coverage of all duplicates found in human (columns), and probability for being functional according to the coverage (Pf, dashed line).