Identification of Low-Complexity Domains by Compositional Signatures Reveals Class-Specific Frequencies and Functions Across the Domains of Life
Fig 7
Statistical LCD enrichment by LCD class in the malarial and human proteomes.
(A) Heatmap depicting the degrees of LCD enrichment (expressed as the lnOR) for each LCD class among the P. falciparum proteome (UniProt ID: UP000001450_36329). For LCD classes in which the number of LCDs in either the original or scrambled proteomes were 0, a value of 1 was added to all cells in the contingency table to calculate a biased lnOR (see Methods). (B) Binary classification for LCD categories for which enrichment was statistically significant (red squares) or statistically non-significant (black squares) after multiple-test correction. Grey squares indicate LCD categories that were excluded from statistical analysis since no LCDs were found in both the original and scrambled proteome. (C) Degrees of LCD enrichment in the human proteome (UniProt ID: UP000005640_9606). (D) Statistical significance for LCD enrichment in the human proteome. For all panels, the diagonals represent corresponding values for each primary LCD class. The data underlying these heatmaps can be found in the supplementary data available at [33].