Identification of Low-Complexity Domains by Compositional Signatures Reveals Class-Specific Frequencies and Functions Across the Domains of Life
Fig 5
Functions consistently enriched for proteins containing H-rich LCDs in eukaryotes.
(A) For all HX and XH LCD classes (where X represents any amino acid except histidine), the percentage of eukaryotic organisms with significant enrichment for each LCD class/GO term pair was calculated. The bar plot indicates the top 50 LCD class/GO term pairs with the highest percentage of eukaryotes exhibiting significant enrichment among the HX or XH LCD classes. Bars are colored according to LCD class (with the reciprocal classes QH and HQ assigned the same color), whereas GO terms are colored according to the GO-term category with Biological Process (BP) in red, Cellular Component (CC) in green, and Molecular Function (MF) in blue. Only GO terms that were significantly enriched (Šidák-corrected p < 0.05) and had a minimum depth of 4 in the gene ontology are shown. (B) Frequency of significant enrichment across organisms for each GO term associated with proteins containing HQ LCDs. GO-term analyses were also performed on the same set of proteins but with those that also contained a spatially distinct H-rich LCD (primary class), Q-rich LCD (primary class), or QX, XQ, HX, and XH LCD (where X is any residue other than Q or H) removed prior to analysis. For simplicity, only GO terms that were significantly enriched for ≥30 organisms and had a minimum depth of 4 in the gene ontology are shown.