Genome-Wide Analysis of Protein Disorder in Arabidopsis thaliana: Implications for Plant Environmental Adaptation
For each organism (Arabidopsis (green) and human (blue)) protein sequences and their corresponding Gene Ontology annotations are retrieved from Uniprot. For each protein, disordered regions (pink) are calculated using 3 different methods (Iupred, VSL2 and Disopred), and disordered-binding regions (DBRs) are predicted using ANCHOR. Proteins are assigned to GO:BP functional classes. For each GO functional class, a comparative analysis of the disorder levels of the proteins of each organism is performed, using different criteria for quantifying disorder in that given GO class. For those disorder criteria that assign a “yes/no” label to a given protein, contingency tables are constructed with the counts of disordered and not-disordered proteins in both organisms and a Chi-squared test is applied to them. For those criteria that quantify the disorder of a given protein, the tables contain the average values of that figure for both organisms, and a Wilcoxon rank sum test is applied.