Intrinsically disordered/unstructured proteins (IDPs) are extremely sensitive to proteolysis in vitro, but show no enhanced degradation rates in vivo. Their existence and functioning may be explained if IDPs are preferentially associated with chaperones in the cell, which may offer protection against degradation by proteases. To test this inference, we took pairwise interaction data from high-throughput interaction studies and analyzed to see if predicted disorder correlates with the tendency of chaperone binding by proteins. Our major finding is that disorder predicted by the IUPred algorithm actually shows negative correlation with chaperone binding in E. coli, S. cerevisiae, and metazoa species. Since predicted disorder positively correlates with the tendency of partner binding in the interactome, the difference between the disorder of chaperone-binding and non-binding proteins is even more pronounced if normalized to their overall tendency to be involved in pairwise protein–protein interactions. We argue that chaperone binding is primarily required for folding of globular proteins, as reflected in an increased preference for chaperones of proteins in which at least one Pfam domain exists. In terms of the functional consequences of chaperone binding of mostly disordered proteins, we suggest that its primary reason is not the assistance of folding, but promotion of assembly with partners. In support of this conclusion, we show that IDPs that bind chaperones also tend to bind other proteins.
Intrinsically disordered/unstructured proteins (IDPs) defy the classical structure–function paradigm because they exist and function without a well-defined 3-D structure. These proteins are extremely sensitive to degradation in the test tube, but show no enhanced degradation rates in the cell. To resolve this apparent contradiction, we tested whether IDPs are protected by interaction with accessory proteins, chaperones, often implicated in guarding other proteins in the cell. Our major finding is that disorder predicted by the IUPred algorithm actually shows negative correlation with chaperone binding in various species. To explain this finding, we argue that IDPs are protected in the cell from proteases by their special amino acid composition, and also by the tight regulation of intracellular proteases. Thus, the primary reason for their chaperone binding is not protection from degradation, but promotion of assembly with partners.
Citation: Hegyi H, Tompa P (2008) Intrinsically Disordered Proteins Display No Preference for Chaperone Binding In Vivo. PLoS Comput Biol 4(3): e1000017. doi:10.1371/journal.pcbi.1000017
Editor: Paul Harrison, McGill University, Canada
Received: August 16, 2007; Accepted: January 31, 2008; Published: March 7, 2008
Copyright: © 2008 Hegyi and Tompa. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was supported by the following grants: OTKA K60694 from the Hungarian Scientific Research Fund, ETT 245/2006 from the Hungarian Ministry of Health, International Senior Research Fellowship ISRF 067595 from the Wellcome Trust to P.T., and a Marie Curie reintegration grant IRG-046572 from the European Commission to H.H.
Competing interests: The authors have declared that no competing interests exist.
Intrinsically disordered/unstructured proteins or protein domains (IDPs) are prevalent in proteomes – due to the inherent functional advantages structural disorder imparts on proteins , –. In vitro, IDPs have been noted for an increased speed of interaction, specificity without excessive binding strength, adaptability to multiple partners and ease of regulation by post-translational modification. These and other functional features explain a particularly high level of disorder in important regulatory proteins involved in signaling, and regulation of transcription, such as p53 , p27Kip1 , CREB  or BRCA1 . Whereas these features elucidate the prevalence of protein disorder in proteomes underlying the recent heightened interest in the subject, the phenomenon of structural disorder poses further serious questions. Due to their open and flexible conformational state, IDPs are exceptionally sensitive to proteolysis in vitro ,, which raises concerns in terms of their in vivo existence and functioning. The question most often asked is how IDPs function when they are supposedly rapidly degraded by proteases in vivo. That this is not the case, is shown by our recent observations that the physiological half-lives of IDPs determined in a high-throughput study  show very weak correlation with their disorder content . This suggests the involvement of additional factors and/or special mechanisms in the physiological function of IDPs, specifically addressed in this work. One particularly intriguing point is the possibility of the involvement of chaperones, which may offer direct protection by binding in the cell. Since chaperone action has already been implicated with some IDPs –, we have decided to analyze recent high-throughput interaction data to provide a systematic and coherent answer to this question.
Chaperones are energy-dependent protein machines that function to prevent their clients from misfolding and aggregation, or to assist their assembly and transport in the crowded intracellular milieu . Recently, it has been recognized that some IDPs also display chaperone activity, probably enabled by a more primitive mechanism that relies on “entropy transfer” from the chaperone to the misfolded partner . Although in the original formulations chaperone models have been described as assisting folding of misfolded globular proteins and RNA molecules, in some cases it has been described that a chaperone may also have an IDP client. For example, it has been shown that molecular chaperones α(s)- and β-casein prevent amyloid fibril formation by κ-casein . In another study, it was shown that chaperones promote the association of a microtubule-associated protein, tau, with microtubules . The suppression of α-synuclein toxicity and aggregation in a Drosophila model for Parkinson's disease may also point towards the involvement of a chaperone in the action of an IDP ,. α-synuclein aggregation is also affected by another chaperone, αB-crystallin . These examples show that some IDPs may require the involvement of chaperones for function, which could also explain the observed in vivo stability of these proteins. Whether this interdependence is general among IDPs, has been the subject of this study.
In recent high-throughput interaction studies large segments of the interactome, i.e. network of protein-protein interactions, have been described –. We have approached the above question by analyzing whether structural disorder correlates with the tendency of proteins to be binding partners of chaperones. We found that on the contrary, partners of chaperones tend to be ordered proteins, which apparently need more assistance for folding than IDPs. IDPs, on the other hand, need no help for folding, also suggested by many in vitro data on their functional efficacy, and probably use chaperone assistance for protection from aggregation and assembly into complexes.
We used the data about pairwise interactions published in the IntAct database (http://www.ebi.ac.uk/intact/site/index.jsf) . It is a collection of interactions between proteins detected with various types of methods, culled from numerous publications and also databases such as the MSD, the Macromolecular Structure Database . It contains system-wide interaction data regarding E. coli and yeast but only fragmented/partial interaction information about higher organisms. Complexes in the databases vary in size from two to more than a hundred components. Due to technical limitations, there is no information on the interaction of any two proteins within a complex of three components and above. Thus, to make sure our analysis focuses on the direct interaction of a protein with its partners, we selected complexes of exactly two components, regardless of the detection method, ensuring an actual physical interaction between the partners.
We grouped the interaction data in IntAct into three phylogenetic subgroups, handling the bacterial (mostly E. coli), unicellular eukaryotic (mostly yeast) and metazoa protein interaction data separately. The data reflects the status of IntAct as of December 6, 2006, which contained 729 bacterial, 7615 unicellular eukaryotic and 35,435 metazoa pairwise protein interactions (of the latter more than 24,000 were between D. melanogaster, 4,000 between human and more than 4,000 between C. elegans proteins).
Selecting chaperone-binding and non-chaperone-binding proteins
We identified chaperones among the interacting proteins based on their annotation in Swissprot and TrEMBL. However, we also identified “putative chaperones” by comparing all the interacting proteins with all the known chaperones in SwissProt and TrEMBL using Blastp  and designating a protein a putative chaperone if it had an at least 50% sequence identity and an almost full-length match (with the possible exception of 30 amino acids at either end) to any known chaperone. However, all the other proteins with a 50% or higher similarity (but not fulfilling the ‘almost full-length’ similarity) were excluded from both the chaperone and the non-chaperone class because of their perceived ambiguity regarding a chaperone function. To avoid false chaperone assignments among the short putative chaperones, we removed all the predicted chaperones with a length of less than 100 amino acids.
We excluded protein interactions with these ambiguous proteins. We also excluded those proteins that appear in pairwise interactions with both chaperones and non-chaperones. Although this step affected only 30 of the 175 chaperone-binding proteins in the bacteria group, for eukaryotes these numbers increased to 330 out of 574 and 505 out of 589 for metazoan proteins.
In addition, we compared the sequences of these unambiguously determined chaperone-binding and non-chaperone-binding proteins by Blastp and excluded those proteins in each group that matched a protein in the other group with at least 90% sequence identity. This step affected 0, 4, and 15 proteins in the bacteria, unicellular eukaryotic and metazoan protein group, respectively.
Determining the percentage intrinsic disorder of interacting proteins
For all the interacting proteins in the three taxonomic groups we determined the percentage intrinsic disorder by counting all the disordered amino acids as predicted by IUPred ,, dividing it with the total length of the protein and multiplying it with 100. We have selected IUPred for predictions because it has not been trained on potentially error-ridden data of disordered proteins. Rather, this algorithm estimates the total pair-wise interresidue interaction energy of sequences by applying low-resolution force-fields deduced from folded proteins. It has been observed that below a certain threshold the estimated energy is insufficient to overcome the large entropic penalty of folding, and (segment of) the protein cannot fold, but remains disordered. In this sense, IUPred score represents an assessment of the structural status of disordered proteins independent of prior rather heterogeneous data on IDPs.
Distribution of the percentage disorder in the three taxonomic groups
For all the interacting proteins in the three taxonomic groups we determined the distribution of the percentage disorder of both the chaperone-binding and non-chaperone-binding proteins, by counting the number of proteins in each disorder range, with increments of 5% disorder. We actually used the percentage disorder values, by dividing the number of proteins for each range with the total number of proteins in that group and multiplying it with 100 (so that the area under each distribution curve adds up to 100).
Identifying interacting and non-interacting proteins in E. coli and yeast
We also determined the percentage disorder distribution of proteins interacting with others and of those that do not seem to interact with any other protein in both E. coli and yeast. In this instance we did not focus on pairwise interactions but considered only SwissProt proteins (4931 E. coli and 6163 yeast proteins in SwissProt as of March, 2007) as only the latter have reliable annotations. In addition to any interaction data about a particular protein in IntAct we considered any protein in E. coli or yeast an interacting protein if it had the keywords ‘interaction(s)’ or ‘protein binding’ in its annotation.
Propensity for chaperone-binding normalized to propensity for general partner binding
We divided all the proteins in E. coli and also in yeast into equal-size groups, bins, depending on their disorder. For each bin we calculated the ratio of chaperone-binding to non-chaperone binding proteins (Figure 1A) and divided it with the ratio of binding to non-binding proteins (Figure 2A). Normalization was carried out by the formula R = (Nchapb/Nnon-chapb)/(Nbind/Nnonbind), where
- Nchapb : number of chaperone-binding proteins in a bin;
- Nnon-chapb: number of non-chaperone-binding proteins;
- Nbind: number of proteins binding at least one more protein;
- Nnonbind: number of proteins, not known to bind any other protein
- Nchapb+Nnon-chapb+Nnonbind = constant for each bin (E. coli: 446; Yeast: 572)
All the proteins detected in any kind of pairwise interactions in the IntAct database were taken into consideration. The percentage intrinsic disorder for each protein was calculated from disorder predicted by IUPred. The occurrence in each disorder range (with increments of 5% disorder) is given in % values, too, so that the area under each disorder curve amounts to 100. The median disorder values for the two sets of proteins are indicated in parentheses. (A) Bacteria (mostly E. coli). (B) Unicellular eukaryotes (mostly yeast). (C) Metazoa proteins.
The occurrence in each disorder range is given in % values, too, as in Figure 1 (but with increments of 10% disorder). (A) All binding (2907) and non-binding (2015) E. coli proteins in Swissprot; (B) All binding (3630) and non-binding (2200) yeast proteins in Swissprot.
Pfam-domain occurrence in proteins in pairwise interactions
We analyzed the interacting proteins in all the three taxonomic groups for Pfam domain occurrence . We ran Blastp  with the proteins in pairwise interactions as queries against the database of Pfam-A domain sequences . The e-value cutoff was set to 1e-5 and we took into consideration only the best match for each protein as we wanted to know only if the protein in question has a globular part or not.
Disorder of chaperone-binding and non-chaperone-binding proteins in 3 taxonomic groups
In Figure 1 the percentage distribution of the intrinsic disorder (as predicted by IUPred, ,) of chaperone-binding, and non-chaperone binding proteins is presented. Figure 1A, 1B, and 1C present data regarding bacterial, unicellular eukaryote- and metazoa proteins, respectively, with the median values of disorder for each set also indicated. In bacteria and unicellular eukaryote the distributions of the two sets of proteins are significantly different according to chi-square tests, with p-values 0.01 and 1e-05, respectively, whereas in metazoa the difference between disorder distributions is not significant, even though the median value for non-chaperone-binding proteins is almost twice that of the chaperone-binding proteins (18.26% vs. 9.91% median disorder). The lack of significance is most certainly due to the small number (72 altogether) of the chaperone-binding proteins in this category. (If we doubled the numbers in this category, which would not change the distributions in Figure 1C, we would end up with a significant difference with a p-value <0.005). The overlaps between the chaperone-binding and non-chaperone-binding proteins in the different taxonomic categories are shown in Table 1. It is also clear from the table that the ratio of shared proteins (expressed in the percentages of all chaperone-binding proteins in Table 1 increases with the increasing complexity of the studied organisms. Median values of the disorder of chaperone-binding and non-chaperone proteins also underscore that the latter has a larger disorder in all three taxonomic groups. For all three taxonomic groups the median values of non-chaperone-binding proteins are about twice as much as for chaperone-binding proteins (Figure 1).
Disorder is different for binding and non-binding proteins in E. coli and yeast
In the previous section we demonstrated that disorder shows anti-correlation with chaperone binding. We thought it is of interest to see if this reflects the general dependence of propensity for partner binding. To this end, we predicted the disorder distribution of all E. coli (Figure 2A) and yeast (Figure 2B) proteins known to be, or not to be, involved in pairwise interactions. For both organisms there is a clear-cut difference in disorder between binding and non-binding proteins most apparent at smaller values of disorder, with binding proteins being more disordered. For example, while there is practically no difference between proteins with zero disorder and with a maximum disorder of 10% for non-binding proteins in E. coli (both with about 40% relative occurrence), the values are sharply different for binding proteins: nearly 60% of all binding proteins have a disorder in the range of 0–10%, but only 25% of all binding proteins possess 0% disorder. The values are similar for yeast, with an even greater discrepancy between the two groups of proteins for this range of disorder. By a chi-square test, the two distributions of binding and non-binding proteins are significantly different, with p-values <1e-14 for both E. coli and yeast. This difference can be clearly attributed to the close link between intrinsic disorder and the involvement of proteins in physical interactions.
The propensity of chaperone-binding normalized with propensity of binding
Comparing Figures 1 and 2, one can conclude that both protein-binding and chaperone-binding vary as a function of intrinsic disorder, and next we asked how the tendency of chaperone-binding is related to the tendency of general partner-binding. Thus, in Figure 3A we proceeded in the following way: we divided all the proteins in E. coli into equal-size groups, bins, depending on their disorder. For each bin we calculated the R ratio of chaperone-binding to non-chaperone binding proteins (Figure 1A) and divided it with the ratio of binding to non-binding proteins (Figure 2A), as detailed in the Methods section.
In Figure 3B, we did the same for yeast proteins. The result in both cases is an almost monotonously decreasing function of protein disorder: i.e. normalized with binding propensity (which increases with increasing disorder) the propensity to bind a chaperone clearly decreases with increasing disorder.
The ratio of chaperone-binding and non-chaperone-binding proteins was divided by the ratio of protein-binding and non-binding proteins for each bin. Each bin contains the same number of proteins. The numbers on the X-axis indicate the upper values of the disorder range for each bin. (A) All E. coli proteins in SwissProt. (B) All yeast proteins in SwissProt.
Occurrence of Pfam domains in chaperone-binding and non-chaperone binding proteins
The results obtained thus far indicate that disordered proteins tend to avoid chaperones, whereas ordered proteins prefer chaperones as binding partners. Percentage disorder within a protein, however, does not adequately distinguish between proteins with or without globular domains, which are potential chaperone binding sites of a protein. To clarify on this point, we decided to select and observe the chaperone binding of those proteins, which have at least one globular domain. As the Pfam domain collection contains mostly globular proteins (85% of them contains 10% disorder at the maximum (unpublished results)), the presence of a Pfam domain should represent the feature decisive for the need of chaperone-binding. To confirm this, we analyzed all the proteins in pair-wise interaction by Blastp against all Pfam domains. We found that for chaperone-binding proteins in unicellular eukaryotes the ratio of Pfam-lacking (i.e. those proteins where no Pfam-domain match was found) and Pfam-containing proteins was 0.380±0.10 (65 over 176 proteins) but for non-chaperone-binding proteins this ratio was 0.504±0.05 (1174 over 2356 proteins). In metazoa proteins the ratio of Pfam-lacking and Pfam-containing proteins for chaperone-binding proteins was 0.271±0.09 (16 over 59 proteins), whereas for non-chaperone-binding the same ratio was 0.45±0.01 (4387 over 9734 proteins). (By a chi-square test to compare the different proportions for both taxonomic groups, we found that these differences in ratios did not achieve statistical significances, but suggested clear tendencies. The lack of strict statistical significance is due mostly to the large differences in the number of chaperone-binding and non- chaperone binding proteins.) Thus, these observations were in line with our expectation that chaperone-binding proteins tend to have more often globular part(s), such as a Pfam domain, which require chaperone binding to help fold. Probably due to limitations in the number of known chaperone-binding proteins, we could not observe such a favorable difference between the ratios in bacteria (chaperone-binding proteins: 0.107 [14 over 131 proteins]; non-chaperone-binding proteins: 0,063 [40 over 636 proteins]) However, the disorder for all the 14 chaperone-binding bacterial proteins without a Pfam domain is below 31% therefore they can easily contain a globular domain. This is further supported by the observation that the name of 13 of the 14 Swissprot proteins in question starts with a ‘y’, indicating a largely uncharacterized bacterial protein in SwissProt.
Ratio of exclusively chaperone-binding and non-exclusively chaperone-binding proteins
Our explanation for all previous data is that ordered proteins/domains require chaperones as binding partners to assist their folding, whereas disordered proteins/regions either do not need chaperones or they need them for some other aspect of their function. From the limited information of relevance in the literature, we expect it might be rather for helping to integrate into larger complexes. The corollary of this suggestion is that IDPs that bind a chaperone are more likely also to bind another partner, than ordered proteins. As there is considerable overlap between the chaperone-binding and non-chaperone binding proteins, we thought to address this issue by determining if there is any difference in the disorder of those proteins that bind only chaperones and those that bind both chaperones and other types of proteins. The results are shown in Figure 4. For both yeast (Figure 4A) and metazoa (Figure 4B), there is a decreasing number of exclusively chaperone-binding proteins with increasing disorder. In the case of metazoa, none of the proteins with more than 80% disorder are exclusively chaperone-binding. There is a similar tendency for yeast proteins, too, with somewhat lesser R-value.
The ratio of proteins which only bind chaperone to those which also bind non-chaperone partner(s). The numbers were counted for each disorder range (with increments of 5% disorder), then normalized to 100% for both the exclusively chaperone binding and non-exclusively chaperone binding proteins. In the last step the ratio of the two percentage numbers was calculated for each disorder range. A linear trendline is fitted to the data points—the equation of the trendline is indicated, with the resulting R2 value. (A) All yeast proteins. There are altogether 461 chaperone binding and 228 exclusively chaperone binding proteins in yeast. (B) All metazoa proteins. Altogether there are 497 chaperone binding proteins, of which 68 are exclusively chaperone binding.
The major finding of our analysis is that predicted disorder of proteins negatively correlates with binding to chaperone partners, i.e. IDPs in the cell tend to avoid being bound to chaperones. This statement applies to a prokaryote, E. coli, a unicellular eukaryote, S. cerevisiae, and also to metazoa. The effect may be correlated with the presence of ordered domains, as observed with Pfam domains, although due to scarcity of data in two systems we could not draw a general conclusion. Further, binding of chaperones to disordered proteins is frequently accompanied by binding to other proteins, which suggests that IDPs use chaperones not for folding, but for assistance with association with other proteins. Even in cases where statistical significance is low, our data strongly discredit the original hypothesis that IDPs would be preferentially bound and protected by chaperones. These observations have numerous ramifications, as discussed next.
The first implication is that the very week correlation of protein disorder with intracellular degradation rate , is apparently not a general consequence of protection of IDPs by chaperones. Because IDPs in vitro are orders of magnitude more sensitive to proteolysis than globular proteins, this observation demands some other, general explanation, such as protection by protein-protein interaction or tight control of proteolytic systems. In fact, many functions of IDPs directly invoke their involvement in protein-protein interactions ,,, and hub proteins with multiple interacting partners have an elevated level of disorder –. As a matter of fact, this may suggest that many interacting partners of proteins may also act in a compensatory or assisting fashion, given their potentially very high intracellular concentrations. Although this is not in the focus of the current work, our results might promote the idea of the extension and generalization of the chaperone concept. An additional point is that many intracellular proteases are known to be regulated and thus not to discriminately degrade their substrates. The mechanisms involve pro-enzyme activation (e.g. caspases), intracellular localization (e.g. lysosomal proteases) or ubiquitination (e.g. proteasome), among others. This might actually relieve chaperones from the duty of guarding IDPs, which might have been a key factor in the spread and functional success of IDPs.
Another pertinent issue is the structural ramifications of the noted preference of ordered proteins for chaperone partners. It is a commonplace that the 3D structure of a protein is determined by its amino acid sequence, but folding, in particular in the crowded intracellular environment of the cell, occasionally requires guidance by chaperones . This, however, should be reflected in their need of chaperones during folding, not in the fully folded state, studied in the high-throughput interaction studies referred to. Thus, their preference for chaperones must reflect their tendency to transiently unfold and recruit a chaperone to assist refolding. In the case of IDPs, current in vitro observations suggest that they need no assistance for folding, i.e. they can reach the native-state ensemble from a highly denatured state spontaneously. The observation that IDPs are often heat resistant, and remain fully functional after treatment by boiling temperatures, bear witness to this point ,,,. This holds true also for IDPs that are not fully disordered, but have short-range , and/or long-range – organization. As a first approximation, we may take this as an indication that a similar situation applies in vivo, i.e. IDPs spontaneously acquire their native ensemble of structures after synthesis. A key point here, however, is that chaperones might not only be needed for assisting proper folding, but also for preventing aggregation from a partially folded/misfolded state. It is thought the open and exposed character of IDPs makes them particularly vulnerable to aggregation, but their special amino acid composition itself counters the threat. In fact, IDPs are usually highly charged, they contain a high percentage of the structure-breaking Pro residue, and are low in hydrophobic residues, which all act against aggregation and subsequent amyloid formation . Further, they have special sequence features built in to prevent aggregation, as noted in the case of the polyGln region of huntingtin . Since IDPs do show some tendency to interact with chaperones, it seems appropriate to suggest that one prime reason for these interactions is to prevent amyloid formation. This has been explicitly stated in the case of the yeast prion Ure2 interacting with Hsp40 , α-synuclein interacting with Hsp70 , and expanded polyQ regions interacting with both Hsp40 and Hsp70 . Interestingly, in one case it has been suggested that the chaperone in fact does not interact with the IDP, but rather a prefibrillar intermediate, which may be a general phenomenon among other IDPs as well .
The final point that deserves closer inspection is the possible functional implications of chaperone binding of IDPs, given their lack of need of assistance for folding to a functional state. Two conceivable requirements are transport through physiological membranes and assistance for partner binding, i.e. assembly of complexes. In the case of transport through membranes, globular proteins partially unfold to a molten-globule state competent with translocation through the membrane and refold at the other side by the help of other chaperones. IDPs in principle do not need such help as they are already in a translocation-competent structural state . As to their binding to other partners, and the subsequent assembly of complexes, IDPs in fact often carry out their functions by protein-protein interactions ,, also shown by that the average disorder increases with increasing size of complexes . However, IDPs have been observed in vitro to be very effective in binding, primarily manifested in binding to the partner at an increased speed ,. Their avoidance of chaperones, in general, may be related to this. When they do bind chaperones, however, the reason might be that in vivo assembly of large complexes may be slowed by non-specific interactions, in the case of which chaperone assistance may be of help.
In conclusion, we report here that IDPs in general require less assistance from chaperones than ordered, globular proteins. The explanation of this negative preference probably stems from the fact that IDPs are rather autonomous in folding, and require little assistance in function. Their liability for amyloid-type aggregation, and involvement in the assembly of large complexes, however, do explain their occasional binding to chaperones. Further studies may address at the level of individual proteins if this is in fact the case.
Conceived and designed the experiments: PT. Performed the experiments: HH. Analyzed the data: HH. Wrote the paper: PT HH. Wrote Perl scripts to analyze the data: HH.
- 1. Dunker AK, Obradovic Z, Romero P, Garner EC, Brown CJ (2000) Intrinsic protein disorder in complete genomes. Genome Inform Ser Workshop Genome Inform 11: 161–171.
- 2. Ward JJ, Sodhi JS, McGuffin LJ, Buxton BF, Jones DT (2004) Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J Mol Biol 337: 635–645.
- 3. Tompa P, Dosztanyi Z, Simon I (2006) Prevalent structural disorder in E. coli and S. cerevisiae proteomes. J Proteome Res 5: 1996–2000.
- 4. Tompa P (2002) Intrinsically unstructured proteins. Trends Biochem Sci 27: 527–533.
- 5. Tompa P (2005) The interplay between structure and function in intrinsically unstructured proteins. FEBS Lett 579: 3346–3354.
- 6. Dyson HJ, Wright PE (2005) Intrinsically unstructured proteins and their functions. Nat Rev Mol Cell Biol 6: 197–208.
- 7. Bell S, Klein C, Muller L, Hansen S, Buchner J (2002) p53 contains large unstructured regions in its native state. J Mol Biol 322: 917–927.
- 8. Sivakolundu SG, Bashford D, Kriwacki RW (2005) Disordered p27Kip1 exhibits intrinsic structure resembling the Cdk2/cyclin A-bound conformation. J Mol Biol 353: 1118–1128.
- 9. Radhakrishnan I, Perez-Alvarado GC, Dyson HJ, Wright PE (1998) formational preferences in the Ser133-phosphorylated and non-phosphorylated forms of the kinase inducible transactivation domain of CREB. FEBS Lett 430: 317–322.
- 10. Mark WY, Liao JC, Lu Y, Ayed A, Laister R, et al. (2005) Characterization of segments from the central region of BRCA1: An intrinsically disordered scaffold for multiple protein-protein and protein-DNA interactions? J Mol Biol 345: 275–287.
- 11. Galea CA, Pagala VR, Obenauer JC, Park CG, Slaughter CA, et al. (2006) Proteomic studies of the intrinsically unstructured mammalian proteome. J Proteome Res 5: 2839–2848.
- 12. Belle A, Tanay A, Bitincka L, Shamir R, O'Shea EK (2006) Quantification of protein half-lives in the budding yeast proteome. Proc Natl Acad Sci U S A 103: 13004–13009.
- 13. Tompa P, Prilusky J, Silman I, Sussman JL (2007) Structural disorder serves as a weak signal for intracellular protein degradation. Proteins. In press.
- 14. Dou F, Netzer WJ, Tanemura K, Li F, Hartl FU, et al. (2003) Chaperones increase association of tau protein with microtubules. Proc Natl Acad Sci U S A 100: 721–726.
- 15. Thorn DC, Meehan S, Sunde M, Rekas A, Gras SL, et al. (2005) Amyloid fibril formation by bovine milk kappa-casein and its inhibition by the molecular chaperones alpha(S)- and beta-casein. Biochemistry 44: 17027–17036.
- 16. Rekas A, Adda CG, Andrew Aquilina J, Barnham KJ, Sunde M, et al. (2004) Interaction of the molecular chaperone alphaB-crystallin with alpha-synuclein: Effects on amyloid fibril formation and chaperone activity. J Mol Biol 340: 1167–1183.
- 17. Korcsmaros T, Kovacs IA, Szalay MS, Csermely P (2007) Molecular chaperones: The modular evolution of cellular networks. J Biosci 32: 441–446.
- 18. Tompa P, Csermely P (2004) The role of structural disorder in the function of RNA and protein chaperones. FASEB J 18: 1169–1175.
- 19. Auluck PK, Chan HY, Trojanowski JQ, Lee VM, Bonini NM (2002) Chaperone suppression of alpha-synuclein toxicity in a Drosophila model for Parkinson's disease. Science 295: 865–868.
- 20. Klucken J, Shin Y, Masliah E, Hyman BT, McLean PJ (2004) Hsp70 reduces alpha-synuclein aggregation and toxicity. J Biol Chem 279: 25497–25502.
- 21. Gavin AC, Bosche M, Krause R, Grandi P, Marzioch M, et al. (2002) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415: 141–147.
- 22. Gavin AC, Aloy P, Grandi P, Krause R, Boesche M, et al. (2006) Proteome survey reveals modularity of the yeast cell machinery. Nature 440: 631–636.
- 23. Arifuzzaman M, Maeda M, Itoh A, Nishikata K, Takita C, et al. (2006) Large-scale identification of protein-protein interaction of Escherichia coli K-12. Genome Res 16: 686–691.
- 24. Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, et al. (2007) IntAct—open source resource for molecular interaction data. Nucleic Acids Res 35: D561–D565.
- 25. Tagari M, Tate J, Swaminathan GJ, Newman R, Naim A, et al. (2006) E-MSD: Improving data deposition and structure quality. Nucleic Acids Res 34: D287–D290.
- 26. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402.
- 27. Dosztanyi Z, Csizmok V, Tompa P, Simon I (2005) IUPred: Web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content. Bioinformatics 21: 3433–3434.
- 28. Dosztányi Z, Csizmok V, Tompa P, Simon I (2005) The pairwise energy content estimated from amino acid composition discriminates between folded and instrinsically unstructured proteins. J Mol Biol 347: 827–839.
- 29. Bateman A, Coin L, Durbin R, Finn RD, Hollich V, et al. (2004) The Pfam protein families database. Nucleic Acids Res 32: D138–D141.
- 30. Iakoucheva L, Brown C, Lawson J, Obradovic Z, Dunker A (2002) Intrinsic disorder in cell-signaling and cancer-associated proteins. J Mol Biol 323: 573–584.
- 31. Ekman D, Light S, Bjorklund AK, Elofsson A (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol 7: R45.
- 32. Haynes C, Oldfield CJ, Ji F, Klitgord N, Cusick ME, et al. (2006) Intrinsic disorder is a common feature of hub proteins from four eukaryotic interactomes. PLoS Comput Biol 2: e100.
- 33. Dosztanyi Z, Chen J, Dunker AK, Simon I, Tompa P (2006) Disorder and sequence repeats in hub proteins and their implications for network evolution. J Proteome Res 5: 2985–2995.
- 34. Kim TD, Ryu HJ, Cho HI, Yang CH, Kim J (2000) Thermal behavior of proteins: Heat-resistant proteins and their heat-induced secondary structural changes. Biochemistry 39: 14839–14846.
- 35. Csizmok V, Szollosi E, Friedrich P, Tompa P (2006) A novel two-dimensional electrophoresis technique for the identification of intrinsically unstructured proteins. Mol Cell Proteomics 5: 265–273.
- 36. Fuxreiter M, Simon I, Friedrich P, Tompa P (2004) Preformed structural elements feature in partner recognition by intrinsically unstructured proteins. J Mol Biol 338: 1015–1026.
- 37. Csizmok V, Bokor M, Banki P, Klement É, Medzihradszky KF, et al. (2005) Primary contact sites in intrinsically unstructured proteins: the case of calpastatin and microtubule-associated protein 2. Biochemistry 44: 3955–3964.
- 38. Dedmon MM, Lindorff-Larsen K, Christodoulou J, Vendruscolo M, Dobson CM (2005) Mapping long-range interactions in alpha-synuclein using spin-label NMR and ensemble molecular dynamics simulations. J Am Chem Soc 127: 476–477.
- 39. Vise P, Baral B, Stancik A, Lowry DF, Daughdrill GW (2007) Identifying long-range structure in the intrinsically unstructured transactivation domain of p53. Proteins 67: 526–530.
- 40. Bhattacharyya A, Thakur AK, Chellgren VM, Thiagarajan G, Williams AD, et al. (2006) Oligoproline effects on polyglutamine conformation and aggregation. J Mol Biol 355: 524–535.
- 41. Lian HY, Zhang H, Zhang ZR, Loovers HM, Jones GW, et al. (2007) Hsp40 interacts directly with the native state of the yeast prion protein Ure2 and inhibits formation of amyloid-like fibrils. J Biol Chem 282: 11931–11940.
- 42. Dedmon MM, Christodoulou J, Wilson MR, Dobson CM (2005) Heat shock protein 70 inhibits alpha-synuclein fibril formation via preferential binding to prefibrillar species. J Biol Chem 280: 14733–14740.
- 43. Wacker JL, Zareie MH, Fong H, Sarikaya M, Muchowski PJ (2004) Hsp70 and Hsp40 attenuate formation of spherical and annular polyglutamine oligomers by partitioning monomer. Nat Struct Mol Biol 11: 1215–1222.
- 44. Dunker AK, Lawson JD, Brown CJ, Romero P, Oh JS, et al. (2001) Intrinsically disordered protein. J Mol Graphics Modelling 19: 26–59.
- 45. Hegyi H, Schad E, Tompa P (2007) Structural disorder promotes assembly of protein complexes. BMC Struct Biol 7: 65.
- 46. Dafforn TR, Smith CJ (2004) Natively unfolded domains in endocytosis: Hooks, lines and linkers. EMBO Rep 5: 1046–1052.