Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Engineering Proteins for Thermostability with iRDP Web Server

Abstract

Engineering protein molecules with desired structure and biological functions has been an elusive goal. Development of industrially viable proteins with improved properties such as stability, catalytic activity and altered specificity by modifying the structure of an existing protein has widely been targeted through rational protein engineering. Although a range of factors contributing to thermal stability have been identified and widely researched, the in silico implementation of these as strategies directed towards enhancement of protein stability has not yet been explored extensively. A wide range of structural analysis tools is currently available for in silico protein engineering. However these tools concentrate on only a limited number of factors or individual protein structures, resulting in cumbersome and time-consuming analysis. The iRDP web server presented here provides a unified platform comprising of iCAPS, iStability and iMutants modules. Each module addresses different facets of effective rational engineering of proteins aiming towards enhanced stability. While iCAPS aids in selection of target protein based on factors contributing to structural stability, iStability uniquely offers in silico implementation of known thermostabilization strategies in proteins for identification and stability prediction of potential stabilizing mutation sites. iMutants aims to assess mutants based on changes in local interaction network and degree of residue conservation at the mutation sites. Each module was validated using an extensively diverse dataset. The server is freely accessible at http://irdp.ncl.res.in and has no login requirements.

Introduction

Thermophiles and hyperthermophiles are organisms that grow at extreme temperatures. Enzymes from these organisms are inherently stable and active at high temperatures, offering a major industrial advantage over their mesophilic homologues with respect to their storage, resistance against chemical denaturants and risk of microbial contaminations. Thermal stability is an important parameter that determines economic feasibility of applying an enzyme in any industrial process. Understanding the molecular determinants of thermostability can not only provide useful insights into evolution of such enzymes but the application of these through rational protein engineering to existing mesophilic proteins can also lead to development of more efficient and thermally stable biocatalysts for various industrial applications [1].

Molecular determinants of protein thermostability

Studies have revealed several trends of residue preference towards thermostabilization of thermophilic proteins, such as lower content of uncharged polar residues, preference of arginine over lysine residues and higher charged residue contents [2]. Shortening of loop regions is a known mechanism of protein thermostabilization in hyperthermophilic proteins [35]. Various non-covalent interactions are known to play a vital role in thermostabilization of proteins [68]. Disulfide bridges are covalent interactions, which are known to provide stability to protein by entropic effect [911]. The presence of thermolabile residues and bonds involving asparagine and glutamine are known to introduce instability to the protein backbone by undergoing deamidation at elevated temperatures [12]. Residues in left-handed helical conformation upon mutation to glycine are known to contribute favorably to protein thermal stability [13, 14]. Marshall et al., 2002 have studied the interactions of α-helix dipole with side chains of sequentially charged residues and found it to contribute favorably to stability [15, 16]. Proline residues being conformationally most rigid are thought to provide stability to proteins by entropic effects [17, 18]. The hydrophobic effect is understood to be one of the primary driving forces of protein folding [19]. Decrease in hydrophobic surface area, as a stabilization mechanism has been studied in superoxide dismutase from S. acidocaldarius [20]. Bound metal is also vital for stability and functioning of many proteins [2123].

Target identification through comparative structural analysis

The rapid addition of protein structures to the PDB [24], has made manual analysis of combination of such a large number of factors extremely time-consuming and sometimes error-prone. Although a variety of computational tools such as WHAT IF [25], PIC [26] and Capture [27] are available for structural analysis, most of these are limited by their ability to analyse only a single structure at a time (S1 Table). These tools primarily focus on analysis of non-covalent interactions as stabilizing mechanisms ignoring most molecular determinants listed above. However, most protein engineering studies necessitate simultaneous analysis of several structural mechanisms amongst a vast set of protein structures for improved selection of target protein and potential mutation sites (Fig 1). In view of this, the iCAPS (in silico Comparative Analysis of Protein Structures) module was developed to simplify the comparison process for a large number of protein structures in terms of the features listed above known to affect protein stability (Fig 2). iCAPS aims to help the user compare a series of proteins in order to select a target protein most suitable for initiation of protein engineering studies.

thumbnail
Fig 1. The general workflow of most rational protein engineering problems.

https://doi.org/10.1371/journal.pone.0139486.g001

thumbnail
Fig 2. The three working modules of the iRDP web server implemented for rational engineering of proteins.

https://doi.org/10.1371/journal.pone.0139486.g002

Identification of potential sites for structural stabilization

Once a target protein is selected for engineering, the next task is to identify potential target mutation sites. The Suzuki group has conclusively proved entropic stabilization by proline insertion as a protein thermostabilization mechanism through their work on oligo 1,6 glucosidase from Bacillus cereus. Through this work, they were able to identify that proline insertion at second position of β-turns and N-cap of helix enhanced thermostability of the protein. This mechanism has been defined as the “The Proline Rule” [28, 29]. Loop stabilization by proline has been successfully implemented for thermostabilization of proteins such as cold shock protein, ubiquitin, ribonuclease Sa2 and guanyl specific ribonuclease Sa3, bacteriophage T4 lysozyme and human lysozyme [3032]. Proline insertion at helix N-cap has also been used to enhance the stability of proteins like alcohol dehydrogenase, α-parvalbumin and triosephosphate isomerase [18, 33, 34]. Studies on ribonuclease HI show that release of conformational strain caused due to left-handed helical residue Lys95 upon mutation to Gly, results in considerable increase in thermostability of the protein [13]. This strategy has been used to improve the stability of proteins such as Drosophila adapter protein Drk, barnase, lysozyme and Pin1 WW [3539]. In their classic work on bacteriophage T4 lysozyme, Matsumura et al., 1989, have not only elucidated the role of disulfide bridges in protein stability but have also shown that the effect of introduction of a combination of disulfide bridges on protein stability is additive in nature [9]. This mechanism has been used successfully for improving the stability of proteins like T4 lysozyme [40], subtilisin BPN [41], xylanase [42], lipase [43], lipase B [44] and glucose 1-dehydrogenase [45]. Currently computational tools such as CUPSAT [46], SDM [47], PopMusic [48], FoldX [49] and Rosetta Design [50] are available for predicting the effect of mutation on protein stability. However these tools require the user to input the mutations and do not suggest potential stabilizing mutations using specific strategies. Hence the iStability (in silico Analysis of Stability Change in Protein Structures) module was developed which not only aids in identification of stabilizing mutation sites through the application of protein design strategies described above for improvement of thermal stability but also assesses the stability of any mutant (Fig 2).

Evaluating potential thermostabilization sites using molecular interactions

Once a stabilizing mutation site is selected, it is important to evaluate the mutation in terms of its effects on neighboring residues, which is vital to any protein engineering experiment. Serrano et al., 1992, in their work with barnase enzyme have revealed that loss of buried salt bridges and hydrogen bonds due to mutations affects protein stability significantly [37]. Studies carried out on the arc repressor protein of bacteriophage P22 have shown deleterious effects of mutations on protein stability due to disruptions in hydrogen bonds and salt bridges [51]. Computational tools such as CUPSAT, SDM, PopMusic and Rosetta Design (S2 Table) are mainly of predictive nature. Although, users are usually informed of the effects of mutations on protein stability by these tools in form of stability scores, underlying details of interaction rearrangements at the mutation site are currently not provided. Along with the stability scores, the information regarding the change in interactions could provide a better evaluation of the mutations being considered.

Understanding this, we have developed a mutation evaluation tool called iMutants (in silico Comparative Analysis of Interactions in Protein Mutants), which assesses the change in local interactions at mutation sites through comparison of wild-type and mutant proteins (Fig 2). Evolutionary conservation analysis is crucial to successful protein design since highly conserved positions typically play important structural or functional roles, i.e. mutation could adversely affect protein function [52]. Therefore iMutants also evaluates the conserved nature of the mutation sites.

The three modules, iCAPS, iStability and iMutants, addressing these aspects of protein engineering problems, are integrated on a single platform in the form of iRDP (in silico Rational Designing of Proteins) web server presented in this work.

Material and Methods

The iRDP server is built on a Linux platform using R, Perl, HTML and PHP. The Bio3d [53] and iGraph [54] packages form the core of all iRDP modules. The vast analysis carried out by modules of iRDP server uses, both in-house developed scripts and established tools (S3 Table). Described below is the detailed workflow of each module in iRDP web server (S1 Fig).

In silico Comparative Analysis of Protein Structures (iCAPS)

Multiple structures serve as input to iCAPS. Input can be a list of PDB entries separated by commas, or files in valid PDB format can be uploaded. The user can select the structural features to be analyzed, modify cutoff values for calculation of non-covalent interactions and relative accessible surface area (ASA) before submitting the job. The relative ASA threshold is used to decide whether a residue is buried or exposed to solvent. The results page contains a unique job identification number for each job being submitted. Users can bookmark this page and return to view and retrieve the results later.

Analysis begins with primary structural features like amino acid composition, secondary structure content, information such as helix/strand/turn/coil composition and then proceeds to calculation of non-covalent interactions (Fig 2). The program DSSP [55] is used to detect secondary structures in the input proteins while NACCESS is used for estimation of residue solvent accessibility and accessible surface areas (ASA) [56]. The non-covalent interactions and disulfide bonds are identified using the standard criteria reported in literature (S4 Table). Users are provided with options to change the criteria of interaction calculations. In-house scripts are written for estimation of parameters such as proline residue distribution profile, thermolabile bond profile and helix dipole stabilization profile. Conformationally strained residues are identified by using Procheck [57] while β-turn and N-cap proline residues are identified using Promotif [58] and DSSP respectively. The program FindGeo has been employed for analysis of metal binding sites and geometry [59]. Gibbs free energy of unfolding is calculated using FoldX [49].

For the validation of iCAPS module, 16 thermophilic-mesophilic protein pairs were used (Table 1). Each pair was submitted to iCAPS module and raw values of various thermostability parameters were calculated. Raw values were first normalized by methods similar to that of Kumar et al., 2000 in order to estimate the percentage change of these parameters between mesophilic and thermophilic proteins [60]. Percentage change was calculated by using difference of normalized values between thermophilic and mesophilic protein, divided by the corresponding normalized value of mesophilic protein. For normalization of parameters such as total percentage of aromatic (Aro), uncharged polar (UP), proline (Pro), hydrophobic or aliphatic (ALI), charged (CHG) residues, total percentage of ion-pairs (IP), aromatic-aromatic (AAI), aromatic-sulphur (ASI), cation-pi (CPI), hydrogen bonding (HB), hydrophobic (HP) interactions, total percentage of conformationally strained residues (CS) and percentage of residues in loop regions (Loop), the raw values obtained were normalized using sequence length. In case of parameters such as total percentage of proline residues occurring at 2nd position of beta turns (Bt2P) and Ncap helix positions (NCap), normalization was carried out using total number of proline residues. Similarly the total percentage of dipole-stabilized helices parameter was normalized with total number of helices. In case of normalization for total percentage of thermolabile bonds (TL), raw values were normalized using total number of Asn and Gln residues. For normalization of parameters such as ratio of Nonpolar to Polar accessible surface areas (NP/P) and Arg to Lys ratio (R/K), raw values were directly considered for percentage change calculation. Since the protein families considered for analysis in the dataset were highly diverse in terms of sequence and structure, the normalization process focused on the pairs rather than the entire dataset.

thumbnail
Table 1. Comparative analysis of various thermostability factors among 16 thermophilic-mesophilic pairs of protein.

https://doi.org/10.1371/journal.pone.0139486.t001

In silico Analysis of Stability Change in Protein Structures (iStability)

The input to this module consists of providing a PDB code or uploading a valid PDB formatted file. The user must select any of the pre-defined protein design strategies (Fig 2) or provide their own mutations in the specified format. Once the design strategy is selected, the user has the choice of trying out four different stability prediction tools (FoldX [49], Auto-Mute [61], I-Mutant [62] and MUpro [63]), which are based on empirical potential energy functions or machine learning methods with provision to modify input. In case of FoldX, mutations are carried out and the total energy difference between the mutant and the corresponding wild-type is estimated. Positive values of energy difference reflects less stability. Auto-mute uses machine learning approach to carry out computational mutagenesis. A 3D-1D profile is generated for both the mutant and wild-type, which is further used in the generation of a vector difference profile. Here the environmental change (EC) scores quantify the differences generated for the mutant in terms of confidence in stability classification. I-Mutant implements support vector machines based on potential energy function to determine the stability. The ΔΔG value calculated uses the difference of Gibbs free energy (ΔG) of the mutant and wild-type. If the ΔΔG value is < 0 kcal/mol, the mutant is considered to be less stable than wild-type, while if the ΔΔG value is > 0 kcal/mol, the mutant is said to be more stable then wild-type. MUpro predicts how single-site amino acid mutations affect protein stability using the ΔΔG value, which is computed using support vector machines and neural networks. The confidence score generated measures the confidence of the prediction. If the score is < 0, it indicates decreased stability, whereas, score > 0 shows that the mutation increases stability compared to wild-type. The user can choose residue conservation analysis if required. If input is a PDB entry, then the evolutionary residue conservation score is derived from ConSurf-DB [64], on a scale of 1–9 (1 is an indication of least conserved/highly variable and 9 highly conserved/least variable). If input is a structure uploaded by the user, then the extracted sequence is used to search for homologs against the UniRef90 database using PSI-BLAST (2 iterations and e-value cutoff of 1) [65]. Weighted observed percentages from generated Position Specific Scoring Matrix (PSSM) are scaled from 1 to 9 as already defined and are presented as conservation scores.

For validation of iStability module, a total of 17, 6, 10 and 15 proteins were selected for the analysis of beta-turn proline insertion, N-cap proline insertion, conformational strain release and disulfide bond insertion strategies respectively (Table 2 and S7 Table). Protein structures were analyzed using iStability module by selecting each strategy and predictions obtained were compared with experimental observations.

thumbnail
Table 2. Validation of iStability using beta-turn proline insertion and conformational strain release strategy.

https://doi.org/10.1371/journal.pone.0139486.t002

in silico Comparative Analysis of Interactions in Protein Mutants (iMutants)

iMutants takes a single structure as input similar to iStability. The user must also provide mutations in the specified format. A large number of mutations can be analyzed simultaneously in iMutants. Each mutation must be provided in a separate line. For double or multiple mutants, mutations should be provided in a comma-separated format on a single line (S2 Fig). Users can modify interaction cutoffs and relative ASA value before submitting a job. Similar to iStability, an option is provided for residue conservation analysis at the mutation site.

For validation of iMutants module, a total of 51 mutations were analyzed (Table 3 and S8 Table) in arc repressor protein of bacteriophage P22 (PDB ID: 1ARR). Mutants were generated using MODELLER [66], energy minimized using steepest descent and finally the structure thus generated was compared with wild-type structure to analyze the change in interactions at the mutation site.

thumbnail
Table 3. iMutant analysis for the five highly unstable mutations in Arc repressor protein.

https://doi.org/10.1371/journal.pone.0139486.t003

Results and Discussion

The generalized workflow that can be implemented for most rational protein engineering problems has been described in Fig 1. The features and in silico implementation of this workflow in iRDP web server has been depicted in Fig 2.

Description and Validation of the Modules

Analysis of structure stabilization mechanisms using iCAPS.

iCAPS has been designed to carry out a comparative analysis of protein structures in terms of structural features and interactions that are known to contribute to thermodynamic stability. iCAPS supports investigation of 20 different stabilization mechanisms, as described below, estimating more than 250 parameters (S4 Table) analyzed simultaneously for a maximum of 100 structures.

  1. Amino acid composition: iCAPS generates a comparative summary for a set of proteins in terms of their amino acid composition and its property-wise classification into different categories like positively charged, negatively charged, uncharged polar and aromatic residues.
  2. Secondary structure information: Comparative summary generated for overall secondary structure (SS) content as well as the residue composition of each type of SS (Helix/Strand/Turn/Coil) of proteins, provides better understanding of the contribution of SS to protein thermostabilization.
  3. Non-covalent interactions: iCAPS calculates non-covalent interactions such as ion-pairs (IP), aromatic-aromatic interactions (AAI), aromatic-sulphur interactions (ASI), cation-π interactions (CPI), hydrogen bonds (HB) and hydrophobic interactions (HP). It also offers identification of interaction networks that are energetically more favorable compared to isolated interactions.
  4. Disulfide bridges: iCAPS identifies disulfide bridges in input protein structures and provides useful insights by classifying them in terms of their expected entropic effect (i.e. based on the number of residues between bridged Cys residues) while providing other details such as solvent accessibility and SS preference of Cys residues, revealing the contribution of these bonds to structural stabilization.
  5. Thermolabile bond profile: iCAPS studies spatial distributions of thermolabile bonds as potential target sites for stability enhancement in input structures involving asparagine and glutamine along with additional details such as SS preference and solvent accessible nature of the residues forming these bonds.
  6. Conformationally strained residue profile: Conformationally strained residue detection feature in iCAPS identifies conformationally strained residues in input structures, which could be considered for mutation to glycine for improving thermal stability of proteins. The module also provides additional information such as conformational geometry, SS preference and strain distance (distance between the Cβ and main chain oxygen atom) of the strained residues. While mutation of such residues to Gly remain the most established strategy, the void generated due to the lack of side chain in the Gly residue remains a viable concern [67]. In such cases it is advisable to explore non-Glycine substitutions using the customized mutation option in iStability.
  7. Helix dipole stabilization profile: The helix dipole stabilization feature of iCAPS identifies dipole-stabilized helices in input protein structures along with position-wise classification of dipole stabilizing charged residues.
  8. Proline residue profile: iCAPS reports the distribution of proline residues in various secondary structures along with specific identification of prolines occurring at the second position of beta-turns and at N-terminus of helices. Since the solvent exposed loops and intrinsically disordered regions of proteins are often found to be proline-rich [68], the secondary structure wise proline distribution provided warrants careful analysis.
  9. Accessible Surface Area analysis: iCAPS measures the total, main-chain, side-chain, polar and non-polar accessible surface areas of proteins. ASA analysis also classifies all 20 amino acids by their solvent accessible nature as buried or exposed.
  10. Metal binding analysis: The module identifies residues involved in metal binding sites along-with determination of metal coordination geometries.
  11. Estimation of Gibbs free energy of unfolding: Protein stabilization energies for input structures are computed by iCAPS using the FoldX energy function [49]. The total energy is considered as an approximation of overall stability of the protein. This comparative report gives a comprehensive overview of energies involving various structure stabilization mechanisms amongst proteins under study.

The above results are presented in a formatted web page. Besides this, for further down-stream analysis the user can download a zipped file containing all results in the form of tab-delimited text files. An extensive help file has been prepared and provided in the website which explains the importance of every parameter generated, along with relevant references.

Validation of iCAPS module.

For validation of iCAPS, a diverse non-redundant dataset of thermophilic-mesophilic (TS-MS) protein pairs, from organisms that are moderately thermophilic to hyperthermophilic as well as their mesophilic counterparts, were investigated (Table 1). The pairs comprising of structures having resolution ≤ 2.5 Å were selected from a diverse set of families (S5 Table). The selected TS-MS pairs were observed to be highly similar to each other with RMSD of the structures in the range of 0.69–1.68 Å while sequence identity in the range of 24–73% (S3 Fig). The thermophilic and mesophilic protein set among themselves were found to be highly dissimilar with the sequence identity ranging from 1–12% and 2–13% respectively (S3 Fig), demonstrating the diverse families considered for the analysis. In terms of structural diversity, 2 families were found to belong to all-alpha class, 3 to all beta class, 1 belonging to small proteins while the rest belonged to alpha-beta class according to the SCOP classification [69]. In most cases the oligomeric state of the pairs selected was found to be the same. Table 1 shows the percentage change values between TS-MS pairs with respect to various structural parameters estimated by iCAPS. The percentage change values can be correlated to the extent of the contribution of each of the factor towards thermostability of the proteins in the dataset. A positive value indicates higher occurrence of a particular parameter in thermophilic proteins while a negative value corresponds to higher occurrence in their mesophilic counterparts.

Comparative amino acid composition analysis revealed 13 families showing higher preference of charged (CHG) amino acids while 14 families displayed a lesser content of uncharged polar amino acids (UP) in the thermophilic proteins (Table 1). Of the 16 families, Ferredoxin from Bacillus thermoproteolyticus (1IQZ) was observed to show highest preference for charged residue content compared to its mesophilic partner from Clostridium acidurici. Similarly 3-Phosphoglycerate kinase from Geobacillus stearothermophilus (1PHP) showed lowest preference for UP content compared to its mesophilic homolog from Saccharomyces cerevisiae. This preference for charged residues compared to uncharged polar residues, a thermostabilization trend [70], also affected other parameters such as ion-pairs (IP), and the R/K ratio. It was found that 11 families had higher numbers of ion-pairs. The percentage change of ion-pairs was observed to be highest in case of Rubredoxin, a 53-residue protein. Rubredoxin from thermophilic Pyrococcus furiosus (1CAA), has 7 ion-pairs in its structure (5 ion-pairs form a network) while its mesophilic counterpart from Desulfovibrio vulgaris showed only one ion-pair. Similarly, 9 thermophilic proteins showed higher preference for Arginine than Lysine with highest preference observed in case of L-Lactate dehydrogenase enzyme family (1LDN). In terms of hydrogen bonding interactions (HB), 13 families contained a higher number of hydrogen bonds in the thermophilic set as compared to their mesophilic counterparts, thereby revealing hydrogen bonds as a contributing factor towards better protein stability. For this dataset aromatic residue content (Aro) and interactions involving aromatic amino acids (AAI, ASI and CPI) showed lower contribution towards stability. Hydrophobic residue content (ALI) and hydrophobic interactions (HP) were observed to be higher in 13 and 10 families of thermophilic proteins respectively. Among all pairs in the dataset, Methionine aminopeptidase from Pyrococcus furiosus (1XGS) showed highest hydrophobic interactions compared to its mesophilic counterpart from Escherichia coli. Reduction in hydrophobic surface area of a protein is a known thermostabilization mechanism. It was seen that in 11 cases the change in NP/P ratio was found to be negative (highest in case of Adenylate kinase family). While 7 families in the thermophilic set showed higher Pro content, in the current dataset only 6 and 7 thermophilic proteins respectively show beta-turn (Bt2P), NCap proline insertion parameters to be a contributing factor. Contribution by shortening of loop regions (Loop) in proteins towards thermostability was observed in 10 thermophilic proteins. In case of CheY protein family, loop percentage was observed to be lowest in case of thermophilic protein (1TMY) than its mesophilic partner. It was similarly observed that 12 thermophilic proteins contained fewer conformationally strained residues (CS), a factor contributing positively towards thermostability.

Although it was difficult to observe a generalized rule for protein thermostabilization, the analysis highlighted few parameters such as charged residue preference, increased ion-pairs and hydrogen bonding interactions, decreased non-polar accessible surface area, conformationally strained residues, and shortening of loops to contribute positively to thermostability of proteins in this dataset.

Identification of potential stabilizing mutations in a protein using iStability.

iStability offers in silico implementation of four experimentally established protein engineering strategies (Fig 2) for identification of possible stabilizing mutation sites. iStability also allows for analysis of any other structure stabilization mechanism based on user-defined mutations. Evolutionary conservation scores further help to decide the mutability of mutation sites, highly conserved sites should be considered for mutation with caution. For further downstream analysis, generated mutant structures can be downloaded. The identification of mutation sites along with stability predictions and residue conservation makes iStability a unique in silico protein-engineering tool.

For the insertion of disulfide bonds strategy, iStability invokes SSBOND software [71], which identifies and ranks residue pairs that on mutation to cysteines could form stable disulfide bridges. Based on identified residue pairs, disulfide bonds are inserted and their effect on stability is predicted. In case of entropic stabilization strategy by insertion of proline residues, iStability identifies the beta-turns in the protein containing non-proline residues at second position and helices containing non-proline residues at the N-cap position. Based on the stability prediction tool (Fig 2) selected, residues in the identified positions are mutated to proline. The mutant structures are then compared with the wild-type by the tool in order to generate a stability score. This stability score is then used to report if the stability of the mutant has been found to increase (I) or decrease (D) in comparison to the wild type. Similarly for the release of conformational strain strategy, iStability identifies conformationally strained residues, mutates them to glycine to release the strain and predicts their effects on stability. Apart from implementation of these strategies, iStability reads user-defined mutations through the customized mutation feature and predicts the mutant stability. The results of iStability constitute the stability score, stability prediction (I: increasing stability and D: decreasing stability) and conservation score of the residue being mutated.

Validation of iStability module.

For the validation of iStability module, a total of 81 mutations exploring the strategies implemented, from a diverse set of 40 structures (S6 Table) were analyzed for their stability effect, by selecting FoldX as stability prediction tool.

i. Improvement of protein thermostability by entropic reduction due to Proline introduction

Using iStability with default parameters, we have studied 28, second position β-turn proline insertions in 17 proteins from 13 organisms for which experimental stability results were available (Table 2). Of 28, iStability could accurately predict the stabilities for 22 β-turn insertions. In 20 cases, upon proline insertion an increase in stability was observed both experimentally and in iStability results. In the case of Protein G from Streptococcus sp. GX7805, the mutation K10P was predicted to decrease the stability by iStability which correlated with experimental results showing a decrease of 8.4°C in the Tm value of the mutant. For the G68P mutation in 2IMM, experimentally a significant decrease in stability was observed which iStability also predicted accurately. For three cases (A93P in 2RN2, G13P and A206P in 3MBP) showing near wild-type stability experimentally, iStability predicted an increase in stability. For 3 other cases (A48P in 1PGA, L15P in 1LVE and A21P in 1RTP), the module was unable to predict the stability of the mutants correctly as experimentally they were observed to have decreased stability whereas iStability predicted them to have increased stability.

11 proline insertions at the N-cap position of helices were analysed in 6 proteins from 5 organisms (S7 Table) for which experimentally determined stability results were available. Of the 11 mutations only 1 mutation, namely L316P carried out for Alcohol dehydrogenase was predicted inaccurately by iStability. The experimental results [18] for this mutant indicate the mutant to have higher stability (ΔTm: +10.8°C) than wild-type while iStability predicts a decreased stability for the mutant.

ii. Increasing protein thermostability through release of conformational strain by mutation to Glycine

A total 14 conformational strained residues in 10 proteins from 5 organisms (Table 2) were studied using iStability and the predictions were compared with experimentally validated results. The stability of 11 mutants was predicted accurately by iStability. Only 3 cases (R21G in 1LZ1, N30G in 1PIN and K136G in 1STN) were predicted incorrectly by the iStability module. Experimentally these three mutants were found to be thermostable [36, 39, 79] while iStability prediction shows a decreased stability.

iii. Reducing entropy for enhancement of thermostability by introduction of disulfide bridges

A set of 28 double Cysteine mutations (S7 Table) was studied for insertion of disulfide bonds for enhancement of protein stability in 15 proteins from 9 organisms. For this strategy, though iStability detected all the 28 residues pairs as potential insertion sites, stability was predicted correctly in 11 cases in comparison with experimentally determined stabilities. In one case (T72C, A471C in 3GLY), experimental evidence showed near wild-type stability while iStability predicted increased stability.

Of the total 81 predictions studied, 47 were true-positives (Both experiment and predictions showed increase of stability), 8 were true-negatives (Both experiment and predictions showed decrease of stability), 9 were false-positives (Experiment showed decrease while prediction showed increase of stability) and 17 were false-negative (Experiment showed increase while prediction showed decrease of stability). Thus the true-positive rate calculated was 0.84 while false-positive rate was 0.53. By using different FoldX energy cut-offs, neutral-states were incorporated in the prediction and ROC curve was generated (S9 Table and S4 Fig). While the true-positive rate and false-positive rates calculated actually test the accuracy of the underlying stability prediction program selected (FoldX), the values shown above also reflect the importance of using protein design strategies for better prediction of mutation sites. In those cases where iStability prediction differed from that observed experimentally, further analysis was carried out using iMutants. In case of A48P, the beta-turn proline insertion strategy in 1PGA, iMutants analysis revealed the loss of A48 (N)–(OD1) 46D hydrogen bond in mutant protein. This loss of hydrogen bond could result in decrease in stability observed experimentally. Similar changes in interactions near the mutation site were also noted in other cases, suggesting the need for further evaluation of the identified mutants using iMutants.

iStability currently relies on use of stability prediction tools that are based either on empirical potential energy functions or machine learning methods. Since these tools are based on defined training dataset, predicting mutations that are distant to the training dataset are a cause for concern.

Evaluating mutations through interaction framework and evolutionary residue conservation at mutation sites using iMutants.

A comparative local interaction profile generated from the detailed molecular interactions in a model is unique to iMutants. It offers a quantitative measure of structural changes in mutants, through loss or gain of interactions at the mutation site. The module provides a comparative interaction analysis of wild-type and mutant residues summarized in the form of a local interaction profile comprising a number of interactions and their networks. Hyperlinks provide details of the calculated interactions. iMutants also supplements the interaction profile with estimated evolutionary conservation scores of the wild-type residues being mutated. In addition, mutant structures generated can also be downloaded for further downstream analysis.

Validation of iMutants module.

The equilibrium stabilities of 51 mutants for the arc repressor protein of bacteriophage P22 (PDB ID: 1ARR) have been studied experimentally by Milla et al., 1994, using thermal and urea denaturation [51]. These 51 mutations were analysed using the iMutants module and the change in various non-bonded interactions was recorded. The mutations were divided into four groups as established by Milla et al., 1994 for analysis purposes. The first group consisted of 5 mutants (Table 3; V22A, I37A, V41A, F45A, E36A) that were experimentally determined to be highly unstable as they were unable to form dimers, and remained in an unfolded state. Of the five, the first four mutations showed a dramatic decrease of 5, 6, 3 and 4 hydrophobic interactions, respectively. These interactions affect the hydrophobic core of the protein and the loss of interactions coincides with the experimental instability observed. Since the E36A mutation involves alteration of a buried polar residue, iMutants recorded drastic loss of two ionic interactions, one ionic network along with one hydrogen bond, which was established by Milla et al., 1994 as a possible cause of instability of the mutant.

The next set analysed comprised 20 mutants (S8 Table), which experimentally exhibited reduced stability with tm values ranging from 30–50°C as compared to the wild-type protein (Tm: 57.9°C). Since three mutations R31A, R40A and R50A involved polar residues, iMutants recorded a loss of one ionic interaction and one ionic interaction network for R31A, two ionic interactions, two hydrogen bonds and one ionic interaction network for R40A and finally one ionic interaction for R50A mutants that could explain the instability of these mutants (Tm: 37.1°C, 31.2°C and 47.9°C, respectively). For mutants W14A, L21A, N29A, V33A and Y38A, changes in hydrogen bonding interactions were observed (S8 Table). The mutation W14A also showed a loss of two aromatic pair interactions, one aromatic pair network as well as five hydrophobic interactions. This change in interactions could thus explain the decrease of Tm to 31.5°C observed for this mutant. Mutants F10A, L12A, P15A, L19A, L21A, Y38A and M42A showed a loss of 6, 4, 2, 4, 1, 3, and 2 hydrophobic interactions respectively which could contribute to the instability observed.

The set of 25 mutants (S8 Table) analysed next displayed near wild-type stability experimentally with their tm ranging between 55–63°C. Most mutations in this set showed marginal or no change in their hydrogen bonding, ionic and hydrophobic interactions. P8A, the only mutant with increased stability (Tm: 74.1°C), showed a change in just one hydrophobic interaction in the iMutants analysis (S8 Table). The stabilization of this particular mutant could be due to the extension of β-sheets or relief of unfavourable packing interactions as postulated by Milla et al, 1994.

In summary, hydrogen bonds and hydrophobic interactions were observed to provide a sizable contribution towards stability of arc repressor protein. Complete interaction profiles and details of interactions for mutants along with their Tm values have been provided in S8 Table. Although the experimental evidence by Milla et al., 1994 focus solely on mutation of the residues to Ala, exploring mutations to non-Ala residues could also yield additional useful information.

iATMs (in silico Analysis of Thermally stable Mutants): An information resource.

The local interaction analysis approach of iMutants was extended to analyse experimentally validated mutations listed in the ProTherm database and is provided in the form of iATMs (in silico Analysis of Thermally stable Mutants), as a supplementary information resource to ProTherm [84]. Although ProTherm contains a vast resource of experimental information, no information is available describing the changes in structure and atomic interactions due to the mutations carried out. iATMs is organized in three sections based on the type of mutation as single, double or multiple. Within these, the sections are further classified into those containing crystal structures for both wild-type and mutant proteins and those where only wild-type crystal structures are available. Wherever wild-type and mutants structures were known, interaction profiles were generated using those structures. In cases where crystal structures for mutants were absent, generation of local interaction profiles was carried out using known wild-type and modelled mutant structure. Information provided in iATMs could provide a better understanding of correlation between experimental observations and interaction rearrangements due to mutations, leading to better application of derived knowledge towards efficient protein engineering.

Conclusion

Despite the availability of a large number of structural analysis tools, to the best of our knowledge there is currently no unified platform addressing the rational protein design problem. The web platform iRDP uniquely offers investigators a multi-faceted approach for carrying out rational protein engineering by integrating protein structure and mutation analysis tools. The modules of iRDP server can either be used separately for various independent analyses or as a systematic directed strategy encompassing the steps involved in rational protein engineering. Applications of modules do not limit themselves to the protein stability problem since the information generated comprises of diverse structural features, which can correlate with a wide range of properties in proteins. Investigations carried out using iRDP act as a guide for analysing varied structural features that relate to problems such as pH stability, protein active site analysis, crystallizability, analysis of frames from molecular dynamics simulations and protein structure-function relationships.

The future direction of the iRDP web server aspires towards implementation of sequence-based inputs complementing the existing structure-based input followed by visualization of interaction networks and mutation sites, thereby providing a better structural perspective. We welcome comments and corrections from users to further improve the iRDP server.

Supporting Information

S1 Fig. The workflow of the working modules implemented in the iRDP web server.

https://doi.org/10.1371/journal.pone.0139486.s001

(TIFF)

S2 Fig. The input mutation format for user-defined mutations in the iMutants module of the iRDP web server.

https://doi.org/10.1371/journal.pone.0139486.s002

(TIF)

S3 Fig. Box plot illustrating the % sequence identity among proteins in Thermophilic- Mesophilic (TS-MS) pairs, all TS proteins and all MS proteins.

High degree of homology observed between TS-MS protein pairs compared to all-TS and all-MS protein sets.

https://doi.org/10.1371/journal.pone.0139486.s003

(TIFF)

S4 Fig. The ROC curve for the iStability prediction based on validation dataset of 81 mutations.

The neutral-state cut-off values used are labelled near the data points. It is observed that the True-positive rate or sensitivity remains > 0.8 for neutral state cutoff range of 0 (no neutral state) to 2.5. The point shown at origin is just used for joining the lines between data points to the origin.

https://doi.org/10.1371/journal.pone.0139486.s004

(TIFF)

S1 Table. List of protein structure analysis tools.

https://doi.org/10.1371/journal.pone.0139486.s006

(PDF)

S2 Table. List of mutant stability prediction tools.

https://doi.org/10.1371/journal.pone.0139486.s007

(PDF)

S3 Table. List of tools used by iRDP web server for estimation of various structural parameters.

https://doi.org/10.1371/journal.pone.0139486.s008

(PDF)

S4 Table. List of various quantitative parameters (total 288) analyzed by the iCAPS module.

https://doi.org/10.1371/journal.pone.0139486.s009

(PDF)

S5 Table. Details of proteins considered in iCAPS validation.

https://doi.org/10.1371/journal.pone.0139486.s010

(PDF)

S6 Table. Details of proteins used for iStability and iMutants validation.

https://doi.org/10.1371/journal.pone.0139486.s011

(PDF)

S7 Table. Validation of iStability using proline insertion at Ncap of helix and disulfide bond insertion strategy.

https://doi.org/10.1371/journal.pone.0139486.s012

(PDF)

S8 Table. The iMutant analysis on 51 mutations in Arc Repressor protein of bacteriophage P22.

Local interaction profile represents number of various interactions and interaction networks of wild-type and mutant residues. The label corresponds to number of IP: ion-pair, IP_Net: ion- pair networks, AP: aromatic aromatic interaction, AP.Net: aromatic aromatic interaction network, AS: aromatic sulphur interactions, AS.Net: aromatic-sulphur interaction network, HB: hydrogen bonds, Disul: disulfide bonds, Cat-pi: cation-pi interactions, Cat-pi_Net: cation-pi interaction networks, Hphob: hydrophobic interactions. The—(hyphen) corresponds to no interaction or interaction networks detected. The Tm value corresponds to the Tm value of Mutant. The results can be accessed following the link http://irdp.ncl.res.in/cgi-bin/result_fetch_MutAna.php?ID=iMutcase.

https://doi.org/10.1371/journal.pone.0139486.s013

(PDF)

S9 Table. iStability validation analysis showing prediction parameters calculated for different neutral-state energy cut-offs.

https://doi.org/10.1371/journal.pone.0139486.s014

(PDF)

Acknowledgments

PP, MS and AG thank Council of Scientific and Industrial Research, New Delhi for Senior Research Fellowship. SKR is DST Ramanujan Fellow. This research is carried out under CSIR- NCL’s Centre of Excellence in Scientific Computation (CoESC). We also acknowledge computational facilities funded by Department of Biotechnology, Government of India.

Author Contributions

Conceived and designed the experiments: CGS PP MS SR. Performed the experiments: PP MS AG. Analyzed the data: PP MS. Wrote the paper: CGS PP MS SR.

References

  1. 1. Zamost B, Nielsen H, Starnes R. Thermostable enzymes for industrial applications. Journal of Industrial Microbiology. 1991;8(2):71–81.
  2. 2. Deckert G, Warren PV, Gaasterland T, Young WG, Lenox AL, Graham DE, et al. The complete genome of the hyperthermophilic bacterium Aquifex aeolicus. Nature. 1998;392(6674):353–8. pmid:9537320
  3. 3. Russell RJM, Ferguson JMC, Hough DW, Danson MJ, Taylor GL. The Crystal Structure of Citrate Synthase from the Hyperthermophilic Archaeon Pyrococcus furiosus at 1.9 Å Resolution. Biochemistry. 1997;36(33):9983–94. pmid:9254593
  4. 4. Thompson MJ, Eisenberg D. Transproteomic evidence of a loop-deletion mechanism for enhancing protein thermostability. Journal of molecular biology. 1999;290(2):595–604. pmid:10390356
  5. 5. Usher KC, De La Cruz AFA, Dahlquist FW, James Remington S, Swanson RV, Simon MI. Crystal structures of CheY from Thermotoga maritima do not support conventional explanations for the structural basis of enhanced thermostability. Protein science. 1998;7(2):403–12. pmid:9521117
  6. 6. Vieille C, Zeikus GJ. Hyperthermophilic enzymes: sources, uses, and molecular mechanisms for thermostability. Microbiology and molecular biology reviews: MMBR. 2001;65(1):1–43. Epub 2001/03/10. pmid:11238984; PubMed Central PMCID: PMC99017.
  7. 7. Vogt G, Woell S, Argos P. Protein thermal stability, hydrogen bonds, and ion pairs. Journal of molecular biology. 1997;269(4):631–43. pmid:9217266
  8. 8. Vogt G, Argos P. Protein thermal stability: hydrogen bonds or internal packing? Folding and Design. 1997;2:S40–S6. pmid:9269567
  9. 9. Matsumura M, Signor G, Matthews BW. Substantial increase of protein stability by multiple disulphide bonds. Nature. 1989;342(6247):291–3. Epub 1989/11/16. pmid:2812028.
  10. 10. Zhang T, Bertelsen E, Alber T. Entropic effects of disulphide bonds on protein stability. Nature Structural & Molecular Biology. 1994;1(7):434–8.
  11. 11. Betz SF. Disulfide bonds and the stability of globular proteins. Protein Science. 1993;2(10):1551–8. pmid:8251931
  12. 12. Robinson NE. Protein deamidation. Proceedings of the National Academy of Sciences of the United States of America. 2002;99(8):5283–8. Epub 2002/04/18. pmid:11959979; PubMed Central PMCID: PMCPmc122761.
  13. 13. Kimura S, Kanaya S, Nakamura H. Thermostabilization of Escherichia coli ribonuclease HI by replacing left-handed helical Lys95 with Gly or Asn. Journal of Biological Chemistry. 1992;267(31):22014–7. pmid:1331044
  14. 14. Kawamura S, Kakuta Y, Tanaka I, Hikichi K, Kuhara S, Yamasaki N, et al. Glycine–15 in the bend between two α-helices can explain the thermostability of DNA binding protein HU from Bacillus stearothermophilus. Biochemistry. 1996;35(4):1195–200.
  15. 15. Marshall SA, Morgan CS, Mayo SL. Electrostatics significantly affect the stability of designed homeodomain variants. Journal of molecular biology. 2002;316(1):189–99. pmid:11829512
  16. 16. Nicholson H, Becktel W, Matthews B. Enhanced protein thermostability from designed mutations that interact with alpha-helix dipoles. Nature. 1988;336(6200):651–6. pmid:3200317
  17. 17. Watanabe K, Masuda T, Ohashi H, Mihara H, Suzuki Y. Multiple Proline Substitutions Cumulatively Thermostabilize Bacillus Cereus ATCC7064 Oligo‐1, 6‐Glucosidase. European Journal of Biochemistry. 1994;226(2):277–83. pmid:8001545
  18. 18. Bogin O, Peretz M, Hacham Y, Burstein Y, Korkhin Y, Frolow F. Enhanced thermal stability of Clostridium beijerinckii alcohol dehydrogenase after strategic substitution of amino acid residues with prolines from the homologous thermophilic Thermoanaerobacter brockii alcohol dehydrogenase. Protein science. 1998;7(5):1156–63. pmid:9836874
  19. 19. Dill KA. Dominant forces in protein folding. Biochemistry. 1990;29(31):7133–55. pmid:2207096
  20. 20. Knapp S, Kardinahl S, Hellgren N, Tibbelin G, Schäfer G, Ladenstein R. Refined crystal structure of a superoxide dismutase from the hyperthermophilic archaeon< i> Sulfolobus acidocaldarius at 2.2 Å resolution. Journal of molecular biology. 1999;285(2):689–702.
  21. 21. Marg GA, Clark DS. Activation of glucose isomerase by divalent cations: evidence for two distinct metal-binding sites. Enzyme and microbial technology. 1990;12(5):367–73.
  22. 22. Kasumi T, Hayashi K, Tsumura N. Roles of magnesium and cobalt in the reaction of glucose isomerase from Streptomyces griseofuscus S–41. Agricultural and Biological Chemistry. 1982;46(1):21–30.
  23. 23. Smith CA, Toogood HS, Baker HM, Daniel RM, Baker EN. Calcium-mediated thermostability in the subtilisin superfamily: the crystal structure of Bacillus Ak. 1 protease at 1.8 Å resolution. Journal of molecular biology. 1999;294(4):1027–40. pmid:10588904
  24. 24. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, et al. The Protein Data Bank. Nucleic acids research. 2000;28(1):235–42. pmid:10592235
  25. 25. Vriend G. WHAT IF: a molecular modeling and drug design program. Journal of molecular graphics. 1990;8(1):52–6. pmid:2268628
  26. 26. Tina K, Bhadra R, Srinivasan N. PIC: protein interactions calculator. Nucleic acids research. 2007;35(suppl 2):W473–W6.
  27. 27. Gallivan JP, Dougherty DA. Cation-π interactions in structural biology. Proceedings of the National Academy of Sciences. 1999;96(17):9459–64.
  28. 28. Suzuki Y. A general principle of increasing protein thermostability. Proceedings of the Japan Academy Ser B: Physical and Biological Sciences. 1989;65(6):146–8.
  29. 29. Suzuki Y, Oishi K, Nakano H, Nagayama T. A strong correlation between the increase in number of proline residues and the rise in thermostability of five Bacillus oligo–1, 6-glucosidases. Applied microbiology and biotechnology. 1987;26(6):546–51.
  30. 30. Fu H, Grimsley GR, Razvi A, Scholtz JM, Pace CN. Increasing protein stability by improving beta‐turns. Proteins: Structure, Function, and Bioinformatics. 2009;77(3):491–8.
  31. 31. Herning T, Yutani K, Inaka K, Kuroki R, Matsushima M, Kikuchi M. Role of proline residues in human lysozyme stability: a scanning calorimetric study combined with X-ray structure analysis of proline mutants. Biochemistry. 1992;31(31):7077–85. pmid:1643041
  32. 32. Nicholson H, Tronrud D, Becktel W, Matthews B. Analysis of the effectiveness of proline substitutions and glycine replacements in increasing the stability of phage T4 lysozyme. Biopolymers. 1992;32(11):1431–41. pmid:1457724
  33. 33. Agah S, Larson JD, Henzl MT. Impact of proline residues on parvalbumin stability. Biochemistry. 2003;42(37):10886–95. pmid:12974622
  34. 34. Mainfroid V, Mande SC, Hol WG, Martial JA, Goraj K. Stabilization of human triosephosphate isomerase by improvement of the stability of individual α-helices in dimeric as well as monomeric forms of the protein. Biochemistry. 1996;35(13):4110–7. pmid:8672446
  35. 35. Bezsonova I, Singer A, Choy W-Y, Tollinger M, Forman-Kay JD. Structural comparison of the unstable drkN SH3 domain and a stable mutant. Biochemistry. 2005;44(47):15550–60. pmid:16300404
  36. 36. Jäger M, Dendle M, Kelly JW. Sequence determinants of thermodynamic stability in a WW domain—An all‐β‐sheet protein. Protein Science. 2009;18(8):1806–13. pmid:19565466
  37. 37. Serrano L, Kellis JT Jr, Cann P, Matouschek A, Fersht AR. The folding of an enzyme: II. Substructure of barnase and the contribution of different interactions to protein stability. Journal of molecular biology. 1992;224(3):783–804. pmid:1569557
  38. 38. Takano K, Ota M, Ogasahara K, Yamagata Y, Nishikawa K, Yutani K. Experimental verification of thestability profile of mutant protein'(SPMP) data using mutant human lysozymes. Protein engineering. 1999;12(8):663–72. pmid:10469827
  39. 39. Takano K, Yamagata Y, Yutani K. Role of non‐glycine residues in left‐handed helical conformation for the conformational stability of human lysozyme. Proteins: Structure, Function, and Bioinformatics. 2001;44(3):233–43.
  40. 40. Perry LJ, Wetzel R. Disulfide bond engineered into T4 lysozyme: stabilization of the protein toward thermal inactivation. Science. 1984;226(4674):555–7. pmid:6387910
  41. 41. Pantoliano MW, Ladner RC, Bryan PN, Rollence ML, Wood JF, Poulos TL. Protein engineering of subtilisin BPN': enhanced stabilization through the introduction of two cysteines to form a disulfide bond. Biochemistry. 1987;26(8):2077–82. pmid:3476160
  42. 42. Davoodi J, Wakarchuk WW, Carey PR, Surewicz WK. Mechanism of stabilization of< i> Bacillus circulans xylanase upon the introduction of disulfide bonds. Biophysical chemistry. 2007;125(2):453–61.
  43. 43. Han Z-l, Han S-y, Zheng S-p, Lin Y. Enhancing thermostability of a Rhizomucor miehei lipase by engineering a disulfide bond and displaying on the yeast cell surface. Applied microbiology and biotechnology. 2009;85(1):117–26. pmid:19533118
  44. 44. Le QA, Joo JC, Yoo YJ, Kim YH. Development of thermostable Candida antarctica lipase B through novel in silico design of disulfide bridge. Biotechnology and bioengineering. 2012;109(4):867–76. pmid:22095554.
  45. 45. Ding H, Gao F, Liu D, Li Z, Xu X, Wu M, et al. Significant improvement of thermal stability of glucose 1-dehydrogenase by introducing disulfide bonds at the tetramer interface. Enzyme and microbial technology. 2013;53(6–7):365–72. pmid:24315638.
  46. 46. Parthiban V, Gromiha MM, Schomburg D. CUPSAT: prediction of protein stability upon point mutations. Nucleic acids research. 2006;34(suppl 2):W239–W42.
  47. 47. Worth CL, Preissner R, Blundell TL. SDM—a server for predicting effects of mutations on protein stability and malfunction. Nucleic acids research. 2011;39(suppl 2):W215–W22.
  48. 48. Dehouck Y, Kwasigroch JM, Gilis D, Rooman M. PoPMuSiC 2.1: a web server for the estimation of protein stability changes upon mutation and sequence optimality. BMC bioinformatics. 2011;12(1):151.
  49. 49. Guerois R, Nielsen JE, Serrano L. Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. Journal of molecular biology. 2002;320(2):369–87. pmid:12079393
  50. 50. Liu Y, Kuhlman B. RosettaDesign server for protein design. Nucleic acids research. 2006;34(suppl 2):W235–W8.
  51. 51. Milla ME, Brown BM, Sauer RT. Protein stability effects of a complete set of alanine substitutions in Arc repressor. Nature structural biology. 1994;1(8):518–23. Epub 1994/08/01. pmid:7664079.
  52. 52. Suemori A. Conserved and non-conserved residues and their role in the structure and function of p-hydroxybenzoate hydroxylase. Protein engineering, design & selection: PEDS. 2013;26(7):479–88. pmid:23766373.
  53. 53. Grant BJ, Rodrigues AP, ElSawy KM, McCammon JA, Caves LS. Bio3d: an R package for the comparative analysis of protein structures. Bioinformatics. 2006;22(21):2695–6. pmid:16940322
  54. 54. Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal, Complex Systems. 2006;1695(5).
  55. 55. Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22(12):2577–637. Epub 1983/12/01. pmid:6667333.
  56. 56. Hubbard SJ, Thornton JM. Naccess. Computer Program, Department of Biochemistry and Molecular Biology, University College London. 1993;2(1).
  57. 57. Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: a program to check the stereochemical quality of protein structures. Journal of applied crystallography. 1993;26(2):283–91.
  58. 58. Hutchinson EG, Thornton JM. PROMOTIF—a program to identify and analyze structural motifs in proteins. Protein Science. 1996;5(2):212–20. pmid:8745398
  59. 59. Andreini C, Cavallaro G, Lorenzini S. FindGeo: a tool for determining metal coordination geometry. Bioinformatics. 2012;28(12):1658–60. pmid:22556364.
  60. 60. Kumar S, Tsai C-J, Nussinov R. Factors enhancing protein thermostability. Protein engineering. 2000;13(3):179–91. pmid:10775659
  61. 61. Masso M, Vaisman II, editors. Structure-based prediction of protein activity changes: Assessing the impact of single residue replacements. Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE; 2011: IEEE.
  62. 62. Capriotti E, Fariselli P, Casadio R. I-Mutant2. 0: predicting stability changes upon mutation from the protein sequence or structure. Nucleic acids research. 2005;33(suppl 2):W306–W10.
  63. 63. Cheng J, Randall A, Baldi P. Prediction of protein stability changes for single‐site mutations using support vector machines. Proteins: Structure, Function, and Bioinformatics. 2006;62(4):1125–32.
  64. 64. Goldenberg O, Erez E, Nimrod G, Ben-Tal N. The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures. Nucleic acids research. 2009;37(suppl 1):D323–D7.
  65. 65. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research. 1997;25(17):3389–402. pmid:9254694
  66. 66. Eswar N, Webb B, Marti-Renom MA, Madhusudhan M, Eramian D, Shen M-y, et al. Comparative protein structure modeling using Modeller. Current protocols in bioinformatics. 2006:5.6. 1–5.6. 30.
  67. 67. Borgo B, Havranek JJ. Automated selection of stabilizing mutations in designed and natural proteins. Proceedings of the National Academy of Sciences. 2012;109(5):1494–9.
  68. 68. Theillet F-X, Kalmar L, Tompa P, Han K-H, Selenko P, Dunker AK, et al. The alphabet of intrinsic disorder. Intrinsically Disordered Proteins. 2013;1(1):e24360.
  69. 69. Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. Journal of molecular biology. 1995;247(4):536–40. pmid:7723011
  70. 70. Chakravarty S, Varadarajan R. Elucidation of determinants of protein stability through genome sequence analysis. Febs Letters. 2000;470(1):65–9. pmid:10722847
  71. 71. Hazes B, Dijkstra BW. Model building of disulfide bonds in proteins with known three-dimensional structure. Protein engineering. 1988;2(2):119–25. pmid:3244694
  72. 72. Trevino SR, Schaefer S, Scholtz JM, Pace CN. Increasing protein conformational stability by optimizing β-turn sequence. Journal of molecular biology. 2007;373(1):211–8. pmid:17765922
  73. 73. Choi EJ, Mayo SL. Generation and analysis of proline mutants in protein G. Protein Engineering Design and Selection. 2006;19(6):285–9.
  74. 74. Ohage EC, Graml W, Walter MM, Steipe B, Steinbacher S. β‐Turn propensities as paradigms for the analysis of structural motifs to engineer protein stability. Protein science. 1997;6(1):233–41. pmid:9007995
  75. 75. Raffen R, Dieckman LJ, Szpunar M, Wunschl C, Pokkuluri PR, Dave P, et al. Physicochemical consequences of amino acid variations that contribute to fibril formation by immunoglobulin light chains. Protein Science. 1999;8(3):509–17. pmid:10091653
  76. 76. Allen MJ, Coutinho PM, Ford CF. Stabilization of Aspergillus awamori glucoamylase by proline substitution and combining stabilizing mutations. Protein engineering. 1998;11(9):783–8. pmid:9796827
  77. 77. Cammett TJ, Luo L, Peng Z-y. Design and Characterization of a Hyperstable p16 INK4a that Restores Cdk4 Binding Activity when Combined with Oncogenic Mutations. Journal of molecular biology. 2003;327(1):285–97. pmid:12614625
  78. 78. Jager M, Dendle M, Kelly JW. Sequence determinants of thermodynamic stability in a WW domain–-an all-beta-sheet protein. Protein science: a publication of the Protein Society. 2009;18(8):1806–13. pmid:19565466; PubMed Central PMCID: PMC2776968.
  79. 79. Stites WE, Meeker AK, Shortle D. Evidence for strained interactions between side-chains and the polypeptide backbone. Journal of molecular biology. 1994;235(1):27–32. pmid:8289248
  80. 80. Kim J, Brych SR, Lee J, Logan TM, Blaber M. Identification of a key structural element for protein folding within β-hairpin turns. Journal of molecular biology. 2003;328(4):951–61. pmid:12729767
  81. 81. Loewenthal R, Sancho J, Fersht AR. Histidine-aromatic interactions in barnase: Elevation of histidine pKa and contribution to protein stability. Journal of molecular biology. 1992;224(3):759–70. pmid:1569555
  82. 82. Arakawa T, Horan TP, Narhi LO, Rees DC, Schiffer SG, Holst PL, et al. Production and characterization of an analog of acidic fibroblast growth factor with enhanced stability and biological activity. Protein engineering. 1993;6(5):541–6. pmid:7692436
  83. 83. Predki PF, Agrawal V, Brünger AT, Regan L. Amino-acid substitutions in a surface turn modulate protein stability. Nature Structural & Molecular Biology. 1996;3(1):54–8.
  84. 84. Kumar MS, Bava KA, Gromiha MM, Prabakaran P, Kitajima K, Uedaira H, et al. ProTherm and ProNIT: thermodynamic databases for proteins and protein–nucleic acid interactions. Nucleic acids research. 2006;34(suppl 1):D204–D6.