Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Benchmarking free energy calculations: Analysis of single and double mutations across two simulation software platforms for two protein systems

  • Shivani Gupta,

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Chemistry, Temple University, Philadelphia, Pennsylvania, United States of America

  • Qinfang Sun ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Resources, Software, Supervision, Validation, Visualization, Writing – review & editing

    qinfang.sun@temple.edu (QS); ronlevy@temple.edu (RML)

    Affiliations Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania, United States of America, Department of Chemistry, Temple University, Philadelphia, Pennsylvania, United States of America

  • Ronald M. Levy

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Supervision, Validation, Visualization, Writing – review & editing

    qinfang.sun@temple.edu (QS); ronlevy@temple.edu (RML)

    Affiliations Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania, United States of America, Department of Chemistry, Temple University, Philadelphia, Pennsylvania, United States of America

Abstract

Mutation analysis of single and double mutations of proteins including the two widely studied proteins: Staphylococcus nuclease (S. nuclease) and T4 lysozyme, provides critical insights into their stability and fitness. Furthermore, such analysis facilitates a deeper understanding of the complex interplay between a protein's sequence, structural conformation, and functional output. Free Energy Perturbation (FEP) is an alchemical, all-atom molecular dynamics based computational approach that determines the free energy change ( from wild type to mutant states. Two widely adopted software platforms used for this purpose are Schrödinger and GROMACS. We have compared the results of FEP simulations for mutations using these platforms, employing the OPLS4 force field implemented in Schrödinger, and the Amber99SB-ILDN force field implemented in GROMACS for this work. For the 38 single mutants of S. nuclease, the Pearson r between the experimental and the calculated free energy change was 0.86 using Schrödinger and 0.87 using GROMACS. The reliability of the FEP method using the two software platforms with the specified force fields was further demonstrated by its performance on 24 single mutants of T4 lysozyme, yielding strong correlation between predicted and experimental values, with Pearson r of 0.80 for Schrödinger and 0.85 for GROMACS. Additionally, the computed folding free energy changes for 45 double mutations in S. nuclease using Schrödinger correlated well with both experimental measurements (Pearson r = 0.74) and previously reported GROMACS values (Pearson r = 0.71). Correspondingly, the nonadditivities of the double mutations derived using Schrödinger for these 45 double mutants of S. nuclease were also found to be in good agreement with the experimental values (Pearson r = 0.79), as well as with the previously reported GROMACS results (Pearson r = 0.61). A good correlation was also observed between computed values from Schrödinger and GROMACS, with a Pearson r of 0.71 for double mutants and 0.61 for their nonadditivities. Collectively, these findings establish the efficiency, accuracy, and reliability of Schrödinger FEP+ and GROMACS in predicting mutation-induced free energy changes in S. nuclease and T4 lysozyme. The integration of FEP based computational methodologies with experimental validation provides a framework for quantifying mutation-induced changes in protein thermostability, which can be used as a tool for protein engineering and design.

Introduction

Mutational analysis serves as a powerful tool for elucidating the complex interdependencies among protein sequence, structure, and function [1]. By systematically assessing the impact of specific amino acid substitutions on protein thermostability [2] and consequently on functional activity, researchers can uncover fundamental principles governing protein folding, molecular interactions, and evolutionary fitness [3,4]. This approach provides critical insights into the intrinsic trade-offs between stability and function, thereby enabling rational protein engineering and aiding the interpretation of pathogenic or disease associated variants.

The impact of mutations on protein thermostability has been extensively studied across diverse protein systems [59]. A protein's ability to remain properly folded and functionally active at elevated temperatures is governed by its thermostability, making it a critical parameter for understanding degradation, aggregation, and denaturation over time. Thermostable proteins are of significant industrial relevance due to their broad-spectrum applications. Their pivotal roles as biocatalysts in chemical synthesis, components in biofuel production, agents in food processing, and additives in detergents are inherently dependent on their structural and functional stability under thermal stress [10]. In the pharmaceutical industry, thermostable therapeutics such as monoclonal antibodies, enzymes, and cytokines exhibit enhanced stability during manufacturing, storage, and transportation, making them more suitable for clinical and commercial applications [11].

Accurate prediction of protein thermostability through computational approaches provides a potentially efficient complement to experimental techniques [12]. In this context, various physics based approaches such as Molecular Mechanics Generalized Born Surface Area (MM/GBSA) [13,14] and Free Energy Perturbation [1517] methods have gained attention. Unlike data driven machine learning models, which may be constrained by the quality, bias, or representativeness of their training datasets, these methods are founded on fundamental physical principles and provide predictive insights that are independent of empirical training data.

Free energy perturbation (FEP) [1821] is an alchemical free energy simulation method, firmly rooted in statistical thermodynamics that can be used to estimate free energy differences, provided sufficiently long simulation times, together with enhanced sampling techniques and an accurate potential energy function are employed. It is an explicit solvent, molecular dynamics (MD) based method, offering a framework for evaluating the thermodynamic impact of amino acid substitutions on protein stability. The term ‘alchemical’ refers to the generation of a series of nonphysical intermediate states connecting two physical end states typically the wild-type (initial) and mutant (final) forms, facilitating the estimation of the difference between the free energy to fold the wild-type and mutant proteins. The transformation from wild-type to mutant residues, for single mutants(SMs), has been performed in both the folded and unfolded states of the protein that follows a thermodynamic cycle, as illustrated in Fig 1 of the Methodology section. Similarly, the thermodynamic cycle representing double mutations and their corresponding nonadditivity has been depicted in Fig 2 of the Methodology section.

thumbnail
Fig 1. Thermodynamic Cycle to calculate Stability Change of Single Mutant T22C ().

The green color highlights wild-type residue and blue represents the mutated residue, with rest colored as red, being same in both states. The folding free energy change upon single amino acid mutation is .

https://doi.org/10.1371/journal.pone.0335829.g001

thumbnail
Fig 2. Double mutant cycle with three different pathways (blue, orange, and green) to calculate the double mutation free energy change and the corresponding nonadditivity (δ).

Here in , the Subscript WT = Reference/Initial state, Superscript A = Mutated/Final state and same for others. In principle free energy change from all three routes should be the same. If ≠ 0 then mutation A and B are nonadditive, highly correlated and exhibit a thermodynamic coupling.

https://doi.org/10.1371/journal.pone.0335829.g002

Staphylococcal nuclease (S. nuclease) is a small extracellular protein of 149 amino acid residue length with no cysteines, secreted by Staphylococcus aureus. Its ability to degrade both DNA and RNA in the presence of free Ca2+ ions, finds various applications in nucleic acid chemistry since 1956 [22,23]. Its simple solution behavior being soluble in both native and denatured states has made it a benchmark system used for studies on protein folding [24,25] and NMR spectroscopy [26]. The experimental values for both single and double mutants (DMs) in S. nuclease were sourced from the extensive work of Shortle et al in S. nuclease [2729]. A second benchmark protein we focus on in the present study is T4 lysozyme (164 residues) [30], Schellman and co-workers [31] have measured the entropy, enthalpy and free energy change for the wild-type protein and a large number of mutants [32]. A diverse set of experimentally characterized SMs for T4 lysozyme, along with their corresponding values indicating changes in protein stability, has been compiled from the literature [3341] and considered for the present study.

The present study investigates a total of 62 SMs across two proteins: S. nuclease (38 mutants) and T4 lysozyme (24 mutants), and 45 double mutations of S. nuclease using the Schrödinger (FEP+) and GROMACS packages. These codes are both widely used for studying free energy changes associated with protein-ligand binding in explicit solvent, and more recently for studying the effects of mutations on protein stability. There have been few efforts to directly compare the performance of these codes on the same benchmark systems.

This study provides the first systematic comparison of free energy change predictions obtained from the Schrödinger and GROMACS simulation platforms across a comprehensive set of single and double mutations in S. Nuclease and a set of single mutations in T4 lysozyme, for which extensive experimental stability data are available. Despite methodological distinctions and differences in the underlying potential energy functions between the two platforms, both yield results that are broadly consistent with experimental measurements and in close agreement with each other.

Methods

Dataset selection and preparation

SMs for two proteins under investigation: S. nuclease and T4 lysozyme were curated from the studies by Duan et al (2020) [42] and de Groot et al. (2021) [43]. The initial dataset comprised of 27 SMs of S. nuclease (PDB ID: 1EY0) and 26 SMs of T4 lysozyme (PDB IDs: 2LZM and 1L63), with all experimental measurements performed at pH 7 ± 1. An additional 18 SMs from the dataset associated with PDB: 1STN [43] were incorporated, augmenting the initial 27 mutations and resulting in a total of 45 single mutations analyzed for S. nuclease. After excluding three overlapping mutations, the final dataset comprised of 42 unique SMs. Among these, only 38 mutations were successfully simulated using the Schrödinger and GROMACS platforms, while the remaining four were omitted due to technical issues involving proline residues. Furthermore, a set of 45 DMs [43] for the S. nuclease was also analyzed. These DMs were simulated using only Schrödinger to estimate the free energy changes () and assess their corresponding nonadditive effects. For T4 lysozyme, although the initial dataset contained 26 SMs, two proline-related mutations were excluded due to similar technical constraints, resulting in a final set of 24 single mutations. No double mutant simulations were performed for T4 lysozyme owing to the lack of reliable experimental data for benchmarking. The experimental values were sourced from comprehensive studies by Shortle (for S. nuclease) [2729] and Matthews (for T4 lysozyme) [34,35,3741], as mentioned above, serving as benchmark for evaluating the calculated free energy changes from Schrödinger and GROMACS simulations across SM and DM datasets for these two proteins.

Before discussing the methodology for FEP calculations using Schrödinger and GROMACS, it is important to first understand the underlying thermodynamic cycle and the associated coupling parameter that are common to both platforms.

Description of the alchemical thermodynamic cycles

The alchemical thermodynamic cycles provide a conceptual framework for the transformations from the wild-type to single and double mutants, as illustrated in Fig 1 and Fig 2, respectively. The corresponding nonadditive effects associated with the DMs are also depicted in Fig 2. The transformations include both physical and alchemical (non-physical) states and since free energy is a state function, its value depends only on the initial and final states and hence independent of the pathway taken.

The objective is to compute the folding free energy change resulting from a single mutation, transitioning from the wild-type (WT) to the mutant (Mut) state. Rather than directly simulating the folding process for each state, which is computationally intensive, an alchemical approach has been employed. As illustrated in Fig 1, the physical pathways represent the folding of the WT and Mut state from their respective unfolded structure, with the associated folding free energy denoted as (path WT) and (path Mut) respectively. The alchemical pathways consist of the free energy associated with mutating WT to Mut in the folded state denoted as (path 1) and in the unfolded state denoted as (path 2). Since the free energy is a state function, the sum of the around the thermodynamic cycle is zero. Consequently, the effect of the mutation on protein stability () can be estimated using Equation 1, which relates the differences in folding free energies of the WT and mutant states via the alchemical transformations.

(1)

The unfolded state of the protein, for both the wild-type and mutant forms, is represented by a capped tripeptide, which preserves the original sequence environment by placing the residue of interest in the central position. The use of a capped tripeptide offers a more realistic and chemically accurate representation for alchemical transformations compared to an uncapped peptide, by mimicking the local backbone environment and minimizing end effects [44].

For DMs an additional thermodynamic cycle has been constructed, as shown in Fig 2. In this cycle, the transformation from the wild-type (WT) to the double mutant (Mut AB) final state can proceed via three distinct routes. In route 1, the transformation proceeds from WT to Mut A, followed by a second mutation to reach state Mut AB. In Route 2, the path goes from WT to Mut B, and then to Mut AB. Route 3 represents a direct transformation from WT to Mut AB, where both mutations A and B are introduced simultaneously. Since free energy is a state function, the total free energy change associated with the double mutation, denoted as ( in Equation 2), must be independent of the transformation path. Therefore, all three routes should theoretically yield identical values. However, in practice, Route 3 (the direct transformation) is employed because experimentally determined PDB structures for the mutated states A or B are not available

(2)

In the case of double mutations, the combined effect may deviate from the sum of the individual single mutation effects, a phenomenon referred to as nonadditivity ( in Equation 3). When 0, the two mutations are energetically coupled and therefore nonadditive. Conversely, if = 0, the mutations are additive and independent. Within the framework of alchemical FEP, the contribution from the unfolded state is assumed to be additive. Consequently, it cancels out during the calculation of nonadditivity, simplifying the analysis. As a result, the nonadditivity is evaluated solely based on the free energy differences between the single and double mutations in the folded state as formally represented in Equation 3.

(3)

Coupling parameter (λ).

In alchemical free energy simulations, the coupling parameter (λ) is a dimensionless variable used to denote a transition between two thermodynamic states commonly, state A (wild-type) and state B (mutant).18 As λ progresses from 0 to 1, the system's Hamiltonian is gradually modified in a series of steps from to . Mathematically, the interpolated Hamiltonian is represented in Equation 4.

(4)

This parameter governs the pattern of the transformation and serves as a foundational component for computing free energy differences using FEP methods.

With the theoretical framework established, the following sections provide a detailed description of the methodologies employed on each platform. First, we outline the workflow implemented in Schrödinger, followed by the methodology employed in GROMACS for FEP calculations.

FEP calculations using Schrödinger (FEP+)

The FEP+ module from the Schrödinger Release 2024−3 (FEP + , Schrödinger, LLC, New York, NY, 2024) was employed to perform FEP calculations. Initial protein structures for different proteins were retrieved from the Protein Data Bank (PDB) and prepared using Maestro's Protein Preparation Wizard. The crystal water molecules were retained, hydrogen atoms were added, and hydrogen-bonding networks were optimized. PropKa was used to predict the ionization states of residues at pH 7.0. Subsequently, the structures underwent restrained minimization of heavy atoms using the OPLS4 force field [45], and the process was terminated once the root mean square deviation (RMSD) of heavy atoms reached 0.3 Å. Each system was solvated using a cubic SPC water box (default water model), extending 5 Å from the protein in all directions. To simulate physiological conditions, counter ions Na⁺ or Cl⁻ were added to both folded and unfolded systems, followed by the addition of excess Na⁺ and Cl⁻ ions to achieve a final salt concentration of 0.15 M NaCl. All ions were randomly distributed throughout the simulation box. For unfolded models, capped tripeptides extracted from the crystal structures of the corresponding native proteins, containing the mutation site at the central position in the peptide were used [42]. The capping groups consisted of an acetyl group at the N-terminus and an N-methyl group at the C-terminus.

The protein FEP+ calculations were performed using a dual topology algorithm, in which multiple MD simulations were performed to transform the system from wild-type (WT) to the mutated state. During this alchemical transformation, the electrostatic and van der Waals interactions of the WT side chain gradually turned off, while those of the mutant side chain were simultaneously turned on. The mutated residue was included in the Replica Exchange Solute Tempering (REST2) region [46,47], which effectively raises the temperature for enhanced sampling of that residue. The FEP + /REST2 framework combines solute tempering with replica exchange to improve sampling of sidechain conformational flexibility near the mutation site, typically within 5 Å. The solvated systems were then relaxed and equilibrated using the default protocol of the Desmond Molecular Dynamics System (D. E. Shaw Research, New York, NY) [48] as implemented in Maestro. This protocol involves a series of energy minimizations and short MD simulations with restraints. Each perturbation was simulated over 16–24 λ windows (where λ denotes the alchemical intermediate states or coupling factor as explained above), depending on the mutation type. The charge conserving mutations were simulated using 16 λ windows while charge altering mutations used 24 λ windows. The λ schedule followed the default settings in Schrödinger FEP+. Electrostatic and van der Waals transformations were integrated across the same λ schedule. Each λ-window was simulated for 10 ns which was extended up to 100 ns to ensure convergence for challenging cases. For charge changing mutations, a co-alchemical water approach was employed to mitigate long range electrostatic artifacts. The electrostatic interactions were handled using the Particle Mesh Ewald (PME) approach [49,50] with a real-space cutoff of 1.0 nm. Van der Waals interactions were also gradually shifted to zero at this same cutoff distance. Final production simulations were conducted with the default ensemble using the Desmond engine and free energy differences between alchemical states were computed using Multistate Bennett Acceptance Ratio (MBAR) [51] in Schrödinger. Predicted FEP uncertainties were calculated using the cycle closure method [46], as implemented in Schrödinger’s Protein FEP + .

FEP calculations using GROMACS

The alchemical FEP calculations for both protein systems were also performed using GROMACS 2022.6 [52,53] which included 38 SMs from S. nuclease and 23 SMs from T4 lysozyme. The dual topology and the hybrid coordinate files for the mutated protein were generated using the pmx server [54]. Similarly, the tripeptide structures, extracted from the full length protein using Maestro, were processed by pmx server to obtain their corresponding dual topology representations and hybrid tripeptides. The AMBER99sb*ILDN force field [5557] has been applied for these transformation in both the cases as this force field has been widely adopted for FEP calculations, as supported by several studies [15,58]. The crystal waters from the protein structure were retained during topology processing of the hybrid protein. Each hybrid system (protein and tripeptide) was solvated in a TIP3P [59] water box with a 10 Å buffer. The counterions Na⁺ or Cl⁻ were added to both the systems to neutralize the charge and achieve the final salt concentration of 0.15 M, consistent with standard FEP simulation practices. The retention of crystallographic water and the use of 0.15 M salt concentration align with established FEP protocols in both Schrӧdinger and GROMACS frameworks.

Then the hybrid protein and tripeptide systems underwent energy minimization using the steepest descent algorithm across 27 λ windows (from λ = 0 to λ = 26). The λ schedule varied as 0.00 0.0125 0.025 0.0375 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95 0.9625 0.975 0.9875 1.00 from 0 to 26 λ windows correspondingly. The 27 λ windows were partitioned into electrostatic decoupling (10 λ windows) and van der Waals decoupling (17 λ windows), with soft-core potentials applied to enable smooth interpolation of non-bonded interactions. This was followed by an additional minimization using the l-bfgs integrator for further relaxation. Subsequently, the systems were equilibrated under the NVT ensemble for 10 ps using the velocity-rescaling thermostat [60]. This step was followed by NPT equilibration ensemble, performed sequentially using the Berendsen and Parrinello-Rahman barostats [61] each for 30 ps. The electrostatic interactions were treated using the Particle Mesh Ewald (PME) method [49,50] and a real space cutoff of 1.0 nm was applied. The van der Waals interactions were gradually shifted to zero at the same cutoff distance (1.0 nm). All bonds involving hydrogen atoms were constrained using the LINCS [50] algorithm (order = 6). Following equilibration, replicas were simulated for 50 ns with NPT ensemble for both the hybrid protein and the tripeptide systems. The free energy differences between the wild-type and the mutant forms were computed using the Bennett Acceptance Ratio (BAR) [62,63] method, implemented via the gmx_bar module in GROMACS. Alchemical free energy calculations were also analyzed using the pymbar package based on ∂H/∂λ data obtained from GROMACS simulations. The robustness of the free energy estimates by MBAR was verified through two primary diagnostics: 1) To ensure reliable reweighting of configurations across states, ∂H/∂λ overlap matrices were generated for representative mutations. These matrices (S5 and S6 Figs) confirm appreciable overlap between adjacent λ-windows for both the S. nuclease (e.g., T41C, T41V) and T4 lysozyme (e.g., T59G, G77A) systems. 2) Convergence was assessed by monitoring the change of ΔG as a function of simulation time. Representative convergence plots (S7 and S8 Figs) for mutants such as L25I and T33V (S. nuclease) and S44A and D47A (T4 lysozyme) demonstrate that the free energy differences reach a stable plateau. This behavior indicates that convergence was achieved over the simulation time for both the tripeptide reference and the full protein. Predicted uncertainties were estimated by partitioning each simulation trajectory into three equal segments and calculating the standard deviation of the ΔΔG values across these sub-blocks.

In summary, to assess the impact of mutations on protein free energy landscapes, we performed a comparative analysis of single and double mutations FEP protocols implemented in GROMACS and Schrödinger’s FEP+ framework. Although both platforms utilize alchemical transformation methods, they differ markedly in system setup, topology handling, and sampling strategies. This study aims to evaluate the consistency and reliability of predicted values across these two distinct, widely used, computational platforms.

Results and discussion

Experimental determination of mutation-induced changes in protein thermostability,  typically using chemical or thermal denaturation techniques, remains the gold standard for validation. However, these methods are often time-consuming, resource-intensive, and low-throughput, making them more difficult for large-scale mutational screening. Moreover, while such methods yield valuable thermodynamic data, they offer limited insight into the atomic level interactions underlying protein stability changes. In contrast, computational methods such as FEP, which leverage all-atom MD simulations, potentially provide a faster, scalable, and cost-effective alternative provided they have sufficient accuracy. These methods not only enable high-throughput in silico mutagenesis but also deliver atomistic insights into stability mechanisms, allowing simulations of various protein conformational states to better understand folding landscapes and thermostability. Recent advances in high performance computing including GPU acceleration as well as improvements in force fields and simulation algorithms, have enhanced the predictive accuracy of these computational approaches.

In this work, we employed two widely used MD platforms: Schrödinger and GROMACS, for FEP simulations, each offering distinct advantages and limitations. GROMACS is an open-source, freely available MD engine noted for its flexibility, extensive customizability, and support for a broad range of force fields including AMBER, CHARMM, OPLS and GROMOS. However, its implementation for FEP workflows requires manual setup, command-line proficiency, and greater time investment, making it less accessible to non-expert users. In contrast, Schrödinger FEP + is a commercial, closed-source platform that supports only the modified OPLS force field. While it offers less flexibility, its user-friendly graphical interface, built-in FEP workflow as FEP+ module, and enhanced sampling methods such as REST (Replica Exchange with Solute Tempering) streamline the simulation process and significantly reduce setup complexity and computational overhead. Despite the widespread use of both platforms for FEP-based protein stability predictions, a systematic comparison of their performance using experimental data as a benchmark has been lacking. In this work, we address this gap by directly evaluating and comparing free energy changes for single and double mutations using Schrödinger and GROMACS against experimental thermostability data, thereby providing insights into their relative accuracy, usability, and computational efficiency.

The free energy changes () for 38 single mutations of S. nuclease were predicted using the two MD platforms: Schrödinger and GROMACS. These results, along with the values calculated via MBAR from the GROMACS data, are provided in Supporting Information (S1 Table). Correlation plots comparing experimentally determined free energy changes with predictions calculated using Schrӧdinger and GROMACS are shown in Fig 3A and 3B respectively. Both platforms demonstrated state-of-the-art accuracy in reproducing experimental measurements, with Pearson correlation coefficients of 0.86 for Schrödinger (Fig 3A) and 0.87 for GROMACS (Fig 3B) between calculated and experimental values, indicating strong linear agreement. Additionally, the mean absolute difference also referred to as the average unsigned error (AUE), was further calculated to evaluate predictive accuracy. The AUE values were found to be 0.99 kcal/mol for Schrödinger and 0.61 kcal/mol for GROMACS, both within the generally accepted accuracy threshold of ~1 kcal/mol for reliable free energy predictions. Notably, these AUE values are comparable to those reported for FEP+ protocol in predicting small molecule protein relative binding affinities [64,65], supporting the robustness of our results. Benchmarking results revealed strong agreement between computational predictions and experimental values. Specifically, the RMSE was 0.13 kcal/mol for GROMACS (via BAR) and 0.24 kcal/mol for Schrödinger. Additionally, a secondary check using MBAR on the GROMACS data resulted in an RMSE of 0.13 kcal/mol. Correlation was further evaluated using Kendall’s tau, which yielded values of 0.54 (GROMACS/BAR), 0.62 (Schrödinger), and 0.50 (GROMACS/MBAR). These statistical indicators, provided in S1 Table, highlight the high predictive accuracy of both platforms. The strong Pearson r of 0.87 between the previously reported [42] and our calculated folding free energy changes () highlights the reliability and robustness of these calculations, as shown in S1 Fig and S1 Table. The high correlation and low AUE observed in this study are consistent with previously reported FEP studies [42,66], where correlations between computed and experimental values ranged from 0.64 to 0.82, further validating the reliability of both platforms. Overall, these findings establish both Schrödinger and GROMACS as accurate and reliable tools for predicting protein thermostability changes induced by single mutations using FEP-based approaches.

thumbnail
Fig 3. Correlation plots of the free energy changes for 38 SMs for S. nuclease between the experimental values and calculated ones from (A) Schrödinger and (B) GROMACS.

Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated. The data plotted here is provided in S1 Table.

https://doi.org/10.1371/journal.pone.0335829.g003

The analysis was extended to evaluate folding free energy change for 45 DMs of S. nuclease represented as The values calculated using Schrödinger and previously reported values using GROMACS [43] alongside experimental measurements, which continue to serve as the benchmark, were summarized in S2 Table. The calculated using Schrödinger yielded a Pearson correlation coefficient of 0.74 when compared with the experimental data (Fig 4A). A similarly strong correlation (r = 0.71) was observed between the previously reported GROMACS values and experimental results, as shown in Fig 4B. The AUE was found to be 0.79 kcal/mol for Schrödinger and 0.76 kcal/mol for GROMACS, both within the generally accepted threshold of 1 kcal/mol. Moreover, we compared previously reported GROMACS values with those calculated using Schrödinger, yielding a Pearson correlation of 0.71 (S2 Fig). Comparable accuracy and low AUE values demonstrate that both Schrödinger (OPLS4 force field) and GROMACS (AMBER99sb*ILDN force field) gave consistent and reliable results for double mutations in the S. nuclease protein. The results demonstrate high predictive accuracy, reflected in low RMSE values of 0.13 kcal/mol for both GROMACS and Schrödinger when compared to experimental data. This strong agreement is further supported by Kendall’s tau (τ) values of 0.52 for GROMACS and 0.57 for Schrödinger. These metrics, provided in S2 Table, validate the reliability of both software platforms for benchmarking free energy calculations.

thumbnail
Fig 4. Correlation plots of the free energy changes for S. nuclease’s 45 DMs between the experimental values and calculated ones with (A) Schrödinger and (B) GROMACS.

Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated. The data plotted here is provided in S2 Table.

https://doi.org/10.1371/journal.pone.0335829.g004

In the case of DMs, the free energy changes can deviate significantly from the sum of the individual single mutations, a phenomenon known as nonadditivity (). The corresponding nonadditivities for the 45 DMs have been calculated employing the FEP+ protocol in Schrӧdinger. For comparison, previously reported free energy changes calculated using GROMACS [43] were included. Experimental values continue to serve as the benchmark for this study, consistent with the analyses of single and double mutations. Schrödinger predictions of nonadditivity yielded a Pearson r of 0.79 with experimental data (Fig 5A), whereas the previously reported GROMACS values showed a correlation of 0.63 (Fig 5B). The AUE was found to be 0.31 kcal/mol for Schrödinger and 0.35 kcal/mol for GROMACS relative to experimental values. Additionally, a Pearson r of 0.61 was observed for the nonadditivity values ) of 45 DMs of S. nuclease between previously reported GROMACS predictions and our calculated Schrödinger values, as illustrated in S3 Fig. All these nonadditivity values obtained from experimental measurements, GROMACS and Schrӧdinger calculations were tabulated in S3 Table. The calculated RMSE values were notably low, at 0.07 kcal/mol for GROMACS and 0.06 kcal/mol for Schrödinger relative to experimental data, indicating high predictive accuracy. As an additional metric, Kendall’s tau (τ) was found to be moderate, 0.48 for GROMACS and 0.44 for Schrödinger. These results, which support the benchmarking of both free energy calculation platforms, are summarized in S3 Table.

thumbnail
Fig 5. Correlation plots for the corresponding nonadditivity for S. nuclease’s 45 DMs between experimental values and calculated ones with (A) Schrödinger and (B) GROMACS.

Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated. The data plotted here is provided in S3 Table.

https://doi.org/10.1371/journal.pone.0335829.g005

We have extended our analysis further to another widely studied protein system, T4 lysozyme in order to gain more confidence in thermostability predictions using computational approaches. The dataset consists of 24 single mutations for T4 lysozyme protein, for which free energy changes () were calculated using both the FEP+ module of Schrödinger and the GROMACS. However, DMs were not included for the T4 lysozyme system, as experimental values were not available.

To assess the agreement between calculated and experimental values, correlation analyses were performed. Fig 6A shows the correlation between experimental values and those calculated using Schrödinger (Pearson r = 0.85), while Fig 6B presents the correlation between experimental values and GROMACS predictions (Pearson r = 0.80). The corresponding AUEs were 0.63 kcal/mol for Schrödinger and 0.55 kcal/mol for GROMACS. A direct comparison of computed values with Schrödinger and GROMACS yielded a Pearson r of 0.81 (Fig 6C). Furthermore, a Pearson r of 0.96 was observed between previously reported⁴² Schrödinger results and the values calculated using Schrödinger in this study (S4 Fig). All free energy changes: experimental, calculated via GROMACS and Schrödinger, and previously reported Schrödinger values were summarized in S4 Table. Collectively, these results demonstrate the effectiveness in modeling free energy changes arising from single mutations in T4 lysozyme [42] by GROMACS and Schrödinger. Additional performance metrics, including RMSE and Kendall’s tau, are provided in the sub-tables of S4 Table. The low RMSE values-0.13 kcal/mol for GROMACS (BAR), 0.16 kcal/mol for Schrödinger, and 0.13 kcal/mol for GROMACS (MBAR) demonstrate strong agreement with experimental data and high predictive accuracy. Similarly, Kendall’s tau values of 0.61, 0.68, and 0.61, respectively, further validate the benchmarking of these two software platforms for free energy calculations.

thumbnail
Fig 6. Correlation plots of the free energy changes for T4 lysozyme 24 SMs between the experimental and calculated values with (A) Schrödinger and (B) GROMACS and (C) Schrödinger vs GROMACS.

Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated. The data plotted here is provided in S4 Table.

https://doi.org/10.1371/journal.pone.0335829.g006

To investigate potential correlations, we analyzed prediction errors as a function of mutation type and location (S5-S6 Tables). Prediction accuracy was highest for mutations involving no change in side-chain size, with errors increasing significantly as side-chain bulk increased. Additionally, charged mutations exhibited larger errors than neutral ones. Location-wise, surface mutations generally yielded higher errors than buried residues.

Conclusion

FEP methods are increasingly employed to estimate changes in protein thermostability caused by mutations. In this study, we compared two widely used MD platforms: GROMACS and Schrödinger’s FEP + , to perform FEP-based calculations for SMs of S. nuclease and T4 lysozyme proteins. Additionally, FEP-based predictions were extended to DMs of S. nuclease, including the assessment of their corresponding nonadditivity effects. Our results showed strong correlation between experimental and computed free energy changes ( for 62 SMs in two different proteins using GROMACS and Schrödinger. Schrödinger accurately captured the free energy changes and their corresponding nonadditivities () for 45 DMs of S. nuclease protein, showing good correlation with experimental values. A strong Pearson correlation for free energy changes and nonadditivity for 45 DMs of S. nuclease, has also been observed between the previously reported GROMACS values and those calculated in this study using Schrödinger. The findings demonstrate the robustness of both Schrödinger and GROMACS in modeling single and double mutations, accurately predicting their free energy changes and nonadditivity. The strong correlation between experimental and computed free energy changes further highlights the consistency and accuracy of these platforms in capturing the thermodynamic effects of mutations.

The choice between two platforms depends on factors such as the complexity of the system under study, the cost of the software involved, preference towards a specific force field or enhanced sampling technique, and user familiarity. While FEP is not a substitute for experimental validation, it offers a rapid, scalable, and mechanistically informative approach to mutation analysis in proteins. Importantly, FEP enables atomic level insights into the effects of mutations, which are not directly accessible through experimental denaturation assays alone. Therefore, integrating FEP-based computational predictions (either by GROMACS or Schrӧdinger) with experimental measurements presents a powerful and complementary strategy for protein engineering and thermostability optimization.

Supporting information

S1 Table. Folding free energy changes in kcal/mol for 38 SMs of the S. nuclease protein.

The experimental values and previously reported values using Schrödinger or GROMACS alongside values calculated in this study using both platforms are provided for comparison.

https://doi.org/10.1371/journal.pone.0335829.s001

(PDF)

S1 Fig. Correlation plot of free energy changes () for 38 SMs in the S. nuclease protein, comparing previously reported values using Schrödinger or GROMACS with values calculated in this study using Schrödinger.

Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated.

https://doi.org/10.1371/journal.pone.0335829.s002

(PDF)

S2 Table. Folding free energy changes in kcal/mol for 45 DMs of the S. nuclease protein.

The experimental values and previously reported values using GROMACS alongside values calculated in this study using Schrödinger are shown for comparison.

https://doi.org/10.1371/journal.pone.0335829.s003

(PDF)

S2 Fig. Correlation plot of free energy changes () for 45 DMs in the S. nuclease protein, comparing previously reported values from GROMACS with values calculated in this study using Schrödinger.

Pearson correlation coefficient (r) and coefficient of determination (R²) are shown.

https://doi.org/10.1371/journal.pone.0335829.s004

(PDF)

S3 Table. Nonadditivity values () in kcal/mol for 45 DMs of the S. nuclease protein.

The experimental data and previously reported GROMACS values are presented alongside values calculated in this study using Schrödinger for comparative analysis.

https://doi.org/10.1371/journal.pone.0335829.s005

(PDF)

S3 Fig. Correlation plot of nonadditivity () for 45 DMs in the S. nuclease protein, comparing previously reported values from GROMACS with values calculated in this study using Schrödinger.

Pearson correlation coefficient (r) and coefficient of determination (R²) are shown.

https://doi.org/10.1371/journal.pone.0335829.s006

(PDF)

S4 Table. Folding free energy changes () in kcal/mol for 24 SMs of the T4 lysozyme protein.

The experimental values and previously reported values using Schrödinger alongside values calculated in this study using GROMACS and Schrӧdinger are provided for comparison.

https://doi.org/10.1371/journal.pone.0335829.s007

(PDF)

S4 Fig. Correlation plot of free energy changes () for 24 SMs in the T4 lysozyme protein, comparing previously reported values with calculated values from Schrödinger.

Pearson correlation coefficient (r) and coefficient of determination (R²) are shown.

https://doi.org/10.1371/journal.pone.0335829.s008

(PDF)

S5 Fig. Overlap matrix plots for representative S. nuclease mutations (T41C and T41V) with all first off-diagonal entries well above 0.03, the suggested threshold.

Each of the four Figs consists of two panels: the upper panel displays the overlap for the tripeptide, while the lower panel displays the overlap for the full protein system.

https://doi.org/10.1371/journal.pone.0335829.s009

(PDF)

S6 Fig. Overlap matrix plots for representative T4 lysozyme mutants T59G and G77A with all first off-diagonal entries well above 0.03, the suggested threshold.

Each of the four Figs consists of two panels: the upper panel displays the overlap for the tripeptide, while the lower panel displays the overlap for the full protein system.

https://doi.org/10.1371/journal.pone.0335829.s010

(PDF)

S7 Fig. Free energy convergence for representative S. nuclease mutants L25I and T33V.

The plots show ΔΔG versus simulation time across four Figs. In each case, the top panel corresponds to the tripeptide simulation (unfolded state proxy) and the bottom panel corresponds to the full protein simulation (folded state).

https://doi.org/10.1371/journal.pone.0335829.s011

(PDF)

S8 Fig. Free energy convergence for representative T4 lysozyme mutants S44A and D47A.

The plots show ΔΔG versus simulation time across four Figs. In each case, the top panel corresponds to the tripeptide simulation (unfolded state proxy) and the bottom panel corresponds to the full protein simulation (folded state).

https://doi.org/10.1371/journal.pone.0335829.s012

(PDF)

S5 Table. Calculated free energy changes (kcal/mol) for 38SMs of S. nuclease using Schrödinger and GROMACS, compared against experimental values.

The mutations are categorized by charge, size, and location within the protein structure.

https://doi.org/10.1371/journal.pone.0335829.s013

(PDF)

S6 Table. Calculated free energy changes (kcal/mol) for 24 SMs of T4 lysozyme using Schrödinger and GROMACS, compared against experimental values.

The mutations are categorized by charge, size, and location within the protein structure.

https://doi.org/10.1371/journal.pone.0335829.s014

(PDF)

References

  1. 1. Soskine M, Tawfik DS. Mutational effects and the evolution of new protein functions. Nat Rev Genet. 2010;11(8):572–82. pmid:20634811
  2. 2. Nisthal A, Wang CY, Ary ML, Mayo SL. Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis. Proc Natl Acad Sci U S A. 2019;116(33):16367–77. pmid:31371509
  3. 3. Mannige R. Dynamic new world: refining our view of protein structure, function and evolution. Proteomes. 2014;2:128–53.
  4. 4. Dill KA, MacCallum JL. The protein-folding problem, 50 years on. Science. 2012;338(6110):1042–6. pmid:23180855
  5. 5. Pack SP, Yoo YJ. Protein thermostability: structure-based difference of amino acid between thermophilic and mesophilic proteins. J Biotechnol. 2004;111(3):269–77. pmid:15246663
  6. 6. Duan J, Nilsson L, Lambert B. Structural and functional analysis of mutations at the human hypoxanthine phosphoribosyl transferase (HPRT1) locus. Hum Mutat. 2004;23(6):599–611. pmid:15146465
  7. 7. Stefl S, Nishi H, Petukh M, Panchenko AR, Alexov E. Molecular mechanisms of disease-causing missense mutations. J Mol Biol. 2013;425(21):3919–36. pmid:23871686
  8. 8. Petukh M, Kucukkal TG, Alexov E. On human disease-causing amino acid variants: statistical study of sequence and structural patterns. Hum Mutat. 2015;36(5):524–34. pmid:25689729
  9. 9. Ponnuswamy PK, Muthusamy R, Manavalan P. Amino acid composition and thermal stability of proteins. Int J Biol Macromol. 1982;4:186–90.
  10. 10. Rigoldi F, Donini S, Redaelli A, Parisini E, Gautieri A. Review: Engineering of thermostable enzymes for industrial applications. APL Bioeng. 2018;2(1):011501. pmid:31069285
  11. 11. McConnell AD, Zhang X, Macomber JL, Chau B, Sheffer JC, Rahmanian S, et al. A general approach to antibody thermostabilization. MAbs. 2014;6(5):1274–82. pmid:25517312
  12. 12. Scarabelli G, Oloo EO, Maier JKX, Rodriguez-Granillo A. Accurate Prediction of Protein Thermodynamic Stability Changes upon Residue Mutation using Free Energy Perturbation. J Mol Biol. 2022;434(2):167375. pmid:34826524
  13. 13. Wang C, Zhao T, Li Y, Huang G, White MA, Gao J. Investigation of endosome and lysosome biology by ultra pH-sensitive nanoprobes. Adv Drug Deliv Rev. 2017;113:87–96. pmid:27612550
  14. 14. Srinivasan J, Miller J, Kollman PA, Case DA. Continuum solvent studies of the stability of RNA hairpin loops and helices. J Biomol Struct Dyn. 1998;16(3):671–82. pmid:10052623
  15. 15. Gapsys V, Michielssens S, Seeliger D, de Groot BL. Accurate and Rigorous Prediction of the Changes in Protein Free Energies in a Large-Scale Mutation Scan. Angew Chem Int Ed Engl. 2016;55(26):7364–8. pmid:27122231
  16. 16. Dang LX, Merz KM, Kollman PA. Free energy calculations on protein stability: Thr-157.fwdarw. Val-157 mutation of T4 lysozyme. J Am Chem Soc. 1989;111:8505–8.
  17. 17. Jespers W, et al. QresFEP: An automated protocol for free energy calculations of protein mutations. J Chem Theory Comput. 2019;15:5461–73.
  18. 18. Jorgensen WL, Ravimohan C. Monte Carlo simulation of differences in free energies of hydration. J Chem Phys. 1985;83:3050–4.
  19. 19. Beveridge DL, DiCapua FM. Free energy via molecular simulation: applications to chemical and biomolecular systems. Annu Rev Biophys Biophys Chem. 1989;18:431–92.
  20. 20. Pohorille A, Jarzynski C, Chipot C. Good practices in free-energy calculations. J Phys Chem B. 2010;114(32):10235–53. pmid:20701361
  21. 21. Zwanzig RW. High-Temperature Equation of State by a Perturbation Method. I. Nonpolar Gases. J Chem Phys. 1954;22:1420–6.
  22. 22. Flanagan JM, Kataoka M, Shortle D, Engelman DM. Truncated staphylococcal nuclease is compact but disordered. Proc Natl Acad Sci U S A. 1992;89(2):748–52. pmid:1731350
  23. 23. Shortle D. Staphylococcal nuclease: a showcase of m-value effects. Adv Protein Chem. 1995;46:217–47. pmid:7771319
  24. 24. Tucker PW, Hazen EE, Cotton FA. Staphylococcal nuclease reviewed: A prototypic study in contemporary enzymology. Mol Cell Biochem. 1979;23:131–41.
  25. 25. Anfinsen CB. Principles that govern the folding of protein chains. Science. 1973;181(4096):223–30. pmid:4124164
  26. 26. Jarde fzky O, Wade- Jarde fzky NG. Nuclear magnetic resonance as a structural method.
  27. 27. Shortle D, Stites WE, Meeker AK. Contributions of the large hydrophobic amino acids to the stability of staphylococcal nuclease. Biochemistry. 1990;29(35):8033–41. pmid:2261461
  28. 28. Green SM, Meeker AK, Shortle D. Contributions of the polar, uncharged amino acids to the stability of staphylococcal nuclease: evidence for mutational effects on the free energy of the denatured state. Biochemistry. 1992;31(25):5717–28. pmid:1610820
  29. 29. Green SM, Shortle D. Patterns of nonadditivity between pairs of stability mutations in staphylococcal nuclease. Biochemistry. 1993;32.
  30. 30. Alber, T. & Matthews, BW. Structure and thermal stability of phage T4 lysozyme. in Methods in Enzymology. 1987;154:511–33.
  31. 31. Schellman JA, Lindorfer M, Hawkes R, Grutter M. Mutations and protein stability. Biopolymers. 1981;20:1989–99.
  32. 32. Becktel WJ, Baase WA. Thermal denaturation of bacteriophage T4 lysozyme at neutral pH. Biopolymers. 1987;26(5):619–23. pmid:3297180
  33. 33. Cornish VW, Kaplan MI, Veenstra DL, Kollman PA, Schultz PG. Stabilizing and destabilizing effects of placing beta-branched amino acids in protein alpha-helices. Biochemistry. 1994;33(40):12022–31. pmid:7918421
  34. 34. Nicholson H, Tronrud DE, Becktel WJ, Matthews BW. Analysis of the effectiveness of proline substitutions and glycine replacements in increasing the stability of phage T4 lysozyme. Biopolymers. 1992;32(11):1431–41. pmid:1457724
  35. 35. Matsumura M, Becktel WJ, Matthews BW. Hydrophobic stabilization in T4 lysozyme determined directly by multiple substitutions of Ile 3. Nature. 1988;334(6181):406–10. pmid:3405287
  36. 36. Heinz DW, et al. Accommodation of amino acid insertions in an α-helix of T4 lysozyme. J Mol Biol. 1994;236:869–86.
  37. 37. Sun DP, Sauer U, Nicholson H, Matthews BW. Contributions of engineered surface salt bridges to the stability of T4 lysozyme determined by directed mutagenesis. Biochemistry. 1991;30(29):7142–53. pmid:1854726
  38. 38. Bell JA, Becktel WJ, Sauer U, Baase WA, Matthews BW. Dissection of Helix Capping in T4 Lysozyme by Structural and Thermodynamic Analysis of Six Amino Acid Substitutions at Thr 59. Biochemistry. 1992;31:3590–6.
  39. 39. Matthews BW, Nicholson H, Becktel WJ. Enhanced protein thermostability from site-directed mutations that decrease the entropy of unfolding. Proc Natl Acad Sci U S A. 1987;84(19):6663–7. pmid:3477797
  40. 40. Heinz DW, Baase WA, Matthews BW. Folding and function of a T4 lysozyme containing 10 consecutive alanines illustrate the redundancy of information in an amino acid sequence. Proc Natl Acad Sci U S A. 1992;89(9):3751–5. pmid:1570293
  41. 41. Nicholson H, Anderson DE, Dao-pin S, Matthews BW. Analysis of the interaction between charged side chains and the alpha-helix dipole using designed thermostable mutants of phage T4 lysozyme. Biochemistry. 1991;30(41):9816–28. pmid:1911773
  42. 42. Duan J, Lupyan D, Wang L. Improving the Accuracy of Protein Thermostability Predictions for Single Point Mutations. Biophys J. 2020;119(1):115–27. pmid:32533939
  43. 43. Werner M, Gapsys V, de Groot BL. One plus one makes three: triangular coupling of correlated amino acid mutations. J Phys Chem Lett. 2021;12:3195–201.
  44. 44. Markthaler D, Kraus H, Hansen N. Overcoming Convergence Issues in Free-Energy Calculations of Amide-to-Ester Mutations in the Pin1-WW Domain. J Chem Inf Model. 2018;58(11):2305–18. pmid:30339754
  45. 45. Lu C, et al. OPLS4: Improving force field accuracy on challenging regimes of chemical space. J Chem Theory Comput. 2021;17:4291–300.
  46. 46. Wang L, Deng Y, Knight JL, Wu Y, Kim B, Sherman W, et al. Modeling Local Structural Rearrangements Using FEP/REST: Application to Relative Binding Affinity Predictions of CDK2 Inhibitors. J Chem Theory Comput. 2013;9(2):1282–93. pmid:26588769
  47. 47. Wang L, Berne BJ, Friesner RA. On achieving high accuracy and reliability in the calculation of relative protein-ligand binding affinities. Proc Natl Acad Sci U S A. 2012;109(6):1937–42. pmid:22308365
  48. 48. Bowers KJ, Chow DE, Xu H, Dror RO, Eastwood MP, Gregersen BA, et al. Scalable Algorithms for Molecular Dynamics Simulations on Commodity Clusters. In: ACM/IEEE SC 2006 Conference (SC’06), 2006. 43–43.
  49. 49. Darden T, York D, Pedersen L. Particle mesh Ewald: An N ⋅log(N) method for Ewald sums in large systems. J Chem Phys. 1993;98:10089–92.
  50. 50. Essmann U, et al. A smooth particle mesh Ewald method. J Chem Phys. 1995;103:8577–93.
  51. 51. Shirts MR, Chodera JD. Statistically optimal analysis of samples from multiple equilibrium states. J Chem Phys. 2008;129(12):124105. pmid:19045004
  52. 52. Van Der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJC. GROMACS: fast, flexible, and free. J Comput Chem. 2005;26(16):1701–18. pmid:16211538
  53. 53. Kohnke B, Kutzner C, Grubmüller H. A GPU-Accelerated Fast Multipole Method for GROMACS: Performance and Accuracy. J Chem Theory Comput. 2020;16(11):6938–49. pmid:33084336
  54. 54. Gapsys V, de Groot BL. pmx Webserver: A User Friendly Interface for Alchemistry. J Chem Inf Model. 2017;57(2):109–14. pmid:28181802
  55. 55. Lindorff-Larsen K, Piana S, Palmo K, Maragakis P, Klepeis JL, Dror RO, et al. Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins. 2010;78(8):1950–8. pmid:20408171
  56. 56. Best RB, Hummer G. Optimized molecular dynamics force fields applied to the helix-coil transition of polypeptides. J Phys Chem B. 2009;113(26):9004–15. pmid:19514729
  57. 57. Hornak V, Abel R, Okur A, Strockbine B, Roitberg A, Simmerling C. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins. 2006;65(3):712–25. pmid:16981200
  58. 58. Aldeghi M, Gapsys V, de Groot BL. Accurate Estimation of Ligand Binding Affinity Changes upon Protein Mutation. ACS Cent Sci. 2018;4(12):1708–18. pmid:30648154
  59. 59. Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79:926–35.
  60. 60. Bussi G, Donadio D, Parrinello M. Canonical sampling through velocity rescaling. J Chem Phys. 2007;126(1):014101. pmid:17212484
  61. 61. Parrinello M, Rahman A. Polymorphic transitions in single crystals: A new molecular dynamics method. J Appl Phys. 1981;52:7182–90.
  62. 62. Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. J Comput Phys. 1976;22:245–68.
  63. 63. Shirts MR, Pande VS. Comparison of efficiency and bias of free energies computed by exponential averaging, the Bennett acceptance ratio, and thermodynamic integration. J Chem Phys. 2005;122(14):144107. pmid:15847516
  64. 64. Abel R, Wang L, Harder ED, Berne BJ, Friesner RA. Advancing Drug Discovery through Enhanced Free Energy Calculations. Acc Chem Res. 2017;50(7):1625–32. pmid:28677954
  65. 65. Abel R, Wang L, Mobley DL, Friesner RA. A Critical Review of Validation, Blind Testing, and Real- World Use of Alchemical Protein-Ligand Binding Free Energy Calculations. Curr Top Med Chem. 2017;17(23):2577–85. pmid:28413950
  66. 66. Steinbrecher T, Zhu C, Wang L, Abel R, Negron C, Pearlman D, et al. Predicting the Effect of Amino Acid Single-Point Mutations on Protein Stability-Large-Scale Validation of MD-Based Relative Free Energy Calculations. J Mol Biol. 2017;429(7):948–63. pmid:27964946