Figures
Abstract
Mutation analysis of single and double mutations of proteins including the two widely studied proteins: Staphylococcus nuclease (S. nuclease) and T4 lysozyme, provides critical insights into their stability and fitness. Furthermore, such analysis facilitates a deeper understanding of the complex interplay between a protein's sequence, structural conformation, and functional output. Free Energy Perturbation (FEP) is an alchemical, all-atom molecular dynamics based computational approach that determines the free energy change ( from wild type to mutant states. Two widely adopted software platforms used for this purpose are Schrödinger and GROMACS. We have compared the results of FEP simulations for mutations using these platforms, employing the OPLS4 force field implemented in Schrödinger, and the Amber99SB-ILDN force field implemented in GROMACS for this work. For the 38 single mutants of S. nuclease, the Pearson r between the experimental and the calculated free energy change
was 0.86 using Schrödinger and 0.87 using GROMACS. The reliability of the FEP method using the two software platforms with the specified force fields was further demonstrated by its performance on 24 single mutants of T4 lysozyme, yielding strong correlation between predicted and experimental
values, with Pearson r of 0.80 for Schrödinger and 0.85 for GROMACS. Additionally, the computed folding free energy changes for 45 double mutations in S. nuclease
using Schrödinger correlated well with both experimental measurements (Pearson r = 0.74) and previously reported GROMACS values (Pearson r = 0.71). Correspondingly, the nonadditivities
of the double mutations derived using Schrödinger for these 45 double mutants of S. nuclease were also found to be in good agreement with the experimental values (Pearson r = 0.79), as well as with the previously reported GROMACS results (Pearson r = 0.61). A good correlation was also observed between computed values from Schrödinger and GROMACS, with a Pearson r of 0.71 for double mutants and 0.61 for their nonadditivities. Collectively, these findings establish the efficiency, accuracy, and reliability of Schrödinger FEP+ and GROMACS in predicting mutation-induced free energy changes in S. nuclease and T4 lysozyme. The integration of FEP based computational methodologies with experimental validation provides a framework for quantifying mutation-induced changes in protein thermostability, which can be used as a tool for protein engineering and design.
Citation: Gupta S, Sun Q, Levy RM (2026) Benchmarking free energy calculations: Analysis of single and double mutations across two simulation software platforms for two protein systems. PLoS One 21(4): e0335829. https://doi.org/10.1371/journal.pone.0335829
Editor: Yong Wang, Zhejiang University College of Life Sciences, CHINA
Received: October 16, 2025; Accepted: March 7, 2026; Published: April 3, 2026
Copyright: © 2026 Gupta et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting information files.
Funding: This study was supported by the National Institutes of Health (NIH) Grant R35 GM132090. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors declare no competing interests.
Introduction
Mutational analysis serves as a powerful tool for elucidating the complex interdependencies among protein sequence, structure, and function [1]. By systematically assessing the impact of specific amino acid substitutions on protein thermostability [2] and consequently on functional activity, researchers can uncover fundamental principles governing protein folding, molecular interactions, and evolutionary fitness [3,4]. This approach provides critical insights into the intrinsic trade-offs between stability and function, thereby enabling rational protein engineering and aiding the interpretation of pathogenic or disease associated variants.
The impact of mutations on protein thermostability has been extensively studied across diverse protein systems [5–9]. A protein's ability to remain properly folded and functionally active at elevated temperatures is governed by its thermostability, making it a critical parameter for understanding degradation, aggregation, and denaturation over time. Thermostable proteins are of significant industrial relevance due to their broad-spectrum applications. Their pivotal roles as biocatalysts in chemical synthesis, components in biofuel production, agents in food processing, and additives in detergents are inherently dependent on their structural and functional stability under thermal stress [10]. In the pharmaceutical industry, thermostable therapeutics such as monoclonal antibodies, enzymes, and cytokines exhibit enhanced stability during manufacturing, storage, and transportation, making them more suitable for clinical and commercial applications [11].
Accurate prediction of protein thermostability through computational approaches provides a potentially efficient complement to experimental techniques [12]. In this context, various physics based approaches such as Molecular Mechanics Generalized Born Surface Area (MM/GBSA) [13,14] and Free Energy Perturbation [15–17] methods have gained attention. Unlike data driven machine learning models, which may be constrained by the quality, bias, or representativeness of their training datasets, these methods are founded on fundamental physical principles and provide predictive insights that are independent of empirical training data.
Free energy perturbation (FEP) [18–21] is an alchemical free energy simulation method, firmly rooted in statistical thermodynamics that can be used to estimate free energy differences, provided sufficiently long simulation times, together with enhanced sampling techniques and an accurate potential energy function are employed. It is an explicit solvent, molecular dynamics (MD) based method, offering a framework for evaluating the thermodynamic impact of amino acid substitutions on protein stability. The term ‘alchemical’ refers to the generation of a series of nonphysical intermediate states connecting two physical end states typically the wild-type (initial) and mutant (final) forms, facilitating the estimation of the difference between the free energy to fold the wild-type and mutant proteins. The transformation from wild-type to mutant residues, for single mutants(SMs), has been performed in both the folded and unfolded states of the protein that follows a thermodynamic cycle, as illustrated in Fig 1 of the Methodology section. Similarly, the thermodynamic cycle representing double mutations and their corresponding nonadditivity has been depicted in Fig 2 of the Methodology section.
The green color highlights wild-type residue and blue represents the mutated residue, with rest colored as red, being same in both states. The folding free energy change upon single amino acid mutation is .
Here in , the Subscript WT = Reference/Initial state, Superscript A = Mutated/Final state and same for others. In principle free energy change from all three routes should be the same. If
≠ 0 then mutation A and B are nonadditive, highly correlated and exhibit a thermodynamic coupling.
Staphylococcal nuclease (S. nuclease) is a small extracellular protein of 149 amino acid residue length with no cysteines, secreted by Staphylococcus aureus. Its ability to degrade both DNA and RNA in the presence of free Ca2+ ions, finds various applications in nucleic acid chemistry since 1956 [22,23]. Its simple solution behavior being soluble in both native and denatured states has made it a benchmark system used for studies on protein folding [24,25] and NMR spectroscopy [26]. The experimental values for both single and double mutants (DMs) in S. nuclease were sourced from the extensive work of Shortle et al in S. nuclease [27–29]. A second benchmark protein we focus on in the present study is T4 lysozyme (164 residues) [30], Schellman and co-workers [31] have measured the entropy, enthalpy and free energy change for the wild-type protein and a large number of mutants [32]. A diverse set of experimentally characterized SMs for T4 lysozyme, along with their corresponding values indicating changes in protein stability, has been compiled from the literature [33–41] and considered for the present study.
The present study investigates a total of 62 SMs across two proteins: S. nuclease (38 mutants) and T4 lysozyme (24 mutants), and 45 double mutations of S. nuclease using the Schrödinger (FEP+) and GROMACS packages. These codes are both widely used for studying free energy changes associated with protein-ligand binding in explicit solvent, and more recently for studying the effects of mutations on protein stability. There have been few efforts to directly compare the performance of these codes on the same benchmark systems.
This study provides the first systematic comparison of free energy change predictions obtained from the Schrödinger and GROMACS simulation platforms across a comprehensive set of single and double mutations in S. Nuclease and a set of single mutations in T4 lysozyme, for which extensive experimental stability data are available. Despite methodological distinctions and differences in the underlying potential energy functions between the two platforms, both yield results that are broadly consistent with experimental measurements and in close agreement with each other.
Methods
Dataset selection and preparation
SMs for two proteins under investigation: S. nuclease and T4 lysozyme were curated from the studies by Duan et al (2020) [42] and de Groot et al. (2021) [43]. The initial dataset comprised of 27 SMs of S. nuclease (PDB ID: 1EY0) and 26 SMs of T4 lysozyme (PDB IDs: 2LZM and 1L63), with all experimental measurements performed at pH 7 ± 1. An additional 18 SMs from the dataset associated with PDB: 1STN [43] were incorporated, augmenting the initial 27 mutations and resulting in a total of 45 single mutations analyzed for S. nuclease. After excluding three overlapping mutations, the final dataset comprised of 42 unique SMs. Among these, only 38 mutations were successfully simulated using the Schrödinger and GROMACS platforms, while the remaining four were omitted due to technical issues involving proline residues. Furthermore, a set of 45 DMs [43] for the S. nuclease was also analyzed. These DMs were simulated using only Schrödinger to estimate the free energy changes () and assess their corresponding nonadditive effects. For T4 lysozyme, although the initial dataset contained 26 SMs, two proline-related mutations were excluded due to similar technical constraints, resulting in a final set of 24 single mutations. No double mutant simulations were performed for T4 lysozyme owing to the lack of reliable experimental data for benchmarking. The experimental values were sourced from comprehensive studies by Shortle (for S. nuclease) [27–29] and Matthews (for T4 lysozyme) [34,35,37–41], as mentioned above, serving as benchmark for evaluating the calculated free energy changes from Schrödinger and GROMACS simulations across SM and DM datasets for these two proteins.
Before discussing the methodology for FEP calculations using Schrödinger and GROMACS, it is important to first understand the underlying thermodynamic cycle and the associated coupling parameter that are common to both platforms.
Description of the alchemical thermodynamic cycles
The alchemical thermodynamic cycles provide a conceptual framework for the transformations from the wild-type to single and double mutants, as illustrated in Fig 1 and Fig 2, respectively. The corresponding nonadditive effects associated with the DMs are also depicted in Fig 2. The transformations include both physical and alchemical (non-physical) states and since free energy is a state function, its value depends only on the initial and final states and hence independent of the pathway taken.
The objective is to compute the folding free energy change resulting from a single mutation, transitioning from the wild-type (WT) to the mutant (Mut) state. Rather than directly simulating the folding process for each state, which is computationally intensive, an alchemical approach has been employed. As illustrated in Fig 1, the physical pathways represent the folding of the WT and Mut state from their respective unfolded structure, with the associated folding free energy denoted as (path WT) and
(path Mut) respectively. The alchemical pathways consist of the free energy associated with mutating WT to Mut in the folded state denoted as
(path 1) and in the unfolded state denoted as
(path 2). Since the free energy is a state function, the sum of the
around the thermodynamic cycle is zero. Consequently, the effect of the mutation on protein stability (
) can be estimated using Equation 1, which relates the differences in folding free energies of the WT and mutant states via the alchemical transformations.
The unfolded state of the protein, for both the wild-type and mutant forms, is represented by a capped tripeptide, which preserves the original sequence environment by placing the residue of interest in the central position. The use of a capped tripeptide offers a more realistic and chemically accurate representation for alchemical transformations compared to an uncapped peptide, by mimicking the local backbone environment and minimizing end effects [44].
For DMs an additional thermodynamic cycle has been constructed, as shown in Fig 2. In this cycle, the transformation from the wild-type (WT) to the double mutant (Mut AB) final state can proceed via three distinct routes. In route 1, the transformation proceeds from WT to Mut A, followed by a second mutation to reach state Mut AB. In Route 2, the path goes from WT to Mut B, and then to Mut AB. Route 3 represents a direct transformation from WT to Mut AB, where both mutations A and B are introduced simultaneously. Since free energy is a state function, the total free energy change associated with the double mutation, denoted as ( in Equation 2), must be independent of the transformation path. Therefore, all three routes should theoretically yield identical
values. However, in practice, Route 3 (the direct transformation) is employed because experimentally determined PDB structures for the mutated states A or B are not available
In the case of double mutations, the combined effect may deviate from the sum of the individual single mutation effects, a phenomenon referred to as nonadditivity ( in Equation 3). When
0, the two mutations are energetically coupled and therefore nonadditive. Conversely, if
= 0, the mutations are additive and independent. Within the framework of alchemical FEP, the contribution from the unfolded state is assumed to be additive. Consequently, it cancels out during the calculation of nonadditivity, simplifying the analysis. As a result, the nonadditivity is evaluated solely based on the free energy differences between the single and double mutations in the folded state as formally represented in Equation 3.
Coupling parameter (λ).
In alchemical free energy simulations, the coupling parameter (λ) is a dimensionless variable used to denote a transition between two thermodynamic states commonly, state A (wild-type) and state B (mutant).18 As λ progresses from 0 to 1, the system's Hamiltonian is gradually modified in a series of steps from to
. Mathematically, the interpolated Hamiltonian is represented in Equation 4.
This parameter governs the pattern of the transformation and serves as a foundational component for computing free energy differences using FEP methods.
With the theoretical framework established, the following sections provide a detailed description of the methodologies employed on each platform. First, we outline the workflow implemented in Schrödinger, followed by the methodology employed in GROMACS for FEP calculations.
FEP calculations using Schrödinger (FEP+)
The FEP+ module from the Schrödinger Release 2024−3 (FEP + , Schrödinger, LLC, New York, NY, 2024) was employed to perform FEP calculations. Initial protein structures for different proteins were retrieved from the Protein Data Bank (PDB) and prepared using Maestro's Protein Preparation Wizard. The crystal water molecules were retained, hydrogen atoms were added, and hydrogen-bonding networks were optimized. PropKa was used to predict the ionization states of residues at pH 7.0. Subsequently, the structures underwent restrained minimization of heavy atoms using the OPLS4 force field [45], and the process was terminated once the root mean square deviation (RMSD) of heavy atoms reached 0.3 Å. Each system was solvated using a cubic SPC water box (default water model), extending 5 Å from the protein in all directions. To simulate physiological conditions, counter ions Na⁺ or Cl⁻ were added to both folded and unfolded systems, followed by the addition of excess Na⁺ and Cl⁻ ions to achieve a final salt concentration of 0.15 M NaCl. All ions were randomly distributed throughout the simulation box. For unfolded models, capped tripeptides extracted from the crystal structures of the corresponding native proteins, containing the mutation site at the central position in the peptide were used [42]. The capping groups consisted of an acetyl group at the N-terminus and an N-methyl group at the C-terminus.
The protein FEP+ calculations were performed using a dual topology algorithm, in which multiple MD simulations were performed to transform the system from wild-type (WT) to the mutated state. During this alchemical transformation, the electrostatic and van der Waals interactions of the WT side chain gradually turned off, while those of the mutant side chain were simultaneously turned on. The mutated residue was included in the Replica Exchange Solute Tempering (REST2) region [46,47], which effectively raises the temperature for enhanced sampling of that residue. The FEP + /REST2 framework combines solute tempering with replica exchange to improve sampling of sidechain conformational flexibility near the mutation site, typically within 5 Å. The solvated systems were then relaxed and equilibrated using the default protocol of the Desmond Molecular Dynamics System (D. E. Shaw Research, New York, NY) [48] as implemented in Maestro. This protocol involves a series of energy minimizations and short MD simulations with restraints. Each perturbation was simulated over 16–24 λ windows (where λ denotes the alchemical intermediate states or coupling factor as explained above), depending on the mutation type. The charge conserving mutations were simulated using 16 λ windows while charge altering mutations used 24 λ windows. The λ schedule followed the default settings in Schrödinger FEP+. Electrostatic and van der Waals transformations were integrated across the same λ schedule. Each λ-window was simulated for 10 ns which was extended up to 100 ns to ensure convergence for challenging cases. For charge changing mutations, a co-alchemical water approach was employed to mitigate long range electrostatic artifacts. The electrostatic interactions were handled using the Particle Mesh Ewald (PME) approach [49,50] with a real-space cutoff of 1.0 nm. Van der Waals interactions were also gradually shifted to zero at this same cutoff distance. Final production simulations were conducted with the default ensemble using the Desmond engine and free energy differences between alchemical states were computed using Multistate Bennett Acceptance Ratio (MBAR) [51] in Schrödinger. Predicted FEP uncertainties were calculated using the cycle closure method [46], as implemented in Schrödinger’s Protein FEP + .
FEP calculations using GROMACS
The alchemical FEP calculations for both protein systems were also performed using GROMACS 2022.6 [52,53] which included 38 SMs from S. nuclease and 23 SMs from T4 lysozyme. The dual topology and the hybrid coordinate files for the mutated protein were generated using the pmx server [54]. Similarly, the tripeptide structures, extracted from the full length protein using Maestro, were processed by pmx server to obtain their corresponding dual topology representations and hybrid tripeptides. The AMBER99sb*ILDN force field [55–57] has been applied for these transformation in both the cases as this force field has been widely adopted for FEP calculations, as supported by several studies [15,58]. The crystal waters from the protein structure were retained during topology processing of the hybrid protein. Each hybrid system (protein and tripeptide) was solvated in a TIP3P [59] water box with a 10 Å buffer. The counterions Na⁺ or Cl⁻ were added to both the systems to neutralize the charge and achieve the final salt concentration of 0.15 M, consistent with standard FEP simulation practices. The retention of crystallographic water and the use of 0.15 M salt concentration align with established FEP protocols in both Schrӧdinger and GROMACS frameworks.
Then the hybrid protein and tripeptide systems underwent energy minimization using the steepest descent algorithm across 27 λ windows (from λ = 0 to λ = 26). The λ schedule varied as 0.00 0.0125 0.025 0.0375 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95 0.9625 0.975 0.9875 1.00 from 0 to 26 λ windows correspondingly. The 27 λ windows were partitioned into electrostatic decoupling (10 λ windows) and van der Waals decoupling (17 λ windows), with soft-core potentials applied to enable smooth interpolation of non-bonded interactions. This was followed by an additional minimization using the l-bfgs integrator for further relaxation. Subsequently, the systems were equilibrated under the NVT ensemble for 10 ps using the velocity-rescaling thermostat [60]. This step was followed by NPT equilibration ensemble, performed sequentially using the Berendsen and Parrinello-Rahman barostats [61] each for 30 ps. The electrostatic interactions were treated using the Particle Mesh Ewald (PME) method [49,50] and a real space cutoff of 1.0 nm was applied. The van der Waals interactions were gradually shifted to zero at the same cutoff distance (1.0 nm). All bonds involving hydrogen atoms were constrained using the LINCS [50] algorithm (order = 6). Following equilibration, replicas were simulated for 50 ns with NPT ensemble for both the hybrid protein and the tripeptide systems. The free energy differences between the wild-type and the mutant forms were computed using the Bennett Acceptance Ratio (BAR) [62,63] method, implemented via the gmx_bar module in GROMACS. Alchemical free energy calculations were also analyzed using the pymbar package based on ∂H/∂λ data obtained from GROMACS simulations. The robustness of the free energy estimates by MBAR was verified through two primary diagnostics: 1) To ensure reliable reweighting of configurations across states, ∂H/∂λ overlap matrices were generated for representative mutations. These matrices (S5 and S6 Figs) confirm appreciable overlap between adjacent λ-windows for both the S. nuclease (e.g., T41C, T41V) and T4 lysozyme (e.g., T59G, G77A) systems. 2) Convergence was assessed by monitoring the change of ΔG as a function of simulation time. Representative convergence plots (S7 and S8 Figs) for mutants such as L25I and T33V (S. nuclease) and S44A and D47A (T4 lysozyme) demonstrate that the free energy differences reach a stable plateau. This behavior indicates that convergence was achieved over the simulation time for both the tripeptide reference and the full protein. Predicted uncertainties were estimated by partitioning each simulation trajectory into three equal segments and calculating the standard deviation of the ΔΔG values across these sub-blocks.
In summary, to assess the impact of mutations on protein free energy landscapes, we performed a comparative analysis of single and double mutations FEP protocols implemented in GROMACS and Schrödinger’s FEP+ framework. Although both platforms utilize alchemical transformation methods, they differ markedly in system setup, topology handling, and sampling strategies. This study aims to evaluate the consistency and reliability of predicted values across these two distinct, widely used, computational platforms.
Results and discussion
Experimental determination of mutation-induced changes in protein thermostability, typically using chemical or thermal denaturation techniques, remains the gold standard for validation. However, these methods are often time-consuming, resource-intensive, and low-throughput, making them more difficult for large-scale mutational screening. Moreover, while such methods yield valuable thermodynamic data, they offer limited insight into the atomic level interactions underlying protein stability changes. In contrast, computational methods such as FEP, which leverage all-atom MD simulations, potentially provide a faster, scalable, and cost-effective alternative provided they have sufficient accuracy. These methods not only enable high-throughput in silico mutagenesis but also deliver atomistic insights into stability mechanisms, allowing simulations of various protein conformational states to better understand folding landscapes and thermostability. Recent advances in high performance computing including GPU acceleration as well as improvements in force fields and simulation algorithms, have enhanced the predictive accuracy of these computational approaches.
In this work, we employed two widely used MD platforms: Schrödinger and GROMACS, for FEP simulations, each offering distinct advantages and limitations. GROMACS is an open-source, freely available MD engine noted for its flexibility, extensive customizability, and support for a broad range of force fields including AMBER, CHARMM, OPLS and GROMOS. However, its implementation for FEP workflows requires manual setup, command-line proficiency, and greater time investment, making it less accessible to non-expert users. In contrast, Schrödinger FEP + is a commercial, closed-source platform that supports only the modified OPLS force field. While it offers less flexibility, its user-friendly graphical interface, built-in FEP workflow as FEP+ module, and enhanced sampling methods such as REST (Replica Exchange with Solute Tempering) streamline the simulation process and significantly reduce setup complexity and computational overhead. Despite the widespread use of both platforms for FEP-based protein stability predictions, a systematic comparison of their performance using experimental data as a benchmark has been lacking. In this work, we address this gap by directly evaluating and comparing free energy changes for single and double mutations using Schrödinger and GROMACS against experimental thermostability data, thereby providing insights into their relative accuracy, usability, and computational efficiency.
The free energy changes () for 38 single mutations of S. nuclease were predicted using the two MD platforms: Schrödinger and GROMACS. These results, along with the
values calculated via MBAR from the GROMACS data, are provided in Supporting Information (S1 Table). Correlation plots comparing experimentally determined free energy changes with predictions calculated using Schrӧdinger and GROMACS are shown in Fig 3A and 3B respectively. Both platforms demonstrated state-of-the-art accuracy in reproducing experimental measurements, with Pearson correlation coefficients of 0.86 for Schrödinger (Fig 3A) and 0.87 for GROMACS (Fig 3B) between calculated and experimental
values, indicating strong linear agreement. Additionally, the mean absolute difference also referred to as the average unsigned error (AUE), was further calculated to evaluate predictive accuracy. The AUE values were found to be 0.99 kcal/mol for Schrödinger and 0.61 kcal/mol for GROMACS, both within the generally accepted accuracy threshold of ~1 kcal/mol for reliable free energy predictions. Notably, these AUE values are comparable to those reported for FEP+ protocol in predicting small molecule protein relative binding affinities [64,65], supporting the robustness of our results. Benchmarking results revealed strong agreement between computational predictions and experimental values. Specifically, the RMSE was 0.13 kcal/mol for GROMACS (via BAR) and 0.24 kcal/mol for Schrödinger. Additionally, a secondary check using MBAR on the GROMACS data resulted in an RMSE of 0.13 kcal/mol. Correlation was further evaluated using Kendall’s tau, which yielded values of 0.54 (GROMACS/BAR), 0.62 (Schrödinger), and 0.50 (GROMACS/MBAR). These statistical indicators, provided in S1 Table, highlight the high predictive accuracy of both platforms. The strong Pearson r of 0.87 between the previously reported [42] and our calculated folding free energy changes (
) highlights the reliability and robustness of these calculations, as shown in S1 Fig and S1 Table. The high correlation and low AUE observed in this study are consistent with previously reported FEP studies [42,66], where correlations between computed and experimental
values ranged from 0.64 to 0.82, further validating the reliability of both platforms. Overall, these findings establish both Schrödinger and GROMACS as accurate and reliable tools for predicting protein thermostability changes induced by single mutations using FEP-based approaches.
Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated. The data plotted here is provided in S1 Table.
The analysis was extended to evaluate folding free energy change for 45 DMs of S. nuclease represented as The values calculated using Schrödinger and previously reported values using GROMACS [43] alongside experimental measurements, which continue to serve as the benchmark, were summarized in S2 Table. The calculated
using Schrödinger yielded a Pearson correlation coefficient of 0.74 when compared with the experimental data (Fig 4A). A similarly strong correlation (r = 0.71) was observed between the previously reported GROMACS values and experimental results, as shown in Fig 4B. The AUE was found to be 0.79 kcal/mol for Schrödinger and 0.76 kcal/mol for GROMACS, both within the generally accepted threshold of 1 kcal/mol. Moreover, we compared previously reported GROMACS values with those calculated using Schrödinger, yielding a Pearson correlation of 0.71 (S2 Fig). Comparable accuracy and low AUE values demonstrate that both Schrödinger (OPLS4 force field) and GROMACS (AMBER99sb*ILDN force field) gave consistent and reliable results for double mutations in the S. nuclease protein. The results demonstrate high predictive accuracy, reflected in low RMSE values of 0.13 kcal/mol for both GROMACS and Schrödinger when compared to experimental data. This strong agreement is further supported by Kendall’s tau (τ) values of 0.52 for GROMACS and 0.57 for Schrödinger. These metrics, provided in S2 Table, validate the reliability of both software platforms for benchmarking free energy calculations.
Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated. The data plotted here is provided in S2 Table.
In the case of DMs, the free energy changes can deviate significantly from the sum of the individual single mutations, a phenomenon known as nonadditivity (). The corresponding nonadditivities for the 45 DMs have been calculated employing the FEP+ protocol in Schrӧdinger. For comparison, previously reported free energy changes calculated using GROMACS [43] were included. Experimental values continue to serve as the benchmark for this study, consistent with the analyses of single and double mutations. Schrödinger predictions of nonadditivity yielded a Pearson r of 0.79 with experimental data (Fig 5A), whereas the previously reported GROMACS values showed a correlation of 0.63 (Fig 5B). The AUE was found to be 0.31 kcal/mol for Schrödinger and 0.35 kcal/mol for GROMACS relative to experimental values. Additionally, a Pearson r of 0.61 was observed for the nonadditivity values
) of 45 DMs of S. nuclease between previously reported GROMACS predictions and our calculated Schrödinger values, as illustrated in S3 Fig. All these nonadditivity values obtained from experimental measurements, GROMACS and Schrӧdinger calculations were tabulated in S3 Table. The calculated RMSE values were notably low, at 0.07 kcal/mol for GROMACS and 0.06 kcal/mol for Schrödinger relative to experimental data, indicating high predictive accuracy. As an additional metric, Kendall’s tau (τ) was found to be moderate, 0.48 for GROMACS and 0.44 for Schrödinger. These results, which support the benchmarking of both free energy calculation platforms, are summarized in S3 Table.
Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated. The data plotted here is provided in S3 Table.
We have extended our analysis further to another widely studied protein system, T4 lysozyme in order to gain more confidence in thermostability predictions using computational approaches. The dataset consists of 24 single mutations for T4 lysozyme protein, for which free energy changes () were calculated using both the FEP+ module of Schrödinger and the GROMACS. However, DMs were not included for the T4 lysozyme system, as experimental
values were not available.
To assess the agreement between calculated and experimental values, correlation analyses were performed. Fig 6A shows the correlation between experimental values and those calculated using Schrödinger (Pearson r = 0.85), while Fig 6B presents the correlation between experimental values and GROMACS predictions (Pearson r = 0.80). The corresponding AUEs were 0.63 kcal/mol for Schrödinger and 0.55 kcal/mol for GROMACS. A direct comparison of computed
values with Schrödinger and GROMACS yielded a Pearson r of 0.81 (Fig 6C). Furthermore, a Pearson r of 0.96 was observed between previously reported⁴² Schrödinger results and the
values calculated using Schrödinger in this study (S4 Fig). All free energy changes: experimental, calculated via GROMACS and Schrödinger, and previously reported Schrödinger values were summarized in S4 Table. Collectively, these results demonstrate the effectiveness in modeling free energy changes arising from single mutations in T4 lysozyme [42] by GROMACS and Schrödinger. Additional performance metrics, including RMSE and Kendall’s tau, are provided in the sub-tables of S4 Table. The low RMSE values-0.13 kcal/mol for GROMACS (BAR), 0.16 kcal/mol for Schrödinger, and 0.13 kcal/mol for GROMACS (MBAR) demonstrate strong agreement with experimental data and high predictive accuracy. Similarly, Kendall’s tau values of 0.61, 0.68, and 0.61, respectively, further validate the benchmarking of these two software platforms for free energy calculations.
Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated. The data plotted here is provided in S4 Table.
To investigate potential correlations, we analyzed prediction errors as a function of mutation type and location (S5-S6 Tables). Prediction accuracy was highest for mutations involving no change in side-chain size, with errors increasing significantly as side-chain bulk increased. Additionally, charged mutations exhibited larger errors than neutral ones. Location-wise, surface mutations generally yielded higher errors than buried residues.
Conclusion
FEP methods are increasingly employed to estimate changes in protein thermostability caused by mutations. In this study, we compared two widely used MD platforms: GROMACS and Schrödinger’s FEP + , to perform FEP-based calculations for SMs of S. nuclease and T4 lysozyme proteins. Additionally, FEP-based predictions were extended to DMs of S. nuclease, including the assessment of their corresponding nonadditivity effects. Our results showed strong correlation between experimental and computed free energy changes ( for 62 SMs in two different proteins using GROMACS and Schrödinger. Schrödinger accurately captured the free energy changes
and their corresponding nonadditivities (
) for 45 DMs of S. nuclease protein, showing good correlation with experimental values. A strong Pearson correlation for free energy changes and nonadditivity for 45 DMs of S. nuclease, has also been observed between the previously reported GROMACS values and those calculated in this study using Schrödinger. The findings demonstrate the robustness of both Schrödinger and GROMACS in modeling single and double mutations, accurately predicting their free energy changes and nonadditivity. The strong correlation between experimental and computed free energy changes further highlights the consistency and accuracy of these platforms in capturing the thermodynamic effects of mutations.
The choice between two platforms depends on factors such as the complexity of the system under study, the cost of the software involved, preference towards a specific force field or enhanced sampling technique, and user familiarity. While FEP is not a substitute for experimental validation, it offers a rapid, scalable, and mechanistically informative approach to mutation analysis in proteins. Importantly, FEP enables atomic level insights into the effects of mutations, which are not directly accessible through experimental denaturation assays alone. Therefore, integrating FEP-based computational predictions (either by GROMACS or Schrӧdinger) with experimental measurements presents a powerful and complementary strategy for protein engineering and thermostability optimization.
Supporting information
S1 Table. Folding free energy changes
in kcal/mol for 38 SMs of the S. nuclease protein.
The experimental values and previously reported values using Schrödinger or GROMACS alongside values calculated in this study using both platforms are provided for comparison.
https://doi.org/10.1371/journal.pone.0335829.s001
(PDF)
S1 Fig. Correlation plot of free energy changes (
) for 38 SMs in the S. nuclease protein, comparing previously reported values using Schrödinger or GROMACS with values calculated in this study using Schrödinger.
Pearson correlation coefficient (r) and coefficient of determination (R²) are indicated.
https://doi.org/10.1371/journal.pone.0335829.s002
(PDF)
S2 Table. Folding free energy changes
in kcal/mol for 45 DMs of the S. nuclease protein.
The experimental values and previously reported values using GROMACS alongside values calculated in this study using Schrödinger are shown for comparison.
https://doi.org/10.1371/journal.pone.0335829.s003
(PDF)
S2 Fig. Correlation plot of free energy changes (
) for 45 DMs in the S. nuclease protein, comparing previously reported values from GROMACS with values calculated in this study using Schrödinger.
Pearson correlation coefficient (r) and coefficient of determination (R²) are shown.
https://doi.org/10.1371/journal.pone.0335829.s004
(PDF)
S3 Table. Nonadditivity values (
) in kcal/mol for 45 DMs of the S. nuclease protein.
The experimental data and previously reported GROMACS values are presented alongside values calculated in this study using Schrödinger for comparative analysis.
https://doi.org/10.1371/journal.pone.0335829.s005
(PDF)
S3 Fig. Correlation plot of nonadditivity (
) for 45 DMs in the S. nuclease protein, comparing previously reported values from GROMACS with values calculated in this study using Schrödinger.
Pearson correlation coefficient (r) and coefficient of determination (R²) are shown.
https://doi.org/10.1371/journal.pone.0335829.s006
(PDF)
S4 Table. Folding free energy changes (
) in kcal/mol for 24 SMs of the T4 lysozyme protein.
The experimental values and previously reported values using Schrödinger alongside values calculated in this study using GROMACS and Schrӧdinger are provided for comparison.
https://doi.org/10.1371/journal.pone.0335829.s007
(PDF)
S4 Fig. Correlation plot of free energy changes (
) for 24 SMs in the T4 lysozyme protein, comparing previously reported values with calculated values from Schrödinger.
Pearson correlation coefficient (r) and coefficient of determination (R²) are shown.
https://doi.org/10.1371/journal.pone.0335829.s008
(PDF)
S5 Fig. Overlap matrix plots for representative S. nuclease mutations (T41C and T41V) with all first off-diagonal entries well above 0.03, the suggested threshold.
Each of the four Figs consists of two panels: the upper panel displays the overlap for the tripeptide, while the lower panel displays the overlap for the full protein system.
https://doi.org/10.1371/journal.pone.0335829.s009
(PDF)
S6 Fig. Overlap matrix plots for representative T4 lysozyme mutants T59G and G77A with all first off-diagonal entries well above 0.03, the suggested threshold.
Each of the four Figs consists of two panels: the upper panel displays the overlap for the tripeptide, while the lower panel displays the overlap for the full protein system.
https://doi.org/10.1371/journal.pone.0335829.s010
(PDF)
S7 Fig. Free energy convergence for representative S. nuclease mutants L25I and T33V.
The plots show ΔΔG versus simulation time across four Figs. In each case, the top panel corresponds to the tripeptide simulation (unfolded state proxy) and the bottom panel corresponds to the full protein simulation (folded state).
https://doi.org/10.1371/journal.pone.0335829.s011
(PDF)
S8 Fig. Free energy convergence for representative T4 lysozyme mutants S44A and D47A.
The plots show ΔΔG versus simulation time across four Figs. In each case, the top panel corresponds to the tripeptide simulation (unfolded state proxy) and the bottom panel corresponds to the full protein simulation (folded state).
https://doi.org/10.1371/journal.pone.0335829.s012
(PDF)
S5 Table. Calculated free energy changes (kcal/mol) for 38SMs of S. nuclease using Schrödinger and GROMACS, compared against experimental values.
The mutations are categorized by charge, size, and location within the protein structure.
https://doi.org/10.1371/journal.pone.0335829.s013
(PDF)
S6 Table. Calculated free energy changes (kcal/mol) for 24 SMs of T4 lysozyme using Schrödinger and GROMACS, compared against experimental values.
The mutations are categorized by charge, size, and location within the protein structure.
https://doi.org/10.1371/journal.pone.0335829.s014
(PDF)
References
- 1. Soskine M, Tawfik DS. Mutational effects and the evolution of new protein functions. Nat Rev Genet. 2010;11(8):572–82. pmid:20634811
- 2. Nisthal A, Wang CY, Ary ML, Mayo SL. Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis. Proc Natl Acad Sci U S A. 2019;116(33):16367–77. pmid:31371509
- 3. Mannige R. Dynamic new world: refining our view of protein structure, function and evolution. Proteomes. 2014;2:128–53.
- 4. Dill KA, MacCallum JL. The protein-folding problem, 50 years on. Science. 2012;338(6110):1042–6. pmid:23180855
- 5. Pack SP, Yoo YJ. Protein thermostability: structure-based difference of amino acid between thermophilic and mesophilic proteins. J Biotechnol. 2004;111(3):269–77. pmid:15246663
- 6. Duan J, Nilsson L, Lambert B. Structural and functional analysis of mutations at the human hypoxanthine phosphoribosyl transferase (HPRT1) locus. Hum Mutat. 2004;23(6):599–611. pmid:15146465
- 7. Stefl S, Nishi H, Petukh M, Panchenko AR, Alexov E. Molecular mechanisms of disease-causing missense mutations. J Mol Biol. 2013;425(21):3919–36. pmid:23871686
- 8. Petukh M, Kucukkal TG, Alexov E. On human disease-causing amino acid variants: statistical study of sequence and structural patterns. Hum Mutat. 2015;36(5):524–34. pmid:25689729
- 9. Ponnuswamy PK, Muthusamy R, Manavalan P. Amino acid composition and thermal stability of proteins. Int J Biol Macromol. 1982;4:186–90.
- 10. Rigoldi F, Donini S, Redaelli A, Parisini E, Gautieri A. Review: Engineering of thermostable enzymes for industrial applications. APL Bioeng. 2018;2(1):011501. pmid:31069285
- 11. McConnell AD, Zhang X, Macomber JL, Chau B, Sheffer JC, Rahmanian S, et al. A general approach to antibody thermostabilization. MAbs. 2014;6(5):1274–82. pmid:25517312
- 12. Scarabelli G, Oloo EO, Maier JKX, Rodriguez-Granillo A. Accurate Prediction of Protein Thermodynamic Stability Changes upon Residue Mutation using Free Energy Perturbation. J Mol Biol. 2022;434(2):167375. pmid:34826524
- 13. Wang C, Zhao T, Li Y, Huang G, White MA, Gao J. Investigation of endosome and lysosome biology by ultra pH-sensitive nanoprobes. Adv Drug Deliv Rev. 2017;113:87–96. pmid:27612550
- 14. Srinivasan J, Miller J, Kollman PA, Case DA. Continuum solvent studies of the stability of RNA hairpin loops and helices. J Biomol Struct Dyn. 1998;16(3):671–82. pmid:10052623
- 15. Gapsys V, Michielssens S, Seeliger D, de Groot BL. Accurate and Rigorous Prediction of the Changes in Protein Free Energies in a Large-Scale Mutation Scan. Angew Chem Int Ed Engl. 2016;55(26):7364–8. pmid:27122231
- 16. Dang LX, Merz KM, Kollman PA. Free energy calculations on protein stability: Thr-157.fwdarw. Val-157 mutation of T4 lysozyme. J Am Chem Soc. 1989;111:8505–8.
- 17. Jespers W, et al. QresFEP: An automated protocol for free energy calculations of protein mutations. J Chem Theory Comput. 2019;15:5461–73.
- 18. Jorgensen WL, Ravimohan C. Monte Carlo simulation of differences in free energies of hydration. J Chem Phys. 1985;83:3050–4.
- 19. Beveridge DL, DiCapua FM. Free energy via molecular simulation: applications to chemical and biomolecular systems. Annu Rev Biophys Biophys Chem. 1989;18:431–92.
- 20. Pohorille A, Jarzynski C, Chipot C. Good practices in free-energy calculations. J Phys Chem B. 2010;114(32):10235–53. pmid:20701361
- 21. Zwanzig RW. High-Temperature Equation of State by a Perturbation Method. I. Nonpolar Gases. J Chem Phys. 1954;22:1420–6.
- 22. Flanagan JM, Kataoka M, Shortle D, Engelman DM. Truncated staphylococcal nuclease is compact but disordered. Proc Natl Acad Sci U S A. 1992;89(2):748–52. pmid:1731350
- 23. Shortle D. Staphylococcal nuclease: a showcase of m-value effects. Adv Protein Chem. 1995;46:217–47. pmid:7771319
- 24. Tucker PW, Hazen EE, Cotton FA. Staphylococcal nuclease reviewed: A prototypic study in contemporary enzymology. Mol Cell Biochem. 1979;23:131–41.
- 25. Anfinsen CB. Principles that govern the folding of protein chains. Science. 1973;181(4096):223–30. pmid:4124164
- 26.
Jarde fzky O, Wade- Jarde fzky NG. Nuclear magnetic resonance as a structural method.
- 27. Shortle D, Stites WE, Meeker AK. Contributions of the large hydrophobic amino acids to the stability of staphylococcal nuclease. Biochemistry. 1990;29(35):8033–41. pmid:2261461
- 28. Green SM, Meeker AK, Shortle D. Contributions of the polar, uncharged amino acids to the stability of staphylococcal nuclease: evidence for mutational effects on the free energy of the denatured state. Biochemistry. 1992;31(25):5717–28. pmid:1610820
- 29. Green SM, Shortle D. Patterns of nonadditivity between pairs of stability mutations in staphylococcal nuclease. Biochemistry. 1993;32.
- 30.
Alber, T. & Matthews, BW. Structure and thermal stability of phage T4 lysozyme. in Methods in Enzymology. 1987;154:511–33.
- 31. Schellman JA, Lindorfer M, Hawkes R, Grutter M. Mutations and protein stability. Biopolymers. 1981;20:1989–99.
- 32. Becktel WJ, Baase WA. Thermal denaturation of bacteriophage T4 lysozyme at neutral pH. Biopolymers. 1987;26(5):619–23. pmid:3297180
- 33. Cornish VW, Kaplan MI, Veenstra DL, Kollman PA, Schultz PG. Stabilizing and destabilizing effects of placing beta-branched amino acids in protein alpha-helices. Biochemistry. 1994;33(40):12022–31. pmid:7918421
- 34. Nicholson H, Tronrud DE, Becktel WJ, Matthews BW. Analysis of the effectiveness of proline substitutions and glycine replacements in increasing the stability of phage T4 lysozyme. Biopolymers. 1992;32(11):1431–41. pmid:1457724
- 35. Matsumura M, Becktel WJ, Matthews BW. Hydrophobic stabilization in T4 lysozyme determined directly by multiple substitutions of Ile 3. Nature. 1988;334(6181):406–10. pmid:3405287
- 36. Heinz DW, et al. Accommodation of amino acid insertions in an α-helix of T4 lysozyme. J Mol Biol. 1994;236:869–86.
- 37. Sun DP, Sauer U, Nicholson H, Matthews BW. Contributions of engineered surface salt bridges to the stability of T4 lysozyme determined by directed mutagenesis. Biochemistry. 1991;30(29):7142–53. pmid:1854726
- 38. Bell JA, Becktel WJ, Sauer U, Baase WA, Matthews BW. Dissection of Helix Capping in T4 Lysozyme by Structural and Thermodynamic Analysis of Six Amino Acid Substitutions at Thr 59. Biochemistry. 1992;31:3590–6.
- 39. Matthews BW, Nicholson H, Becktel WJ. Enhanced protein thermostability from site-directed mutations that decrease the entropy of unfolding. Proc Natl Acad Sci U S A. 1987;84(19):6663–7. pmid:3477797
- 40. Heinz DW, Baase WA, Matthews BW. Folding and function of a T4 lysozyme containing 10 consecutive alanines illustrate the redundancy of information in an amino acid sequence. Proc Natl Acad Sci U S A. 1992;89(9):3751–5. pmid:1570293
- 41. Nicholson H, Anderson DE, Dao-pin S, Matthews BW. Analysis of the interaction between charged side chains and the alpha-helix dipole using designed thermostable mutants of phage T4 lysozyme. Biochemistry. 1991;30(41):9816–28. pmid:1911773
- 42. Duan J, Lupyan D, Wang L. Improving the Accuracy of Protein Thermostability Predictions for Single Point Mutations. Biophys J. 2020;119(1):115–27. pmid:32533939
- 43. Werner M, Gapsys V, de Groot BL. One plus one makes three: triangular coupling of correlated amino acid mutations. J Phys Chem Lett. 2021;12:3195–201.
- 44. Markthaler D, Kraus H, Hansen N. Overcoming Convergence Issues in Free-Energy Calculations of Amide-to-Ester Mutations in the Pin1-WW Domain. J Chem Inf Model. 2018;58(11):2305–18. pmid:30339754
- 45. Lu C, et al. OPLS4: Improving force field accuracy on challenging regimes of chemical space. J Chem Theory Comput. 2021;17:4291–300.
- 46. Wang L, Deng Y, Knight JL, Wu Y, Kim B, Sherman W, et al. Modeling Local Structural Rearrangements Using FEP/REST: Application to Relative Binding Affinity Predictions of CDK2 Inhibitors. J Chem Theory Comput. 2013;9(2):1282–93. pmid:26588769
- 47. Wang L, Berne BJ, Friesner RA. On achieving high accuracy and reliability in the calculation of relative protein-ligand binding affinities. Proc Natl Acad Sci U S A. 2012;109(6):1937–42. pmid:22308365
- 48. Bowers KJ, Chow DE, Xu H, Dror RO, Eastwood MP, Gregersen BA, et al. Scalable Algorithms for Molecular Dynamics Simulations on Commodity Clusters. In: ACM/IEEE SC 2006 Conference (SC’06), 2006. 43–43.
- 49. Darden T, York D, Pedersen L. Particle mesh Ewald: An N ⋅log(N) method for Ewald sums in large systems. J Chem Phys. 1993;98:10089–92.
- 50. Essmann U, et al. A smooth particle mesh Ewald method. J Chem Phys. 1995;103:8577–93.
- 51. Shirts MR, Chodera JD. Statistically optimal analysis of samples from multiple equilibrium states. J Chem Phys. 2008;129(12):124105. pmid:19045004
- 52. Van Der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJC. GROMACS: fast, flexible, and free. J Comput Chem. 2005;26(16):1701–18. pmid:16211538
- 53. Kohnke B, Kutzner C, Grubmüller H. A GPU-Accelerated Fast Multipole Method for GROMACS: Performance and Accuracy. J Chem Theory Comput. 2020;16(11):6938–49. pmid:33084336
- 54. Gapsys V, de Groot BL. pmx Webserver: A User Friendly Interface for Alchemistry. J Chem Inf Model. 2017;57(2):109–14. pmid:28181802
- 55. Lindorff-Larsen K, Piana S, Palmo K, Maragakis P, Klepeis JL, Dror RO, et al. Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins. 2010;78(8):1950–8. pmid:20408171
- 56. Best RB, Hummer G. Optimized molecular dynamics force fields applied to the helix-coil transition of polypeptides. J Phys Chem B. 2009;113(26):9004–15. pmid:19514729
- 57. Hornak V, Abel R, Okur A, Strockbine B, Roitberg A, Simmerling C. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins. 2006;65(3):712–25. pmid:16981200
- 58. Aldeghi M, Gapsys V, de Groot BL. Accurate Estimation of Ligand Binding Affinity Changes upon Protein Mutation. ACS Cent Sci. 2018;4(12):1708–18. pmid:30648154
- 59. Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79:926–35.
- 60. Bussi G, Donadio D, Parrinello M. Canonical sampling through velocity rescaling. J Chem Phys. 2007;126(1):014101. pmid:17212484
- 61. Parrinello M, Rahman A. Polymorphic transitions in single crystals: A new molecular dynamics method. J Appl Phys. 1981;52:7182–90.
- 62. Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. J Comput Phys. 1976;22:245–68.
- 63. Shirts MR, Pande VS. Comparison of efficiency and bias of free energies computed by exponential averaging, the Bennett acceptance ratio, and thermodynamic integration. J Chem Phys. 2005;122(14):144107. pmid:15847516
- 64. Abel R, Wang L, Harder ED, Berne BJ, Friesner RA. Advancing Drug Discovery through Enhanced Free Energy Calculations. Acc Chem Res. 2017;50(7):1625–32. pmid:28677954
- 65. Abel R, Wang L, Mobley DL, Friesner RA. A Critical Review of Validation, Blind Testing, and Real- World Use of Alchemical Protein-Ligand Binding Free Energy Calculations. Curr Top Med Chem. 2017;17(23):2577–85. pmid:28413950
- 66. Steinbrecher T, Zhu C, Wang L, Abel R, Negron C, Pearlman D, et al. Predicting the Effect of Amino Acid Single-Point Mutations on Protein Stability-Large-Scale Validation of MD-Based Relative Free Energy Calculations. J Mol Biol. 2017;429(7):948–63. pmid:27964946