Heuristic energy-based cyclic peptide design

doi:10.1371/journal.pcbi.1012290

Fig 1.

CyclicChamp workflow and peptide backbone annotations.

(a) There are four steps in CyclicChamp for designing stable cyclic peptides. (b) Ideal backbone bond lengths and bond angles are assumed in CyclicChamp. For backbone closure, we consider local coordinate systems at each atom i. There are three steps transforming from coordinate system i to i + 1.

More »

Expand

Fig 2.

CyclicChamp computation time and design comparisons with Rosetta.

(a) The computation time required by CyclicChamp backbone sampling and stability validation (ClusterGen) exhibits linear-like growth with increasing backbone size. FastDesign was faster for 20 and 24 residues than for 15 because there were fewer backbones on which we did sequence design. (b) Total design time divided by the number of stable designs validated by the filtering method for 7 residues, ClusterGen for 15 residues, and reshaped ClusterGen for 20 and 24 residues. (c) When allocating equivalent computation time for backbone sampling, CyclicChamp generated 5 to 28 times as many cyclic backbones with sufficient H-bonds as Rosetta’s simple_cycpep_predict, which led to 2 to 11 times as many stable designs as Rosetta’s after stability validation.

More »

Expand

Fig 3.

Stability analysis of 7-residue designs.

(a) Correlation plot between values calculated by Rosetta simple_cycpep_predict and by our filtering method. (b) value distributions within different energy bins (kcal/mol). Design counts are labeled on top of the bars. (c) Energy landscape comparisons for three cases that have noticeable differences in values calculated by the two methods. Left, more extensive sampling of wells further from the designed state by the filtering method results in a lower computed value. Middle: more extensive sampling close to the designed state by the filtering method identifies a deeper minimum, raising the value. Right: more extensive sampling by the filtering method allows exploration in a low-RMSD region missed entirely by Rosetta, identifying energy wells and raising the value. (d) Top 7-residue designs () demonstrating smaller backbone root-mean-square radii with H-bond intersections. Representative designs having 0, 1, 2, 3, 5, and 6 H-bond intersections are drawn, along with their H-bond networks where L- and D-amino acids are specified and arrows point from amide proton to carbonyl oxygen. The designed sequences are written in one-letter codes, with uppercase for L-amino acids, and lowercase for D-amino acids.

More »

Expand

Fig 4.

Comparison with Rosetta experimentally validated 7-residue designs [13].

(a,b) From our 513 designs, we find designs (colored in orange) for which the backbones align best with the Rosetta designs (light blue). The residues having different side-chains are marked in red. (c) Design 475 has an alternate low-energy structure (purple), leading to a low value. The amino acid sequences are written in one-letter codes, with uppercase for L-amino acids, and lowercase for D-amino acids.

More »

Expand

Fig 5.

Stability analysis of 15-residue designs.

(a) Correlation plot between values calculated by Rosetta simple_cycpep_predict and by our ClusterGen (left). The values of our ClusterGen are also plotted against the backbone root-mean-square radii (right). (b) Energy landscape comparison for one case in which both methods obtain high values. (c) Energy landscape comparisons for three cases in which the two methods calculate significantly different values. (d) Top designs with bending backbones. The backbone atoms, prolines (in color purple), and hydrophobic amino acids (ALA, ILE, LEU, VAL, MET, PHE in color orange) are shown. The backbone turn segments are enlarged. (e) Top designs with short alpha helices and consecutive i , i + 2 ∕ i + 3 H-bonds. The designed sequences are written in one-letter codes, with uppercase for L-amino acids, and lowercase for D-amino acids.

More »

Expand

Fig 6.

Stability analysis of 20-residue designs.

(a) Correlation plot between values calculated by Rosetta and our ClusterGen (left). The ClusterGen’s values are plotted against the backbone root-mean-square radii (right). (b) Example energy landscape comparison. By selecting the lowest-energy structures (marked by a purple cross) as the native states, the ClusterGen landscapes were reshaped. (c) Six top designs with minor conformation changes between their initial target states and the lowest-energy structures. (d) Six low-energy structures (colored in green) that show major backbone conformation changes from their designed structures (red). These involve formation of a short helix or a compact bending. The amino acid sequences are written in one-letter codes, with uppercase for L-amino acids, and lowercase for D-amino acids.

More »

Expand

Fig 7.

Stability analysis of 24-residue designs.

(a) Correlation plot between values calculated by Rosetta and ClusterGen (left). The ClusterGen’s values are plotted against the backbone root-mean-square radii (right). (b) Example energy landscape comparison. By selecting the lowest energy structures (marked by a purple cross) as the native states, the ClusterGen landscapes were reshaped. (c) Six top designs with minor conformation changes in their low energy structures. (d) Six low energy structures (colored in green) that show major backbone conformation changes from their designed structures (red). The amino acid sequences are written in one-letter codes, with uppercase for L-amino acids, and lowercase for D-amino acids.

More »

Expand

Fig 8.

Molecular dynamics simulation results of top designs.

(a) Backbone -atom RMSDs are calculated between the MD trajectory frames and our designed structures. Snapshots are shown for selected time points (designed structures in red, trajectory frames in blue). (b) REMD free energy surfaces. From the lowest free energy basins (marked by green boundaries), representative structures (colored in blue) are extracted from the histogram bins and aligned against our designed structures (red), with the population percentages in the minima labeled aside. The designed sequences are listed on the side.

More »

Expand

Table 1.

Computational validation of top designs using REMD simulations.

For each simulation, the average RMSD is computed for backbone atoms using uncorrelated configurations sampled at temperature state 300 K. As a negative control, average RMSDs are also computed for randomly permuted sequences of each size. By comparing the free energy surfaces of the designed structures with those starting from alternative conformations, the validation results are categorized into computationally validated (the design state is reached in REMD simulations from different starting states), computationally suggestive (the design state is preserved in REMD simulations if started in the design state), and failed (the design state is not preserved in REMD simulations).

More »

Expand

Fig 9.

Structure predictions for macrocycles previously deposited in the PDB.

The best predictions (green) from the low-energy cluster centers are aligned to the PDB structures (orange), with RMSDs shown. PDB structures from the 2017 [13] and 2020 [25] Rosetta design papers are labeled in blue and black, respectively, and all others in pink.

More »

Expand