Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Interactive molecular dynamics in virtual reality for accurate flexible protein-ligand docking

  • Helen M. Deeks,

    Roles Formal analysis, Investigation, Writing – original draft

    Affiliations Intangible Realities Laboratory, School of Chemistry, University of Bristol, Bristol, England, United Kingdom, Department of Computer Science, University of Bristol, Bristol, England, United Kingdom, Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, England, United Kingdom

  • Rebecca K. Walters,

    Roles Formal analysis, Investigation

    Affiliations Intangible Realities Laboratory, School of Chemistry, University of Bristol, Bristol, England, United Kingdom, Department of Computer Science, University of Bristol, Bristol, England, United Kingdom, Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, England, United Kingdom

  • Stephanie R. Hare,

    Roles Formal analysis, Validation

    Affiliation Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, England, United Kingdom

  • Michael B. O’Connor,

    Roles Software

    Affiliations Intangible Realities Laboratory, School of Chemistry, University of Bristol, Bristol, England, United Kingdom, Department of Computer Science, University of Bristol, Bristol, England, United Kingdom, Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, England, United Kingdom

  • Adrian J. Mulholland ,

    Roles Supervision, Writing – review & editing

    adrian.mulholland@bristol.ac.uk (AJM); glowacki@bristol.ac.uk (DRG)

    Affiliation Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, England, United Kingdom

  • David R. Glowacki

    Roles Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing

    adrian.mulholland@bristol.ac.uk (AJM); glowacki@bristol.ac.uk (DRG)

    Affiliations Intangible Realities Laboratory, School of Chemistry, University of Bristol, Bristol, England, United Kingdom, Department of Computer Science, University of Bristol, Bristol, England, United Kingdom, Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, England, United Kingdom

Interactive molecular dynamics in virtual reality for accurate flexible protein-ligand docking

  • Helen M. Deeks, 
  • Rebecca K. Walters, 
  • Stephanie R. Hare, 
  • Michael B. O’Connor, 
  • Adrian J. Mulholland, 
  • David R. Glowacki
PLOS
x

Abstract

Simulating drug binding and unbinding is a challenge, as the rugged energy landscapes that separate bound and unbound states require extensive sampling that consumes significant computational resources. Here, we describe the use of interactive molecular dynamics in virtual reality (iMD-VR) as an accurate low-cost strategy for flexible protein-ligand docking. We outline an experimental protocol which enables expert iMD-VR users to guide ligands into and out of the binding pockets of trypsin, neuraminidase, and HIV-1 protease, and recreate their respective crystallographic protein-ligand binding poses within 5–10 minutes. Following a brief training phase, our studies shown that iMD-VR novices were able to generate unbinding and rebinding pathways on similar timescales as iMD-VR experts, with the majority able to recover binding poses within 2.15 Å RMSD of the crystallographic binding pose. These results indicate that iMD-VR affords sufficient control for users to carry out the detailed atomic manipulations required to dock flexible ligands into dynamic enzyme active sites and recover crystallographic poses, offering an interesting new approach for simulating drug docking and generating binding hypotheses.

1 Introduction

Computational researchers across a wide range of fields are becoming increasingly aware of their responsibility to explore low-cost simulation methodologies whose energy and hardware demands are environmentally sustainable. [1] Whilst the molecular dynamics (MD) approach to biomolecular simulation [25] and protein ligand-binding [612] has exploded in recent years, [13] the computational cost of MD-based approaches remains significant–a result of the fact that proteins are high-dimensional systems characterized by many local energy minima separated by a rugged landscape.

Building on previous work constructing interactive simulation frameworks (e.g., led by Brooks [1416], Wilson [17], Schulten [18, 19], and others [2022]), we have been exploring interactive molecular dynamics in virtual reality (iMD-VR) as a low-cost strategy for investigating biomolecular problems like protein-ligand binding. Part of the attraction of an MD-based approach is the fact that it can capture movement and flexibility of both the protein and the ligand. Our open-source iMD-VR framework Narupa [23] allows users to interactively visualise and manipulate the molecular dynamics of real-time simulations within a virtual environment with atomic-level precision. The iMD-VR approach recognizes that the neural nets of the human brain, trained over several aeons, offer extremely sophisticated machinery for efficiently undertaking tasks linked to 3D insight, spatial navigation, and manipulation, with energy demands that are a fraction of those required by sophisticated hardware-accelerated machine-learning frameworks. Narupa enables humans to utilize their 3D spatial awareness skills, and their ability to undertake ‘on-the-fly’ reasoning, to perform sophisticated operations on complex three-dimensional molecular structures, giving them the ability to move molecular systems between different regions of configuration space. In previous work, we have demonstrated that iMD-VR has acceleration benefits for various 3D molecular tasks compared to two-dimensional interfaces. [23, 24] The ability to physically reach out and manipulate simulated systems as if they were tangible objects gives users the opportunity to explore molecular transformations, mechanisms, and rare events that may otherwise be inaccessible using conventional high-performance computing (HPC) MD simulations. While we have previously demonstrated the potential of iMD-VR to generate dynamical pathways in small systems, it remains to be seen whether iMD-VR is sufficiently intuitive and enables enough control for researchers to undertake more complex tasks.

In this article, we focus on applying Narupa to unbinding and rebinding small molecules to proteins, as illustrated in Fig 1. Applying iMD-VR to protein-ligand systems requires moving a simulation between thermodynamically and kinetically distinct states (i.e., bound and unbound). This requires sophisticated three-dimensional spatial reasoning: the protein binding pocket needs to be located, and the ligand orientated such that key contacts to specific residues are properly recreated. We evaluate the utility of iMD-VR to facilitate such manipulations through applications to three protein-ligand systems of increasing complexity, and investigate the extent to which iMD-VR ‘experts’ and ‘novices’ were able to reversibly generate protein unbinding and rebinding pathways and recover crystallographic ligand poses for the systems shown in Fig 1. The results of the experimental protocol outlined herein show that iMD-VR users can quickly unbind and rebind ligands to proteins and recover experimental protein-ligand structures. For example, we show that an hour-long training session with an expert user enables novice iMD-VR users to recover the original binding poses for several different protein-ligand systems. We have established that the pathways generated by iMD-VR users are reproducible, and that the binding poses they recover are stable when used to initiate MD simulations over longer simulation timescales.

thumbnail
Fig 1. Interactive protein-ligand rebinding using iMD-VR.

The top panel shows a schematic of Narupa, the open-source multiperson iMD-VR framework used to carry out the studies described herein, showing two participants using handheld wireless controllers to manipulate a real-time MD simulation of neuraminidase and oseltamivir. More information on the Narupa setup is available in ref 15. The bottom panel shows a three-dimensional representation of the binding pockets of each of the three protein systems interactively undocked and redocked, each bound to a ligand. The trypsin graphic (A) highlights the network of hydrogen bond interactions from Asp189, Ser190, and Gly219 to benzamidine (based off PDB code 1S0R). The neuraminidase graphic (B) highlights the hydrogen bonds between oseltamivir and both the 150-loop and a positively-charged trio of arginine residues of neuraminidase (based off a mutated neuraminidase structure derived from PDB code 2QWK). The HIV-1 protease graphic (C) highlights the hydrogen bonds between the hydroxyl group of amprenavir with both Asp25A and the protonated form of Asp25B (based off PDB code 1HPV). For HIV-1 protease, the protein ‘flaps’ have been highlighted in purple.

https://doi.org/10.1371/journal.pone.0228461.g001

iMD-VR represents a relatively new approach to simulating drug binding, since it involves a fully (or predominantly) flexible dynamic system and also does not require predefining a CV along which to carry out biased molecular dynamics simulations. [25] Instead, the researcher simply ‘draws’ the desired pathway by manipulating a complex system between different states. This approach has the potential to accelerate the exploration of pathways through high-dimensional configuration space. Overall, our tests show that a well-designed iMD-VR setup enables expert and novice users alike to accelerate complex 3D state changes of biomolecular systems. Our results suggest that iMD-VR tools are sufficiently intuitive and provide adequate control to enable users to recover experimentally derived bound complexes. Moreover, the fact that the sampled pathways are reproducible suggests that they are not too far from the equilibrium ensemble.

2 Methods

2.1 System selection

The systems selected for the studies outlined herein include, at one extreme, those that have well-characterized binding modes, where a user with prior knowledge of the system can use iMD-VR to sample binding and unbinding pathways. At the other extreme, we sought to push the limits of the iMD-VR tools and examine more complex protein-ligand systems. The three systems we chose are shown in Figs 1 and 2 and are described below in order of increasing complexity.

thumbnail
Fig 2. Summary of iMD-VR docking tasks.

The ligands that were interactively unbound and rebound in a series of user tests, two for each of the three systems: Trypsin (A), Neuraminidase (B), and HIV-1 Protease (C). For each protein system, we devised two docking tasks, a training phase (where a trace representation of the ligand in the bound pose was present to guide users as shown in supplementary videos A–C in S1 File), and a testing phase, where no trace atoms were shown. For the testing tasks, parts of the ligand which interact with key residues have been highlighted in lilac. For proteins A and B, where the testing task ligand differs from training, the shared scaffold between the two ligands is highlighted in green.

https://doi.org/10.1371/journal.pone.0228461.g002

2.1.1 Trypsin.

The first protein-ligand system is the enzyme trypsin with a benzamidine ligand. Trypsin is a well-studied serine protease hallmarked by a catalytic triad consisting of histidine, serine and aspartic acid. As shown in Fig 1A, a secondary motif of the trypsin binding pocket is an additional aspartic acid residue (at position 189) that stabilises the positively charged amino acids, binding a peptide chain in a pose that allows for specific cleavage of the carboxyl-side chain in which these amino acids are present.[26] Asp189 is charged, partially-buried, and is surrounded by a water network that favours desolvation.[2729] Trypsin inhibitors can exploit this by mimicking positively-charged residues and forming an electrostatic contact with Asp189, an interaction further stabilized by a network of hydrogen bonds to Ser190 and Gly216 (Fig 1A).[30] The binding mode between trypsin and one of its inhibitors, benzamidine, has relatively fast kinetics and is therefore often applied as an initial use case for theoretical methods.[3134] From an iMD-VR perspective, the trypsin-benzamidine system represents a binding mode where users do not need to induce significant conformational changes in the protein, and the ligand has only one rotatable bond. In effect, a user need only move the benzamidine out of the binding pocket before placing it back, making trypsin a relatively simple test case for iMD-VR.

Fig 1A shows a three-dimensional binding mode of benzamidine in the secondary binding pocket of trypsin, illustrating how residues Asp189, Ser190, and Gly216 bind to ligands (based off PDB code 1S0R). Fig 2A shows the two-dimensional binding mode of two ligands, benzamidine and indole-amidine, into trypsin. Supplementary video A.1 in S1 File (https://vimeo.com/354833443) shows an example of an expert iMD-VR user unbinding and rebinding benzamidine from trypsin in iMD-VR.

2.1.2 Neuraminidase.

H7N9 neuraminidase is a glycoside enzyme found on the surface of influenza virions that catalyzes the hydrolysis of sialic acid residues, a process integral to influenza virus mutation.[35, 36] This strain was found in the influenza virus responsible for the 2013 bird flu pandemic.[37] The process of influenza virus replication begins with the virus attaching itself to the cell via the binding of hemagglutinin (a viral cell surface protein) to sialic acid, found on the end of glycoproteins attached to the cell membrane. Once anchored, the virus can enter the cell and hijack its machinery in order to replicate itself. After new viruses have been manufactured, neuraminidase proteins cleave sialic acid groups from cellular glycoproteins, breaking the anchor between the virus and the host cell, freeing the new viruses to infect other cells. In recent years, there has been extensive research on the discovery of inhibitors that mimic sialic acid, thereby preventing the cleavage of sialic acid residues by neuraminidase and obstructing the release of new viruses. [38, 39] Oseltamivir (Tamiflu) is a transition state analogue of sialic acid and licensed therapy for influenza that binds to a triad of arginine residues in the neuraminidase active site (Arg118, Arg292, and Arg371) via its carboxylate group. [40] In addition to this, a crucial loop opening and closing mechanism, known as the 150-loop motion, is involved in the unbinding and rebinding of oseltamivir (Fig 1B). [41, 42] From an iMD-VR perspective, the binding of oseltamivir to neuraminidase represents a more complex task than the trypsin-benzamidine system for two reasons: (1) Given that 150-loop movement is imperative to binding, this backbone motion needs to be carried out by the iMD-VR user; (2) With eight rotatable bonds, oseltamivir has considerably more conformational flexibility than benzamidine, making it more of a challenge to re-establish the correct bound configuration.

Fig 1B shows a three-dimensional binding mode of oseltamivir in the active site of neuraminidase, illustrating how the 150-loop and arginine trio binds to ligands (based off a mutated neuraminidase structure derived from PDB code 2QWK; details for how this was done can be found in ref [37]). Fig 2B shows the two-dimensional binding mode of two ligands, oseltamivir and zanamivir, into neuraminidase. Supplementary video B in S1 File shows an example of an expert iMD-VR user unbinding and rebinding oseltamivir from neuraminidase in Narupa.

2.1.3 HIV-1 protease.

HIV-1 protease is a viral aspartyl protease essential to the life cycle of HIV, responsible for cleaving precursor polypeptides into functional proteins, making it an attractive drug target for preventing HIV maturation. Structurally, HIV-1 protease is a homodimer that shares a single active site between two protein subunits, each of which contributes a catalytic aspartic acid. Mechanisms for HIV-1 protease cleavage of precursor proteins have been proposed [43]; however, it is broadly understood that the two aspartic acid residues each act as an acid or base respectively, activating a water molecule that then goes on to break peptide carbonyl bonds via nucleophilic attack. Sulfonamides are a class of drug licensed for the treatment of HIV that hydrogen bond to the catalytic aspartic residues and thus block protease activity [44], an example of which is amprenavir. Functionally, the two aspartic acids are thought to primarily exist in a monoprotonated state, especially when in the presence of an inhibitor [45], although it has been debated which tautomer favours amprenavir binding. [4650] The HIV-1 protease active site is gated by two beta-hairpin flaps that shift through a series of different conformational states before ligand binding. [51] Therefore, this task requires iMD-VR users to move these flaps into an open position to guide amprenavir out, and then carefully place the loops back without disrupting their secondary structure. From an iMD-VR perspective, the motion of these flaps, combined with the higher rotational flexibility of amprenavir, makes this a particularly challenging binding task. Fig 1C shows a three-dimensional binding mode of amprenavir in the active site of HIV-1 protease, illustrating how Asp25A and Asp25B bind amprenavir (based off PDB code 1HPV). The active site flaps are coloured in purple. Fig 2C shows the two-dimensional binding mode of amprenavir into HIV-1 protease. Supplementary video C in S1 File (https://vimeo.com/354834013) shows an example of an expert iMD-VR user unbinding and rebinding amprenavir from HIV-1 protease in Narupa.

2.2 Overview of binding and rebinding tasks

Using these candidate protein-ligand systems, we carried out iMD-VR tests. Users were split into expert and novice cohorts. Expert users had extensive experience using iMD-VR to interact with molecules and had some degree of familiarity with the protein-ligand systems described in this paper. The experts first tested the viability of using iMD-VR to generate ligand unbinding and rebinding pathways. Expert iMD-VR studies were carried out to establish whether the original crystallographic structure could be re-established for each task. Then, a cohort of novice users were asked to carry out a series of unbinding and rebinding tasks, in order to control for the experts’ familiarity with iMD-VR tools and the selected systems. This novice users had little experience using iMD-VR but did have a scientific background in biomolecular simulations.

2.2.1 Expert user tests.

Expert iMD-VR users were tasked with unbinding and rebinding three ligands from the binding pockets of three proteins: trypsin, neuraminidase, and HIV-1 protease, detailed in Section 2.1. Fig 1 shows a three-dimensional representation of each of the three experimentally derived poses that expert users were asked to recreate. In these tasks, experts were initially given a faint trace of the ligand atoms in the correct pose, providing a visual indication of where the ligand sits in the crystallographic binding pose. Experts could freely switch between interacting with either a single atom at a time or a group of atoms together (e.g., moving one atom of the ligand or the center of mass of the entire ligand at once). Experts could also add or remove positional restraints to atoms, such as when manipulating the active site flaps during the HIV-1 protease task. Supplementary videos A-C in S1 File show experts unbinding and rebinding small molecules from trypsin, neuraminidase and HIV-1 protease, and also show the guide trace for the position of the ligand atoms. [23]

2.2.2 Novice user tests.

Once the iMD-VR unbinding and rebinding pathways in Narupa had been established by the expert users, novice users were recruited to complete unbinding and rebinding tasks with each of the three protein systems. Novice users interacted with each system until they felt they had correctly established a binding pose, spending approximately five minutes on each task. The protein-ligand systems were given in order of increasingly complexity. Fig 2 gives a summary of tasks novice participants were asked to complete. We recruited a total of ten novice participants in two cohorts of five (‘cohort one’ and ‘cohort two’). Cohort one completed tasks described in 2.3.1–2.3.4. Cohort two completed tasks described in 2.3.1–2.3.4 and 2.3.6. More detailed demographic and task information can be found in the S1 File.

To ensure there was a comparable level of proficiency with the Narupa iMD-VR interface prior to attempting protein-ligand binding, novice participants were first placed in a simulation of C60 molecules and given an opportunity to familiarize themselves with the controls and the interaction, as was done in previous studies. [24] Additionally, before interacting with each of the three protein systems described in Section 2.1, we took advantage of the multiplayer feature of Narupa: novice users were placed in the same iMD-VR simulation as the expert user (as shown in the Fig 1 schematic), where they could see a fully three-dimensional representation of the bound poses shown in Fig 1. To ensure the novices had a comparable level of knowledge of each system, while co-habiting the simulation space, the iMD-VR expert gave a brief background on the system and highlighted the interactions shown in Fig 1.

Once the system had been introduced, participants were asked to complete a ‘training’ ligand unbinding-rebinding task. During the training phase, novices had a trace of the ligand atoms in the bound pose to help guide them. Next, participants were given a ‘testing’ ligand unbinding and rebinding task. A summary of the training and testing tasks for each of the three protein systems is shown in Fig 2. In these experiments, no trace atoms of the ligand were present, meaning that participants had to rely more on their chemical insight to find the correct binding pose. To control for learning effects during the testing phase of trypsin and neuraminidase binding, novices were asked to bind different ligands from those used during training. The alternative ligands were chosen on the basis of key interactions remaining between the ligand and protein in the training portion of the study (highlighted in lilac in Fig 2A and 2B) and sharing structural similarities (highlighted in green in Fig 2A and 2B). Due to a lack of availability of an appropriate alternative ligand to amprenavir, and because of the heightened complexity of the task owing to the number of rotatable bonds amprenavir contains, amprenavir was used for both the training and testing phases of the HIV-1 protease system. Novice users also had a more limited user interface, where they only had the option of interacting with a single atom at a time and could not apply or remove positional restraints.

2.3 iMD-VR simulation set up

The Narupa selection utility, shown in supplementary videos A–C in S1 File, allows users to select and apply visualization and interaction settings to a subset of atoms in a system. For all systems, the following selections were created: (a) protein backbone atoms, (b) key active site residues, (c) ligand atoms. Each system had some specific features, detailed below. Further simulation-specific details can be found in the S1 File.

2.3.1 Trypsin and benzamidine (expert task and novice training task).

This task (Fig 2A, ‘Training’) was completed by both experts and novices. Given the relatively simple binding mode between benzamidine and trypsin, where the protein does not need to be manipulated by the user, the entire protein backbone was held in place with a positional restraint (details can be found in the S1 File). To enable the user to visually identify the location of the active site in the protein, residues Asp189 and Ser190 were fully rendered and the backbone oxygen of Gly219 was shown, similar to the representation shown in Fig 1A. Additionally, supplementary video A.1 in S1 File shows the rendering scheme used for trypsin and benzamidine. For the remaining residues, only the protein backbone was rendered. A trace of benzamidine in the crystallographic pose was used as a visual indication of where the ligand binds, encouraging the user to re-establish the original bound pose as closely as possible.

2.3.2 Trypsin and indole-amidine (novice testing task).

Following the training phase, novices were asked to bind indole-amidine into trypsin (Fig 2A ‘Testing’). Indole-amidine was selected as an alternate ligand for novices to bind as it contains the benzamidine moiety and adopts the same binding mode to Asp189, Ser190 and Gly219. However, unlike benzamidine, indole-amidine has a non-symmetrical extended scaffold that needs to be correctly orientated. This testing task used the same protein restraints and rendering scheme as the training task with trypsin and benzamidine (Section 2.3.1). However, no trace atoms were shown.

2.3.3 Neuraminidase and oseltamivir (expert task and novice training task).

This task was completed by both experts and novices. Binding oseltamivir requires a small amount of manipulation of the protein, as the positions of the 150-loop residues over the ligand need to be carefully maintained. However, as no significant shift in the tertiary structure of neuraminidase is required, all backbone atoms were held with a positional restraint (details can be found in the S1 File). To aid in ensuring the 150-loop was correctly placed, two strategies were adopted. First, residues Asp151 and Arg152 of the 150-loop were fully rendered so the user could see their position. Second, a trace representation of the position of the 150-loop residues in the closed position was rendered so the user could see where they should be placed. Additionally, to show if a hydrogen bond between the carboxylic acid of oseltamivir and the arginine trio of the neuraminidase binding pocket had been established, Arg118, Arg292, and Arg371 were also fully rendered. For the remaining residues, only the backbone atoms were shown. A trace of the oseltamivir atoms in the correct position was used as a visual indication of where the ligand binds, ensuring that the original binding pose could be closely replicated. Supplementary video B in S1 File shows the rendering scheme used in Narupa for neuraminidase and oseltamivir.

2.3.4 Neuraminidase and zanamivir (novice testing task).

Following the training phase, novice users were asked to bind zanamivir to neuraminidase. Zanamavir was selected as the second binding task as it is a chemical analogue to oseltamivir, sharing the same ring scaffold, carboxylic acid group, and hydrogen-bonding contacts to the 150-loop. However, zanamivir does exhibit some structural differences from oseltamivir; therefore, this task requires the user to recognize the similarity in binding modes between the two drugs. As discussed below, we used this particular task to examine the results obtained if we removed backbone restraints entirely. The same protein rendering scheme as neuraminidase and oseltamivir was used (Section 2.3.3), however, no trace atoms were present.

2.3.5 HIV-1 protease and amprenavir (expert task).

Binding small molecules to HIV-1 protease requires a significant shift in the backbone atom positions, altering the structure from closed to open. During the simulation, the user could toggle a positional restraint force on the active site flaps, allowing them to be held in place once an open or closed conformation had been established. An additional strategy to aid in the opening and closing of HIV-1 protease was to render trace atoms of the protein backbone in the position of the active site flaps in both the closed position and the open position, giving the expert user a visual indication of the range of motion the loop has between the two states, as well as allowing the user to closely re-establish the original closed conformation once amprenavir had been redocked; details on how this was done can be found in the S1 File. Aside from the active site beta-hairpin flaps, which were manipulated by the expert user during the simulation, the protein backbone atoms were held by backbone restraints. Details on the positional restraints can be found in the S1 File. Key contact residues Asp25A and Asp25B were fully rendered; the remaining amino acids only had their backbone atoms shown. A trace atom representation of amprenavir was used as a visual indication of where the ligand binds, ensuring the original bound conformation could be re-established.

2.3.6 HIV-1 protease and amprenavir (novice testing and training tasks; cohort two only).

Novice users from cohort two, totalling five participants, were asked to unbind and rebind amprenavir twice, once with the aid of trace atoms and once without. All other backbone atoms in the protein were positionally restrained. To streamline the task, novices were not required to open and close the HIV-1 protease flaps prior to and after binding amprenavir, instead starting from an HIV-1 protease conformation that had been pre-opened by an expert using iMD-VR and held with a positional restraint. Details of this can be found in the S1 File. Both the testing and training binding tasks used the same rendering scheme as the expert user tests (Section 2.3.5)–the only difference between the two tasks was the presence of trace atoms.

2.3.7 Backbone restraints.

During iMD-VR, users can apply force to any atom in the system–including those integral to the protein tertiary or quaternary structure. In order to ensure that the protein tertiary structure required for binding was not distorted, a positional restraint was applied to the backbone atoms in the proteins. To evaluate the impact of these restraints, we removed the backbone restraints for cohort two during the neuraminidase and zanamivir testing phase. A comparison of task completion between cohort one and cohort two for undocking and redocking zanamivir from neuraminidase can be found in S3 Fig in S1 File. These results, discussed in further detail in the S1 File, show that careful iMD-VR users (expert and novice alike) were able to carry out binding and unbinding tasks without positional restraints. For analysis of iMD-VR trajectories, we observed that PCA carried out on iMD-VR trajectories which utilized positional restraints produced better results for the binding/unbinding pathways of interest, owing to diminished noise from the backbone fluctuations.

2.4 Analysis of iMD-VR results

For both the expert and novice interactive unbinding and rebinding tasks, trajectories of the protein-ligand system were recorded, taking a snapshot of the system every 0.25 ps. iMD-VR trajectories were analysed using an RMSD calculation protocol (described in the S1 File). The RMSD was used to: (1) analyse the expert trajectories to see how the system is changed from a bound to unbound and back to bound state; (2) analyse novice trajectories to identify how close a user got to recreating the starting pose; and (3) analyse whether the recovered binding poses were stable. Specifically, we selected an iMD-VR frame with an RMSD close to the starting coordinates and used it to initialize a longer timescale (200 nanoseconds) MD trajectory. Further details can be found in the S1 File.

To qualitatively assess the reproducibility of the binding pathways users explored using iMD-VR, we utilized a Principal Component Analysis (PCA) tool called PathReducer, [52] which takes as input an xyz file containing a series of molecular structures (in this case, an iMD-VR-generated trajectory) and outputs the set of principal coordinates which captures the most structural variance in the fewest coordinates. In effect, PathReducer collapses a 3N-dimensional (where N = number of atoms) trajectory into a visualisable path which spans two or three dimensions. Further details of this can be found in the S1 File.

3. Results and discussion

3.1 Expert unbinding and rebinding tasks

3.1.1 Trypsin.

Fig 3A shows the RMSD of the expert user-generated trajectory of benzamidine unbinding and rebinding to trypsin, alongside the top two and top three PCs along the trajectory path. The RMSD plot of benzamidine shows that the original binding pose was recovered by the expert user, as the RMSD between the ligand at the end of the trajectory and the beginning is very low. The PCA results for this system showed that the direction of PC1 (the PC that captures the most structural variance along the trajectory) corresponds predominantly to motion of the ligand away from the protein. The value of PC1 at the end of the trajectory is very similar to that at the beginning, which is more evidence that this user was able to replicate the original bound pose of the drug. PC2 and PC3 correspond to fluctuations of the backbone and side chains not in the active site of the protein, hence the trajectory not returning precisely to its initial value of these PCs. Supplementary animation A in S1 File (https://vimeo.com/354828618) shows the trajectory generated by an expert user (31 picoseconds of simulation in total). Fig 4A shows the RMSD of an expert-generated rebound trypsin-benzamidine complex over 200 nanoseconds of production MD, after solvation, minimization and equilibration as described in the S1 File, establishing that the iMD-VR produces stable poses for this system.

thumbnail
Fig 3. Recovering binding poses of three protein-ligand systems using iMD-VR.

Left: Ligand RMSD of iMD-VR generated trajectories by experts (compared to starting coordinates): (A) trypsin and benzamidine, (B) neuraminidase and oseltamivir, and (C) HIV-1 protease and amprenavir. For neuraminidase, the RMSD of the 150-loop residues is shown in blue. For HIV-1 protease, the RMSD of the active site flaps backbone atoms is shown in blue. Right: The top two (left plots) and top three (right plots) principal components (PC1-3) for representative trajectories of the three protein-ligand systems. Time is represented for each trajectory by the colour of the plot, starting at purple for the beginning of the trajectory and yellow for the end, passing through blue and green in between.

https://doi.org/10.1371/journal.pone.0228461.g003

thumbnail
Fig 4. Ligand RMSD over 200 nanoseconds of molecular dynamics.

Showing the RMSD of each of the three ligands (compared to the initial solvated, minimized and equilibrated bound coordinates) during 200 nanoseconds of molecular dynamics simulation. From top to bottom: (A) benzamidine (in trypsin), (B) oseltamivir (in neuraminidase), and (C) amprenavir in HIV-1 protease.

https://doi.org/10.1371/journal.pone.0228461.g004

Expert study subjects were able to unbind benzamidine in under 5 picoseconds and obtain the rebound pose within another 5 picoseconds (Fig 3A). Using our computing architecture, as described in the S1 File, we were able to achieve MD simulation rates of 4.45 picoseconds per minute of real time, meaning that benzamidine could be unbound and rebound on the scale of minutes. As such, this demonstrates the efficiency with simulations can be interactively pushed between two distinct states. Trypsin-benzamidine binding events have previously been observed to take nanoseconds of simulation time [32], and theoretical calculations of trypsin-benzamidine unbinding rate constants, koff, predict millisecond dissociation times [31, 33]. To unbind or rebind benzamidine, the user does not need to make any major conformational changes to the binding site of trypsin, so completing this task was a matter of applying sufficient force to break the relatively weak electrostatic contacts between benzamidine and Asp189 and continuing to apply this force until the ligand had fully dissociated from the protein (supplementary video A.1 in S1 File). On rebinding, the user had to take care to keep the orientation of benzamidine such that the charged amidine group is pointing towards the bottom of the pocket. If the user attempted to bind benzamidine with the benzyl group directed towards the positively-charged Asp189 residue the ligand would be repelled out of the binding pocket, demonstrating the real-time responsiveness of iMD-VR to energetically unfavourable input, as shown in Supplementary video A.2 in S1 File. The video shows how, after being guided out of the binding pocket and replaced in an incorrect orientation, the N1 and N2 of benzamidine fail to form electrostatic contacts to Asp189, and is subsequently repelled out of the binding pocket, moving towards the HE1 of His57 in Trypsin, where it remains in a transiently stable pose over at least 200 ps, as shown in S2-A1 Fig in S1 File. Supplementary video A.2 in S1 File shows the expert utilizing the 3D interface of Narupa to identify those amino acids with which the amidine group interacts with. This alternate binding mode, in which benzamidine favours the catalytic triad and polar residues surrounding the binding pocket is similar to previous observations by Buch et al. [32]. S2-A2 Fig in S1 File shows how it was possible for the expert user to apply continuous force to the benzamidine when rebinding, guiding it past the cluster of hydrophilic groups and into the protein core and moving it out of the protein via an alternate opening. Whilst the kinetic barriers for these pathways are likely to be very large; they nevertheless show the ability of iMD-VR to quickly enable exploration of new dynamical pathways.

3.1.2 Neuraminidase.

Fig 3B shows the RMSD of the expert-generated trajectory, alongside PathReducer-generated principal component plots, establishing that the original binding pose was recovered by the expert user. Similar to the trypsin system, PC1 is dominated by distances between atoms in the ligand and atoms in the protein, and so the variation along PC1 that can be seen along the trajectory path can be directly attributed to the removal of oseltamivir from the active site. PC2 and PC3 are again predominantly defined by distances between atoms in residues of the protein far from the active site, and thus fluctuations of the associated side chains lead to an offset in the pathway’s start and endpoints. Supplementary animation B in S1 File (https://vimeo.com/354829098) shows the trajectory generated by an expert user (58 picoseconds of simulation time in total). Fig 4B shows the RMSD of an expert-generated rebound oseltamivir-neuraminidase complex over 200 nanoseconds of production MD, after solvation, minimization and equilibration as described in the S1 File, establishing that the iMD-VR can produce stable poses for this system.

Using our computing architecture, as described in the S1 File, we were able to achieve MD simulation rates of 4.51 picoseconds per minute of simulation time, meaning that oseltamivir could be unbound and rebound on the scale of minutes of real time. Unbinding oseltamivir requires applying enough force to break any charged contacts or hydrogen bonds formed between the triad of positively charged arginine residues in the active site and the negatively charged carboxylate group of oseltamivir. Upon unbinding, the interactions from the amine group and carbonyl group of the drug and the two residues which comprise the 150-loop (Asp151 and Arg152 –see Fig 1B) were also broken, causing the loop residues to dynamically open as the drug leaves the active site as shown in supplementary video B in S1 File (https://vimeo.com/354833800). The backbone of the 150-loop did not dynamically open when the drug exited the active site as the backbone was held in place with restraints. When removing oseltamivir from the binding site, the molecule would occasionally flip itself over, losing the bound orientation. As such, when attempting to re-establish the correct binding pose, the user needed to be extremely careful to ensure that binding interactions between the drug and the 150-loop residues were successfully recreated, allowing the drug to flip itself back into the correct pose; if the user was too forceful, the drug would enter the active site in the incorrect orientation. A representation of oseltamivir ‘flipping’ as it is being removed from the neuraminidase binding pocket can be found in S2-B1 Fig in S1 File. We observed that when oseltamivir is back into the correct bound conformation, the two 150-loop residues will also move back into the original configuration without any input from the user, demonstrating how iMD-VR can dynamically reform energetically-favourable contacts.

3.1.3 HIV-1 protease.

Fig 3C shows the RMSD of the expert-generated trajectory, alongside generated principal component plots, establishing that the original binding pose was recovered by the expert-user. The PCA results in this case are unique, as the user had to move two flaps capping the active site prior to removal of the bound amprenavir (discussed in more detail below, see Fig 5). Each approximately linear segment of the PC plots corresponds to a conformational change event: First, one flap is moved away from the active site. Then the second flap is moved away from the active site. Finally, the ligand is removed from the active site. This process is then reversed to rebind the ligand. The plots show strikingly similar pathways for unbinding and rebinding, likely as the principal coordinates are each capturing a significant movement of the protein backbone and thus ignoring smaller residue fluctuations; comparatively, PC2 and PC3 in the trypsin and neuraminidase systems do not have the same starting and end points as they are ‘picking up’ protein fluctuations along the trajectory. Intuitively, PC1 is dominated by changes in distances corresponding to movement of the ligand out of the protein, PC2 corresponds predominantly to distances between the first ‘flap’ and other parts of the protein, and PC3 corresponds predominantly to distances between the second ‘flap’ and other parts of the protein. Supplementary animation C in S1 File (https://vimeo.com/354829412) shows the trajectory generated by an expert user (61 picoseconds simulation time in total). Fig 4A shows the RMSD of an expert-generated rebound HIV-1 protease-amprenavir complex over 200 nanoseconds of production MD, after solvation, minimization and equilibration as described in the S1 File. Some fluctuations to the RMSD value are observable. Inspection of the 200 nanosecond trajectory shows that the hydroxyl group ‘flips’ its orientation relative to the two catalytic aspartic acid residues, resulting in amprenavir adopting a slightly different binding pose throughout the simulation, albeit still remaining in the binding pocket of HIV-1 protease. Fig 5 shows a step-by-step breakdown of the process of unbinding and rebinding amprenavir from HIV-1 protease, establishing that the iMD-VR can produce stable poses for this system. In the apo form, the active site flaps of HIV-1 protease are thought to dynamically switch between various open and closed conformations [51]. Upon binding, the once-open flaps have a stabilizing effect by closing over the ligand. [53, 54] Experimental estimates of koff for amprenavir binding to WT HIV-1 protease correspond to an average binding residence time on the scale of seconds [55, 56], making it a difficult transition to simulate without some form of biasing. Using our computing architecture, as described in the S1 File, we were able to achieve MD simulation rates of 4.45 picoseconds per minute of simulation time, meaning that amprenavir could be unbound and rebound–including opening and closing the active site flaps—on the scale of minutes of real time.

thumbnail
Fig 5. Snapshots along the interactive unbinding and rebinding of amprenavir into HIV-1 protease.

The image in the upper left shows the bound “start” pose. (a)–(c) shown snapshots as the user unbinds amprenavir; (d) shows the unbound pose; and (e)–(g) shows snapshots as the user rebinds amprenavir. The corresponding RMSD time traces are shown in the middle plot, identical to that shown in Fig 3c.

https://doi.org/10.1371/journal.pone.0228461.g005

Additionally, the Narupa selection language enabled the expert user to remove positional backbone restraints and move the flaps into an open position, before replacing the restraints and extracting amprenavir away from the binding pocket. To rebind amprenavir, the hydrogen bonding to Asp25A and Asp25B was reformed and, once the stable binding configuration had been found, the user moved the beta-hairpin flaps back into the closed conformation, again replacing the restraints. We note that we initially performed this task with a tautomer of HIV-1 protease where neither Asp25 residues were protonated; in this state, amprenavir seemed to favour a slightly different binding pose as its hydroxyl group no longer acts as a hydrogen bond acceptor. To further investigate the difference in binding mode between the two tautomers of HIV-1 protease, both were placed in Narupa with all positional backbone restraints removed. The expert user observed both binding poses without interacting with the system over 125 picoseconds of simulation time. In the monoprotonated HIV-1 protease system, the hydroxyl group of amprenavir rigidly points towards the deprotonated aspartic acid residue. Conversely, when both catalytic residues are deprotonated, the hydroxyl group frequently flips between pointing at either aspartic acid, causing amprenavir to ‘sag’ in the active site (see S2-C1 Fig in S1 File). The positively charged hydrogen on the protonated aspartic acid appears to have a stabilising effect on the orientation of the hydroxyl group, whereas removing the hydrogen causes the hydroxyl group to more easily flip as one orientation is no longer electrostatically ‘penalized’ by the protonated residue. Nonetheless, it is worth highlighting that Narupa enables qualitative observation of such binding dynamics in real time, demonstrating that iMD-VR enables both manipulation of molecular dynamics and easy comprehension of distinct dynamical behaviour in systems that have only small structural differences.

3.2 Novice unbinding and rebinding tasks

Fig 6 shows the minimum achieved RMSD of each small ligand novices tried to dock in both ‘training’ and ‘testing’ phases. For fully flexible simulations like these, which explore a range of conformational space as a result of thermal fluctuations, there is an open question how to assess whether a particular simulation has successfully recovered the binding pose seen in the crystal structure. For example, in a recent review by Pagadala et al. [57] which evaluated several different docking software programs, they used a cutoff of 2 Å to determine whether or not a fully flexible docking program managed to find the ‘correct’ pose. This strategy makes sense for cases where the protein’s unbiased fluctuations span a range less than 2 Å. The solid boxes in the left hand panel of Fig 6 show the RMSD range (mean ±2σ) spanned by a non-interactive Narupa simulation run for 50 picoseconds initialized in the bound pose at 298 K. Fig 6 shows that the ±2σ range is less than 2 Å for all but the trypsin + indole-aminidine testing system. During user ‘training’ phases, the results from Fig 6 show that all of the user generated poses are either within ±2σ of the mean RMSD, or else have an RMSD which is within 2 Å of the crystal structure. During user ‘testing’ phases, where the binding guide was switched off and the ligand identity was different for trypsin and neuraminidase, the results show more scatter, exactly as we would expect. Nevertheless the results are encouraging. For trypsin, all of the novice-generated testing poses lie within ±2σ of the mean RMSD. For neuraminidase and HIV-1 protease, which are considerably more complicated tasks, the results are similarly encouraging: 4/5 of the neuraminidase testing poses are within 2.08 Å RMSD of the crystal structure; and 3/5 of the HIV-1 protease testing poses are within 2.15 Å RMSD of the crystal structure. Below we discuss the results for each system in further detail.

thumbnail
Fig 6. Minimum RMSD values for novice tasks.

(Left) Showing the minimum achieved RMSD of each small ligand novices tried to dock, where each participant is assigned a unique colour. Two ligands were docked for each system, in both ‘training’ and ‘testing’ phases; each ligand is denoted in Fig 2. To determine the extent to which user-identified poses aligned with the native RMSD, a non-interactive Narupa simulation was run for 50 picoseconds in the bound pose and the average RMSD (plus or minus two standard deviations) was calculated, shown as the purple box. The whiskers at the top of each box span 1 Angstrom. (Right) The pose corresponding to the minimum RMSD value shown is on the left for each participant, overlaid on top of one another. Testing tasks for the three are shown in the top row, training tasks are shown on the bottom.

https://doi.org/10.1371/journal.pone.0228461.g006

3.2.1 Trypsin.

Directly comparing the trypsin training and testing tasks for the same participant shows that task accomplishment was better for all participants (ranging between 1.0Å and 1.9Å lower) during the training task. Benzamidine, used during the training task, is a small ligand and users had the benefit of trace atoms (Fig 6), explaining the good performance on the task. During the testing task, Fig 6 shows that participants were all able to recreate the amidine contact to Asp189, correctly aligning the indole-amidine benzamidine moiety. The variation in RMSD can be attributed to the rest of the molecule, where users had no prior guidance; variation in the extended scaffold between users can be seen in Fig 2A. Therefore, task accomplishment for binding indole-amidine into trypsin could likely be improved by providing additional feedback to indicate poses that are energetically favourable.

3.2.2 Neuraminidase.

Fig 6 shows the minimum achieved RMSD for the neuraminidase training and testing tasks. All participants were able to recreate the original bound pose of oseltamivir in neuraminidase, getting within 1Å RMSD of the starting coordinates. The training results gave better minimum ligand RMSDs compared to the testing task, where were between 0.3Å and 3.2Å higher when the results between the training and testing task are compared for the same participant (Fig 6). For the testing task, four out of five participants were able to place zanamivir in the correct orientation, indicating that users recognized the shared scaffold between oseltamivir and zanamavir (highlighted in green in Fig 2B). In particular, there was very good alignment of polar moieties (e.g. the carboxylate group, the ring ether) between participants (Fig 6). However, variation arose from the flexible nature of zanamivir (whereas oseltamivir was very consistently aligned throughout the molecule when participants completed the task with the aid of trace atoms). This would suggest that an interface for completing drug binding tasks using iMD-VR could benefit from being able to give ligand-centric feedback, indicating whether functional groups are in an optimal geometry.

3.2.3 HIV-1 protease.

Fig 6C shows the minimum achieved RMSD for the HIV-1 protease training and testing tasks. Likely owing to the increased size of amprenavir and the number of rotatable bonds it has, both the training and test tasks for HIV-1 protease had a higher degree of variation in minimum RMSD achieved when directly comparing results between the two tasks for the same participant. Interestingly however, when rebinding amprenavir a second time without trace atoms to guide them (testing task), three participants were able to get an RMSD close to the training task, one of which was lower by 0.08Å. This may be a result of users remembering the correct binding pose but it is nonetheless encouraging that iMD-VR can be used to recreate poses of larger, torsionally complex molecules with limited iMD-VR training. In particular, recreation of the contact between the amprenavir hydroxyl group and Asp25B was generally good between both tasks (S4-C Fig in S1 File). For the training task, Baker-Hubbard hydrogen bond analysis [58] on the minimum RMSD pose confirms this all participants were able to establish this contact. In the testing task, 4/5 participants were able to orientate the hydroxyl group and subsequently form a hydrogen bond to the catalytic Asp25A/Asp25B catalytic residues. However, the HIV-1 protease ‘testing’ task still had the highest variation in minimum RMSD, as some participants had a poorer performance without having a trace of the correct binding pose in the active site (the highest difference in RMSD between training and testing tasks was 5.9Å). In the testing task, four participants were able to orientate the hydroxyl group and subsequently form a hydrogen bond to the catalytic Asp25A/Asp25B catalytic residues. However, when inspecting the two outliers, the overall scaffold of amprenavir is incorrectly orientated–regardless of whether the user had established the hydrogen bonding contact to Asp25A and Asp25B.

4. Conclusions & future directions

In this paper, we have outlined an experimental protocol for setting up an iMD-VR simulation for the purposes of interactively manipulating protein-ligand systems to recover experimentally derived bound poses using iMD-VR. Utilizing this protocol, we have carried out studies exploring iMD-VR as a strategy for interactively sampling the unbinding and rebinding of ligands from proteins. The iMD-VR strategy enables this process to be accelerated compared to the much larger timescales required in unbiased MD simulations. To the best of our knowledge, this study represents the first time that iMD-VR has been extended to studies of complex protein-ligand binding and unbinding dynamics.

Our results show that expert iMD-VR users are able to manipulate protein-ligand systems to sample bound and unbound states. We also assessed the extent to which novice iMD-VR uses were able to sample binding and unbinding pathways using a training phase followed by a testing phase. Overall, the training phase showed slightly better task accomplishment; however, binding was still generally successful for many of the users during the testing phase, with the majority of users able to get within 2.15 Å RMSD of the experimental crystal structure, which is a comparable level of accuracy to other fully flexible docking programs. [57] So as not to overwhelm participants with having to learn a new VR-rendering interface during training, we took the decision to simplify the protein representations as shown in Supplementary videos A–C in S1 File, showing a backbone representation along with a subset of those amino acids which are known to have key interactions with the ligands, and which make a large contribution to the overall binding free energy. Encouragingly, even though only a limited set of binding interactions were shown, participants were still able to accurately rebind the ligands studied herein, including amprenavir, whose size and flexibility makes the binding task particularly complicated. This raises an interesting question: Is it the quantity of rendered interactions (i.e., showing all possible interactions) or the quality of rendered interactions (i.e., showing a few of the most important interactions) which enable better results to be obtained during interactive docking tasks? At this stage, because we have not carried out the appropriate control experiment, our results do not conclusively show that focusing on a few key interactions enables better results, but this is an interesting avenue of future research which we intend to explore in future work. Nonetheless, these results are especially encouraging given that our cohort of novice users had a very limited time (less than 60 minutes) to be trained in both the interface and each of the three protein-ligand systems. The results suggest that the iMD-VR paradigm is sufficiently intuitive and affords adequate control to enable the sorts of detailed manipulations required to carry out VR-enabled atomistic docking in a fully flexible MD environment. Our ‘expert’ results show that iMD-VR users benefit from further training, given their familiarity with the interface and the specific protein systems.

Beyond the quantitative analysis described herein, these studies also show that iMD-VR can be used to reveal interesting qualitative features of protein-ligand unbinding, such as benzamidine not forming electrostatic contacts to Asp189 if it is incorrectly orientated and subsequently being repelled out of the trypsin binding pocket (Supplementary video A.2 in S1 File), or changes in the binding of amprenavir to tautomers of HIV-1 protease. In particular, the three-dimensional interface aided in the identification of key interactions between the protein and the ligand, as well as allowing the user to directly observe dynamical behaviours in real time.

Our iMD-VR framework enables users to quickly apply restraints to arbitrary subsets of atoms, which can be turned on and off ‘on-the-fly’. The flexible application of restraints enables the user to rapidly identify those regions where they would like to either explore or dampen conformational flexibility–e.g., in loop domains. As a training strategy, we have found that restraints enable less experienced iMD-VR users to familiarize themselves with a system without the risk of inadvertently damaging the tertiary structure. For expert iMD-VR users, restraints permit the sampling of sophisticated docking strategies that involve complex conformational changes, like that in HIV-1 protease. In future work, we intend to carry out more detailed studies on how to apply restraints so as to enable users to efficiently and accurately carry out binding tasks.

We also intend to develop additional strategies to help iMD-VR users quickly formulate and test binding hypotheses. For example, checkpoints will enable users to return the system to previous states visited earlier in the simulation, along with additional audio and visual feedback to indicate to the user how structural manipulation influences the overall energy of the system, and allowing the user to pause the simulation and carry out energy minimizations ‘on-the-fly’. For the purposes of streamlining these particular studies and not overwhelming novice users, we limited the studies described herein to a ‘backbone and key residue’ visualization scheme for the proteins. While such a representation aims to facilitate locating the active site and identifying key interactions, it makes it difficult to see unfavourable van der Waals interactions between the ligand and protein. In future work, we will examine the extent to which different rendering strategies influence task accomplishment. Such methods will help inform binding hypotheses in cases where the bound pose is unknown.

Unbiased simulation of protein ligand-binding typically occur on timescales of milliseconds or seconds [55, 56], and is therefore inaccessible to even the most sophisticated simulation architectures without recourse to a biasing or acceleration method. There is evidence that finding minimum energy pathways in hyperdimensional systems such as these is an “NP-hard” problem (for which no optimal method exists). [59] Having established herein that iMD-VR offers a reliable tool for experts and novices alike to generate accurate protein-ligand binding poses, we intend to carry out studies aimed at analysing the user-generated iMD-VR pathways. Specifically, we plan to analyse: (1) how closely user-generated iMD-VR pathways follow the system’s minimum free energy path (MFEP); and (2) methods where iMD-VR pathways may be used to recover free energies for association and dissociation. For example, the configurations and pathways generated from iMD-VR (shown in Fig 3) could serve as initial guesses for adaptive path-based free energy sampling algorithms like transition path sampling [60], umbrella sampling [61], path-based metadynamics [62], forward flux sampling, [63] string methods, [64] or boxed molecular dynamics (BXD). [65] In addition, the configurations generated during an iMD-VR run could be used to produce Markov state models for ligand binding systems [8], by seeding the model with initial conditions which are not trapped within metastable energy minima [34, 66]. Whilst there are a number of details which require further investigation in order to establish a reliable and integrated workflow for recovering free energies along user-generated iMD-VR pathways, such a framework will enable us to explore the differences in kinetics and thermodynamics between different ligands binding to the same protein target, or the free energy landscape of the same drug binding to different protein mutants, with potential applications to areas like antimicrobial resistance. More broadly, the 3D iMD-VR interface can be used to explore and sample cryptic binding configurations which may not be indicated in the crystal structure, offering exciting new opportunities for interactive drug design.

References

  1. 1. Nardi B., et al., Computing within limits. Commun. ACM, 2018. 61(10): p. 86–93.
  2. 2. McCammon J.A., Gelin B.R., and Karplus M., Dynamics of folded proteins. Nature, 1977. 267(5612): p. 585–590. pmid:301613
  3. 3. Levitt M., A simplified representation of protein conformations for rapid simulation of protein folding. Journal of Molecular Biology, 1976. 104(1): p. 59–107. pmid:957439
  4. 4. Patodia S., Bagaria A., and Chopra D., Molecular Dynamics Simulation of Proteins: A Brief Overview. Journal of Physical Chemistry & Biophysics, 2014. 4(6): p. 1.
  5. 5. Karplus M. and Kuriyan J., Molecular dynamics and protein function. Proceedings of the National Academy of Sciences of the United States of America, 2005. 102(19): p. 6679. pmid:15870208
  6. 6. Mortier J., et al., The impact of molecular dynamics on drug design: applications for the characterization of ligand–macromolecule complexes. Drug Discovery Today, 2015. 20(6): p. 686–702. pmid:25615716
  7. 7. Plattner N. and Noé F., Protein conformational plasticity and complex ligand-binding kinetics explored by atomistic simulations and Markov models. Nature Communications, 2015. 6: p. 7653. pmid:26134632
  8. 8. Plattner N. and Noé F., Protein conformational plasticity and complex ligand-binding kinetics explored by atomistic simulations and Markov models. Nature Communications, 2015. 6(1): p. 7653.
  9. 9. Gioia D., et al., Dynamic docking: a paradigm shift in computational drug discovery. Molecules, 2017. 22(11): p. 2029.
  10. 10. Zhao H. and Caflisch A., Molecular dynamics in drug design. European journal of medicinal chemistry, 2015. 91: p. 4–14. pmid:25108504
  11. 11. Amaro R.E., et al., Ensemble docking in drug discovery. Biophysical Journal, 2018. 114(10): p. 2271–2278. pmid:29606412
  12. 12. Śledź P. and Caflisch A., Protein structure-based drug design: from docking to molecular dynamics. Current opinion in structural biology, 2018. 48: p. 93–102. pmid:29149726
  13. 13. Hollingsworth S.A. and Dror R.O., Molecular Dynamics Simulation for All. Neuron, 2018. 99(6): p. 1129–1143. pmid:30236283
  14. 14. Ming, O.-Y., D.V. Beard, and F.P. Brooks, Force display performs better than visual display in a simple 6-D docking task, in IEEE Int. Conf. on Robotics and Automation. 1989, IEEE. p. 1462–1466.
  15. 15. Brooks F.P., et al., Project GROPE-Haptic displays for scientific visualization. ACM SIGGraph computer graphics, 1990. 24(4): p. 177–185.
  16. 16. Surles M.C., et al., Sculpting proteins interactively: continual energy minimization embedded in a graphical modeling system. Protein Sci, 1994. 3(2): p. 198–210. pmid:8003957
  17. 17. Atkinson W.D., et al., Computing with feeling. Computers & Graphics, 1977. 2(2): p. 97–103.
  18. 18. Stone, J.E., J. Gullingsrud, and K. Schulten, A system for interactive molecular dynamics simulation, in Proceedings of the 2001 symposium on Interactive 3D graphics. 2001, ACM. p. 191–194.
  19. 19. Grayson P., Tajkhorshid E., and Schulten K., Mechanisms of Selectivity in Channels and Enzymes Studied with Interactive Molecular Dynamics. Biophysical Journal, 2003. 85(1): p. 36–48. pmid:12829462
  20. 20. Dreher M., et al., Interactive Molecular Dynamics: Scaling up to Large Systems. Procedia Comp. Sci., 2013. 18: p. 20–29.
  21. 21. Luehr N., Jin A.G., and Martínez T.J., Ab initio interactive molecular dynamics on graphical processing units (GPUs). Journal of chemical theory and computation, 2015. 11(10): p. 4536–4544. pmid:26574246
  22. 22. Haag M.P., et al., Interactive Chemical Reactivity Exploration. ChemPhysChem, 2014. 15(15): p. 3301–3319. pmid:25205397
  23. 23. O’Connor M.B., et al., Interactive molecular dynamics in virtual reality from quantum chemistry to drug binding: An open-source multi-person framework. The Journal of Chemical Physics, 2019. 150(22): p. 220901. pmid:31202243
  24. 24. O’Connor M., et al., Sampling molecular conformations and dynamics in a multiuser virtual reality framework. Science Advances, 2018. 4(6): p. eaat2731.
  25. 25. Amabilino S., et al., Training Neural Nets To Learn Reactive Potential Energy Surfaces Using Interactive Quantum Chemistry in Virtual Reality. The Journal of Physical Chemistry A, 2019. 123(20): p. 4486–4499. pmid:30892040
  26. 26. Perona J.J., et al., Structural origins of substrate discrimination in trypsin and chymotrypsin. Biochemistry, 1995. 34(5): p. 1489–99. pmid:7849008
  27. 27. Schiebel J., et al., Intriguing role of water in protein-ligand binding studied by neutron crystallography on trypsin complexes. Nature Communications, 2018. 9(1): p. 3559. pmid:30177695
  28. 28. Polticelli F., et al., Structural determinants of trypsin affinity and specificity for cationic inhibitors. Protein Sci, 1999. 8(12): p. 2621–9. pmid:10631977
  29. 29. Yonetani Y., Water access and ligand dissociation at the binding site of proteins. The Journal of Chemical Physics, 2018. 149(17): p. 175102. pmid:30408972
  30. 30. Krieger M., Kay L.M., and Stroud R.M., Structure and specific binding of trypsin: comparison of inhibited derivatives and a model for substrate binding. J Mol Biol, 1974. 83(2): p. 209–30. pmid:4821871
  31. 31. Votapka L.W., et al., SEEKR: Simulation Enabled Estimation of Kinetic Rates, A Computational Tool to Estimate Molecular Kinetics and Its Application to Trypsin-Benzamidine Binding. J Phys Chem B, 2017. 121(15): p. 3597–3606. pmid:28191969
  32. 32. Buch I., Giorgino T., and De Fabritiis G., Complete reconstruction of an enzyme-inhibitor binding process by molecular dynamics simulations. Proc Natl Acad Sci U S A, 2011. 108(25): p. 10184–9. pmid:21646537
  33. 33. Teo I., et al., Adaptive Multilevel Splitting Method for Molecular Dynamics Calculation of Benzamidine-Trypsin Dissociation Time. J Chem Theory Comput, 2016. 12(6): p. 2983–9. pmid:27159059
  34. 34. Doerr S. and De Fabritiis G., On-the-Fly Learning and Sampling of Ligand Binding by High-Throughput Molecular Simulations. J Chem Theory Comput, 2014. 10(5): p. 2064–9. pmid:26580533
  35. 35. Horimoto T. and Kawaoka Y., Influenza: lessons from past pandemics, warnings from current incidents. Nat Rev Microbiol, 2005. 3(8): p. 591–600. pmid:16064053
  36. 36. von Itzstein M., The war against influenza: discovery and development of sialidase inhibitors. Nat Rev Drug Discov, 2007. 6(12): p. 967–74. pmid:18049471
  37. 37. Woods C.J., et al., Computational assay of H7N9 influenza neuraminidase reveals R292K mutation reduces drug binding affinity. Sci Rep, 2013. 3: p. 3561. pmid:24356381
  38. 38. von Itzstein M., et al., Rational design of potent sialidase-based inhibitors of influenza virus replication. Nature, 1993. 363(6428): p. 418–23. pmid:8502295
  39. 39. Kim C.U., et al., Influenza neuraminidase inhibitors possessing a novel hydrophobic interaction in the enzyme active site: design, synthesis, and structural analysis of carbocyclic sialic acid analogues with potent anti-influenza activity. J Am Chem Soc, 1997. 119(4): p. 681–90. pmid:16526129
  40. 40. Shahrour N., The Role of Neuraminidase Inhibitors in the Treatment and Prevention of Influenza. J Biomed Biotechnol, 2001. 1(2): p. 89–90. pmid:12488615
  41. 41. Amaro R.E., et al., Mechanism of 150-cavity formation in influenza neuraminidase. Nat Commun, 2011. 2: p. 388. pmid:21750542
  42. 42. Wu Y., et al., Induced opening of influenza virus neuraminidase N2 150-loop suggests an important role in inhibitor binding. Sci Rep, 2013. 3: p. 1551. pmid:23531861
  43. 43. Brik A. and Wong C.H., HIV-1 protease: mechanism and drug discovery. Org Biomol Chem, 2003. 1(1): p. 5–14. pmid:12929379
  44. 44. Kim E.E., et al., Crystal structure of HIV-1 protease in complex with VX-478, a potent and orally bioavailable inhibitor of the enzyme. Journal of the American Chemical Society, 1995. 117(3): p. 1181–1182.
  45. 45. McGee T.D., Edwards J., and Roitberg A.E., pH-REMD simulations indicate that the catalytic aspartates of HIV-1 protease exist primarily in a monoprotonated state. J Phys Chem B, 2014. 118(44): p. 12577–85. pmid:25340507
  46. 46. Chen J., et al., Revealing origin of decrease in potency of darunavir and amprenavir against HIV-2 relative to HIV-1 protease by molecular dynamics simulations. Sci Rep, 2014. 4: p. 6872. pmid:25362963
  47. 47. Hou T. and Yu R., Molecular dynamics and free energy studies on the wild-type and double mutant HIV-1 protease complexed with amprenavir and two amprenavir-related inhibitors: mechanism for binding and drug resistance. J Med Chem, 2007. 50(6): p. 1177–88. pmid:17300185
  48. 48. Kar P. and Knecht V., Energetic basis for drug resistance of HIV-1 protease mutants against amprenavir. J Comput Aided Mol Des, 2012. 26(2): p. 215–32. pmid:22350569
  49. 49. Wittayanarakul K., Hannongbua S., and Feig M., Accurate prediction of protonation state as a prerequisite for reliable MM-PB(GB)SA binding free energy calculations of HIV-1 protease inhibitors. Journal of Computational Chemistry, 2008. 29(5): p. 673–685. pmid:17849388
  50. 50. Hosseini A., et al., Computational Prediction of HIV-1 Resistance to Protease Inhibitors. J Chem Inf Model, 2016. 56(5): p. 915–23. pmid:27082876
  51. 51. Mahanti M., et al., Flap Dynamics in Aspartic Proteases: A Computational Perspective. Chem Biol Drug Des, 2016. 88(2): p. 159–77. pmid:26872937
  52. 52. Hare, S., et al., Low Dimensional Representations Along Intrinsic Reaction Coordinates and Molecular Dynamics Trajectories Using Interatomic Distance Matrices. 2019.
  53. 53. Leonis G., et al., Computational studies of darunavir into HIV-1 protease and DMPC bilayer: necessary conditions for effective binding and the role of the flaps. J Chem Inf Model, 2012. 52(6): p. 1542–58. pmid:22587384
  54. 54. Zhu Z., Schuster D.I., and Tuckerman M.E., Molecular dynamics study of the connection between flap closing and binding of fullerene-based inhibitors of the HIV-1 protease. Biochemistry, 2003. 42(5): p. 1326–33. pmid:12564936
  55. 55. Dierynck I., et al., Binding kinetics of darunavir to human immunodeficiency virus type 1 protease explain the potent antiviral activity and high genetic barrier. J Virol, 2007. 81(24): p. 13845–51. pmid:17928344
  56. 56. Shuman C.F., et al., Elucidation of HIV-1 protease resistance by characterization of interaction kinetics between inhibitors and enzyme variants. Antiviral Res, 2003. 58(3): p. 235–42. pmid:12767471
  57. 57. Pagadala N.S., Syed K., and Tuszynski J., Software for molecular docking: a review. Biophysical reviews, 2017. 9(2): p. 91–102. pmid:28510083
  58. 58. Baker E.N. and Hubbard R.E., Hydrogen bonding in globular proteins. Progress in Biophysics and Molecular Biology, 1984. 44(2): p. 97–179. pmid:6385134
  59. 59. Hart W.E. and Istrail S., Robust proofs of NP-hardness for protein folding: general lattices and energy potentials. J. Comput. Biol., 1997. 4(1): p. 1–22. pmid:9109034
  60. 60. Bolhuis P.G., et al., TRANSITION PATH SAMPLING: Throwing Ropes Over Rough Mountain Passes, in the Dark. Annual Review of Physical Chemistry, 2002. 53(1): p. 291–318.
  61. 61. Kästner J., Umbrella sampling. Wiley Interdisciplinary Reviews: Computational Molecular Science, 2011. 1(6): p. 932–942.
  62. 62. Barducci A., Bonomi M., and Parrinello M., Metadynamics. Wiley Interdisciplinary Reviews: Computational Molecular Science, 2011. 1(5): p. 826–843.
  63. 63. Allen R.J., Valeriani C., and ten Wolde P.R., Forward flux sampling for rare event simulations. Journal of physics: Condensed matter, 2009. 21(46): p. 463102. pmid:21715864
  64. 64. Weinan E., Ren W., and Vanden-Eijnden E., Finite temperature string method for the study of rare events. J. Phys. Chem. B, 2005. 109(14): p. 6688–6693. pmid:16851751
  65. 65. O’Connor M., et al., Adaptive free energy sampling in multidimensional collective variable space using boxed molecular dynamics. Faraday Discussions, 2016. 195(0): p. 395–419.
  66. 66. Zimmerman M.I. and Bowman G.R., FAST Conformational Searches by Balancing Exploration/Exploitation Trade-Offs. Journal of Chemical Theory and Computation, 2015. 11(12): p. 5747–5757. pmid:26588361