Aberrant Ras activity is a hallmark of diverse cancers and developmental diseases. Unfortunately, conventional efforts to develop effective small molecule Ras inhibitors have met with limited success. We have developed a novel multi-level computational approach to discover potential inhibitors of previously uncharacterized allosteric sites. Our approach couples bioinformatics analysis, advanced molecular simulations, ensemble docking and initial experimental testing of potential inhibitors. Molecular dynamics simulation highlighted conserved allosteric coupling of the nucleotide-binding switch region with distal regions, including loop 7 and helix 5. Bioinformatics methods identified novel transient small molecule binding pockets close to these regions and in the vicinity of the conformationally responsive switch region. Candidate binders for these pockets were selected through ensemble docking of ZINC and NCI compound libraries. Finally, cell-based assays confirmed our hypothesis that the chosen binders can inhibit the downstream signaling activity of Ras. We thus propose that the predicted allosteric sites are viable targets for the development and optimization of new drugs.
Citation: Grant BJ, Lukman S, Hocker HJ, Sayyah J, Brown JH, McCammon JA, et al. (2011) Novel Allosteric Sites on Ras for Lead Generation. PLoS ONE 6(10): e25711. https://doi.org/10.1371/journal.pone.0025711
Editor: Rakesh K. Srivastava, The University of Kansas Medical Center, United States of America
Received: July 26, 2011; Accepted: September 8, 2011; Published: October 25, 2011
Copyright: © 2011 Grant et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by grants from National Institutes of Health, National Science Foundation, Howard Hughes Medical Institute and Center for Theoretical Biological Physics. SL thanks the Agency for Science, Technology and Research, Singapore for studentship. AG and HH acknowledge financial support from the University of Texas Medical School in Houston and the Gulf Coast Consortia Training in Pharmacological Sciences. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Ras proteins are key regulators of signaling pathways controlling normal cell proliferation and malignant transformation. Signal propagation through Ras is mediated by a regulated GTPase cycle that leads to active and inactive conformations, which differ significantly in their affinity for downstream effectors. Somatic point mutations that perturb the fidelity of this cycle can lead to constitutively active oncogenic Ras . Such mutants are found in about a third of all human tumors where they contribute to the deregulation of cell growth, tumor invasiveness and new blood vessel formation . Germline Ras mutations are also frequently expressed in patients suffering from a group of related developmental disorders, referred to collectively as neuro-cardio-facial-cutaneous syndrome , , . These disorders share a variable degree of mental retardation, cardiac defects, craniofacial dysmorphism, and short stature . Therapies that target Ras proteins and the signaling pathways under their regulations are thus of major importance for human health.
Considerable effort has been directed towards inhibiting Ras processing enzymes and major components of Ras signaling pathways. Inhibitors of farnesyl and palmitoyl transferases  have been investigated for their potential to attenuate C-terminal lipid modification of Ras required for correct plasma membrane localization and subsequent signaling. A drawback of such inhibitors is their poor selectivity as they likely affect many lipid-modified proteins. Furthermore, the most promising farnesyltransferase inhibitors failed to achieve their intended goal of disrupting Ras membrane-binding . Indeed the most frequently mutated Ras isoforms in human tumors (K-Ras and N-Ras) were found to undergo alternative prenylation and remain oncogenically active . Attempting to inhibit the function of K-Ras and N-Ras by using a combination of prenylation inhibitors failed because of the very high toxicity associated with the required combination therapy . Indeed, it is likely that the lack of toxicity associated with farnesyltransferase inhibitors in isolation is due to their inability to inhibit the functions of the endogenous Ras isoforms essential for normal cell viability. Another approach has involved inhibitors of the upstream protein kinase regulators and downstream effectors of Ras, for example, receptor tyrosine kinase inhibitors and components of the RAF-MAPK pathway , . However, because Ras proteins are activated by a myriad of stimuli and utilize a multitude of downstream effectors, a particular kinase inhibitor will likely impair only a subset of Ras functions leading to potentially limited therapeutic benefits.
The development of small-molecule inhibitors that directly target Ras is highly desirable but has proven to be a major challenge. Notable issues include the limited bioavailability of drugs that target highly polar active sites, such as the nucleotide-binding site of Ras, also known as the switch region . Additional selectivity and toxicity issues arise from the highly conserved nature of this switch region across small G-proteins and the wider P-loop NTPase superfamily. Furthermore, Ras signaling involves a tightly regulated network of multiple positive and negative regulators with a specific spatiotemporal organization on cellular membranes . It is the balance of these positive and negative regulators that ultimately determines the fraction of GTP-bound active and GDP-bound inactive Ras. Currently it is not clear if normal and aberrant Ras have distinct plasma-membrane organizations that can lead to differential accessibility to downstream effectors and/or upstream exchange factors. These complications may explain, at least in part, why Ras-binders that have promising anti-cancer activity in pre-clinical models failed in clinical trials .
The development of compounds with selectivity for K-Ras over H-Ras would be particularly desirable. Silencing of K-Ras by siRNA , , miRNA ,  or antisense K-Ras  has been shown to result in reversal of transformed phenotypes and suppression of tumorigenicity in human cancer cells. Studies of the three major H-, K-, and N-Ras isoforms  suggest that differential membrane-organization may underlie the association of these highly similar proteins with different diseases . Similarly, recent evidence from experimental ,  and computational efforts ,  suggests that oncogenic and normal Ras proteins harbor distinct dynamic properties that may lead to differences in membrane binding . Of special note are long-range coupled motions between the conserved N-terminal lobe1 (residues 1–86, which includes the switch region) and the more variable membrane-interacting lobe2  (Fig. 1). Such dynamic behaviors are not always captured in crystallographic structures. For example, while the overall structures of wild-type H-Ras and the oncogenic variant G12V-H-Ras are very similar, the latter exhibits enhanced dynamics  and is more susceptible to adopt an active-like conformation . Here we ask if such motional differences can be exploited to develop new allosteric inhibitors that selectively target a specific malfunctioning Ras protein.
The switches 1 (blue) and 2 (red), that form the nucleotide-binding site, are situated in the N-terminal lobes. GTP (yellow) is shown as sticks.
In subsequent sections we describe the application of an integrated bioinformatics approach that couples fragment, grid and ligand based binding-site mapping methods with virtual screening (VS) of small molecule libraries against an ensemble of distinct Ras conformers. Four non-nucleotide binding pockets were identified in available crystal structures and molecular dynamics simulated conformers. Simulations reveal that three of these distal pockets have the potential to be allosterically linked to the switch region. Furthermore, a number of lead compounds predicted to bind these allosteric sites were tested experimentally and found to exhibit inhibitory activity in Ras-expressing cancer cells.
Results and Discussion
A structural ensemble scheme to account for conformational diversity
Conformational transitions are essential for modulating Ras signaling activity. These transitions have the potential to expose transient surface pockets and allosteric sites of significance for drug discovery. Indeed the merit of accounting for receptor flexibility in ligand design , , , , , especially in the development of allosteric inhibitors , , is now well recognized. In the current work, we combine analysis of multiple conformationally distinct crystallographic structures with molecular dynamics (MD) trajectory analysis. This ensemble approach differs from earlier virtual screening studies on single crystallographic structures ,  by accounting for the intrinsic conformational mobility of Ras proteins.
Ras crystal structure representatives were selected based on a previously developed protocol for the analysis of inter-conformer relationships in families of protein structures , , , . Principal component analyses (PCA) and RMSD based clustering of available structures indicate the existence of distinct Ras conformational groupings (Fig. S1). It has been noted previously that the PCA based groupings correlate with the nature of the bound nucleotide or the presence of a mutation in the vicinity of the nucleotide-binding site , . Considering these differences in bound nucleotide and mutation, as well as isoform, seven representative structures, termed the crystallographic ensemble, were used for further analysis (Table S1). Additional conformers were derived from MD simulations performed on the K-Ras (the most frequently mutated isoform in a wide range of cancers , ). Conformers from a total of 120 ns multi-copy simulations performed in the presence of GTP and GDP were subjected to PCA and RMSD clustering analysis similar to that performed on the available crystal structures (see Methods). In this manner, eight representative MD ensemble conformers were appended to the crystallographic ensemble and used for binding site mapping and small molecule docking studies described below.
Ensemble binding site mapping
A combination of fragment, grid and ligand based methods was employed to identify potential small molecule binding sites on our ensemble of Ras structures. The common practice in computer aided structure based drug design is to aim for inhibitors that compete with an enzyme's cognate substrate . However, in the current work, we deliberately focused on the identification of potential allosteric small molecule binding sites that are distal from the nucleotide-binding site (see Introduction). Taking the union of identified sites from a combination of different methods increases our chances of mapping all such sites at the expense of considering potentially more spurious sites .
Fragment based pocket analysis
We first used a fragment based approach (FTMAP) that is based on the ideas behind screening small organic fragments by NMR and X-ray crystallography . FTMAP correlates pocket druggability with their propensity to bind clusters of small organic compounds. Fig. 2 displays the mapping results across our ensemble of Ras crystal structures. To further characterize the location of each hot spot, an ensemble normalized probe occupancy value was calculated and assigned to each residue of the catalytic domain (Fig. 3). As expected, high probe occupancy values were obtained for residues around the nucleotide-binding site (red in Figs. 2 and 3 and Movie S1), consistent with the existence of this binding site in all conformers. Three additional pockets, displaying low and high occupancies, were also identified (termed p1, p2 and p3 and rendered in pink, green and blue respectively in Figs. 2 and 3 and Movie S1).
Sites identified include the nucleotide-binding site (red) and new potential binding sites p1, p2 and p3 (in pink, green and blue respectively, see also Fig. 3).
Sites with low probe occupancy values identify regions on the protein surface where potential interaction sites are exposed only in a subset of conformations (reflecting the intrinsic dynamic nature of these binding pockets). This includes pockets p1 and p2 that reside on either side of the highly mobile helix α2. Residues from α2 and β-strands 1–3 (in lobe1) line p1, whilst p2 lies between helices α2 and α3 at the interface between the two lobes (see Table 1). The large relative displacement of α2 between GDP- and GTP- like conformers effectively modulates the accessibility of these two pockets. These nucleotide-associated displacements effectively link the conformations of p1 and p2 to that of the nucleotide-binding site. Of particular note is Tyr71 that lies at the center of pocket p1 blocking ligand binding in the crystal structures of GDP-H-Ras and H-Ras-G12V. During simulations this residue is observed to reorient so as to no longer block p1 pocket accessibility (Fig. S2).
The most distal non active site pocket, p3, resides in lobe2 approximately 25 Å from the nucleotide-binding site. Residues from loop 7 and the C-terminal end of helix 5 line this pocket which has a mostly hydrophobic interior but is surrounded by polar residues K104, S106, D108 and H166. Pocket p3 can vary in size in the ensemble, due predominately to small displacements of loop 7 residues. Indeed loop7 and helix 5 are observed to adopt distinct conformations in the GTP-, GDP-, and intermediate ensemble structures. However, pocket p3 is observed in all ensemble conformers in contrast to pockets p1 and p2 that are inaccessible in some members of the ensemble. Considering the dynamic nature of loop 7, α5 and α2 that line pockets p1–3, we hypothesized that VS with our ensemble of conformers may allow us to find new ligand poses that would not be observed with any single crystallographic structure. Finally, we note that the loop 7 comprising pocket p3 has been shown to undergo correlated motions with the active site , , .
Grid energy based pocket analysis
In addition to fragment based mapping we employed grid energy based pocket identification with the AutoLigand package . AutoLigand searches the exterior surface space of a protein for contiguous volume envelopes with potentially favorable interaction energies for small molecules. Applying this method across the Ras structural ensemble highlighted two of the three previous FTMAP identified binding sites, p1 and p3 (Table 1). An additional pocket, here termed p3b was identified immediately adjacent to pocket p3. Both pockets p3 and its extension p3b share residues from loop 7 and the C-terminal end of helix α5. However p3b extends toward loop 5 and p3 toward loop 9 (Fig. 4). Pocket p3b is rich in charged residues including E162 that provides a negatively charged site at the bottom of the p3b pocket. Pocket p3b is significantly smaller and shallower than pockets p1–3 as highlighted by FTMAP (Table 1 and Fig. 4), which likely explains why it did not feature in fragment based (described above) or ligand based (described below) site identification.
Ligand based pocket analysis
As a final step in our characterization of potential binding sites we performed a blind docking of known small-molecule Ras binding molecules. The basic premise of this approach is that a ligand frequently targets a given site if that site offers an energetic or steric advantage over any other site , . A library of putative ligands that were shown to bind to Ras at unknown sites from high throughput bioassay data were “blind-docked” onto the entire surface of a single K-Ras crystal structure (see methods). By counting the number of compounds that target a given site, weighted by the frequency with which a given compound targets the site, we found two prominent binding sites (Table 1 and Fig. 4): one that coincides with pocket p3 (targeted by ∼80% of compounds) and a second novel pocket, p4, behind the flexible effector binding loop (targeted by ∼3% of the compounds). To ensure that this dramatic preference for pocket p3 is not biased by the choice of a specific small molecule set, we blind-docked 20 top-ranking compounds derived from a site-directed VS of the NCI and ZINC libraries against the entire crystallographic- and MD-ensembles (to be discussed). We found that these compounds tend to preferentially bind pocket p3, and to a lesser extent to p4. For instance, of the top-ranking poses, 12 out of the 14 ZINC compounds bind to p3 and the other 2 at p4. Similarly, 2 out of the 6 NCI compounds preferentially target p3 while the remaining 4 target pocket p4. Overall, it is noteworthy that three distinct procedures applied to a different set of ligand libraries and fragment probes highlighted the same potential pockets.
Ensemble virtual screening against the novel binding sites
Compounds from the NCIDS II and Zinc drugs-now subset were docked against the identified pockets in each of our ensemble structures using the Schrödinger Glide package , . We next re-scored compounds ranked as top 20 using eight different scoring functions (see Methods). A total of 195 and 256 top scoring unique compounds were obtained from the screening against the crystallographic and MD ensemble, respectively. A total of 56 of the top-scoring compounds (ranked within the top 20) were able to target the same site in both the crystallographic- and MD-ensembles. The MD-ensemble yielded 26 additional compounds not detected in the screening against the crystallographic-ensemble. Furthermore, some compounds that scored poorly in the crystallographic-ensemble scored well in the MD-ensemble. Such an improvement of the rankings and the appearance of 26 additional compounds as top scorers in the MD-ensemble is largely due to conformers that lie in-between the major GTP- and GDP-clusters (Fig. S3). This result highlights the benefit of utilizing the MD-based relaxed complex scheme in addition to the traditional approach of relying on one or a few average X-ray structures , . It is important to also note that the crystallographic ensemble is dominated by H-Ras structures (both wild-type and mutant), whereas the MD-ensemble contains only K-Ras structures. It is possible that some of the 26 ligands that appear only in the MD-ensemble may be K-Ras-selective. For subsequent analysis, we focused on the 56 compounds that scored high in both the crystallographic- and MD-ensembles as well as the 26 compounds that are unique to the MD-ensemble. After Lipinski-filtering and visual inspection 19 ligands were selected as promising leads and submitted for experimental testing (Table 2).
It is well known that Ras can activate the Raf/Mek/Erk signaling pathway. In order to select an appropriate cell line to evaluate the inhibitory properties of the candidate compounds in vitro, we determined basal Ras and ERK1/2 activity in several glioblastoma cell lines via Ras pull down assays and Western blot analysis, respectively. Results for four well characterized glioblastoma cell lines are shown in Fig. 5A. Among the lines examined, the U251 cells  were found to have the highest level of basal GTP bound Ras and this was associated with high basal downstream ERK1/2 phosphorylation (Fig. 5A). We employed this cell line to test the ability of the candidate inhibitor compounds to block the Ras-ERK pathway. We first determined the effect of each of four lead candidate compounds on ERK1/2 activation. Compounds were initially tested at a concentration of 10 µM and cells were treated for 24 hrs to allow adequate time to achieve decreases in steady state Ras and ERK activity levels. As shown in Fig. 5B, immunoblot analysis demonstrated that compounds 662796, 643000 and 117028 inhibited ERK1/2 phosphorylation in U251 glioblastoma cells. ERK1/2 phosphorylation levels from four separate experiments were quantitated and compounds 662796, 643000 and 117028 shown to elicit significant (50 to 75%) decreases in ERK1/2 activation. We next examined the ability of these compounds to decrease Ras activity in U251 cells (Fig. 5C). Ras pull down assays were performed to assess GTP bound (active) Ras. Two of the three compounds that suppressed ERK1/2 phosphorylation (643000 and 117028) also inhibited Ras activation in U251 cells. These data suggest that the ability of compounds 643000 and 117028 to decrease ERK1/2 phosphorylation derives from inhibition of Ras activity rather than nonspecific inhibitory effects on the ERK activation cascade downstream of Ras. We subsequently examined the dose dependence of Ras inhibition by compounds 643000 and 117028. Both compounds decreased basal Ras activity in a dose-dependent manner (Fig. 5C). For compound 64300, the half maximal inhibitory concentration appeared to be approximately 10–30 µM and nearly complete inhibition was achieved. For compound 117028, the maximal decrease achieved was approximately 40%. This may reflect the fact that there is a presence of highly active Ras guanine nucleotide exchange factors in this glioblastoma cell line. Nonetheless the ability of these non-optimized, computationally selected Ras inhibitory compounds to decrease the amount of constitutively active Ras, and its downstream target ERK, in this glioblastoma cell line suggests that this strategy holds considerable promise for limiting aberrant cancer cell growth mediated through this pathway.
(A,top). The U138, U251, U373 and U343 glioblastoma cells were serum-starved for 48 hours and active Ras was precipitated with GST-fused Ras-binding domain of Raf-1 (Raf-1 RBD)-agarose beads and probed with anti-Ras antibody. (A,bottom) Lysates were immunoblotted for P-ERK1/2 or total ERK1/2. Data were quantitated and averaged from 4 independent experiments and presented as means ± SEM of the ratio of active Ras to total Ras (top) or the ratio of phosphorylated ERK1/2 to total ERK1/2 (bottom). *P<0.05 for U251 vs U138, U373 and U343, one-way ANOVA. (B) Serum-starved U251 glioblastoma cells were treated with either vehicle (DMSO) control or 10 mM of the indicated compound for 24 hours. Cell lysates were immunoblotted for P-ERK1/2 or total ERK1/2. Immunoblots are representative of four independent experiments which were quantitated and shown graphically as means ± SEM; *P<0.05 vs. vehicle control, one-way ANOVA. (C) Cells (U251) were serum starved and subsequently treated with compound 643000 (left) or compound 117028 (right) at the indicated concentrations for 24 hours. Ras activity was assessed using the pull down assay. Immunoblots are representative of four independent experiments, which were quantitated and plotted as percent of vehicle control. Values are means ± SEM; *P<0.05 vs. vehicle control for compound 643000; **P<0.01 vs. vehicle control for compound 117028.
A proposed mechanism of action of the predicted ligands
The most prominent putative ligand binding site, p3, as well as its extension p3b, are located on lobe 2 near the C-terminus that leads to the membrane-interacting hypervariable region. Given our earlier MD and cell-biological studies that found that membrane-binding by the full length H-Ras , ,  is allosterically modulated by the bound nucleotide, it is tempting to speculate that ligands targeting the predicted allosteric sites may interfere with Ras' attachment to the plasma membrane. It is also possible, however, that ligands that target these sites could allosterically modulate the conformation of the canonical switch region. This is consistent with the observed long-range coupled motions of loop 7 with the active site , , , . Furthermore, a study combining crystallography, paramagnetic relaxation enhancement and calorimetry found that divalent ion-cyclen binding at a site equivalent to p3 allosterically stabilizes an effector-loop conformation with a weak potential for effector binding , . Similarly, the current ligands may selectively stabilize GDP-like or transient structures and thereby decrease the pool of GTP-bound Ras through a population-shift mechanism . Although further investigations are required to determine if the predicted ligands bind at the predicted sites, it is encouraging that they reduce ERK phosphorylation (activation) in U251 cells simultaneous with lowering the level of GTP-loaded Ras. Assuming that the compounds bind to Ras as predicted, we propose that they act by allosterically modulating membrane binding or, more likely, the catalytic site architecture.
Encouragingly, the micromolar anti-Ras activities observed in our initial experiments suggest that the predicted ligands may indeed serve as viable starting points for the lead generation against deregulated Ras signaling. Of special interest in terms of specificity are allosteric ligands targeting pocket p3 and p3b near the C-terminus and distal from the active site. The close proximity of these binding sites, and the fact that some compounds bind across multiple sites, provide an opportunity for linked-fragment drug design . The bridging compounds can be used as scaffolds to discover new drugs with enhanced affinity and specificity. Obviously, the ligands identified in this work might not have high-affinity (tens of µM was required in our cell-based assays), or the required level of selectivity to be effective inhibitors. However, they can serve as crucial starting points for future development of new therapeutics against deregulated Ras signaling for which no selective inhibitors are available.
A major goal of the current work was to identify and validate novel drug binding sites on the Ras catalytic domain. Using a battery of computational techniques, that incorporate global conformational fluctuations into the hit identification process, we have shown that there are at least four potential allosteric pockets on the Ras catalytic domain. One of these novel sites near the C-terminus, termed p3, is of particular interest for inhibitor development. It is the most frequently targeted pocket both during our blind docking and VS exercises. That this pocket is involved in correlated motions with the active site  suggests that ligand binding at this pocket may allosterically modulate the active site architecture, as shown in other proteins , , . This is consistent with the report that divalent ion-cyclen binding at p3 stabilizes a conformation that has weak effector-binding potential . Thus, we propose that pocket p3 is a viable target against which leads can be generated to serve as starting points for the development and optimization of new drugs.
Structural ensemble generation
Available Ras crystallographic structures and MD simulation snapshots (see below) were analyzed using the Bio3D package . Details of this procedure have been described previously , , briefly, structural superposition was performed on the invariant “core” as defined by Grant et al. . Next, the Cartesian coordinates of equivalent Cα atoms were used to define the elements of a covariance matrix. The covariance matrix was then diagonalized to derive principal components with their associated variances. The crystallographic and MD conformers were projected into the sub-space defined by PC1-3 where the maximum variation of the conformational distribution was observed (see ,  for further details). Structures then underwent average-linkage hierarchical clustering according to the pairwise distances obtained from their projection onto the first three principal components obtained from analysis of available crystal structures. Clustering based on pairwise Cα atom RMSD yielded similar major clusters. Inspection of the resulting dendogram was used to partition structures into seven to eight dominant groups (ranked according to their populations). The closest structure to the average structure from each cluster, in terms of RMSD, was chosen as a representative for further binding site mapping and virtual screening analysis.
Identification of novel binding sites
We used the FTMap method of Brenke and co-workers to highlight regions on the Ras catalytic domain surface that have the potential to bind the highest number of small molecular probes . Both crystal structures and each cluster representative from MD were subject to fragment mapping. Hot-spot residues (those that comprise prominent fragment binding sites) across all structures were analyzed with the Bio3D package. A residue was assumed to be in contact with a probe molecule if any two heavy atoms from the probe and residue were within 5.0 Å.
To further characterize novel binding sites we used AutoLigand from the AutoDockTools (ADT) package . AutoLigand predicts possible ligand-binding sites by searching for a contiguous envelope with the specified volume of 70 favorable energy points that represent potential atomic centers for ligand atoms. We applied AutoLigand to scan each conformer in the structural ensemble. High affinity binding pockets in each of the Ras conformers were identified and the common residues lining those sites were determined using Bio3D. As a positive control, AutoLigand and FTMAP both successfully identified the nucleotide-binding site in all ensemble conformers.
An additional “blind docking” ligand based search with known Ras binders in public databases was used to determine if there are sites on Ras that are frequently visited by a library of small molecule compounds. It was shown that such blind docking procedures correctly identify binding sites in 80% of cases but fail to discriminate between true ligands and decoys . We used AutoDock 4.2  to “blind dock” a library of 267 putative ligands from the PubChem Bioassy database. These compounds were shown to bind to Ras at unknown sites. The PubChem Bioassy database has potential Ras inhibitors comprising 267 compounds as determined by a high throughput fluorescent-GTP binding assay (AID database accession code: 759). Compounds inducing at least 20% reduction in fluorescence relative to the ligand-free control were termed potential inhibitors in the high throughput assay. For our VS study, the ligands were prepared with ADT by adding polar hydrogen atoms and calculating the Gasteiger charges and torsions.
AutoDock 4.2 was used to dock the ligands onto the entire surface of the K-Ras crystal structure (PDB code 3GFT) with a grid dimension large enough to cover the entire protein (143×149×155 Å, with 0.375 Å grid spacing). The bound GTP-analogue phosphoaminophosphonic acid-guanylate ester (GNP) was kept to exclude ligands that may target the catalytic site. A population size of 400, a 2 Å cutoff for clustering, 1.5×108 energy evaluations and 256 hybrid Lamarckian genetic algorithm runs were used. The predicted poses were analyzed in terms of their potential to form hydrogen bonds and van der Waals contacts with the protein. Hydrogen bonds were defined by a donor-acceptor distance of less than 3.5 Å and a donor-hydrogen-acceptor angle between 150° and 180°. Van der Waals contacts were defined by a carbon-carbon distance cutoff of 5.0 Å. The relative importance of target-sites was determined by ranking the docked compounds according to their empirical inhibition constant after eliminating compounds that did not consistently bind at the same site and those violating Lipinski's rule . Pocket volumes were calculated with Povme (POcketVolumeMEasurer) program  by first aligning all ensemble conformers.
Molecular dynamics simulation
Simulations of K-Ras were carried out with bound Mg2+GTP and Mg2+GDP, both modeled from a high-resolution crystal structure of K-Ras (PDB code 2PMX). Protonation states for all titratable residues were determined using PDB2PQR . The LEaP module of AMBER 10 package  was used for the addition of missing hydrogen atoms. The systems were neutralized by addition of counter ions at pH 7 and solvated with TIP3P waters with the buffering distance of 10 Å. All simulations were performed with the AMBER 10 package  and the ff99SB force field  and previously developed parameters for guanine nucleotides .
Energy minimization was carried out with decreasing constraints on the heavy atom positions, followed by constant volume heating to 300 K for ∼10 ps, and a 200ps constant temperature and constant pressure (1 atm) equilibration. The production phase involved multi-copy unrestrained runs for enhanced sampling and improved statistics. Three independent 20 ns production runs with different initial velocities were carried out on each system resulting in a cumulative simulation time of 120 ns. The integration time step was 2 fs with bonds involving hydrogen atoms restrained by SHAKE . Short-range non-bonded interactions were truncated at 10 Å while long-range electrostatic interactions were evaluated with the Particle-Mesh Ewald method . All simulation analysis was performed with the Bio3D package.
Compounds in the NCIDS II and ZINC drugs-now subset satisfying a Tanimoto score threshold of 60% were docked into each of the seven and eight representative conformers of the crystallographic- and MD-ensembles, respectively. Each protein conformer was first prepared with the Protein Preparation Wizard of the Schrodinger program suite (Fig. S4). Bond orders were assigned, and hydrogen bond networks, rotating hydroxyl and thiol hydrogens were optimized. A grid enclosing each binding site of interest was generated using Receptor Grid Generation.
Each compound was prepared using Ligprep of Schrodinger using the OPLS_2005 force field at pH 7.0+−2.0 with ionization states generated by Epik 1.6. To enhance ligand diversity, for each ligand up to 32 stereoisomers and 8 low energy ring conformations were generated, resulting in a total of 5,267 and 39,490 3D structures for NCIDSII and ZINC, respectively. Then triple steps virtual screenings, encompassing high throughput virtual screening, standard precision, and extra precision were performed using Glide , .
Each docked pose was re-scored with Accelrys Discovery Studio using PLP(1 and 2) , Jain , PMF , PMF04  and Ludi (1, 2, and 3) , ,  scoring functions. Although absolute scores given by the different scoring methods cannot be compared, the relative rankings of each compound with the different scoring methods provide useful information. Thus, compounds ranked as top 20 with each scoring function were combined to build a new library for each binding site. Visual analysis and filtering for drug-likeness by Lipinski's rule of fives ,  (no more than five hydrogen bond donors, 10 hydrogen bond acceptors, low molecular weight not greater than 500 g/mol, and a logP not greater than 5), was applied to further reduce the number of compounds suggested for experimental testing.
ERK1/2 Phosphorylation Assay
Glioblastoma cells were serum-starved for 24 hours in DMEM containing 0.1% BSA and subsequently treated or not for 24 hours with compounds obtained from the NCIDS II and ZINC small molecule database. Cells were washed and lysed in buffer containing 50 mM Tris HCL (pH 7.4), 100 mM NaCl, 5 mM MgCl2, 0.1% Triton X100, 10% glycerol, 1 mM phenylmethylsulfonylfluoride, 1 mM DTT, 0.1 mM sodium orthovanadate, 10 µg/mL leupeptin, and 10 mg/mL aprotinin. Lysates were clarified by centrifugation and protein content was measured. Cell lysates were solubilized by boiling for 10 minutes in Leammli buffer. Proteins were resolved by SDS-PAGE analysis and membranes were probed with an anti-phospho-ERK1/2 antibody (Cell Signaling). Total ERK1/2 was determined by immunoblotting with anti-ERK1/2 polyclonal antibody (Cell Signaling). Proteins were visualized using enhanced chemiluminescence and quantitated using gel documentation software (Alpha Innotech Corp., San Leandro, CA).
Ras Pull-Down Assay
Ras activation assays were performed following the affinity precipitation protocol provided by the manufacturer (RAs pull-down assay kit: Upstate Biotechnology lake Placid, NY). Briefly, lysates were incubated with Raf-1 RBD for 45 min at 4°C and then centrifuged to pellet the agarose beads. Agarose beads were washed, and pellets were resuspended in 2× Laemmli sample buffer and boiled for 5 min. The supernatant was collected and cellular proteins were resolved by SDS-PAGE and analyzed by immunoblotting with an anti-Ras antibody (Cell Signaling).
Heat map clustering of Ras structures in the PC1 to PC3 planes. Structure labels are colored by nucleotide state (red for GTP and green for GDP). Major conformational groupings are indicated by the orange labels and corresponding marginal dendrograms (see text for details).
Pocket p1 dynamics. Residues lining P1 (see Table 1) are shown in red surface. Tyr71 (magenta) re-orients to enable access to P1. Probability density of Chi1 side chain dihedral angles of Tyr 71 shows distinct orientations.
Projection of representative conformers (red) of K-Ras MD ensemble (blue) on the first two dominant principal components obtained from the analysis of crystallographic ensemble. MD conformers lie in between two major crystal clusters (gray) associated with GTP-bound (PC1: −7 to 0) and GDP-bound (PC1: 15 to 20). The representative conformers were identified based on RMSD and PCA based clustering, see text for details.
A multi-level computational approach for the identification of small molecules that bind to novel allosteric sites on Ras. MD, molecular dynamics; NCIDS II, National Cancer Institute diversity set II; HTVS, high throughput virtual screening; SP, standard precision; XP, extra precision.
Representative structure obtained from analyzing the Ras crystallographic ensemble.
Ensemble fragment mapping results highlight three non-nucleotide binding sites. Representative Ras crystal structure conformers (gray protein cartoon) are shown along with the nucleotide-binding site (red molecular surface representation) and new potential binding sites p1, p2 and p3 (in pink, green and blue molecular surface representations respectively). Also shown is an average Ras conformer where relative chain thickness and color scale (red: high, gray: low) represent residue wise probe occupancy values (see text for details).
We thank the National Science Foundation (NSF) Supercomputer Centers, the Center for Theoretical Biological Physics (CTBP) and National Biomedical Computation Resource for computational resources. Dr. Frank Furnari, Ludwig Institute of Cancer Research, La Jolla, CA, kindly provided the U251 human glioma cell line.
Conceived and designed the experiments: BG AG JM. Performed the experiments: BG AG JM SL HH JS. Analyzed the data: BG AG JM SL HH JS JHB. Contributed reagents/materials/analysis tools: BG AG JM SL HH JS JHB. Wrote the paper: BG AG JM SL HH JS JHB.
- 1. Karnoub AE, Weinberg RA (2008) Ras oncogenes: split personalities. Nat Rev Mol Cell Biol 9: 517–531.
- 2. Gorfe AA (2010) Mechanisms of allostery and membrane attachment in Ras GTPases: implications for anti-cancer drug discovery. Curr Med Chem 17: 1–9.
- 3. Gremer L, Merbitz-Zahradnik T, Dvorsky R, Cirstea IC, Kratz CP, et al. (2011) Germline KRAS mutations cause aberrant biochemical and physical properties leading to developmental disorders. Hum Mutat 32: 33–43.
- 4. Rauen KA, Schoyer L, McCormick F, Lin AE, Allanson JE, et al. (2010) Proceedings from the 2009 genetic syndromes of the Ras/MAPK pathway: From bedside to bench and back. Am J Med Genet A 152A: 4–24.
- 5. Cirstea IC, Kutsche K, Dvorsky R, Gremer L, Carta C, et al. (2010) A restricted spectrum of NRAS mutations causes Noonan syndrome. Nat Genet 42: 27–29.
- 6. Schubbert S, Shannon K, Bollag G (2007) Hyperactive Ras in developmental disorders and cancer. Nat Rev Cancer 7: 295–308.
- 7. Sebti SM, Hamilton AD (2000) Farnesyltransferase and geranylgeranyltransferase I inhibitors in cancer therapy: important mechanistic and bench to bedside issues. Expert Opin Investig Drugs 9: 2767–2782.
- 8. Kohl NE, Omer CA, Conner MW, Anthony NJ, Davide JP, et al. (1995) Inhibition of farnesyltransferase induces regression of mammary and salivary carcinomas in ras transgenic mice. Nat Med 1: 792–797.
- 9. James GL, Goldstein JL, Brown MS (1995) Polylysine and CVIM sequences of K-RasB dictate specificity of prenylation and confer resistance to benzodiazepine peptidomimetic in vitro. J Biol Chem 270: 6221–6226.
- 10. Lobell RB, Omer CA, Abrams MT, Bhimnathwala HG, Brucker MJ, et al. (2001) Evaluation of farnesyl:protein transferase and geranylgeranyl:protein transferase inhibitor combinations in preclinical models. Cancer Res 61: 8758–8768.
- 11. Roberts PJ, Der CJ (2007) Targeting the Raf-MEK-ERK mitogen-activated protein kinase cascade for the treatment of cancer. Oncogene 26: 3291–3310.
- 12. Bollag G, Freeman S, Lyons JF, Post LE (2003) Raf pathway inhibitors in oncology. Curr Opin Investig Drugs 4: 1436–1441.
- 13. Hardy JA, Wells JA (2004) Searching for new allosteric sites in enzymes. Curr Opin Struct Biol 14: 706–715.
- 14. Cox AD, Channing JD (2002) Ras family signaling. Cancer Biol Ther 1: 599–606.
- 15. Brummelkamp TR, Bernards R, Agami R (2002) Stable suppression of tumorigenicity by virus-mediated RNA interference. Cancer Cell 2: 243–247.
- 16. Fleming JB, Shen GL, Holloway SE, Davis M, Brekken RA (2005) Molecular consequences of silencing mutant K-ras in pancreatic cancer cells: justification for K-ras-directed therapy. Mol Cancer Res 3: 413–423.
- 17. Chen X, Guo X, Zhang H, Xiang Y, Chen J, et al. (2009) Role of miR-143 targeting KRAS in colorectal tumorigenesis. Oncogene 28: 1385–1392.
- 18. Zhao WG, Yu SN, Lu ZH, Ma YH, Gu YM, et al. (2010) The miR-217 microRNA functions as a potential tumor suppressor in pancreatic ductal adenocarcinoma by targeting KRAS. Carcinogenesis 31: 1726–1733.
- 19. Aoki K, Ohnami S, Yoshida T (2005) Suppression of pancreatic and colon cancer cells by antisense K-ras RNA expression vectors. Methods Mol Med 106: 193–204.
- 20. Barbacid M (1987) ras genes. Annu Rev Biochem 56: 779–827.
- 21. Abankwa D, Gorfe AA, Inder K, Hancock JF (2010) Ras membrane orientation and nanodomain localization generate isoform diversity. Proc Natl Acad Sci U S A 107: 1130–1135.
- 22. Abankwa D, Hanzal-Bayer M, Ariotti N, Plowman SJ, Gorfe AA, et al. (2008) A novel switch region regulates H-ras membrane orientation and signal output. EMBO J 27: 727–735.
- 23. Gorfe AA, Grant BJ, McCammon JA (2008) Mapping the nucleotide and isoform-dependent structural and dynamical features of Ras proteins. Structure 16: 885–896.
- 24. Lukman S, Grant BJ, Gorfe AA, Grant GH, McCammon JA (2010) The distinct conformational dynamics of K-Ras and H-Ras A59G. PLoS Comput Biol 6:
- 25. Abankwa D, Gorfe AA, Hancock JF (2008) Mechanisms of Ras membrane organization and signalling: Ras on a rocker. Cell Cycle 7: 2667–2673.
- 26. Futatsugi N, Tsuda M (2001) Molecular dynamics simulations of Gly-12→Val mutant of p21(ras): dynamic inhibition mechanism. Biophys J 81: 3483–3488.
- 27. Carlson HA, McCammon JA (2000) Accommodating Protein Flexibility in Computational Drug Design. Mol Pharmacol 57: 213–218.
- 28. Teague SJ (2003) Implications of protein flexibility for drug discovery. Nat Rev Drug Discov 2: 527–541.
- 29. Lin JH, Perryman AL, Schames JR, McCammon JA (2002) Computational drug design accommodating receptor flexibility: the relaxed complex scheme. J Am Chem Soc 124: 5632–5633.
- 30. Amaro RE, Baron R, McCammon JA (2008) An improved relaxed complex scheme for receptor flexibility in computer-aided drug design. J Comput Aided Mol Des 22: 693–705.
- 31. Cheng LS, Amaro RE, Xu D, Li WW, Arzberger PW, et al. (2008) Ensemble-based virtual screening reveals potential novel antiviral compounds for avian influenza neuraminidase. J Med Chem 51: 3878–3894.
- 32. Laine E, Goncalves C, Karst JC, Lesnard A, Rault S, et al. (2010) Use of allostery to identify inhibitors of calmodulin-induced activation of Bacillus anthracis edema factor. Proc Natl Acad Sci U S A 107: 11277–11282.
- 33. Chang L, Miyata Y, Ung PM, Bertelsen EB, McQuade TJ, et al. (2011) Chemical screens against a reconstituted multiprotein complex: myricetin blocks DnaJ regulation of DnaK through an allosteric mechanism. Chem Biol 18: 210–221.
- 34. Gao Y, Dickerson JB, Guo F, Zheng J, Zheng Y (2004) Rational design and characterization of a Rac GTPase-specific small molecule inhibitor. Proc Natl Acad Sci U S A 101: 7618–7623.
- 35. Jayakanthan M, Wadhwa G, Mohan T, Arul L, Balasubramanian P, et al. (2009) Computer-aided drug design for cancer-causing H-ras p21 mutant protein. Lett Drug Des Discov 6: 14–20.
- 36. Grant BJ, Rodrigues AP, ElSawy KM, McCammon JA, Caves LS (2006) Bio3d: an R package for the comparative analysis of protein structures. Bioinformatics 22: 2695–2696.
- 37. Grant BJ, McCammon JA, Caves LS, Cross RA (2007) Multivariate analysis of conserved sequence-structure relationships in kinesins: coupling of the active site and a tubulin-binding sub-domain. J Mol Biol 368: 1231–1248.
- 38. Downward J (2003) Targeting RAS signalling pathways in cancer therapy. Nat Rev Cancer 3: 11–22.
- 39. Friday BB, Adjei AA (2005) K-ras as a target for cancer therapy. Biochim Biophys Acta 1756: 127–144.
- 40. Wong CF, McCammon JA (2003) Protein flexibility and computer-aided drug design. Annu Rev Pharmacol Toxicol 43: 31–45.
- 41. Perot S, Sperandio O, Miteva MA, Camproux AC, Villoutreix BO (2010) Druggable pockets and binding site centric chemical space: a paradigm shift in drug discovery. Drug Discov Today 15: 656–667.
- 42. Brenke R, Kozakov D, Chuang GY, Beglov D, Hall D, et al. (2009) Fragment-based identification of druggable ‘hot spots’ of proteins using Fourier domain correlation techniques. Bioinformatics 25: 621–627.
- 43. Grant BJ, Gorfe AA, McCammon JA (2009) Ras conformational switching: simulating nucleotide-dependent conformational transitions with accelerated molecular dynamics. PLoS Comput Biol 5: e1000325.
- 44. Harris R, Olson AJ, Goodsell DS (2008) Automated prediction of ligand-binding sites in proteins. Proteins 70: 1506–1517.
- 45. Wang Q, Pang YP (2007) Preference of small molecules for local minimum conformations when binding to proteins. PLoS One 2: e820.
- 46. Henrich S, Salo-Ahen OM, Huang B, Rippmann FF, Cruciani G, et al. (2010) Computational approaches to identifying and characterizing protein binding sites for ligand design. J Mol Recognit 23: 209–219.
- 47. Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, et al. (2004) Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J Med Chem 47: 1739–1749.
- 48. Halgren TA, Murphy RB, Friesner RA, Beard HS, Frye LL, et al. (2004) Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. J Med Chem 47: 1750–1759.
- 49. Schames JR, Henchman RH, Siegel JS, Sotriffer CA, Ni H, et al. (2004) Discovery of a novel binding trench in HIV integrase. J Med Chem 47: 1879–1881.
- 50. Furnari FB, Lin H, Huang HS, Cavenee WK (1997) Growth suppression of glioma cells by PTEN requires a functional phosphatase catalytic domain. Proc Natl Acad Sci U S A 94: 12479–12484.
- 51. Gorfe AA, Babakhani A, McCammon JA (2007) H-ras protein in a bilayer: interaction and structure perturbation. J Am Chem Soc 129: 12280–12286.
- 52. Buhrman G, Holzapfel G, Fetics S, Mattos C (2010) Allosteric modulation of Ras positions Q61 for a direct role in catalysis. Proc Natl Acad Sci U S A 107: 4931–4936.
- 53. Spoerner M, Graf T, Konig B, Kalbitzer HR (2005) A novel mechanism for the modulation of the Ras-effector interaction by small molecules. Biochem Biophys Res Commun 334: 709–713.
- 54. Rosnizeck IC, Graf T, Spoerner M, Trankle J, Filchtinski D, et al. (2010) Stabilizing a weak binding state for effectors in the human ras protein by cyclen complexes. Angew Chem Int Ed Engl 49: 3830–3833.
- 55. Grant BJ, Gorfe AA, McCammon JA (2010) Large conformational changes in proteins: signaling and other functions. Curr Opin Struct Biol.
- 56. Szczepankiewicz BG, Liu G, Hajduk PJ, Abad-Zapatero C, Pei Z, et al. (2003) Discovery of a potent, selective protein tyrosine phosphatase 1B inhibitor using a linked-fragment strategy. J Am Chem Soc 125: 4087–4096.
- 57. Formaneck MS, Ma L, Cui Q (2006) Reconciling the “old” and “new” views of protein allostery: a molecular simulation study of chemotaxis Y protein (CheY). Proteins 63: 846–867.
- 58. McClendon CL, Friedland G, Mobley DL, Amirkhani H, Jacobson MP (2009) Quantifying Correlations Between Allosteric Sites in Thermodynamic Ensembles. J Chem Theory Comput 5: 2486–2502.
- 59. Gsponer J, Christodoulou J, Cavalli A, Bui JM, Richter B, et al. (2008) A coupled equilibrium shift mechanism in calmodulin-mediated signal transduction. Structure 16: 736–746.
- 60. Fukunishi Y, Nakamura H (2011) Prediction of ligand-binding sites of proteins by molecular docking calculation for a random ligand library. Protein Sci 20: 95–106.
- 61. Morris GM, Goodsell DS, Halliday RS, Huey R, Hart WE, et al. (1998) Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J Comp Chem 19: 1639–1662.
- 62. Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 46: 3–26.
- 63. Durrant JD, de Oliveira CA, McCammon JA (2011) POVME: an algorithm for measuring binding-pocket volumes. J Mol Graph Model 29: 773–776.
- 64. Dolinsky TJ, Nielsen JE, McCammon JA, Baker NA (2004) PDB2PQR: an automated pipeline for the setup, execution, and analysis of Poisson-Boltzmann electrostatics calculations. Nucleic Acids Res 32: W665–W667.
- 65. Case DA, Darden TA, Cheatham TE III, Simmerling CL, Wang J, et al. (2008) AMBER 10. University of California, San Francisco.
- 66. Hornak V, Abel R, Okur A, Strockbine B, Roitberg A, et al. (2006) Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins 65: 712–725.
- 67. Meagher KL, Redman LT, Carlson HA (2003) Development of polyphosphate parameters for use with the AMBER force field. J Comput Chem 24: 1016–1025.
- 68. van Gunsteren WF, Berendsen HJ (1977) Algorithms for macromolecular dynamics and constraint dynamics. Molecular Physics 34: 1311–1327.
- 69. Darden T, York D, Pedersen L (1993) Particle mesh Ewald-an NlogN method for Ewald sums in large systems. J Chem Phys 98: 10089–10092.
- 70. Gehlhaar DK, Verkhivker GM, Rejto PA, Sherman CJ, Fogel DB, et al. (1995) Molecular recognition of the inhibitor AG-1343 by HIV-1 protease: conformationally flexible docking by evolutionary programming. Chem Biol 2: 317–324.
- 71. Jain AN (1996) Scoring noncovalent protein-ligand interactions: a continuous differentiable function tuned to compute binding affinities. J Comput Aided Mol Des 10: 427–440.
- 72. Muegge I, Martin YC (1999) A general and fast scoring function for protein-ligand interactions: a simplified potential approach. J Med Chem 42: 791–804.
- 73. Muegge I (2006) PMF scoring revisited. J Med Chem 49: 5895–5902.
- 74. Bohm HJ (1994) The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure. J Comput Aided Mol Des 8: 243–256.
- 75. Bohm HJ (1992) LUDI: rule-based automatic design of new substituents for enzyme inhibitor leads. J Comput Aided Mol Des 6: 593–606.
- 76. Bohm HJ (1998) Prediction of binding constants of protein ligands: a fast method for the prioritization of hits obtained from de novo design or 3D database search programs. J Comput Aided Mol Des 12: 309–323.
- 77. Pickett SD, McLay IM, Clark DE (2000) Enhancing the hit-to-lead properties of lead optimization libraries. J Chem Inf Comput Sci 40: 263–272.