Systematic identification of protein-drug interaction networks is crucial to correlate complex modes of drug action to clinical indications. We introduce a novel computational strategy to identify protein-ligand binding profiles on a genome-wide scale and apply it to elucidating the molecular mechanisms associated with the adverse drug effects of Cholesteryl Ester Transfer Protein (CETP) inhibitors. CETP inhibitors are a new class of preventive therapies for the treatment of cardiovascular disease. However, clinical studies indicated that one CETP inhibitor, Torcetrapib, has deadly off-target effects as a result of hypertension, and hence it has been withdrawn from phase III clinical trials. We have identified a panel of off-targets for Torcetrapib and other CETP inhibitors from the human structural genome and map those targets to biological pathways via the literature. The predicted protein-ligand network is consistent with experimental results from multiple sources and reveals that the side-effect of CETP inhibitors is modulated through the combinatorial control of multiple interconnected pathways. Given that combinatorial control is a common phenomenon observed in many biological processes, our findings suggest that adverse drug effects might be minimized by fine-tuning multiple off-target interactions using single or multiple therapies. This work extends the scope of chemogenomics approaches and exemplifies the role that systems biology has in the future of drug discovery.
Both the cost to launch a new drug and the attrition rate during the late stage of the drug discovery and development process are increasing. Torcetrapib is a case in point, having been withdrawn from phase III clinical trials after 15 years of development and an estimated cost of US $800 M. Torcetrapib represents a new class of therapies for the treatment of cardiovascular disease; however, clinical studies indicated that Torcetrapib has deadly side-effects as a result of hypertension. To understand the origins of these adverse drug reactions from Torcetrapib and other related drugs undergoing clinical trials, we introduce a systematic strategy to identify off-targets in the human structural proteome and investigate the roles of these off-targets in impacting human physiology and pathology using biochemical pathway analysis. Our findings suggest that potential side-effects of a new drug can be identified at an early stage of the development cycle and be minimized by fine-tuning multiple off-target interactions. The hope is that this can reduce both the cost of drug development and the mortality rates during clinical trials.
Citation: Xie L, Li J, Xie L, Bourne PE (2009) Drug Discovery Using Chemical Systems Biology: Identification of the Protein-Ligand Binding Network To Explain the Side Effects of CETP Inhibitors. PLoS Comput Biol 5(5): e1000387. https://doi.org/10.1371/journal.pcbi.1000387
Editor: Ruth Nussinov, National Cancer Institute, United States of America and Tel Aviv University, Israel
Received: January 22, 2009; Accepted: April 13, 2009; Published: May 15, 2009
Copyright: © 2009 Xie et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the National Institutes of Health (grant GM078596). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Identification of protein-ligand interaction networks on a proteome-wide scale is crucial to address a wide range of biological problems such as correlating molecular functions to physiological processes and designing safe and efficient therapeutics . Recent protein-ligand interaction studies have revealed that protein targets involved in entirely different pharmacology can bind similar small molecule drugs –. Large scale mapping of polypharmacology interactions indicates that drug promiscuity is a common phenomenon across the proteome . It has been found that approximately 35% of known drugs or leads were active against more than one target. Moreover, a significant number of promiscuous compounds (approximately 25%) have observed activity in completely different gene families. Such drug promiscuity presents both opportunities and challenges for modern drug discovery. On one hand, it is possible to develop high-efficacy drugs by inhibiting multiple targets  or to reposition existing drugs to treat different diseases ,; on the other hand, the off-target effect may result in adverse drug reactions that account for around one-third of drug failures during development . As a result, there is increasing interest in the identification of multiple targets associated with a phenotype  and in developing combinatorial therapies to boost clinical efficacy . Chemogenomics has emerged as a new discipline to systematically establish target relationships based on the structural and biological similarity of their ligands , –. However, the success of chemogenomics depends on the availability of bioactivity data for the receptors and their associated ligands. For new drug targets, such data are either insufficient or unavailable. Further, the adverse drug reaction may involve receptors that are not well characterized. Complementary to chemogenomics methods, we have developed a chemical systems biology approach to identifying off-target binding networks through their ligand binding sites. The method requires 3D-structure information for the protein but not the ligand, thereby extending the scope of existing chemogenomics approaches. Moreover, the identified off-target binding network is integrated with the reconstructed biological pathways so that the effect of the drug on the biological system can be understood at the system level. In brief (see Methods for further details), our chemical systems biology approach proceeds as follows: 1) The ligand binding site of the primary target is extracted or predicted from a 3D experimental structure or homology model and characterized by a geometric potential . 2) Off-target proteins with a similar ligand binding site to the primary target are identified across the human structural genome using a Sequence Order Independent Profile-Profile Alignment (SOIPPA) . The atomic details of the interactions between the drug and the putative off-targets from step 2 are characterized using protein-ligand docking methods. Based on a normalized docking score the high-ranking off-targets are further investigated. 4) The identified panel of off-targets is subject to structural and functional cluster analysis and incorporated into a network that includes multiple metabolic, signal transduction, and gene regulation pathways. The first and second steps have been implemented in the software package SMAP, available from http://funsite.sdsc.edu.
In this paper, we apply this strategy to identify and analyze a panel of unknown off-targets for Cholesteryl Ester Transfer Protein (CETP) inhibitors. CETP inhibitors represent a new preventive therapy for cardiovascular disease through raising HDL cholesterol. However, clinical studies have revealed that one of the CETP inhibitors, Torcetrapib, has deadly off-target effects as a result of hypertension – and consequently was withdrawn from phase III clinical trial. In contrast to Torcetrapib, another CETP inhibitor JTT-705 does not have unwanted side-effects that increases blood pressure . In addition, JTT-705 is able to block cell proliferation and angiogenesis through Ras and P38 kinase pathways . As will be shown, the multiple off-targets of these CETP inhibitors identified here are involved in both positive and negative control of stress regulation and immune response through an interconnected metabolic, signal transduction and gene regulation network. Our predictions are strongly correlated to the observed clinical and in vitro observations, providing a molecular explanation for the difference in side-effect profiles of these two CETP inhibitors. These findings suggest that adverse drug reactions might be modulated by the fine-tuning of the off-target binding network and exemplify the role of systems biology in the future of drug discovery.
CETP off-target binding network computed for the human structural genome
The ligand binding site of CETP (PDB id: 2OBD) is assumed to be a long tunnel interacting with two cholesteryl oleates (2OB) and two 1,2-dioleoyl-Sn-glycero-3-phosphocholines (PCW) molecules in the native state (Fig. S1), however, the exact location of inhibitor binding is unknown. Docking studies using the software Surflex , eHits  and AutoDock  indicate that the CETP inhibitors are able to bind to all four sites, with a slight preference for the pocket occupied by PCW. Thus, all four sites were used to search for the off-target binding sites of CETP inhibitors.
Although only approximately 15% of human proteins have known 3D structures deposited in the Protein Data Bank (PDB)  , the structural coverage of the human proteome increases to 57% if homologous proteins are included (e-value less than 1.0e-3 and aligned sequence lengths greater than 30 residues using a Blast  search). The structural coverage is reduced to around 40% if the aligned length is greater than 120 residues (Fig. S2). After removing structures with redundant sequences (sequence identity = 100%), 5,985 structures and models from the PDB were selected for off-target search by SMAP. Besides bactericidal/permeability increasing protein (PDB Id: 1ewf) that is classified in the same fold and Pfam  family as CETP (FATCAT  p-value = 1.26e-11, RMSD = 4.53), 273 off-fold structures are found with similar binding sites to CETP (SMAP p-value less than 1.0e-3). Reverse virtual screening of the 273 structures against JTT-705, the smallest CETP inhibitor, was carried out with Surflex  and eHits  (see Methods) to detect the binding capability of these proteins. To reduce the impact of protein flexibility, the complex structure, whenever available in PDB, is used for docking. Proteins that have steric crashes with JTT-705 were removed from the list and a panel of CETP off-targets consisting of 204 structures was constructed for further study as shown in Table S1. The majority of these off-targets have binding sites that match to one of the two sites that are adjacent to PCW in CETP. Excluding cytochrome P450s that bind drugs promiscuously, most of the putative off-targets are involved in lipid/fatty acid transport or binding, signal transduction pathways and immune response. Based on both SMAP p-values and docking scores (p-value<1.0e-3, Surflex score>3.50 and eHiTs score<−4.50), six classes of structure were consistently found at the top of the list: CD1B like antigen recognition domains (CD1B); nuclear hormone receptor ligand binding domains (NR); lipid transport proteins (LPTP); fatty acid binding proteins (FABP); EF hand-like calcium binding proteins (EF); and heme binding proteins (HEME). The first four classes of proteins are able to bind cognate ligands similar to those that bind to CETP, such as fatty acids, lipoproteins, and lipids . Although these putative off-targets do not have detectable global structural similarities to CETP according to their CE Z-scores (Fig. S3), they have local structural similarity and are related to each other, forming an interconnected off-target network. As shown in Fig. 1, 76% of the putative off-targets (154/204) form the three largest clusters. The largest helix bundle cluster includes NR, EF, HEME and other proteins (Fig. S4). In this paper, we focus on the six selected classes of proteins and demonstrate how they correlate to the clinical findings. Other putative off-targets are subject to on-going computational and experimental studies.
Structural characterization of CETP off-targets
Most of the predicted ligand binding sites of CD1B, LPTP, and FABP have a similar topology to that of CETP. The drug molecule binds to a cavity formed by anti-parallel beta-sheets and capped by other structural components such as a helix. The others, NR, EF, and HEME all have alpha-helical architectures that are completely different from the secondary structure surrounding the binding site of CETP. These differences illustrate the necessity of tools like SMAP that can find local structural similarities even when global similarity is non-existent. From a functional perspective, it is not surprising that lipid binding proteins act as off-targets for CETP inhibitors since they are required to bind similar cognate hydrophobic ligands such as PCW. It is noteworthy that glycolipid transfer protein, one of the lipid binding proteins, has significant structural similarity to nuclear hormone receptors. For example, the FATCAT  p-value is 1.77e-3 when comparing one glycolipid transfer protein (PDB id: 1TFJ) with that of retinoid X receptor (PDB id: 1YOW), but the RMSD is 9.82 Å for a rigid superimposition. However, if the components of these structures are allowed to twist, the RMSD drops to 2.57 Å when the helices surrounding the binding site are well aligned (Fig. S5). The structural similarity between glycolipid transfer protein and other all-helical proteins increases confidence in our result that the lipid-activated nuclear receptor (NR) is one of the major off-targets of CETP inhibitors.
Functional correlation of CETP with off-targets
We searched for possible functional correlations between CETP and the putative off-targets using the iHOP  literature network (http://www.ihop-net.org/UniPub/iHOP/in?dbrefs_1=NCBI_LOCUSLINK__ID|1071). Several top-ranked off-targets appear in the same sentences with each other more than 3 times in the literature. They include phospholipid transfer proteins, nuclear receptors, including PPAR, major histocompatibility complex class II that is similar to CD1B, apolipoprotein A-1, and angiotension I converting enzyme.
The functional similarity between CETP and the off-targets is further quantitatively measured using gene ontology (GO) relationships found with the FunSimMat web server  (http://funsimmat.bioinf.mpi-inf.mpg.de/index.php). From 204 off-targets, 148 structures had annotated GO terms and 94 structures had detectable similarities with a Resnik score  larger than 0.0. Among these 94 structures, lipid transport/binding proteins, CD1B, and nuclear hormone receptors were ranked top, followed by globin-like, EF hand-like and other proteins (Table S2).
Binding affinity similarity between CETP and off-targets
To further support our off-target predictions we conducted docking studies on CETP and the identified off-targets, which also provides insights into the molecular mechanisms of off-target binding. It has been established that the binding affinity calculated from docking programs is not necessarily reliable –. When using an energy-based scoring function, the errors come predominantly from the inaccurate parameterization of the individual energy terms. We find that the docking scores for CETP and its putative off-targets are linearly dependent on the number of carbon atoms on the docked molecules because the hydrophobic term dominates the scoring (Fig. S6). Based on this observation we developed a procedure to minimize the systematic error in the scoring function. Rather than considering the raw docking score we used the z-score to represent the relative binding affinity. The z-score is derived from a large number of random drug-like molecules and is dependent on both the number of carbon atoms in the ligand and the nature of the protein binding site. A large negative z-score indicates a high probability of true binding. Based on this procedure, the normalized docking scores (NDS) of the six classes of off-targets are listed in Table 1. These data indicate that binding of CETP inhibitors to putative off-targets is indeed statistically significant. Furthermore, the vector distance of the carbon atom size dependent average docking score for CETP and the majority of off-targets is less than 1.0 (Table S3). This implies that the ligands are able to bind to CETP and to the off-targets with similar binding affinities, since their predicted binding affinity differences are less than 1.0, which is the standard deviation of docking scores (see Methods). Finally, the correlation of ligand binding profiles between CETP and its off-targets  are relatively high (Table S3 and Fig. S7).
Importantly, the binding profiles for the three CETP inhibitors (Torcetrapib, Anacetrapib, and JTT-705) are different from each other across the panel of off-targets. JTT-705 is the most promiscuous inhibitor. In contrast, Torcetrapib failed to dock into some of the off-targets, and Anacetrapib is suitable to be docked into the least number of off-targets. The difference between their off-target binding profiles can be partly explained by their different complexity  and sizes. The molecular volumes of JTT-705, Torcetrapib and Anacetrapib are 407.31, 498.42, and 527.28 Å3, respectively. As shown in Table 1 the estimated volume of the off-target binding pockets varies greatly. Thus, the smallest ligand, JTT-705, can be accommodated in all of these pockets, but the larger-sized Torcetrapib and Anacetrapib are difficult to fit into the smaller sized pockets. It could be argued that the failure in docking Torcetrapib and Anacetrapib into the smaller sized pockets is because the induce fit of the receptor is not explicitly modeled. However, for most of the NRs, both antagonist and agonist conformations are tested. Thus it is less likely that the unfitness of Torcetrapib and Anacetrapib for some of the off-targets is a result of not specifically considering induced fit in the docking calculation. The different off-target binding profiles of these CETP inhibitors have significant implications for the observed side-effects, as discussed subsequently.
Incorporation of the off-target binding network into biological pathways
By incorporating the predicted off-targets into biological pathways it is possible for us to correlate the predicted off-target interactions with the observed pleotropic effects of Torcetrapib, Anacetrapib and JTT-705. Among them, the negative effect of Torcetrapib on blood pressure in phase III clinical trials could be deduced. Also deducible was an explanation for the increased death from infection and cancer . Conversely, JTT-705 has gotten encouraging safety results from phase II clinical trials and no side-effects of hypertension have been observed thus far. Similar positive results are observed for Anacetrapib during phase I clinical trials. It should be noted that at this time that JTT-705 and Anacetrapib are in clinical trials involving only a small number of patients during short term studies. Results from long term studies are needed to confirm the absence of negative effects for these two drugs. In addition, JTT-705 is found to be able to block cell proliferation and angiogenesis through Ras and P38 kinase pathways . To illustrate these findings, using a survey of the literature, we constructed a hierarchical biological network that connects drugs, off-targets, pathways and clinical observations. Using this network we could explore the implications of administering CETP inhibitors on different pathways through their interactions with corresponding off-targets (Fig. S8). The network consists of several interconnected metabolic, signal transduction, and gene regulation pathways. Each component of the network is separately shown in Fig. 2, Fig. 3, Fig. 4, and Fig. S9, and is discussed in detail in the following sections. It is notable that several predicted off-targets, especially the nuclear hormone receptors, are essential components in the network, involved in both positive and negative controls of several cellular systems. Nuclear hormone receptors are known as lipid-activated transcription factors that play key roles in lipid metabolism, inflammatory processes and the hormone system. The regulatory controls of our predicted nuclear hormone receptors are on pathways involved in hypertension, inflammation and cancer development. Torcetrapib, Anacetrapib and JTT705 showed different binding affinities to these receptors and thus different clinical outcomes resulting from the combinational responses of these receptors in related pathways.
The red, purple, and blue lines between inhibitors and off-targets indicate strong, relatively strong, and weak binding affinities, respectively. The brown and black lines between off-targets and pathways or clinical indications represent positive and negative regulation, respectively. A. Regulation control of nuclear hormone receptors on RAAS system. B. Binding profile of Torcetrapib on nuclear hormone receptors. C. Binding profile of Anacetrapib on nuclear hormone receptors. D. Binding profile of JTT-705 on nuclear hormone receptors.
The color and line schema are the same as those in Fig. 2. A. Regulation control of nuclear hormone receptors on inflammatory system. B. Binding profiles of Torcetrapib on nuclear hormone receptors. C. Binding profile of Anacetrapib on nuclear hormone receptors. D. Binding profile of JTT-705 on nuclear hormone receptors.
The color and line schema are the same as those in Fig. 2. A. Regulation control of nuclear hormone receptors on cancer system. B. Binding profiles of Torcetrapib on nuclear hormone receptors. C. Binding profile of Anacetrapib on nuclear hormone receptors. D. Binding profile of JTT-705 on nuclear hormone receptors.
Combinatorial control of nuclear hormone receptors in hypertension.
As shown in Fig. 2, the effects of the three CETP inhibitors on blood pressure can be explained through their influence on the Renin-Angiotension-Aldosterone System (RAAS), the main system for blood pressure regulation. When the RAAS system is too active, blood pressure becomes dangerously high. Several nuclear receptors highly ranked in our off-target list are involved in the regulation of this system, including proxisome proliferator-activated receptor (PPAR), retinoid X receptor (RXR), liver X receptor (LXR) and Vitamin D receptor (VDR) . As positive regulators, activation of PPAR, RXR and LXR increases gene expression of angiotensinogenase and then up-regulates RAAS, resulting in high blood pressure and increased aldosterone secretion. In contrast, VDR has a negative control on RAAS. Activation of VDR will balance the up-regulation effect of PPAR, RXR and LXR on blood pressure.
According to the normalized docking scores (Table 1), Torcetrapib, Anacetrapib and JTT-705 show distinctly different binding profiles to these nuclear receptors, consistent with their differing involvement in hypertension. To avoid inaccuracy in the docking calculation, three different categories (strong, relatively strong and weak) were used to estimate the binding affinity, instead of direct comparison of individual docking scores. It is noteworthy that the weak binding (large positive normalized docking score) is due mainly to the steric crashes between the CETP inhibitors and the receptor. As a result the inhibitors cannot fit into the binding pockets of these receptors. Interaction between the three CETP inhibitors and nuclear receptors are shown in Fig. 2 with different colors illustrating the type of interaction. When the three CETP inhibitors are docked as agonists to these nuclear hormone receptors the stronger binding affinity to the panel of positive regulators (PPAR, LXR and RXR) indicates stronger up-regulation of RAAS and higher risk of hypertension; stronger binding affinity to the negative regulator VDR implies a lower risk of hypertension. It is clearly shown in Fig. 2 that JTT-705 has relatively strong binding affinity not only to the positive regulators but also as an agonist to the negative regulator, implying the ability of JTT-705 to exhibit a balanced positive/negative control over RAAS and consequently a lesser chance to cause hypertension. In contrast to JTT-705, Torcetrapib binds more strongly to the active conformations of the positive regulators and leads to increased blood pressure through up-regulation of RAAS. Limited by its larger size, Anacetrapib can only bind to RXR and PPARδ in their active conformations and PPARα and VDR in their inactive conformations (listed in Table 1), even though it is in the same structural class as Torcetrapib. Thus, Anacetrapib has less effect on both the positive and negative control of blood pressure and its negative effect on blood pressure regulation may be less than Torcetrapib. Even though the detailed mechanism for the side-effect of hypertension caused by Torcetrapib is still unknown and the different binding profiles of the three CETP inhibitors needs experimental verification, our observations are consistent with the current clinic trial data from the three CETP inhibitors and the predicted off-targets provides information for future use in drug optimization.
Combinatorial control of nuclear hormone receptors in inflammation.
The effects of CETP inhibitors on inflammation are shown in Fig. 3. Activation of nuclear hormone receptors such as PPAR, LXR, GCR and RXR regulates gene expression associated with inflammation through different mechanisms , and consequently reduces the inflammatory response. For example, NF-κB plays a key role in regulating the immune response to infection. PPARα/γ, LXRα/β and GCR can block the NF-κB pathway by directly binding to AP1 and NF-κB –, acting downstream of NF-κB binding to DNA , or by competing for limited amounts of co-activators . There are other examples to show the PPAR induced trans-repression of inflammatory response genes ,. PPARγ/δ can also function as transcriptional regulators of monocyte phenotypic differentiation by promoting expression of target genes involved in M2 macrophage function thereby activating M2 macrophage so as to generate anti-inflammatory products –. Thus, the overall picture of activation of these nuclear hormone receptors involved in inflammatory response suggests that they have interesting anti-inflammatory effects. The binding profiles of Torcetrapib, Anacetrapib and JTT-705 to these nuclear hormone receptors (listed in Table 1 and shown in Fig. 3) indicate that JTT-705 has a broader control over the inflammation system to reduce the inflammatory response.
The regulatory effect of nuclear hormone receptors on cancer.
NF-κB regulates genes involved in cell proliferation and cell survival and hence is an interesting drug target in cancer treatment. Inhibition of NF-κB can potentially halts tumor progression and eliminate tumors ,. As discussed above, activation of PPARα/γ, LXRα/β and GCR will block the NF-κB pathway and thus prevent cancer. Of the three CETP inhibitors only JTT-705 is predicted to bind to these receptors and hence have the ability to control cell proliferation and tumor progression (Fig. 4).
Recent experiments have shown that PPARα and PPARγ can induce extracellular signal-regulated kinase (Erk) and/or p38 phosphorylation and then activate the MAPK/Erk signaling pathway ,. This pathway is involved in the action of most nonnuclear oncogenes and participates in cancer development . Interestingly, JTT-705 was shown to block cell proliferation through the activation of the p38 MAPK pathway , but the mechanism for how JTT-705 induces p38 MAPK activation is still unclear. Our results suggest a possible hypothesis (Fig. 5). JTT-705 could trigger the p38 MAPK pathway through its interaction with PPARα/γ and thus has the potential to prevent cell proliferation and cancer.
Regulatory effects of other identified off-targets.
Effects of PPAR and RXR are also regulated by fatty acid binding proteins (FABP) . FABP can function as an intracellular chaperone to transport fatty acids and drugs into the nucleus and directly interact with PPAR . The cooperation between FABP and PPAR will enhance the activities of PPAR in gene transcription regulation ,. FABP can also interact with hormone-sensitive lipases to potentially modulate their catalytic activity and thereby integrates several signaling networks that control inflammatory response potentially through the JNK/inhibitor of kappa kinase (IKK) and IKK–nuclear factor-κB (NF-κB) pathway . According to the calculated docking scores, only JTT-705 can bind to FABP and further regulate hypertension and inflammation.
Another type of highly ranked off-target, CD1, can also be directly related to the side- effect of infection through its function as an antigen-presenting protein in the immune system. T cells will recognize antigens presented by CD1 proteins and activate a cell-mediated immune response against microbial infections . Docking results show that all three drugs have a strong binding affinity to CD1, suggesting an impact on antimicrobial immunity and host response to infection.
Other putative off-targets such as ubiquinol-cytochrome-c reductases, globin-like proteins, EF hand-like calcium binding proteins (EFs), and LPTP are also directly or indirectly associated with hypertension, inflammation, and/or cancer. Recent studies suggest that ubiquinol-cytochrome-c reductase expression is indirectly regulated by steroid hormones in response to hypertension . Further, as one of the key protein components in the Q-cycle , it contributes to the regulation of cell death and repair  and may also be related to cancer and infection. It is interesting that hemoglobin has been found in non-hematopoietic organs such as the kidney acting as an anti-oxidative defense agent . It is also involved in the activation of KCl cotransporter activity ,, which may affect the regulation of blood pressure. EFs modulate vascular function through Ca2+ homeostasis and nitric oxide. It has also been observed that the lack of S100A1 (an EF protein) expression could lead to hypertension . EFs also have effects on transcription factors. They not only indirectly regulate the activity of transcription factors through their phosphorylation/dephosphorylation in response to Ca2+ levels but also directly control the transcriptional activity of the tumor suppressor p53 through interactions with its regulatory sequences . It is not surprising that LPTP is one of CETP's off-targets because they bind to the same or similar cognate ligands and are involved in lipid metabolism. However, the biological functions of phosphatidylinositol/phosphatidylcholine transfer proteins (PITPs) have not been well characterized ,. They may play a role in dense-core vesicles exocytosis, which regulates heart rate and blood pressure through the release of noradrenaline and adrenaline . Interactions between the three CETP inhibitors and these predicted off-targets show potential additional contributions to the side-effects of hypertension, inflammation and cancer.
In summary, most of the putative off-targets for CETP inhibitors are involved in interconnected lipid metabolism and signaling networks which activate or mediate various biological process such as hypertension, stress regulation , immune response  and cell death . Our predications are consistent with current clinical studies on all three CETP inhibitors, highlighting the interrelationship of multiple biological processes involved in hypertension, infection and cancer. These results call for further experimental validation.
Roles of combinatorial control in modulation of side-effects of CETP inhibitors
In vitro, in vivo and clinical studies indicate that CETP inhibitors exhibit pleotropic effects in humans through the interaction with unknown off-targets. We have identified a panel of proteins that likely bind to CETP inhibitors leading to the observed clinical indications. The putative off-target interactions are consistent with existing experimental data and provide insights into the molecular mechanisms of the side-effect profile of CETP inhibitors. Drug promiscuity depends not only on the similarity of ligand binding pockets in the related proteins but also the complexity of the drug itself . In general, smaller molecules are able to bind more targets. The same trend has been predicted for CETP inhibitors; the smallest JTT-705 is the most promiscuous and the largest, Anacetrapib, is the least promiscuous. However, in contrast to conventional wisdom that implies the more specific the binding the lesser the side-effects, the most promiscuous inhibitor, JTT-705, does not cause the side-effect of hypertension that is observed in the more specific Torcetrapib. Considering the regulation of blood pressure by NRs, it is possible that JTT-705 acts as an antagonist of NRs to down-regulate aldosterone. However, our results suggest that CETP inhibitors prefer binding to the agonist rather than the antagonist conformation of the NR. Experimental evidence also implies that JTT-705 actually activates NR to mediate Ras and p38 kinase pathways . Thus, it is more likely that the side-effect of CETP inhibitors is modulated by a combination of biological controls involved in many physiological processes such as cell proliferation , inflammation and hypertension. In other words, JTT-705 is involved in activation of NRs that contribute to both positive and negative controls of aldosterone. Although Torcetrapib is more specific and binds less off-targets than JTT-705, it only activates those NRs that up-regulate RAAS resulting in hypertension. To fully understand how small molecules can modulate physiological or pathological processes through such combinatorial control, it is necessary to simulate the dynamic properties of the biological system. To this end, it is a critical first step to identify all of the putative molecular receptors involved in the biological process and to connect them into a logical integrated protein-ligand interaction network.
Advantages and limitations of the methodology
The chemical systems biology approach developed here is limited by available protein structures that currently only cover approximately 50% of the human proteome, although the structural coverage of the human proteome will steadily increase with progress in structural genomics  and conventional structure determination. As a result, some potential off-targets may be missed because they are not included in the screening. In addition to establishing functional relationships between proteins using their sequences, structures and functional sites, there are significant efforts to relate drug targets to their ligands through chemical genomics analysis . However, the chemical genomics approach is restricted by the availability of bioactivity data. When exploring off-targets that cover the whole human proteome, this limitation becomes obvious since only a small number of target families explored by pharmaceutical companies are in the bioactivity database . Thus our method is complementary to existing chemical genomics approaches. Drug-target networks will be greatly expanded by combining chemical genomics data and a structural genome-wide off-target analysis. Several studies have attempted to extend the target-based method to the domain-based model through similar sequence motifs or global structures . In this study we further expand the scope of the chemical genomics approach beyond sequence and fold similarity by searching for similar ligand binding sites. Hence a ligand binding site-based approach will provide an ever improving way to generate a candidate list of proteins participating in interconnected biochemical pathways and to establish their relationships to biological processes. It is hoped that these approaches will eventually provide the foundation for the in silico simulation of the influence of small molecules on biological systems. In the interim it is noted that the analysis of incomplete networks is still invaluable in making new discoveries in biomedicine as exemplified by several recent studies ,.
Besides SMAP used in this study, a number of web servers for ligand binding site search are available, for example, SiteEngine , SitesBase ,, CavBase –, SuMo , PdbSiteScan , eF-Site ,, pvSOAR , and pevoSOAR . Compared with these servers, SMAP has several distinguishing features making it particularly suitable for identifying off-targets on a structural genome-wide scale. First, SMAP does not require prior knowledge of both the location and the boundary of the ligand binding site. Instead, whole proteins are scanned to find the most similar local patch in the spirit of local sequence alignment such as the Smith-Waterman algorithm . This feature makes SMAP appropriate for practical problems since typically the boundary of the ligand binding site is not clearly defined or depends on the ligand in the complex structure. Second, SMAP integrates geometric, evolutionary and physical information into a unified similarity score akin to a sequence alignment score. However, unlike conventional sequence alignment, the SMAP alignment is sequence order independent; a necessary requirement when comparing local binding sites. Third, because SMAP uses the reduced structure representation, it is not sensitive to structural uncertainty and flexibility. Thus SMAP can be applied to homology models and handle flexible ligand binding sites. Finally, we have developed a probability model to efficiently estimate the statistical significance of the binding site similarity. The model allows us to reliably identify similar ligand binding sites in a high throughput fashion. Despite these advantages of SMAP, it is expected that the best results will come from the combination of different tools as demonstrated by many studies in bioinformatics and molecular modeling.
Despite the success of ligand binding search algorithms in protein function prediction and drug design , , , , , , – currently no algorithm can retrieve all of the binding sites that bind a cognate ligand such as ATP. However, in the context of searching for off-targets of drug molecules, the actual number of false negatives may be limited based on the nature of the drug. False negatives in the ligand binding site search are due mainly to large conformational changes of the ligand and corresponding physical and geometric changes in the binding site. Most existing drugs are designed to selectively inhibit an exquisite target. They are more rigid and less adaptable to the changing environment of the binding site than the cognate ligand. For example, a protein kinase ATP competitive inhibitor is designed to inhibit only the ATP binding site of the protein kinase, not that of other superfamilies such as P-loop hydrolases. On the other hand, although rational drug design may take the same cognate ligand binding site into account, it rarely explores the cross-reactivity between binding sites that are not naturally designed for the same cognate ligand but are able to bind the same drug. Studies by others have shown that the drug binding site can be considered as a negative image of the drug to screen compound database  or vice versa to model the drug binding site . Hence ligand binding site similarity search is a valuable tool to identify off-targets that accommodates only the drug molecule but not necessarily all proteins that bind to the same cognate ligand across gene families. In general, the chemical systems biology approach developed in this paper is specific in identifying potential off-targets for drug-like molecules and could be used in concert with experimental design employing in vitro screening, in vivo screening and clinical trials.
Implications for drug discovery and development
Even with the current limited structural coverage of the human proteome, our predications are able to provide a testable hypothesis as to the suitability of a lead compound prior to conducting a clinical trial. Thus our findings have implications for drug discovery and development. In contrast to the conventional drug discovery process in which drug leads are optimized to reduce promiscuous binding, the possible combinatorial control of aldosterone regulation by CETP inhibitors suggests that adverse drug effects can be minimized through fine tuning of multiple off-target interactions. Although it is desirable for a drug to bind the primary target in a highly specific way, this is difficult to achieve considering the inherent similarity among protein binding pockets within and across gene families. Moreover, many biological process involve combinatorial control to provide redundancy and homeostasis . In such cases it becomes very difficult to modulate the systems behavior by inhibiting or activating only one single target protein. Thus, a multiple-target approach  and combination therapy  have been actively pursued to boost clinical efficacy in the treatment of diseases such as cancer and diabetes. However, these combined approaches are rarely systematic with the purposeful intent of developing therapeutics that bind to a primary target to treat the disease, but at the same time are considered to bind to desirable off-targets that modulate side-effects. In some cases this combined goal is achieved serendipitously as would seem to be the case for JTT-705. Instead of using a single molecule, it may be more feasible to use multiple components to treat a disease state and at the same time to reduce drug side-effects. Different from conventional combination therapy where all of components target disease related proteins, here only a subset of the molecules are directly therapeutic, other molecules serve the purpose of reducing side-effects by targeting non-disease related proteins. We speculate that many drugs which failed due to off-target effects can be rescued by this target-off-target combination therapy. For example, it is expected that the side-effect of Torcetrapib can be reduced by introducing molecules that binds to molecular components involved in the negative control of aldosterone regulation. Such therapies can be only rationally designed by exploring the system properties of the biological network.
Binding site similarity search on a genome scale
5,985 structures or models that cover approximately 57% of the human proteome were searched against CETP (PDB id: 2obd) ligand binding sites using the sequence order independent profile-profile alignment (SOIPPA) algorithm . A new statistical model was introduced to the original approach to estimate the significance of the alignment score . In brief, the alignment score for a given alignment length is fitted to an extreme value distribution (EVD):(1)Where:(2)where S is the raw SOIPPA similarity score. μ and σ are fitted to the logarithm of N, which is the alignment length between two proteins:(3)(4)Six parameters a, b, c, d, e, and f are 5.963, −15.523, 21.690, 3.122, −9.449, and 18.252 for the McLachlan similarity matrix used in this study, respectively.
Using this statistical model, 276 off-targets are identified with p-values less than 1.0e-3.
Reverse screening of the human structural proteome
The putative 276 off-targets are subject to further investigation using more computationally intensive protein-ligand docking. After removing three structures with the same fold as CETP, JTT-705, the smallest CETP inhibitor, is docked to the remaining 273 structures using two commonly used fast docking programs, Surflex 2.1  (default setting) and eHits 6.2  (fastest setting). 69 structures with a Surflex docking score smaller than 0.0 or an eHits score larger than 0.0 are considered to be difficult to fit JTT-705 due to significant steric crashes (and hence the other two inhibitors based on size) and are removed from the putative off-target list. The remaining 204 structures are subject to further investigation using the docking software AutoDock4.0  and other more computationally intense methods as described below.
Global structure similarity network of off-targets
An all-against-all global structural similarity analysis between the 204 putative off-targets was computed using CE . A graph is constructed with each of the structures as a node. An edge is formed between two nodes if their CE z-score is larger than 4.0 (a superfamily level similarity) .
Volume of the binding pocket
Normalized docking score
Drug-like molecules are downloaded from ZINC (http://zinc.docking.org) . From this database, six sets of molecules are randomly selected with a fixed number, 5, 10, 15, 20, 25 and 29 carbon atoms, respectively; each set includes 100 molecules. These molecules are docked to CETP and its putative off-targets using eHiTs  and AutoDock4.0 . The correlation of the docking score to the number of carbon atoms is derived from linear regression for each of the protein receptors. From the linear fitting curve, the average docking score for molecules with a certain number of carbon atoms can be estimated.
Based on the fitted average docking score, a normalized docking score DS is calculated as a z-score:(5)Where Si is the raw docking score for the molecule with i carbon atoms, μi is the fitted average docking score for the number of carbon atoms i, σ is the standard deviation, which is not dependent on the size of molecules and is approximately 1.0 in all cases.
Vector distance of the average docking score
The vector distance of the average docking score D between CETP and its off-targets is calculated from the average values of the docking scores for randomly selected molecules with fixed numbers of 5, 10, 15, 20, 25 and 29 carbon atoms as follows:(6)where SCETP and Soff are the average values of carbon atom size dependent docking scores to CETP and its off-targets, respectively.
In this case study, we identify a panel of off-targets of CETP inhibitors using a chemical systems biology approach. All of the identified off-targets belong to different protein superfamilies from the primary target, but are structurally and functionally related, being mainly involved in lipid metabolism, immune response and signaling networks. Among them, CD1, nuclear hormone receptors and lipid transport proteins are the most likely off-targets with highly consistent results from multiple resources including functional correlation, ligand binding site similarity, hydrophobic scales, and predicted binding affinities. Moreover, the elucidated off-target effects from these proteins are strongly correlated to clinical and in vitro observations. Their combinatorial control of biological process plays a key role in the modulation of the adverse drug effect of CETP inhibitors. This study demonstrates that a chemical systems biology approach, which systematically explores protein-ligand interactions on a genome-wide scale and incorporates them into biological pathways, will provide us with valuable clues as to the molecular basis of cellular function. At the same time, it will help to transform the conventional single-target-single-drug drug discovery process to a new multi-target-multi-molecule paradigm.
Four endogenous ligands in the CETP complex structure (PDB id: 2OBD).
(0.09 MB DOC)
Structural coverage of the human proteome vs. alignment length between the protein sequence and the structural template.
(0.06 MB DOC)
CE Z-score distributions of putative off-targets.
(0.06 MB DOC)
Structural clusters of helix-like proteins.
(0.07 MB DOC)
Global structure similarity between glycolipid transport protein (PDB: 1tfj) and nuclear hormone receptor ligand binding domain (PDB: 1yow).
(0.17 MB DOC)
Regression curves of eHiTs score for CETP and its off-targets dependent on the number of carbon atoms for a) 2obd, b) 1yow, c) 1y0s, d) 2p54, e) 1zeo, and f) 1ie8.
(0.38 MB DOC)
Correlation of eHiTS score between CETP and its off-targets binding with random ligands with different sizes. a) 1yow; b) 1y0s; c) 2p54; d) 1zeo; e) 1ie8; f) 1tfj.
(2.04 MB DOC)
Correlation of the off-target interaction network of CETP inhibitors with the clinical indication through interconnected biological pathways.
(0.23 MB DOC)
The different regulation effects of Torcetrapib, Anacetrapib and JTT-705 on hypertension, inflammation and cancer through combinational control of other identified off-targets.
(0.10 MB DOC)
Putative off-targets of CETP inhibitors across the human structural genome identified from the off-target pipeline SMAP.
(0.05 MB DOC)
GO based similarity between CETP and off-targets.
(0.03 MB DOC)
We appreciate the constructive suggestions of the anonymous reviewers and the editor in improving the manuscript. We thank Ms. Lyn Jia for her assistance in making figures.
Conceived and designed the experiments: Lei Xie PEB. Performed the experiments: Li Xie JL Lei Xie. Analyzed the data: Li Xie Lei Xie. Contributed reagents/materials/analysis tools: Lei Xie. Wrote the paper: Li Xie Lei Xie PEB.
- 1. Kuhn M, Campillos M, González P, Jensen LJ, Bork P (2008) Large-scale prediction of drug-target relationships. FEBS Lett 582: 1283–1290.
- 2. Weber A, Casini A, Heine A, Kuhn D, Supuran CT, et al. (2004) Unexpected nanomolar inhibition of carbonic anhydrase by COX-2-selective celecoxib: new pharmacological opportunities due to related binding site recognition. J Med Chem 47: 550–557.
- 3. Keiser MJ, Roth BL, Armbruster BN, Ernsberger P, Irwin JJ, et al. (2007) Relating protein pharmacology by ligand chemistry. Nat Biotechnol 25: 197–206.
- 4. Xie L, Wang J, Bourne PE (2007) In silico elucidation of the molecular mechanism defining the adverse effect of selective estrogen receptor modulators. PLoS Comp Biol 3: e217.
- 5. Paolini GV, Shapland RH, van Hoorn WP, Mason JS, Hopkins AL (2006) Global mapping of pharmacological space. Nat Biotechnol 24: 805–815.
- 6. Zimmermann GR, Lehár J, Keith CT (2007) Multi-target therapeutics: when the whole is greater than the sum of the parts. Drug Discov Today 12: 34–42.
- 7. O'Connor KA, Roth BL (2005) Finding new tricks for old drugs: an efficient route for public-sector drug discovery. Nat Rev Drug Discov 4: 1005–1014.
- 8. Ashburn TT, Thor KB (2004) Drug repositioning: identifying and developing new uses for existing drugs. Nat Rev Drug Discov 3: 673–683.
- 9. Kennedy T (1997) Managing the drug discovery /development interface. Drug Discov Today 2: 436–444.
- 10. Fitzgerald JB, Schoeberl B, Nielsen UB, Sorger PK (2006) Systems biology and combination therapy in the quest for clinical efficacy. Nat Chem Biol 2: 458–466.
- 11. Campillos M, Kuhn M, Gavin AC, Jensen LJ, Bork P (2008) Drug target identification using side-effect similarity. Science 321: 263–266.
- 12. Bender A, Young DW, Jenkins JL, Serrano M, Mikhailov D, et al. (2007) Chemogenomic data analysis: prediction of small-molecule targets and the advent of biological fingerprint. Comb Chem High Throughput Screen 10: 719–731.
- 13. Jacoby E (2006) Chemogenomics: drug discovery's panacea? Mol Biosyst 2: 218–220.
- 14. Mestres J (2004) Computational chemogenomics approaches to systematic knowledge-based drug discovery. Curr Opin Drug Discov Devel 7: 304–313.
- 15. Rognan D (2007) Chemogenomic approaches to rational drug design. Br J Pharmacol 152: 38–52.
- 16. Savchuk NP, Balakin KV, Tkachenko SE (2004) Exploring the chemogenomic knowledge space with annotated chemical libraries. Curr Opin Chem Biol 8: 414–417.
- 17. Wuster A, Madan Babu M (2008) Chemogenomics and biotechnology. Trends Biotechnol 26: 252–258.
- 18. Hert , Keiser , Irwin , Oprea , Shoichet (2008) Quantifying the relationships among drug classes. J Chem Inf Model 48: 755–765.
- 19. Xie L, Bourne PE (2007) A robust and efficient algorithm for the shape description of protein structures and its application in predicting ligand binding sites. BMC Bioinformatics 8: S9.
- 20. Xie L, Bourne PE (2008) Detecting evolutionary relationships across existing fold space, using sequence order-independent profile–profile alignments. Proc Natl Acad Sci U S A 105: 5441–5446.
- 21. Barter PJ, Caulfield M, Eriksson M, Grundy SM, Kastelein JJP, et al. (2007) Effects of torcetrapib in patients at high risk for coronary events. N Engl J Med 357: 2109–2122.
- 22. Forrest MJ, Bloomfield D, Briscoe RJ, Brown PN, Cumiskey A-M, et al. (2008) Torcetrapib-induced blood pressure elevation is independent of CETP inhibition and is accompanied by increased circulating levels of aldosterone. Br J Pharmacol 154: 1465–1473.
- 23. Howes LG, Kostner K (2007) The withdrawal of torcetrapib from drug development: implications for the future of drugs that alter HDL metabolism. Expert Opin Investig Drugs 16: 1509–1516.
- 24. Joy TR, Hegele RA (2008) The failure of torcetrapib: what have we learned? Br J Pharmacol 154: 1379–1381.
- 25. Kontush A, Guérin M, Chapman MJ (2008) Spotlight on HDL-raising therapies: insights from the torcetrapib trials. Nat Clin Pract Cardiovasc Med 5: 329–336.
- 26. Miura S, Matsuo Y, Kawamura A, Saku K (2005) JTT-705 blocks cell proliferation and angiogenesis through p38 kinase/p27(kip1) and Ras/p21(waf1) pathways. Atherosclerosis 182: 267–275.
- 27. Jain AN (2007) Surflex-Dock 2.1: robust performance from ligand energetic modeling, ring flexibility, and knowledge-based search. J Comput Aided Mol Des 21: 281–306.
- 28. Zsoldos Z, Reid D, Simon A, Sadjad SB, Johnson AP (2007) eHiTS: a new fast, exhaustive flexible ligand docking system. J Mol Graph Model 26: 198–212.
- 29. Morris GM, Goodsell DS, Halliday RS, Huey R, Hart WE, et al. (1998) Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J Comput Chem 19: 1639–1662.
- 30. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, et al. (2000) The Protein Data Bank. Nucleic Acids Res 28: 235–242.
- 31. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402.
- 32. Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, et al. (2008) The Pfam protein families database. Nucleic Acids Res 36: D281–D288.
- 33. Ye Y, Godzik A (2003) Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics 19: ii246–ii255.
- 34. Bensinger SJ, Tontonoz P (2008) Integration of metabolism and inflammation by lipid-activated nuclear receptors. Nature 454: 470–477.
- 35. Fernández JM, Hoffmann R, Valencia A (2007) iHOP web services. Nucleic Acids Res 35: W21–W26.
- 36. Schlicker A, Albrecht M (2008) FunSimMat: a comprehensive functional similarity database. Nucleic Acids Res 36: D434–D439.
- 37. Resnik P (1999) Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. Artif Intell Res 11: 95–130.
- 38. Wang R, Lu Y, Wang S (2003) Comparative evaluation of 11 scoring functions for molecular docking. J Med Chem 46: 2287–2303.
- 39. Ferrara P, Gohlke H, Price DJ, Brooks CLI, Klebe G (2004) Assessing scoring functions for protein-ligand interactions. J Med Chem 47: 3032–3047.
- 40. Warren GL, Andrews CW, Capelli A-M, Clarke B, LaLonde J, et al. (2006) A critical assessment of docking programs and scoring functions. J Med Chem 49: 5912–5931.
- 41. Hopkins AL, Mason JS, Overington JP (2006) Can we rationally design promiscuous drugs? Curr Opin Struct Biol 16: 127–136.
- 42. Kuipers I, van der Harst P, Navis G, van Genne L, Morello F, et al. (2008) Nuclear hormone receptors as regulators of the renin-angiotensin-aldosterone system. Hypertension 51: 1442–1448.
- 43. Khovidhunkit W, Kim M-S, Memon RA, Shigenaga JK, Moser AH, et al. (2004) Effects of infection and inflammation on lipid and lipoprotein metabolism: mechanisms and consequences to the host. J Lipid Res 45: 1169–1196.
- 44. Delerive P, De Bosscher K, Besnard S, Vanden Berghe W, Peters JM, et al. (1999) Peroxisome proliferator-activated receptor alpha negatively regulates the vascular inflammatory gene response by negative cross-talk with transcription factors NF-kappa B and AP-1. J Biol Chem 274: 32048–32054.
- 45. Chung SW, Kang BY, Kim SH, Pak YK, Cho D, et al. (2000) Oxidized low density lipoprotein inhibits interleukin-12 production in lipopolysaccharide-activated mouse macrophages via direct interactions between peroxisome proliferator-activated receptor-gamma and nuclear factor-kappa B. J Biol Chem 275: 32681–32687.
- 46. Karin M, Chang L (2001) AP-1-glucocorticoid receptor crosstalk taken to a higher level. J Endocrinol 169: 447–451.
- 47. Nissen RM, Yamamoto KR (2000) The glucocorticoid receptor inhibits NF kappa B by interfering with serine-2 phosphorylation of the RNA polymerase II carboxy-terminal domain. Genes Dev 14: 2314–2329.
- 48. Sternberg EM (2006) Neural regulation of innate immunity: a coordinated nonspecific host response to pathogens. Nat Rev Immunol 6: 318–328.
- 49. Joseph SB, Castrillo A, Laffitte BA, Mangelsdorf DJ, Tontonoz P (2003) Reciprocal regulation of inflammation and lipid metabolism by liver X receptors. Nat Med 9: 213–219.
- 50. Li M, Pascual G, Glass CK (2000) Peroxisome proliferator-activated receptor gamma-dependent repression of the inducible nitric oxide synthase gene. Mol Cell Biol 20: 4699–4707.
- 51. Pascual G, Fong AL, Ogawa S, Gamliel A, Li AC, et al. (2005) A SUMOylation-dependent pathway mediates transrepression of inflammatory response genes by PPAR-gamma. Nature 437: 759–763.
- 52. Lee CH, Curtiss LK (2003) Transcriptional repression of atherogenic inflammation: Modulation by PPAR delta (vol 302, pg 453, 2003). Science 302: 1153.
- 53. Odegaard JI, Ricardo-Gonzalez RR, Goforth MH, Morel CR, Subramanian V, et al. (2007) Macrophage-specific PPAR gamma controls alternative activation and improves insulin resistance. Nature 447: 1116–U1112.
- 54. Vats D, Mukundan L, Odegaard JI, Zhang L, Smith KL, et al. (2006) Oxidative metabolism and PGC-1 beta attenuate macrophage-mediated inflammation. Cell Metabolism 4: 255–255.
- 55. Gosset P, Charbonnier AS, Delerive P, Fontaine J, Staels B, et al. (2001) Peroxisome proliferator-activated receptor gamma activators affect the maturation of human monocyte-derived dendritic cells. Eur J Immunol 31: 2857–2865.
- 56. Escarcega RO, Fuentes-Alexandro S, Garcia-Carrasco M, Gatica A, Zamora A (2007) The transcription factor nuclear factor-kappa B and cancer. Clin Oncol (R Coll Radiol) 19: 154–161.
- 57. Stathopoulos GT, Zhu Z, Everhart MB, Kalomenidis I, Lawson WE, et al. (2006) Nuclear factor-kappaB affects tumor progression in a mouse model of malignant pleural effusion. Am J Respir Cell Mol Biol 34: 142–150.
- 58. Gardner OS, Dewar BJ, Earp HS, Samet JM, Graves LM (2003) Dependence of peroxisome proliferator-activated receptor ligand-induced mitogen-activated protein kinase signaling on epidermal growth factor receptor transactivation. J Biol Chem 278: 46261–46269.
- 59. Pozzi A, Ibanez MR, Gatica AE, Yang S, Wei S, et al. (2007) Peroxisomal proliferator-activated receptor-alpha-dependent inhibition of endothelial cell proliferation and tumorigenesis. J Biol Chem 282: 17685–17695.
- 60. Chang L, Karin M (2001) Mammalian MAP kinase signalling cascades. Nature 410: 37–40.
- 61. Furuhashi M, Hotamisligil GS (2008) Fatty acid-binding proteins: role in metabolic diseases and potential as drug targets. Nat Rev Drug Discov 7: 489–503.
- 62. Wolfrum C, Borrmann CM, Borchers T, Spener F (2001) Fatty acids and hypolipidemic drugs regulate peroxisome proliferator-activated receptors alpha- and gamma-mediated gene expression via liver fatty acid binding protein: a signaling path to the nucleus. Proc Natl Acad Sci U S A 98: 2323–2328.
- 63. Schachtrup C, Emmler T, Bleck B, Sandqvist A, Spener F (2004) Functional analysis of peroxisome-proliferator-responsive element motifs in genes of fatty acid-binding proteins. Biochem J 382: 239–245.
- 64. Tan N-S, Shaw NS, Vinckenbosch N, Liu P, Yasmin R, et al. (2002) Selective cooperation between fatty acid binding proteins and peroxisome proliferator-activated receptors in regulating transcription. Mol Cell Biol 22: 5114–5127.
- 65. Ulrichs T, Porcelli SA (2000) CD1 proteins: targets of T cell recognition in innate and adaptive immunity. Rev Immunogenet 2: 416–432.
- 66. Huynh H, Servant N, Chalifour LE (2007) Ubiquinol-cytochrome-c reductase 7.2 kDa protein of mitochondrial complex III is steroid-responsive and increases in cardiac hypertrophy and hypertension. Can J Physiol Pharmacol 85: 986–996.
- 67. Trumpower BL (1990) The protonmotive Q-cycle: energy transduction by coupling of proton translocation to electron-transfer by the cytochrome-bc1 complex. J Biol Chem 265: 11409–11412.
- 68. Mitchell P (1975) The protonmotive Q cycle: a general formulation. FEBS Lett 59: 137–139.
- 69. Skulachev VP (1998) Cytochrome c in the apoptotic and antioxidant cascades. FEBS Lett 423: 275–280.
- 70. Nishi H, Inagi R, Kato H, Tanemoto M, Kojima I, et al. (2008) Hemoglobin is expressed by mesangial cells and reduces oxidant stress. J Am Soc Nephrol 19: 1500–1508.
- 71. Romero JR, Suzuka SM, Romero-González GV, Nagel RL, Fabry ME (2001) K:Cl cotransport activity is inhibited by HCO3− in knockout mouse red cells expressing human HbC. Blood Cells Mol Dis 27: 69–70.
- 72. Romero JR, Suzuka SM, Nagel RL, Fabry ME (2004) Expression of HbC and HbS, but not HbA, results in activation of K-Cl cotransport activity in transgenic mouse red cells. Blood 103: 2384–2390.
- 73. Pleger ST, Harris DM, Shan C, Vinge LE, Chuprun JK, et al. (2008) Endothelial S100A1 modulates vascular function via nitric oxide. Circ Res 102: 786–794.
- 74. Ikura M, Osawa M, Ames JB (2002) The role of calcium-binding proteins in the control of transcription: structure to function. Bioessays 24: 625–636.
- 75. Phillips SE, Vincent P, Rizzieri KE, Schaaf G, Bankaitis VA, et al. (2006) The diverse biological functions of phosphatidylinositol transfer proteins in eukaryotes. Crit Rev Biochem Mol Biol 41: 21–49.
- 76. Routt SM, Bankaitis VA (2004) Biological functions of phosphatidylinositol transfer proteins. Biochem Cell Biol 82: 254–262.
- 77. Sugita S (2008) Mechanisms of exocytosis. Acta Physiol (Oxf) 192: 185–193.
- 78. Kültz D (2005) Molecular and evolutionary basis of the cellular stress response. Annu Rev Physiol 67: 225–257.
- 79. Yaqoob P (2003) Lipids and the immune response: from molecular mechanisms to clinical applications. Curr Opin Clin Nutr Metab Care 6: 133–150.
- 80. Cristea IM, Degli Esposti M (2004) Membrane lipids and cell death: an overview. Chem Phys Lipids 129: 133–160.
- 81. Danielpour D, Song K (2006) Cross-talk between IGF-I and TGF-beta signaling pathways. Cytokine Growth Factor Rev 17: 59–74.
- 82. Xie L, Bourne PE (2005) Functional coverage of the human genome by existing structures, structural genomics targets, and homology models. PLoS Comp Biol 1: e31.
- 83. Strömbergsson H, Kryshtafovych A, Prusis P, Fidelis K, Wikberg JES, et al. (2006) Generalized modeling of enzyme-ligand interactions using proteochemometrics and local protein substructures. Proteins 65: 568–579.
- 84. Shulman-Peleg A, Nussinov R, Wolfson HJ (2004) Recognition of functional sites in protein structures. J Mol Biol 339: 607–633.
- 85. Gold ND, Jackson RM (2006) SitesBase: a database for structure-based protein-ligand binding site comparisons. Nucleic Acids Res 34: D231–234.
- 86. Gold ND, Jackson RM (2006) A searchable database for comparing protein-ligand binding sites for the analysis of structure-function relationships. J Chem Inf Model 46: 736–742.
- 87. Schmitt S, Kuhn D, Klebe G (2003) A new method to detect related function among proteins independent of sequence and fold homology. J Mol Biol 323: 387–406.
- 88. Kuhn D, Weskamp N, Schmitt S, Hullermeier E, Klebe G (2006) From the similarity analysis of protein cavities to the functional classification of protein families using cavbase. J Mol Biol 359: 1023–1044.
- 89. Weskamp N, Kuhn D, Hullermeier E, Klebe G (2004) Efficient similarity search in protein structure databases by k-clique hashing. Bioinformatics 20: 1522–1526.
- 90. Jambon M, Imberty A, Deleage G, Geourjon C (2003) A new bioinformatic approach to detect common 3D sites in protein structures. Proteins 52: 137–145.
- 91. Ivanisenko VA, Pintus SS, Grigorovich DA, Kolchanov NA (2004) PDBSiteScan: a program for searching for active, binding and posttranslational modification sites in the 3D structures of proteins. Nucleic Acids Res 32: W549–W554.
- 92. Kinoshita K, Furui J, Nakamura H (2001) Identification of protein functions from a molecular surface database, eF-site. J Struct Funct Genomics 2: 9–22.
- 93. Kinoshita K, Nakamura H (2003) Identification of protein biochemical functions by similarity search using the molecular surface database eF-site. Protein Sci 12: 1589–1595.
- 94. Binkowski TA, Freeman P, Liang J (2004) pvSOAR: detecting similar surface patterns of pocket and void surfaces of amino acid residues on proteins. Nucleic Acids Res 32: W555–W558.
- 95. Tseng YY, Dundas J, Liang J (2009) Predicting protein function and binding profile via matching of local evolutionary and geometric surface patterns. J Mol Biol 387: 451–464.
- 96. Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147: 195–197.
- 97. Binkowski TA, Adamian L, Liang J (2003) Inferring functional relationships of proteins from local sequence and spatial surface patterns. J Mol Biol 332: 505–526.
- 98. Laskowski RA, Watson JD, Thornton JM (2005) Protein function prediction using local 3D templates. J Mol Biol 351: 614–626.
- 99. Watson JD, Laskowski RA, Thornton JM (2005) Predicting protein function from sequence and structural data. Curr Opin Struct Biol 15: 275–284.
- 100. Fukunishi Y, Kubota S, Kanai C, Nakamura H (2006) A virtual active compound produced from the negative image of a ligand-binding pocket, and its application to in-silico drug screening. J Comput Aided Mol Des 20: 237–248.
- 101. Tanrikulu Y, Schneider G (2008) Pseudoreceptor models in drug design: bridging ligand- and receptor-based virtual screening. Nat Rev Drug Discov 7: 667–677.
- 102. Tortora G, Bianco R, Daniele G (2004) Strategies for multiple signalling inhibition. J Chemother 16: (Suppl 4)41–43.
- 103. Xie L, Xie L, Bourne PE (2009) A unified statistical model to support local sequence order independent similarity searching for ligand binding sites and its application to genome-based drug discovery. Bioinformatics. In press.
- 104. Shindyalov IN, Bourne PE (1998) Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng 9: 739–747.
- 105. Dundas J, Zheng O, Tseng J, Binkowski B, Turpaz Y, et al. (2006) CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated resiudes. Nucleic Acids Res 34: W116–W118.
- 106. Shoichet BK, Irwin JJ (2005) ZINC—a free database of commercially available compounds for virtual screening. J Chem Inf Model 45: 177–182.