The dose response curve is the gold standard for measuring the effect of a drug treatment, but is rarely used in genomic scale transcriptional profiling due to perceived obstacles of cost and analysis. One barrier to examining transcriptional dose responses is that existing methods for microarray data analysis can identify patterns, but provide no quantitative pharmacological information. We developed analytical methods that identify transcripts responsive to dose, calculate classical pharmacological parameters such as the EC50, and enable an in-depth analysis of coordinated dose-dependent treatment effects. The approach was applied to a transcriptional profiling study that evaluated four kinase inhibitors (imatinib, nilotinib, dasatinib and PD0325901) across a six-logarithm dose range, using 12 arrays per compound. The transcript responses proved a powerful means to characterize and compare the compounds: the distribution of EC50 values for the transcriptome was linked to specific targets, dose-dependent effects on cellular processes were identified using automated pathway analysis, and a connection was seen between EC50s in standard cellular assays and transcriptional EC50s. Our approach greatly enriches the information that can be obtained from standard transcriptional profiling technology. Moreover, these methods are automated, robust to non-optimized assays, and could be applied to other sources of quantitative data.
Transcriptional profiling is arguably the most powerful hypothesis-free method for investigating biological effects of drugs—so why do the experiments typically use outmoded single-dose designs? Such single-dose experiments will co-mingle effects that can occur with different potency (e.g., effects on the known target versus effects on additional undesired targets). Single-dose experiments have little comparability to the dose-response bioassays, which are now used throughout the drug discovery processes. One reason for the disparity between experimental approaches is that existing analytical methods for dose-response bioassays can't cope with the dimensionality of microarray data: a typical bioassay is optimized for one response, then used to run a screen against thousands of compounds; whereas transcriptional profiling measures thousands of non-optimized responses to a single compound. Conversely, existing methods for microarray data analysis can identify patterns, but provide no quantitative dose-response information. To overcome these problems, we developed novel algorithms and visualization methods that allow anyone to apply transcriptional profiling as a conventional dose-response assay. The approach provides far more information than limited-dose designs, yet is economical (12 arrays/compound). With this new analytical framework, it is now possible to identify distinct transcriptional responses at distinct regions of the dose range, to link these impacts to biological pathways, and to make realistic connections to drug targets and to other bioassays.
Citation: Ji R-R, de Silva H, Jin Y, Bruccoleri RE, Cao J, He A, et al. (2009) Transcriptional Profiling of the Dose Response: A More Powerful Approach for Characterizing Drug Activities. PLoS Comput Biol 5(9): e1000512. doi:10.1371/journal.pcbi.1000512
Editor: Greg Tucker-Kellogg, Lilly Singapore Centre for Drug Discovery, Singapore
Received: April 8, 2009; Accepted: August 20, 2009; Published: September 18, 2009
Copyright: © 2009 Ji et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by Bristol-Myers Squibb. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors are current and previous employees of Bristol-Myers Squibb, who is the manufacturer of dasatinib (one of the compounds included in the study).
The necessity of dose information in interpreting drug effects has been recognized since the 16th century, when Paracelsus observed: “All things are poison, and nothing is without poison: the dose alone makes a thing not poison” . Today, dose-response models are routinely used to evaluate drug effects in biochemical and cell-based assays. Pharmacological parameters such as the widely used EC50 value (half-maximal Effective Concentration) are central to any discussion of drug activities. In contrast, transcription profiling experiments are typically performed using replicate treatments at one dose, and effects are identified by analysis of variance . Single-dose experiments cannot distinguish effects that have different potencies, and they limit the utility of expression data relative to other bioassays. This is regrettable given the many applications of transcriptional profiling in drug discovery –.
There is no inherent reason for transcription profiling not to use the dose-response designs seen in every other area of chemical biology . Transcript levels are known to exhibit dose-responsive behavior in response to ligands, toxins and pharmacological agents –. Compound:target interaction at a single site that follows the law of mass action is reflected by the sigmoidal dose response seen in many bioassays . Although the algorithms used to quantify such dose responses in optimized bioassays are not ideal for microarray data, they have been used successfully to identify dose-responsive transcripts in two studies ,,.
While transcriptional responses are typically controlled through second messengers, it can be shown mathematically  and empirically  that when intermediate steps have the same characteristics, the sigmoidal response is preserved. An important corollary of these properties is that if a compound binds with distinct potencies to multiple targets, multiple biological responses will occur, with EC50 values corresponding to the target-binding EC50. Transcriptional profiling provides an informative genome-wide view of biological responses , thus obtaining quantitative dose-response information for transcript responses has obvious application in characterizing compounds that have high potential to interact with multiple targets. For example, establishing selectivity of kinase inhibitors across the human kinome continues to be difficult .
We describe analysis of transcription profiling studies of the dose responses to four kinase inhibitors: imatinib, nilotinib, dasatinib and PD0325901. Imatinib is a relatively selective  clinical ABL inhibitor; nilotinib is a similar but more potent second-generation compound . Dasatinib is a highly potent clinical ABL inhibitor that has additional activities on Src family  and receptor tyrosine kinases . PD0325901 is a non-ATP competitive inhibitor of MEK, a threonine/tyrosine kinase . Like most pharmaceutical agents, these compounds bind a single site on their target and elicit sigmoidal dose responses in biochemical , and cellular – assays. We developed novel methods to efficiently identify the transcripts that exhibit a sigmoidal dose response, and to visualize and further characterize groups of coordinated transcriptional responses. These analyses allow comparisons of potency, establish connection to other cellular assays, and provide insight into the mechanism and selectivity of compounds.
Identification of dose-responsive transcripts
Experimental data were obtained from the human lung cell line A549, treated for four hours with imatinib, nilotinib, dasatinib and PD0325901. This timeframe allows both down- and up-regulated transcripts to be identified, since there is detectable mRNA turnover for the majority of transcripts in four hours . For the ABL inhibitors, a 20 hour treatment was also performed. Treatments used 12 concentrations from 170 pM to 30 µM (a three-fold dilution series covering a six logarithmic range). This dose regimen is modeled on other dose-response bioassays, and is amenable to identifying responses with EC50s in the interval between 0.54 nM and 10 µM. Each of the 22,215 probesets on an Affymetrix HG-133A array was treated as an assay for the response of its corresponding transcript; the 12 intensity values for each treatment constituted the assay data. Only sigmoidal dose responses were evident in a hierarchical cluster analysis of the data (Text S1).
A typical sigmoidal dose response curve is defined by an equation with four unknowns corresponding to minimal response (A), maximal response (B), EC50 (C), and slope (D) . To identify dose-responsive transcripts, we developed a grid search-based algorithm named Sigmoidal Dose Response Search (SDRS). For each probeset, the SDRS algorithm tests a series of candidate EC50 values (C) across the dose range. The goodness of fit at every grid search dose is measured by an F-statistic, calculated as the ratio between mean square of regression and mean square of error. When the data for a given probeset fits a sigmoidal dose response, its F-statistic plot has an inverted ‘V’ shape (Figure 1A), and the values of A, B, C and D that generate the maximal F are the best fitting model (Figure 1B), and C approximates the true EC50 for the assay data. Given the normality of residuals (Text S1 and Table S9), the statistic follows an F-distribution, and F-tables can be used to establish significance. A probeset is designated as a ‘response transcript’ if its maximal F-statistic is larger than the critical value for P<0.05 (see Methods) In this work, this criterion for a response transcript is used for benchmarking to other algorithms, and to retrieve gene sets for study after their significance has been established with methods that employ multiple test corrections.
(A) SDRS analysis of intensity data for probeset 205016_at (Transforming Growth Factor Alpha), following a four hour treatment with dasatinib. The plot shows the maximal F-statistic obtained for this probeset at each C value evaluated by SDRS. The critical value of the F-distribution at P<0.05 is indicated. The global maximal F was obtained from fit of experimental data to a sigmoidal dose response model with the values of A, B, C, D shown. (B) The experimental data for probeset 205016_at, and the curve obtained using the optimal model parameters established by SDRS in the equation shown. (C) Schematic diagram of the data analysis flow. The SDRS report (left) contains the calculated pharmacological parameters for each probeset, and can be used to identify individual response transcript of interest. This report is qualitatively similar to the output of an iterative algorithm. The SDRS Summary Doses output (right) is unique to the grid search method. This dataset is used for the further approaches shown, which characterize and compare the overall transcriptional responses. A more detailed flowchart of SDRS is provided in Text S1.
The performance of SDRS compared favorably to XLfit (Text S1 and Table S1), software that implements the Levenberg-Marquardt algorithm . Using SDRS, response transcripts were identified for the four kinase inhibitors in the seven treatments described (Table S2).
As diagrammed in Figure 1C, one output of SDRS is qualitatively similar to that of an iterative algorithm: each probeset has a predicted EC50, P value and fold-change (Table S2). However, SDRS also generates an F-statistic for every probeset at each grid search dose level. This output, which is unique to the grid search method, allowed us to characterize and compare the coordinated transcriptional dose responses using the novel approaches described below.
Characterization of the transcriptional dose response
SDRS identifies the subset of transcripts that exhibit sigmoidal dose response behavior, and estimates the potency of the effect. While each probeset is an independent assay, transcript levels respond to the treatment's effect on a limited number of biological targets. At doses where the treatment impacts a target, one might identify coordinated sets of transcription responses. Identifying more than one coordinated set of responses that occur at distinct doses within a treatment series would point to multi-target pharmacology of a drug. However, if one has only a single EC50 value per probeset, it is impossible to identify such coordinated responses without imposing binning criteria, which are inherently arbitrary and not amenable to statistical evaluation.
The SDRS output allows an alternative, statistically rigorous means to identify coordinated transcriptional responses. We exploited the fact that SDRS provides a list of the F-statistic for every probeset at each grid search dose. (To simplify presentation, we use only the F-statistic lists from ‘Summary Doses’: a log-evenly distributed subset of the SDRS search doses). A false discovery rate (FDR) correction was applied to each Summary Dose list; this multiple test correction effectively removes spurious response transcripts (Text S1 and Table S8). A bar chart of the results for a given FDR revealed ‘peaks’ at regions of the dose range, indicative of coordinated transcriptional responses (Figure 2A). Since a single FDR criterion also represents an arbitrary limitation of analysis, we computed the numbers of response transcripts corresponding to FDRs from 1% to 35%.
(A) Bar chart displaying the number of probesets whose F-statistic for goodness of fit to a sigmoidal curve passes a 10% FDR criterion at each SDRS Summary Dose. The results shown are from a four hour treatment with PD0325901. These results are also presented as the tenth row of the heatmap in (B). (B–C) Heatmaps showing the number of probesets whose F-statistic for goodness of fit to a sigmoidal curve passes the given FDR criterion at each SDRS Summary Dose. In (B), the horizontal axis contains the SDRS Summary Doses, and the vertical axis contains one row for each of the 35 FDR criteria applied. Panel (C) is rotated 90° to clarify its relationship to panel (D). The results shown are from a four hour treatment with PD0325901 (B) or a 20 hour treatment with imatinib (C). (D) The negative logarithm of P values from Fisher's exact tests that compare response transcripts for PD0325901 and imatinib. Values displayed are the lowest P value observed in tests performed between the (maximally) 35 lists of probesets that underlie each dose in panels (B) and (C). The horizontal and vertical axes contain the SDRS Summary Doses for PD0325901 (four hours) and imatinib (20 hours), respectively. The two boxed regions indicate shared transcript responses discussed in the text.
We devised a novel and effective visualization for this data: a heat map where columns correspond to the Summary Doses, each row is a histogram of results at a given FDR, and intensity reflects the number of response transcripts passing the FDR criterion. Thus, the data presented in Figure 2A is the tenth row of the heat map in Figure 2B. As with a histogram, ‘peaks’ were evident at distinct regions of the dose range (Figure 2B, 2C and Text S1). These peaks reflect the likelihood that a coordinated transcriptional response is occurring around a given dose. For example, the MEK-specific inhibitor PD0325901 induced two separate peaks of transcriptional responses at four hours (Figure 2B). The first peak centered at 0.5 nM and contained 80 response transcripts at 35% FDR. A second peak centered at 6 µM contained seven response transcripts. Examination of loci with multiple response transcripts suggested that the width of these peaks reflects the precision with which an EC50 for a single target can be determined (Table S3). Of the three ABL inhibitors, dasatinib induced the most potent and populous transcriptional response at both time points, consistent with its multi-kinase activity ,.
Comparison of treatments using their transcriptional dose responses
Drug treatments are typically compared by evaluating the overlap between the two lists of regulated transcripts. The SDRS output allows a similar but more granular evaluation, since one can see whether transcripts are regulated with the same potency in each treatment. At each Summary Dose there is a list of probesets that may be sorted based on F-score and truncated based on FDR criteria. To compare the data from two treatments, we used a Fisher's exact test on each possible pair of Summary Doses, incrementing the response transcript lists from 1% to 35% FDR (see Methods). The resulting grid of P values is effectively visualized as a heat map. Comparison of a treatment to itself produced low P values at points on the diagonal (Text S1). If two distinct treatments generate the same response transcripts but with different EC50s, low P values occur at points off the diagonal. For example, 4 hour treatment with PD0325901 gave a peak of transcriptional responses centered around 6 µM (Figure 2B). Imatinib affected an overlapping set of response transcripts with EC50s around 500 nM. Conversely, a set of transcripts with EC50s around 1 nM for PD0325901 had EC50s over 1 µM for imatinib (Figure 2C,D and Text S1).
Connecting transcriptional dose response to effects on pathways and processes
The heat maps in Figure 2B,C and Text S1 provide an overview of transcriptional responses to a compound. To understand the underlying mechanism, the responding genes need to be mapped to cellular pathways. This typically involves evaluating the overlap between the regulated genes and lists of genes representing biological processes . Results can be prioritized by P-value, usefulness of the process grouping and relevance to possible targets. The SDRS output allows a similar but more granular evaluation of pathway impact, since one can see the dose at which the cellular process is implicated. For each treatment, the FDR-corrected response transcript list at each Summary Dose was tested for overlap with lists of genes connected to processes in the KEGG  and Gene Ontology (GO, ) databases (see Methods and Table S4). For processes meeting a P<0.001 criterion, impact on a discrete pathway at a distinct region of the dose range was used to prioritize the following examples.
For PD0325901, plotting the results of this analysis (Table S4, Figure 3A) indicated that the two peaks of transcriptional responses observed in Figure 2B represented distinct biological effects. PD0325901 specifically inhibits cellular MEK activity with an IC50 of around 1 nM in both Ras/Raf-mutant and wild-type cell lines . The response transcripts with EC50s around this dose were enriched for components of the MAP kinase pathway, which contains MEK (Figure 3B, upper panel, Table S5) and the Jak-STAT pathway (Figure 3B, middle panel), which is reported to be indirectly affected by MEK function –. These response transcripts included MAP kinase phosphatases (DUSP1, DUSP4, DUSP5, and DUSP6)  and STAT regulators (IFNGR1, OSMR) and targets (BCL2L1, CCND1, MYC, LIF, IL15, SOCS5, SOCS6) – (Figure 3B,C). In contrast, the response transcripts with EC50s around 6 µM comprised 5 genes (CYP1A1, CYP1B1, ALDH1A3, ALDH3A2, and DHCR24; Figure 3B lower panel, Figure 3C), all mapped to the P450-mediated xenobiotic metabolism and tryptophan metabolism pathways. This transcriptional response was also provoked by treatment with imatinib at around 500 nM (Figure 2D, and Table S4), and could reflect a xenobiotic response to these compounds, or another shared target activity.
(A) Plot of significance values obtained from Fisher's exact tests performed between the lists of gene loci corresponding to response transcripts (1 to 35% FDR), and lists of gene loci representing the six biological pathways indicated. PD0325901 causes ~50% inhibition of cellular MEK activity in the dose region indicated . (B) Signal intensity for 48 response transcripts (in rows) identified by SDRS, across twelve doses of PD0325901 (columns) at 4 hours. Top panel contains transcripts related to MAPK signaling (KEGG:hsa04010). Middle panel contains transcripts related to Jak-STAT signaling (KEGG:hsa04630). Bottom panel contains transcripts mapped to the pathways for metabolism of xenobiotics by cytochrome P450 (KEGG:hsa00980) and tryptophan metabolism (KEGG:hsa00380). For visualization, RMA signal intensity data was reverse-logged and scaled from zero (blue) to one (red) for each probeset. (C) Experimental data for probesets corresponding to DUSP4 (204014_at, EC50 0.9 nM, left panel), annotated to the pathway for MAPK signaling (KEGG:hsa04010), and CYP1A1 (205749_at, EC50 6.0 µM, right panel), annotated to the pathway for metabolism of xenobiotics by cytochrome P450 (KEGG:hsa00980), with dose response curves obtained from the optimal model parameters established by SDRS.
Dasatinib had a complex transcriptional response across the dose range (Figure 4A), reflecting its potent impact on numerous kinases . Combining dose information with pathway analysis allows scientists to make realistic connections between these target kinases and responses. For example, at Summary Doses from 34–128 nM there is an enrichment of transcripts for genes in the TGF-β signaling pathway, including the ID repressor family and SMADs (Figure 4B,C). However since dasatinib's cellular activity on TGF-β family kinases is in the micromolar range (, shown in Figure 4A) they are not the relevant target. Instead, hypotheses should include the kinases that dasatinib does inhibit in this dose region: some are known to impact the TGF-β pathway, for example Src . Another example is the impact on MAP kinase signaling at higher concentrations of dasatinib (Figure 4B,D). Dasatinib's impact was distinct from that of PD0325901 by both automated comparison (Text S1), and by direct analysis of the response transcript lists (Table S5). PD0325901 predominantly affected loci assigned to the ‘classical’ MAP kinase pathway containing its target, MEK. Dasatinib regulated transcripts for a different set of loci in the ‘classical’ pathway, where it targets EGFR and Raf kinases. Dasatinib also regulated transcripts for loci in the KEGG p38/Jnk pathway, where it targets p38α, MLTK (ZAK), and TGFβR2. The potency of the transcript responses was consistent with known potencies for these targets (,, see Figure 4A and Table S5).
(A) Heatmap showing the number of probesets whose F-statistic for goodness of fit to a sigmoidal curve passes the given FDR criterion at each SDRS Summary Dose. Known kinase targets of dasatinib are shown below, with the dose interval for their binding EC50 in cells . (B) Significance values obtained from Fisher's exact tests performed between the lists of gene loci corresponding to response transcripts (1 to 35% FDR), and lists of gene loci representing the three biological pathways indicated. (C) Experimental data for probesets corresponding to ID3 (207826_s_at; EC50 17.5 nM) and SMAD2 (203077_s_at; EC50 10.7 nM) annotated to the TGF-beta signaling pathway (KEGG:hsa04350), with the dose response curves obtained from the optimal model parameters established by SDRS. (D) Experimental data for probesets corresponding to HSP1A1 (200799_at; EC50 3.9 µM) and MAP2K6 (205698_s_at; EC50 3.6 µM) annotated to the MAPK signaling pathway (KEGG:hsa04010), with the dose response curves obtained from the optimal model parameters established by SDRS.
For imatinib, dose-dependent effects have been invoked to explain discrepancies between pre-clinical studies . In examining imatinib's transcriptional responses at 20 hours, we used the efficacious clinical plasma concentration of 3 µM  as a reference point. Of 245 response transcripts for imatinib (10% FDR, >1.5-fold change, Table S2), only 44 have EC50s<3 µM (Figure 5A, C). The remaining 201 response transcripts have EC50s>3 µM (Figure 5B). The 201 response transcripts include 35 for endoplasmic reticulum-localized proteins such as XBP1 (Figure 5C), supporting observations that in vitro treatments with imatinib at 5 µM affect the function of this compartment . Pathway analysis identified enrichment of transcripts for the seven KEGG pathways (P<0.0001) shown in Figure 5D. Transcripts with EC50s>3 µM had enrichment for several pathways affecting lipid metabolism, indicating distinct biological impacts as the dose increases beyond the required clinical range.
(A, B) Signal intensity for 245 SDRS response transcripts (FDR10%, >1.5-fold change; in rows) across twelve doses of imatinib (columns) following 20 hours of treatment. (A) contains the subset of 44 response transcripts with EC50<3 µM. (B) contains the subset of 201 response transcripts with EC50>3 µM. For visualization, RMA signal intensity data was reverse-logged and scaled from zero (blue) to one (red) for each probeset. (C) Experimental data for probesets corresponding to Factor V (204713_s_at; EC50 824 nM) and XBP1 (200670_at, EC50 4.2 µM), with the dose response curves obtained from the optimal model parameters established by SDRS. (D) Significance values obtained from Fisher's exact tests performed between the lists of gene loci corresponding to response transcripts (1 to 35% FDR), and lists of gene loci representing the seven KEGG biological pathways indicated.
Submicromolar doses of imatinib, dasatinib and nilotinib did not produce shared effects on transcription (Text S1), indicating that their shared potent inhibition of cellular targets such as ABL and PDGFR , does not provoke transcriptional responses in the A549 cell line. In the micromolar dose range, comparison of transcriptional responses indicated shared effects at 20 hours (Text S1). Pathway analysis indicated both shared and compound-specific effects in this dose range (Text S1).
Connecting transcriptional dose responses to other dose response assays
Pathway analysis found that a significant number of genes involved in the cell cycle (KEGG pathway hsa04110) were represented in the transcriptional responses to dasatinib and nilotinib but not imatinib at 20 hours (Figure 6A and Table S4). The broad peak for dasatinib reflects impact at two distinct dose regions, based on the distribution of EC50s for individual response transcripts (Table S6). The first group of 44 response transcripts has EC50s around 10 nM and includes the DNA helicase complex, cyclins and CDKs (Figure 6B,C). The second group of 35 response transcripts has EC50s in the micromolar range and includes known transcriptional targets of p53: GADD45 (Figure 6C), CDKN1A and Stratifin , PCNA and MDM2. Pathway analysis also confirmed dasatinib's impact on DNA replication (KEGG:hsa03030) and p53 signaling (KEGG:hsa04115); Table S4). By contrast, nilotinib impacts the cell cycle pathway in one dose region: 62/63 of the response transcripts have EC50s in the micromolar range (Table S6). While nilotinib also regulates transcripts for the DNA helicase complex, cyclins and CDKs, it does not have a significant impact on the p53 signaling pathway (Figure 6B,C).
(A) Significance values obtained from Fisher's exact tests performed between the lists of gene loci corresponding to response transcripts at each SDRS Summary Dose, and lists of gene loci implicated in cell cycle regulation (KEGG:hsa04110). (B) Probeset signal intensity data from a 20 hour treatment with dasatinib (upper panel) or nilotinib (lower panel) at the doses indicated. Rows contain data for 35 response transcripts that correspond to genes implicated in cell cycle regulation (KEGG:hsa04110). For visualization, RMA signal intensity data was reverse-logged and scaled from zero (blue) to one (red) for each probeset. (C) Signal intensity data for CCNB2 (202705_at; EC50 14.8 nM for dasatinib and 9.3 µM for nilotinib) and GADD45B (207574_s_at; EC50 1.7 µM for dasatinib, no curve fit for nilotinib), upon treatment with dasatinib or nilotinib for 20 hours at the doses indicated. The dose-response curves obtained from the optimal model parameters established by SDRS are shown. (D) Luminescence assay for ATP content performed on cell cultures treated with dasatinib, nilotinib or imatinib for 96 hours at the doses indicated. (E) Percentage of the treated cell population in S phase (3n DNA content) in cell cultures treated with dasatinib or nilotinib for 23 hours at the doses indicated. The full data set is in Table S7.
To compare these findings from transcriptional profiling with typical cell-based assays for proliferation, we treated our A549 cell line with the 12-point dose range of dasatinib, imatinib, and nilotinib and assayed the ATP content of samples (a surrogate for cell number) at 96 hours, and the DNA content of dasatinib and nilotinib samples (a surrogate for cell-cycle stage) at 23 hours. The results reflected our findings from pathway analysis: dasatinib and nilotinib inhibited proliferation and decreased the S phase population, whereas imatinib did not (Figure 6D,E). Importantly, the potency in these conventional measures of cell cycle impact agreed with the EC50s of transcripts for the DNA helicase complex, cyclins and CDKs (EC50s around 10 nM for dasatinib and 6 to 9 µM for nilotinib).
This paper is the first description of a systematic application of genome-wide transcriptional profiling as a traditional dose-response assay. Since most compounds act on multiple targets with different potencies, target-specific effects of a compound may not be distinguished by the limited dose selection of a typical transcriptional profiling experiment. We show that with a dose-response study design that uses just 12 arrays per compound, one can use the existing technology in a more informative way, and establish connection to other cellular dose-response assays.
We present new algorithms and visualization methods that allow one to identify, compare, and characterize transcriptional responses. Sigmoidal dose responses are usually identified by iterative nonlinear regression methods . SDRS applies nonlinear regression in a grid search, and performs equally well. It is robust against natural variability, and will be amenable to identifying dose responses in other sources of quantitative data. The most important benefit of SDRS over iterative regression or clustering methods is obtaining the fitting statistic (F) across the dose range for each transcript. This provides a moving window to evaluate the transcriptome's coordinated responses across the dose range. The full set of F-statistics permitted the further methods we present, which easily characterize and compare the overall transcriptional dose response with statistical rigor.
Our results demonstrate that transcription profiling has many of the properties of traditional dose-responsive bioassays that have been used for decades . The ability to combine dose information from diverse bioassays with dose-dependent pathway analysis proves valuable in connecting transcriptional responses with targets. For example, the MEK-specific inhibitor PD0325901 produced a significant transcriptional response at the known cellular potency for MEK inhibition , and had no further effect on transcription until micromolar doses. In contrast, the transcriptional response to dasatinib had multiple EC50s, consistent with its known activities on multiple targets . Nonetheless pathway analysis allowed us to map cellular processes to distinct regions of the dose range, and connect them to likely kinase targets identified in other dose-response cellular assays . Connection to discrete kinase inhibition events can be refined by numerous methods: kinases not expressed in the experimental cell line can be excluded from lists of hits from biochemical assays, and comparisons can be made with transcript response profiles for compounds that have overlapping target spectra, or with profiles generated following siRNA ablation of kinase targets.
The power of a dose response study design stems from the ability to rigorously compare pharmacological parameters across assays . For example, we show that PD0325901 affects STAT-regulated transcripts with an EC50 of 1 nM, the same potency as its cellular activity on MEK , and this connectivity provides compelling support for earlier reports that the ERKs are STAT kinases . We also show connectivity between the EC50s for transcriptional effects on cell cycle genes by dasatinib and nilotinib, and their EC50s in conventional cell cycle and proliferation assays. Such connectivity should be applicable across diverse cell lines: while it is always possible for drug potency to be modulated by extraneous factors, studies of these kinase inhibitors across multiple cell lines (e.g. ,) support the biochemical prediction that a single site-binder has a defined potency for its target.
Transcriptional dose responses can separate the biological effects of a multi-target compound. Whereas a microarray experiment at a single micromolar dose should identify dasatinib's impact on both DNA replication and p53 signaling, only a dose-response design revealed that the impact on the p53 pathway occurs at a micromolar dose, and thus is irrelevant to dasatinib's nanomolar anti-proliferative potency. Transcriptional dose responses also allow a more meaningful comparison to clinical parameters such as plasma concentrations observed in treated patients (200 nM for dasatinib ; up to 3 µM for imatinib ; 3.6 µM for nilotinib ). With regard to imatinib, our observation of numerous additional transcriptional responses as dosing increases through the micromolar range supports the assertion  that dosing level is critical in evaluating the relevance of in vitro assays or pre-clinical models to imatinib's clinical effects.
There are some limitations to the current study. First, it is impossible to measure all possible dose-responsive treatment effects in a single cell line, as not all targets are functional. In the non-ABL dependent cell line A549, we cannot evaluate and compare the on-target activity of the three clinical ABL-inhibitors. Second, the pathway analysis is limited by the quality of annotation . Third, not all dose responses fit a single sigmoidal model . The clustering analysis we apply for quality control occasionally reveals treatments in which groups of transcripts show sigmoidal induction at low doses but have sigmoidal down-regulation at high doses or vice versa (Text S1), presumably due to action on a second, counteractive target or process. Such behavior could be routinely identified and quantified by substituting the data model in the SDRS algorithm, permitting the subsequent analytical approaches described in this work.
In summary, we have developed new methods that enable interpretation of transcriptome behavior, including dissection of dose-dependent activities, fine differentiation between compounds, and connection with other biochemical and cellular assays. This analytical method has application at all the points of drug development where transcription profiling is currently used. In early discovery, we have compared transcriptional effects of target knockdown to the dose responses for lead compounds. As lead compounds are developed, we routinely compare the dose-dependent effects of diverse chemotypes to identify the on-target biology. Development candidates can be more clearly differentiated from a first-in-class compound, and the relative potency of an off-target activity is valuable information when assessing its importance. On transition to the clinic, transcriptional biomarkers have added validity when they are connected to target biology by a clear dose response. Investigations of approved drugs have successfully used transcriptional profiling to clarify biology (e.g. ); such studies can only be facilitated by a more precise linkage between dose and effects. Ultimately, dose response strategies could be combined with a compendium database of response profiles ,, enabling rapid cellular categorization of new compounds.
Imatinib Cat.# PKI-IMTB-010 was purchased from Biaffin GmbH & Co KG (Cat.# PKI-IMTB-010). Nilotinib, PD-0325901 and dasatinib were provided in pure form (> = 98%) by Bristol-Myers Squibb Chemistry Division. Compounds stocks in DMSO (10 mM) were stored at −20°C and diluted in culture media before addition to cells.
Cell viability assay
Assays were performed in triplicate. A549 cells were seeded in the interior 48 wells of 96-well plates at a density of 1×104/cm2 in 180 µl of media, 4 hours prior to treatment with DMSO vehicle or a 11-point dose range of compound (30 µM with 3-fold dilutions down to 0.51 nM; final concentration of DMSO vehicle was 0.5% for all treatments) for 96 hours. ATP content was assayed using the CellTiter-Glo Assay (Promega, Madison, WI) with a Victor plate luminometer (Wallac, Turku, Finland).
FACS analysis of DNA content
A549 cells were seeded in 6-well cell plates at a density of 1×105/ml in 2 ml of media, 16 hours prior to treatment with DMSO vehicle or a 12-point dose range of compound (30 µM with 3-fold dilutions down to 0.17 nM; final concentration of DMSO vehicle was 0.5% for all treatments) for 23 hours. Both attached and detached cells were recovered, fixed with 0.25% ultrapure formaldehyde (Polysciences #04018) in dPBS (Ca2+ Mg2+-free; Invitrogen #14190) followed by 80% methanol. Cells were stained with dPBS/1%BSA containing propidium iodide (5 µg/ml; Sigma #P4864) and RNAse (1 µg/ml) for 30 minutes at RT in the dark. The samples were run on the FACSCanto with Diva 6.1.1software (Becton Dickinson), and data was analyzed using FlowJo 8.5.3. The experiment was performed twice (Table S7).
Cell treatment and Affymetrix Gene Chip analysis
All handling was performed in 96-well format. Positions of the 12 levels of each treatment were randomized using an experimental design that prevented row or column effects being confounded with dose effect. A549 cells were cultured at 37°C in RPMI1640 media containing 10% heat-inactivated Fetal Bovine Serum (Mediatech, Manassas, VA). Cells were seeded at 1.7×105/cm2 16 hours prior to treatment with vehicle or a 12-point dose range of compound (30 µM with 3-fold dilutions down to 0.17 nM; final concentration of DMSO vehicle was 0.5% for all treatments). Cells were lysed with 1× Nucleic Acid Purification Lysis Solution (Applied Biosystems, Foster City, CA) at 4 hours or 20 hours. Total RNA was extracted using the Prism 6100 (Applied Biosystems, Foster City, CA), purified by RNAClean Kit (Agencourt Bioscience Corporation; Beverly, MA), and evaluated on a 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA). cRNA preparation and hybridization on HT-U133A 96-array plates followed manufacturer's protocols (Affymetrix, Santa Clara, CA). The CEL files were analyzed with the robust multi-array analysis (RMA) algorithm , obtained from www.bioconductor.org. Following quality control and removal of non-expressed probesets (Text S1), RMA values were reverse-logged (base 2) for use in sigmoidal dose response curve fitting.
Sigmoidal dose response search (SDRS) algorithm
See also Supplementary Methods (Text S1). A standard four parameter dose response model was used to model gene expression changes in response to varying compound concentration: Y = A+(B−A)/(1+(X/C)D), where Y is the signal intensity value, X is compound concentration, C is the EC50, D is the slope factor, and A and B correspond to the signal intensity at low and high plateau of the curve, respectively. The approach can be viewed as a grid search, where a series of 542 values for C, distributed across the experimental dose range, are tested for every probeset on the array. For each probeset, ranges for A and B are based on signal levels in the treatment dataset. At each value of C tested, the algorithm evaluates 10,240 models against the experimental data. Goodness of fit is measured by an F-statistic: F = MSR/MSE where MSR is the mean square of the variance explained by the model and MSE is the mean square of error). For every probeset, at every C tested, the highest F-statistic and the corresponding A, B, D parameters are recorded. Given the normal distribution of residuals, the F-statistic follows an F-distribution, F(p-1, n-p), where n is the number of experimental dose points and p is the number of parameters in the model (i.e. 4: A, B, C, D). Note that the number of dose points is the most important influence on the degrees of freedom. A probeset was designated as fitted to a sigmoidal curve and corresponded to a ‘response transcript’ if its global maximal F-statistic (i.e. best fit) was larger than the critical F (95% significance level, i.e. the F-distribution table was consulted for 95 percentile with numerator degree of freedom of p-1 and denominator degree of freedom of n-p). For each response transcript, the values of A, B, C and D that gave rise to the maximal F-statistic define the optimal model and the predicted EC50. Thus the estimated EC50 presented in the SDRS report is selected from 542 possible values for C.
Multiple test correction
After SDRS, each probeset is associated with an F-statistic at each of the 542 test values of C. For use in further analytical methods including data visualization, the results at a subset of 79 log-evenly distributed values for C were selected as ‘Summary Doses’ (i.e. data reduction from 542 lists to 79 lists). Each F-statistic was converted to the associated P-value. For each Summary Dose list the number of response transcripts (i.e. probesets) whose P value passed an FDR cutoff was calculated, using 1% increments from FDR = 1% to FDR = 35%, resulting in a 35×79 matrix. This FDR correction used the Simes procedure, which employs a series of linearly increasing critical values  and has been shown to control the FDR at pre-specified levels for independent test statistics .
Treatment comparison and pathway analysis
All comparisons of lists were based on Fisher's exact test (FET) using the right test, which evaluates the significance of the intersection between two lists for positive association i.e. an enrichment of elements of list A in list B or vice versa . Comparisons between two compounds were performed at each possible pair of Summary Doses (one from each compound), using the lists generated by the FDR procedure described above. (Note that there are as many as 35 distinct probeset lists for each compound at each of the 79 Summary Dose values). For each of the Summary Dose pairs, the lowest P value from the (maximally) 35×35 FETs was retained. The resulting 79×79 matrix is visualized as a heat map, where the depth of the color is proportional to the negative logarithm (base 10) of the P value.
Pathway analysis was performed using the lists generated by the FDR procedure described above. Probesets were consolidated to single gene loci to eliminate redundancy. (Note there are as many as 35 distinct gene sublists at each of the 79 Summary Doses). Each such gene list was evaluated by FET against a pathway gene list, and the lowest P value from the (maximally) 35 comparisons at each of the 79 Summary Dose values was retained. For comparisons that met a P<0.001 criterion, the resulting 79-point dataset for each pathway of interest was plotted to examine significant enrichment for pathway genes as a function of the dose range.
The microarray dataset is available at ArrayExpress, E-TABM-585.
Supplementary data and methods
(1.20 MB PDF)
(5.98 MB XLS)
SDRS reports for treatments
(8.27 MB ZIP)
Loci with multiple response transcripts
(0.03 MB XLS)
(1.51 MB ZIP)
KEGG 04010 MAPK loci
(0.05 MB XLS)
KEGG 04110 cell cycle response transcripts
(0.06 MB XLS)
FACS analysis of cell cycle effects
(0.15 MB XLS)
SDRS reports for control datasets
(8.64 MB ZIP)
SDRS residuals for imatinib 4 hr
(5.06 MB ZIP)
We thank Carmen Raventos-Suarez for assistance in data analysis, Charles A. Tilford for supporting pathway analysis tools, and Tai W. Wong, Francis Lee, and James Burke for critical review of the manuscript.
Performed the experiments: HdS JC AH WH BP. Analyzed the data: RRJ YJ KHO PRM. Contributed reagents/materials/analysis tools: RRJ REB PSK IMN MIC NOS. Wrote the paper: RRJ PRM. Developed the analytical methods described in the manuscript: RRJ. Designed the experiments: HdS. Revised the manuscript: YJ REB KHO NOS. Conceived transcriptional dose response concept: MGN.
- 1. Paracelsus (1564) Dritte defensione, von der Beischreibung der neuen Rezepte. Sieben Defensiones, Verantwortung über etliche Verunglimpfungen seiner Mißgönner. Cologne: Arnold Byrckmann.
- 2. Shi L, Reid LH, Jones WD, Shippy R, Warrington JA, et al. (2006) The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. Nat Biotechnol 24: 1151–1161.
- 3. Stegmaier K, Ross KN, Colavito SA, O'Malley S, Stockwell BR, et al. (2004) Gene expression-based high-throughput screening(GE-HTS) and application to leukemia differentiation. Nat Genet 36: 257–263.
- 4. Parker RA, Flint OP, Mulvey R, Elosua C, Wang F, et al. (2005) Endoplasmic reticulum stress links dyslipidemia to inhibition of proteasome activity and glucose transport by HIV protease inhibitors. Mol Pharmacol 67: 1909–1919.
- 5. Boehm C, Newrzella D, Herberger S, Schramm N, Eisenhardt G, et al. (2006) Effects of antidepressant treatment on gene expression profile in mouse brain: cell type-specific transcription profiling using laser microdissection and microarray analysis. J Neurochem 97: Supplement 144–49.
- 6. Fanton CP, Rowe MW, Moler EJ, Ison-Dugenny M, De Long SK, et al. (2006) Development of a screening assay for surrogate markers of CHK1 inhibitor-induced cell cycle release. J Biomol Screen 11: 792–806.
- 7. Kamb A, Wee S, Lengauer C (2007) Why is cancer drug discovery so difficult? Nat Rev Drug Discov 6: 115–120.
- 8. Wang XD, Reeves K, Luo FR, Xu LA, Lee F, et al. (2007) Identification of candidate predictive and surrogate molecular markers for dasatinib in prostate cancer: rationale for patient selection and efficacy monitoring. Genome Biol 8: R255.
- 9. Inglese J, Auld DS, Jadhav A, Johnson RL, Simeonov A, et al. (2006) Quantitative high-throughput screening: A titration-based approach that efficiently identifies biological activities in large chemical libraries. Proceedings of the National Academy of Sciences 103: 11473–11478.
- 10. Tian H, Cao L, Tan Y, Williams S, Chen L, et al. (2004) Multiplex mRNA assay using electrophoretic tags for high-throughput gene expression analysis. Nucleic Acids Res 32: e126.
- 11. Muller PY, Netscher T, Frank J, Stoecklin E, Rimbach G, et al. (2005) Comparative quantification of pharmacodynamic parameters of chiral compounds (RRR- vs. all-rac-alpha tocopherol) by global gene expression profiling. J Plant Physiol 162: 811–817.
- 12. O'Grady M, Raha D, Hanson B, Bunting M, Hanson G (2005) Combining RNA interference and kinase inhibitors against cell signalling components involved in cancer. BMC Cancer 5: 125.
- 13. Balakrishnan N, editor. (1991) Handbook of the Logistic Distribution:. CRC Press. 601 p.
- 14. Varma R, Hector S, Greco WR, Clark K, Hawthorn L, et al. (2007) Platinum drug effects on the expression of genes in the polyamine pathway: time-course and concentration-effect analysis based on Affymetrix gene expression profiling of A2780 ovarian carcinoma cells. Cancer Chemother Pharmacol 59: 711–723.
- 15. Brun YF, Varma R, Hector SM, Pendyala L, Tummala R, et al. (2008) Simultaneous modeling of concentration-effect and time-course patterns in gene expression data from microarrays. Cancer Genomics Proteomics 5: 43–53.
- 16. Black JW, Leff P (1983) Operational models of pharmacological agonism. Proc R Soc Lond B Biol Sci 220: 141–162.
- 17. Lamb J (2007) The Connectivity Map: a new tool for biomedical research. Nat Rev Cancer 7: 54–60.
- 18. Fabian MA, Biggs WH 3rd, Treiber DK, Atteridge CE, Azimioara MD, et al. (2005) A small molecule-kinase interaction map for clinical kinase inhibitors. Nat Biotechnol 23: 329–336.
- 19. Quintas-Cardama A, Kantarjian H, Cortes J (2007) Flying under the radar: the new wave of BCR-ABL inhibitors. Nat Rev Drug Discov 6: 834–348.
- 20. Lombardo LJ, Lee FY, Chen P, Norris D, Barrish JC, et al. (2004) Discovery of N-(2-chloro-6-methyl-phenyl)-2-(6-(4-(2-hydroxyethyl)-piperazin-1-yl)-2-methylpyrimidin-4-ylamino)thiazole-5-carboxamide(BMS-354825), a dual Src/Abl kinase inhibitor with potent antitumor activity in preclinical assays. J Med Chem 47: 6658–6661.
- 21. Karaman MW, Herrgard S, Treiber DK, Gallant P, Atteridge CE, et al. (2008) A quantitative analysis of kinase inhibitor selectivity. Nat Biotechnol 26: 127–132.
- 22. Wang JY, Wilcoxen KM, Nomoto K, Wu S (2007) Recent advances of MEK inhibitors and their clinical progress. Curr Top Med Chem 7: 1364–1378.
- 23. Rix U, Hantschel O, Durnberger G, Remsing Rix LL, Planyavsky M, et al. (2007) Chemical proteomic profiles of the BCR-ABL inhibitors imatinib, nilotinib and dasatinib reveal novel kinase and non-kinase targets. Blood: 4055–4063.
- 24. Smith CK, Windsor WT (2007) Thermodynamics of Nucleotide and Non-ATP-Competitive Inhibitor Binding to MEK1 by Circular Dichroism and Isothermal Titration Calorimetry. Biochemistry 46: 1358–1367.
- 25. Akhmetshina A, Dees C, Pileckyte M, Maurer B, Axmann R, et al. (2008) Dual inhibition of c-abl and PDGF receptor signaling by dasatinib and nilotinib for the treatment of dermal fibrosis. FASEB J 22: 2214–2222.
- 26. Brown A, Carlson T, Loi C-M, Graziano M (2007) Pharmacodynamic and toxicokinetic evaluation of the novel MEK inhibitor, PD0325901, in the rat following oral and intravenous administration. Cancer Chemotherapy and Pharmacology 59: 671–679.
- 27. Xie S, Wang Y, Liu J, Sun T, Wilson MB, et al. (2001) Involvement of Jak2 tyrosine phosphorylation in Bcr-Abl transformation. Oncogene 20: 6188–6195.
- 28. Lam L, Pickeral O, Peng A, Rosenwald A, Hurt E, et al. (2001) Genomic-scale measurement of mRNA turnover and the mechanisms of action of the anti-cancer drug flavopiridol. Genome Biology 2: research0041.0041–research0041.0011.
- 29. Marquardt DW (1963) An Algorithm for Least-Squares Estimation of Nonlinear Parameters. SIAM Journal on Applied Mathematics 11: 431–441.
- 30. Khatri P, Draghici S (2005) Ontological analysis of gene expression data: current tools, limitations, and open problems. Bioinformatics 21: 3587–3595.
- 31. Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, et al. (2008) KEGG for linking genomes to life and the environment. Nucl Acids Res 36: D480–484.
- 32. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, et al. (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25: 25–29.
- 33. Solit DB, Garraway LA, Pratilas CA, Sawai A, Getz G, et al. (2006) BRAF mutation predicts sensitivity to MEK inhibition. Nature 439: 358–362.
- 34. Decker T, Kovarik P (2000) Serine phosphorylation of STATs. Oncogene 19: 2628–2637.
- 35. Li H, Gade P, Xiao W, Kalvakolanu DV (2007) The interferon signaling network and transcription factor C/EBP-beta. Cell Mol Immunol 4: 407–418.
- 36. O'Neill LA (2006) Targeting signal transduction as a strategy to treat inflammatory diseases. Nat Rev Drug Discov 5: 549–563.
- 37. Jeffrey KL, Camps M, Rommel C, Mackay CR (2007) Targeting dual-specificity phosphatases: manipulating MAP kinase signalling and immune responses. Nat Rev Drug Discov 6: 391–403.
- 38. Hebenstreit D, Horejs-Hoeck J, Duschl A (2005) JAK/STAT-dependent gene regulation by cytokines. Drug News Perspect 18: 243–249.
- 39. Murray PJ (2007) The JAK-STAT signaling pathway: input and output integration. J Immunol 178: 2623–2629.
- 40. Xu Y, Ikegami M, Wang Y, Matsuzaki Y, Whitsett JA (2007) Gene expression and biological processes influenced by deletion of Stat3 in pulmonary type II epithelial cells. BMC Genomics 8: 455–460.
- 41. Bantscheff M, Eberhard D, Abraham Y, Bastuck S, Boesche M, et al. (2007) Quantitative chemical proteomics reveals mechanisms of action of clinical ABL kinase inhibitors. Nat Biotechnol 25: 1035–1044.
- 42. Buettner R, Mesa T, Vultur A, Lee F, Jove R (2008) Inhibition of Src Family Kinases with Dasatinib Blocks Migration and Invasion of Human Melanoma Cells. Mol Cancer Res 6: 1766–1774.
- 43. Han MS, Chung KW, Cheon HG, Rhee SD, Yoon C-H, et al. (2009) Imatinib Mesylate Reduces Endoplasmic Reticulum Stress and Induces Remission of Diabetes in db/db Mice. Diabetes 58: 329–336.
- 44. Picard S, Titier K, Etienne G, Teilhet E, Ducint D, et al. (2007) Trough imatinib plasma levels are associated with both cytogenetic and molecular responses to standard-dose imatinib in chronic myeloid leukemia. Blood 109: 3496–3499.
- 45. Kerkela R, Grazette L, Yacobi R, Iliescu C, Patten R, et al. (2006) Cardiotoxicity of the cancer therapeutic agent imatinib mesylate. Nat Med 12: 908–916.
- 46. Taylor WR, Stark GR (2001) Regulation of the G2/M transition by p53. Oncogene 20: 1803–1815.
- 47. Berkson J (1953) A statistically precise and relatively simple method of estimating the bio-assay with quantal response, based upon the logistic function. Journal of the American Statistical Association 48: 565–599.
- 48. Neubauer M, Ross-Macdonald P (2007) The Application of Transcriptional Profiling in Model Connectivity and Lead Assessment in Drug Discovery. In: Macor J, editor. Annual Reports in Medicinal Chemistry:. Academic Press. pp. 417–430.
- 49. Luo FR, Yang Z, Camuso A, Smykla R, McGlinchey K, et al. (2006) Dasatinib (BMS-354825) Pharmacokinetics and Pharmacodynamic Biomarkers in Animal Models Predict Optimal Clinical Exposure. Clin Cancer Res 12: 7180–7186.
- 50. Kantarjian H, Giles F, Wunderle L, Bhalla K, O'Brien S, et al. (2006) Nilotinib in Imatinib-Resistant CML and Philadelphia Chromosome-Positive ALL. N Engl J Med 354: 2542–2551.
- 51. Bindslev N (2004) A homotropic two-state model and auto-antagonism. BMC Pharmacol 4: 11.
- 52. Hughes TR, Marton MJ, Jones AR, Roberts CJ, Stoughton R, et al. (2000) Functional discovery via a compendium of expression profiles. Cell 102: 109–126.
- 53. Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, et al. (2003) Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res 31: e15.
- 54. Simes RJ (1986) An improved Bonferroni procedure for multiple tests of significance. Biometrika 73: 751–754.
- 55. Benjamini Y, Hochberg Y (1995) Controlling the False Discovery Rate: a Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society B 57: 289–300.
- 56. Agresti A (1992) A Survey of Exact Inference for Contingency Tables. Statist Sci 7: 131–153.