Figures
Abstract
In the search for new pharmaceutical leads, especially with DNA-binding molecules or genome editing methods, the issue of side and off-target effects have always been thorny in nature. A particular case is the investigation into the off-target effects of N-methylpyrrole-N-methylimidazole polyamides, a naturally inspired class of DNA binders with strong affinity to the minor-groove and sequence specificity, but at < 20 bases, their relatively short motifs also insinuate the possibility of non-unique genomic binding. Binding at non-intended loci potentially lead to the rise of off-target effects, issues that very few approaches are able to address to-date. We here report an analytical method to infer off-target binding, via expression profiling, based on probing the relative impact to various biochemical pathways; we also proposed an accompanying side effect prediction engine for the systematic screening of candidate polyamides. This method marks the first attempt in PI polyamide research to identify elements in biochemical pathways that are sensitive to the treatment of a candidate polyamide as an approach to infer possible off-target effects. Expression changes were then considered to assess possible outward phenotypic changes, manifested as side effects, should the same PI polyamide candidate be administered clinically. We validated some of these effects with a series of animal experiments, and found agreeable corroboration in certain side effects, such as changes in aspartate transaminase levels in ICR and nude mice post-administration.
Citation: Lin J, Krishnamurthy S, Yoda H, Shinozaki Y, Watanabe T, Koshikawa N, et al. (2019) Estimating genome-wide off-target effects for pyrrole-imidazole polyamide binding by a pathway-based expression profiling approach. PLoS ONE 14(4): e0215247. https://doi.org/10.1371/journal.pone.0215247
Editor: Chakrabhavi Dhananjaya Mohan, University of Mysore, INDIA
Received: December 28, 2018; Accepted: March 28, 2019; Published: April 9, 2019
Copyright: © 2019 Lin et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Expression microarray datasets are available at the NCBI Gene Expression Omnibus (NCBI GEO) database (accession numbers GSE86599 for polyamide 1 and GSE124462 for polyamides 2 and 3). Other relevant data are within the manuscript and its Supporting Information files.
Funding: This work was supported by Grant-in-Aids for Scientific Research B and for Young Scientists B from Japan Society for the Promotion of Science (JP17H03602 to H.N. and JP17K15047 to J.L.) as well as Japan Agency for Medical Research and Development (AMED, JP17cm0106510, JP17ck0106263 and JP17ck0106356 to H.N., 15656919 to A.T. and JP18ck0106422 to H.N. and J.L.). Additional support was provided in part by Takeda Science Foundation (A.T.), Princess Takamatsu Cancer Research Fund (H.N.) and Grant-in-Aid for Scientific Research on Innovative Areas via the Platform for Supporting Cohort Study and Biospecimen Analysis (H.N. and J.L.) from the Ministry of Education, Culture, Sports, Science and Technology of Japan. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
N-methylpyrrole-N-methylimidazole (PI) polyamides have been frequently discussed as viable alternatives for “undruggable” oncological targets otherwise inaccessible at the protein level, such as KRAS [1–3]. Their ability to interact with the DNA minor-groove binders with unrivaled sequence recognition translates to their distinctive ability to disrupt transcription, and proper functionalization with histone modifiers or alkylating agents transform these molecules to programmable epigenetic switches and anticancer agents in vivo [4–10]. A troubling issue for PI polyamides, however, is their relatively short motifs, leading to multiple genomic binding sites; lengthening a PI polyamide beyond its typical working range of 8–10 bases will reduce the number of genomic binding sites at the penalty of reducing its chemical affinity to the DNA minor groove due to increased structural rigidity [11, 12]; intriguingly, most PI polyamide, despite the presence of multiple binding sites, led to little toxicity [13–15]. It would therefore imply that nonunique binding typically did not lead to ill biological effects, further complicating our understanding on the subject matter. Nevertheless, to improve the clinical prospects of PI polyamides, the evaluation of these so-called “off-target effects” would be unavoidable in a similar manner as genome editing methods such as CRISPR/Cas. Yet, even before we could approach the problem, we were met with a difficult question: for PI polyamides, how would one define the effect of off-target binding for PI polyamides?
In the most general sense, binding to any genomic location other than its intended site is technically “off-target.” Several studies have elected to use next-generation sequencing (NGS) methods such as Chem-seq [16–18] to approach the question of off-target binding. While Chem-seq do provide the means of identifying binding sites in the genomic space, in addition to functionalizing a candidate PI polyamide with an affinity tag for enrichment, the method still requires the coupling of expression profiling to characterize the effect of genomic binding. Moreover, binding at an unintended site in certain situations may be more or less inconsequential; for instance, binding in genes downstream of the intended target may do very little since the process is already disrupted. To understand the effect of off-target binding, therefore, would necessitate one to perform additional analysis and process the expression profiling data beyond the conventional before/after fold-change calculations to elucidate or distinguish these effects.
Common knowledge suggest that chemical stimuli can environmental or oxidative stress homeostasis, leading to the alteration of expression profiles as a response to those perturbations. Consequently, while some genes are certainly to be downregulated upon the administration of a PI polyamide, it is entirely possible that those changes are as part of the homeostatic responses rather than “off-target” binding; how does one differentiate “true” off-target binding then? We can reformulate the question as follows: if a polyamide suppresses its primary target by some extent, is that suppression significant when compared to non-targets? Now, the natural response would be to compare changes to the primary target with some measures of significance testing with the rest of the targets in a background population; this nonetheless gets complicated, as gene expressions are also affected by innate feedback, homeostasis, regulatory mechanisms and complex secondary effects in vivo. By recognizing that most genes participate in various pathways that lead to phenotypic changes, these pathways can more natural units of analysis to evaluate off-target binding. The null hypothesis then is whether the effect onto “on-target” pathways differs from other “off-target” pathways. To evaluate these changes, NGS methods are also no longer required; expression microarrays in this regard are viable alternatives in exploring a potential candidate PI polyamide’s off-target effects, since one can simultaneously monitor changes in the genome at a reasonable cost through with nearly 50,000 data points per run.
With this testable hypothesis in mind, we devised a computational method to evaluate the relative impact to unintended pathways (“background”), compared to pathways with target participation (“foreground”). By evaluating the impact to a pathway as a mean change of all participating genes, we could search the background to identify genes relatively downregulated compared to the intended target to estimate the extent of erroneous binding. We here propose the use of expression microarrays to evaluate the effect of off-target binding and possible side effects via a pathway-based approach. The effectiveness of our analysis is further validated in mouse experiments. This is the first report of a method specifically aimed to address the complicated issue of off-target binding for PI polyamides, and hopefully these algorithms and methodologies may help accelerate the much-needed effort in motif-specific DNA binders in the field of chemical biology and precision medicine.
Materials and methods
Summary of polyamide synthesis
Polyamides 1–4 were synthesized piecewise following a Fmoc-based solid-phase peptide synthesis protocol on a PSSM-8 peptide synthesizer (Shimadzu) with a custom operating system and functionalized by indole-seco-CBI [19]. Purification of the product was performed using high-performance liquid chromatography system on a LC-20 (Shimadzu) using a 10 mm × 150 mm Gemini-NX3u 5-ODS-H reverse-phase column (Phenomenex). Separation was achieved with a linear gradient of acetonitrile in 0.1% acetic acid in Milli-Q water (typically 0–100%) at a flow rate of 10 ml m-1 with detection at 310 nm. Collected fractions were analyzed by a LC-MS2020 LC-MS system (Shimadzu). Conjugation of indole-seco-CBI was performed by a reaction of respective PIP backbone with NH2-indole-seco-CBI in N-methylpyrrolidone in the presence of 1-ethyl-3-(3-dimethylaminopropyl)carbodiimidyl hydrochloride. Reaction mixtures were then purified by liquid chromatography with a linear gradient of in 0.1% acetic acid in water (typically 30–75%), and the final products were subsequently lyophilized. Mass information (ESI-MS): polyamide 1, C100H110ClN34O18S, m/z calcd [M+2H]2+ 1071.95, found 1072.00; [M+3H]3+ 714.94, found 715.25; polyamide 2, C89H92ClN31O16, m/z calcd [M+H]+ 1886.70, found 1886.50; calcd [M+2H]2+ 943.86, found 944.05; calcd [M+3H]3+ 629.57, found 629.45; polyamide 3, C88H91ClN32O16, m/z calcd [M+H]+ 1888.32, found 1888.95; polyamide 4, C89H92ClN31O16, m/z calcd [M+2H]2+ 943.86, found 944.15; calcd [M+3H]3+ 629.57, found 630.10.
Cell cultures
Cells were maintained in a humidified environment at 37°C and 5% CO2 in their respective media and nutrient supplements (S1 Table) in addition to 1% penicillin/streptomycin. Reagents used for cultures were purchased from Gibco.
Microarray data processing
104–105 cells per condition were plated in a 6-well microtiter plate for overnight attachment prior to treatment of candidate polyamides or 0.05% dimethyl sulfoxide (DMSO). After RNA extraction with RNeasy Plus Mini Kit (Qiagen), replicated samples (2 × 2 for polyamides 1 and 2; 3 technical replicates for 3) were labelled with RNA Spike-In Kit and analyzed on SurePrint G3 Human GE 8x60K V2 microarrays per Agilent Technologies’ recommendations. Data acquisition was performed on an Agilent SureScan microarray scanner and analysis were completed with custom R codes utilizing the limma package [20]. Our procedure automatically searched for a suitable offset to correct for background effects and normalize replicate between arrays and determined expressions by empirical Bayes estimations in terms of fold changes from the difference between DMSO and each PI polyamide treatment. Spots with matching RefSeq mRNA & ncRNA identifiers, as well as Ensembl accession codes if applicable, were filtered. For the purpose of comparison, genes with promoter region (defined as 1000 bp upstream of transcription start site) binding were also annotated with corresponding mRNA expressions. Tissue-specific expression changes (as defined by the logarithm of mean tumor expressions compared to normal tissue expressions) from GENT [21] were extracted for the genes considered to be “on-target” to use as weights in the computation.
Pathway analysis
Gene-pathway data were retrieved from the Kyoto Encyclopedia of Genes and Genomes (KEGG) [22]. After selecting a cell line as control (in the case of polyamide 2, SiHA), expression data in the form of a 6-column table detailing the chromosomal position of the gene, a corresponding RefSeq symbol and accession number followed by the expression in terms of logFC) were recomputed to determine the relative change in expression (RFC) across the test and control cell lines. These RFC’s were then mapped to a specific pathway (PI3K-Akt signaling pathway, hsa04151) based on pathway-gene information retrieved from KEGG; each clustered gene on the KEGG pathway map was then assigned a color based on their respective RFC; for those with RFC < -0.5, red; RFC > 0.5, green; orange otherwise. Binding site information was obtained from either Chem-seq experiment (polyamide 1) or based on searching the number of motif occurrences in the hg19 reference genome according to the order of methylpyrrole and methylimidazole pairings. The core pathway (genes immediately upstream of PIK3CA and downstream of Akt) was visualized using PathVisio [23].
Side effect prediction
Drug-adverse reaction data were obtained from DrugBank and SIDER following previously published procedures [24, 25] to generate a final matrix of drug-adverse reaction matrix. Gene expression datasets containing the selected drug candidates found in the matrix were manually retrieved from the GEO database (S2 Table) and those expressions were processed as training sets as predictors for each of the adverse reactions included in the matrix. Side effects occurring in fewer than 10 of the training set drugs were omitted. Expressions for primary target genes (e.g. KRAS for 1 and PIK3CA for 2) were numerically scaled with recorded tumor/normal expression ratios from GENT in their respective cancer origins prior to side effect prediction (see S3 Table for an example for PIK3CA). Random forest via the R ranger implementation with 1000 trees per ensemble was used for data training to generate a side effect prediction model for polyamides 1–3. Codes used for pathway analysis and building side effect prediction models were organized into pipoft, an R package available on GitHub (jlincbio/pipoft).
Animal experiments
Approval from the animal care and ethics committee of Chiba Cancer Center Research Institute were obtained and experiments were performed following strict guidelines to minimize animal suffering. Mice were housed in a designated restricted-access facility staffed by independent Japan Animal Care (JAC) personnel and provided with sterile water and CRF-1 Certified Diet (Oriental Yeast) feed pellets over the course of experimentation. Physical examinations and non-interactive visual checks were routinely performed to ensure subjects’ well-being until sacrifice by cervical dislocation. Collection of blood specimens were performed under anesthesia by minute amounts of ether, to effect. Invasive procedures and sacrifice were performed by a licensed veterinarian (A.T.). Female Balb/c nude mice (Charles River Laboratories) were subcutaneously inoculated with 106 ME-180 cervical cancer cells. Upon reaching a critical tumor size of ~ 50–70 mm3, the mice were intraperitoneally injected with DMSO (n = 4) or 3 mg/kg·week polyamide 2 (n = 5) and sacrificed after the tumors reached 2 × 103 mm3. For polyamide 1, sera were collected from mice treated with DMSO, 0.3 or 3 mg/kg of polyamide 1 at 3 and 36 days. To evaluate the effect of 10 mg/kg polyamide 4, we first performed a pilot experiment with two C57BL/6J mice to assess the need for an additional cohort of 4 × 2 ICR mice (Charles River Laboratories) for monitoring changes in appetite as well as AST, ALT, blood glucose levels. Mice were housed in individual cages so the amount of feed remaining in the food compartment could be measured in 24 h periods during the course of polyamide 4 treatment. The test and control groups were injected with a dose of 10 mg/kg polyamide 4 or DMSO in sterile phosphate-buffered saline solutions once a week. After sacrifice on day 16, liver tissues (C57BL/6J mice, CLEA Japan) and serum samples (ICR mice) were collected for histological examination and biochemistry panels. 150 μL of blood per mouse was incubated with 10 μL coagulant (Wako) at room temperature for 0.5 h before centrifugation at 16 × 103 rpm for 15 m for serum collection. Samples were stored at –80°C until analysis. Quantification of AST, ALT, CRE and blood glucose levels were performed by Starlight Biotech Inc. (Tokyo, Japan) or the Japan Mouse Clinic at RIKEN (Tsukuba, Japan) in the case of the pilot experiment with polyamide 4.
Cytotoxicity assays and correlation to GNA14
Cells (S4 Table) were seeded at a final density of 3 × 103 cells in a 96-well microtiter plate (Nunc) and incubated overnight for attachment. Upon treatments of various concentrations of polyamide 4 or 0.125% DMSO for 48 h, 10 μL WST-8 reagent (Dojindo) was added to each sample and incubated at 37°C for 2 h to assess the rate of cell proliferation. Absorbances were measured at 450 nm on a Corona MTP-310 microplate reader, with IC50 values calculated using GraphPad Prizm. GNA14 expressions were obtained from compiled U133A values in GENT [21].
Statistical analysis
Two sample t-tests, unless otherwise specified, were used to assess the null hypothesis that there were no differences in the sample means between gene expressions or marker levels. For gene expressions, we assessed significance against a predefined α-level of 0.05; for the mouse experiments, as sample sizes were n < 10 per experiment, we adopted the one-tailed α-level of 0.10, such that fewer than one out of the entire cohort may be a false positive at any time. Power analyses were performed with the pwr package in R [26, 27] to ensure that the adaptation of the new alpha level did not impact the assessment of significance (criteria: p < 0.10, statistical power > 90% and within 15% of the same metric under p < 0.05).
Results and discussion
Pathway-based approach to evaluating off-target effects
We utilized PI polyamides 1–3 (Fig 1A) targeting KRAS G12D/V mutations, PIK3CA E545K mutation and a third candidate which sought to suppress MYCN amplification as its mode of action [28, 29] to determine whether the newly formed hypothesis sufficiently addressed the question of determining the extent of off-target effects at the pathway level; polyamide 4, an additional unbiotinylated version of polyamide 1, was used for a follow-up validation experiment in vivo. All three polyamides were functionalized with indole-seco-CBI, an alkylating moiety; while polyamide 1 had an additional biotinyl moiety as it was originally developed for NGS applications, we previously [16] found no discernible differences compared to the non-biotinylated counterpart in microarray studies.
(A) Structure of candidate PI polyamides: 1, designed to target G12D/V KRAS mutants; 2, a polyamide seeking to bind E545K PIK3CA mutation; 3, a polyamide meant to suppress MYCN amplifications; 4, unbiotinylated version of polyamide 1 used for in vivo validation experiments. N-methylpyrrole subunits, blue; N-methylimidazole subunits, red. (B) Scheme of automated systematic Analysis; (C) concept of pathway-based evaluation.
After administrating these polyamides to their respective target cell lines (Fig 1B), we determined changes in expression (logFC) systematically and linked gene-pathway information available in KEGG to calculate the relative impact for all genes participating in the catalogued pathways. We used the relative percentile rank of logFC and measured their deviations from the median to provide a method to evaluate whether these differences were statistically meaningful. The incorporation of binding information (either from NGS results such as Chem-seq or by motif-based computational estimation), allowed us to determine the extent of “off-target” binding (Fig 1C). This procedure allowed us to separate severely affected genes from “undiscernible” genes that might have been affected by PI polyamide treatment but did not overall behave much differently from other members in the same pathway.
This method allowed us to estimate the relative effect of our candidate PI polyamides to their respective targets (Table 1). Polyamide 2, for instance, was estimated to have a larger number of off-target genes in SiHA, a cervical cancer cell line expressing wild-type PIK3CA; while this polyamide was capable of disrupting mutant PIK3CA preferentially, the cytotoxicity was still in the nanomolar range, the high number of off-target genes would provide an explanation for this observation. We found expressions to correlate roughly in mutant PIK3CA cell lines (ME180 and CaSki, Fig 2A), and the three cell lines shared a number of identical genes deemed “off-target” (Fig 2B). To further clarify how “on-target” polyamide 2 was, we expanded the algorithm to consider the relative differences between SiHA and other mutant PIK3CA cell lines. Since PIK3CA had been frequently asserted to be the primary driver gene in cervical cancer, assuming that most other mutations manifested little effect on the cancer phenotypes, we used SiHA as the “baseline” and computed the extent of which the other two cell lines deviated in terms of expressions in the PI3K-Akt signaling pathway. We saw a downward shift of this relative expression difference for PIK3CA as well as several downstream elements (Fig 2C and S1 and S2 Figs), for instance AKT, PTEN and p53 in the case of ME180 cells (by comparing SiHA/ME180 and CaSki/ME180 pairings). Several downstream elements of PIK3CA were invariant in their responses in SiHA and CaSki cells, suggesting that they were little affected by the binding of polyamide 2, allowing us to disregard quantitatively the effects to those elements and their subsequently participating pathways.
(A) Correlations between changes in expression (logFC) after treatment of 10 nM of polyamide 2 against DMSO across ME180, CaSki and SiHA cells; numerical values above each pairing indicate the pairwise Spearman’s rank correlation coefficient. (B) Distribution of the relative percentile rank (RPR) for on-target and off-target genes expressed as ME180/CaSki (red) or SiHA (blue) pairings. PIK3CA (on-target) or ACTB (control) is expressed in black. (C) Differential comparison of changes in expression across different cell lines in the PI3K-Akt pathway (KEGG accession: hsa04151), with ME180 as the baseline; colors indicate the level of RFC (< -0.5, red; > 0.5, green), with divided boxes indicating nonuniform RFC behavior between CaSki/ME180 (left) and SiHA/ME180 (right) pairings. PIP3, phosphatidylinositol-3,4,5-triphosphate. Full pathway illustrations can be found as S1 and S2 Figs.
Inferencing potential side effects of PI polyamides
We now expand the discussion on off-target binding to understand the potential phenotypic changes that may be more clinically relevant and easier to conceptualize: side effects. The ability to eliminate PI polyamide candidates with more adverse effects a priori could beneficially improve the design and testing process and provide a way to monitor animal experiments to track and observe these small but critical shifts in phenotype. While various side-effect prediction methods exist [24, 25, 30, 31], most had been developed with protein-level based inhibitors in mind and focused largely on modelling changes through metabolic networks; since PI polyamides’ primary mode of action was DNA minor-groove binding, we thought that perhaps an approach utilizing gene expression profiling from microarray data would better reflect the direct effect more meaningfully. An expression-based prediction strategy also allowed us to account for the hypothetical possibility that certain gene targets could have a wide range of expression distributions in different tissues by allowing for separate microarray datasets to be evaluated or directly adjusting for expression discrepancies by scaling factors derived from tissue-specific information. We chose to implement this approach by testing different machine-learning methods such as support vector machines and neural networks with a publicly available set of cell-line expression arrays under various drug treatments coupled with available side effect information as training data. Upon testing, we found random forests to be the most robust; the method itself required merely a fraction of computing time compared to neural networks at the current size of drug-adverse reaction arrays and had lower prediction errors compared to support vector machines. Predictions across different cell lines of the same cancer type, i.e. SW480 vs. LS180 of colorectal cancer, or the ME180 vs. CaSki of cervical cancer (S3–S5 Figs) also were more consistently correlated compared to raw expressions. We could also systematically identify key elements leading to certain side effects and evaluate the reliability of our predictions (Fig 3A). The predictor models were sufficiently robust, with a large number of side effects yielding AUCs greater than 0.70 (an example is provided in Fig 3B).
(A) comparison of gene expressions and its relative importance in the prediction model; vertical axis, Z-normalized mean expressions for a particular gene in training data sets; horizontal axis, importance in terms of mean Gini index; top 100 most important genes were highlighted in red. (B) sample receiver operating characteristic curve for the side effect of increased coughing; FPR, false positive rate; TPR, true positive rate; AUC, area under curve (0.750). (C) Hierarchical clustering of common potential side effects of polyamides 1–3 from expression profiling; Cell lines and the corresponding polyamide are indicated in parentheses; side effects with likelihood scores are indicated in increasing shades of red. Arrows and asterisks indicate AST (*) and ALT (**) increases. (D) AST and ALT levels of ME180 xenograft in Balb/c nude mice after 30–35 d of exposure to 3 mg/kg polyamide 2 (cyan) or DMSO (0 mg/kg; orange). Statistical significance is assessed by two-sample t-test; error bars indicate mean ± SD.
Side effect predictions with animal experiments
Throughout our testing of the three polyamides, several side effects were common across different cell lines and polyamides (Fig 3C and S5 Table); a large number of the side effects were qualitative (neuropathy, for instance), and barring actual patient-based testing there were few ways to validate these predictions. As the prediction model suggested that polyamide 2 could potentially alter aspartate and alanine transaminase levels (AST and ALT, respectively) among other qualitative side effects, we tried to confirm these quantifiable changes in concurrent in vivo experiment with ME180 xenografts in Balb/c nude mice and 3 mg/kg polyamide 2, a dose an order of magnitude below the estimated maximum tolerated dose (MTD) of 30 mg/kg based on our previous studies of other polyamides. Inspections of AST and ALT levels in these mice revealed increases supporting the likelihood of our hypothesis (Fig 3D; two-sample t-test p = 0.365 and p = 0.175 for AST and ALT, respectively), albeit with the difference in sample means above the threshold for significance. We also evaluated whether to reject the null hypothesis that there was no difference in the sample variance between the control and test groups with two-tailed F test (p = 0.1056 and p = 0.0869 for AST and ALT, respectively).
To confirm this prediction in other systems, we then performed a set of similar analyses with polyamide 4 (the unbiotinylated version of 1) against KRAS G12D/V mutations. We first computationally explored potential candidates prone to off-target binding, such as the type and number of off-target genes, as well as their possible connections to the primary target. We found several genes to be affected as a result of off-target binding (S6 Table). These results mirrored some of the previously established links [32–34] between KRAS and EEA1, an endosomal marker as well as functional changes in GNA14. We tested a number of cancer cell lines for their sensitivity to polyamide 4 and were able to observe some correlation to GNA14 levels (Fig 4A and S4 Table); from this result, certain links likely existed between off-target binding to GNA14 and the resultant cytotoxic response. With recent reports that GNA14 mutations could be associated to liver abnormalities such as hepatic small vessel neoplasm [35], we then suspected that the accidental disruption of GNA14 could lead to, and potentially explain, the manifestation of liver-related effects. As results with C57BL/6J mice over the course of two weeks from a pilot experiment suggested marginal changes in ALT levels (Fig 4B) under the influence of polyamide 4, we set up a differently structured side effect experiment with a follow-up cohort of ICR mice (n = 8) at 10 mg/kg polyamide 4, a dosage level approximately half the order of magnitude from the MTD. In this experiment, we monitored changes in appetite, a predicted side effect, as well as AST, ALT and blood glucose (GLU) levels. We found AST levels to fluctuate more appreciably compared to ALT or GLU, although changes in ALT level were significant while AST marginal (Fig 4C; GLU, an unpredicted side effect, was determined to be invariant). This trend was markedly consistent in mice treated with different polyamides (Figs 3D, 4B and 4C) as similar to earlier results. As a consequence of the small cohort size (n < 10), we decided to assess significance at the alpha level of p < 0.10, or at most one per ten subjects could be a false positive; generally, elevating the alpha level had no negative effect on the statistical power during testing. The mice were responsive to polyamide treatment and displayed changes in the distribution of ALT levels in the cohort. Additionally, on days immediately following an injection, mice administered with polyamide 4 tended to exhibit reduced food intake (Fig 4D, black arrows) while changes in body weight were insignificant across the two groups.
(A) Correlation of cytotoxicity of polyamide 4 to cell lines of varying KRAS codon 12 mutation status and their respective GNA14 expression. Cytotoxicity is expressed as the IC50 concentration determined by cell proliferation assay; red points indicate cell lines with non G12D/V KRAS mutation (“off-target”); black points indicate cell lines with G12D/V KRAS mutations. R2 for correlation model [IC50] = a0 + a1 [GNA14]: 0.2463; with a1 = 1.8635 (p = 0.0116). For reference, R2 for correlation model [IC50] = a0 + a1 [KRAS]: 0.04128, with a1 = -0.0430 (p = 0.3300). (B) ALT levels of Balb/c nude mice after 3 or 36 d of exposure to polyamide 4; n = 4 per test condition; orange, 0.3 mg/kg polyamide 4; cyan, 3.0 mg/kg. (C) Serum ALT, AST and glucose levels of ICR mice as well as (D) changes in appetite over time in ICR mice treated with polyamide 4 (n = 4 each; 0 mg/kg, orange; 10 mg/kg, cyan); red timepoints in the horizontal axis indicate points of injection. Error bars indicate mean ± SD; statistical significance is assessed by two-sample t-test; ‘*’, one-tailed p < 0.10. Arrows in (D) indicate days in which treatment of between 0 or 10 mg/kg of polyamide 4 showed statistical significance.
These increases echoed earlier findings that certain, but not all, configurations of PI polyamides could induce increases in these aminotransferases [14] when amine substitutions were introduced to the γ-turn subunit. Considering that previous reports found the toxicity to be highly dose-limited and inconclusive on the contribution of polyamide structure on toxicity, an expression profiling-based approach would be more reliable for estimating the side effect of a polyamide due to potential off-target effects. Increases in blood creatinine level, a side effect predicted not to occur with polyamide 2, were found to be negative (~0.1–0.2 mg/dl), once again placing the noted aminotransferase increases in focus. ALT and AST levels recorded in these subjects tended to fall within the normal range of 25–60 U/L and ~ 50–100 U/L, respectively [36], suggesting that liver function was most likely also normal, well in corroboration with a previous examination of liver tissues in animal subjects [28]. Hierarchical clustering of the side effects additionally suggested no close relations to other liver-related side effects, and even at 10 mg/kg polyamide 4, elevations in AST and ALT did not correlate with liver damage upon histological examination (S6 Fig). Further model optimizations factoring in considerations such as concentration and dosage lengths could help further differentiate effects driven by the pharmacodynamics of PI polyamides in tumor and normal tissues. While cell-level observations may not fully translate to higher-order animal results, this procedure has the benefit of being able to screen out quickly the less desirable PI polyamide candidates. In lead development, initial decisions to proceed with certain candidates are frequently generated based on cell-level experiments, and our method can be implemented in-line of other experiments with relative ease.
Conclusion
Most developmental pipelines encounter a similar problem: it is not always certain whether the investment in preclinical studies will attribute to the eventual success for a marketable product. PI polyamides, while highly versatile and by their mechanistic nature tunable, also face this. By evaluating candidate polyamides with expression microarrays and this systematic method can greatly improve the throughput and meaningfully shorten the initial lead selection and optimization period. We understand how changing the structure of PI polyamides may affect its binding affinity, and how these molecules’ relative hydrophobicity can have on its tumor retention [37]; yet we still know very little about how these molecules will influence the outward phenotypes, especially when binding motifs are concerned. The lack of progress in understanding the off-target effect of PI polyamides can limit the growth of PI polyamide research and the translation to bedside applications. With our first report on the subject matter, we hope to facilitate the push of PI polyamides to bedside applications and accelerate the field of precision medicine.
Supporting information
S1 Fig. Full PI3K-Akt pathway for CaSki/ME180 cell lines upon administration of polyamide 2.
Levels of relative fold change across cell lines (RFC) as indicated: yellow, |RFC| < 0.5; red, RFC < -0.5; green, RFC > 0.5.
https://doi.org/10.1371/journal.pone.0215247.s001
(TIF)
S2 Fig. PI3K-Akt pathway for SiHA/ME180 cell lines upon administration of polyamide 2.
Levels of relative fold change across cell lines (RFC) as indicated: yellow, |RFC| < 0.5; red, RFC < -0.5; green, RFC > 0.5.
https://doi.org/10.1371/journal.pone.0215247.s002
(TIF)
S3 Fig. Correlation of expressions and side effect predictions of polyamide 1 in LS180 and SW480 colorectal cancer cells.
Above, green: correlation of gene expressions in LS180 (horizontal axis) and SW480 (vertical axis); below, purple: correlation of predicted side effect scores in LS180 (horizontal) and SW480 (vertical).
https://doi.org/10.1371/journal.pone.0215247.s003
(TIF)
S4 Fig. Correlation of expressions and predicted side effects of polyamide 2 in ME180, CaSki and SiHA cervical cancer cells.
Above diagonal, correlations of expressions (green); below diagonal, pairwise correlation of side effect scores (purple). Cell lines are labelled diagonally across the vertical and horizontal axes.
https://doi.org/10.1371/journal.pone.0215247.s004
(TIF)
S5 Fig. Correlation of expressions and side effect predictions of polyamide 3 in CHP134, KELLY, MC-IXC and SK-N-AS cell lines.
Above diagonal, correlations of expressions (green); below diagonal, pairwise correlation of side effect scores (purple). Cell lines are labelled diagonally across the vertical and horizontal axes.
https://doi.org/10.1371/journal.pone.0215247.s005
(TIF)
S6 Fig. Histological staining of the skin and liver from polyamide-treated C57BL/6J Mice.
Left half, skin; right half, liver tissue staining. DMSO (top half) is labeled 0 mg/kg; bottom, polyamide 4 at the dose of 10 mg/kg.
https://doi.org/10.1371/journal.pone.0215247.s006
(TIF)
S1 Table. List of cells and culture conditions.
ATCC, American Type Culture Collection; ECACC, European Collection of Authenticated Cell Cultures; RIKEN, RIKEN BioResource Research Center Cell Bank; NCI, National Cancer Institute Division of Cancer Treatment and Diagnosis Tumor Repository; MEM, Eagle’s Minimum Essential Medium; DMEM, Dulbecco’s Modified Eagle’s Medium; RPMI-1640, Roswell Park Memorial Institute medium 1640; IMDM, Iscove’s Modified Dulbecco’s Medium; FBS, fetal bovine serum; Gln, glutamine (if supplemented).
https://doi.org/10.1371/journal.pone.0215247.s007
(PDF)
S2 Table. Training expression datasets for the side effect prediction model.
https://doi.org/10.1371/journal.pone.0215247.s008
(PDF)
S3 Table. Relative distribution of PIK3CA in various tumor and normal tissues.
Values computed from average tissue-type U133 Plus 2 PIK3CA expressions in GENT as of August 2018; logFC indicates the log ratio of tumor vs. normal expressions for a particular tissue.
https://doi.org/10.1371/journal.pone.0215247.s009
(PDF)
S4 Table. Summary of cytotoxicity of polyamide 4 and GNA14 expressions in various cell lines.
KRAS G12, mutation status for codon 12 in KRAS; GNA14, expression of GNA14 in U133A datasets compiled by the Gene Expression Across Normal and Tumor Tissue Database. Class, classification of cell lines into “Off-Target” or “G12D/V” in Fig 4A.
https://doi.org/10.1371/journal.pone.0215247.s010
(PDF)
S5 Table. Common potential side effects of polyamides 1–3 from metabolic gene expression profiling.
Predicted scores are on a scale of 0–1; synd., syndrome; inc., increase; dec., decrease; did., disorder. The table is illustrated as a heatmap in Fig 3B.
https://doi.org/10.1371/journal.pone.0215247.s011
(PDF)
S6 Table. Potential off-targets of polyamide 1 in LS180 colorectal cancer cell line.
RPR, mean relative percentile rank of specified gene in all participating pathways; p0, t-test p-value under H0: RPR = 0.5000; p1, t-test p-value under H0: RPROff-Target = RPRTarget. Significance evaluated against a predefined threshold of p < 0.05 (log10 p < -1.3010); N/S, not significant; N/A, not evaluated; [a]number of pathways evaluated for KRAS (“foreground”): 68; number of background pathways: 244.
https://doi.org/10.1371/journal.pone.0215247.s012
(PDF)
Acknowledgments
We would like to thank Ms. Yuki Kaiho, Nina Matsuo and Yuki Nakamura as well as Drs. Takahiro Inoue, Kiriko Hiraoka and Yasutoshi Tatsumi at Chiba Cancer Center Research Institute for their experimental assistance. Dr. Tony Kuo at AIST Artificial Intelligence Research Center (AIST-AIRC) along with Drs. Etyan Ruppin and Itay Shaked at Tel Aviv University also provided insights and support during the initial period. High-performance computing was provided by the Institute of Medical Science at the University of Tokyo and AIST-AIRC. J.L. held a visiting appointment at AIST-AIRC for the duration of this project.
References
- 1. Makley LN, Gestwicki JE. Expanding the number of 'druggable' targets: non-enzymes and protein-protein interactions. Chem Biol Drug Des. 2013;81(1): 22–32. pmid:23253128
- 2. Ledford H. Cancer: The Ras renaissance. Nature. 2015;520(7547): 278–280. pmid:25877186
- 3. Cox AD, Fesik SW, Kimmelman AC, Luo J, Der CJ. Drugging the undruggable RAS: Mission possible? Nat Rev Drug Discov. 2014;13(11): 828–851. pmid:25323927
- 4. Meier JL, Yu AS, Korf I, Segal DJ, Dervan PB. Guiding the design of synthetic DNA-binding molecules with massively parallel sequencing. J Am Chem Soc. 2012;134(42): 17814–17822. pmid:23013524
- 5. Alagarswamy K, Shinohara KI, Takayanagi S, Fukuyo M, Okabe A, Rahmutulla B, et al. Region-specific alteration of histone modification by LSD1 inhibitor conjugated with pyrrole-imidazole polyamide. Oncotarget. 2018;9(50): 29316–29335. pmid:30034620
- 6. Pandian GN, Nakano Y, Sato S, Morinaga H, Bando T, Nagase H, et al. A synthetic small molecule for rapid induction of multiple pluripotency genes in mouse embryonic fibroblasts. Sci Rep. 2012;2: 544. pmid:22848790
- 7. Anandhakumar C, Li Y, Kizaki S, Pandian GN, Hashiya K, Bando T, et al. Next-generation sequencing studies guide the design of pyrrole-imidazole polyamides with improved binding specificity by the addition of beta-alanine. Chembiochem. 2014;15(18): 2647–2651. pmid:25371287
- 8. Pandian GN, Sato S, Anandhakumar C, Taniguchi J, Takashima K, Syed J, et al. Identification of a small molecule that turns ON the pluripotency gene circuitry in human fibroblasts. ACS Chem Biol. 2014;9(12): 2729–2736. pmid:25366962
- 9. Taylor RD, Chandran A, Kashiwazaki G, Hashiya K, Bando T, Nagase H, et al. Selective Targeting of the KRAS Codon 12 Mutation Sequence by Pyrrole-Imidazole Polyamide seco-CBI Conjugates. Chem Eur J. 2015;21(42): 14996–15003. pmid:26306751
- 10. Syed J, Chandran A, Pandian GN, Taniguchi J, Sato S, Hashiya K, et al. A Synthetic Transcriptional Activator of Genes Associated with the Retina in Human Dermal Fibroblasts. Chembiochem. 2015;16(10): 1497–1501. pmid:25900774
- 11. Chenoweth DM, Meier JL, Dervan PB. Pyrrole-imidazole polyamides distinguish between double-helical DNA and RNA. Angew Chem Int Ed Engl. 2013;52(1): 415–418. pmid:22987334
- 12. Swalley SE, Baird EE, Dervan PB. A Pyrrole-Imidazole Polyamide Motif for Recognition of Eleven Base Pair Sequences in the Minor Groove of DNA. Chem Eur J. 1997;3(10): 1600–1607.
- 13. Mishra R, Watanabe T, Kimura MT, Koshikawa N, Ikeda M, Uekusa S, et al. Identification of a novel E-box binding pyrrole-imidazole polyamide inhibiting MYC-driven cell proliferation. Cancer Sci. 2015;106(4): 421–429. pmid:25611295
- 14. Yang F, Nickols NG, Li BC, Szablowski JO, Hamilton SR, Meier JL, et al. Animal toxicity of hairpin pyrrole-imidazole polyamides varies with the turn unit. J Med Chem. 2013;56(18): 7449–7457. pmid:24015881
- 15. Liu K, Fang L, Sun H, Pan Z, Zhang J, Chen J, et al. Targeting Polo-like Kinase 1 by a Novel Pyrrole-Imidazole Polyamide-Hoechst Conjugate Suppresses Tumor Growth In Vivo. Mol Cancer Ther. 2018;17(5): 988–1002. pmid:29483218
- 16. Lin J, Hiraoka K, Watanabe T, Kuo T, Shinozaki Y, Takatori A, et al. Identification of Binding Targets of a Pyrrole-Imidazole Polyamide KR12 in the LS180 Colorectal Cancer Genome. PLoS ONE. 2016;11(10): e0165581. pmid:27798693
- 17. Chandran A, Syed J, Taylor RD, Kashiwazaki G, Sato S, Hashiya K, et al. Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing. Nucleic Acids Res. 2016;44(9): 4014–4024. pmid:27098039
- 18. Erwin GS, Grieshop MP, Bhimsaria D, Eguchi A, Rodriguez-Martinez JA, Ansari AZ. Genome-wide Mapping of Drug-DNA Interactions in Cells with COSMIC (Crosslinking of Small Molecules to Isolate Chromatin). J Vis Exp. 2016(107): e53510. pmid:26863565
- 19. Lajiness JP, Boger DL. Asymmetric synthesis of 1,2,9,9a-tetrahydrocyclopropa[c]benzo[e]indol-4-one (CBI). J Org Chem. 2011;76(2): 583–587. pmid:21192653
- 20. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7): e47. pmid:25605792
- 21. Shin G, Kang TW, Yang S, Baek SJ, Jeong YS, Kim SY. GENT: gene expression database of normal and tumor tissues. Cancer Inform. 2011;10: 149–157. pmid:21695066
- 22. Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017;45(D1): D353–D361. pmid:27899662
- 23. Kutmon M, van Iersel MP, Bohler A, Kelder T, Nunes N, Pico AR, et al. PathVisio 3: an extendable pathway analysis toolbox. PLoS Comput Biol. 2015;11(2): e1004085. pmid:25706687
- 24. Shaked I, Oberhardt MA, Atias N, Sharan R, Ruppin E. Metabolic Network Prediction of Drug Side Effects. Cell Syst. 2016;2(3): 209–213. pmid:27135366
- 25. Duarte NC, Becker SA, Jamshidi N, Thiele I, Mo ML, Vo TD, et al. Global reconstruction of the human metabolic network based on genomic and bibliomic data. Proc Natl Acad Sci U S A. 2007;104(6): 1777–1782. pmid:17267599
- 26.
Cohen J. Statistical Power Analysis for the Behavioral Sciences. 2nd ed. Routledge; 1988.
- 27. Champely S. pwr: Basic Functions for Power Analysis. R package version 1.2–2 ed2018.
- 28. Hiraoka K, Inoue T, Taylor RD, Watanabe T, Koshikawa N, Yoda H, et al. Inhibition of KRAS codon 12 mutants using a novel DNA-alkylating pyrrole-imidazole polyamide conjugate. Nat Commun. 2015;6: 6706. pmid:25913614
- 29. Yoda H, Inoue T, Shinozaki Y, Lin J, Watanabe T, Koshikawa N, et al. Direct Targeting of MYCN Gene Amplification by Site-Specific DNA Alkylation in Neuroblastoma. Cancer Res. 2019;79(4): 830–840. pmid:30584073
- 30. Cheng F, Li W, Wang X, Zhou Y, Wu Z, Shen J, et al. Adverse drug events: database construction and in silico prediction. J Chem Inf Model. 2013;53(4): 744–752. pmid:23521697
- 31. Stanfield Z, Coskun M, Koyuturk M. Drug Response Prediction as a Link Prediction Problem. Sci Rep. 2017;7: 40321. pmid:28067293
- 32. Gelabert-Baldrich M, Soriano-Castell D, Calvo M, Lu A, Vina-Vilaseca A, Rentero C, et al. Dynamics of KRas on endosomes: involvement of acidic phospholipids in its association. FASEB J. 2014;28(7): 3023–3037. pmid:24719356
- 33. Lubeseder-Martellato C, Alexandrow K, Hidalgo-Sastre A, Heid I, Boos SL, Briel T, et al. Oncogenic KRas-induced Increase in Fluid-phase Endocytosis is Dependent on N-WASP and is Required for the Formation of Pancreatic Preneoplastic Lesions. EBioMedicine. 2017;15: 90–99. pmid:28057438
- 34. Lim YH, Bacchiocchi A, Qiu J, Straub R, Bruckner A, Bercovitch L, et al. GNA14 Somatic Mutation Causes Congenital and Sporadic Vascular Tumors by MAPK Activation. Am J Hum Genet. 2016;99(2): 443–450. pmid:27476652
- 35. Joseph NM, Brunt EM, Marginean C, Nalbantoglu I, Snover DC, Thung SN et al. Frequent GNAQ and GNA14 Mutations in Hepatic Small Vessel Neoplasm. Am J Surg Pathol. 2018;42(9): 1201–1207. pmid:29975248
- 36. Sher Y, Hung M. Blood AST, ALT and UREA/BUN Level Analysis. Bio-protocol. 2013;3(19): e931.
- 37. Inoue T, Shimozato O, Matsuo N, Mori Y, Shinozaki Y, Lin J, et al. Hydrophobic structure of hairpin ten-ring pyrrole-imidazole polyamides enhances tumor tissue accumulation/retention in vivo. Bioorg Med Chem. 2018;26(9): 2337–2344. pmid:29622411