Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Estimating genome-wide off-target effects for pyrrole-imidazole polyamide binding by a pathway-based expression profiling approach

  • Jason Lin ,

    Roles Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Resources, Software, Visualization, Writing – original draft, Writing – review & editing

    linjason@chiba-cc.jp (JL); hnagase@chiba-cc.jp (HN)

    Affiliations Laboratory of Cancer Genetics, Chiba Cancer Center Research Institute, Chuo-ku, Chiba, Japan, Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Koto-ku, Tokyo, Japan

  • Sakthisri Krishnamurthy ,

    Contributed equally to this work with: Sakthisri Krishnamurthy, Hiroyuki Yoda

    Roles Data curation, Investigation

    Affiliation Laboratory of Innovative Cancer Therapeutics, Chiba Cancer Center Research Institute, Chuo-ku, Chiba, Japan

  • Hiroyuki Yoda ,

    Contributed equally to this work with: Sakthisri Krishnamurthy, Hiroyuki Yoda

    Roles Data curation, Investigation

    Affiliation Laboratory of Innovative Cancer Therapeutics, Chiba Cancer Center Research Institute, Chuo-ku, Chiba, Japan

  • Yoshinao Shinozaki,

    Roles Investigation

    Affiliation Laboratory of Cancer Genetics, Chiba Cancer Center Research Institute, Chuo-ku, Chiba, Japan

  • Takayoshi Watanabe,

    Roles Investigation

    Affiliation Laboratory of Innovative Cancer Therapeutics, Chiba Cancer Center Research Institute, Chuo-ku, Chiba, Japan

  • Nobuko Koshikawa,

    Roles Resources, Supervision

    Affiliation Laboratory of Cancer Genetics, Chiba Cancer Center Research Institute, Chuo-ku, Chiba, Japan

  • Atsushi Takatori,

    Roles Data curation, Funding acquisition, Investigation, Resources

    Affiliation Laboratory of Innovative Cancer Therapeutics, Chiba Cancer Center Research Institute, Chuo-ku, Chiba, Japan

  • Paul Horton,

    Roles Resources, Supervision

    Affiliations Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan, Institute of Medical Informatics, National Cheng Kung University, Tainan, Taiwan

  • Hiroki Nagase

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing

    linjason@chiba-cc.jp (JL); hnagase@chiba-cc.jp (HN)

    Affiliation Laboratory of Cancer Genetics, Chiba Cancer Center Research Institute, Chuo-ku, Chiba, Japan

Abstract

In the search for new pharmaceutical leads, especially with DNA-binding molecules or genome editing methods, the issue of side and off-target effects have always been thorny in nature. A particular case is the investigation into the off-target effects of N-methylpyrrole-N-methylimidazole polyamides, a naturally inspired class of DNA binders with strong affinity to the minor-groove and sequence specificity, but at < 20 bases, their relatively short motifs also insinuate the possibility of non-unique genomic binding. Binding at non-intended loci potentially lead to the rise of off-target effects, issues that very few approaches are able to address to-date. We here report an analytical method to infer off-target binding, via expression profiling, based on probing the relative impact to various biochemical pathways; we also proposed an accompanying side effect prediction engine for the systematic screening of candidate polyamides. This method marks the first attempt in PI polyamide research to identify elements in biochemical pathways that are sensitive to the treatment of a candidate polyamide as an approach to infer possible off-target effects. Expression changes were then considered to assess possible outward phenotypic changes, manifested as side effects, should the same PI polyamide candidate be administered clinically. We validated some of these effects with a series of animal experiments, and found agreeable corroboration in certain side effects, such as changes in aspartate transaminase levels in ICR and nude mice post-administration.

Introduction

N-methylpyrrole-N-methylimidazole (PI) polyamides have been frequently discussed as viable alternatives for “undruggable” oncological targets otherwise inaccessible at the protein level, such as KRAS [13]. Their ability to interact with the DNA minor-groove binders with unrivaled sequence recognition translates to their distinctive ability to disrupt transcription, and proper functionalization with histone modifiers or alkylating agents transform these molecules to programmable epigenetic switches and anticancer agents in vivo [410]. A troubling issue for PI polyamides, however, is their relatively short motifs, leading to multiple genomic binding sites; lengthening a PI polyamide beyond its typical working range of 8–10 bases will reduce the number of genomic binding sites at the penalty of reducing its chemical affinity to the DNA minor groove due to increased structural rigidity [11, 12]; intriguingly, most PI polyamide, despite the presence of multiple binding sites, led to little toxicity [1315]. It would therefore imply that nonunique binding typically did not lead to ill biological effects, further complicating our understanding on the subject matter. Nevertheless, to improve the clinical prospects of PI polyamides, the evaluation of these so-called “off-target effects” would be unavoidable in a similar manner as genome editing methods such as CRISPR/Cas. Yet, even before we could approach the problem, we were met with a difficult question: for PI polyamides, how would one define the effect of off-target binding for PI polyamides?

In the most general sense, binding to any genomic location other than its intended site is technically “off-target.” Several studies have elected to use next-generation sequencing (NGS) methods such as Chem-seq [1618] to approach the question of off-target binding. While Chem-seq do provide the means of identifying binding sites in the genomic space, in addition to functionalizing a candidate PI polyamide with an affinity tag for enrichment, the method still requires the coupling of expression profiling to characterize the effect of genomic binding. Moreover, binding at an unintended site in certain situations may be more or less inconsequential; for instance, binding in genes downstream of the intended target may do very little since the process is already disrupted. To understand the effect of off-target binding, therefore, would necessitate one to perform additional analysis and process the expression profiling data beyond the conventional before/after fold-change calculations to elucidate or distinguish these effects.

Common knowledge suggest that chemical stimuli can environmental or oxidative stress homeostasis, leading to the alteration of expression profiles as a response to those perturbations. Consequently, while some genes are certainly to be downregulated upon the administration of a PI polyamide, it is entirely possible that those changes are as part of the homeostatic responses rather than “off-target” binding; how does one differentiate “true” off-target binding then? We can reformulate the question as follows: if a polyamide suppresses its primary target by some extent, is that suppression significant when compared to non-targets? Now, the natural response would be to compare changes to the primary target with some measures of significance testing with the rest of the targets in a background population; this nonetheless gets complicated, as gene expressions are also affected by innate feedback, homeostasis, regulatory mechanisms and complex secondary effects in vivo. By recognizing that most genes participate in various pathways that lead to phenotypic changes, these pathways can more natural units of analysis to evaluate off-target binding. The null hypothesis then is whether the effect onto “on-target” pathways differs from other “off-target” pathways. To evaluate these changes, NGS methods are also no longer required; expression microarrays in this regard are viable alternatives in exploring a potential candidate PI polyamide’s off-target effects, since one can simultaneously monitor changes in the genome at a reasonable cost through with nearly 50,000 data points per run.

With this testable hypothesis in mind, we devised a computational method to evaluate the relative impact to unintended pathways (“background”), compared to pathways with target participation (“foreground”). By evaluating the impact to a pathway as a mean change of all participating genes, we could search the background to identify genes relatively downregulated compared to the intended target to estimate the extent of erroneous binding. We here propose the use of expression microarrays to evaluate the effect of off-target binding and possible side effects via a pathway-based approach. The effectiveness of our analysis is further validated in mouse experiments. This is the first report of a method specifically aimed to address the complicated issue of off-target binding for PI polyamides, and hopefully these algorithms and methodologies may help accelerate the much-needed effort in motif-specific DNA binders in the field of chemical biology and precision medicine.

Materials and methods

Summary of polyamide synthesis

Polyamides 14 were synthesized piecewise following a Fmoc-based solid-phase peptide synthesis protocol on a PSSM-8 peptide synthesizer (Shimadzu) with a custom operating system and functionalized by indole-seco-CBI [19]. Purification of the product was performed using high-performance liquid chromatography system on a LC-20 (Shimadzu) using a 10 mm × 150 mm Gemini-NX3u 5-ODS-H reverse-phase column (Phenomenex). Separation was achieved with a linear gradient of acetonitrile in 0.1% acetic acid in Milli-Q water (typically 0–100%) at a flow rate of 10 ml m-1 with detection at 310 nm. Collected fractions were analyzed by a LC-MS2020 LC-MS system (Shimadzu). Conjugation of indole-seco-CBI was performed by a reaction of respective PIP backbone with NH2-indole-seco-CBI in N-methylpyrrolidone in the presence of 1-ethyl-3-(3-dimethylaminopropyl)carbodiimidyl hydrochloride. Reaction mixtures were then purified by liquid chromatography with a linear gradient of in 0.1% acetic acid in water (typically 30–75%), and the final products were subsequently lyophilized. Mass information (ESI-MS): polyamide 1, C100H110ClN34O18S, m/z calcd [M+2H]2+ 1071.95, found 1072.00; [M+3H]3+ 714.94, found 715.25; polyamide 2, C89H92ClN31O16, m/z calcd [M+H]+ 1886.70, found 1886.50; calcd [M+2H]2+ 943.86, found 944.05; calcd [M+3H]3+ 629.57, found 629.45; polyamide 3, C88H91ClN32O16, m/z calcd [M+H]+ 1888.32, found 1888.95; polyamide 4, C89H92ClN31O16, m/z calcd [M+2H]2+ 943.86, found 944.15; calcd [M+3H]3+ 629.57, found 630.10.

Cell cultures

Cells were maintained in a humidified environment at 37°C and 5% CO2 in their respective media and nutrient supplements (S1 Table) in addition to 1% penicillin/streptomycin. Reagents used for cultures were purchased from Gibco.

Microarray data processing

104–105 cells per condition were plated in a 6-well microtiter plate for overnight attachment prior to treatment of candidate polyamides or 0.05% dimethyl sulfoxide (DMSO). After RNA extraction with RNeasy Plus Mini Kit (Qiagen), replicated samples (2 × 2 for polyamides 1 and 2; 3 technical replicates for 3) were labelled with RNA Spike-In Kit and analyzed on SurePrint G3 Human GE 8x60K V2 microarrays per Agilent Technologies’ recommendations. Data acquisition was performed on an Agilent SureScan microarray scanner and analysis were completed with custom R codes utilizing the limma package [20]. Our procedure automatically searched for a suitable offset to correct for background effects and normalize replicate between arrays and determined expressions by empirical Bayes estimations in terms of fold changes from the difference between DMSO and each PI polyamide treatment. Spots with matching RefSeq mRNA & ncRNA identifiers, as well as Ensembl accession codes if applicable, were filtered. For the purpose of comparison, genes with promoter region (defined as 1000 bp upstream of transcription start site) binding were also annotated with corresponding mRNA expressions. Tissue-specific expression changes (as defined by the logarithm of mean tumor expressions compared to normal tissue expressions) from GENT [21] were extracted for the genes considered to be “on-target” to use as weights in the computation.

Pathway analysis

Gene-pathway data were retrieved from the Kyoto Encyclopedia of Genes and Genomes (KEGG) [22]. After selecting a cell line as control (in the case of polyamide 2, SiHA), expression data in the form of a 6-column table detailing the chromosomal position of the gene, a corresponding RefSeq symbol and accession number followed by the expression in terms of logFC) were recomputed to determine the relative change in expression (RFC) across the test and control cell lines. These RFC’s were then mapped to a specific pathway (PI3K-Akt signaling pathway, hsa04151) based on pathway-gene information retrieved from KEGG; each clustered gene on the KEGG pathway map was then assigned a color based on their respective RFC; for those with RFC < -0.5, red; RFC > 0.5, green; orange otherwise. Binding site information was obtained from either Chem-seq experiment (polyamide 1) or based on searching the number of motif occurrences in the hg19 reference genome according to the order of methylpyrrole and methylimidazole pairings. The core pathway (genes immediately upstream of PIK3CA and downstream of Akt) was visualized using PathVisio [23].

Side effect prediction

Drug-adverse reaction data were obtained from DrugBank and SIDER following previously published procedures [24, 25] to generate a final matrix of drug-adverse reaction matrix. Gene expression datasets containing the selected drug candidates found in the matrix were manually retrieved from the GEO database (S2 Table) and those expressions were processed as training sets as predictors for each of the adverse reactions included in the matrix. Side effects occurring in fewer than 10 of the training set drugs were omitted. Expressions for primary target genes (e.g. KRAS for 1 and PIK3CA for 2) were numerically scaled with recorded tumor/normal expression ratios from GENT in their respective cancer origins prior to side effect prediction (see S3 Table for an example for PIK3CA). Random forest via the R ranger implementation with 1000 trees per ensemble was used for data training to generate a side effect prediction model for polyamides 13. Codes used for pathway analysis and building side effect prediction models were organized into pipoft, an R package available on GitHub (jlincbio/pipoft).

Animal experiments

Approval from the animal care and ethics committee of Chiba Cancer Center Research Institute were obtained and experiments were performed following strict guidelines to minimize animal suffering. Mice were housed in a designated restricted-access facility staffed by independent Japan Animal Care (JAC) personnel and provided with sterile water and CRF-1 Certified Diet (Oriental Yeast) feed pellets over the course of experimentation. Physical examinations and non-interactive visual checks were routinely performed to ensure subjects’ well-being until sacrifice by cervical dislocation. Collection of blood specimens were performed under anesthesia by minute amounts of ether, to effect. Invasive procedures and sacrifice were performed by a licensed veterinarian (A.T.). Female Balb/c nude mice (Charles River Laboratories) were subcutaneously inoculated with 106 ME-180 cervical cancer cells. Upon reaching a critical tumor size of ~ 50–70 mm3, the mice were intraperitoneally injected with DMSO (n = 4) or 3 mg/kg·week polyamide 2 (n = 5) and sacrificed after the tumors reached 2 × 103 mm3. For polyamide 1, sera were collected from mice treated with DMSO, 0.3 or 3 mg/kg of polyamide 1 at 3 and 36 days. To evaluate the effect of 10 mg/kg polyamide 4, we first performed a pilot experiment with two C57BL/6J mice to assess the need for an additional cohort of 4 × 2 ICR mice (Charles River Laboratories) for monitoring changes in appetite as well as AST, ALT, blood glucose levels. Mice were housed in individual cages so the amount of feed remaining in the food compartment could be measured in 24 h periods during the course of polyamide 4 treatment. The test and control groups were injected with a dose of 10 mg/kg polyamide 4 or DMSO in sterile phosphate-buffered saline solutions once a week. After sacrifice on day 16, liver tissues (C57BL/6J mice, CLEA Japan) and serum samples (ICR mice) were collected for histological examination and biochemistry panels. 150 μL of blood per mouse was incubated with 10 μL coagulant (Wako) at room temperature for 0.5 h before centrifugation at 16 × 103 rpm for 15 m for serum collection. Samples were stored at –80°C until analysis. Quantification of AST, ALT, CRE and blood glucose levels were performed by Starlight Biotech Inc. (Tokyo, Japan) or the Japan Mouse Clinic at RIKEN (Tsukuba, Japan) in the case of the pilot experiment with polyamide 4.

Cytotoxicity assays and correlation to GNA14

Cells (S4 Table) were seeded at a final density of 3 × 103 cells in a 96-well microtiter plate (Nunc) and incubated overnight for attachment. Upon treatments of various concentrations of polyamide 4 or 0.125% DMSO for 48 h, 10 μL WST-8 reagent (Dojindo) was added to each sample and incubated at 37°C for 2 h to assess the rate of cell proliferation. Absorbances were measured at 450 nm on a Corona MTP-310 microplate reader, with IC50 values calculated using GraphPad Prizm. GNA14 expressions were obtained from compiled U133A values in GENT [21].

Statistical analysis

Two sample t-tests, unless otherwise specified, were used to assess the null hypothesis that there were no differences in the sample means between gene expressions or marker levels. For gene expressions, we assessed significance against a predefined α-level of 0.05; for the mouse experiments, as sample sizes were n < 10 per experiment, we adopted the one-tailed α-level of 0.10, such that fewer than one out of the entire cohort may be a false positive at any time. Power analyses were performed with the pwr package in R [26, 27] to ensure that the adaptation of the new alpha level did not impact the assessment of significance (criteria: p < 0.10, statistical power > 90% and within 15% of the same metric under p < 0.05).

Results and discussion

Pathway-based approach to evaluating off-target effects

We utilized PI polyamides 1–3 (Fig 1A) targeting KRAS G12D/V mutations, PIK3CA E545K mutation and a third candidate which sought to suppress MYCN amplification as its mode of action [28, 29] to determine whether the newly formed hypothesis sufficiently addressed the question of determining the extent of off-target effects at the pathway level; polyamide 4, an additional unbiotinylated version of polyamide 1, was used for a follow-up validation experiment in vivo. All three polyamides were functionalized with indole-seco-CBI, an alkylating moiety; while polyamide 1 had an additional biotinyl moiety as it was originally developed for NGS applications, we previously [16] found no discernible differences compared to the non-biotinylated counterpart in microarray studies.

thumbnail
Fig 1. Design of a pathway-based method to evaluate off-target effects for PI polyamides.

(A) Structure of candidate PI polyamides: 1, designed to target G12D/V KRAS mutants; 2, a polyamide seeking to bind E545K PIK3CA mutation; 3, a polyamide meant to suppress MYCN amplifications; 4, unbiotinylated version of polyamide 1 used for in vivo validation experiments. N-methylpyrrole subunits, blue; N-methylimidazole subunits, red. (B) Scheme of automated systematic Analysis; (C) concept of pathway-based evaluation.

https://doi.org/10.1371/journal.pone.0215247.g001

After administrating these polyamides to their respective target cell lines (Fig 1B), we determined changes in expression (logFC) systematically and linked gene-pathway information available in KEGG to calculate the relative impact for all genes participating in the catalogued pathways. We used the relative percentile rank of logFC and measured their deviations from the median to provide a method to evaluate whether these differences were statistically meaningful. The incorporation of binding information (either from NGS results such as Chem-seq or by motif-based computational estimation), allowed us to determine the extent of “off-target” binding (Fig 1C). This procedure allowed us to separate severely affected genes from “undiscernible” genes that might have been affected by PI polyamide treatment but did not overall behave much differently from other members in the same pathway.

This method allowed us to estimate the relative effect of our candidate PI polyamides to their respective targets (Table 1). Polyamide 2, for instance, was estimated to have a larger number of off-target genes in SiHA, a cervical cancer cell line expressing wild-type PIK3CA; while this polyamide was capable of disrupting mutant PIK3CA preferentially, the cytotoxicity was still in the nanomolar range, the high number of off-target genes would provide an explanation for this observation. We found expressions to correlate roughly in mutant PIK3CA cell lines (ME180 and CaSki, Fig 2A), and the three cell lines shared a number of identical genes deemed “off-target” (Fig 2B). To further clarify how “on-target” polyamide 2 was, we expanded the algorithm to consider the relative differences between SiHA and other mutant PIK3CA cell lines. Since PIK3CA had been frequently asserted to be the primary driver gene in cervical cancer, assuming that most other mutations manifested little effect on the cancer phenotypes, we used SiHA as the “baseline” and computed the extent of which the other two cell lines deviated in terms of expressions in the PI3K-Akt signaling pathway. We saw a downward shift of this relative expression difference for PIK3CA as well as several downstream elements (Fig 2C and S1 and S2 Figs), for instance AKT, PTEN and p53 in the case of ME180 cells (by comparing SiHA/ME180 and CaSki/ME180 pairings). Several downstream elements of PIK3CA were invariant in their responses in SiHA and CaSki cells, suggesting that they were little affected by the binding of polyamide 2, allowing us to disregard quantitatively the effects to those elements and their subsequently participating pathways.

thumbnail
Fig 2. Off-target evaluation of polyamide 2.

(A) Correlations between changes in expression (logFC) after treatment of 10 nM of polyamide 2 against DMSO across ME180, CaSki and SiHA cells; numerical values above each pairing indicate the pairwise Spearman’s rank correlation coefficient. (B) Distribution of the relative percentile rank (RPR) for on-target and off-target genes expressed as ME180/CaSki (red) or SiHA (blue) pairings. PIK3CA (on-target) or ACTB (control) is expressed in black. (C) Differential comparison of changes in expression across different cell lines in the PI3K-Akt pathway (KEGG accession: hsa04151), with ME180 as the baseline; colors indicate the level of RFC (< -0.5, red; > 0.5, green), with divided boxes indicating nonuniform RFC behavior between CaSki/ME180 (left) and SiHA/ME180 (right) pairings. PIP3, phosphatidylinositol-3,4,5-triphosphate. Full pathway illustrations can be found as S1 and S2 Figs.

https://doi.org/10.1371/journal.pone.0215247.g002

thumbnail
Table 1. Assessment of off-target binding of polyamides 1–3.

https://doi.org/10.1371/journal.pone.0215247.t001

Inferencing potential side effects of PI polyamides

We now expand the discussion on off-target binding to understand the potential phenotypic changes that may be more clinically relevant and easier to conceptualize: side effects. The ability to eliminate PI polyamide candidates with more adverse effects a priori could beneficially improve the design and testing process and provide a way to monitor animal experiments to track and observe these small but critical shifts in phenotype. While various side-effect prediction methods exist [24, 25, 30, 31], most had been developed with protein-level based inhibitors in mind and focused largely on modelling changes through metabolic networks; since PI polyamides’ primary mode of action was DNA minor-groove binding, we thought that perhaps an approach utilizing gene expression profiling from microarray data would better reflect the direct effect more meaningfully. An expression-based prediction strategy also allowed us to account for the hypothetical possibility that certain gene targets could have a wide range of expression distributions in different tissues by allowing for separate microarray datasets to be evaluated or directly adjusting for expression discrepancies by scaling factors derived from tissue-specific information. We chose to implement this approach by testing different machine-learning methods such as support vector machines and neural networks with a publicly available set of cell-line expression arrays under various drug treatments coupled with available side effect information as training data. Upon testing, we found random forests to be the most robust; the method itself required merely a fraction of computing time compared to neural networks at the current size of drug-adverse reaction arrays and had lower prediction errors compared to support vector machines. Predictions across different cell lines of the same cancer type, i.e. SW480 vs. LS180 of colorectal cancer, or the ME180 vs. CaSki of cervical cancer (S3S5 Figs) also were more consistently correlated compared to raw expressions. We could also systematically identify key elements leading to certain side effects and evaluate the reliability of our predictions (Fig 3A). The predictor models were sufficiently robust, with a large number of side effects yielding AUCs greater than 0.70 (an example is provided in Fig 3B).

thumbnail
Fig 3. Side effect evaluation of PI polyamides.

(A) comparison of gene expressions and its relative importance in the prediction model; vertical axis, Z-normalized mean expressions for a particular gene in training data sets; horizontal axis, importance in terms of mean Gini index; top 100 most important genes were highlighted in red. (B) sample receiver operating characteristic curve for the side effect of increased coughing; FPR, false positive rate; TPR, true positive rate; AUC, area under curve (0.750). (C) Hierarchical clustering of common potential side effects of polyamides 1–3 from expression profiling; Cell lines and the corresponding polyamide are indicated in parentheses; side effects with likelihood scores are indicated in increasing shades of red. Arrows and asterisks indicate AST (*) and ALT (**) increases. (D) AST and ALT levels of ME180 xenograft in Balb/c nude mice after 30–35 d of exposure to 3 mg/kg polyamide 2 (cyan) or DMSO (0 mg/kg; orange). Statistical significance is assessed by two-sample t-test; error bars indicate mean ± SD.

https://doi.org/10.1371/journal.pone.0215247.g003

Side effect predictions with animal experiments

Throughout our testing of the three polyamides, several side effects were common across different cell lines and polyamides (Fig 3C and S5 Table); a large number of the side effects were qualitative (neuropathy, for instance), and barring actual patient-based testing there were few ways to validate these predictions. As the prediction model suggested that polyamide 2 could potentially alter aspartate and alanine transaminase levels (AST and ALT, respectively) among other qualitative side effects, we tried to confirm these quantifiable changes in concurrent in vivo experiment with ME180 xenografts in Balb/c nude mice and 3 mg/kg polyamide 2, a dose an order of magnitude below the estimated maximum tolerated dose (MTD) of 30 mg/kg based on our previous studies of other polyamides. Inspections of AST and ALT levels in these mice revealed increases supporting the likelihood of our hypothesis (Fig 3D; two-sample t-test p = 0.365 and p = 0.175 for AST and ALT, respectively), albeit with the difference in sample means above the threshold for significance. We also evaluated whether to reject the null hypothesis that there was no difference in the sample variance between the control and test groups with two-tailed F test (p = 0.1056 and p = 0.0869 for AST and ALT, respectively).

To confirm this prediction in other systems, we then performed a set of similar analyses with polyamide 4 (the unbiotinylated version of 1) against KRAS G12D/V mutations. We first computationally explored potential candidates prone to off-target binding, such as the type and number of off-target genes, as well as their possible connections to the primary target. We found several genes to be affected as a result of off-target binding (S6 Table). These results mirrored some of the previously established links [3234] between KRAS and EEA1, an endosomal marker as well as functional changes in GNA14. We tested a number of cancer cell lines for their sensitivity to polyamide 4 and were able to observe some correlation to GNA14 levels (Fig 4A and S4 Table); from this result, certain links likely existed between off-target binding to GNA14 and the resultant cytotoxic response. With recent reports that GNA14 mutations could be associated to liver abnormalities such as hepatic small vessel neoplasm [35], we then suspected that the accidental disruption of GNA14 could lead to, and potentially explain, the manifestation of liver-related effects. As results with C57BL/6J mice over the course of two weeks from a pilot experiment suggested marginal changes in ALT levels (Fig 4B) under the influence of polyamide 4, we set up a differently structured side effect experiment with a follow-up cohort of ICR mice (n = 8) at 10 mg/kg polyamide 4, a dosage level approximately half the order of magnitude from the MTD. In this experiment, we monitored changes in appetite, a predicted side effect, as well as AST, ALT and blood glucose (GLU) levels. We found AST levels to fluctuate more appreciably compared to ALT or GLU, although changes in ALT level were significant while AST marginal (Fig 4C; GLU, an unpredicted side effect, was determined to be invariant). This trend was markedly consistent in mice treated with different polyamides (Figs 3D, 4B and 4C) as similar to earlier results. As a consequence of the small cohort size (n < 10), we decided to assess significance at the alpha level of p < 0.10, or at most one per ten subjects could be a false positive; generally, elevating the alpha level had no negative effect on the statistical power during testing. The mice were responsive to polyamide treatment and displayed changes in the distribution of ALT levels in the cohort. Additionally, on days immediately following an injection, mice administered with polyamide 4 tended to exhibit reduced food intake (Fig 4D, black arrows) while changes in body weight were insignificant across the two groups.

thumbnail
Fig 4. Side effects of polyamides targeting KRAS with G12D/V mutations.

(A) Correlation of cytotoxicity of polyamide 4 to cell lines of varying KRAS codon 12 mutation status and their respective GNA14 expression. Cytotoxicity is expressed as the IC50 concentration determined by cell proliferation assay; red points indicate cell lines with non G12D/V KRAS mutation (“off-target”); black points indicate cell lines with G12D/V KRAS mutations. R2 for correlation model [IC50] = a0 + a1 [GNA14]: 0.2463; with a1 = 1.8635 (p = 0.0116). For reference, R2 for correlation model [IC50] = a0 + a1 [KRAS]: 0.04128, with a1 = -0.0430 (p = 0.3300). (B) ALT levels of Balb/c nude mice after 3 or 36 d of exposure to polyamide 4; n = 4 per test condition; orange, 0.3 mg/kg polyamide 4; cyan, 3.0 mg/kg. (C) Serum ALT, AST and glucose levels of ICR mice as well as (D) changes in appetite over time in ICR mice treated with polyamide 4 (n = 4 each; 0 mg/kg, orange; 10 mg/kg, cyan); red timepoints in the horizontal axis indicate points of injection. Error bars indicate mean ± SD; statistical significance is assessed by two-sample t-test; ‘*’, one-tailed p < 0.10. Arrows in (D) indicate days in which treatment of between 0 or 10 mg/kg of polyamide 4 showed statistical significance.

https://doi.org/10.1371/journal.pone.0215247.g004

These increases echoed earlier findings that certain, but not all, configurations of PI polyamides could induce increases in these aminotransferases [14] when amine substitutions were introduced to the γ-turn subunit. Considering that previous reports found the toxicity to be highly dose-limited and inconclusive on the contribution of polyamide structure on toxicity, an expression profiling-based approach would be more reliable for estimating the side effect of a polyamide due to potential off-target effects. Increases in blood creatinine level, a side effect predicted not to occur with polyamide 2, were found to be negative (~0.1–0.2 mg/dl), once again placing the noted aminotransferase increases in focus. ALT and AST levels recorded in these subjects tended to fall within the normal range of 25–60 U/L and ~ 50–100 U/L, respectively [36], suggesting that liver function was most likely also normal, well in corroboration with a previous examination of liver tissues in animal subjects [28]. Hierarchical clustering of the side effects additionally suggested no close relations to other liver-related side effects, and even at 10 mg/kg polyamide 4, elevations in AST and ALT did not correlate with liver damage upon histological examination (S6 Fig). Further model optimizations factoring in considerations such as concentration and dosage lengths could help further differentiate effects driven by the pharmacodynamics of PI polyamides in tumor and normal tissues. While cell-level observations may not fully translate to higher-order animal results, this procedure has the benefit of being able to screen out quickly the less desirable PI polyamide candidates. In lead development, initial decisions to proceed with certain candidates are frequently generated based on cell-level experiments, and our method can be implemented in-line of other experiments with relative ease.

Conclusion

Most developmental pipelines encounter a similar problem: it is not always certain whether the investment in preclinical studies will attribute to the eventual success for a marketable product. PI polyamides, while highly versatile and by their mechanistic nature tunable, also face this. By evaluating candidate polyamides with expression microarrays and this systematic method can greatly improve the throughput and meaningfully shorten the initial lead selection and optimization period. We understand how changing the structure of PI polyamides may affect its binding affinity, and how these molecules’ relative hydrophobicity can have on its tumor retention [37]; yet we still know very little about how these molecules will influence the outward phenotypes, especially when binding motifs are concerned. The lack of progress in understanding the off-target effect of PI polyamides can limit the growth of PI polyamide research and the translation to bedside applications. With our first report on the subject matter, we hope to facilitate the push of PI polyamides to bedside applications and accelerate the field of precision medicine.

Supporting information

S1 Fig. Full PI3K-Akt pathway for CaSki/ME180 cell lines upon administration of polyamide 2.

Levels of relative fold change across cell lines (RFC) as indicated: yellow, |RFC| < 0.5; red, RFC < -0.5; green, RFC > 0.5.

https://doi.org/10.1371/journal.pone.0215247.s001

(TIF)

S2 Fig. PI3K-Akt pathway for SiHA/ME180 cell lines upon administration of polyamide 2.

Levels of relative fold change across cell lines (RFC) as indicated: yellow, |RFC| < 0.5; red, RFC < -0.5; green, RFC > 0.5.

https://doi.org/10.1371/journal.pone.0215247.s002

(TIF)

S3 Fig. Correlation of expressions and side effect predictions of polyamide 1 in LS180 and SW480 colorectal cancer cells.

Above, green: correlation of gene expressions in LS180 (horizontal axis) and SW480 (vertical axis); below, purple: correlation of predicted side effect scores in LS180 (horizontal) and SW480 (vertical).

https://doi.org/10.1371/journal.pone.0215247.s003

(TIF)

S4 Fig. Correlation of expressions and predicted side effects of polyamide 2 in ME180, CaSki and SiHA cervical cancer cells.

Above diagonal, correlations of expressions (green); below diagonal, pairwise correlation of side effect scores (purple). Cell lines are labelled diagonally across the vertical and horizontal axes.

https://doi.org/10.1371/journal.pone.0215247.s004

(TIF)

S5 Fig. Correlation of expressions and side effect predictions of polyamide 3 in CHP134, KELLY, MC-IXC and SK-N-AS cell lines.

Above diagonal, correlations of expressions (green); below diagonal, pairwise correlation of side effect scores (purple). Cell lines are labelled diagonally across the vertical and horizontal axes.

https://doi.org/10.1371/journal.pone.0215247.s005

(TIF)

S6 Fig. Histological staining of the skin and liver from polyamide-treated C57BL/6J Mice.

Left half, skin; right half, liver tissue staining. DMSO (top half) is labeled 0 mg/kg; bottom, polyamide 4 at the dose of 10 mg/kg.

https://doi.org/10.1371/journal.pone.0215247.s006

(TIF)

S1 Table. List of cells and culture conditions.

ATCC, American Type Culture Collection; ECACC, European Collection of Authenticated Cell Cultures; RIKEN, RIKEN BioResource Research Center Cell Bank; NCI, National Cancer Institute Division of Cancer Treatment and Diagnosis Tumor Repository; MEM, Eagle’s Minimum Essential Medium; DMEM, Dulbecco’s Modified Eagle’s Medium; RPMI-1640, Roswell Park Memorial Institute medium 1640; IMDM, Iscove’s Modified Dulbecco’s Medium; FBS, fetal bovine serum; Gln, glutamine (if supplemented).

https://doi.org/10.1371/journal.pone.0215247.s007

(PDF)

S2 Table. Training expression datasets for the side effect prediction model.

https://doi.org/10.1371/journal.pone.0215247.s008

(PDF)

S3 Table. Relative distribution of PIK3CA in various tumor and normal tissues.

Values computed from average tissue-type U133 Plus 2 PIK3CA expressions in GENT as of August 2018; logFC indicates the log ratio of tumor vs. normal expressions for a particular tissue.

https://doi.org/10.1371/journal.pone.0215247.s009

(PDF)

S4 Table. Summary of cytotoxicity of polyamide 4 and GNA14 expressions in various cell lines.

KRAS G12, mutation status for codon 12 in KRAS; GNA14, expression of GNA14 in U133A datasets compiled by the Gene Expression Across Normal and Tumor Tissue Database. Class, classification of cell lines into “Off-Target” or “G12D/V” in Fig 4A.

https://doi.org/10.1371/journal.pone.0215247.s010

(PDF)

S5 Table. Common potential side effects of polyamides 1–3 from metabolic gene expression profiling.

Predicted scores are on a scale of 0–1; synd., syndrome; inc., increase; dec., decrease; did., disorder. The table is illustrated as a heatmap in Fig 3B.

https://doi.org/10.1371/journal.pone.0215247.s011

(PDF)

S6 Table. Potential off-targets of polyamide 1 in LS180 colorectal cancer cell line.

RPR, mean relative percentile rank of specified gene in all participating pathways; p0, t-test p-value under H0: RPR = 0.5000; p1, t-test p-value under H0: RPROff-Target = RPRTarget. Significance evaluated against a predefined threshold of p < 0.05 (log10 p < -1.3010); N/S, not significant; N/A, not evaluated; [a]number of pathways evaluated for KRAS (“foreground”): 68; number of background pathways: 244.

https://doi.org/10.1371/journal.pone.0215247.s012

(PDF)

Acknowledgments

We would like to thank Ms. Yuki Kaiho, Nina Matsuo and Yuki Nakamura as well as Drs. Takahiro Inoue, Kiriko Hiraoka and Yasutoshi Tatsumi at Chiba Cancer Center Research Institute for their experimental assistance. Dr. Tony Kuo at AIST Artificial Intelligence Research Center (AIST-AIRC) along with Drs. Etyan Ruppin and Itay Shaked at Tel Aviv University also provided insights and support during the initial period. High-performance computing was provided by the Institute of Medical Science at the University of Tokyo and AIST-AIRC. J.L. held a visiting appointment at AIST-AIRC for the duration of this project.

References

  1. 1. Makley LN, Gestwicki JE. Expanding the number of 'druggable' targets: non-enzymes and protein-protein interactions. Chem Biol Drug Des. 2013;81(1): 22–32. pmid:23253128
  2. 2. Ledford H. Cancer: The Ras renaissance. Nature. 2015;520(7547): 278–280. pmid:25877186
  3. 3. Cox AD, Fesik SW, Kimmelman AC, Luo J, Der CJ. Drugging the undruggable RAS: Mission possible? Nat Rev Drug Discov. 2014;13(11): 828–851. pmid:25323927
  4. 4. Meier JL, Yu AS, Korf I, Segal DJ, Dervan PB. Guiding the design of synthetic DNA-binding molecules with massively parallel sequencing. J Am Chem Soc. 2012;134(42): 17814–17822. pmid:23013524
  5. 5. Alagarswamy K, Shinohara KI, Takayanagi S, Fukuyo M, Okabe A, Rahmutulla B, et al. Region-specific alteration of histone modification by LSD1 inhibitor conjugated with pyrrole-imidazole polyamide. Oncotarget. 2018;9(50): 29316–29335. pmid:30034620
  6. 6. Pandian GN, Nakano Y, Sato S, Morinaga H, Bando T, Nagase H, et al. A synthetic small molecule for rapid induction of multiple pluripotency genes in mouse embryonic fibroblasts. Sci Rep. 2012;2: 544. pmid:22848790
  7. 7. Anandhakumar C, Li Y, Kizaki S, Pandian GN, Hashiya K, Bando T, et al. Next-generation sequencing studies guide the design of pyrrole-imidazole polyamides with improved binding specificity by the addition of beta-alanine. Chembiochem. 2014;15(18): 2647–2651. pmid:25371287
  8. 8. Pandian GN, Sato S, Anandhakumar C, Taniguchi J, Takashima K, Syed J, et al. Identification of a small molecule that turns ON the pluripotency gene circuitry in human fibroblasts. ACS Chem Biol. 2014;9(12): 2729–2736. pmid:25366962
  9. 9. Taylor RD, Chandran A, Kashiwazaki G, Hashiya K, Bando T, Nagase H, et al. Selective Targeting of the KRAS Codon 12 Mutation Sequence by Pyrrole-Imidazole Polyamide seco-CBI Conjugates. Chem Eur J. 2015;21(42): 14996–15003. pmid:26306751
  10. 10. Syed J, Chandran A, Pandian GN, Taniguchi J, Sato S, Hashiya K, et al. A Synthetic Transcriptional Activator of Genes Associated with the Retina in Human Dermal Fibroblasts. Chembiochem. 2015;16(10): 1497–1501. pmid:25900774
  11. 11. Chenoweth DM, Meier JL, Dervan PB. Pyrrole-imidazole polyamides distinguish between double-helical DNA and RNA. Angew Chem Int Ed Engl. 2013;52(1): 415–418. pmid:22987334
  12. 12. Swalley SE, Baird EE, Dervan PB. A Pyrrole-Imidazole Polyamide Motif for Recognition of Eleven Base Pair Sequences in the Minor Groove of DNA. Chem Eur J. 1997;3(10): 1600–1607.
  13. 13. Mishra R, Watanabe T, Kimura MT, Koshikawa N, Ikeda M, Uekusa S, et al. Identification of a novel E-box binding pyrrole-imidazole polyamide inhibiting MYC-driven cell proliferation. Cancer Sci. 2015;106(4): 421–429. pmid:25611295
  14. 14. Yang F, Nickols NG, Li BC, Szablowski JO, Hamilton SR, Meier JL, et al. Animal toxicity of hairpin pyrrole-imidazole polyamides varies with the turn unit. J Med Chem. 2013;56(18): 7449–7457. pmid:24015881
  15. 15. Liu K, Fang L, Sun H, Pan Z, Zhang J, Chen J, et al. Targeting Polo-like Kinase 1 by a Novel Pyrrole-Imidazole Polyamide-Hoechst Conjugate Suppresses Tumor Growth In Vivo. Mol Cancer Ther. 2018;17(5): 988–1002. pmid:29483218
  16. 16. Lin J, Hiraoka K, Watanabe T, Kuo T, Shinozaki Y, Takatori A, et al. Identification of Binding Targets of a Pyrrole-Imidazole Polyamide KR12 in the LS180 Colorectal Cancer Genome. PLoS ONE. 2016;11(10): e0165581. pmid:27798693
  17. 17. Chandran A, Syed J, Taylor RD, Kashiwazaki G, Sato S, Hashiya K, et al. Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing. Nucleic Acids Res. 2016;44(9): 4014–4024. pmid:27098039
  18. 18. Erwin GS, Grieshop MP, Bhimsaria D, Eguchi A, Rodriguez-Martinez JA, Ansari AZ. Genome-wide Mapping of Drug-DNA Interactions in Cells with COSMIC (Crosslinking of Small Molecules to Isolate Chromatin). J Vis Exp. 2016(107): e53510. pmid:26863565
  19. 19. Lajiness JP, Boger DL. Asymmetric synthesis of 1,2,9,9a-tetrahydrocyclopropa[c]benzo[e]indol-4-one (CBI). J Org Chem. 2011;76(2): 583–587. pmid:21192653
  20. 20. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7): e47. pmid:25605792
  21. 21. Shin G, Kang TW, Yang S, Baek SJ, Jeong YS, Kim SY. GENT: gene expression database of normal and tumor tissues. Cancer Inform. 2011;10: 149–157. pmid:21695066
  22. 22. Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017;45(D1): D353–D361. pmid:27899662
  23. 23. Kutmon M, van Iersel MP, Bohler A, Kelder T, Nunes N, Pico AR, et al. PathVisio 3: an extendable pathway analysis toolbox. PLoS Comput Biol. 2015;11(2): e1004085. pmid:25706687
  24. 24. Shaked I, Oberhardt MA, Atias N, Sharan R, Ruppin E. Metabolic Network Prediction of Drug Side Effects. Cell Syst. 2016;2(3): 209–213. pmid:27135366
  25. 25. Duarte NC, Becker SA, Jamshidi N, Thiele I, Mo ML, Vo TD, et al. Global reconstruction of the human metabolic network based on genomic and bibliomic data. Proc Natl Acad Sci U S A. 2007;104(6): 1777–1782. pmid:17267599
  26. 26. Cohen J. Statistical Power Analysis for the Behavioral Sciences. 2nd ed. Routledge; 1988.
  27. 27. Champely S. pwr: Basic Functions for Power Analysis. R package version 1.2–2 ed2018.
  28. 28. Hiraoka K, Inoue T, Taylor RD, Watanabe T, Koshikawa N, Yoda H, et al. Inhibition of KRAS codon 12 mutants using a novel DNA-alkylating pyrrole-imidazole polyamide conjugate. Nat Commun. 2015;6: 6706. pmid:25913614
  29. 29. Yoda H, Inoue T, Shinozaki Y, Lin J, Watanabe T, Koshikawa N, et al. Direct Targeting of MYCN Gene Amplification by Site-Specific DNA Alkylation in Neuroblastoma. Cancer Res. 2019;79(4): 830–840. pmid:30584073
  30. 30. Cheng F, Li W, Wang X, Zhou Y, Wu Z, Shen J, et al. Adverse drug events: database construction and in silico prediction. J Chem Inf Model. 2013;53(4): 744–752. pmid:23521697
  31. 31. Stanfield Z, Coskun M, Koyuturk M. Drug Response Prediction as a Link Prediction Problem. Sci Rep. 2017;7: 40321. pmid:28067293
  32. 32. Gelabert-Baldrich M, Soriano-Castell D, Calvo M, Lu A, Vina-Vilaseca A, Rentero C, et al. Dynamics of KRas on endosomes: involvement of acidic phospholipids in its association. FASEB J. 2014;28(7): 3023–3037. pmid:24719356
  33. 33. Lubeseder-Martellato C, Alexandrow K, Hidalgo-Sastre A, Heid I, Boos SL, Briel T, et al. Oncogenic KRas-induced Increase in Fluid-phase Endocytosis is Dependent on N-WASP and is Required for the Formation of Pancreatic Preneoplastic Lesions. EBioMedicine. 2017;15: 90–99. pmid:28057438
  34. 34. Lim YH, Bacchiocchi A, Qiu J, Straub R, Bruckner A, Bercovitch L, et al. GNA14 Somatic Mutation Causes Congenital and Sporadic Vascular Tumors by MAPK Activation. Am J Hum Genet. 2016;99(2): 443–450. pmid:27476652
  35. 35. Joseph NM, Brunt EM, Marginean C, Nalbantoglu I, Snover DC, Thung SN et al. Frequent GNAQ and GNA14 Mutations in Hepatic Small Vessel Neoplasm. Am J Surg Pathol. 2018;42(9): 1201–1207. pmid:29975248
  36. 36. Sher Y, Hung M. Blood AST, ALT and UREA/BUN Level Analysis. Bio-protocol. 2013;3(19): e931.
  37. 37. Inoue T, Shimozato O, Matsuo N, Mori Y, Shinozaki Y, Lin J, et al. Hydrophobic structure of hairpin ten-ring pyrrole-imidazole polyamides enhances tumor tissue accumulation/retention in vivo. Bioorg Med Chem. 2018;26(9): 2337–2344. pmid:29622411