Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Identification of drug combinations on the basis of machine learning to maximize anti-aging effects

  • Sun Kyung Kim,

    Roles Data curation, Writing – original draft

    Affiliation College of Pharmacy, Kyung Hee University, Seoul, Republic of Korea

  • Peter C. Goughnour,

    Roles Data curation, Writing – original draft

    Affiliation College of Pharmacy, Kyung Hee University, Seoul, Republic of Korea

  • Eui Jin Lee,

    Roles Data curation

    Affiliation College of Pharmacy, Kyung Hee University, Seoul, Republic of Korea

  • Myeong Hyun Kim,

    Roles Data curation

    Affiliation Center for Research and Development, Oncocross Ltd., Seoul, Republic of Korea

  • Hee Jin Chae,

    Roles Data curation

    Affiliation Center for Research and Development, Oncocross Ltd., Seoul, Republic of Korea

  • Gwang Yeul Yun,

    Roles Data curation, Methodology

    Affiliation Center for Research and Development, Oncocross Ltd., Seoul, Republic of Korea

  • Yi Rang Kim ,

    Roles Conceptualization, Supervision, Writing – review & editing (YRK); (JWC)

    Affiliations Center for Research and Development, Oncocross Ltd., Seoul, Republic of Korea, Department of Hematology/Oncology, Yuseong Sun Hospital, Daejeon, Republic of Korea

  • Jin Woo Choi

    Roles Conceptualization, Funding acquisition, Methodology, Supervision, Writing – review & editing (YRK); (JWC)

    Affiliations College of Pharmacy, Kyung Hee University, Seoul, Republic of Korea, Department of Life and Nano-pharmaceutical Sciences, Kyung Hee University, Seoul, Republic of Korea


Aging is a multifactorial process that involves numerous genetic changes, so identifying anti-aging agents is quite challenging. Age-associated genetic factors must be better understood to search appropriately for anti-aging agents. We utilized an aging-related gene expression pattern-trained machine learning system that can implement reversible changes in aging by linking combinatory drugs. In silico gene expression pattern-based drug repositioning strategies, such as connectivity map, have been developed as a method for unique drug discovery. However, these strategies have limitations such as lists that differ for input and drug-inducing genes or constraints to compare experimental cell lines to target diseases. To address this issue and improve the prediction success rate, we modified the original version of expression profiles with a stepwise-filtered method. We utilized a machine learning system called deep-neural network (DNN). Here we report that combinational drug pairs using differential expressed genes (DEG) had a more enhanced anti-aging effect compared with single independent treatments on leukemia cells. This study shows potential drug combinations to retard the effects of aging with higher efficacy using innovative machine learning techniques.


Aging is recognized as a direct or indirect cause of many diseases [1]. The suppression and reversion of aging is a considered methodology to prevent and cure its related diseases [2]. Aging is a complex genetic phenomenon, and the accepted strategy to delay aging is to regulate gene expression [3] rather than focusing on developmental mutations [4, 5].

However, due to the complex networks that link aging-related factors, computation and machine learning using previously accumulated data may be a promising approach to understand hidden causative gene expression patterns and identify new anti-aging drugs [6].

Using computational and bioinformatics strategies, researchers can now generate and analyze different kinds of data and drug repositioning (repurposing); in particular, finding new uses for existing drugs has become a popular method for unique drug discovery [7, 8]. These advances in drug screening are primarily due to the increasing number of gene expression-profiling analyses that have exhibited desired drug effects.

Recent collaborative efforts have combined many different fields of study, and the computer-aided drug discovery/design [9] method is adopted to facilitate the time-consuming process and to increase the effectiveness of drug discovery [8]. Recently, many databases for in silico drug development have been established; for example, Drugbank (2006) is a drug database with comprehensive drug target data [10], and PDTD (2008) is a web-accessible protein database for drug target identification [11]. Furthermore, large public databases are available that contain information regarding relationships between drugs and genes, which have made drug repositioning or repurposing more accessible.

CMap (Connectivity Map), which can reveal unexpected connections among drugs, genes, and diseases [12], is a representative example of the use of drug repositioning. CMap shows relationships between different biological states through drug treatment using gene expression profiles and signatures [13]. Thus previously developed compounds can be predicted and applied for non-targeted diseases. CMap has been applied to various drug development processes. It is used as a logical reference because each chemical induces different gene expressions, and all illnesses are caused or accompanied by changes in their gene expression.

Nevertheless, CMap has limitations, such as the use of a database with different cell lines for gene expression profiles [14] that does reflect the real impact of a drug on a particular cell line [15]. CMap is limited to only five cell lines such as: MCF7 (Breast cancer), PC3 (Prostate cancer), HL60 (Leukemia), ssMCF7 (Breast cancer-charcoal-stripped serum), SKMEL5 (Skin cancer). Therefore, if there are some diseases that we want to target that are not in these five cell lines, it is difficult to derive an accurate drug lists. Also, Cmap can be used to select drugs that can down-regulate the up-regulated genes and vice versa. However, the drug’s gene expression pattern (derived from Cmap) and the inserted gene expression pattern used in the initial analysis do not always match. In other words, the drug derived from CMap does not cover all of the gene expressions for that particular disease.

These above shortcomings may reduce the success rate of identifying potent drug repositioning. Additionally, it is difficult to alter the total gene expression pattern in the expected direction using a single drug. Thus, similar to drug combination therapy, wherein multiple chemicals are combined to treat disease [16], by combining two to three compounds with different mechanisms of action, researchers can overcome shortcomings in the drug repositioning process. Drug combinations can also reduce the required concentration of individual drugs [17].

This paper discusses how drug combinations can broadly manage a disease mechanism that can be used to find unique drug targets. Also, it shows how machine learning can assist in combining previous drugs that may have been missed by prior mathematical algorithms. In the current study, gene expression profiles for acute myeloid leukemia patients were downloaded from Gene Expression Omnibus (GEO), a database repository of high throughput gene expression data and hybridization arrays, chips, microarrays. The purpose of our model was to determine what kind of gene, especially in PBMC (peripheral blood mononuclear cell), was differentially expressed between two groups the young and the old. But as any normal PBMC cohort didn’t meet the enough number of population, we used GSE6891 which was collected for leukemia research including the ‘age’ information.

After that, differentially expressed genes (DEGs) in the aged group were determined with an artificial neural network [18] and matched with drug-induced gene expression profiles from CMap to predict drug candidates that can reverse the DEG pattern of the aged group. Using a deep neural network (DNN), an artificial neural network (ANN) with multiple layers between the input and output layers was used to find the correct mathematical manipulation to turn the input into the output, whether it be a linear relationship or a non-linear relationship. An artificial neural network is an interconnected group of nodes, inspired by a simplification of neurons in a brain. A two-stage search strategy was adopted to find drug combinations to reverse the aging-related target DEGs more effectively than the one-drug approach.

The purpose of our DNN model was to determine if a specific gene was differentially expressed between two samples. In many studies, fold-change approach and statistical methods including t-test and non-parametric test has been used as criterion to select differentially expressed genes (DEGs). But these approaches are sensitive to outliers or sample size. In this study, our goal was to develop a DEGs selection method which is robust to the sample size. The training dataset consists of the DEGs of the results of LIMMA from multiple GSEs with various sample size to this end.

Materials and methods

Deep neural network model

In this study, we built a deep neural network (DNN) model using the Tensorflow framework [19] to predict whether a specific gene was differentially expressed between two samples. The model is composed of an input layer of two nodes, three hidden layers with ten nodes each, and an output layer with two nodes that represent “up-regulation” and “down-regulation.” Rectified linear unit (ReLu) and Softmax was used as an activation function of hidden layers and output layer, respectively. Gene expression data were downloaded from the Gene Expression Omnibus (GEO) site ( We selected 13 datasets of leukemia samples to train our DNN model. The training dataset consists of 730 samples, with 108 and 622 samples in the control and disease groups, respectively. We used the limma package in GEO2R [20] ( to compute p value for each probe with a moderated t-test. The null hypothesis was that the gene was expressed the same in the disease group and the control group. We cut off those probes with p value > 0.1. We labeled the probes as “up” or “down” according to the expression values. Then we trained the DNN to predict whether a specific gene is differentially expressed between two samples. According to the experimental design of each dataset, we trained the model with expression value pairs of each gene between samples, one from the control group and the other from the disease group within individual datasets.

Gene expression data of aged group

Microarray data were downloaded from NCBI GEO with accession number GSE6891, which were derived from 461 acute myeloid leukemia patients [21, 22]. GSE6891 is the clinical data that represents the gene expression profiles of AML samples of two independent cohorts (n = 247 and n = 214). Data analyses were carried out to discover and predict prognostically relevant subtypes in AML (<60 years) based on their gene expression signatures. We categorized these data into two groups based on the patients’ age. Samples from patients older than 50 were included in the aged group, and those younger than 30 were included in the normal (young) group. The number of samples in the normal and aged groups were 84 and 159, respectively. Differentially expressed genes were identified with GEO2R in the same way as described in the ‘Deep neural network model’ section above. Lastly, we mapped the probe IDs to Entrez-gene IDs.

Drug-induced gene expression data

CMap is a resource that uses transcriptional expression data to probe relationships between diseases, cell physiology, and therapeutics [23]. The gene expression data is downloaded from CMap, which shows transcriptomic changes following drug treatment. In this study, we analyzed gene expression data from 1,072 drug experiments with HL-60 cells.

Cell culture

HL-60 cells were cultured at 1 × 105 cells/mL in Roswell Park Memorial Institute (RPMI) 1640 medium supplemented with 10% fetal bovine serum (FBS) and 1% penicillin/streptomycin at 37 °C in a 5% CO2 humidified atmosphere.

Cell viability analysis

Trypan blue staining was performed to assess cell viability. HL-60 cells were incubated in 48-well plates at 3.0 × 104 cells per well. After chemical treatment with or without pre-treatment with 10μM hydrogen peroxide, cells were diluted by trypan blue working solution and counted with a cell counter to allow for growth curve construction [24].

Reactive oxygen species (ROS) assay

HL-60 cells were seeded in 96-well plates at a density of 3.0 × 104 cells per well and then incubated with 10μM hydrogen peroxide to achieve ROS-induced senescence. After 4hr, each reagent was treated for 36hr in serum-free conditions. For cells, ROS production was measured with the ROS Detection Assay Kit (Biovision, Inc., Milpitas, CA, USA) according to the manufacturer’s protocol [25]. The plate was finally scanned with the CLARIOstar® Plus microplate reader (BMG Labtech, Ortenberg, Germany). The drugs were administered at the following concentrations: 30nM trichostatin A, 1μM vorinostat, 5μM anisomycin, 2mM metformin, 20μM danazol, 10μM glibenclamide, 20μM ampyrone, and 20μM chlorzoxazone.

Nuclear morphology assay

For fluorescence microscope detection of nuclear morphology changes, chemically treated HL-60 cells were washed twice with phosphate-buffered saline (PBS) and fixed with 4% paraformaldehyde in PBS for 15 min. For nuclear staining, cells were incubated with 1.0μg/mL 4ʹ,6-diamidino-2-phenylindole (DAPI). Fluorescent microscopy images were obtained using a fluorescence microscope system [26].

Senescence-associated beta-galactosidase staining

HL-60 cells were seeded in 24-well plates at a density of 3.0 × 104 cells per well and then incubated with 10μM hydrogen peroxide to achieve senescence. After 4hr, each reagent was treated for 36hr in serum-free conditions. For cells, β-galactosidase assays were performed using a Biovison Beta-Galactosidase Staining Kit (Biovision, Inc., Milpitas, CA, USA). The protocol was conducted according to the manufacturer’s instructions [27]. After fluorescent microscopy images were obtained using a fluorescence microscope system, beta-galactosidase positive cells were counted with a cell counter.


Production of the gene expression pattern of the aged group and drug response patterns

The GEO2R program [20] ( was used to calculate differentially expressed genes between the aged and normal groups of leukemia patients cohort (GSE6891), which 1,415 genes had a p <0.1. The samples from patients older than 50 were included in the aged group, and those younger than 30 were included in the normal (young) group, as shown in S1 Fig. Using our deep neural network (DNN), we computed the predictive value of each probe and ruled out the probes with their predictive value less than 0.95 (Fig 1A). The model architecture and other parameters are shown in S1 Table. We selected 13 datasets of leukemia samples to train our DNN model (S2 Table). The performance measured by the loss, accuracy, and area under the curve [28] is shown in S3 Table. Then we classified the probes as “up-regulated” or “down-regulated” according to its predictive value (S2 Fig). After that, we substituted the probe IDs with Entrez IDs to match them with the drug-induced gene expression data. Our approach resulted in the DEGs list consisting of 1,293 probes (Fig 1B), in which blue represents up-regulated genes, and red represents down-regulated genes in the aged group compared to the normal group. In this study, which represents the original DEGs (o-DEG) consisted of 676 up-regulated genes and 617 down-regulated genes. Drug-induced DEG (d-DEG) pattern was created similarly. In the drug-induced expression data, DEGs were selected with limma (p value ≤ 0.1) and filtered with our DNN (predictive value ≥ 0.95) (Table 1, and S4 Table).

Fig 1. Original differentially expressed genes (o-DEGs) between normal and aged groups.

(A) A schematic of training deep neural network (DNN) model with gene expression data from 13 datasets from GEO. DEGs pattern of an aged group (o-DEG) was built and transformed into i-DEG for matching with d-DEG. d-DEG was created with drug-induced gene expression data from the Connectivity Map. i-DEG and d-DEG are matched and scored to rank drugs. (B) A heatmap of up-regulated genes (marked in blue) and down-regulated genes (marked in red) in the aged group compared to the normal group. The total gene number in this pattern is 1,293.

Table 1. Differential gene expression patterns using DNNs.

Initial and second matching

To identify a DEG pattern related to therapeutic direction, initial matching was conducted, which reversed the o-DEG pattern, so that up-regulated genes were down-regulated and vice versa. We called this reversed DEG pattern “i-DEG,” which means the “ideal” state to reverse the current o-DEG of the aging state. We screened common genes between the i-DEG pattern and each d-DEG pattern accordingly and built a common DEG (c-DEG) pattern called initial matching (Fig 2A and 2B). The gene number in the c-DEG of each drug was scored, and chemicals were ranked based on score (S5 Table). Vorinostat [29], trichostatin A [30], lycorine, and anisomycin were designated as results of the initial matching, but lycorine was ruled out due to its expected toxicity (Table 2).

Fig 2. Flow of initial and second matching.

(A) To identify a differentially expressed gene (DEG) pattern that is expected to have anti-aging effects, we reversed the original DEGs (o-DEGs) of the aged group. The i-DEGs comprised 617 up-regulated genes and 676 down-regulated genes. We performed matching with each drug-induced DEG (d-DEG) to select common DEGs (c-DEGs). In the case of vorinostat, 91 up-regulated genes and 68 down-regulated genes were shared with the i-DEG pattern. (B) To maximize the number of reversed genes, second matching was performed. Excluding the drugs chosen in initial matching, drug-induced DEGs (d-DEGs) of each drug were matched with genes not covered by the first drug. In the case of vorinostat and anisomycin, c-DEGs of vorinostat covered 159 genes, and anisomycin, 132 genes.

Table 2. Drugs matched to i-DEG pattern in initial matching.

Since the drugs found in initial matching cannot reverse all the gene expressions in the o-DEG pattern, a second matching phase was performed to discover other combinatorial drugs to maximize the number of reversed genes with drugs found in initial matching. In the second phase, the genes covered by the chemicals found in initial matching were excluded when scoring the common genes between the i-DEG list and d-DEG patterns. The suggested drugs, in combination with the drugs identified in initial matching, are shown in Table 3 and S6 Table. The o-DEG and d-DEG patterns of vorinostat were compared, and the reversed match pixels were removed and expressed as white color. Thus, white color represents higher simulated efficacy of the drug. Combinatory treatment with anisomycin was predicted by broader white areas and was expected to reverse more o-DEGs in combination with vorinostat (Fig 3). We performed analysis and found genes that were both up-regulated in the first drug and down-regulated in the second drug screening, and vice versa. A total of 14 drug combinations, genes that overlap with Vorinostat and Anisomycin were only valid. Genes down-regulated in Vorinostat were up-regulated with effective values in Anisomycin (logFC>1, p<0.1). However, out of a total of 18,476 genes (down-regulated in Vorinostat and up-regulated in Anisomycin), only two genes IL6 and CYP1A1 were reversely regulated effectively, so it is shown that Anisomycin does not neutralize the effect of Vorinostat (S7 Table).

Fig 3. Simulation of drugs applied to aged samples. Original DEG pattern of the aged group.

The drug-induced DEGs (d-DEGs) of vorinostat and simulated effects when applied to o-DEGs. The d-DEGs of anisomycin and simulated combination effects with vorinostat.

Validation of the anti-aging effect via ROS staining

We evaluated drug cytotoxicity with trypan blue assay, and the no observed adverse effects level (NOAEL) was selected as an experimental dose with sub-lethal conditions (S3A–S3H Fig). To rapidly induce cellular senescence, we tried to generate reactive oxygen species (ROS), chemically reactive chemical species containing oxygen, in HL-60 cells with hydrogen peroxide, and then directly measured the ROS concentration. In the first round of drug matching, intracellular ROS levels in trichostatin A-, vorinostat-, and anisomycin-treated cells decreased by 9% on average compared with their levels in the experimental control. (*p <.05) Interestingly, their combinatorial treatment slightly and significantly lowered the ROS concentration about 15% more than that with a treat by each chemical alone. (*p <.05) The predicted first and second combinatory treatments showed an average efficacy of 27% (Fig 4A and 4B, and S4A Fig). (**p <.01) Still, the randomly combined treatment did not decrease the ROS concentration significantly (average 14.5%, S4B Fig). Taken together, the combination of drugs seemed to protect against ROS-induced aging.

Fig 4. Removal activity of cellular ROS concentration was detected as an anti-aging effect.

ROS detection measured by fluorescence intensity using a microplate reader. Hydrogen peroxide (H2O2) was administered to induce ROS-based senescence, then single drugs or combinatorial drug pairs were administered to HL-60 cells to protect against ROS-induced aging. (A) HL-60 cells were exposed to H2O2 (10 μM) for 24 h and then treated with trichostatin A, metformin, danazol, glibenclamide, ampyrone, and each prior drug was co-treated with trichostatin A. or (B) with vorinostat, metformin, anisomycin, danazol, glibenclamide and each prior drug with vorinostat for 36 h. The intracellular ROS levels were detected with a microplate reader capable of measuring Ex/Em 495/529 nm spectra and recorded. *p <.05, **p <.01 vs H2O2 group.

Analysis of anti-aging effect based on nuclear shape alteration

Commonly, significant changes in nuclear shape were seen in senescent cells. Untreated cells were all relatively small in size and with adequately regular round morphology. Nevertheless, senescent cells were mainly large and characterized by an irregular distribution of fluorescence [31, 32]. To investigate the morphological alterations that occur in aging nuclei, hydrogen peroxide was used to induce ROS-based senescence in HL-60 cells. The morphology of DAPI-stained nuclei was analyzed (S5 Fig), and the ratio of distorted: altered nuclear shapes were counted. Following treatment with 10μM H2O2, the first CMap drug treatment restored the nuclear morphology to a more spherical shape by 32% in comparison with the experimental control. (*p <.05) Furthermore, combinatorial drug treatment produced nuclei that were about 25% more circular compared with those treated independently with the first or second round drugs (Fig 5A and 5B, and S6A Fig). (*p <.05) The treatments of glibenclamide in combination with trichostatin A or vorinostat decreased the nuclear shape alteration rate most markedly. Still, the randomly combined treatment did not reduce the nuclear shape alteration significantly. However, an unexpected combinatory treatment of vorinostat with trichostatin A showed an average efficiency of 41% (S6B Fig). (*p <.05) Combination drug treatment restored the nuclear morphology to a spherical shape compared to first drug only treatment.

Fig 5. Investigation of morphologic alterations by nuclear morphology assay.

Nuclear morphology changes in 4ʹ,6-diamidino-2-phenylindole (DAPI)-stained HL-60 cells assessed by fluorescence microscopy. (A) HL-60 cells were exposed to H2O2 (10 μM) for 24 h and then treated with trichostatin A, metformin, danazol, glibenclamide, ampyrone, and each prior drug was co-treated with trichostatin A or (B) with vorinostat, metformin, anisomycin, danazol, glibenclamide and each prior drug with vorinostat for 36 h. Nuclear altered shape nuclei were counted and graphed. *p <.05, **p <.01 vs H2O2 group.

Identify of anti-aging effect by senescence-associated beta-galactosidase staining

Senescence-associated beta-galactosidase (SA-β-gal or SABG) is a hypothetical hydrolase enzyme that catalyzes the hydrolysis of beta-galactosidase into monosaccharides only in senescent cells. Senescence associated beta-galactosidase is regarded to be a biomarker of cellular senescence. The alteration of senescent cell numbers was validated with beta-galactosidase staining (S7 Fig). HL-60 cells were co-treated with trichostatin A, vorinostat, anisomycin with combinatorial drug pairs for 36 h after incubation with 10μM H2O2. The percentage of beta-galactosidase positive cells treated with the first-round CMap drugs (trichostatin A, vorinostat, anisomycin) is slightly decreased compared with their levels in the experimental control (10μM H2O2 only). The percentage of trichostatin A and danazol or trichostatin A and ampyrone combination-treated cells were decreased 48–51% compared with their levels in the first-round drug-treated cells (Fig 6A and 6B, and S8A Fig). (**p <.01) However, the percentage of β-galactosidase positive cells treated with randomly combined drug pairs did not decrease significantly (S8B Fig). Taken together, combinational drug treat had a higher percentage of beta-galactosidase compared to single drug.

Fig 6. Combinatorial drug pairs decrease cell senescence in HL-60 cells.

(A) HL-60 cells were exposed to H2O2 (10 μM) for 24 h and then treated with trichostatin A, metformin, danazol, glibenclamide, ampyrone, and each prior drug was co-treated with trichostatin A. or (B) with vorinostat, metformin, anisomycin, danazol, glibenclamide and each prior drug with vorinostat for 36 h. Then the cells were incubated with β-galactosidase and stained cells were counted and plotted. *p <.05, **p <.01 vs H2O2 group.


In this study, we used the public database CMap and focused on devising a new machine learning-combined analytical method to increase the efficiency of target discovery. As expected, the combination of potential drugs displayed better effects than those of single treatment. Here, we suggest how the two-step CMap strategy can be used to improve the success rate of target discovery. We used a modified drug repositioning method that involved the combination of the remaining non-overlapping DEG profile. These DEGs were reinserted into CMap for a second time, and the drugs obtained through the first and second CMap utilization were combined with the expectation of broadening the coverage of existing drugs. Finally, the machine learning strategy was applied to accelerate the process and efficiently solve the input for the third drug.

The three drugs retrieved from the database were trichostatin A, vorinostat, and anisomycin. These were used only as components to evaluate whether 2-step filtered CMap works. Here, we do not insist that these are effective drugs for anti-aging trials; instead, they served as models to demonstrate the usefulness of our platform. Although some of these drugs have a potential risk of side effects, they were utilized only as experimental examples in our process.

Trichostatin A serves as an antifungal antibiotic and selectively inhibits the class I and II mammalian histone deacetylase (HDAC) families of enzymes [33]. Vorinostat is also a member of a larger class of compounds that inhibit histone deacetylases (HDAC) [34]. Anisomycin is an antibiotic produced by Streptomyces that inhibits eukaryotic protein synthesis [35]. Thus, these three chemicals are expected to pose nontoxic effects. One of the selected secondary drugs, metformin, is the first-line medication for the treatment of type 2 diabetes [36], particularly in people who are overweight [37]. Metformin is also used in the treatment of polycystic ovary syndrome. Danazol is used in the treatment of endometriosis, fibrocystic breast disease, hereditary angioedema, and other conditions [38, 39]. Glibenclamide is a medication used to treat diabetes mellitus type 2 [40]. Ampyrone is a metabolite of aminopyrine with analgesic, anti-inflammatory, and antipyretic properties [41]. Chlorzoxazone (INN) is a centrally acting muscle relaxant used to treat muscle spasms and the resulting pain or discomfort [42]. Based on their broad indications or combinatory effects (especially for glibenclamide), these drugs may be appropriate as secondary treatments, but not as main compounds for anti-aging treatments.

Interestingly, some of the above agents have been suggested to have anti-aging effects. Danazol may stimulate the recovery of shortened telomeres [43]. Although the exact mechanism has not been revealed, metformin is known to have an anti-aging effect [44]. Glibenclamide induces neurogenesis [45]. Two HDAC inhibitors were suggested as anti-aging agents, which were initially developed as anti-cancer therapeutics. HDAC inhibitors could be a regulator of the aging process due to its epigenetic control ability [46].

Here, we focused on the anti-aging phenotype rather than cell death in leukemia HL-60 cells or cytotoxicity for cancer treatment. Because anti-cancer drugs generally induce cell death and gene expression patterns for cell death can be easily shared [47], they are not suitable for research on gene expression patterns. Thus, we speculated that anti-aging effects could be used as a proper model to study gene expression-based drug repositioning. Leukemia and cancer cells are usually senescence-defective and fail to undergo apoptosis; however; these cells may undergo premature senescence, otherwise called interim cell proliferative arrest, and it has been show that treatments, such as LEE0011, to leukemia cells can induce cell senescence [4850].

We verified the anti-aging effects of these drugs through different experiments and found that when drugs were mixed, low doses of individual drugs did not influence the overall outcomes. Therefore, our method of identifying combination drug pairs may be sufficient to cover a broad range of disease statuses. This paper showed that CMap-based machine learning for drug repositioning could be used to develop unique targets for drug development through well-designed strategies [51]. In the past, the identification of drugs with drug repositioning has been serendipitous [52, 53]. However, as shown in this paper, rationalized off-target drug repositioning using CMap can take many forms, and modified repositioning involving two-step CMap analyses assisted drug discovery more effectively. It is expected that further bioinformatics analysis, systems biology approaches, or network-based approaches will be applied for new drug discovery as technology develops.

Our computational method to identify combinational drug pairs is useful to cover a broad range of diseases. These results show the potential of our strategy to discover new drug combinations using a large scale input data set that results in a more fine-tuned screening to propose the best anti-aging drug combinations.

Supporting information

S2 Table. List of datasets used as training dataset.


S5 Table. Comparison results of initial matching.


S6 Table. Comparison results of secondary matching.


S7 Table. Neutralized effect of combinatorial drugs.


S1 Fig. Categorization of young and aged groups.

Categorization of patient groups The samples from patients older than 50 were included in the aged group and younger than 30 were included in normal group. The numbers of samples in the normal and aged groups were 84 and 159, respectively.


S2 Fig. Training of DNN.

Pairs of samples from Control group (‘C’ in the left side of figure) and Disease group (‘D’ in the left side of figure) are randomly chosen from the same series in the training dataset and fed to the DNN.


S3 Fig. Cell viability assay.

(A) trichostatin A (0–300 nM), (B) vorinostat (0–50 μM), (C) anisomycin (0–100 μM), (D) metformin (0–50 mM), (E) glibenclamide (0–100 μM), (F) ampyrone (0–200 μM), (G) danazol (0–200 μM), and (H) chlorzoxazone (0–200 μM) were treated at the different dosages after 3 hours-pretreatment with 10 μM H2O2 to determine the sub-lethal dose (NOAEL, no observed adverse effects level).


S4 Fig. Cellular ROS levels were attenuated by anisomycin or randomly combined drug pairs.

(A) HL-60 cells were co-treated with anisomycin and metformin or trichostatin A for 36 h after incubation with 10 μM H2O2. (B) Cells were also treated with trichostatin A + chlorzoxazone, vorinostat + trichostatin A, anisomycin + glibenclamide, anisomycin + danazol. The intracellular ROS levels were detected by a microplate reader capable of measuring Ex/Em 495/529 nm spectra and recorded. *p <.05, **p <.01 vs H2O2 group.


S5 Fig. Investigation of nuclear morphology alterations treated with anisomycin or randomly combined drug pairs.

HL-60 cells were treated as indicated in each picture. Nuclei were stained with DAPI, and pictures were taken on a fluorescent inverted microscope. Cells with misshaped(dented) nucleus were indicated by red arrows.


S6 Fig. Quantification of morphologic alterations treated with anisomycin or randomly combined drug pairs.

Nuclear morphology changes in DAPI-stained HL-60 cells as shown above were counted and graphed. (A) anisomycin and combined drug pairs, (B) trichostatin A + chlorzoxazone, vorinostat + trichostatin A, anisomycin + glibenclamide, anisomycin + danazol. *p <.05, **p <.01 vs H2O2 group.


S7 Fig. Measuring Senescence by beta-galactosidase staining after treatment with anisomycin or randomly combined drug pairs.

HL-60 cells were treated as indicated in each picture Fluorescent microscopy images were obtained using a fluorescence microscope system, and then beta-galactosidase positive cells were indicated with red arrows.


S8 Fig. Percentage of beta-galactosidase positive cells by anisomycin or randomly combined drug pairs.

HL-60 cells shown above were quantified and (A) anisomycin and combined drug pairs, (B) trichostatin A + chlorzoxazone, vorinostat + trichostatin A, anisomycin + glibenclamide, anisomycin + danazol. *p <.05, **p <.01 vs H2O2 group.



  1. 1. Niccoli T, Partridge L. Ageing as a risk factor for disease. Curr Biol. 2012;22(17):R741–52. pmid:22975005
  2. 2. Blagosklonny MV. Validation of anti-aging drugs by treating age-related diseases. Aging (Albany NY). 2009;1(3):281–8. pmid:20157517
  3. 3. Yang J, Huang T, Petralia F, Long Q, Zhang B, Argmann C, et al. Synchronized age-related gene expression changes across multiple tissues in human and the link to complex diseases. Sci Rep. 2015;5:15145. pmid:26477495
  4. 4. Calvert S, Tacutu R, Sharifi S, Teixeira R, Ghosh P, de Magalhaes JP. A network pharmacology approach reveals new candidate caloric restriction mimetics in C. elegans. Aging Cell. 2016;15(2):256–66. pmid:26676933
  5. 5. Janssens GE, Lin XX, Millan-Arino L, Kavsek A, Sen I, Seinstra RI, et al. Transcriptomics-Based Screening Identifies Pharmacological Inhibition of Hsp90 as a Means to Defer Aging. Cell Rep. 2019;27(2):467–80 e6. pmid:30970250
  6. 6. Kerepesi C, Daroczy B, Sturm A, Vellai T, Benczur A. Prediction and characterization of human ageing-related proteins by using machine learning. Sci Rep. 2018;8(1):4094. pmid:29511309
  7. 7. Shim JS, Liu JO. Recent advances in drug repositioning for the discovery of new anticancer drugs. Int J Biol Sci. 2014;10(7):654–63. pmid:25013375
  8. 8. Ashburn TT, Thor KB. Drug repositioning: identifying and developing new uses for existing drugs. Nat Rev Drug Discov. 2004;3(8):673–83. pmid:15286734
  9. 9. Johnson RJ, Christodoulou J, Dumoulin M, Caddy GL, Alcocer MJ, Murtagh GJ, et al. Rationalising lysozyme amyloidosis: insights from the structure and solution dynamics of T70N lysozyme. J Mol Biol. 2005;352(4):823–36. pmid:16126226
  10. 10. Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P, et al. DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 2006;34(Database issue):D668–72. pmid:16381955
  11. 11. Gao Z, Li H, Zhang H, Liu X, Kang L, Luo X, et al. PDTD: a web-accessible protein database for drug target identification. BMC Bioinformatics. 2008;9:104. pmid:18282303
  12. 12. Musa A, Ghoraie LS, Zhang SD, Glazko G, Yli-Harja O, Dehmer M, et al. A review of connectivity map and computational approaches in pharmacogenomics. Brief Bioinform. 2018;19(3):506–23. pmid:28069634
  13. 13. De Bastiani MA, Pfaffenseller B, Klamt F. Master Regulators Connectivity Map: A Transcription Factors-Centered Approach to Drug Repositioning. Front Pharmacol. 2018;9:697. pmid:30034338
  14. 14. Huff RG, Bayram E, Tan H, Knutson ST, Knaggs MH, Richon AB, et al. Chemical and structural diversity in cyclooxygenase protein active sites. Chem Biodivers. 2005;2(11):1533–52. pmid:17191953
  15. 15. Raghavan R, Hyter S, Pathak HB, Godwin AK, Konecny G, Wang C, et al. Drug discovery using clinical outcome-based Connectivity Mapping: application to ovarian cancer. BMC Genomics. 2016;17(1):811. pmid:27756228
  16. 16. Barabasi AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12(1):56–68. pmid:21164525
  17. 17. Sun W, Sanderson PE, Zheng W. Drug combination therapy increases successful drug repositioning. Drug Discov Today. 2016;21(7):1189–95. pmid:27240777
  18. 18. Eetemadi A, Tagkopoulos I. Genetic Neural Networks: an artificial neural network architecture for capturing gene expression relationships. Bioinformatics. 2019;35(13):2226–34. pmid:30452523
  19. 19. Rampasek L, Goldenberg A. TensorFlow: Biology’s Gateway to Deep Learning? Cell Syst. 2016;2(1):12–4. pmid:27136685
  20. 20. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, et al. NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Res. 2013;41(Database issue):D991–5. pmid:23193258
  21. 21. Goel S, Shoag JE, Gross MD, Al Hussein Al Awamlh B, Robinson B, Khani F, et al. Concordance Between Biopsy and Radical Prostatectomy Pathology in the Era of Targeted Biopsy: A Systematic Review and Meta-analysis. Eur Urol Oncol. 2020;3(1):10–20. pmid:31492650
  22. 22. Verhaak RG, Wouters BJ, Erpelinck CA, Abbas S, Beverloo HB, Lugthart S, et al. Prediction of molecular subtypes in acute myeloid leukemia based on gene expression profiling. Haematologica. 2009;94(1):131–4. pmid:18838472
  23. 23. Lamb J, Crawford ED, Peck D, Modell JW, Blat IC, Wrobel MJ, et al. The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science. 2006;313(5795):1929–35. pmid:17008526
  24. 24. Strober W. Trypan blue exclusion test of cell viability. Curr Protoc Immunol. 2001;Appendix 3:Appendix 3B. pmid:18432654
  25. 25. Liu B, Tan X, Liang J, Wu S, Liu J, Zhang Q, et al. A reduction in reactive oxygen species contributes to dihydromyricetin-induced apoptosis in human hepatocellular carcinoma cells. Sci Rep. 2014;4:7041. pmid:25391369
  26. 26. Klimaszewska-Wisniewska A, Halas-Wisniewska M, Tadrowski T, Gagat M, Grzanka D, Grzanka A. Paclitaxel and the dietary flavonoid fisetin: a synergistic combination that induces mitotic catastrophe and autophagic cell death in A549 non-small cell lung cancer cells. Cancer Cell Int. 2016;16:10. pmid:26884726
  27. 27. Takasaka N, Araya J, Hara H, Ito S, Kobayashi K, Kurita Y, et al. Autophagy induction by SIRT6 through attenuation of insulin-like growth factor signaling is involved in the regulation of human bronchial epithelial cell senescence. J Immunol. 2014;192(3):958–68. pmid:24367027
  28. 28. Heim SW, Nadkarni M, Rollins LK, Schorling JB, Waters DB, Hauck FR, et al. Modular lifestyle intervention tool: a handheld tool to assist clinicians in providing patient-tailored counseling. Ann Fam Med. 2005;3 Suppl 2:S65–7. pmid:16049095
  29. 29. Silva G, Cardoso BA, Belo H, Almeida AM. Vorinostat induces apoptosis and differentiation in myeloid malignancies: genetic and molecular mechanisms. PLoS One. 2013;8(1):e53766. pmid:23320102
  30. 30. Jasek E, Lis GJ, Jasinska M, Jurkowska H, Litwin JA. Effect of histone deacetylase inhibitors trichostatin A and valproic acid on etoposide-induced apoptosis in leukemia cells. Anticancer Res. 2012;32(7):2791–9. pmid:22753739
  31. 31. Mocali A, Giovannelli L, Dolara P, Paoletti F. The comet assay approach to senescent human diploid fibroblasts identifies different phenotypes and clarifies relationships among nuclear size, DNA content, and DNA damage. J Gerontol A Biol Sci Med Sci. 2005;60(6):695–701. pmid:15983170
  32. 32. Chen QM, Tu VC, Catania J, Burton M, Toussaint O, Dilley T. Involvement of Rb family proteins, focal adhesion proteins and protein synthesis in senescent morphogenesis induced by hydrogen peroxide. J Cell Sci. 2000;113 (Pt 22):4087–97. pmid:11058095
  33. 33. Furumai R, Komatsu Y, Nishino N, Khochbin S, Yoshida M, Horinouchi S. Potent histone deacetylase inhibitors built from trichostatin A and cyclic tetrapeptide antibiotics including trapoxin. Proc Natl Acad Sci U S A. 2001;98(1):87–92. pmid:11134513
  34. 34. Garcia-Manero G, Yang H, Bueso-Ramos C, Ferrajoli A, Cortes J, Wierda WG, et al. Phase 1 study of the histone deacetylase inhibitor vorinostat (suberoylanilide hydroxamic acid [SAHA]) in patients with advanced leukemias and myelodysplastic syndromes. Blood. 2008;111(3):1060–6. pmid:17962510
  35. 35. Mawji IA, Simpson CD, Gronda M, Williams MA, Hurren R, Henderson CJ, et al. A chemical screen identifies anisomycin as an anoikis sensitizer that functions by decreasing FLIP protein synthesis. Cancer Res. 2007;67(17):8307–15. pmid:17804746
  36. 36. Forslund K, Hildebrand F, Nielsen T, Falony G, Le Chatelier E, Sunagawa S, et al. Disentangling type 2 diabetes and metformin treatment signatures in the human gut microbiota. Nature. 2015;528(7581):262–6. pmid:26633628
  37. 37. Levri KM, Slaymaker E, Last A, Yeh J, Ference J, D’Amico F, et al. Metformin as treatment for overweight and obese adults: a systematic review. Ann Fam Med. 2005;3(5):457–61. pmid:16189063
  38. 38. Bretza JA, Novey HS, Vaziri ND, Warner AS. Hypertension: a complication of danazol therapy. Arch Intern Med. 1980;140(10):1379–80. pmid:7425772
  39. 39. Thomas GW, Rael LT, Shimonkevitz R, Curtis CG, Bar-Or R, Bar-Or D. Effects of danazol on endothelial cell function and angiogenesis. Fertil Steril. 2007;88(4 Suppl):1065–70. pmid:17382938
  40. 40. Landstedt-Hallin L, Adamson U, Lins PE. Oral glibenclamide suppresses glucagon secretion during insulin-induced hypoglycemia in patients with type 2 diabetes. J Clin Endocrinol Metab. 1999;84(9):3140–5. pmid:10487677
  41. 41. Ghorab MM, El-Gazzar MG, Alsaid MS. Synthesis, characterization and anti-breast cancer activity of new 4-aminoantipyrine-based heterocycles. Int J Mol Sci. 2014;15(5):7539–53. pmid:24798749
  42. 42. Powers BJ, Cattau EL Jr., Zimmerman HJ. Chlorzoxazone hepatotoxic reactions. An analysis of 21 identified or presumed cases. Arch Intern Med. 1986;146(6):1183–6. pmid:3521519
  43. 43. Townsley DM, Dumitriu B, Liu D, Biancotto A, Weinstein B, Chen C, et al. Danazol Treatment for Telomere Diseases. N Engl J Med. 2016;374(20):1922–31. pmid:27192671
  44. 44. Valencia WM, Palacio A, Tamariz L, Florez H. Metformin and ageing: improving ageing outcomes beyond glycaemic control. Diabetologia. 2017;60(9):1630–8. pmid:28770328
  45. 45. Ortega FJ, Jolkkonen J, Mahy N, Rodriguez MJ. Glibenclamide enhances neurogenesis and improves long-term functional recovery after transient focal cerebral ischemia. J Cereb Blood Flow Metab. 2013;33(3):356–64. pmid:23149556
  46. 46. Shen S, Sandoval J, Swiss VA, Li J, Dupree J, Franklin RJ, et al. Age-dependent epigenetic control of differentiation inhibitors is critical for remyelination efficiency. Nat Neurosci. 2008;11(9):1024–34. pmid:19160500
  47. 47. Nemade H, Chaudhari U, Acharya A, Hescheler J, Hengstler JG, Papadopoulos S, et al. Cell death mechanisms of the anti-cancer drug etoposide on human cardiomyocytes isolated from pluripotent stem cells. Arch Toxicol. 2018;92(4):1507–24. pmid:29397400
  48. 48. Pan Y, Meng M, Zheng N, Cao Z, Yang P, Xi X, et al. Targeting of multiple senescence-promoting genes and signaling pathways by triptonide induces complete senescence of acute myeloid leukemia cells. Biochem Pharmacol. 2017;126:34–50. pmid:27908660
  49. 49. Tao YF, Wang NN, Xu LX, Li ZH, Li XL, Xu YY, et al. Molecular mechanism of G1 arrest and cellular senescence induced by LEE011, a novel CDK4/CDK6 inhibitor, in leukemia cells. Cancer Cell Int. 2017;17:35. pmid:28286417
  50. 50. Yu W, Qin X, Jin Y, Li Y, Santiskulvong C, Vu V, et al. Tianshengyuan-1 (TSY-1) regulates cellular Telomerase activity by methylation of TERT promoter. Oncotarget. 2017;8(5):7977–88. pmid:28002788
  51. 51. Jia J, Zhu F, Ma X, Cao Z, Cao ZW, Li Y, et al. Mechanisms of drug combinations: interaction and network perspectives. Nat Rev Drug Discov. 2009;8(2):111–28. pmid:19180105
  52. 52. Smalley JL, Gant TW, Zhang SD. Application of connectivity mapping in predictive toxicology based on gene-expression similarity. Toxicology. 2010;268(3):143–6. pmid:19788908
  53. 53. Boolell M, Allen MJ, Ballard SA, Gepi-Attee S, Muirhead GJ, Naylor AM, et al. Sildenafil: an orally active type 5 cyclic GMP-specific phosphodiesterase inhibitor for the treatment of penile erectile dysfunction. Int J Impot Res. 1996;8(2):47–52. pmid:8858389