Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Identification of empagliflozin-related hub genes in atherosclerosis and their correlations with immune infiltration: Network pharmacology and bioinformatics analyses

  • Yicheng Rong ,

    Contributed equally to this work with: Yicheng Rong, Xinyu Liu

    Roles Writing – original draft, Writing – review & editing

    Affiliation School of Life Sciences, Nanjing University, Nanjing, China

  • Xinyu Liu ,

    Contributed equally to this work with: Yicheng Rong, Xinyu Liu

    Roles Formal analysis, Writing – original draft

    Affiliation Department of Cardiology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, Shandong, China

  • Yuanyuan Sun,

    Roles Formal analysis

    Affiliation Department of Geriatric Cardiology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, Shandong, China

  • Le Yang ,

    Roles Data curation, Methodology, Project administration

    clm19841002@126.com (LC); yangle0307@126.com (LY)

    Affiliation Department of Cardiology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, Shandong, China

  • Liming Chen

    Roles Data curation, Funding acquisition, Writing – original draft, Writing – review & editing

    clm19841002@126.com (LC); yangle0307@126.com (LY)

    Affiliation Department of Cardiology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, Shandong, China

Abstract

Background

Atherosclerosis (AS) is by far the most frequent underlying cause of atherosclerotic cardiovascular disease. Recently, sodium-glucose cotransporter 2 (SGLT2) inhibitors stand out for their anti-atherosclerotic effects. The present study was conducted to explore the potential genetic and molecular mechanisms of empagliflozin, a selective SGLT2 inhibitor, in preventing AS, and the correlation of empagliflozin-related hub genes with immune cells.

Methods

In our study, pharmacology platforms were accessed to identify the empagliflozin-related genes (ERGs). AS datasets GSE100927 and GSE43292 were downloaded from the public GEO database to identify AS-differentially expressed genes (AS-DEGs). Furthermore, the empagliflozin-related DEGs (ERDEGs) were obtained by intersecting AS-DEGs and ERGs. ERDEGs were further analyzed for Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses. The protein-protein interaction (PPI) network was constructed to screen for hub genes, which were subjected to regulatory network construction, ROC curve plotting, as well as validation analysis, correlation and Friends analysis, and CIBERSORT and ssGSEA analysis.

Results

A total of 33 genes were identified as ERDEGs, among which the empagliflozin-related hub genes were identified to be IL1B, EGFR, ERBB2, JAK2, SYK, and LGALS3 in AS. Furthermore, in terms of the characteristics of immune cells, there were significant correlations between ERBB2 and CD8 + T cells, IL1B and resting mast cells, LGALS3 and eosinophils, as well as JAK2 and CD56dim NK.

Conclusion

Our network pharmacology and bioinformatics analyses provide a comprehensive understanding of the potential mechanisms of empagliflozin in AS at the genomic level, with the discovery of significant correlations between the screened hub genes and various immune cell subsets.

1. Introduction

Atherosclerosis (AS) is a chronic large and medium-sized artery-involved inflammatory disease that may induce atherosclerotic cardiovascular disease (ASCVD) involvingischemic heart disease, stroke, and peripheral vascular disease. AS has affected humans for millennia [1,2]. Currently, the primary anti-atherosclerotic therapies included cardioprevention guideline-recommended treatments and effective lipid-lowering medications (e.g., statins, ezetimibe, and PCSK9 inhibitors), either individually or in combination. However, ASCVD continues to be a leading cause of death globally, responsible for approximately 17.6 million deaths annually, with its incidence on the rise. It imposes a heavy disease burden clinically, affecting quality of life, and escalating healthcare costs [35]. Noticeably, the remaining risk for major adverse cardiovascular events may be inflammatory in nature in the context of controlled low-density lipoprotein-cholesterol (LDL-C) [1]. Therefore, in order to improve prognosis and achieve better survival for ASCVD patients, novel anti-atherosclerotic treatments can be established preferentially by modulating inflammatory pathway/mechanism.

Serving as a new class of anti-diabetic therapy, SGLT2 inhibitors have been confirmed to possess significant extra-glycaemic benefits in controlling cardiovascular risk factors [6]. In particular, the landmark EMPA-REG OUTCOME study, for the first time, offered convincing evidence that empagliflozin could reduce the total burden of cardiovascular complications in patients with type 2 diabetes and ASCVD [7]. Moreover, several preclinical studies in animal models have further supported the anti-atherosclerotic effect of empagliflozin, with ongoing research exploring potential protective mechanisms, such as suppressing inflammation and sympathetic activity, decreasing vasoconstrictive eicosanoids and inflammation in the vasculature and perivascular adipose tissue, reducing macrophage infiltration in atherosclerotic plaques, etc [811]. However, there is currently a lack of comprehensive research analyzing the genetic and molecular mechanisms underlying SGLT2 inhibitors in AS, as well as investigations into their interactions within the human genome.

Network pharmacology, proposed by Hopkins et al. in 2007, is a promising tool to establish a network-centric perspective of drug action through the integration of bioinformatics and cheminformatics [1214]. Currently, in the era of big data, network pharmacology has been transitioned significantly, supported by a wealth of biological data generated from high-throughput technologies such as sequencing [14]. The practicality network pharmacology-based genetic and molecular research has been further enhanced owing to the public availability of system pharmacology platforms and bioinformatic databases [15].

In this study, cross-public datasets were accessed to extensively screen potential empagliflozin-related hub genes in AS. Our analysis included target identification, protein-protein interaction network analysis, regulatory network analysis, and hub gene analysis. Unlike previous studies that focused on single aspects of SGLT2 inhibitor mechanisms, our approach systematically combines target prediction, differential gene expression analysis, protein-protein interaction networks, and immune cell correlation analysis to provide a holistic understanding of empagliflozin’s anti-atherosclerotic effects at the genomic level. A comprehensive analysis of hub genes and immune cell characteristics was also adopted given the interdependence between AS and inflammation. Our study aimed to uncover the common molecular regulatory networks of empagliflozi in AS (Fig 1). Findings in this study are expected to provide valuable insights for anti-atherosclerotic mechanism research, as well as novel insights and intervention targets for clinical anti-atherosclerotic drug development.

thumbnail
Fig 1. Flow Chart for the Comprehensive Analysis of ERDEGs.

AS, Atherosclerosis; AA, Advanced Atherosclerotic Plaque Samples; EA, Early Atherosclerotic Plaque Samples; DEGs, Differentially Expressed Genes; ERGs, Empagliflozin-Related Genes; ERDEGs, Empagliflozin-Related Differentially Expressed Genes; GSEA, Gene Set Enrichment Analysis; GSVA, Gene Set Variation Analysis; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; PPI Network, Protein-Protein Interaction Network; TF, Transcription Factor; RBP, RNA-Binding Protein; ROC Curve, Receiver Operating Characteristic Curve; ssGSEA, single-sample Gene-Set Enrichment Analysis.

https://doi.org/10.1371/journal.pone.0339956.g001

2. Materials and methods

2.1. Prediction of targets

Empagliflozin targets were searched and acquired, with “Empagliflozin” as the search term, from PubChem (https://pubchem.ncbi.nlm.nih.gov), [16] SwissTargetPrediction (http://swisstargetprediction.ch/), [17] and DGIdb (https://dgidb.org/) [18]. The online platform-sourced targets were further integrated and visualized using Cytoscape [19]. Potential disease targets were identified from the CTD (https://ctdbase.org/) and GeneCards (https://www.genecards.org/) databases, using “Atherosclerosis” as the key keyword, followed by the intersection of the results [20]. Finally, empagliflozin targets and AS targets were intersected to obtain the ERGs.

2.2. Data acquisition and preparation

Using the R package “GEOquery”, AS datasets GSE100927, [21] GSE43292, [22] and GSE28829 [21] were downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/geo/) [23,24]. All samples in these datasets were sourced from Homo sapiens Carotid Arteries. The chip platforms for each dataset were GPL17077, GPL6244, and GPL570, respectively (Table 1). Specifically, GSE100927 and GSE43292 were integrated to create a combined test set (hereinafter referred to as the combined dataset), after merging using the R package “sva” [25] to eliminate batch effects; while GSE28829 served as the validation set. Noticeably, prior to statistical analysis, the R package “limma” was used to conduct background correction and normalization procedures for the combined dataset [26]. In addition, to assess the efficacy of the process, PCA was employed to visualize the merged matrix both before and after batch effect removal [27].

2.3. Identification of ERDEGs

The samples in the combined dataset were divided into AS and control groups for inter-group differential gene analysis using the R package “limma”. AS-DEGs were identified for those with |log FC| ≥ 0.25, and adjusted p(adj. p)<0.05. The |logFC| ≥ 0.25 was chosen based on biological relevance rather than statistical optimization alone. This relatively lenient threshold was selected to capture genes with modest but potentially meaningful biological changes, as network pharmacology studies often identify therapeutic targets through cumulative effects of multiple genes with moderate expression changes rather than focusing solely on highly differentially expressed genes. The |logFC| ≥ 0.25 threshold was selected based on the specific requirements of network pharmacology analysis, where small fold changes may still be biologically meaningful for multi-target drug mechanisms. Lower thresholds are appropriate for pathway analysis when balanced with stringent statistical significance criteria, and our integration with p(adj. p)<0.05 ensures reliable identification of differentially expressed genes while maintaining comprehensive coverage for pathway enrichment analysis. P-values were adjusted using Benjamini-Hochberg (BH) algorithm and a significant DEGs volcano map was drawn using the R package “ggplot2”. Subsequently, the ERDEGs were obtained by intersecting AS-DEGs and ERGs, which were visualized using the Venn diagram. Additionally, this study continued to analyze the expression differences of ERDEGs between different sample groups in the combined datase, and visualized corresponding results using the R package “pheatmap”.

2.4. PPI network construction and hub gene screening

This study further constructed the PPI network, using the Cytoscape software and the Search Tool for the Retrieval of Interacting Genes (STRING), to enhance the comprehension of the interactions among ERDEGs [28]. A statistically significant interaction was confirmed whe the combined score was > 0.4. Using the CytoHubba plug-in in Cytoscape, [29] each gene was scored based on the maximal clique centrality (MCC), maximum neighborhood component (MNC), degree, EPC, and Closeness algorithms [30]. Finally, the top-10 genes in each algorithm were considered hub genes. The intersection of these hub genes at the intersection of the five algorithms, designated as empagliflozin-related hub genes for AS was presented in a Venn diagram.

2.5. GO and Kyoto KEGG pathway enrichment analysis of ERDEGs

The identified ERDEGs were subjected to GO and KEGG pathway enrichment analyses using the R package “clusterProfiler” [3133]. The BH algorithm was used for the correction of the p value. Enrichment was considered significant at adj. p < 0.05 and FDR value (q value) < 0.25. In addition, a drug-target-pathway network diagram was created in Cytoscape, based on the drug-target interaction network of empagliflozin, ERDEGs, GO terms, and enriched KEGG pathways.

2.6. Gene set enrichment analysis (GSEA)

GSEA assessed whether genes from a predefined set showed a non-random distribution pattern within a gene list ranked by their correlation with the phenotype, thereby indicating the set’s contribution to the phenotype [34]. Following the ranking of genes in the combined dataset from highest to lowest based on logFC values, the GSEA was performed using the R package “clusterProfiler”. The parameters for GSEA were: seed, 2,020; number of calculations, 5,000; minimum gene set size, 10; and maximum gene set size, 500. The gene set c2.all.v2022.1.Hs.symbols.gmt[Curated/Pathway](6449) in the Molecular Signatures Database (MSigDB) were subjected to GSEA [35]. Significant enrichment was determined based on the criteria of adj.P < 0.05 (adjusted using the BH algorithm) and FDR < 0.25.

2.7. Gene set variation analysis (GSVA)

In our study, in order to explore the different pathways enriched between the AS and control groups for all genes in the combined dataset, GSVA, a non-parametric unsupervised analysis, [36] was performed using MSigDB-derived hallmark gene set (h.all.v7.4.symbols.gmt) [35]. P-values were adjusted using the BH algorithm and the significance threshold was set at adj. p < 0.05.

2.8. Regulatory network of hub genes

Corresponding regulatory networks were constructed to further understand the regulatory mechanism of hub genes. MicroRNAs (miRNAs) are small functional ncRNA molecules that are involved in a wide range of biological processes [37]. Individual miRNAs typically regulate numerous target mRNAs, while a single mRNA transcript is often targeted by multiple miRNAs, forming complex regulatory networks [38]. After obtaining miRNAs associated with hub genes from the miRDB database (https://mirdb.org/), this study continued to clarify the relationship between empagliflozin-related hub genes and miRNA [39]. Given the role of transcription factors (TFs) in controlling gene dynamics and expression, TFs regulating the transcription of empagliflozin-related hub genes potentially were identified in CHIPBase (http://rna.sysu.edu.cn/chipbase/) [40] and the Human Transcription Factors Target (hTFtarget) database (https://bio.tools/hTFtarget) [41] to further illustrate the potential regulation network for these hub genes. RNA-binding Proteins (RBP), a vital player in post-transcriptional gene regulation, are involved in RNA splicing, maturation, transport, stability, degradation, and translation [42]. Therefor, the starBase v3.0 was searched to identify the targeting relationship between RBPs and empagliflozin-related hub genes [43]. Eventually, the Cytoscape software was used to visualize regulatory networks.

2.9. Correlation analysis among empagliflozin-related hub genes

To explore pairwise gene expression correlation for empagliflozin-related hub genes, the Spearman correlation analysis was performed and the results were visualized through correlation chord diagrams using the R packages “igraph” and “ggraph”. Additionally, the Friends analysis was also utilized to analyze the functional similarity of the six genes identified in the aforementioned analyses using the R package “GOSemSim” [44].

2.10. The ROC curve analysis of the correlation between hub genes expression and AS clinical characteristics

The ROC curve was plotted, using R package pROC, and analyzed to assess the accuracy of the hub gene as a marker for AS and control groups in the combined dataset. Simultaneously, we also generated the comparison chart based on the expression levels of hub genes. Meanwhile, the AUC was calculated to assess the diagnostic impact of hub gene expression on AS occurrence. The hub genes with AUC > 0.7 were deemed useful for disease diagnosis.

2.11. Validation of hub genes

To determine the clinical characteristics of AS, the GSE28829 dataset, consisting of 13 EA and 16 AA, was used for to further verify the expression differences of empagliflozin-related hub genes. P-value < 0.05 was considered statistically significant.

2.12. Immune cell characteristics and hub gene correlation analysis

Considering the significance of inflammation and immune cells in AS, the CIBERSORT algorithm (https://cibersort.stanford.edu/index.php) and ssGSEA were applied to evaluate the proportions of immune cells along with immune-related functions between AS and control groups [45,46].

Using the CIBERSORT algorithm, the proportions of the 22 types of immune cells were determined based on samples from the combined dataset, with the results visualized by bar charts. Subsequently, the R package “ggplot2” was utilized to compare the levels of the 22 types of immune cells between AS and controls, selecting results showing statistical significance (P < 0.05) for further analysis.

Furthermore, the ssGSEA algorithm was utilized to evaluate the infiltration abundance of 28 different immune cell types within each individual samples. Initially, the infiltrating immune cell types were categorized, which included activated CD8 T cell, activate dendritic cell, gamma delta T cell, natural killer cell, regulatory T cell, etc. Subsequently, immune cell subsets with significant differences between groups were filtered for further analysis. The correlation between immune cell subsets, calculated by Spearman algorithm, was visualized in a correlation heatmap using the R package “pheatmap”. Finally, the correlation between hub genes and immune cell subsets was visualized using a correlation bubble plot generated by the R package “ggplot2”.

2.13. Statistical analysis

All statistical analyses in our study were conducted with R software (version 4.2.2), unless otherwise specified, with a two-sided p-value < 0.05 considered statistically significant. Inter-group comparisons of continuous variables with normal distribution was estimated by a Student’s t-test; while differences in non-normally distributed variables across groups were analyzed by the Mann-Whitney U test (i.e., the Wilcoxon rank-sum test). Comparisons between three or more groups were performed by Kruskal-Wallis tests.

2.14. Ethics statement

No further institutional review board approval is needed for this study as it was conducted based on secondary analyses of publicly available and de-identified data.

Results

3.1. Prediction of targets

With access to PubChem, SwissTargetPrediction online platform, and DGIdp, we acquired 22, 100, and 2 empagliflozin targets respectively. After integration, 118 empagliflozin targets were identified, which were visualized by Cytoscape in Fig 2A. Searching in CTD and GeneCards resulted in 36,768 and 4,043 AS targets, with 4,014 targets yielded after merging. Finally, 76 ERGs were obtained by intersecting empagliflozin and AS targets (Fig 2B). Supplementary S1 Table shows the results of the preliminary screening.

thumbnail
Fig 2. Empagliflozin and Atherosclerosis Targets Interaction Analysis.

A. empagliflozin-atherosclerosis targets interaction network. The green oval is empagliflozin, and the blue circles are the predicted DGIdb targets; Pink circles are PubChem predicted targets; Yellow circles are the predicted targets from SwissTargetPrediction. B. Venn diagram of targets of the drug empagliflozin and disease atherosclerosis. E, Empagliflozin; ERGs, Empagliflozin-Related Genes; AS, Atherosclerosis; ASRGs, atherosclerosis-related Genes.

https://doi.org/10.1371/journal.pone.0339956.g002

3.2. Data acquisition and preparation

For data acquisition, we merged and normalized GSE100927 (9 AS cases and 12 controls) and GSE43292 (32 AS cases and 32 controls) datasets. A total of 61 AS cases and 44 controls were included in the test set. As revealed in the distribution boxplots and PCA plots, batch processing resulted in an effective elimination of the batch effect of samples in the combined datasets (Fig 3). In addition, the validation set GSE28829 consisted of 16 cases of advanced-stage atherosclerotic plaque (AA) and 13 cases of early-stage atherosclerotic plaques (EA).

thumbnail
Fig 3. Batch Effects Removal of GSE100927 and GSE43292.

A. Boxplot of combined datasets distribution before batch removal. B. Boxplot of combined dataset distribution post batch removal. C. PCA plot of the datasets before debatching. D. PCA map of the combined dataset after batch processing. PCA, Principal Component Analysis; AS, Atherosclerosis. The AS dataset GSE100927 is shown in blue, and the AS dataset GSE43292 is shown in pink.

https://doi.org/10.1371/journal.pone.0339956.g003

3.3. Identification of ERDEGs

Based on |logFC| > 0.25 and adj.p < 0.05, the combined dataset included 3,503 AS-DEGs. Among these, there were 1,820 up-regulated genes (logFC > 0.25 and adj.p < 0.05) and 1,683 down-regulated genes (logFC < −0.25 and adj.p < 0.05). Differential analysis of these dataset generated a volcano plot, as shown in Fig 4A. After overlapping between all AS-DEGs and the 76 ERGs, 33 genes were identified as ERDEGs (Fig 4B). These genes included ABCB1, ADORA2B, AGTR1, BCL2A1, CAT, CDK1, DPP4, EGFR, ERBB2, HCAR2, HK2, IGFBP3, IL1B, ILK, JAK2, LCK, LGALS3, LGALS9, MAP3K14, MAPK10, MME, MMP1, MMP8, PDE5A, PDGFRB, PLK1, PRF1, PRKG1, PYGM, SLC2A1, SNCA, SYK, and TBXA2R.

thumbnail
Fig 4. Differential Gene Expression Analysis.

A. Volcano plot of differentially expressed gene analysis of AS and Control in the combined dataset. The x-axis represents log2 fold change, y-axis represents -log10(adjusted p-value). Red dots indicate significantly upregulated genes (logFC > 0.25, adj.p < 0.05), blue dots indicate significantly downregulated genes (logFC < −0.25, adj.p < 0.05), and gray dots represent non-significant genes. B. Venn plot of AS-DEGs and ERGs in the combined datasets. C. Heat map of ERDEGs in the combined datasets. AS, Atherosclerosis; DEGs, Differentially Expressed Genes; ERGs, Empagliflozin-Related Genes; ERDEGs, Empagliflozin-Related Differentially Expressed Genes. Pink is AS and blue is Control. Red represents high expression and blue represents low expression in the heat map.

https://doi.org/10.1371/journal.pone.0339956.g004

3.4. PPI network construction and hub gene screening

Supported by STRING, this study further constructed a PPI network for the screened 33 ERDEGs. As displayed in Fig 5A, 28 ERDEGs were interconnected, including IL1B, EGFR, ERBB2, JAK2, LGALS3, AGTR1, DPP4, MMP1, IGFBP3, SYK, PRF1, SLC2A1, MME, PDGFRB, LCK, ABCB1, SNCA, CDK1, BCL2A1, LGALS9, PYGM, HK2, MMP8, HCAR2, TBXA2R, PRKG1, PLK1, and PDE5A. Topological parameters for these ERDEGs were analyzed and are presented in Table 2. Subsequently, the top-10 ERDEGs were identified by employing the five algorithms (MCC, MNC, Degree, EPC, and Closeness) in the CytoHubba plugin of Cytoscape, with the results of calculation detailed in S2 Table. Simultaneously, corresponding PPI networks were visualized, with circle color changing from red to yellow to indicate decreasing scores (Fig 5B-F). The Venn diagram (Fig 5G) was also generated to identify the intersection of genes from the five algorithms, resulting in 6 hub genes, involving IL1B, EGFR, ERBB2, JAK2, SYK, and LGALS3 (S3 Table).

thumbnail
Table 2. Topological parameter analysis for connected ERDEGs.

https://doi.org/10.1371/journal.pone.0339956.t002

thumbnail
Fig 5. PPI Network and Hub Genes Analysis.

A. PPI Network of ERDEGs calculated by the STRING database. B-F. PPI Network of TOP10 ERDEGs calculated by 5 algorithms of CytoHubba plug-in, including MCC (B), MNC (C), Degree (D), EPC (E) and Closeness (F). G. Venn diagram of the top 10 ERDEGs identified by five algorithms of the CytoHubba plugin. PPI Network, Protein-protein Interaction Network; ERDEGs, Empagliflozin-Related Differentially Expressed Genes.

https://doi.org/10.1371/journal.pone.0339956.g005

3.5. GO and KEGG pathway enrichment analyses of ERDEGs

Our subsequent investigation focused on GO and KEGG pathway enrichment analyses of the 33 ERDEGs associated with AS (Table 3). Specifically, according to the GO enrichment analysis, the BPs mainly included regulation of cell-cell adhesion, regulation of T cell activation, regulation of leukocyte cell-cell adhesion, leukocyte cell-cell adhesion, and regulation of T cell proliferation; the cellular components (CCs) were mostly related to the focal adhesion, cell-substrate junction, membrane raft, membrane microdomain, and apical plasma membrane; and the molecular functions (MFs) were chiefly associated with protein serine/threonine/tyrosine kinase activity, protein tyrosine kinase activity, protein serine/threonine kinase activity, phosphatidylinositol 3-kinase binding, and phosphatase binding. Furthermore, as indicated by the KEGG enrichment analysis, the predominant pathways enriched were Central carbon metabolism in cancer, NF-kappa B signaling pathway, Osteoclast differentiation, Coronavirus disease-COVID-19, and EGFR tyrosine kinase inhibitor resistance. The results of the GO terms and KEGG pathways are visualized in the bubble chart (Fig 6A), while the network diagrams for each analysis are displayed in Fig 6B-E. Lastly, Fig 7 presents the overall drug-target-pathway network diagram, illustrating the interaction of empagliflozin, ERDEGs, GO terms, and enriched KEGG pathways.

thumbnail
Table 3. Results of GO and KEGG enrichment analysis for ERDEGs.

https://doi.org/10.1371/journal.pone.0339956.t003

thumbnail
Fig 6. GO and KEGG Enrichment Analysis for ERDEGs.

A. Bubble diagram of GO and KEGG enrichment analysis for ERDEGs: BP, CC, MF, and KEGG. GO terms and KEGG terms are shown on the abscissa. Bubble size represents gene count (number of genes enriched in each term), and bubble color intensity represents adjusted p-value (darker/more intense colors indicate higher significance). The x-axis shows gene ratio (proportion of genes enriched in each term). B-E. Network diagram of GO and KEGG enrichment analysis r for ERDEGs showing: BP (B), CC (C), MF (D), and KEGG (E). Pink nodes represent items, blue nodes represent molecules, and lines represent the relationship between items and molecules. ERDEGs, Empagliflozin-Related Differentially Expressed Genes; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; BP, Biological Process; CC, Cellular Component; MF, Molecular Function. The screening criteria for GO and KEGG enrichment analysis were adj.p < 0.05 and FDR value (q value) < 0.25, and the p value correction method was Benjamini-Hochberg (BH).

https://doi.org/10.1371/journal.pone.0339956.g006

thumbnail
Fig 7. Drug-Targets-Pathways Network.

Drug-target-pathway network diagram of empagliflozin, ERDEGs, GO, and KEGG. Orange ellipse is empagliflozin, green circles are ERDEGs, blue circles are biological processes, pink circles are cellular components, yellow circles are molecular functions, and purple circles are biological pathways (KEGG).

https://doi.org/10.1371/journal.pone.0339956.g007

3.6. GSEA

The impact of all gene expression levels associated with AS in the combined dataset were analyzed based on GSEA (Fig 8A and Table 4). All genes were significantly enriched in TGFB1 Signaling Via NFIC 1hr Dn (Fig 8B), TP53 And TP63 Targets (Fig 8C), Pi3kci Pathway (Fig 8D), Apoptosis By CDKN1A Via TP53 (Fig 8E), Response To HGF Vs CSF2RB And IL4 Up (Fig 8F), Fceri Mediated Mapk Activation (Fig 8G), and NOTCH1 Targets Dn (Fig 8H).

thumbnail
Table 4. Results of GSEA for combined datasets.

https://doi.org/10.1371/journal.pone.0339956.t004

thumbnail
Fig 8. GSEA for Combined Dataset.

A. Bubble plot showcasing the GSEA of 7 biological functions in the combined datasets. B-H. GSEA showed that all genes were significantly enriched in TGFB1 Signaling Via NFIC 1hr Dn (B), TP53 And TP63 Targets (C), Pi3kci Pathway (D), NFIC 1HR DN (B), TP53 and TP63 targets (C), PI3KCI pathway (D). Apoptosis By CDKN1A Via TP53 (E), Response To HGF Vs CSF2RB And IL4 Up (F), Fceri Mediated Mapk Activation (G), NOTCH1 Targets Dn (H). The enrichment score (ES) curve is shown at the top of each panel, with the ranking metric scores in the middle, and gene positions marked as vertical lines. NES: normalized enrichment score. Positive NES indicates enrichment in AS group; negative NES indicates enrichment in control group. GSEA, Gene Set Enrichment Analysis. The screening criteria of GSEA were adj.p < 0.05 and FDR value (q value) < 0.25, and the p value correction method was Benjamini-Hochberg (BH).

https://doi.org/10.1371/journal.pone.0339956.g008

3.7. GSVA

Our research further compared the pathway differences in genes involved in AS and controls from the MSigDB database (Table 5). The top-20 enriched pathways (an adj. p < 0.05 and |logFC|) were identified and are visualized in the heatmap (Fig 9A). Through Mann–Whitney U test, the inter-group differencesare illustrated in Fig 9B. Consequently, there were statistically significant differences in pathways such as HALLMARK_MYOGENESIS, HALLMARK_UV_RESPONSE_DN, HALLMARK_TGF_BETA_SIGNALING, HALLMARK_EPITHELIAL_MESENCHYMAL_TRANSITION, HALLMARK_DNA_REPAIR, HALLMARK_NOTCH_SIGNALING, HALLMARK_APOPTOSIS, HALLMARK_G2M_CHECKPOINT, HALLMARK_REACTIVE_OXYGEN_SPECIES_PATHWAY, HALLMARK_TNFA_SIGNALING_VIA_NFKB, HALLMARK_IL2_STAT5_SIGNALING, HALLMARK_E2F_TARGETS, HALLMARK_PI3K_AKT_MTOR_SIGNALING, HALLMARK_P53_PATHWAY, HALLMARK_COMPLEMENT, HALLMARK_INFLAMMATORY_RESPONSE, HALLMARK_IL6_JAK_STAT3_SIGNALING, HALLMARK_ALLOGRAFT_REJECTION, HALLMARK_INTERFERON_GAMMA_RESPONSE, and HALLMARK_INTERFERON_ALPHA_RESPONSE, between the AS group and the control group (all p < 0.05).

thumbnail
Table 5. Results of GSVA for combined datasets.

https://doi.org/10.1371/journal.pone.0339956.t005

thumbnail
Fig 9. GSVA Analysis for Combined Dataset.

A-B. Heatmap (A) and group comparison plot (B) of GSVA results between AS and controls of the combined dataset. AS, Atherosclerosis; GSVA, Gene Set Variation Analysis. ** represents p value < 0.01, highly statistically significant; *** represents p value < 0.001 and highly statistically significant. Pink represents AS and blue represents Control. Blue represents down-regulation and red represents up-regulation in the heat map. The screening criteria for GSVA was adj.p < 0.05, and the p value correction method was Benjamini-Hochberg (BH).

https://doi.org/10.1371/journal.pone.0339956.g009

3.8. Regulatory network of hub genes

Following the analysis of data sourced from miRDB database, the mRNA-miRNA regulatory network (Fig 10A and S4 Table) revealed 5 hub genes and 40 miRNAs. Furthermore, the integration of 6 hub genes and 29 TFs from the ChIPBase and hTFtarget generated a mRNA-TF regulatory network, as presented in Fig 10B and S5 Table. Additionally, 44 RBPs predicted to bind to the 6 hub genes were identified using the StarBase database (Fig 10C and S6 Table).

thumbnail
Fig 10. Regulatory Network of Hub Genes.

A. mRNA-miRNA Regulatory Network of empagliflozin-related Hub Genes. B. mRNA-TF Regulatory Network of empagliflozin-related Hub Genes. C. mRNA-RBP Regulatory Network of empagliflozin-related Hub Genes. E; Empagliflozin; TF, Transcription Factor; RBP, RNA-Binding Protein. mRNAs are shown in blue, miRNAs in pink, TFs in green, and RBP in purple.

https://doi.org/10.1371/journal.pone.0339956.g010

3.9. Correlation and Friends analysis among empagliflozin-related hub genes

As illustrated in Fig 11A, the Chord diagram was employed to visualize the results from pairwise correlations of the 6 hub genes identified by Pearson correlation analysis, with negative correlations found in these hub genes primarily. Specifically, IL1B, SYK, and LGALS3 were positively correlated with each other but negatively correlated with ERBB2, EGFR, and JAK2, which conversely showed positive correlations. Additionally, ERBB2 was found to play an important role in AS, as evidenced by the box plot of functional similarity (Friends) analysis (Fig 11B).

thumbnail
Fig 11. Correlation and Friends Analysis of Hub Genes.

A. Chord diagram of the correlations among the 6 empagliflozin-related hub genes in the combined datasets B. Box plot of functional similarity (Friends) analysis results for hub genes. Pink represents a positive correlation, blue represents a negative correlation. The absolute value of the correlation coefficient (r value) below 0.3 is weak or no correlation, between 0.3 and 0.5 is weak correlation, between 0.5 and 0.8 is moderate correlation, and above 0.8 is strong correlation.

https://doi.org/10.1371/journal.pone.0339956.g011

3.10. ROC curve analysis of the correlation between hub gene expression and AS clinical characteristics

In the combined dataset, inter-group differences were observed in expression levels of IL1B, EGFR, ERBB2, JAK2, SYK, and LGALS3 between the AS and control groups (Fig 12A). As a result, there were highly statistically significant differences in the expression levels of the 6 hub genes between groups(all p 0.001). Among these, IL1B, SYK and LGALS3 showed significantly up-regulated expressions, while EGFR, ERBB2, and JAK2 exhibited obviously down-regulated expressions. Subsequently, based on the ROC curve analysis, the expression levels of IL1B, EGFR, ERBB2, JAK2, SYK, and LGALS3 exhibited reasonable accuracy in distinguishing AS and the controls, with corresponding AUC values of 0.816, 0.852, 0.836, 0.738, 0.895, and 0.845, respectively (Fig 12B-G).

thumbnail
Fig 12. Differential Expression Analysis and ROC Curve Analysis.

A. Comparison plot of hub genes in AS and control groups within the combined dataset. B-G. ROC curves of hub genes in the combined dataset. *** represents a p value < 0.001 and highly statistically significant. When AUC > 0.5, it indicates that the expression of the molecule is a trend to promote the occurrence of the event, and the closer the AUC is to 1, the better the diagnostic effect. The AUC had some accuracy in the range of 0.7 to 0.9. AS, Atherosclerosis; ROC, Receiver Operating Characteristic; AUC, Area Under the Curve. TPR, True Positive Rate; FPR, False Positive Rate. Blue represents Control and pink represents AS.

https://doi.org/10.1371/journal.pone.0339956.g012

3.11. Validation of hub genes

In accordance with the analysis of the hub gene expression in the validation set GSE28829, five of the 6 hub genes (i.e., JAK2, SYK, LGALS3, IL1B, and ERBB2) revealed statistically significant differences in expression levels between EA and AA groups, exhibiting high consistency with the findings from the combined dataset (Fig 13A). Specifically, compared to EA group, IL1B, SYK, and LGALS3 showed higher expression, while ERBB2 and JAK2 exhibited lower expression levels in the AA group. Moreover, the expression levels of IL1B, ERBB2, JAK2, SYK, and LGALS3 could accurately distinguish between EA and AA groups, as depicted in the the ROC curve, with AUC values of 0.721, 0.736, 0.880, 0.933, and 0.971, respectively (Fig 13B-F).

thumbnail
Fig 13. Differential Expression Validation and ROC Curve Analysis.

A. Comparison plot of hub genes in AA samples and EA samples within the validation set GSE28829. B-F. ROC curve of hub genes in the validation set GSE28829. *** represents p value < 0.001, highly statistically significant; * represents p value < 0.05, indicating statistical significance. When AUC > 0.5, it indicates that the expression of the molecule is a trend to promote the occurrence of the event, and the closer the AUC is to 1, the better the diagnostic effect. AUC between 0.7-0.9 has a certain accuracy; AUC > 0.9 has high accuracy. AA, Advanced Atherosclerotic Plaque Samples; EA, Early Atherosclerotic Plaque Samples; ROC, Receiver Operating Characteristic; AUC, Area Under the Curve. TPR, True Positive Rate; FPR, False Positive Rate. Pink represents AA samples and blue represents early EA samples.

https://doi.org/10.1371/journal.pone.0339956.g013

3.12. Immune cell characteristics and hub gene correlation analysis

This study observed remarkable correlations between immune cells and empagliflozin-related hub genes, when adopting CIBERSORT analysis and ssGSEA algorithm. To be specific, shown in Fig 14A. Inter-group comparisons showed significant differences in the proportion of immune cell infiltration between AS and control groups. Specifically, statistically significant differences in expressions were found in 12 immune cell types (all p < 0.05), including B cells naive, B cells memory, T cells CD8, T cells CD4 memory resting, T cells CD4 memory activated, T cells gamma delta, NK cells activated, monocytes, macrophages M0, macrophages M1, mast cells resting, and neutrophils (Fig 14B). The correlation thermogram in Fig 14C highlighted strong positive correlation between monocytes and NK cells activated (r = 0.47), as well as the greatest negative correlation between macrophages M0 and monocytes (r = −0.64). Fig 14D shows the correlation between 6 hub genes and the 12 immune cell types infiltration. Consequently, ERBB2 was significantly positively correlated with T cells CD8 (r = 0.58, p < 0.05), while IL1B was highly negatively correlated with mast cells resting (r = −0.61, p < 0.05).

thumbnail
Fig 14. Combined Datasets Immune Infiltration Analysis by CIBERSORT Algorithm.

A-B. Bar graph (A) and group comparison graph (B) of the proportion of immune cells in the combined dataset. C. Correlation heatmap of immune cell infiltration abundance in the combined dataset. D. Bubble plot of the correlation between hub genes and immune cell infiltration abundance in the combined dataset. AS, Atherosclerosis. * represents p value < 0.05, indicating statistical significance; ** represents p value < 0.01, highly statistically significant; *** represents p value < 0.001 and highly statistically significant. The absolute value of the correlation coefficient (r-value) below 0.3 was weak or no correlation, between 0.3 and 0.5 was a weak correlation, between 0.5 and 0.8 was a moderate correlation, and above 0.8 was a strong correlation. Pink is AS and blue is Control. Blue is a negative correlation, pink is a positive correlation, and the depth of the color represents the strength of the correlation.

https://doi.org/10.1371/journal.pone.0339956.g014

In view of the calculation results via the ssGSEA algorithm, this study also confrmed the characterized immune subtypes based on the abundance of 28 immune cells (Fig 15A). The correlation thermogram indicated a positive relationship between these immune cell types (Fig 15B). In the bubble plot (Fig 15C), a strongest positive correlation was found between LGALS3 and eosinophils (r = 0.70, p < 0.05), while a strongest negative correlation between JAK2 and CD56dim natural killer cells (r = −0.69, p < 0.05).

thumbnail
Fig 15. Immune Infiltration Analysis by ssGSEA Algorithm.

A. Group comparison plots of immune cells in AS and Control of the combined dataset. B. Correlation heatmap of immune cell infiltration abundance in the combined dataset. C. Bubble plot of the correlation between hub genes and immune cell infiltration abundance in the combined dataset. ssGSEA, single-sample Gene-Set Enrichment Analysis; AS, Atherosclerosis. ns stands for p value ≥ 0.05, not statistically significant; *** represents p value < 0.001, highly statistically significant. The absolute value of the correlation coefficient (r-value) below 0.3 was weak or no correlation, between 0.3 and 0.5 was a weak correlation, between 0.5 and 0.8 was a moderate correlation, and above 0.8 was a strong correlation. Pink is AS and blue is Control. Pink is a positive correlation and blue is a negative correlation. The depth of the color represents the strength of the correlation.

https://doi.org/10.1371/journal.pone.0339956.g015

4. Discussion

4.1. Main findings

From the most comprehensive perspective to date, the present study analyzed the genetic and molecular mechanisms underlying SGLT2 inhibitors in AS, and investigated into their interactions within the human genome. With the identification of 33 genes as ERDEGs, six genes (IL1B, EGFR, ERBB2, JAK2, SYK, and LGALS3) were identified as hub genes, with the discovery of interactions with miRNAs, TFs, and RBPs, as suggested by a regulatory network. Furthermore, it was observed with significant inter-group differences in the expression levels of the hub genes between AS and control groups, including up-regulated IL1B, SYK, and LGALS3, as well as down-regulated EGFR, ERBB2, and JAK2. Meanwhile, ROC curve demonstrated the accuracy of the hub genes in distinguishing these two groups, with subsequent verification in the validation set. In addition, the identified hub genes were also found to have potential relationships with immune cell subsets, with ERBB2 exhibiting significant positive correlation with T cells CD8, IL1B showing significant negative correlation with mast cells resting, LGALS3 having the strongest positive correlation with eosinophils, and JAK2 revealing the strongest negative correlation with CD56dim natural killer cells.

4.2. GO and KEGG analyses on ERDEGs

GO and KEGG analyses were conducted on ERDEGs to elucidate their roles in biological processes and metabolic pathways. Specifically, as indicated by GO analysis, ERDEGs were primarily involved in PBs such as the regulation of leukocyte cell-cell adhesion, leukocyte cell-cell adhesion, and regulation of T cell proliferation; significant enrichment regarding MFs in various enzyme activities and protein interactions. These results underscore the crucial roles of these biological processes and activities in multiple stages of AS, such as endothelial cell damage and dysfunction, smooth muscle cell proliferation and migration, inflammatory cell recruitment and activation, as well as plaque formation and stabilization [4751].

Meanwhile, as revealed by the KEGG pathway analysis, the screened genes were mainly involved in central carbon metabolism in cancer, NF-kappa B signaling pathway, osteoclast differentiation, COVID-19, and EGFR tyrosine kinase inhibitor resistance, all of which are important in AS development and progression [47,52,53]. Notably, the NF-kappa B signaling pathway plays a central role in inflammation and immunity [54].

Collectively, empagliflozin is critical in modulating AS-associated signaling pathways and gene expression, potentially explaining its mechanisms against the disease, which may facilitate the identification of new strategies to treat AS.

4.3. Characteristics of the six empagliflozin-related hub genes in AS

Through network pharmacology and bioinformatics-based approach, our study identified 6 hub genes that are dysregulated in AS and potentially associated with the anti-atherosclerotic effect of empagliflozin. The proteins encoded by these genes are primarily involved in inflammatory reaction, a significant event in AS occurrence and development. Specifically, IL1B, SYK, and LGALS3 participate significantly in the process of inflammatory reactions; [5557] moreover, JAK2, EGFR, ERBB2, and IL1B are contributors to cell proliferation and migration, which are also essential processes in AS where smooth muscle cells undergo abnormal migration and vascular remodeling [5860]. Given these findings, it may offer promising solutions to treat various diseases, including AS, by targeting these proteins with small molecule inhibitors and antibody drug. For instance, the CANTOS trial demonstrated reduced recurrent major adverse cardiovascular events by using anti-IL-1β antibody in treating stable cardiovascular patients post-myocardial infarction, along with standard care [61,62].

IL1B encodes interleukin-1β, a master regulator of inflammatory responses in atherosclerosis. Empagliflozin reduces IL-1B secretion through NLRP3 inflammasome inhibition and targets the TRAF3IP2/ROS/NLRP3/Caspase-1 pathway [63]. SGLT2 inhibitors significantly reduce cardiac inflammatory cytokine expression, including IL-1B, suggesting reduced inflammation-mediated vascular damage [64]. The anti-atherosclerotic effects are mediated through β-hydroxybutyrate-dependent mechanisms that suppress inflammasome activation.

SYK is a critical mediator involved in atherosclerosis progression through inflammatory signaling pathways. SYK inhibition demonstrates significant anti-proliferative effects on vascular smooth muscle cells, attenuating cell proliferation and migration associated with atherosclerosis development. SYK serves as a key mediator of inflammation in atherosclerotic processes, particularly in PCSK9-induced inflammatory responses [65]. Empagliflozin’s cardiovascular protective effects may involve SYK pathway modulation, contributing to reduced vascular inflammation and improved plaque stability.

LGALS3 encodes Galectin-3, a multifunctional protein involved in atherosclerosis progression through macrophage activation and foam cell formation. The upregulation of LGALS3 in atherosclerotic lesions suggests its involvement in inflammatory cascades, though its exact role warrants further investigation. Empagliflozin may modulate Gal-3 expression through anti-inflammatory pathways.

EGFR, a member of the ErbB receptor tyrosine kinase family, plays a critical role in atherosclerosis development. Myeloid cell-specific deletion of EGFR significantly reduces atherosclerotic lesion size by 43−54% through decreased macrophage accumulation, TNF-αand IL-6 production, and limited lipid uptake via CD36 downregulation [66]. ERBB2 (HER2) is abundantly expressed in endothelial cells and vascular smooth muscle cells, with elevated serum HER2 levels independently associated with coronary artery disease presence and stenotic vessel number. The downregulation of EGFR and ERBB2 observed in our analysis suggests empagliflozin may inhibit vascular smooth muscle cell proliferation and inflammatory responses, contributing to its anti-atherosclerotic effects.

JAK2 encodes Janus kinase 2, a critical tyrosine kinase mediating cytokine signaling through the JAK-STAT pathway. The JAK2V617F mutation confers a 12-fold increased cardiovascular disease risk, promoting accelerated atherosclerosis with increased inflammation, larger necrotic cores, and defective efferocytosis [67]. However, normal macrophage JAK2 is atheroprotective, as its deficiency causes impaired cholesterol efflux and accelerated atherosclerosis [68]. The JAK2/STAT3 pathway drives M1 macrophage polarization and NLRP3 inflammasome activation. Empagliflozin downregulates the JAK2/STAT1 pathway in macrophages, reducing inflammatory responses, suggesting empagliflozin’s cardioprotective effects partially involve JAK2 pathway modulation.

4.4. Critical assessment of findings

While our network pharmacology approach identified six hub genes with statistical significance, several limitations warrant careful interpretation. First, the modest fold change threshold (|logFC| ≥ 0.25) employed in this study, though justified for pathway analysis, means that individual gene expression changes may be biologically subtle. The clinical relevance of such modest expression differences remains uncertain without dose-response validation or functional studies demonstrating that these changes translate to meaningful biological effects. Second, the correlations identified between hub genes and immune cell subsets, while statistically significant, represent associations rather than causal relationships. For instance, the correlation between ERBB2 and CD8 + T cells (r = 0.58) or LGALS3 and eosinophils (r = 0.70) does not establish whether these genes directly regulate immune cell infiltration or merely reflect parallel processes in atherosclerotic tissue. Third, our findings are derived from cross-sectional gene expression data and cannot determine whether the observed dysregulation of these hub genes precedes, accompanies, or follows empagliflozin’s therapeutic effects. Given that empagliflozin’s cardiovascular benefits occur through multiple mechanisms (metabolic, hemodynamic, and anti-inflammatory), the relative contribution of each identified hub gene to its overall anti-atherosclerotic effect remains speculative without targeted intervention studies.

4.5. Immune cell characteristics and hub gene correlation analysis

Based on CIBERSORT and ssGSEA analyses, there were significant differences in the proportions and infiltration levels of immune cells between AS and controls. Our subsequent analyses also confirmed the relationships between hub genes and immune cell subsets, with significant correlations between ERBB2 and CD8 + T cells, IL1B and resting mast cells, LGALS3 and eosinophils, and JAK2 and CD56dim NK cells. Collectively, we may acquire novel perspective on the role of empagliflozin in immune regulation for anti-atherosclerotic effects, and additional targets for developing novel immunotherapeutic strategies. For further in-depth understanding, our future study should continue to explore the relationship between these immune cells and disease progression as well as treatment response.

4.6. Clinical importance

Our study has significant implications for both clinical practice and future research. First, our study may offer important scientific basis for future clinical research and treatment strategies, as evidenced by a comprehensive explanation of the mechanism of empagliflozin against AS. In particular, this study reveals significant correlations of empagliflozin-related hub genes with immune cell subsets, which are key players in AS onset and progression [69]. All these findings may hint future directions of validation and exploration of potential applications of empagliflozin clinically.

4.7. Strengths and limitations

Our research has significant advantages. Firstly, this study comprehensively explored the potential mechanisms of empagliflozin against AS within the human genome utilizing advanced network pharmacology methods, robust public data platforms, and powerful analytical tools. Secondly, our research addressed the limitations of animal experimental studies, which often fail to fully replicate human conditions that may hinder drug discovery and clinical translation.

However, the existence of several limitations underlines our cautious interpretation of results in the current study. Firstly, study based on network pharmacology, constrained by the timeliness of public databases, may be insufficient to realize real-time investigations owing to the potentially outdated target information. Secondly, this study did not validate the results of network pharmacology, necessitating more tangible pharmacological models, as reliance solely on computer and database networks may not be sufficient. Third, this study lacks experimental validation through wet lab or in vivo experiments, which limits the direct clinical translation of our findings. The identified hub genes and pathways require validation through cell culture studies, animal models, and ultimately clinical trials to confirm their therapeutic relevance. Moreover, the immune infiltration analysis using CIBERSORT and ssGSEA provides computational estimates rather than direct measurements of immune cell proportions. These deconvolution methods depend on reference signatures that may not fully represent immune cells in diseased atherosclerotic tissues, and bulk RNA sequencing has inherent limitations in distinguishing closely related cell subtypes. Additionally, different deconvolution algorithms can yield variable estimates for the same samples, and spatial information about immune cell distribution within plaques cannot be captured. Future validation using single-cell RNA sequencing, flow cytometry, or multiplexed immunofluorescence would provide more accurate immune cell characterization and confirm these computational predictions.

Therefore, our findings should be validated by further in vitro experiments and clinical trials, thereby promoting the clinical application of the discovered therapeutic targets.

5. Conclusion

In conclusion, this study represents the first comprehensive integration of network pharmacology and immune infiltration analysis to elucidate empagliflozin’s anti-atherosclerotic mechanisms. Our identification of six key hub genes (IL1B, EGFR, ERBB2, JAK2, SYK, and LGALS3) and their correlations with immune cell subsets provides a molecular framework for understanding empagliflozin’s cardiovascular protective effects. These findings lay the groundwork for future experimental validation studies and suggest potential combination therapeutic strategies targeting both metabolic and inflammatory pathways in atherosclerosis treatment.

Supporting information

S1 Table. Empagliflozin-Related Genes (ERGs) in Atherosclerosis.

Complete list of 76 genes identified at the intersection of empagliflozin targets and atherosclerosis-related genes from PubChem, SwissTargetPrediction, DGIdb, CTD, and GeneCards databases.

https://doi.org/10.1371/journal.pone.0339956.s001

(XLSX)

S2 Table. CytoHubba Algorithm Results for ERDEGs.

Top-10 genes ranked by five topological algorithms (MCC, MNC, Degree, EPC, and Closeness) from the CytoHubba plugin analysis of the protein-protein interaction network.

https://doi.org/10.1371/journal.pone.0339956.s002

(XLSX)

S3 Table. Empagliflozin-Related Hub Genes in Atherosclerosis.

List of six hub genes (IL1B, EGFR, ERBB2, JAK2, SYK, and LGALS3) identified at the intersection of the five CytoHubba algorithms.

https://doi.org/10.1371/journal.pone.0339956.s003

(XLSX)

S4 Table. mRNA-miRNA Regulatory Network.

Complete list of predicted interactions between 5 empagliflozin-related hub genes and 40 microRNAs identified from the miRDB database.

https://doi.org/10.1371/journal.pone.0339956.s004

(XLSX)

S5 Table. mRNA-Transcription Factor Regulatory Network

Complete list of predicted interactions between 6 empagliflozin-related hub genes and 29 transcription factors identified from ChIPBase and hTFtarget databases.

https://doi.org/10.1371/journal.pone.0339956.s005

(XLSX)

S6 Table. mRNA-RNA Binding Protein Regulatory Network

Complete list of predicted interactions between 6 empagliflozin-related hub genes and 44 RNA-binding proteins identified from the starBase v3.0 database.

https://doi.org/10.1371/journal.pone.0339956.s006

(XLSX)

References

  1. 1. Kobiyama K, Ley K. Atherosclerosis. Circulation Research. 2018;123(10):1118–20.
  2. 2. Pedro-Botet J, Climent E, Benaiges D. Atherosclerosis and inflammation. New therapeutic approaches. Med Clin (Barc). 2020;155(6):256–62. pmid:32571617
  3. 3. Roeters van Lennep JE, Tokgözoğlu LS, Badimon L, Dumanski SM, Gulati M, Hess CN, et al. Women, lipids, and atherosclerotic cardiovascular disease: A call to action from the European Atherosclerosis Society. Eur Heart J. 2023;44(39):4157–73. pmid:37611089
  4. 4. Nayor M, Brown KJ, Vasan RS. The molecular basis of predicting atherosclerotic cardiovascular disease risk. Circulation Research. 2021;128(2):287–303.
  5. 5. Fonarow GC, Keech AC, Pedersen TR, Giugliano RP, Sever PS, Lindgren P, et al. Cost-effectiveness of evolocumab therapy for reducing cardiovascular events in patients with atherosclerotic cardiovascular disease. JAMA Cardiol. 2017;2(10):1069–78. pmid:28832867
  6. 6. Paul SK, Bhatt DL, Montvida O. The association of amputations and peripheral artery disease in patients with type 2 diabetes mellitus receiving sodium-glucose cotransporter type-2 inhibitors: Real-world study. Eur Heart J. 2021;42(18):1728–38. pmid:33289789
  7. 7. McGuire DK, Zinman B, Inzucchi SE, Wanner C, Fitchett D, Anker SD, et al. Effects of empagliflozin on first and recurrent clinical events in patients with type 2 diabetes and atherosclerotic cardiovascular disease: A secondary analysis of the EMPA-REG OUTCOME trial. Lancet Diabetes Endocrinol. 2020;8(12):949–59. pmid:33217335
  8. 8. Liu Y, Wu M, Xu B, Kang L. Empagliflozin alleviates atherosclerosis progression by inhibiting inflammation and sympathetic activity in a normoglycemic mouse model. J Inflamm Res. 2021;14:2277–87. pmid:34103961
  9. 9. Ganbaatar B, Fukuda D, Shinohara M, Yagi S, Kusunose K, Yamada H, et al. Empagliflozin ameliorates endothelial dysfunction and suppresses atherogenesis in diabetic apolipoprotein E-deficient mice. Eur J Pharmacol. 2020;875:173040. pmid:32114052
  10. 10. Fu J, Xu H, Wu F, Tu Q, Dong X, Xie H, et al. Empagliflozin inhibits macrophage inflammation through AMPK signaling pathway and plays an anti-atherosclerosis role. Int J Cardiol. 2022;367:56–62. pmid:35931206
  11. 11. Xu J, Hirai T, Koya D, Kitada M. Effects of SGLT2 inhibitors on atherosclerosis: Lessons from cardiovascular clinical outcomes in type 2 diabetic patients and basic researches. J Clin Med. 2021;11(1):137. pmid:35011882
  12. 12. Yu L, Gai Y. Elucidating the mechanism of agrimonolide in treating colon cancer based on network pharmacology. Drug Des Devel Ther. 2023;17:2209–22.
  13. 13. Nogales C, Mamdouh ZM, List M, Kiel C, Casas AI, Schmidt HHHW. Network pharmacology: Curing causal mechanisms instead of treating symptoms. Trends Pharmacol Sci. 2022;43(2):136–50. pmid:34895945
  14. 14. Noor F, Asif M, Ashfaq UA, Qasim M, Tahir ul Qamar M. Machine learning for synergistic network pharmacology: A comprehensive overview. Brief Bioinform. 2023;24(3).
  15. 15. Deng YX, He WG, Cai HJ. Analysis and validation of hub genes in blood monocytes of postmenopausal osteoporosis patients. Front Endocrinol. 2022;12.
  16. 16. Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, et al. PubChem in 2021: New data content and improved web interfaces. Nucleic Acids Res. 2021;49(D1):D1388–95. pmid:33151290
  17. 17. Daina A, Michielin O, Zoete V. SwissTargetPrediction: updated data and new features for efficient prediction of protein targets of small molecules. Nucleic Acids Res. 2019;47(W1):W357–64. pmid:31106366
  18. 18. Freshour SL, Kiwala S, Cotto KC, Coffman AC, McMichael JF, Song JJ, et al. Integration of the Drug-Gene Interaction Database (DGIdb 4.0) with open crowdsource efforts. Nucleic Acids Res. 2021;49(D1):D1144–51. pmid:33237278
  19. 19. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504. pmid:14597658
  20. 20. Davis AP, Wiegers TC, Johnson RJ, Sciaky D, Wiegers J, Mattingly CJ. Comparative toxicogenomics database (CTD): Update 2023. Nucleic Acids Res. 2023;51(D1):D1257–62. pmid:36169237
  21. 21. Zeng Y, Cao S, Chen M. Integrated analysis and exploration of potential shared gene signatures between carotid atherosclerosis and periodontitis. BMC Med Genomics. 2022;15(1):227. pmid:36316672
  22. 22. Meng L-B, Shan M-J, Qiu Y, Qi R, Yu Z-M, Guo P, et al. TPM2 as a potential predictive biomarker for atherosclerosis. Aging (Albany NY). 2019;11(17):6960–82. pmid:31487691
  23. 23. Davis S, Meltzer PS. GEOquery: A bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics. 2007;23(14):1846–7. pmid:17496320
  24. 24. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, et al. NCBI GEO: Archive for functional genomics data sets--update. Nucleic Acids Res. 2013;41(Database issue):D991-5. pmid:23193258
  25. 25. Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882–3. pmid:22257669
  26. 26. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47. pmid:25605792
  27. 27. Ben Salem KBA. Principal Component Analysis (PCA). Tunis Med. 2021;99(4):383–9.
  28. 28. Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: Protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47(D1):D607–13. pmid:30476243
  29. 29. Chin C-H, Chen S-H, Wu H-H, Ho C-W, Ko M-T, Lin C-Y. cytoHubba: Identifying hub objects and sub-networks from complex interactome. BMC Syst Biol. 2014;8 Suppl 4(Suppl 4):S11. pmid:25521941
  30. 30. Yang X, Li Y, Lv R, Qian H, Chen X, Yang CF. Study on the multitarget mechanism and key active ingredients of herba siegesbeckiae and volatile oil against rheumatoid arthritis based on network pharmacology. Evid Based Complement Alternat Med. 2019;2019:8957245. pmid:31885670
  31. 31. Mi H, Muruganujan A, Ebert D, Huang X, Thomas PD. PANTHER version 14: More genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Res. 2019;47(D1):D419–26. pmid:30407594
  32. 32. Goto MK a S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Research. 2000;28(1):27–30.
  33. 33. Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–7. pmid:22455463
  34. 34. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. pmid:16199517
  35. 35. Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27(12):1739–40. pmid:21546393
  36. 36. Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics. 2013;14:7. pmid:23323831
  37. 37. Sun L, Chen G, Sun A, Wang Z, Huang H, Gao Z, et al. BAG2 promotes proliferation and metastasis of gastric cancer via ERK1/2 signaling and partially regulated by miR186. Front Oncol. 2020;10:31. pmid:32082999
  38. 38. Liu Z, Huang J, Jiang Q, Li X, Tang X, Chen S, et al. miR-125a attenuates the malignant biological behaviors of cervical squamous cell carcinoma cells through Rad51. Bioengineered. 2022;13(4):8503–14. pmid:35332852
  39. 39. Chen Y, Wang X. miRDB: an online database for prediction of functional microRNA targets. Nucleic Acids Res. 2020;48(D1):D127–31. pmid:31504780
  40. 40. Zhou K-R, Liu S, Sun W-J, Zheng L-L, Zhou H, Yang J-H, et al. ChIPBase v2.0: Decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data. Nucleic Acids Res. 2017;45(D1):D43–50. pmid:27924033
  41. 41. Zhang Q, Liu W, Zhang H-M, Xie G-Y, Miao Y-R, Xia M, et al. hTFtarget: A comprehensive database for regulations of human transcription factors and their targets. Genomics Proteomics Bioinformatics. 2020;18(2):120–8. pmid:32858223
  42. 42. Mukhopadhyay M. Recapitulating early cardiogenesis in vitro. Nat Methods. 2021;18(4):331. pmid:33828266
  43. 43. Li J-H, Liu S, Zhou H, Qu L-H, Yang J-H. starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014;42(Database issue):D92-7. pmid:24297251
  44. 44. Yu G, Li F, Qin Y, Bo X, Wu Y, Wang S. GOSemSim: An R package for measuring semantic similarity among GO terms and gene products. Bioinformatics. 2010;26(7):976–8. pmid:20179076
  45. 45. Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015;12(5):453–7. pmid:25822800
  46. 46. Xiao B, Liu L, Li A, et al. Identification and Verification of Immune-Related Gene Prognostic Signature Based on ssGSEA for Osteosarcoma. Front Oncol. 2020;10.
  47. 47. Pan H, Ho SE, Xue C, Cui J, Johanson QS, Sachs N, et al. Atherosclerosis is a smooth muscle cell-driven tumor-like disease. Circulation. 2024;149(24):1885–98. pmid:38686559
  48. 48. Mandatori S, Pacella I, Marzolla V. Altered Tregs differentiation and impaired autophagy correlate to atherosclerotic disease. Front Immunol. 2020;11.
  49. 49. Smith E, Prasad K-MR, Butcher M, Dobrian A, Kolls JK, Ley K, et al. Blockade of interleukin-17A results in reduced atherosclerosis in apolipoprotein E-deficient mice. Circulation. 2010;121(15):1746–55. pmid:20368519
  50. 50. Yang X, Zhang S, Dong Z. Insulin resistance is a risk factor for overall cerebral small vessel disease burden in old nondiabetic healthy adult population. Front Aging Neurosci. 2019;11.
  51. 51. Zigmond ZM, Song L, Martinez L, Lassance-Soares RM, Velazquez OC, Vazquez-Padron RI. c-Kit expression in smooth muscle cells reduces atherosclerosis burden in hyperlipidemic mice. Atherosclerosis. 2021;324:133–40. pmid:33781566
  52. 52. Weber C, Habenicht AJR, von Hundelshausen P. Novel mechanisms and therapeutic targets in atherosclerosis: inflammation and beyond. Eur Heart J. 2023;44(29):2672–81. pmid:37210082
  53. 53. Durham AL, Speer MY, Scatena M, Giachelli CM, Shanahan CM. Role of smooth muscle cells in vascular calcification: Implications in atherosclerosis and arterial stiffness. Cardiovasc Res. 2018;114(4):590–600. pmid:29514202
  54. 54. Fong CHY, Bebien M, Didierlaurent A, Nebauer R, Hussell T, Broide D, et al. An antiinflammatory role for IKKbeta through the inhibition of “classical” macrophage activation. J Exp Med. 2008;205(6):1269–76. pmid:18490491
  55. 55. Van Damme J, De Ley M, Opdenakker G, Billiau A, De Somer P, Van Beeumen J. Homogeneous interferon-inducing 22K factor is related to endogenous pyrogen and interleukin-1. Nature. 1985;314(6008):266–8. pmid:3920526
  56. 56. Wang L, Aschenbrenner D, Zeng Z, Cao X, Mayr D, Mehta M, et al. Gain-of-function variants in SYK cause immune dysregulation and systemic inflammation in humans and mice. Nat Genet. 2021;53(4):500–10. pmid:33782605
  57. 57. Blanda V, Bracale UM, Di Taranto MD, Fortunato G. Galectin-3 in cardiovascular diseases. Int J Mol Sci. 2020;21(23).
  58. 58. Bełtowski J, Jazmroz-Wiśniewska A. Transactivation of ErbB receptors by leptin in the cardiovascular system: Mechanisms, consequences and target for therapy. Curr Pharm Des. 2014;20(4):616–24. pmid:23688012
  59. 59. Ghosh P, Beas AO, Bornheimer SJ, Garcia-Marcos M, Forry EP, Johannson C, et al. A G{alpha}i-GIV molecular complex binds epidermal growth factor receptor and determines whether cells migrate or proliferate. Mol Biol Cell. 2010;21(13):2338–54. pmid:20462955
  60. 60. Sakatsume M, Igarashi KI, Winestock KD, Garotta G, Larner AC, Finbloom DS. The Jak kinases differentially associate with the α and β (accessory factor) chains of the interferon γ receptor to form a functional receptor unit capable of activating STAT transcription factors. J Biol Chem. 1995;270(29):17528–34.
  61. 61. Soehnlein O, Libby P. Targeting inflammation in atherosclerosis - from experimental insights to the clinic. Nat Rev Drug Discov. 2021;20(8):589–610. pmid:33976384
  62. 62. Cao L, Ai Y, Dong Y, Li D, Wang H, Sun K, et al. Bioinformatics analysis reveals the landscape of immune cell infiltration and novel immune-related biomarkers in moyamoya disease. Front Genet. 2023;14:1101612. pmid:37265961
  63. 63. Sukhanov S, Higashi Y, Yoshida T, Mummidi S, Aroor AR, Jeffrey Russell J, et al. The SGLT2 inhibitor Empagliflozin attenuates interleukin-17A-induced human aortic smooth muscle cell proliferation and migration by targeting TRAF3IP2/ROS/NLRP3/Caspase-1-dependent IL-1β and IL-18 secretion. Cell Signal. 2021;77:109825. pmid:33160017
  64. 64. Hodrea J, Saeed A, Molnar A, Fintha A, Barczi A, Wagner LJ, et al. SGLT2 inhibitor dapagliflozin prevents atherosclerotic and cardiac complications in experimental type 1 diabetes. PLoS One. 2022;17(2):e0263285. pmid:35176041
  65. 65. Shin D, Kim S, Lee H, Lee H-C, Lee J, Park H-W, et al. PCSK9 stimulates Syk, PKCδ, and NF-κB, leading to atherosclerosis progression independently of LDL receptor. Nat Commun. 2024;15(1):2789. pmid:38555386
  66. 66. Zeboudj L, Giraud A, Guyonnet L, Zhang Y, Laurans L, Esposito B, et al. Selective EGFR (epidermal growth factor receptor) deletion in myeloid cells limits atherosclerosis-brief report. Arterioscler Thromb Vasc Biol. 2018;38(1):114–9. pmid:29191921
  67. 67. Liu W, Östberg N, Yalcinkaya M. Erythroid lineage Jak2V617F expression promotes atherosclerosis through erythrophagocytosis and macrophage ferroptosis. J Clin Invest. 2022;132(13).
  68. 68. Dotan I, Yang J, Ikeda J, Roth Z, Pollock-Tahiri E, Desai H, et al. Macrophage Jak2 deficiency accelerates atherosclerosis through defects in cholesterol efflux. Commun Biol. 2022;5(1):132. pmid:35169231
  69. 69. Tan L, Xu Q, Shi R, Zhang G. Bioinformatics analysis reveals the landscape of immune cell infiltration and immune-related pathways participating in the progression of carotid atherosclerotic plaques. Artif Cells Nanomed Biotechnol. 2021;49(1):96–107. pmid:33480285