Figures
Abstract
Purpose
Diabetic retinopathy (DR), a major microvascular complication of diabetes and leading global blindness cause, involves uric acid (UA) in its onset and progression. This study aimed to identify UA-related genes (UARGs) in DR and clarify their molecular mechanisms for improved diagnosis and treatment.
Methods
Using public database transcriptome data, key UARGs were screened via differential expression analysis, machine learning, receiver operating characteristic (ROC) analysis, and expression profiling, followed by gene set enrichment analysis (GSEA), immune infiltration analysis, molecular regulatory network construction, and clinical validation with reverse transcription quantitative polymerase chain reaction (RT-qPCR).
Results
MMP15 and FOXK1 were identified as potential key genes, with significantly elevated expression in DR patient blood samples. GSEA showed MMP15 enriched in Apc targets requiring Myc and Smarca2 targets up, and FOXK1 in phosphatidylinositol signaling system and proteasome. Immune infiltration analysis revealed differences in 7 immune cell types, with central memory CD8 T cells and natural killer cells showing the strongest positive correlation; MMP15 was negatively correlated with NK cells, and FOXK1 with central memory CD4 T cells and effector memory CD8 T cells. Twelve transcription factors (e.g., HOXB7, CATA6) jointly targeted both genes.
Citation: Liang H, Yang T, Lu F, Lin L, Liang H (2026) Transcriptome sequencing combined with experimental verification to explore potential key genes related to uric acid in diabetic retinopathy. PLoS One 21(6): e0350132. https://doi.org/10.1371/journal.pone.0350132
Editor: Yalong Dang, Sanmenxia Central Hospital, Henan University of Science and Technilogy, CHINA
Received: February 6, 2026; Accepted: May 10, 2026; Published: June 2, 2026
Copyright: © 2026 Liang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The datasets ANALYZED for this study can be found in the [GeneCards database] [https://www.genecards.org/], and [GEO database] [http://www.ncbi.nlm.nih.gov/geo/].
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Abbreviation:: DR, diabetic retinopathy; UA, uric acid; UARGs, UA-related genes; ROC, receiver operating characteristic; GSEA, gene set enrichment analysis; AUC, area under the curve; RT-qPCR, reverse transcription quantitative polymerase chain reaction; TF, transcription factor; NPDR, non-proliferative diabetic retinopathy; PDR, proliferative diabetic retinopathy; GEO, Gene Expression Omnibus; PRP, anretinal photocoagulation; TNF-α, Tumor necrosis factor-α; IL-6, Interleukin-6; CRP, C-reactive protein; DEGs, differentially expressed genes; FC, fold change; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; LASSO, least absolute shrinkage and selection operator; ssGSEA, single-sample gene set enrichment analysis; PPI, protein-protein interaction; DN, diabetic nephropathy; DPN, diabetic peripheral neuropathy; ECM, extracellular matrix; LPS, lipopolysaccharide; Tem, Effector memory T cells; NKT, natural killer T; NK, natural killer
1. Introduction
Diabetic retinopathy (DR) is one of the most common microvascular complications of diabetes mellitus and a leading cause of vision loss worldwide. Its pathogenesis is driven by chronic hyperglycemia and involves complex processes such as microvascular damage and pathological neovascularization [1,2]. With the continuously increasing prevalence of diabetes globally, DR has become the primary cause of acquired blindness in the working-age population [2,3]. Although current therapeutic approaches, including anti-vascular endothelial growth factor (VEGF) agents, vitrectomy, and panretinal photocoagulation, have been applied clinically, they still have limitations such as the requirement for repeated injections, poor treatment compliance, and unsatisfactory responses in some patients [4–7]. Therefore, in-depth exploration of the key molecular pathways underlying the pathogenesis of DR is crucial for the development of more precise and effective therapeutic strategies.
Uric acid (UA) is the end product of purine metabolism. In addition to serving as a diagnostic marker for gout, epidemiological studies have suggested that elevated UA levels are associated with an increased risk of diabetes mellitus and its complications, including diabetic nephropathy and peripheral neuropathy [3]. However, current epidemiological evidence regarding the relationship between UA and DR remains highly inconsistent. Several studies have demonstrated that high UA levels are an independent risk factor for DR and correlate with the severity of DR [8,9], whereas other studies have found no independent association between the two, and even reported contradictory results [10]. Such inconsistency indicates that the mechanism underlying the role of UA in DR pathogenesis may be highly complex, rather than a simple linear correlation.
Mechanistic studies have shown that uric acid can trigger the release of inflammatory mediators, including tumor necrosis factor-α (TNF-α), interleukin-6 (IL-6), and C-reactive protein (CRP) [11], which subsequently promote the recruitment and activation of immune cells, thereby establishing a self-sustaining inflammatory amplification cascade. Consistent with this, recent meta-analyses have indicated that elevated IL-6 levels are associated with an increased risk of DR [12]. Furthermore, in vitro studies have revealed that under high-glucose conditions, increased uric acid levels promote the expression of inflammatory mediators and enhance the activity of the Notch signaling pathway in retinal endothelial cells [13]. In addition, hyperuricemia induces endothelial dysfunction by impairing endothelial cell function and upregulating vascular endothelial growth factor (VEGF) expression [14,15]. Nevertheless, the precise role of uric acid in DR pathogenesis remains elusive due to inconsistent epidemiological evidence and a lack of systematic bioinformatics investigations. Therefore, the present study aims to elucidate the potential key genes and regulatory pathways linking uric acid to DR through integrated analysis, so as to provide new insights into the association between UA and DR and identify potential molecular targets for targeted therapy of DR.
In this study, we leveraged publicly available transcriptome datasets to identify UA-related hub genes in DR through differential expression analysis, machine-learning-based feature selection, and expression validation. We then performed gene set enrichment analysis (GSEA), immune cell infiltration profiling, and GeneMANIA-based functional association analysis, and further constructed putative regulatory networks around these genes. Our goal was to elucidate plausible molecular mechanisms linking UA-associated inflammatory and innate immune activity to retinal vascular pathology in DR, and to provide a theoretical foundation for future translational and clinical studies.
2. Materials and methods
2.1. Data collection
Gene expression information of the GSE221521 and GSE185011 datasets related to DR was retrieved from the Gene Expression Omnibus (GEO). Specifically, GSE221521 (training set) (platform: GPL24676) included 69 DR patient samples and 50 control blood samples [1]. GSE185011 (validation set) (platform: GPL24676) included 5 DR patient samples and 5 control blood samples [16]. Both datasets were processed using official standardized pipelines. Specifically, the GEO official pipeline utilized HISAT2 to align reads to the GRCh38/hg38 reference genome, and featureCounts was used to obtain raw counts at the gene level. The raw counts were then normalized using the FPKM (Fragments Per Kilobase of transcript per Million mapped fragments) method: FPKM = C × 10⁹/ (N × L), where C represented the number of fragments for a gene, N represented the total number of aligned fragments (sequencing depth) for the sample, and L represented the total length of exons (bp) for the gene. Both datasets were processed using the same normalization pipeline, and the official FPKM matrix files were directly downloaded for subsequent analyses. Detailed clinical information for both datasets is provided in S1 Table. In addition, for independent external validation, a new dataset, GSE94019 (platform: GPL11154), which included 9 DR patient samples and 4 control samples, was introduced. For literature reference, the keyword “Uric Acid” was searched in the GeneCards database, and 3,806 uric acid-related genes (UARGs) were obtained [11] (S2 Table).
2.2. Differential expression analysis
In the GSE221521 dataset, differential expression analysis between DR patients and controls was performed using the DESeq2 package (v 1.40.2) [17] with thresholds of p < 0.05 and |log₂FC| > 0.5. The top 10 up- and down-regulated genes, ranked by log₂FC values, were visualized in a volcano plot via the EnhancedVolcano package (v 1.18.0) (https://github.com/kevinblighe/EnhancedVolcano) and an expression heatmap using the ComplexHeatmap package (v 2.16.0) [18].
2.3. Identification of candidate genes
The overlap of DEGs and UARGs (candidate genes) was visualized with the Venn Diagram package (v 1.7.3) [19]. Functional annotation of candidate genes was then conducted using Gene Ontology (GO) enrichment analysis—covering BP, CC, and MF categories—and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis for pathway annotation of all identified proteins. GO and KEGG enrichment analyses of candidate genes were executed via the clusterProfiler package (v 4.15.0.003) [20] (p-adjust < 0.05, Benjamini & Hochberg) with gene annotation derived from org.Hs.e.g.,db (v 3.17.0) [21]. The results were visualized via the ggplot2 package (v 3.5.1) [22]. Meanwhile, candidate genes were submitted to the STRING database to construct an interaction network (interaction score > 0.4), which was subsequently visualized in Cytoscape software (v 3.10.2) [23].
2.4. Identification of potential key genes
Machine learning, a robust data analysis methodology, has been extensively applied to genetic feature screening. To screen the obtained feature genes, an algorithm was used: the Least Absolute Shrinkage and Selection Operator (LASSO) regression. Based on the GSE221521 dataset, LASSO regression was executed via the glmnet package (v 4.1-8) [24], and using 10-fold cross-validation, the optimal λ value was determined to screen feature genes at the minimum error threshold.
In an effort to evaluate how feature genes differentiate DR blood samples from control ones, ROC analysis was conducted using the pROC package (v 1.18.5) [25]. on the feature genes in the DR and control blood samples of the GSE221521 dataset. The area AUC value was calculated. Feature genes (AUC > 0.7) for distinguishing DR blood samples were selected from the GSE221521 dataset, and their ROC curves were displayed.
For additional confirmation of the feature genes’ expression levels, a Wilcoxon test (p < 0.05) was used. Feature gene expression was validated across all samples from the GSE221521 and GSE185011 datasets. Using the ggplot2 package (v 3.5.1), boxplots were constructed to present the outcomes. Based on significant differential expression and coherent trends across both datasets, a core set of genes was defined for all downstream analyses. To further ensure the generalizability of the results, the cohens_d function from the R package rstatix was used to perform effect size testing on the validation dataset. The magnitude of the difference between the two groups was assessed based on the absolute value of Cohen’s d, with the following criteria: < 0.2 indicating a very small effect, 0.2-0.5 indicating a small effect, 0.5-0.8 indicating a medium effect, and ≥ 0.8 indicating a large effect.
2.5. GSEA
For a deeper understanding of the biological functions of potential key genes in all GSE221521 dataset samples, the reference gene set was derived from the KEGG pathway gene set c2.cp.all.v2022.1.Hs.symbols.gmt (provided by the MsigDB). The stats package (v 4.3.1) was employed for the purpose of conducting Spearman correlation analysis for each key gene against all other genes, respectively, to obtain the correlation coefficients of the genes. These coefficients were then sorted from largest to smallest, and GSEA was executed via the clusterProfiler package (v 4.15.0.003). The GSEA results were visualized with the enrichplot package (v 1.20.3) [26], applying a statistical threshold of p-adjust < 0.05 (Benjamini & Hochberg).
2.6. Immune infiltration analysis
Using the single-sample Gene Set Enrichment Analysis (ssGSEA) algorithm in the GSVA package (v 0.99.0) [27], the infiltration abundances of 28 immune cell types were evaluated in the GSE221521 dataset to compare DR and control blood samples [28]. Subsequently, infiltration differences of 28 immune cells between DR and control groups were assessed using the Wilcoxon test (p < 0.05) and visualized by boxplots generated with the ggplot2 package (v 3.5.1). Next, pairwise correlations involving differentially infiltrated immune cells and potential key genes across all GSE221521 samples were assessed using the Spearman method from the psych package (v 2.4.6.26) [29]. Significant correlations (|r| > 0.3, p < 0.05) were plotted using the ggplot2 package (v 3.5.1).
2.7. GeneMANIA analysis
The potential key genes were submitted to the GeneMANIA database (species: Homo sapiens) to construct a protein-protein interaction (PPI) network with functionally similar genes, thereby enabling functional characterization.
2.8. Molecular regulatory network analysis
An mRNA-TF regulatory network was established, and the transcriptional regulatory effects of transcription factors (TFs) on potential key genes were analyzed to reveal potential regulatory mechanisms. Furthermore, interaction information between TFs and their target genes was extracted from the ChIPBase database. Transcription factors with a combined count of “discovered samples (upstream)” and “discovered samples (downstream)” greater than 12 were designated as key TFs. Data was organized and filtered using the tidyverse package (v 2.0.0) [30], and construction of the regulatory network was executed using igraph, followed by visualization with Cytoscape software (v 3.10.2).
Correlation analysis between purine metabolism‑related genes and FOXK1 was performed To investigate whether FOXK1 exhibited co‑expression relationships with genes in the purine metabolism pathway, eight genes were selected, including the core purine synthesis enzymes (PPAT, ADSL, ATIC, IMPDH1/2), core degradation enzymes (ADA, XDH), and transporters/receptors (ENTPD1, CD39). Based on the gene expression matrix of all samples in the GSE221521 dataset, the Pearson correlation coefficients between FOXK1 and each of the above genes were calculated using the corr.test function of the R package psych (version 2.4.6.26), and the Benjamini‑Hochberg method was applied for multiple hypothesis testing correction. The significance threshold was set at adjusted P < 0.05 and |cor| > 0.3.
2.9. External validation of potential key genes and RT-qPCR validation were performed
To further validate the expression stability of the potential potential key genes, the GSE94019 dataset was downloaded from the GEO database. External validation was conducted using a t-test to compare the expression differences of the potential key genes between the two groups, and the magnitude of the difference between the two groups was assessed based on the absolute value of Cohen’s d. To validate the expression of potential key genes in clinical samples against the bioinformatics predictions, blood samples were collected from 5 healthy controls and 5 DR patients who were recruited from Guangxi Hospital, The First Affiliated Hospital, Sun Yat-sen University, Nanning, Guangxi Zhuang Autonomous Region, China. Clinical information of the study subjects was presented in S3 Table. The recruitment period of research subjects was from 20/06/2025 to 31/07/2025. The study was conducted in accordance with the Declaration of Helsinki. This study was approved by the Guangxi Hospital, The First Affiliated Hospital, Sun Yat-sen University, Nanning, Guangxi Zhuang Autonomous Region, China. The approval number and date of approval are as follows: [KY-IIT-2025–008] and [20 June 2025]. All patients provided written informed consent when clinical samples were collected for RT-qPCR experiments to ensure that the research process was in accordance with ethical norms and that the patients’ rights and wishes were fully respected. Total RNA from 10 samples was extracted (Trizol reagent, Vazyme), reverse-transcribed (HP All-in-one kit, Yungeng), and analyzed by RT-qPCR (2x Universal Blue SYBR Green Master Mix, Servicebio) with GAPDH as normalizer. The 2-ΔΔCt method and t-test (p < 0.05) in GraphPad Prism 10 were used for quantification and statistical comparison, respectively. Primer sequences are listed in S4 Table.
3. Results
3.1. Identification of 730 DEGs
A total of 730 DEGs (439 upregulated and 291 downregulated) were identified in DR blood samples compared to controls, based on thresholds of p < 0.05 and |log2FC| > 0.5 (Fig 1A-B) (S5 Table).
(A) Volcano plot of DEGs. Yellow dots represent up-regulated DEGs in the DR group, blue dots represent down-regulated DEGs. The top 10 up- and down-regulated DEGs are labeled. (B) Heatmap of DEGs. The heatmap displays the expression trends of the top 10 up- and down-regulated genes. Yellow indicates higher expression, blue indicates lower expression.
3.2. Acquisition of 103 candidate genes
Next, the intersection of 730 DEGs and 3,806 UARGs was taken, and a total of 103 candidate genes related to UA in DR were obtained for subsequent analysis (Fig 2A). GO enrichment analyses showed that 103 candidate genes were enriched in 465 GO signaling pathways (p-adjust < 0.05). Among them, there were 436 BP pathways, including nephron development, regulation of macrophage derived foam cell differentiation, etc. There were 11 CC pathways, such as collagen-containing extracellular matrix, and 18 MF pathways, such as protein-lipid complex binding (Fig 2B) (S6 Table). These genes were enriched in 186 KEGG signaling pathways (p-adjust < 0.05), such as proximal tubule bicarbonate reclamation (Fig 2C) (S7 Table).
(A) Venn diagram showing the candidate genes. (B) GO enrichment results. The y-axis labels the GO terms, and the x-axis represents the number of genes enriched in the corresponding term (Hypergeometric test). (C) KEGG enrichment results (Hypergeometric test). (D) PPI network. Red circles represent proteins, and connecting lines represent interactions between proteins.
A PPI network with 69 nodes and 193 edges (interaction score > 0.4) was subsequently constructed from the candidate genes. These interactions included the interaction between MMP15 and MMP9, as well as between HCFC1 and FOXK1, etc. (Fig 2D).
3.3. Identification of 2 potential key genes
Using the Lasso algorithm, the optimal λ (log(lambda.min)) value was −2.09, and a total of 9 feature genes were screened, namely ESR1, MMP15, TRPM2, H6PD, NUDC, CC2D2A, NDUFAF2, FOXK1, and SLC25A34 (Fig 3A-B).
(A) LASSO coefficient profile of candidate genes. Each curve represents a gene. The x-axis is the log(λ), and the y-axis is the coefficient. (B) Optimal lambda (λ) selection in the LASSO model. (C) ROC curves (GSE221521 dataset). A red curve above the grey line with an AUC value greater than 0.7 indicates good classification performance. (D-E) Expression levels for screening biomarkers (T test). (D) GSE221521 dataset, (E) GSE185011 dataset. ns, not significant; *, p < 0.05; **, p < 0.01; ***, p < 0.001.
Subsequently, ROC analysis of the nine feature genes in DR versus control samples revealed that all exhibited AUC values greater than 0.7. Such as ESR1 had an AUC of 0.771, and FOXK1 had an AUC of 0.79. These results demonstrated that these feature genes exhibited moderate discriminatory ability and could distinguish DR from control blood samples to some extent (Fig 3C).
Finally, Wilcoxon test analysis confirmed significant differential expression for all nine feature genes in the GSE221521 dataset (p < 0.001), following validation across the GSE221521 and GSE185011 datasets (Fig 3D). By contrast, significant differential expression in the GSE185011 dataset was limited to two genes, MMP15 and FOXK1 (p < 0.05) (Fig 3E), and their expression trends were consistent with those in GSE221521. Both genes were significantly upregulated. Therefore, MMP15 and FOXK1 were identified as Potential key genes. Further Cohen’s d effect size analysis on the validation dataset showed that the effect sizes of the differences between the two groups for MMP15 (Cohen’s d = −1.57) and FOXK1 (Cohen’s d = −1.74) were both at a “very large” level, which to some extent demonstrated the generalizability of the results.
3.4. Biological pathways of 2 potential key genes
GSEA showed that MMP15 was enriched in a total of 740 pathways, such as breast cancer 16p13 amplicon, Apc targets require Myc, and Smarca2 targets up. FOXK1 was enriched in 833 pathways, including EBNA1 anticorrelated, phosphatidylinositol signaling system, and proteasome (p < 0.05) (Fig 4) (S8 Table). These results suggested that these pathways might be associated with DR.
3.5. Immune microenvironment of 2 potential key genes
Assessment of 28 immune cell types in the GSE221521 dataset (Fig 5A) identified 7 with significantly altered infiltration (Wilcoxon test, p < 0.05), including downregulated activated CD4 and CD8 T cells in DR (Fig 5B). A separate correlation analysis highlighted a strong positive interaction between central memory CD8 T cells and natural killer cells (cor = 0.55, p = 6.23e-11) (Fig 5C) (S9 Table) (S10 Table). In addition, a statistically significant negative association was detected between MMP15 and natural killer cells (cor = −0.35, p = 9.50e-05), and FOXK1 was significantly negatively correlated with central memory CD4 T cells (cor = −0.38, p = 2.05e-05) and effector memory CD8 T cells (cor = −0.39, p = 1.24e-05). (Fig 5D) (S11 Table) (S12 Table). These findings suggested that FOXK1 might be associated with the infiltration levels of central memory CD4 T cells and effector memory CD8 T cells, and might participate in the disease process of DR.
(A) Relative proportions of immune infiltration in the DR group vs. the Control group. Different colors represent the ssGSEA scores for various immune cells. (B) Differences in immune cell infiltration proportions between the DR group and the Control group. (C) Correlations among differentially infiltrated immune cells. Blue indicates negative correlation, red indicates positive correlation (wilcox test). (D) Correlation between biomarkers and differentially infiltrated immune cells. ns, not significant; *, p < 0.05; **, p < 0.01; ***, p < 0.001; ****, p < 0.0001 (wilcox test).
3.6. GeneMANIA of 2 potential key genes
The interaction network among MMP15, FOXK1, and other related genes was constructed using the GeneMANIA database. The results showed that the relationships among MMP15, FOXK1, and other genes primarily involved physical interactions, shared protein domains, and pathways (Fig 6).
3.7. Regulatory network of 2 potential key genes
An mRNA-TF network with 214 nodes and 224 edges was constructed. 76 TFs targeted MMP15, and 148 TFs targeted FOXK1. Among these, a total of 12 TFs were found to co-regulate both potential key genes, including HOXB7 and GATA6 targeting both MMP15 and FOXK1 (Fig 7). At the transcriptional level, these TFs, which had a regulatory effect on MMP15 and FOXK1, may influence the development of DR.
Green nodes represent biomarker genes, red nodes represent TFs.
3.8. Correlation analysis between FOXK1 and purine metabolism-related genes was performed
To investigate whether FOXK1 exhibited co‑expression relationships with genes in the purine metabolism pathway, the expression correlations between FOXK1 and eight purine metabolism‑related genes in the GSE221521 dataset were analyzed. The Pearson correlation analysis revealed two significant correlation pairs: FOXK1 was significantly positively correlated with IMPDH1 (cor = 0.3, P = 0.00091) and significantly negatively correlated with ATIC (cor = −0.3, P = 0.00079). These results suggested that FOXK1 might participate in the purine metabolism process by regulating the expression of IMPDH1 and ATIC, thereby affecting uric acid levels (S1 Fig).
3.9. High expression of MMP15 and FOXK1 in DR patient blood samples
To further validate the reliability of the two screened potential key genes (MMP15 and FOXK1), an independent dataset GSE94019 was analyzed in this study. The results showed that, consistent with the findings in the training set, the expression levels of both genes were significantly higher in DR samples than in control samples (p < 0.05), and their expression trends were consistent with those observed in the GSE221521 and GSE185011 datasets (S2 Fig). Furthermore, Cohen’s d effect size analysis indicated that the effect sizes of the differences between the two groups for MMP15 (Cohen’s d = −1.34) and FOXK1 (Cohen’s d = −2.13) were both at a “very large” level, suggesting that the observed differences were not only statistically significant but also of substantial practical significance.
Clinical samples were used for RT-qPCR validation of MMP15 and FOXK1. The results confirmed that MMP15 and FOXK1 were significantly upregulated in DR patient blood samples (p < 0.05) (Fig 8A-B). In summary, the concordance between RT-qPCR results and bioinformatics data collectively verified key gene expression patterns in DR. These findings suggested that the expression patterns of the key genes were reproducible in clinical samples and provided a reference for further investigation of their roles in DR.
(A) MMP15, (B) FOXK1. *, p < 0.05; **, p < 0.01.
4. Discussion
In the present study, we identified MMP15 and FOXK1 as potential uric acid–related potential key genes in DR through machine learning–based screening, ROC curve evaluation, and expression validation. Both genes were significantly upregulated in DR samples and exhibited moderate-to-high discriminatory ability (AUC > 0.7). Functional enrichment analysis indicated that both genes were enriched in pathways implicated in the pathogenesis of DR. Immune infiltration analysis revealed imbalanced infiltration of multiple immune cell subsets in DR patients, in which MMP15 and FOXK1 were negatively correlated with natural killer cells and memory T cells, respectively. Regulatory network analysis identified 12 transcription factors that could co-target these two potential key genes. RT-qPCR validation confirmed elevated expression levels of MMP15 and FOXK1 in the peripheral blood of DR patients, supporting their potential value as diagnostic biomarkers.
MMP15 (MT2‑MMP) is a membrane‑anchored matrix metalloproteinase capable of degrading the extracellular matrix (ECM) and regulating growth factor signaling [31]. These characteristics are consistent with the pathological alterations in retinal vascular diseases: MMP15 expression is upregulated in the vitreous humor of patients with proliferative diabetic retinopathy (PDR) and is thought to contribute to disease progression [32]. Ranibizumab, an anti‑VEGF agent, can reshape the systemic expression pattern of MMP‑ and TIMP‑related genes in patients with neovascular age‑related macular degeneration, and MMP15 has been proposed to be potentially involved in this therapeutic response [33]. Moreover, MMP15 has been identified as a hub gene in glaucoma, suggesting its broader pathogenic relevance in ocular vascular and neurodegenerative disorders [34]. In the present study, MMP15 was significantly upregulated in the peripheral blood transcriptome of patients with DR, consistently appeared as a high‑weight feature gene in machine learning‑based screening, and its elevated expression was further confirmed by RT‑qPCR. These findings indicate that MMP15 represents a detectable and reproducible peripheral blood biomarker with potential translational application value.
According to existing literature, although the roles of MMP‑2 and MMP‑9 in DR have been extensively documented, MMP15 has not been established as a key molecule linking uric acid to DR. Combined with our enrichment analysis, the expression signature of MMP15 is closely associated with ECM remodeling, increased endothelial permeability, and transendothelial migration of inflammatory cells — processes that underlie early blood‑retinal barrier breakdown, exudation, and immune cell infiltration into retinal tissue in DR [31,32,35]. We hypothesize that the inflammatory microenvironment induced by elevated uric acid may upregulate MMP15 expression [36,37]. As a membrane‑type matrix metalloproteinase, MMP15 can activate downstream MMP‑2 [38], and this cascade may constitute a potential molecular pathway connecting uric acid metabolic disturbance to DR vasculopathy.
FOXK1 (Forkhead Box K1) is a member of the forkhead box transcription factor family and participates in key cellular processes including cell cycle regulation, DNA damage response, metabolic reprogramming, angiogenesis, and apoptosis [39,40]. The VEGF/VEGFC signaling axis serves as a core driver of pathological angiogenesis and vascular leakage in DR [41]. Recent evidence indicates that upregulation of FOXK1 enhances VEGFC expression [42]. Conversely, inhibition of FOXK1 suppresses the PI3K/Akt/mTOR pathway and subsequent cell proliferation and angiogenesis [43]. These findings position FOXK1 as a relevant regulator at the interface between metabolism and vascular remodeling. Notably, FOXK1 also exhibits similar pathogenic roles in other microvascular complications. For instance, FOXK1 knockdown has been shown to alleviate glomerular endothelial injury and renal fibrosis in mice [44]. In the present study, FOXK1 was consistently upregulated in two independent peripheral blood transcriptome datasets of DR patients, and its elevated expression was further validated by RT‑qPCR. In addition, correlation analysis revealed that FOXK1 was significantly positively correlated with IMPDH1, a key enzyme in purine nucleotide synthesis, while negatively correlated with ATIC, which participates in purine nucleotide interconversion. This suggests a potential regulatory link between FOXK1 and purine metabolism at the transcriptional level. Collectively, these findings imply that FOXK1 may participate in metabolic reprogramming and angiogenesis regulation in DR patients. This hypothesis warrants further mechanistic investigation for validation.
Regarding pathway enrichment analysis, the Gene Set Enrichment Analysis (GSEA) results for MMP15 and FOXK1 revealed distinct functional orientations. The gene sets associated with MMP15 were enriched in the lipopolysaccharide (LPS) response pathway, which is linked to the activation of innate immune cells. Previous studies have demonstrated that LPS can affect the function of retinal microvascular endothelial cells by activating the TLR4 signaling pathway [45], suggesting that MMP15 may be associated with the innate immune response and thereby indirectly participate in vascular lesions in DR. In contrast, the gene sets related to FOXK1 were enriched in the phosphatidylinositol signaling system and proteasome pathways. As an upstream regulatory component of the PI3K/Akt signaling axis, the phosphatidylinositol signaling system and its associated axis have been confirmed to be involved in DR-related pathological angiogenesis [46]. The proteasome pathway, in turn, is involved in maintaining intracellular protein homeostasis [47]. As a metabolism-related transcription factor [40], the enrichment profile of FOXK1 indicates that this gene may participate in the formation of retinal neovascularization during the progression of DR by influencing intracellular metabolic signal transduction and protein homeostasis. It should be emphasized that the above speculations are mainly based on indirect evidence from existing literature and require further functional experiments for validation.
In the present study, we observed a negative correlation between MMP15 expression and natural killer [48] cell infiltration level. NK cells are thought to exert potential protective functions in DR [49]. As a membrane-type matrix metalloproteinase, MMP15 participates in extracellular matrix remodeling and the processing and release of inflammatory factors [50]. Combined with the above evidence, we hypothesize that high MMP15 expression may indirectly modulate the recruitment or activity of NK cells by shaping the local inflammatory cytokine network, thereby attenuating their potential protective effects. This hypothesis provides a plausible molecular entry point for understanding the involvement of MMP15 in immune regulation in DR.In addition, FOXK1 expression was negatively correlated with the infiltration levels of central memory CD4 ⁺ T cells and effector memory CD8 ⁺ T cells. As a key transcription factor involved in metabolic reprogramming and cellular stress responses, FOXK1 contributes to insulin regulation and mitochondrial-dependent cellular metabolism and function [40]. The differentiation, maintenance, and migratory capacity of memory T cells are highly dependent on metabolic status [51]. We therefore speculate that elevated FOXK1 expression may alter the metabolic fitness of T cells, thereby affecting the distribution or survival of specific memory T-cell subsets in peripheral blood and further contributing to DR-related systemic immune alterations. However, the above speculations are mainly based on statistical associations and indirect evidence from existing literature, and it remains unclear whether these immune changes act as drivers of DR progression or are concomitant phenomena secondary to systemic metabolic disorders. Future studies incorporating retinal tissue samples, longitudinal cohorts, and functional experiments are warranted to further clarify the specific roles of MMP15 and FOXK1 in immune regulation and their association with the pathological mechanisms of DR.
Of particular note, as a local disease centered on retinal microvascular lesions, ideal molecular biomarker studies for DR should ideally be derived directly from ocular tissue. However, numerous recent studies have demonstrated that DR is not an isolated ocular disorder, but is associated with systemic metabolic disturbances, chronic low-grade inflammation, and imbalanced immune status [52]. Moreover, an increasing number of studies have employed peripheral blood samples to identify potential key genes or biomarkers related to DR, which have shown favorable diagnostic and risk-prediction value [53,54]. Therefore, the use of peripheral blood data for biomarker discovery offers unique advantages in clinical translation, including noninvasiveness, easy accessibility, and convenience for dynamic monitoring. Nevertheless, we acknowledge that peripheral blood gene expression profiles may not fully and precisely reflect local molecular events in the retina. The precise localization, expression changes, and functional mechanisms of the currently identified potential key genes in retinal tissue still require further experimental validation. Accordingly, in future studies we will combine animal models and retinal tissue specimens to further investigate the exact roles of the screened potential key genes in local ocular lesions, with the aim of more comprehensively elucidating their pathological significance.
Taken together, our findings support a preliminary working model: uric acid-driven chronic low-grade inflammation and metabolic stress may be linked to MMP15-mediated alterations in vascular permeability and FOXK1-related PI3K/Akt signaling, which collectively contribute to the pathogenesis of DR. This model warrants further validation by functional experiments. Nevertheless, several limitations should be acknowledged. The sample size remains relatively small, and experimental validation is currently restricted to peripheral blood. We lack direct functional evidence from single-cell spatial resolution, metabolic profiling, as well as models including endothelial barrier assays, angiogenesis assays, and in vivo intervention studies. Furthermore, due to missing information, the included samples lacked comparisons of key clinical characteristics such as diabetes duration, DR stage, blood glucose, and uric acid levels, which may introduce certain bias in interpreting gene expression differences. Therefore, the precise causal positioning of MMP15 and FOXK1 in the DR pathological cascade remains to be determined. Future work will focus on expanding the sample size and recruiting large independent cohorts with complete clinical information, to further clarify the independent expression regulatory relationships of MMP15 and FOXK1 in DR by adjusting for confounders through multivariate analysis. Meanwhile, future studies will determine whether MMP15 directly regulates blood‑retinal barrier permeability, whether FOXK1 amplifies pathological neovascularization via the PI3K/Akt/mTOR pathway, and whether these two genes form a synergistic regulatory loop under uric acid-driven chronic inflammation. Addressing these issues may provide a molecular basis for early risk stratification, progression control, and individualized intervention in DR.
Supporting information
S1 Table. Clinical Information Table of the Dataset.
https://doi.org/10.1371/journal.pone.0350132.s001
(XLSX)
S3 Table. Clinical Sample Clinical Information Form.
https://doi.org/10.1371/journal.pone.0350132.s003
(DOCX)
S4 Table. Primer sequences used for RT-qPCR validation of Potential key genes.
https://doi.org/10.1371/journal.pone.0350132.s004
(XLS)
S5 Table. DEGs identified in the GSE221521 dataset.
https://doi.org/10.1371/journal.pone.0350132.s005
(XLSX)
S6 Table. GO enrichment analysis of candidate genes.
https://doi.org/10.1371/journal.pone.0350132.s006
(XLSX)
S7 Table. KEGG pathway enrichment analysis of candidate genes.
https://doi.org/10.1371/journal.pone.0350132.s007
(XLSX)
S8 Table. GSEA results for Potential key genes.
https://doi.org/10.1371/journal.pone.0350132.s008
(XLSX)
S9 Table. Correlation matrix of differentially infiltrated immune cell types in the GSE221521 dataset.
https://doi.org/10.1371/journal.pone.0350132.s009
(XLSX)
S10 Table. P-value matrix for immune cell type correlations in the GSE221521 dataset.
https://doi.org/10.1371/journal.pone.0350132.s010
(XLSX)
S11 Table. Correlation coefficients between potential key genes and differentially infiltrated immune cells.
https://doi.org/10.1371/journal.pone.0350132.s011
(XLSX)
S12 Table. P-values for correlations between potential key genes and immune cells.
https://doi.org/10.1371/journal.pone.0350132.s012
(XLSX)
S1 Figure. Analysis results of the correlation between FOXK1 and genes related to purine metabolism.
https://doi.org/10.1371/journal.pone.0350132.s013
(TIF)
S2 Figure. Validation of potential key gene expressions in the GSE94019 dataset.
https://doi.org/10.1371/journal.pone.0350132.s014
(TIF)
Acknowledgments
We would like to express our sincere gratitude to all individuals and organizations who supported and assisted us throughout this research. Special thanks to Yongshun Liang from the Department of Ophthalmology, The First Affiliated Hospital of Guangxi Medical University. In conclusion, we extend our thanks to everyone who has supported and assisted us along the way. Without your support, this research would not have been possible.
References
- 1. Wang N, Ding L, Liu D, Zhang Q, Zheng G, Xia X, et al. Molecular investigation of candidate genes for pyroptosis-induced inflammation in diabetic retinopathy. Front Endocrinol (Lausanne). 2022;13:918605. pmid:35957838
- 2. Ling F, Zhang C, Zhao X, Xin X, Zhao S. Identification of key genes modules linking diabetic retinopathy and circadian rhythm. Front Immunol. 2023;14:1260350. pmid:38124748
- 3. Guo Y, Liu S, Xu H. Uric Acid and Diabetic Retinopathy: A Systematic Review and Meta-Analysis. Front Public Health. 2022;10:906760. pmid:35712295
- 4. Hashida N, Nishida K. Recent advances and future prospects: Current status and challenges of the intraocular injection of drugs for vitreoretinal diseases. Adv Drug Deliv Rev. 2023;198:114870. pmid:37172783
- 5. Rush RB, Gomez PL, Rush SW, Gomez Bastar P. INTERNAL LIMITING MEMBRANE PEELING IN PATIENTS UNDERGOING VITRECTOMY FOR TRACTIONAL RETINAL DETACHMENT SECONDARY TO DIABETIC RETINOPATHY. Retina. 2023;43:1282–90.
- 6. Everett LA, Paulus YM. Laser Therapy in the Treatment of Diabetic Retinopathy and Diabetic Macular Edema. Curr Diab Rep. 2021;21(9):35. pmid:34487257
- 7. Liu J, Zhang F, Shi X. The role of metal nanocarriers, liposomes and chitosan-based nanoparticles in diabetic retinopathy treatment: A review study. Int J Biol Macromol. 2025;291:139017. pmid:39708854
- 8. Hou L, Shi Y, Wang S, Chen Q, Li Q, Zhao M, et al. Associations of serum uric acid level with diabetic retinopathy and albuminuria in patients with type 2 diabetes mellitus. J Int Med Res. 2020;48(12):300060520963980. pmid:33334216
- 9. Chen D, Sun X, Zhao X, Liu Y. Associations of serum uric acid and urinary albumin with the severity of diabetic retinopathy in individuals with type 2 diabetes. BMC Ophthalmol. 2020;20(1):467. pmid:33256661
- 10. Xia Q, Zhang S-H, Yang S-M, Zhu X-L, Su S, Hu A-P, et al. Serum uric acid is independently associated with diabetic nephropathy but not diabetic retinopathy in patients with type 2 diabetes mellitus. J Chin Med Assoc. 2020;83(4):350–6. pmid:32132382
- 11. Zhuang C, Liu Y, Gu R, Du S, Long Y. Prognostic signature of colorectal cancer based on uric acid-related genes. Heliyon. 2023;9(12):e22587. pmid:38213580
- 12. Mason RH, Minaker SA, Lahaie Luna G, Bapat P, Farahvash A, Garg A, et al. Changes in aqueous and vitreous inflammatory cytokine levels in proliferative diabetic retinopathy: a systematic review and meta-analysis. Eye (Lond). 2022;:10.1038/s41433-022-02127–x. pmid:35672457
- 13. Zhu D-D, Wang Y-Z, Zou C, She X-P, Zheng Z. The role of uric acid in the pathogenesis of diabetic retinopathy based on Notch pathway. Biochem Biophys Res Commun. 2018;503(2):921–9. pmid:29932924
- 14. Xiong Q, Liu J, Xu Y. Effects of Uric Acid on Diabetes Mellitus and Its Chronic Complications. Int J Endocrinol. 2019;2019:9691345. pmid:31737070
- 15. Nie C, Ma Q, Liu C, Chen L, Wang C, Hou X. Associations of serum uric acid to eGFR ratio with diabetic retinopathy in individuals with type 2 diabetes. Sci Rep. 2025;15(1):16625. pmid:40360591
- 16. Ye C, Fu Y, Zhou X, Zhou F, Zhu X, Chen Y. Identification and validation of NAD+ metabolism-related biomarkers in patients with diabetic peripheral neuropathy. Front Endocrinol (Lausanne). 2024;15:1309917. pmid:38464965
- 17. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. pmid:25516281
- 18. Gu Z, Eils R, Schlesner M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016;32(18):2847–9. pmid:27207943
- 19. Chen H, Boutros PC. VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinformatics. 2011;12:35. pmid:21269502
- 20. Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z. ClusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb). 2021;2:100141.
- 21. Feng J, Hu Y, Peng P, Li J, Ge S. Potential biomarkers of aortic dissection based on expression network analysis. BMC Cardiovasc Disord. 2023;23:147.
- 22. Gustavsson EK, Zhang D, Reynolds RH, Garcia-Ruiz S, Ryten M. ggtranscript: an R package for the visualization and interpretation of transcript isoforms using ggplot2. Bioinformatics. 2022;38(15):3844–6. pmid:35751589
- 23. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504. pmid:14597658
- 24. Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw. 2010;33(1):1–22. pmid:20808728
- 25. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77. pmid:21414208
- 26. Li H, Gao L, Du J, Ma T, Ye Z, Li Z. Differentially expressed gene profiles and associated ceRNA network in ATG7-Deficient lens epithelial cells under oxidative stress. Front Genet. 2022;13:1088943. pmid:36568386
- 27. Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics. 2013;14:7. pmid:23323831
- 28. Yu M, Peng J, Lu Y, Li S, Ding K. Silencing immune-infiltrating biomarker CCDC80 inhibits malignant characterization and tumor formation in gastric cancer. BMC Cancer. 2024;24(1):724. pmid:38872096
- 29. Robles-Jimenez LE, Aranda-Aguirre E, Castelan-Ortega OA, Shettino-Bermudez BS, Ortiz-Salinas R, Miranda M, et al. Worldwide Traceability of Antibiotic Residues from Livestock in Wastewater and Soil: A Systematic Review. Animals (Basel). 2021;12(1):60. pmid:35011166
- 30. Tao B, Wang Z, Xie D, Cui H, Zhao B, Li J, et al. Exploring the prognostic analysis of autophagy and tumor microenvironment based on monocyte cells in lung cancer. Aging (Albany NY). 2024;16(13):10931–42. pmid:38942606
- 31. Wu M-H, Wu P-R, Hsieh Y-H, Lin C-L, Liu C-J, Ying T-H. Silencing PROK2 Inhibits Invasion of Human Cervical Cancer Cells by Targeting MMP15 Expression. Int J Mol Sci. 2020;21(17):6391. pmid:32887509
- 32. Abu El-Asrar AM, Nawaz MI, Allegaert E, Siddiquei MM, Ahmad A, Gikandi P, et al. Differential Expression and Localization of ADAMTS Proteinases in Proliferative Diabetic Retinopathy. Molecules. 2022;27(18):5977. pmid:36144730
- 33. Chen T, Wan L, Xiao Y, Wang K, Wu P, Li C, et al. Curcumin/pEGCG-encapsulated nanoparticles enhance spinal cord injury recovery by regulating CD74 to alleviate oxidative stress and inflammation. J Nanobiotechnology. 2024;22(1):653. pmid:39443923
- 34. Xie R-L, Nie H-Y, Xu Y-X. Identification of hub genes for glaucoma: a study based on bioinformatics analysis and experimental verification. Int J Ophthalmol. 2023;16(7):1015–25. pmid:37465497
- 35. Zeng X, Li Y, Sun C, Liu Z, Zhao J, Ma X, et al. A New Model for Predicting Nonsentinel Lymph Node Metastasis in Early-Stage Breast Cancer Using MMP15. J Oncol. 2022;2022:8675705. pmid:36035312
- 36. Xiao N, Liu T-L, Li H, Xu H-C, Ge J, Wen H-Y, et al. Low Serum Uric Acid Levels Promote Hypertensive Intracerebral Hemorrhage by Disrupting the Smooth Muscle Cell-Elastin Contractile Unit and Upregulating the Erk1/2-MMP Axis. Transl Stroke Res. 2020;11(5):1077–94. pmid:32323149
- 37. Tao M, Shi Y, Tang L, Wang Y, Fang L, Jiang W, et al. Blockade of ERK1/2 by U0126 alleviates uric acid-induced EMT and tubular cell injury in rats with hyperuricemic nephropathy. Am J Physiol Renal Physiol. 2019;316(4):F660–73. pmid:30648910
- 38. Visse R, Nagase H. Matrix metalloproteinases and tissue inhibitors of metalloproteinases: structure, function, and biochemistry. Circ Res. 2003;92(8):827–39. pmid:12730128
- 39. Li X, Lu J, Liu L, Li F, Xu T, Chen L, et al. FOXK1 regulates malignant progression and radiosensitivity through direct transcriptional activation of CDC25A and CDK4 in esophageal squamous cell carcinoma. Sci Rep. 2023;13(1):7737. pmid:37173384
- 40. Sakaguchi M, Cai W, Wang C-H, Cederquist CT, Damasio M, Homan EP, et al. FoxK1 and FoxK2 in insulin regulation of cellular and mitochondrial metabolism. Nat Commun. 2019;10(1):1582. pmid:30952843
- 41. Yang JM, Park CS, Kim SH, Noh TW, Kim J-H, Park S, et al. Dll4 Suppresses Transcytosis for Arterial Blood-Retinal Barrier Homeostasis. Circ Res. 2020;126(6):767–83. pmid:32078435
- 42. Zheng S, Yang L, Zou Y, Liang J-Y, Liu P, Gao G, et al. Long non-coding RNA HUMT hypomethylation promotes lymphangiogenesis and metastasis via activating FOXK1 transcription in triple-negative breast cancer. J Hematol Oncol. 2020;13(1):17. pmid:32138762
- 43. Long H, Hu Q, Yang L, Li X. FOXK1 regulates apoptosis, migration and angiogenesis in high glucose-induced vascular endothelial cells through p-AKT/AKT signaling pathway. BMC Biotechnol. 2025;25(1):110. pmid:41039445
- 44. Zhang L, Tian M, Zhang M, Li C, Wang X, Long Y, et al. Forkhead Box Protein K1 Promotes Chronic Kidney Disease by Driving Glycolysis in Tubular Epithelial Cells. Adv Sci (Weinh). 2024;11(36):e2405325. pmid:39083268
- 45. Tsioti I, Steiner BL, Escher P, Zinkernagel MS, Benz PM, Kokona D. Endothelial Toll-like receptor 4 is required for microglia activation in the murine retina after systemic lipopolysaccharide exposure. J Neuroinflammation. 2023;20(1):25. pmid:36739425
- 46. Gao Y, Liu H, Yang T, Wang J. Baicalin affects the progression of diabetic retinopathy through the RAGE/PXDN/PI3K/AKT pathway. J Transl Med. 2025;24(1):95. pmid:41408559
- 47. Basler M, Groettrup M. On the Role of the Immunoproteasome in Protein Homeostasis. Cells. 2021;10(11):3216. pmid:34831438
- 48. Braga TT, Forni MF, Correa-Costa M, Ramos RN, Barbuto JA, Branco P. Soluble Uric Acid Activates the NLRP3 Inflammasome. Sci Rep. 2017;7:39884.
- 49. Josef R, Jitka P, Martina Z, Vlastimil K, Ivana S, Lucie DR, et al. Concentration of NK cells after β-glucan and vitamin D supplementation in patients with diabetic retinopathy. Folia Microbiol (Praha). 2020;65(4):755–61. pmid:32248405
- 50. Lee H, Ibrahimi L, Azar DT, Han K-Y. The Role of Membrane-Type 1 Matrix Metalloproteinase-Substrate Interactions in Pathogenesis. Int J Mol Sci. 2023;24(3):2183. pmid:36768503
- 51. Balyan R, Gautam N, Gascoigne NRJ. The Ups and Downs of Metabolism during the Lifespan of a T Cell. Int J Mol Sci. 2020;21(21):7972. pmid:33120978
- 52. Atak Tel BM, Khalid A, Aktas G. Serum C-reactive protein and C-reactive protein derived markers in diabetic microvascular complications. World J Diabetes. 2025;16(12):113237. pmid:41480614
- 53. Liu J, Li J, Tang Y, Zhou K, Zhao X, Zhang J, et al. Transcriptome analysis combined with Mendelian randomization screening for biomarkers causally associated with diabetic retinopathy. Front Endocrinol (Lausanne). 2024;15:1410066. pmid:39022342
- 54. Li X, Long H, Peng S. Identification and Validation of FOXK1-Related Biomarkers in Diabetic Retinopathy. Int J Gen Med. 2026;19:581708. pmid:41716682