Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Integrative identification of hub genes in development of atrial fibrillation related stroke

  • Kai Huang ,

    Contributed equally to this work with: Kai Huang, Xi Fan, Yuwen Jiang

    Roles Data curation, Formal analysis

    Affiliation Department of Cardiothoracic Surgery, Huashan Hospital of Fudan University, Shanghai, China

  • Xi Fan ,

    Contributed equally to this work with: Kai Huang, Xi Fan, Yuwen Jiang

    Roles Data curation

    Affiliation Department of Cardiothoracic Surgery, Huashan Hospital of Fudan University, Shanghai, China

  • Yuwen Jiang ,

    Contributed equally to this work with: Kai Huang, Xi Fan, Yuwen Jiang

    Roles Methodology, Software

    Affiliation Department of Cardiothoracic Surgery, Huashan Hospital of Fudan University, Shanghai, China

  • Sheng Jin,

    Roles Data curation, Formal analysis

    Affiliation Department of Physiology, Hebei Medical University, Shijiazhuang, China

  • Jiechun Huang,

    Roles Data curation, Formal analysis

    Affiliation Department of Cardiothoracic Surgery, Huashan Hospital of Fudan University, Shanghai, China

  • Liewen Pang,

    Roles Formal analysis

    Affiliation Department of Cardiothoracic Surgery, Huashan Hospital of Fudan University, Shanghai, China

  • Yiqing Wang ,

    Roles Data curation, Formal analysis, Funding acquisition, Software

    drsunxiaotian@126.com (XS); wuyum@yahoo.com (YW); wangyiqing@huashan.org.cn (YW)

    Affiliation Department of Cardiothoracic Surgery, Huashan Hospital of Fudan University, Shanghai, China

  • Yuming Wu ,

    Roles Data curation, Formal analysis, Funding acquisition, Investigation, Methodology

    drsunxiaotian@126.com (XS); wuyum@yahoo.com (YW); wangyiqing@huashan.org.cn (YW)

    Affiliation Department of Physiology, Hebei Medical University, Shijiazhuang, China

  • Xiaotian Sun

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology

    drsunxiaotian@126.com (XS); wuyum@yahoo.com (YW); wangyiqing@huashan.org.cn (YW)

    Affiliation Department of Cardiothoracic Surgery, Huashan Hospital of Fudan University, Shanghai, China

Abstract

Background

As the most common arrhythmia, atrial fibrillation (AF) is associated with a significantly increased risk of stroke, which causes high disability and mortality. To date, the underlying mechanism of stroke occurring after AF remains unclear. Herein, we studied hub genes and regulatory pathways involved in AF and secondary stroke and aimed to reveal biomarkers and therapeutic targets of AF-related stroke.

Methods

The GSE79768 and GSE58294 datasets were used to analyze AF- and stroke-related differentially expressed genes (DEGs) to obtain a DEG1 dataset. Weighted correlation network analysis (WGCNA) was used to identify modules associated with AF-related stroke in GSE66724 (DEG2). DEG1 and DEG2 were merged, and hub genes were identified based on protein–protein interaction networks. Gene Ontology terms were used to analyze the enriched pathways. The GSE129409 and GSE70887 were applied to construct a circRNA-miRNA-mRNA network in AF-related stroke. Hub genes were verified in patients using quantitative real-time polymerase chain reaction (qRT-PCR).

Results

We identified 3,132 DEGs in blood samples and 253 DEGs in left atrial specimens. Co-expressed hub genes of EIF4E3, ZNF595, ZNF700, MATR3, ACKR4, ANXA3, SEPSECS-AS1, and RNF166 were significantly associated with AF-related stroke. The hsa_circ_0018657/hsa-miR-198/EIF4E3 pathway was explored as the regulating axis in AF-related stroke. The qRT-PCR results were consistent with the bioinformatic analysis.

Conclusions

Hub genes EIF4E3, ZNF595, ZNF700, MATR3, ACKR4, ANXA3, SEPSECS-AS1, and RNF166 have potential as novel biomarkers and therapeutic targets in AF-related stroke. The hsa_circ_0018657/hsa-miR-198/EIF4E3 axis could play an important role regulating the development of AF-related stroke.

Background

Cardioembolic stroke often results in severe neurological deficits and disability [1]. One of its most common causes is atrial fibrillation (AF). AF increases the overall risk of cerebrovascular events by three- to fivefold, causing one-third of all ischemic strokes [2,3]. AF patients suffering from cardioembolic strokes significantly display worse outcomes than those not suffering from them [4]. The prevalence of AF varies from 0.1% to 4% in different countries [5], causing global increases in stroke and associated disabilities and mortalities [6]. According to the Oxford Vascular Study (OXVASC) trial, AF patients aged over 80 years display a threefold increase in the risk of ischemic stroke despite the introduction of anticoagulants [7]. Although traditional stroke risk scores are widely used in clinical practice, identifying the relationship between AF and secondary stroke and predicting stroke risk in AF patients are still in high demand.

With the development of novel technologies, including microarray and next generation sequencing, the exploration of underlying molecular mechanisms behind AF-related disease becomes possible. Xie, Y et al. reported higher level of miR-641 and miR-30e-5p in serum exosomes in AF related stroke patients than in AF patients [8]. What’s more, the addition of miR-107 and miR-146a-5p to the MACE score increased the predictive performance of AF related stroke [9]. Zou, R et al. found that AF and stroke are related and ZNF566, PDZK1IP1, ZFHX3, and PITX2 genes are significantly associated with novel biomarkers involved in AF-related stroke [10]. By using GWAS analysis, Hsieh, C. S et al first identified deletions in chromosomal regions 1p36.32-1p36.33, 5p15.33, 8q24.3 and 19p13.3 and amplifications in 14q11.2 that were significantly associated with AF-related stroke and GNB1, PRKCZ, and GNG7 genes related to the alpha-adrenergic receptor signaling pathway that play a major role in determining the risk of an AF-related stroke [11].

However, no research has been performed to clarify the hub genes and underlying circRNA-miRNA-mRNA regulatory networks in AF-related stroke.

Serum miR-106 and MYL4 levels are closely related to the prevalence of atrial fibrillation, which can reflect the risk of thromboembolism in patients with atrial fibrillation and can be used as a biological indicator to predict the prognosis of patients with atrial fibrillation.

In this study, we aimed to identify AF-related differentially expressed genes (DEGs) and cardiogenic stroke-related DEGs. The DEGs of AF-related stroke was further identified, and the key modules and hub genes were explored. Finally, we identified differentially expressed miRNA (DEMis) and differentially expressed circRNA (DECircs) and constructed a circRNA-miRNA-mRNA network to elucidate the molecular mechanisms of AF-related stroke.

Methods and materials

Data acquisition

Microarray datasets of stroke and AF patients were downloaded from Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geoprofiles). The mRNA expression profile GSE79768 was performed on Platform GPL570 with paired left atrial (LA) and right atrial specimens obtained from patients with persistent AF (n = 13) or sinus rhythm (n = 13), which described the mechanisms of AF-related LA remodeling, arrhythmogenesis, and thrombogenesis. The mRNA expression profile GSE58294 was performed on Platform GPL570 containing plasma samples from patients with cardioembolic stroke (n = 69) and healthy controls (n = 23). Subjects were analyzed at three time points: less than 3 h, 5 h, and 24 h after stroke. The mRNA expression profile data GSE66724 was based on peripheral blood cells in eight patients with AF and stroke and eight AF subjects without stroke (same anticoagulation and arrhythmia therapy). The miRNA dataset GSE70887 was performed on Platform GPL19546, containing LA biopsies of two sinus rhythm and four persistent AF patients. The circRNA dataset GSE129409 was performed on GPL21825 with heart tissues from three AF patients and three healthy controls. The flowchart of this study is shown in Fig 1.

thumbnail
Fig 1. Flow chart of data preparation, processing, and analysis.

GSE79768 (atrial fibrillation mRNA dataset), GSE58294 (stroke mRNA dataset), GSE66724 (atrial fibrillation related stroke mRNA dataset), GSE70887 (atrial fibrillation miRNA dataset), GSE129409 (atrial fibrillation circRNA dataset). DEMis (differently expressed miRNAs), DECircs (differently expressed circRNAs), ceRNA (competitive endogenous RNA).

https://doi.org/10.1371/journal.pone.0283617.g001

Data screening strategy

The limma package in R [12] was applied to evaluate all the five matrix datasets. The Bayesian method was used to correct for batch effects. If more than one probe mapped to the same gene, the average expression value was used to equal its expression value. The Benjamini-Hochberg method was used to adjust original p-values, and the false discovery rate procedure was used to calculate fold changes (FC). The DEGs were screened by P < 0.05 and log FC > average (abs(log FC)) + 2 * SD (abs(log FC)) [13]. The expression values of the |log FC| > 0.618 and adjusted P < 0.05 were used for filtering AF-DEGs. The |log FC| > 0.62 and P < 0.05 were used to identify stroke-DEGs. The |log FC| > 0.273 and P < 0.05 were used to identify AF-related stroke-DEGs. As per DEMis and DECircs, the cutoff values for log FC were 1.14 and 2.09, respectively. Volcano maps and heatmaps were also drawn by the plot and pheatmap package in the R studio to exhibit all DEGs.

Construction of a weighted gene co-expression network and identification of significant modules

The co-expression network of DEGs was constructed based on the GSE66724 microarray dataset using the R package “weighted correlation network analysis (WGCNA) [14].” The soft-thresholding power was set to 5 when 0.85 was used as the correlation coefficient threshold, and 50 was selected as the minimum number of genes in the modules. To merge possibly similar modules, we defined 0.25 as the threshold for cut height.

Identification of hub genes and efficacy evaluation

DEG1 was confirmed by intersecting AF-DEGs and stroke-DEGs. Then, genes in the module of interest by WGCNA were intersected with AF-related stroke-DEGs, and DEG2 was obtained. Thereafter, DEG1 and DEG2 were merged to obtain AF-related stroke genes (DEG3). The venn plot of these three DEG datasets were showed in S1 Fig. The PPI network of DEG3 were showed in S2 Fig. The ROC curve was then plotted and the AUC was calculated to evaluate the capability of selected genes to distinguish AF-related stroke patients and AF patients without stroke. Only genes with an AUC > 0.8 were considered hub genes. The comparative toxicogenomic database (http://ctdbase.org/) was used to analyze possible relationships between hub genes and nervous or cardiovascular diseases (Fig 1).

Functional enrichment analysis

Functional annotation of Gene Ontology (GO) [15,16] was performed with the ClusterProfiler package [17] in R studio. GO terms (biological processes, molecular functions, and cellular components) with a P < 0.05 were considered significantly enriched. Subsequently, the AmiGO database (v2.0; http://amigogeneontology.org/amigo/) was used to analyze the GO consortium for filtered hub genes, verify their accuracy, and annotate their biological functions.

Protein–protein interaction (PPI) networks

To calculate the interactions between molecules in AF-DEGs, stroke-DEGs, and WGCNA modules, the online database STRING [18] (https://string-db.org/) was applied with a confidence score > 0.4. Cytoscape software (Version 3.7.2; http://cytoscape.org/) was used to visualize and analyze the biological networks and node degrees [19].

Construction of circRNA-miRNA-mRNA network for AF-related stroke

Three online databases, including miRDB (http://www.mirdb.org), miRTarBase (http://mirtarbase.mbc.nctu.edu.tw), and TargetScan (http://www.targetscan.org), were used to predict the targets of DEMis. The targeted genes fulfilling the qualification of at least two databases were selected. Predicted genes were further filtered by matching the hub genes selected above. Then, the miRNA-mRNA pairs were determined. The miRNA targets of DECircs were predicted using the online database circBank (http://www.circbank.cn/), and they were validated by the DEMis selected above. Then, the circRNA-miRNA pairs were determined. Cytoscape was used to visualize this circRNA-miRNA-mRNA network.

Blood sample collection and quantitative real-time polymerase chain reaction

AF patients with or without stroke in the Huashan Hospital were included in this study. The exclusion criteria were coronary artery heart disease, hypertension, diabetes, and obstructive sleep apnea syndrome. Blood samples from three AF patients with stroke and three AF patients without stroke were collected. Samples were immediately preserved in RNALock Reagent (TIANGEN, Beijing, China) to inhibit RNA degradation. This study was approved by the Medical Ethics Committee of Huashan Hospital, Fudan University. All patients participating in this study provided written informed consent before blood collection.

The total RNA was extracted from blood samples with an RNAprep pure Blood Kit (TIANGEN, Beijing, China). Total RNA was quantified with a NanoDrop spectrophotometer 2000 (Thermo Fisher Scientific, Waltham, MA, USA) and then reversely transcribed into cDNA using random primers with the PrimeScript™ RT Reagent Kit (TaKara, Dalian, China). A quantitative real-time polymerase chain reaction (qRT-PCR) was carried out with TB Green™ Premix Ex Taq™ (TaKara, Dalian, China) on a CFX-96 real-time PCR System (Bio-Rad, Shanghai, China). Expression data was normalized by GAPDH, and the 2−ΔΔCT method was applied to analyze the results. All sequences for RNA primers were purchased from Sangon Biotech, Shanghai, China.

Results

Identification of differently expressed genes

After data pre-processing, we screened the DEGs of the datasets GSE79768, GSE58294, GSE66724, GSE70887, and GSE129409. In GSE79768 (mRNA dataset of AF), we identified 160 upregulated and 93 downregulated genes (S1 Table). In GSE58294 (stroke mRNA dataset), we identified 348 upregulated and 445 downregulated genes at <3 h, 555 upregulated and 632 downregulated genes at 5 h, and 525 upregulated and 629 downregulated genes at 24 h. Here, we defined 566 co-expressed DEGs at the three time points mentioned above as the stroke-DEGs (S2 Table). In GSE66724 (AF-related stroke mRNA dataset), we identified 195 upregulated and 133 downregulated genes (S3 Table). In GSE70887 (AF miRNA dataset), we identified 12 upregulated and 9 downregulated miRNAs (S4 Table). In GSE129409 (AF circRNA dataset), we identified 270 upregulated and 162 downregulated circRNAs (S5 Table). Heatmaps of the top 30 AF-DEGs and stroke-DEGs are shown in Fig 2A–2D.

thumbnail
Fig 2. Identification of differentially expressed genes.

Heatmap of the top 30 differentially expressed genes based on GSE79768 (A), and at < 3 h (B), 5 h (C), 24 h (D) after stroke in GSE58294. The color intensity (from red to green) suggests the higher to lower expression.

https://doi.org/10.1371/journal.pone.0283617.g002

The volcanic diagram of all genes and the expression heatmap of the top 30 DEGs in GSE66724 (S3A Fig, Fig 3A), GSE70887 (S3B Fig, Fig 3B), and GSE129409 (S3C Fig, Fig 3C).

thumbnail
Fig 3. Identification of differentially expressed genes.

Heatmap of the top 30 differentially expressed genes based on A GSE66724, B GSE70887, and C GSE129409. The color intensity (from red to green) suggests the higher to lower expression.

https://doi.org/10.1371/journal.pone.0283617.g003

Construction of co-expression networks and identification of key modules

A hierarchical clustering tree was created based on the dynamic hybrid cut with a scale-free network and topological overlaps (Fig 4A). Based on the scale-free topology criterion, a soft-thresholding power of 5 was selected (scale-free R2 = 0.85; Fig 4B and 4C). Five modules were identified for further analysis. The cluster dendrogram of the modules is shown in Fig 4D, while the clustering of module eigengenes is provided in Fig 4E. Moreover, we analyzed the association of gene modules by comparing AF with stroke and AF without stroke (Fig 5A). The turquoise module showed the highest positive correlation (r = 0.34). Therefore, we identified the turquoise module as the key one for further analysis. A total of 432 genes were included in the turquoise module (S6 Table). Moreover, we illustrated in turquoise the module membership and gene significance for AF with stroke (correlation coefficient = 0.32, P < 0.001) (Fig 5B). In addition, genes in the turquoise module were overlapped with DEGs in GSE66724. Fourteen genes (DEG2), including ZNF595, BC044596, MATR3, LOC389765, PDCD4, SEPSECS-AS1, GRINA, ELANE, TVP23B, AMFR, ROGDI, ZNF700, PLVAP, and MYL4, were hub gene candidates. We overlapped these two gene sets (DEG3) and obtained 21 genes: ZNF595, MATR3, PDCD4, SEPSECS-AS1, GRINA, ELANE, TVP23B, AMFR, ROGDI, ZNF700, PLVAP, MYL4, RNF166, ACKR4, EIF4E3, PDZK1IP1, HTRA1, NELL2, BEX2, ANXA3, and SLC51A.

thumbnail
Fig 4. Sample clustering and network construction of the weighted co-expressed genes.

Clustering dendrogram of 8 AF without Stroke and 8 AF with Stroke (A). The color intensity was proportional to disease status (with or without Stroke). Analysis of the scale independence (B) and the mean connectivity (C) for various soft‑thresholding powers. The soft‑thresholding power of 5 was selected based on the scale‑free topology criterion. Dendrogram clustered was based on a dissimilarity measure (1‑TOM). Gene expression similarity is assessed by a pair‑wise weighted correlation metric and clustered based on a topological overlap metric into modules. Each color below represents one co‑expression module, and every branch stands for one gene (D). The cluster dendrogram of module eigengenes was demonstrated (E).

https://doi.org/10.1371/journal.pone.0283617.g004

thumbnail
Fig 5. The identification of key modules via weighted gene co‑expression network analysis.

Heatmap of the correlation between module eigengenes and the disease status of AF-related Stroke (A). The corresponding correlation coefficient along with P‑value is given in each cell, and each cell is color‑coded by correlation according to the color (legend at right). The turquoise module was most significantly correlated with AF-related Stroke. Scatter plot of module eigengenes in the turquoise module was presented (B). The Venn diagram of genes from the key module and DEGs from GSE66724 was drawn (C).

https://doi.org/10.1371/journal.pone.0283617.g005

PPI network analysis and functional GO term enrichment analysis

We identified 155 and 43 nodes from the PPI network of AF-DEGs and stroke-DEGs, as shown in Fig 6A and 6B. The top 10 hub nodes in AF-DEGs included Toll-like receptor 4 (TLR4, degree = 18), Toll-like receptor 8 (TLR8, degree = 16), complement C3 (C3, degree = 15), cathepsin S (CXCR2, degree = 13), keratin 5 (KRT5, degree = 12), myeloid cell nuclear differentiation antigen (MNDA, degree = 11), snail family transcriptional repressor 2 (SNAI2, degree = 11), caspase 1 (CASP1, degree = 9), and purinergic receptor P2Y13 (P2RY13, degree = 9). The top 10 hub genes in stroke-DEGs included G protein subunit gamma transducin 1 (GNGT1, degree = 18), phospholipase C gamma 1 (PLCG1, degree = 15), platelet factor 4 (PF4, degree = 15), AKT serine/threonine kinase 1 (AKT1, degree = 14), gamma-glutamyl hydrolase (GGH, degree = 13), catenin beta 1 (CTNNB1, degree = 13), junction plakoglobin (JUP, degree = 11), major histocompatibility complex, class II, DQ alpha 1 (HLA-DQA1, degree = 11), LCK proto-oncogene (LCK, degree = 11), and adenylate cyclase 4 (ADCY4, degree = 11). Fig 6C showed that DEG1 owned 9 genes, including RNF166, ACKR4, EIF4E3, PDZK1IP1, HTRA1, NELL2, BEX2, ANXA3, SLC51A.

thumbnail
Fig 6.

PPI network of AF-related DEGs (A), PPI network of Stroke-related DEGs (B), and Venn diagrams of AF-related stroke genes (C) were presented. Red, greater degree. blue, lesser degree.

https://doi.org/10.1371/journal.pone.0283617.g006

We performed interactomics and GO functional enrichment analysis on DEG3 based on GeneMania (http://genemania.org/). As shown in Fig 7A, hydrolase activity, secretory granule lumen, azurophil granule and primary lysosome were the main enriched terms.

thumbnail
Fig 7. Enrichment analysis of key modules.

Gene ontology enrichment analysis in DEG1 (A), DEG2 (B), and DEG3 (C). The significance of enrichment gradually increases from blue to red, and the size of the dots indicates the number of genes contained in the corresponding pathway. ROC curves of hub genes in GSE66724 (D) and GSE58294 (E) were present.

https://doi.org/10.1371/journal.pone.0283617.g007

Validation of hub genes of AF-related stroke

Based on the results of DEG3 and ROC curves in GSE66724 and GSE58294 (Fig 7D and 7E), we filtered eight hub genes of AF-related stroke (AUC > 0.8), including EIF4E3, ZNF595, ZNF700, MATR3, ACKR4, ANXA3, SEPSECS-AS1, and RNF166. The database showed that hub genes targeted several nervous system and cardiovascular diseases (Fig 8A–8H). GO term enrichment related to biological processes, molecular functions, and cellular components of hub genes were associated with various processes, as indicated in Table 1.

thumbnail
Fig 8.

Nervous and cardiovascular diseases related to hub genes based on the CTD database (A-H), circRNA-miRNA-mRNA network (I).

https://doi.org/10.1371/journal.pone.0283617.g008

thumbnail
Table 1. The Gene Ontology (GO) terms enrichment for hub genes of the AF-related stroke.

https://doi.org/10.1371/journal.pone.0283617.t001

Reconstruction of the circRNA-miRNA-mRNA network in AF-related stroke

As shown in Fig 8I, a circRNA-miRNA-mRNA regulatory network bearing ten circRNAs, eight miRNAs, and one mRNA was constructed to demonstrate pathophysiologic mechanisms in AF-related stroke. The parameters of degree, closeness, and betweenness in the network were calculated by the plugin cytoHubba in Cytoscape. The top 5 nodes were hsa-miR-33a-3p, hsa_circ_0000444, hsa-miR-452-3p, hsa-miR-198, and hsa_circ_0018657. EIF4E3 is the target of hsa-miR-198, and hsa-miR-198 is the target of hsa_circ_0018657, suggesting that hsa_circ_0018657/hsa-miR-198/EIF4E3 could be an important pathway regulating the development of AF-related stroke.

Experimental validation of hub genes

The expression levels of eight hub genes, miR-198, and hsa_circ_0018657 were detected in blood samples by qRT-PCR (Fig 9A–9J). Results showed that the expression of EIF4E3, ANXA3, and hsa_circ_0018657 was significantly higher in AF-related stroke patients than those in AF patients without stroke, which was consistent with the bioinformatic analysis. The expression of MATR4, ACKR4, RNF166, and miR-198 in AF-related stroke patients was lower than those in AF patients without stroke.

thumbnail
Fig 9. The expression levels of 8 hub genes, miR-198, and hsa_circ_0018657 (n = 3).

*: P < 0.05.

https://doi.org/10.1371/journal.pone.0283617.g009

Discussion

As a serious public health concern, AF has shown an increasing incidence and prevalence in the elderly population, which is associated with elevated risks of cerebrovascular disease events and deaths [2022]. Discriminating individuals with AF who are prone to develop atrial mural thrombus and cardiogenic stroke is of great concern. Despite continuous cardiac rhythm monitoring, around 30% of patients show no obvious signs before the occurrence of cerebrovascular events. Thus, identifying biomarkers of AF-related stroke and elucidating the relationship between AF and embolic events may provide novel therapeutic targets for primary care [23].

In this study, a series of bioinformatics analyses were carried out to filter hub genes of AF-related stroke. We first took the overlap of DEGs from AF and stroke patients to obtain DEG1. Then, WGCNA was used to identify key modules associated with AF-related stroke to obtain DEG2. The overlap of DEG1 and DEG2 (DEG3) was considered to include important genes in the development of AF-related stroke. Then, the PPI network of AF- and stroke-related DEGs was constructed, and we filtered eight hub genes based on ROC analyses. Those hub genes could facilitate the prevention of AF-related stroke and provide novel therapeutic targets. We also investigated the biological processes, cellular components, and molecular functions of the DEGs, revealing that these genes are significantly associated with the regulation of the BMP signaling pathway, smooth muscle cell proliferation, and extracellular matrix disassembly. Finally, we constructed a circRNA-miRNA-mRNA network of AF-related stroke and screened out the hsa_circ_0018657/hsa-miR-198/EIF4E3 axis as an important regulatory pathway in the development of AF-related stroke. These results were further validated in our qRT-PCR experiments.

In the present study, EIF4E3 was explored as a potential molecular signature for AF patients with a high probability of stroke occurrence, which may play an important role in the development of AF-related stroke. IF4E3 belongs to the EIF4E family as a translational initiation factor that interacts with the 5-prime cap structure of mRNA and recruit mRNA to the ribosome. Interestingly, although considered to regulate the nervous system directly or indirectly, EIF4E3 also has a role in the cardiovascular system. Mrvová, S and colleagues performed bioinformatics analyses of expressed sequence tags and the 3’-UTRs of the main transcript splice variants of the translational initiation factor EIF4E3 and showed that EIF4E3 mRNAs have a great potential in heavy post-transcriptional regulation [24]. EIF4E3 truncated transcript variants were mainly found in the brain. However, EIF4E3 also promotes angiogenesis in the region surrounding myocardial infarction [25].

Along with genetic susceptibility, thrombus vulnerability is another main reason for AF-related stroke. miRNAs have also been proposed as potential biomarkers for vulnerable plaques. Hoekstra, M et al. explored peripheral blood mononuclear cell microRNA profiles from coronary artery disease patients and found that miR-198 exhibited a high expression level in unstable angina pectoris patients compared with such levels in stable patients [26]. Sepramaniam, S and colleagues performed a miRNA microarray of peripheral blood samples from acute stroke patients and healthy controls, presenting that miR-198 was dysregulated in stroke patients [27], which is consistent with our results. Based on previous studies, we believe that miR-198 could play an important role in the development of AF-related stroke by binding to the 3’UTR of EIF4E3.

As the host gene of hsa_circ_0018657, EIF4EBP2 was also found to participate in brain dysfunction. Martín-Flores, N showed the significant association of SNP rs1043098 in the EIF4EBP2 gene with the onset of dyskinesia induced by L-DOPA administration [28]. In multiple sclerosis patients, EIF4EBP2 expression was downregulated compared to those in healthy controls [29]. Thus, hsa_circ_0018657 could also play a part in cerebral disease, which needs further verification. As a miRNA sponge, hsa_circ_0018657 could regulate miRNA-198 by targeting EIF4E3. The role of this axis in the development of AF-related stroke deserves further exploration.

ZNF595 and ZNF700 belong to the zinc finger protein family, whose members function as transcription factors that can regulate a broad variety of developmental and cellular processes. In schizophrenia (SCZ) patients, nonsense de novo mutations of ZNF595 are common. Data from genome-wide association studies suggested that common variants in the ZNF595 gene may be associated with SCZ and SCZ-related traits [30]. Pathogenic mutations in selenocysteine synthase (SEPSECS) cause neurodevelopmental disorders [3133]. While using sequencing data analysis, Laan, L identified regional epigenetic changes in the transcription factor gene ZNF700, which is relevant in Down syndrome brain development, providing a novel framework for further studies on epigenetic changes and transcriptional dysregulation during chromosome 21 neurogenesis [34].

The MATR3 gene encodes a nuclear matrix protein, which is proposed to stabilize certain mRNA species. Previous studies proved that mutations in MATR3 cause hereditary amyotrophic lateral sclerosis [3540]. In a genome-wide association study using memory performance in a cohort of elderly individuals (>60years), MATR3 was significantly associated with neuronal development, synaptic plasticity, and memory-related processes [41]. In addition, the 3’ UTR of MATR3 encodes the nuclear matrix protein MATR3, which is strongly expressed in the neural crest, developing heart, and great vessels [42]. Thus, subtle perturbations in MATR3 expression appear to cause similar left ventricular outflow tract defects in humans and mice. ACRK4 is a member of the G protein-coupled receptor family and is a receptor for C-C type chemokines. ACKR4 binds the homeostatic chemokines CCL19, CCL21, CCL25, and CXCL13 and has been attributed scavenging properties [43]. The expression of ACKR4 was upregulated in the border/infarct area after myocardial infarction, and knocking out ACKR4 protected against adverse ventricular remodeling in mice post-infarction, indicating that ACKR4 may be a novel therapeutic target to ameliorate cardiac remodeling [44]. ANXA3 is recognized as a regulator of cerebral ischemia/reperfusion injury. The upregulation of ANXA3 could promote cell viability, decrease cell apoptosis, and reduce the production of inflammatory cytokines in neurons after oxygen-glucose deprivation [45]. The silencing of ANXA3 would promote repair and healing of the myocardium after infarction by the activation of the PI3K/Akt signaling pathway [46]. Thus, there may be a relationship between cardiovascular and nervous system diseases, arising from loci mutations or gene variants [23].

This study also has several limitations. First, this study was based on microarray analysis, and gene expression may be not directly equivalent to protein expression. Second, data for the analysis of AF-related stroke mainly refers to persistent AF patients. Although persistent AF was most hazardous in stroke cases, other AF forms should be studied in the future. Finally, although we performed qRT-PCR to verify the expression levels of genes, more in vitro and in vivo experiments should be carried out to validate our results.

Conclusion

The hub genes of EIF4E3, ZNF595, ZNF700, MATR3, ACKR4, ANXA3, SEPSECS-AS1, and RNF166 may link AF and secondary stroke. The hsa_circ_0018657/hsa-miR-198/EIF4E3 pathway could be an important regulating axis in AF-related stroke. In addition, the hub genes TLR4, TLR8, C3, CXCR2, KRT5, MNDA, SNAI2, CASP1, and P2RY13 may be associated with AF recurrence and maintenance. GNGT1, PLCG1, PF4, AKT1, GGH, CTNNB1, JUP, HLA-DQA1, LCK, and ADCY4 may be associated with stroke.

Supporting information

S1 Fig. The venn plot of these three DEG datasets.

https://doi.org/10.1371/journal.pone.0283617.s001

(TIF)

S3 Fig. The volcanic diagram of all genes in GSE66724.

https://doi.org/10.1371/journal.pone.0283617.s003

(TIF)

S1 Table. 160 upregulated and 93 downregulated mRNAs in GSE79768.

https://doi.org/10.1371/journal.pone.0283617.s004

(XLSX)

S2 Table. 583 co-expressed DEGs at the three time points in GSE58294.

https://doi.org/10.1371/journal.pone.0283617.s005

(XLSX)

S3 Table. 195 upregulated and 133 downregulated genes in GSE66724.

https://doi.org/10.1371/journal.pone.0283617.s006

(XLSX)

S4 Table. 12 upregulated and 9 downregulated genes in GSE70887.

https://doi.org/10.1371/journal.pone.0283617.s007

(XLSX)

S5 Table. 270 upregulated and 162 downregulated genes in GSE129409.

https://doi.org/10.1371/journal.pone.0283617.s008

(XLSX)

S6 Table. 432 genes in the turquoise module.

https://doi.org/10.1371/journal.pone.0283617.s009

(XLSX)

References

  1. 1. Freeman W D and Aguilar M I. Prevention of cardioembolic stroke. Neurotherapeutics, 2011. 8(3): 488–502. pmid:21638139
  2. 2. Pistoia F, Sacco S, Tiseo C, Degan D, Ornello R, and Carolei A. The Epidemiology of Atrial Fibrillation and Stroke. Cardiol Clin, 2016. 34(2): 255–68. pmid:27150174
  3. 3. Kamel H and Healey J S. Cardioembolic Stroke. Circ Res, 2017. 120(3): 514–526. pmid:28154101
  4. 4. Barra S, Narayanan K, Boveda S, Primo J, Gonçalves H, Baran J, et al. Atrial Fibrillation Ablation and Reduction of Stroke Events: Understanding the Paradoxical Lack of Evidence. Stroke, 2019. 50(10): 2970–2976. pmid:31510894
  5. 5. Rahman F, Kwan G F, and Benjamin E J. Global epidemiology of atrial fibrillation. Nat Rev Cardiol, 2014. 11(11): 639–54. pmid:25113750
  6. 6. Collaborators G B D C o D Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet, 2017. 390(10100): 1151–1210. pmid:28919116
  7. 7. Yiin G S, Howard D P, Paul N L, Li L, Luengo-Fernandez R, Bull L M, et al. Age-specific incidence, outcome, cost, and projected future burden of atrial fibrillation-related embolic vascular events: a population-based study. Circulation, 2014. 130(15): 1236–44. pmid:25208551
  8. 8. Xie Y, Zhou L, Yan M, Zhao W, and Hu Z. Differentiation of Atrial Fibrillation and Atrial Fibrillation-Associated Ischemic Stroke Based on Serum Exosome miRNA-seq. Cardiology, 2023. pmid:36758532
  9. 9. Rivera-Caravaca J M, Teruel-Montoya R, Roldán V, Cifuentes-Riquelme R, Crespo-Matas J A, de Los Reyes-García A M, et al. Pilot Study on the Role of Circulating miRNAs for the Improvement of the Predictive Ability of the 2MACE Score in Patients with Atrial Fibrillation. J Clin Med, 2020. 9(11). pmid:33198388
  10. 10. Zou R, Zhang D, Lv L, Shi W, Song Z, Yi B, et al. Bioinformatic gene analysis for potential biomarkers and therapeutic targets of atrial fibrillation-related stroke. J Transl Med, 2019. 17(1): 45. pmid:30760287
  11. 11. Hsieh C S, Huang P S, Chang S N, Wu C K, Hwang J J, Chuang E Y, et al. Genome-Wide Copy Number Variation Association Study of Atrial Fibrillation Related Thromboembolic Stroke. J Clin Med, 2019. 8(3). pmid:30857284
  12. 12. Ritchie M E, Phipson B, Wu D, Hu Y, Law C W, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res, 2015. 43(7): e47. pmid:25605792
  13. 13. Huang K, Wen S, Huang J, Wang F, Pang L, Wang Y, et al. Integrated Analysis of Hub Genes and miRNAs in Dilated Cardiomyopathy. Biomed Res Int, 2020. 2020: 8925420. pmid:33015184
  14. 14. Zhang B and Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol, 2005. 4: Article17. pmid:16646834
  15. 15. The Gene Ontology C The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res, 2019. 47(D1): D330–D338. pmid:30395331
  16. 16. Ashburner M, Ball C A, Blake J A, Botstein D, Butler H, Cherry J M, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet, 2000. 25(1): 25–9. pmid:10802651
  17. 17. Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb), 2021. 2(3): 100141. pmid:34557778
  18. 18. Szklarczyk D, Gable A L, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res, 2019. 47(D1): D607–D613. pmid:30476243
  19. 19. Chin C H, Chen S H, Wu H H, Ho C W, Ko M T, and Lin C Y. cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Syst Biol, 2014. 8 Suppl 4: S11. pmid:25521941
  20. 20. Benjamin E J, Wolf P A, D’Agostino R B, Silbershatz H, Kannel W B, and Levy D. Impact of atrial fibrillation on the risk of death: the Framingham Heart Study. Circulation, 1998. 98(10): 946–52. pmid:9737513
  21. 21. Soliman E Z, Lopez F, O’Neal W T, Chen L Y, Bengtson L, Zhang Z M, et al. Atrial Fibrillation and Risk of ST-Segment-Elevation Versus Non-ST-Segment-Elevation Myocardial Infarction: The Atherosclerosis Risk in Communities (ARIC) Study. Circulation, 2015. 131(21): 1843–50. pmid:25918127
  22. 22. January C T, Wann L S, Calkins H, Chen L Y, Cigarroa J E, Cleveland J C Jr., et al. 2019 AHA/ACC/HRS Focused Update of the 2014 AHA/ACC/HRS Guideline for the Management of Patients With Atrial Fibrillation: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines and the Heart Rhythm Society in Collaboration With the Society of Thoracic Surgeons. Circulation, 2019. 140(2): e125–e151. pmid:30686041
  23. 23. Zou R, Zhang D, Lv L, Shi W, Song Z, Yi B, et al. Bioinformatic gene analysis for potential biomarkers and therapeutic targets of atrial fibrillation-related stroke. J Transl Med, 2019. 17(1): 45. pmid:30760287
  24. 24. Mrvová S, Frydrýšková K, Pospíšek M, Vopálenský V, and Mašek T. Major splice variants and multiple polyadenylation site utilization in mRNAs encoding human translation initiation factors eIF4E1 and eIF4E3 regulate the translational regulators? Mol Genet Genomics, 2018. 293(1): 167–186. pmid:28942592
  25. 25. Pang J, Ye L, Chen Q, Wang J, Yang X, He W, et al. The Effect of MicroRNA-101 on Angiogenesis of Human Umbilical Vein Endothelial Cells during Hypoxia and in Mice with Myocardial Infarction. Biomed Res Int, 2020. 2020: 5426971. pmid:32953883
  26. 26. Hoekstra M, van der Lans C A, Halvorsen B, Gullestad L, Kuiper J, Aukrust P, et al. The peripheral blood mononuclear cell microRNA signature of coronary artery disease. Biochem Biophys Res Commun, 2010. 394(3): 792–7. pmid:20230787
  27. 27. Sepramaniam S, Tan J R, Tan K S, DeSilva D A, Tavintharan S, Woon F P, et al. Circulating microRNAs as biomarkers of acute stroke. Int J Mol Sci, 2014. 15(1): 1418–32. pmid:24447930
  28. 28. Martín-Flores N, Fernández-Santiago R, Antonelli F, Cerquera C, Moreno V, Martí M J, et al. MTOR Pathway-Based Discovery of Genetic Susceptibility to L-DOPA-Induced Dyskinesia in Parkinson’s Disease Patients. Mol Neurobiol, 2019. 56(3): 2092–2100. pmid:29992529
  29. 29. Yang Q, Pan W, and Qian L. Identification of the miRNA-mRNA regulatory network in multiple sclerosis. Neurol Res, 2017. 39(2): 142–151. pmid:27809691
  30. 30. Jiang S, Zhou D, Wang Y Y, Jia P, Wan C, Li X, et al. Identification of de novo mutations in prenatal neurodevelopment-associated genes in schizophrenia in two Han Chinese patient-sibling family-based cohorts. Transl Psychiatry, 2020. 10(1): 307. pmid:32873781
  31. 31. Schweizer U and Fradejas-Villar N. Why 21? The significance of selenoproteins for human health revealed by inborn errors of metabolism. Faseb j, 2016. 30(11): 3669–3681. pmid:27473727
  32. 32. Hady-Cohen R, Ben-Pazi H, Adir V, Yosovich K, Blumkin L, Lerman-Sagie T, et al. Progressive cerebello-cerebral atrophy and progressive encephalopathy with edema, hypsarrhythmia and optic atrophy may be allelic syndromes. Eur J Paediatr Neurol, 2018. 22(6): 1133–1138. pmid:30100179
  33. 33. Anttonen A K, Hilander T, Linnankivi T, Isohanni P, French R L, Liu Y, et al. Selenoprotein biosynthesis defect causes progressive encephalopathy with elevated lactate. Neurology, 2015. 85(4): 306–15. pmid:26115735
  34. 34. Laan L, Klar J, Sobol M, Hoeber J, Shahsavani M, Kele M, et al. DNA methylation changes in Down syndrome derived neural iPSCs uncover co-dysregulation of ZNF and HOX3 families of transcription factors. Clin Epigenetics, 2020. 12(1): 9. pmid:31915063
  35. 35. Johnson J O, Pioro E P, Boehringer A, Chia R, Feit H, Renton A E, et al. Mutations in the Matrin 3 gene cause familial amyotrophic lateral sclerosis. Nat Neurosci, 2014. 17(5): 664–666. pmid:24686783
  36. 36. Müller T J, Kraya T, Stoltenburg-Didinger G, Hanisch F, Kornhuber M, Stoevesandt D, et al. Phenotype of matrin-3-related distal myopathy in 16 German patients. Ann Neurol, 2014. 76(5): 669–80. pmid:25154462
  37. 37. Gallego-Iradi M C, Clare A M, Brown H H, Janus C, Lewis J, and Borchelt D R. Subcellular Localization of Matrin 3 Containing Mutations Associated with ALS and Distal Myopathy. PLoS One, 2015. 10(11): e0142144. pmid:26528920
  38. 38. Jones A R, Troakes C, King A, Sahni V, De Jong S, Bossers K, et al. Stratified gene expression analysis identifies major amyotrophic lateral sclerosis genes. Neurobiol Aging, 2015. 36(5): 2006.e1–9. pmid:25801576
  39. 39. Moloney C, Rayaprolu S, Howard J, Fromholt S, Brown H, Collins M, et al. Transgenic mice overexpressing the ALS-linked protein Matrin 3 develop a profound muscle phenotype. Acta Neuropathol Commun, 2016. 4(1): 122. pmid:27863507
  40. 40. Tada M, Doi H, Koyano S, Kubota S, Fukai R, Hashiguchi S, et al. Matrin 3 Is a Component of Neuronal Cytoplasmic Inclusions of Motor Neurons in Sporadic Amyotrophic Lateral Sclerosis. Am J Pathol, 2018. 188(2): 507–514. pmid:29128563
  41. 41. Weiss K, Treiber T, Meister G, and Schratt G. The nuclear matrix protein Matr3 regulates processing of the synaptic microRNA-138-5p. Neurobiol Learn Mem, 2019. 159: 36–45. pmid:30790622
  42. 42. Quintero-Rivera F, Xi Q J, Keppler-Noreuil K M, Lee J H, Higgins A W, Anchan R M, et al. MATR3 disruption in human and mouse associated with bicuspid aortic valve, aortic coarctation and patent ductus arteriosus. Hum Mol Genet, 2015. 24(8): 2375–89. pmid:25574029
  43. 43. Gencer S, van der Vorst E P C, Aslani M, Weber C, Döring Y, and Duchene J Atypical Chemokine Receptors in Cardiovascular Disease. Thromb Haemost, 2019. 119(4): 534–541. pmid:30716778
  44. 44. Zhang M, Zhang M, Zhou T, Liu M, Xia N, Gu M, et al. Inhibition of fibroblast IL-6 production by ACKR4 deletion alleviates cardiac remodeling after myocardial infarction. Biochem Biophys Res Commun, 2021. 547: 139–147. pmid:33610913
  45. 45. Min X L, He M, Shi Y, Xie L, Ma X J, and Cao Y. miR-18b attenuates cerebral ischemia/reperfusion injury through regulation of ANXA3 and PI3K/Akt signaling pathway. Brain Res Bull, 2020. 161: 55–64. pmid:32380186
  46. 46. Meng H, Zhang Y, An S T, and Chen Y. Annexin A3 gene silencing promotes myocardial cell repair through activation of the PI3K/Akt signaling pathway in rats with acute myocardial infarction. J Cell Physiol, 2019. 234(7): 10535–10546. pmid:30456911