Figures
Abstract
Background
Irritable bowel syndrome (IBS) is a prominent functional gastrointestinal disorder, yet the precise causes and mechanisms behind it remain largely unclear. Numerous environmental compounds have been associated with the intestinal health of individuals suffering from IBS. This study sought to explore the impact of environmental chemicals on the condition of IBS.
Methods
We analyzed genome-wide association study (GWAS) data comprising 455,321 individuals of white British descent, among which 28,518 individuals with IBS underwent transcriptome-wide association study (TWAS) analysis. Reference gene expression data were sourced from tissues including the small intestine, transverse colon, sigmoid colon, whole blood, and peripheral blood.
Results
Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses were conducted utilizing the significant genes identified through TWAS. Additionally, protein-protein interaction network analysis was performed using the STRING database to elucidate the functions of the proteins encoded by these genes. Furthermore, chemical-related gene set enrichment analysis (CGSEA) was employed to explore the associations between environmental chemicals and IBS. In total, TWAS identified 33 significant genes (PFDR < 0.05), while CGSEA revealed 112 chemicals significantly correlated with IBS (PFDR < 0.05, |NES| > 1). Both TWAS (targeting genetic influences) and CGSEA (focusing on environmental influences) were instrumental in pinpointing chemicals associated with IBS.
Citation: Zhao X, Yang Y, Liu Z, Sun R, Suo A (2026) Identifying chemicals associated with irritable bowel syndrome by integrating a transcriptome-wide association study with chemical-gene-interaction analysis. PLoS One 21(2): e0340666. https://doi.org/10.1371/journal.pone.0340666
Editor: Yanggang Hong, Wenzhou Medical University, CHINA
Received: August 19, 2025; Accepted: December 22, 2025; Published: February 3, 2026
Copyright: © 2026 Zhao et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: GWAS summary statistics for irritable bowel syndrome were obtained from Wu et al. (Nat Commun, 2021, 12(1): 1146) and are available at https://cnsgenomics.com/content/data. TWAS analyses were performed using FUSION (http://gusevlab.org/projects/fusion/). Gene expression data (GEO accession: GSE166869) were downloaded from GEO (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE166869). PPI information was retrieved from the STRING database (https://cn.string-db.org/). Chemical-gene interaction data were acquired from CTD (http://ctdbase.org/downloads/). All datasets are publicly available.
Funding: This study was supported by [Shaanxi Province Natural Science Foundation] in the form of a grant awarded to [ZG L] (2025JC-YBMS-992) and [Shaanxi Province Natural Science Foundation] in the form of a salary for [ZG L]. The specific roles of this author are articulated in the ‘author contributions’ section. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Irritable bowel syndrome (IBS) represents a significant functional gastrointestinal disorder, exhibiting a prevalence rate ranging from approximately 7% to 21% [1]. Although IBS usually appears in early adulthood [2], its prevalence has significantly increased among children over the last two decades [3]. There are notable gender differences in IBS prevalence, with women in some Western countries being nearly twice as likely to be diagnosed compared to men; this disparity may be even greater in Asian countries [4]. IBS is mainly characterized by recurring abdominal pain and changes in bowel habits [5]. Various factors may contribute to the development of IBS, including genetic factors, changes in gut bacteria, immune system issues, problems with bile salt metabolism, and irregular serotonin levels [6]. Notably, interactions between the digestive system and the nervous system (the gut-brain axis) are increasingly recognized as a key pathophysiological mechanism, potentially explaining the high comorbidity with psychiatric conditions like anxiety and depression Despite these insights, the precise etiology and pathogenesis of IBS remain largely elusive. The syndrome considerably impairs nutritional absorption, growth, and overall quality of life for those affected [7]. Consequently, it is crucial to elucidate the etiology and associated risk factors of IBS to inform and optimize treatment and prevention strategies.
The management of IBS incorporates both pharmacological and non-pharmacological approaches, with the primary aim of alleviating distressing symptoms [8]. Available pharmacotherapeutics for IBS encompass antispasmodics, laxatives, antidiarrheals, analgesics, and probiotics. Nevertheless, the effectiveness of many of these medications is limited, with several studies indicating that their benefits do not significantly surpass those of a placebo [9].As a result, there has been a growing emphasis on non-pharmacological interventions for IBS in recent years [10]. Current clinical guidelines advocate for the use of soluble fiber to address overall IBS symptoms, while antispasmodic agents are presently available in the United States for symptom management [11]. Although dietary factors can exacerbate IBS, they also play a critical role in symptom relief [12]. Moreover, various environmental compounds are linked to intestinal health in patients with IBS [13]. Research has been conducted to explore the interconnections among complex diseases, genetic factors, and environmental chemicals by evaluating the interactions between genetic predispositions and environmental influences [14,15]. Therefore, understanding the relationship between environmental chemicals and IBS is vital for both treatment and prevention of the syndrome. Identifying specific chemicals associated with IBS could pave the way for novel preventive approaches based on exposure reduction, opening the possibility of developing preventive strategies based on chemical exposure identification.
Advancements in bioinformatics and high-throughput sequencing technologies have significantly increased the focus on the genetic factors contributing to IBS. Genome-wide association studies (GWAS) have become a key method for investigating the genetic loci linked to complex human diseases and traits. Through these studies, six genetic susceptibility loci for IBS have been identified: NCAM1, CADM2, PHF2/FAM120A, DOCK9, CKAP2/TPTE2P3, and BAG6 [16]. However, GWAS has its limitations, particularly in assessing the risk associated with complex polygenic disorders, as most of the identified single nucleotide polymorphisms (SNPs) are found in non-coding regions of genes [17]. To overcome this issue, transcriptome-wide association studies (TWAS) have been proposed. TWAS combines GWAS findings with expression quantitative trait loci (eQTLs) data to provide a clearer picture of gene-trait associations [18]. In this approach, pre-computed gene expression weights are integrated with SNP data, which enhances our understanding of the genetic architecture that underlies IBS [19].
This study assessed how genetic determinants affect IBS by performing a TWAS using a GWAS dataset. This analysis evaluated gene expression in various tissues, including the small intestine, transverse colon, sigmoid colon, whole blood, and peripheral blood. Furthermore, we conducted a functional analysis of the genes identified through TWAS and identified chemicals associated with IBS. This research improves our understanding of the genetic and environmental factors contributing to gastrointestinal autoimmune disorders.
Methods
GWAS data
The summary data for the IBS GWAS were acquired from previously published research [20]. In detail, the study utilized genotype and phenotype data from the European ancestry subset of the UK Biobank (UKB) cohort, which is a population-based volunteer longitudinal cohort of ~500,000 individuals recruited across 22 centers in the United Kingdom. This European ancestry subset included 456,327 individuals, among whom 348,441 were unrelated.
The IBS phenotype was defined based on multi-source data including death registers, self-reported records, hospital admission data, and primary care records (UKB data field: 131639). Initially, 29,524 participants were identified as original IBS cases; after excluding 1,006 participants who also had inflammatory bowel disease (IBD) diagnoses, 28,518 individuals were finally assigned to the IBS case group, with the remaining 426,803 participants in the European ancestry subset serving as controls.
Genotype data of the UKB cohort were imputed using the Haplotype Reference Consortium (HRC) and UK10K as reference samples. Genotype probabilities were transformed into definitive genotypes utilizing PLINK2 (hard-call 0.1). Single nucleotide polymorphisms (SNPs) were excluded from the analysis if they met any of the following criteria: minor allele count < 5, Hardy-Weinberg equilibrium test P-value < 1.0 × 10 ⁻ ⁵, missing genotype rate > 0.05, or imputation accuracy (Info) score < 0.3. Comprehensive details regarding subjects, genotyping procedures, imputation, and quality control measures can be referenced in the original study.
TWAS analysis
TWAS is a powerful method that integrates gene expression profiles with GWAS summary data. Compared with linkage disequilibrium-based (LD-based) estimates of local genetic correlation, the TWAS approach effectively distinguishes between direct genetic effects on gene expression and indirect LD confounding [21]. In this research, TWAS was conducted utilizing Fusion-based software. Among the most widely used TWAS methods (e.g., PrediXcan, TWAS-Fusion, and SMR) that interrogate causal relationships between gene expression and complex traits [22]. TWAS-Fusion is favored for its unique advantages in our research: 1) GWAS summary data, without relying on individual-level genotypes; 2) Bayesian sparse linear-mixed models excel at capturing cis-regulatory effects. Briefly, Bayesian sparse linear-mixed models [23] were utilized to calculate SNP expression weights for each gene within its 1-Mb cis-regulatory region. The association between predicted gene expression levels and IBS was then estimated using the formula: Z = w·Z/(w·[Lw]) [21], where w represents the SNP weight for gene expression prediction, Z denotes the Z-score from IBS GWAS summary data, and L stands for the SNP correlation matrix (i.e., LD structure). Each genetic feature was defined as a contiguous 100,000 bp genomic segment. For the joint model construction, features with a minimum P-value ≤ 0.05 were included; features with an r > 0.9 were considered redundant (identical) and merged, while those with an r < 0.008 were treated as independent to avoid overcounting.
We specifically analyzed gene expression in the small intestine, transverse colon, sigmoid colon, whole blood (comprising blood cells, plasma, and all other components in the blood), and peripheral blood (blood predominantly composed of blood cells), employing pre-calculated functional weights in conjunction with the IBS GWAS summary data to evaluate gene expression through the predictive model available in FUSION. The gene expression weight panel for the aforementioned tissues was downloaded from the FUSION website (http://gusevlab.org/projects/fusion/). A false discovery rate (FDR) correction was applied to all TWAS findings.
Gene Expression Omnibus (GEO) datasets
Although FUSION enables the mapping of disease-associated loci identified by TWAS to the transcriptional level, a notable limitation exists in its analytical framework: the transcriptomic data utilized for constructing gene expression weights in FUSION are exclusively derived from normal tissues. Such normal tissue-based expression profiles can only partially recapitulate the transcriptional characteristics of disease states, leading to potential inconsistencies between the predicted gene expression patterns and the actual transcriptional landscape in pathological conditions. To address this constraint and improve the reliability of our findings, we employed previously published transcriptomic data retrieved from the GEO database to further filter the candidate genes identified by FUSION, thereby enhancing the credibility and biological relevance of our study outcomes [24].
The GSE166869 dataset was retrieved from the GEO database (https://www.ncbi.nlm.nih.gov/geo/). This study enrolled 26 patients with IBS, including 12 with constipation-predominant IBS (IBS-C) and 14 with diarrhea-predominant IBS (IBS-D), as well as 15 healthy volunteers. Probe-based confocal laser endoscopy was used, and mucosal biopsies of the duodenum and jejunum were obtained for bulk RNA sequencing (RNA-seq). The aim of this study was to identify the small intestinal mechanisms underlying lipid-induced symptoms and rectal hypersensitivity in IBS based on RNA-seq data [25].
Gene functional analyses
GEO2R [26] was used to find DEGs between diarrhea-irritable bowel syndrome (d-IBS) patients and control groups. Adj.P value < 0.05 and |Log2FC| > 0.5 were considered as differentially expressed genes (DEGs) [27]. Quality control was performed using principal component analysis (PCA) via the prcomp function and ggplot2 to verify sample grouping. The volcano plot and heatmap analysis were done using R packages [28,29].Functional enrichment analysis of DEGs was conducted with the clusterProfiler package for Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, with significant terms filtered by adjusted P-value < 0.05(Benjamini-Hochberg method for false discovery rate correction) [30].
Protein-protein interaction (PPI)
PPI networks for TWAS-identified genes were constructed and visualized using Cytoscape software (version 3.9.0 or later), with PPI data retrieved from the STRING database (confidence score ≥ 0.4 as the cutoff) and visualization parameters (node size, edge thickness, color gradients) adjusted to distinguish gene/protein nodes and their interaction strengths [31]. Subsequently, the MCODE plugin was applied to the PPI network with optimized core parameters (degree cutoff = 2, node score cutoff = 0.2, k-core = 2, maximum depth from seed = 100) to cluster densely connected sub-networks, and the top 5 modules with the highest MCODE scores were selected as key functional clusters. Meanwhile, the CytoHubba plugin was used to screen hub genes by employing three complementary algorithms: Maximal Clique Centrality (MCC), which accurately identifies functionally critical core hub genes by considering both direct and indirect interactions via clique structures; Edge Percolated Component (EPC), which highlights nodes pivotal to pathway stability by evaluating contributions to network connectivity and robustness; and Degree, a computationally efficient and intuitive method for preliminary screening of hub genes with extensive direct interactions.
Chemical-related Gene Set Enrichment Analysis (CGSEA)
In this investigation, the chemical gene set enrichment analysis was conducted utilizing the CGSEA tool, which is based on previously published research [32]. The CGSEA tool serves to evaluate the correlations between chemicals and intricate diseases. We employed the Comparative Toxicogenomics Database (CTD) chemical-gene interaction network alongside TWAS expression associations to analyze the statistics related to Irritable Bowel Syndrome (IBS) in order to examine the connections between chemicals and diseases from a genomic functional standpoint. Additionally, we applied weighted Kolmogorov-Smirnov run sum statistics to further investigate the association between chemicals and IBS. The dataset for chemical-associated gene expression annotations utilized in this study was acquired from the CTD (http://ctdbase.org/downloads/), which includes 1,788,149 chemical-gene pair annotation terms, resulting in a compilation of 11,190 chemical-associated genes [33]. To address the issue of multiple testing bias inherent in high-throughput chemical-gene association analysis, we applied FDR correction to the CGSEA results in addition to the P-value filtering. The screening thresholds for significant chemicals were defined as follows: 1) P_CGSEA < 0.05 (primary association signal); 2) FDR < 0.05 (corrected for multiple comparisons); 3) |normalized enrichment score (NES)| > 1 [34].
Results
TWAS of IBS
TWAS identified 11,000 unduplicated genes from the GWAS summary data: 2,844, 5,261, 4,829, 4,679, and 2,445 genes for the small intestine, transverse colon, sigmoid colon, whole blood, and peripheral blood, respectively (Fig 1, Supplementary Information S1 File).
The horizontal line indicates PTWAS, with green dots indicating PTWAS<0.05 and red dots indicating PTWAS<0.01. Each point represents the genetic predicted expression of a specific gene by the prediction models for small intestine, transverse colon, colon sigmoid, whole blood, and peripheral blood tissues. The x-axis represents the genomic position of the corresponding gene, and the y-axis represents the negative logarithm of the association P-value.
Functional exploration of the TWAS-identified genes associated with IBS
We performed FDR correction on the TWAS-identified genes, yielding 33 significant genes after correction; following the removal of duplicates, 27 genes were retained (Fig 2A). We subjected genes with a TWAS Z value > 0 and Z value < 0 to KEGG and GO enrichment analyses separately (Fig 2B). Results showed that genes with a TWAS Z value > 0 were significantly enriched in pathways including regulation of sodium ion transport, cerebral cortex GABAergic interneuron differentiation, response to histamine, regulation of dopamine receptor signaling pathway, dopamine uptake involved in synaptic transmission, peristalsis, branching morphogenesis of a nerve, negative regulation of ion transport, regulation of cation channel activity, and postsynaptic modulation of chemical synaptic transmission. The activation of sodium ion transport pathways suggests that the upregulation of these genes may be associated with abnormal water-electrolyte metabolism in diarrhea-predominant IBS patients, while the activation of neurotransmitter-related pathways indicates that the upregulation of relevant genes may be linked to abnormalities in neurotransmitter uptake, secretion, and signal transmission in IBS. In contrast, genes with a TWAS Z value < 0 were significantly enriched in pathways such as antigen processing and presentation of endogenous peptide antigen via MHC class I, antigen processing and presentation of endogenous peptide antigen, antigen processing and presentation of endogenous antigen, regulation of integrin activation, positive regulation of macrophage migration, MHC protein complex, MHC protein binding, Natural killer cell mediated cytotoxicity, Allograft rejection, and Autoimmune thyroid disease. The activation of antigen processing and presentation pathways as well as autoimmune-related pathways suggests that the downregulation of these genes may be associated with chronic low-grade inflammation and autoimmune tendency in IBS, while the activation of integrin activation pathways indicates that the downregulation of relevant genes may be related to impaired integrity of the intestinal barrier in IBS.
A: Heatmap of TWAS-significant genes corrected by FDR in all five tissues, shown with TWAS Z value. B: Bar plot of KEGG and GO enrichment of TWAS Z value > 0 genes and TWAS Z value < 0 genes. C: Venn diagram reveals the overlap of TWAS-significant genes in all five tissues. Purple: small intestine; Yellow: transverse colon; Green: sigmoid colon; Red: peripheral blood. D: Bar plot of KEGG and GO enrichment of the 5 overlapping genes. KEGG: Kyoto Encyclopedia of Genes and Genomes; BP: biological processes; CC: cellular components; MF: molecular functions.
To identify genes co-expressed in different tissues, we performed an overlap analysis of the 1120 significant TWAS-identified genes (P TWAS < 0.05). The Venn diagram suggested that there were five significant TWAS-identified genes associated with IBS among the five tissues (Fig 2C, Table 1), including MutS homolog 2 (MSH2), DQ alpha 1 (HLA-DQA1), butyrophilin subfamily 3 member A2 (BTN3A2), transmembrane protein 80 (TMEM80), and cathepsin W (CTSW). Detection of intestinal tissues is notably limited in clinical practice; thus, these five overlapping genes are promising candidates as blood biomarkers that can reflect the gene expression profile of intestinal tissues. Subsequently, we conducted a functional analysis of the genes associated with IBS identified through TWAS. This analysis utilized the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) frameworks, and the top three pathways exhibiting significant enrichment from each database are illustrated in Fig 2D. The KEGG pathway analysis highlighted three notably enriched terms linked to asthma, allograft rejection, and mismatch repair mechanisms. In terms of biological processes, three GO terms were significantly enriched, specifically regarding the antigen receptor-mediated signaling pathway, the positive regulation of isotype switching towards IgG subclasses, and the positive regulation of isotype switching to IgG isotypes. Furthermore, three enriched GO terms related to cellular components were identified, including the MHC class II protein complex, the mismatch repair complex, and the lumen of platelet-dense granules. The molecular functions of the enriched GO terms primarily encompassed binding to mismatch repair complexes, MHC class II receptor activity, and the binding of oxidized DNA. These signal pathways suggest that the pathological processes of IBS are mainly characterized by immune-inflammatory imbalance and autoimmune responses.
Tissue-specific functional analysis
We further performed enrichment analysis separately on the differentially expressed genes (DEGs) in the small intestine, colon, and blood tissues (Fig 3). To further capture the relationships between the terms, a subset of enriched terms was selected and rendered as a network plot, where terms with a similarity > 0.3 are connected by edges. The network was visualized using Cytoscape 5, where each node represents an enriched term and is colored first by its cluster ID and then by its p-value. The differentially expressed genes in the three tissues (colon, small intestine, and blood) exhibit distinct tissue-specific pathways, each focusing on their respective physiological roles and exerting unique biological significance in IBS. For the colon, its unique core pathways include VEGFR2 mediated cell proliferation, MHC class II protein complex assembly, Butanoate metabolism, eicosanoid metabolic process, and progesterone metabolic process, with functional focus on intestinal local immune regulation (e.g., MHC class II-mediated antigen presentation in mucosal immunity), utilization of gut microbiota metabolites (e.g., Butanoate, a key energy source for colonic epithelial cells), and intestinal mucosal cell proliferation; biologically, abnormal Butanoate metabolism in the colon may impair intestinal barrier integrity by reducing energy supply to epithelial cells and weakening tight junctions, thereby exacerbating IBS symptoms like abdominal pain and diarrhea, while dysregulated MHC class II protein complex assembly can disrupt antigen presentation to T cells, leading to mucosal immune imbalance and chronic low-grade inflammation (a core pathological feature of IBS), and abnormal VEGFR2-mediated cell proliferation may hinder colonic mucosal repair, further compromising the intestinal barrier. In the small intestine, the unique pathways are mitochondrial tRNA aminoacylation, sulfur compound biosynthetic process, Ovarian steroidogenesis, Primary ovarian insufficiency, and regulation of leukocyte mediated immunity, focusing on energy metabolism (mitochondrial function, as mitochondrial tRNA aminoacylation is critical for mitochondrial protein synthesis), steroid hormone biosynthesis, and local intestinal immune activation (leukocyte-mediated immune responses); in terms of IBS relevance, mitochondrial tRNA aminoacylation abnormalities can reduce energy production in small intestinal epithelial cells, disrupting nutrient absorption and intestinal motility to cause IBS-related malabsorption and irregular bowel habits, dysregulated leukocyte-mediated immunity can trigger local inflammatory responses to accelerate mucosal damage and enhance visceral hypersensitivity (a key driver of IBS abdominal pain), and abnormal Ovarian steroid metabolism may explain the higher prevalence of IBS in women, as sex hormone fluctuations (e.g., during the menstrual cycle) can exacerbate small intestinal dysfunction. As for the blood, its unique core pathways include Cellular responses to stress, Adaptive Immune System, Membrane Trafficking, endocytosis, Metabolism of RNA, and Ebola virus infection in host, with functional emphasis on systemic immune response (regulation by the adaptive immune system), stress adaptation (cellular stress signaling), and subcellular material transport (membrane trafficking and endocytosis); biologically, abnormal cellular responses to stress in the blood highlight the role of the “brain-gut axis” in IBS, as psychological or environmental stressors can trigger systemic stress signaling via the circulatory system to disrupt intestinal motility and visceral sensitivity, explaining stress-induced exacerbation of IBS symptoms (e.g., increased abdominal pain, altered defecation frequency), dysregulated adaptive immunity indicates a systemic inflammatory state in IBS, where circulating immune cells (e.g., T cells, B cells) and inflammatory factors may infiltrate the intestinal mucosa to amplify local inflammation and worsen barrier dysfunction, and impaired membrane trafficking/endocytosis can disrupt the secretion and uptake of neurotransmitters and inflammatory cytokines in the circulatory system, hindering signal communication between blood and intestinal tissues and perpetuating the “intestinal-systemic” pathological loop in IBS.
Common genes shared by TWAS and mRNA expression profiling
Differential analysis was performed on the small intestinal tissues of GSE166869, resulting in 24 upregulated DEGs and 17 downregulated DEGs. The intersection of DEGs from the small intestinal tissues of GSE166869 and TWAS-identified significant genes in small intestinal tissues was taken; unfortunately, no overlapping genes were obtained (Fig 4A). Differential analysis was conducted on the colon tissues of GSE166869, yielding 616 upregulated DEGs and 625 downregulated DEGs. Taking the intersection of DEGs from the colon tissues of GSE166869 and TWAS-identified significant genes in colon tissues, 24 overlapping genes were acquired (Fig 4B). The details of these 24 overlapping genes are detailed in Table 2, including 13 upregulated DEGs and 11 downregulated DEGs. KEGG and GO enrichment analyses were performed on the upregulated and downregulated DEGs among the overlapping genes, respectively (Fig 4C). The activation of inflammatory mediators and inflammatory pathways suggests that the upregulation of these genes may be associated with excessive inflammatory activation in irritable bowel syndrome (IBS). The activation of Glutathione metabolism and branched-chain amino acid degradation pathways indicates that the upregulation of relevant genes may be related to oxidative stress in the intestinal cells of IBS patients. The downregulation of MHC class II antigen processing and presentation pathways suggests that the downregulation of relevant genes may be associated with immune recognition defects in IBS. The downregulation of pathways related to negative regulation of cell growth indicates that the downregulation of corresponding genes may be linked to abnormal cell proliferation and impaired mucosal repair capacity in IBS.
A:volcano plot of small intestinal tissues from GSE166869, and Venn diagram revealing the overlap between differentially expressed genes (DEGs) of small intestinal tissues in GSE166869 and TWAS-significant genes; B: volcano plot of colon tissues from GSE166869, and Venn diagram revealing the overlap between DEGs of colon tissues in GSE166869 and TWAS-significant genes; C: bar charts of KEGG and GO enrichment analyses for upregulated and downregulated genes among the overlapping genes between colon tissue DEGs in GSE166869 and TWAS-significant genes.
Notably, the tissue-specific distribution of TWAS genes and GEO-derived DEGs shows strong consistency: colon-related genes (629 TWAS candidate genes, 1229 DEGs) are substantially more abundant than small intestinal-related genes (241 TWAS candidate genes, 41 DEGs) in both datasets. This discrepancy highlights that the pathological changes of IBS are predominantly concentrated in the colon rather than the small intestine, which is consistent with the clinical manifestation of IBS—abnormal intestinal function mainly involving the colonic segment. Further analysis reveals three key implications: first, the quantitative dominance of colon-related genes reflects the higher genetic and transcriptional regulatory activity in colonic tissues during IBS pathogenesis, suggesting that the colon is the core tissue mediating IBS-related genetic effects and transcriptional dysregulation. Second, the lack of overlapping genes between small intestinal TWAS candidates and GEO-derived small intestinal DEGs may be attributed to the relatively milder pathological alterations in the small intestine of IBS patients, leading to weak transcriptional signals that are difficult to capture. Third, the consistent tissue-specific enrichment pattern between TWAS and GEO datasets enhances the reliability of our findings—both genetic association (TWAS) and transcriptional change (DEGs) analyses independently confirm the colon as the primary site of IBS pathology, providing a solid basis for subsequent targeted exploration of colon-specific biomarkers and therapeutic targets. Regarding the numerical difference between tissue-specific gene sets, the large gap between colon and small intestinal genes (both in TWAS and GEO datasets) further reflects the tissue heterogeneity of IBS genetic regulatory networks. Genetic effects and transcriptional responses during disease progression are not uniformly distributed across intestinal segments; instead, they are concentrated in the colon, which may be related to the colon’s unique physiological functions (e.g., water absorption, fecal formation, and microbial colonization) that make it more susceptible to environmental and genetic perturbations. This tissue-specific feature also explains the low overlap ratio between small intestinal-related gene sets, while the colon-related gene sets show a relatively meaningful overlap (24/629, 3.82% of colon TWAS genes; 24/1229, 1.95% of colon DEGs)—a ratio that is biologically reasonable for complex diseases, considering the multi-factorial nature of IBS pathogenesis.
CGSEA of the TWAS-identified genes
We conducted a CGSEA analysis to explore the environmental factors affecting IBS, identifying 1,789 chemicals (Supplementary Information 2 in S1 File), including 112 significant chemicals (FDR < 0.05, |normalized enrichment score, |NES| > 1). Table 3 lists the top50 compounds ranked by NES. We classified 112 significant chemicals and mapped them to the genes identified by TWAS (Fig 5A). The classifications include Drugs, Nutrients, Organic Compounds, Pollutants/Heavy Metals/Carcinogens, Inorganic Compounds, and Pesticides. The number and proportion of compounds corresponding to each category are shown in Fig 5B. We further performed PPI (Protein-Protein Interaction) analysis on the TWAS-identified genes and used the MCODE plugin to identify 5 key MCODEs in the PPI network (Fig 5C). Additionally, we analyzed the hub genes in the network using the CytoHubba plugin (Fig 5D). Notably, HLA-DQA1 and HLA-DQA2 are not only hub genes but were also identified in our previous analysis.
A: Classification of 112 significant chemicals and their mapping to TWAS-identified genes. B: Number and proportion of compounds in each classification. C: PPI network of TWAS-identified genes and top 5 key MCODEs. D: HUB genes in the PPI network.
Discussion
The health-related quality of life of IBS patients is significantly reduced, increasing the disease burden [35]. Unlike many other gastrointestinal diseases, IBS is a symptom based diagnosis that presents significant challenges in terms of diagnosis, potential pathophysiology, and management [36]. In addition to gastrointestinal-specific mechanisms, interactions between the digestive system and the nervous system are increasingly recognized among the possible causes of IBS. Therefore, the study of IBS-related gene expression differences can help identify new biomarkers to assist in diagnosis. In addition, studying environmental compounds associated with IBS provides new ideas for identifying risk factors,and opens the possibility of developing preventive strategies based on chemical exposure identification.
We performed TWAS analysis to predict the significant genes expressed in the small intestine, transverse colon, sigmoid colon, whole blood, and peripheral blood. Our results suggest that there are five significant TWAS-identified genes associated with IBS among the five tissues, including MSH2, HLA-DQA1, BTN3A2, TMEM80, and CTSW. A large-scale epidemiological review [37] has demonstrated a significantly higher incidence of colorectal cancer following an IBS diagnosis, with an overall risk ratio of approximately 1.5 and a nearly 6.8-fold increase within the first year after diagnosis, indicating that genetic variations in MSH2 (a gene involved in DNA mismatch repair) and related loci may underlie this predisposition. The low level of inflammation underlying the pathogenesis of IBS has received increasing research attention [38]. HLA-DQA1 (a gene involved in DNA mismatch repair), BTN3A2 (butyrophilin subfamily 3 member A2, involved in immune regulation), TMEM80 (transmembrane protein 80), and CTSW (cathepsin W, a cysteine protease expressed in lymphocytes) are all associated with immune response. We further performed functional enrichment analysis of these TWAS-identified genes. KEGG and GO enrichment analyses showed that the MHC class II protein complex pathway is the main pathway that plays an important role in IBS. Intestinal epithelial cells express MHC class II molecules during homeostasis and inflammation and can produce different functional outcomes [39]; this is consistent with our findings. We also enriched for sex hormone-related pathways, such as steroid binding, which helps to explain the sex bias in IBS [40]. Neurotransmitter-gut interactions are one of the key regulatory mechanisms underlying the pathogenesis of IBS [41], and our study also identified cellular amide metabolic process pathways, which may involve neurotransmitters such as serotonin (5-HT) and gamma-aminobutyric acid (GABA), known to be critical in gut-brain communication. In this study, we determined some new pathways associated with IBS, such as galactosyltransferase, acetylglucosaminyltransferase, and acetylglucosaminyltransferase activities, showing that the role of gut microbes and their metabolites in IBS is not negligible [42].
IBS is a functional disease [43], thus, avoiding exposure to risk factors and alleviating its symptoms may be more important. We extended the classic GSEA approach to detect the association between environmental chemicals and IBS using CGSEA analysis. We identified various chemicals, including drugs, pesticides, organic compounds, inorganic compounds, nutrients, pollutants, heavy metals, and carcinogens. IBS is often comorbid with psychiatric disorders such as anxiety and depression [44]. The gut-brain axis is a bidirectional neurohumoral integrated communication between the microbiota and autonomic nervous system [45]. Gut-brain interaction disorders are associated with upper and lower gastrointestinal food intake-induced symptoms, which may be an emerging mechanism for studying IBS [46]. Imipramine is the most significant compound for NES obtained in our analysis using CGSEA and is also a commonly used clinical antidepressant [47]. Other antipsychotics, such as methylphenidate, flecainide, riluzole, pentylenetetrazole, and chlorpromazine, were also observed in our analysis. It is noteworthy that while antipsychotics as a class were enriched, not all drugs within this family showed significant associations, suggesting specificity in the mechanisms linking certain psychoactive compounds to IBS pathophysiology.For instance, the differential enrichment of specific tricyclic antidepressants (e.g., imipramine) but not others may reflect their distinct pharmacological profiles, such as varying affinities for neurotransmitter transporters or ion channels that modulate gut-brain signaling. Similarly, among anti-inflammatory agents, only a subset was significantly enriched, pointing to distinct inflammatory pathways being relevant. We have enriched the anti-inflammatory class of drugs, including ibuprofen, valdecoxib, geldanamycin, and metribolone, which support the role played by inflammation in IBS from the perspective of gene-compound interactions. The absence of enrichment for other common anti-inflammatory drugs (e.g., aspirin or certain corticosteroids) may indicate that the association is specific to compounds affecting particular molecular targets (e.g., COX-2 inhibition or HSP90 modulation) rather than a broad class effect. Similarly, among anti-inflammatory agents, only a subset was significantly enriched, pointing to distinct inflammatory pathways being relevant. Another group of compounds of interest is flavonoids such as catechin, grape seed proanthocyanidins, and sesame oil. Research [48] has shown that plant polyphenols are beneficial for IBS by reducing oxidative stress, altering gut permeability, and affecting gut enzyme activity and microbes. A recent study reported gastrointestinal distress in humans after exposure to pesticides, which usually manifests as IBS. Our results also showed that pesticides might be related to IBS, indicating that the effect of pesticides on IBS may be a new direction for future research. However, it is important to note that the associations identified between environmental chemicals and IBS in this study are correlational and do not imply causality; further experimental and epidemiological validation is required to establish causal links.
Our study explored a large number of compounds associated with IBS by analyzing gene-environment interactions, providing novel insights for IBS research. The identification of key genes such as MSH2, HLA-DQA1, and others, along with enriched pathways related to immune function, sex hormones, and neurotransmitter metabolism, advances our understanding of IBS mechanisms. The association of specific chemical classes, including antipsychotics, anti-inflammatories, and flavonoids, with IBS opens new avenues for etiological research and potential preventive strategies. However, this study still has certain limitations. Firstly, the GWAS summary data used in this research is mainly derived from European populations, which means the findings may exhibit varying degrees of differences across different populations. The lack of multi-ethnic validation cohorts (e.g., Asian, African, or American populations) as well as independent clinical cohorts with detailed phenotypic records may restrict the generalizability of our genetic association results. Secondly, the transcriptomic data used to validate the identified IBS-related key genes was primarily obtained from newly diagnosed IBS-D patients who met the Rome III criteria, prospectively recruited at a gastroenterology clinic by Grover, M et al. [25]. The diagnosis of IBS is relatively challenging, resulting in a small number of patients, which may introduce bias into the validation results. Finally, the IBS-related key genes and compounds identified in this study require further validation in subsequent research, including functional experiments to elucidate their biological functions in the pathogenesis of IBS and their potential pathogenic or therapeutic value. Future work should focus on mechanistic studies to compare the actions of enriched versus non-enriched drugs within the same families, longitudinal assessments of chemical exposures in relation to IBS onset, and multi-omics integration to clarify causal pathways.
Conclusions
This study identified 33 significant genes and 112 environmental chemicals associated with IBS through integrated transcriptomic and chemical-gene interaction analyses. Five key genes (MSH2, HLA-DQA1, BTN3A2, TMEM80, and CTSW) were consistently expressed across all examined tissues, primarily involved in immune-inflammatory pathways and gut-brain interactions. The chemical associations highlight potential environmental risk factors, particularly neuroactive and anti-inflammatory compounds. These findings provide new insights into IBS pathogenesis and identify promising targets for future research on biomarkers and therapeutic strategies.
Supporting information
S1 File. Supporting information.
Supplementary information 1: Transcriptome-Wide association study results of irritable bowel syndrome. Supplementary information 2: Chemical-Gene-Interaction analysis resluts of the transcriptome-wide association study identified genes.
https://doi.org/10.1371/journal.pone.0340666.s001
(ZIP)
References
- 1. Chey WD, Kurlander J, Eswaran S. Irritable bowel syndrome: a clinical review. JAMA. 2015;313(9):949–58. pmid:25734736
- 2. Zielińska A, Sałaga M, Włodarczyk M, Fichna J. Chronic abdominal pain in irritable bowel syndrome - current and future therapies. Expert Rev Clin Pharmacol. 2018;11(7):729–39. pmid:29957084
- 3. Hyams JS, Di Lorenzo C, Saps M, Shulman RJ, Staiano A, van Tilburg M. Functional disorders: children and adolescents. Gastroenterology. 2016;:S0016-5085(16)00181-5. pmid:27144632
- 4. Wang J, Yang P, Zhang L, Hou X. A Low-FODMAP diet improves the global symptoms and bowel habits of adult IBS patients: a systematic review and meta-analysis. Front Nutr. 2021;8:683191. pmid:34490319
- 5. Aziz I, Simrén M. The overlap between irritable bowel syndrome and organic gastrointestinal diseases. Lancet Gastroenterol Hepatol. 2021;6(2):139–48. pmid:33189181
- 6. Holtmann GJ, Ford AC, Talley NJ. Pathophysiology of irritable bowel syndrome. Lancet Gastroenterol Hepatol. 2016;1(2):133–46. pmid:28404070
- 7. Raskov H, Burcharth J, Pommergaard H-C, Rosenberg J. Irritable bowel syndrome, the microbiota and the gut-brain axis. Gut Microbes. 2016;7(5):365–83. pmid:27472486
- 8. Adriani A, Ribaldone DG, Astegiano M, Durazzo M, Saracco GM, Pellicano R. Irritable bowel syndrome: the clinical approach. Panminerva Med. 2018;60(4):213–22. pmid:30257542
- 9. Ranjbar S, Seyednejad SA, Nikfar S, Rahimi R, Abdollahi M. How can we develop better antispasmodics for irritable bowel syndrome?. Expert Opin Drug Discov. 2019;14(6):549–62. pmid:30920313
- 10. Eriksson EM, Andrén KI, Kurlberg GK, Eriksson HT. Aspects of the non-pharmacological treatment of irritable bowel syndrome. World J Gastroenterol. 2015;21(40):11439–49. pmid:26523108
- 11. Lacy BE, Pimentel M, Brenner DM, Chey WD, Keefer LA, Long MD, et al. ACG clinical guideline: management of irritable bowel syndrome. Am J Gastroenterol. 2021;116(1):17–44. pmid:33315591
- 12. Rej A, Aziz I, Tornblom H, Sanders DS, Simrén M. The role of diet in irritable bowel syndrome: implications for dietary advice. J Intern Med. 2019;286(5):490–502. pmid:31468640
- 13. de Silva PS, Yang X, Korzenik JR, Goldman RH, Arheart KL, Caban-Martinez AJ. Association of urinary phenolic compounds, inflammatory bowel disease and chronic diarrheal symptoms: Evidence from the National Health and Nutrition Examination Survey. Environ Pollut. 2017;229:621–6. pmid:28689150
- 14. Lu M, Feng R, Qin Y, Deng H, Lian B, Yin C, et al. Identifying environmental endocrine disruptors associated with the age at menarche by integrating a transcriptome-wide association study with chemical-gene-interaction analysis. Front Endocrinol (Lausanne). 2022;13:836527. pmid:35282430
- 15. Cheng S, Wen Y, Ma M, Zhang L, Liu L, Qi X, et al. Identifying 5 common psychiatric disorders associated chemicals through integrative analysis of genome-wide association study and chemical-gene interaction datasets. Schizophr Bull. 2020;46(5):1182–90. pmid:32291453
- 16. Eijsbouts C, Zheng T, Kennedy NA, Bonfiglio F, Anderson CA, Moutsianas L, et al. Genome-wide analysis of 53,400 people with irritable bowel syndrome highlights shared genetic pathways with mood and anxiety disorders. Nat Genet. 2021;53(11):1543–52. pmid:34741163
- 17. Adams MK, Belyaeva OV, Wu L, Kedishvili NY. The retinaldehyde reductase activity of DHRS3 is reciprocally activated by retinol dehydrogenase 10 to control retinoid homeostasis. J Biol Chem. 2014;289(21):14868–80. pmid:24733397
- 18. Porcu E, Rüeger S, Lepik K, eQTLGen Consortium, BIOS Consortium, Santoni FA, et al. Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits. Nat Commun. 2019;10(1):3300. pmid:31341166
- 19. Gong L, Zhang D, Lei Y, Qian Y, Tan X, Han S. Transcriptome-wide association study identifies multiple genes and pathways associated with pancreatic cancer. Cancer Med. 2018;7(11):5727–32. pmid:30334361
- 20. Wu Y, Murray GK, Byrne EM, Sidorenko J, Visscher PM, Wray NR. GWAS of peptic ulcer disease implicates Helicobacter pylori infection, other gastrointestinal disorders and depression. Nat Commun. 2021;12(1):1146. pmid:33608531
- 21. Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BWJH, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48(3):245–52. pmid:26854917
- 22. Zhang Y, Quick C, Yu K, Barbeira A, GTEx Consortium, Luca F, et al. PTWAS: investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis. Genome Biol. 2020;21(1):232. pmid:32912253
- 23. Zhou X, Carbonetto P, Stephens M. Polygenic modeling with bayesian sparse linear mixed models. PLoS Genet. 2013;9(2):e1003264. pmid:23408905
- 24. Wu C, Tan S, Liu L, Cheng S, Li P, Li W, et al. Transcriptome-wide association study identifies susceptibility genes for rheumatoid arthritis. Arthritis Res Ther. 2021;23(1):38. pmid:33482886
- 25. Grover M, Berumen A, Peters S, Wei T, Breen-Lyles M, Harmsen WS, et al. Intestinal chemosensitivity in irritable bowel syndrome associates with small intestinal TRPV channel expression. Aliment Pharmacol Ther. 2021;54(9):1179–92. pmid:34472640
- 26.
Smyth GK. limma: Linear Models for Microarray Data. In: Gentleman R, Carey VJ, Huber W, Irizarry RA, Dudoit S, editors. Bioinformatics and Computational Biology Solutions Using R and Bioconductor. New York, NY: Springer New York; 2005. p. 397–420.
- 27. Cao S, Wei Y, Yue Y, Wang D, Xiong A, Yang J, et al. Bioinformatics Identification and experimental verification of disulfidptosis-related genes in the progression of osteoarthritis. Biomedicines. 2024;12(8):1840. pmid:39200304
- 28. Gu Z, Eils R, Schlesner M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016;32(18):2847–9. pmid:27207943
- 29. Hill DP, Blake JA, Richardson JE, Ringwald M. Extension and integration of the gene ontology (GO): combining GO vocabularies with external vocabularies. Genome Res. 2002;12(12):1982–91. pmid:12466303
- 30. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30. pmid:10592173
- 31. Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, et al. STRING 8--a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res. 2009;37(Database issue):D412-6. pmid:18940858
- 32. Cheng S, Ma M, Zhang L, Liu L, Cheng B, Qi X, et al. CGSEA: A flexible tool for evaluating the associations of chemicals with complex diseases. G3 (Bethesda). 2020;10(3):945–9. pmid:31937547
- 33. Mattingly CJ, Colby GT, Rosenstein MC, Forrest JN Jr, Boyer JL. Promoting comparative molecular studies in environmental health research: an overview of the comparative toxicogenomics database (CTD). Pharmacogenomics J. 2004;4(1):5–8. pmid:14735110
- 34. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. pmid:16199517
- 35. Heidelbaugh JJ, Stelwagon M, Miller SA, Shea EP, Chey WD. The spectrum of constipation-predominant irritable bowel syndrome and chronic idiopathic constipation: US survey assessing symptoms, care seeking, and disease burden. Am J Gastroenterol. 2015;110(4):580–7. pmid:25781368
- 36. Devanarayana NM, Rajindrajith S. Irritable bowel syndrome in children: Current knowledge, challenges and opportunities. World J Gastroenterol. 2018;24(21):2211–35. pmid:29881232
- 37. Wu X, Wang J, Ye Z, Wang J, Liao X, Liv M, et al. Risk of colorectal cancer in patients with irritable bowel syndrome: a meta-analysis of population-based observational studies. Front Med (Lausanne). 2022;9:819122. pmid:35308554
- 38. Burns GL, Talley NJ, Keely S. Immune responses in the irritable bowel syndromes: time to consider the small intestine. BMC Med. 2022;20(1):115. pmid:35354471
- 39. Heuberger C, Pott J, Maloy KJ. Why do intestinal epithelial cells express MHC class II?. Immunology. 2021;162(4):357–67. pmid:32966619
- 40. So SY, Savidge TC. Sex-Bias in irritable bowel syndrome: linking steroids to the gut-brain axis. Front Endocrinol (Lausanne). 2021;12:684096. pmid:34093447
- 41. Chen M, Ruan G, Chen L, Ying S, Li G, Xu F, et al. Neurotransmitter and intestinal interactions: focus on the microbiota-gut-brain axis in irritable bowel syndrome. Front Endocrinol (Lausanne). 2022;13:817100. pmid:35250873
- 42. Xiao L, Liu Q, Luo M, Xiong L. Gut microbiota-derived metabolites in irritable bowel syndrome. Front Cell Infect Microbiol. 2021;11:729346. pmid:34631603
- 43. Rolland-Fourcade C, Denadai-Souza A, Cirillo C, Lopez C, Jaramillo JO, Desormeaux C, et al. Epithelial expression and function of trypsin-3 in irritable bowel syndrome. Gut. 2017;66(10):1767–78. pmid:28096305
- 44. Staudacher HM, Mikocka-Walus A, Ford AC. Common mental disorders in irritable bowel syndrome: pathophysiology, management, and considerations for future randomised controlled trials. Lancet Gastroenterol Hepatol. 2021;6(5):401–10. pmid:33587890
- 45. Jacobs JP, Gupta A, Bhatt RR, Brawer J, Gao K, Tillisch K, et al. Cognitive behavioral therapy for irritable bowel syndrome induces bidirectional alterations in the brain-gut-microbiome axis associated with gastrointestinal symptom improvement. Microbiome. 2021;9(1):236. pmid:34847963
- 46. Van den Houte K, Bercik P, Simren M, Tack J, Vanner S. Mechanisms underlying food-triggered symptoms in disorders of gut-brain interactions. Am J Gastroenterol. 2022;117(6):937–46. pmid:35506862
- 47. Kriston L, von Wolff A, Westphal A, Hölzel LP, Härter M. Efficacy and acceptability of acute treatments for persistent depressive disorder: a network meta-analysis. Depress Anxiety. 2014;31(8):621–30. pmid:24448972
- 48. Hagan M, Hayee BH, Rodriguez-Mateos A. (Poly)phenols in inflammatory bowel disease and irritable bowel syndrome: a review. Molecules. 2021;26(7):1843. pmid:33805938