Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Gene co-expression network analysis of the human gut commensal bacterium Faecalibacterium prausnitzii in R-Shiny

Abstract

Faecalibacterium prausnitzii is abundant in the healthy human intestinal microbiota, and the absence or scarcity of this bacterium has been linked with inflammatory diseases and metabolic disorders. F. prausnitzii thus shows promise as a next-generation probiotic for use in restoring the balance of the gut microbial flora and, due to its strong anti-inflammatory properties, for the treatment of certain pathological conditions. However, very little information is available about gene function and regulation in this species. Here, we utilized a systems biology approach—weighted gene co-expression network analysis (WGCNA)–to analyze gene expression in three publicly available RNAseq datasets from F. prausnitzii strain A2-165, all obtained in different laboratory conditions. The co-expression network was then subdivided into 24 co-expression gene modules. A subsequent enrichment analysis revealed that these modules are associated with different kinds of biological processes, such as arginine, histidine, cobalamin, or fatty acid metabolism as well as bacteriophage function, molecular chaperones, stress response, or SOS response. Some genes appeared to be associated with mechanisms of protection against oxidative stress and could be essential for F. prausnitzii’s adaptation and survival under anaerobic laboratory conditions. Hub and bottleneck genes were identified by analyses of intramodular connectivity and betweenness, respectively; this highlighted the high connectivity of genes located on mobile genetic elements, which could promote the genetic evolution of F. prausnitzii within its ecological niche. This study provides the first exploration of the complex regulatory networks in F. prausnitzii, and all of the “omics” data are available online for exploration through a graphical interface at https://shiny.migale.inrae.fr/app/faeprau.

Introduction

The human gut microbiota plays a fundamental role in human health. The collective genomes of bacteria and other microorganisms in this ecosystem, including fungi, viruses, and archaea, have received increasing attention in the past two decades (Integrative HMP (iHMP) Consortium, 2019). Taken together, this research has revealed the extent to which the gut microbiome actively affects numerous host functions, including immune system development, maintenance of intestinal mucosal integrity, human metabolism, brain processes, and behavior [14]. The community composition and behavior of this assemblage can be influenced by a variety of factors, such as diet and lifestyle. Alterations to the gut microbiota, referred to as dysbiosis, can disrupt essential health-promoting services and are associated with a variety of illnesses, including Inflammatory Bowel Diseases (IBDs) and cancer [57]. Within the gut microbiota, microorganisms interact in multiple contexts and exhibit heterogeneous behaviors. The diversity and number of these interactions can generate unpredictable population dynamic, and the emergent properties and cross-scale interactions within these complex systems are best studied using comprehensive investigations of the entire community. In this context, the integrative approaches used in systems biology represent important tools for studying the interplay between the different biological systems within the whole, and may help to elucidate host microbiome interactions [8,9].

Among the strictly anaerobic bacteria present in the human colon, Faecalibacterium praunsitzii is one of the most common taxa and serves as a general health biomarker in humans [10]. The abundance of F. prausnitzii is reflective of the health status and colonic environment of the host. Low levels of this species in the gut are associated with various gastrointestinal disorders, such as IBDs, Irritable Bowel Syndrome, colorectal cancer, or obesity [11,12]. One of the mechanisms by which F. prausnitzii exerts a beneficial effects is the production of butyrate, which is involved in maintaining the gut lining and in fighting inflammation [1315]. Moreover, this species produces several bioactive molecules that affect inflammation and gut barrier function, such as the microbial anti-inflammatory molecule (MAM) [1618]. F. prausnitzii is thus considered a promising next generation probiotic (NGP) which can help to not only restore the balance of the microbial flora but also to aid in the treatment of certain pathological conditions [19,20]. Despite the importance of F. prausnitzii within the gut, nothing is known about the regulatory factors responsible for a range of important processes, including intestinal colonization, quorum sensing, and stress responses to bile salt, acidic pH, or oxidative stress. In this bacterium and for NGPs in general, the first step in understanding the interactions and behavior within the gut is to consider individual bacteria as a biological system, one that responds to environmental perturbations through complex networks of gene interactions.

Recent studies have utilized high-throughput RNA sequencing (RNA-seq) to examine gene expression in F. prausnitzii strain A2-165, mostly with the goal of identifying differential gene expression under different conditions [2123]. However, certain underlying properties can only be explained by studying organisms as complex systems [2426]. To this end, expression data can be used to accurately group genes into functional modules based on co-expression patterns. In particular, the algorithm developed for weighted gene co-expression network analysis (WGCNA) enables the construction of gene networks through consideration of the co-expression patterns between two genes as well as the overlap between neighboring genes [27]. Highly correlated genes are clustered into larger modules based on similarities in their expression profiles, and members of a given module are often involved in similar functional processes [28]. To date, WGCNA has been successfully used to construct gene co-expression networks in several bacteria, including Mycobacterium tuberculosis [29], Escherichia coli [30], Lactococcus lactis [31], Vibrio cholerae [32], and Streptococcus oralis [33].

In the current study, gene expression data were obtained for F. prausnitzii A2-165 from three public RNAseq datasets. Co-expression networks were analyzed using WGCNA and genes were clustered into modules with similar expression patterns. For each module, we investigated enrichment in biological functions. The structure of the network was also analyzed to identify hub and bottleneck genes; of the 25 we identified, the majority were located in the region of six mobile genetic elements that could promote intracellular or intercellular DNA mobility. This study represents the first time that a gene expression network has been constructed for F. prausnitzii. In addition, an R-Shiny application is available online to make the data and the analyses accessible and usable to a wide audience (https://shiny.migale.inrae.fr/app/faeprau).

Materials and methods

Data sources

In the Lebas dataset [22], the authors explored the effects on the transcriptomic profile of F. prausnitzii A2-165 of treatment with supernatant from Lactococcus lactis subsp. lactis CNCM I-1631, Lactococcus lactis subsp. cremoris CNCM I-3558, Lactobacillus paracasei CNCM I-3689, and Streptococcus thermophilus CNCM I-3862. The raw RNA-seq fastq files are available at ArrayExpress under project E-MTAB-9387. In the D’hoe dataset, the authors sequenced RNA from F. prausnitzii A2-165 monocultures at three different time points [21]. The RNA-seq results were deposited in the Short Read Archive under the study identifier SRP136465. In the Kang dataset [23], the authors compared the transcriptomic profiles of F. prausnitzii A2-165 cultured using different carbohydrates (galactose, fructose, glucose, N-acetylglucosamine) as the sole carbon source. Data are in the Short Read Archive under the study identifier SRX10245665. Datasets and source code have been deposited in the GitLab repository at https://gitlab.com/adlin-science-public/FaePrau.

Data processing

The workflow for this study is presented in S1 Fig. A read count table was obtained using the raw sequencing reads. After trimming reads with fastp v.0.20.0 (default parameters, [34]), fastq-formatted reads were aligned to the genome of F. prausnitzii strain A2-165 (Genome Assembly ASM273414v1) using BWA v.0.7.17 [35], allowing a single mismatch in the read. Then, sam-formatted alignments were sorted and converted to bam output files using SAMtools v.1.10 [36]. The number of reads per transcript from each sample was counted using HTSeqCount v.0.12.4 [37] and GFF-formatted gene annotations downloaded from NCBI. We checked the distribution of raw counts and performed principal component analysis in each dataset (S2 Fig). Gene expression values were normalized using the DEseq2 package v.1.34.0, in R [38]. The count table, containing 2,950 genes (S1 Table), was filtered to eliminate non-expressed genes. The resulting final dataset contained 2,902 genes and was further processed with WGCNA [39]. The quality of the expression matrix was evaluated by hierarchical clustering based on the distance between different samples, measured using Spearman’s correlation. No outliers were detected (S3 Fig).

Data analysis

Differential expression analysis was performed with the DEseq2 package v.1.34.0. We applied cutoffs of P-adj (corrected FDR) < 0.05 and absolute log2FoldChange (log2FC) > 1.5 for the detection of genes that were differentially expressed between the conditions considered and the negative control. Partial least squares-discriminant analysis (PLS-DA) implemented in the mixOmics package of R (v.6.18.1) [40] was used.

Construction of weighted gene co-expression network

WGCNA is widely used in systems biology to construct a scale-free network from gene expression data [41]. In this study, we used the WGCNA package in R (v.1.71–3). In brief, Pearson’s correlation matrices were calculated for all pairs of genes and were subsequently transformed into matrices of connection strengths using a power function; in this case, the power was β = 17. We selected pairs of genes with distance correlation coefficients greater than 0.5 from the three datasets. The connection strengths were then used to calculate topology overlap (TO) [42], which measures the connectivity of a pair of genes. Hierarchical average linkage clustering [43] based on TO was used to identify gene co-expression modules, which groups genes with similar patterns of expression. We chose the minimum module size of 20 without merging the modules (a height cut of 0.0).

Functional analysis and networks

After extracting the list of genes present in each module, we used eggNOG-mapper, a tool for functional annotation based on precomputed orthology assignments [44]. Functional enrichment analysis was performed in each module with STRING (version 9.0; https://string-db.org/) in March 2022. Enrichments with strength > 0.5 were selected. The resulting interaction networks contained direct (physical) and indirect (functional) interactions, derived from numerous sources including experimental repositories and computational prediction methods.

Nodes with a high degree of connectivity are called “hubs”; they interact with several other genes and may thus play a central role in the network. Instead, “betweenness” measures the total number of shortest nonredundant paths passing through a given node or edge. By definition, most of the shortest paths in a network go through nodes with a high degree of betweenness. These nodes, named “bottlenecks”, become central points controlling the communication among other nodes in the network. In this study, we looked at the measures of connectivity and betweeness within each module by using a relative hub/bottleneck threshold of top 10%. Network visualization was performed with the igrah package v1.2.8.

Results and discussion

Comprehensive transcriptomic profiles from RNAseq data

Raw data from previous gene expression profiling analyse [2123] were preprocessed using a similar workflow (S1 Fig). Details on these experiments and the samples used are presented in Table 1. To investigate differences in the transcriptomic profiles among the different conditions analyzed, we conducted a PLS-DA. This revealed a clear separation along the gene expression datasets from the different publications along the first two principal components, which explained 40% and 9% of the observed covariance (Fig 1A). While component 1 was sufficient to distinguish the Lebas dataset, the second component was necessary for separation of the D’hoe and Kang datasets. In total, we identified 50 genes that could be used to discriminate between the Lebas and Kang samples (component 1) and 50 genes that could be used to distinguish the D’hoe and Kang datasets (component 2) (Fig 1B). As shown in Fig 1C, the first component was principally influenced by clusters of orthologous genes (COGs) related to translation, carbohydrate transport and metabolism, cell cycle control, and lipid transport, and metabolism. The second component was more strongly affected by genes involved in coenzyme transport and metabolism, and inorganic ion transport and metabolism. In particular, among these discriminatory genes, we detected rubrerythrin (CG447_11045) and reverse rubrerythrin (CG447_01540), a putative iron transporter (CG447_11790 and CG447_11800), and enzymes of the cobalamin biosynthesis pathway (CG447_11805, CG447_11815, and CG447_11845). Rubrerythrin and reverse rubrerythrin are known to function primarily in the defense against oxidative radicals in anaerobic bacteria, while oxygen sensing and tight regulation of iron homeostasis are closely linked to bacterial growth in anaerobic conditions [45,46]. In addition, cobalamin has been demonstrated to have a protective effect against oxidative stress in some bacteria [47]. Therefore, it appears that some of the key discriminatory genes we identified are functionally related to each other and associated with mechanisms for protection against oxidative stress, which could be essential for the growth of F. prausnitzii under the different anaerobic conditions investigated in the three previous publications.

thumbnail
Fig 1. Partial least-square discriminant analysis of RNA-seq data.

(A) PLS-DA plot of RNA-seq data showing clear transcriptome-based discrimination among the three publication datasets. Each point represents the transcriptome signature of one sample, with ellipses representing 95% confidence level. (B) Genes with high discriminatory ability were identified from PLS-DA. (C) Histogram showing the distribution of COG categories (Clusters of Orthologous Genes) associated with the genes with the highest discriminatory power in the two components generated by PLS-DA. See S2 Table for the full list of discriminatory genes.

https://doi.org/10.1371/journal.pone.0271847.g001

thumbnail
Table 1. Description of the dataset and samples used in this study.

https://doi.org/10.1371/journal.pone.0271847.t001

Exploration of transcriptomic data with R-Shiny

We developed an R-Shiny application to facilitate deeper exploration of the transcriptomic data from each of the three studies. For example, using this approach on the Lebas dataset, we observed that treatment of F. prausnitzii with supernatant from L. lactis subsp. lactis generated more transcriptional downregulation than upregulation relative to the control (see our R-Shiny application), confirming the findings of the original study [22]. In the Kang dataset, a comparison between bacteria raised on glucose or fructose as the sole carbon source highlighted that the most upregulated gene was CG447_08360, which encodes a putative PTS glucose transporter subunit IIBC (Fig 2A and 2B). This gene was also highly upregulated in the presence of galactose. As the transcription PTS genes is usually regulated in response to substrate availability [48], this suggests that CG447_08360 encodes a galactose/glucose transporter. Finally, the D’hoe dataset provided the opportunity to track F. prausnitzii gene expression across three growth time-points. In the R-Shiny application, it was possible to compare, for example, expression at time 15h with that at time 3h and to highlight the most upregulated and downregulated genes between these time-points (Fig 2C–2H). Notably, serine/threonine and ferrous iron transporters were highly upregulated at the earlier time compared to later, which suggests that available serine, threonine, and ferrous iron are imported into the cell during early growth (Fig 2D and 2E). It was also interesting to observe the upregulation of CG447_14545 at time 3h (Fig 2F); this gene encodes a multi-antimicrobial extrusion (MATE) efflux transporter, which plays a role in antibiotic resistance as well as potentially in bacterial-host interactions and intercellular signaling [49,50]. Instead, time-point 15h was marked by strong upregulation of CG447_09920 and CG447_06950, which encode energy-coupling factor (ECF) transport S component and DNA polymerase IV, respectively (Fig 2G and 2H). ECF transporters are responsible for vitamin uptake in prokaryotes [51], while DNA polymerase IV is part of the SOS response to DNA damage in bacteria and contributes significantly to cell fitness in late stationary phase cultures in the absence of any exogenous DNA damage [52]. Our R-Shiny application thus provided new insight into each dataset and reveals genes that may play a role in the fitness of F. prausnitzii under the different growth conditions employed.

thumbnail
Fig 2. Illustration of transcriptomic data mining.

(A) and (C) Visualization of RNAseq results with volcano plots. (B, D, E, F, G, H) Boxplots showing the read counts for genes of interest among different growth conditions.

https://doi.org/10.1371/journal.pone.0271847.g002

Construction of weighted gene co-expression network

To choose the best soft-thresholding power, we analyzed network topology and determined that 17 was the lowest appropriate power value, with a scale-free topology fit index of 0.95 and a relatively high average connectivity (Fig 3A). WGCNA revealed 24 distinct modules representing highly co-expressed networks of genes (Fig 3B–3D). The list of genes in each module of the WGCNA analysis is available in S3 Table. Two modules, turquoise and blue, were more notably larger than the rest, with 568 and 431 genes, respectively (Table 2). The grey module contained the non-clustering genes, of which there were only 36, suggesting that WGCNA performed well under the parameters applied here.

thumbnail
Fig 3. Network construction and module detection with WGCNA.

(A) Network topology analysis for various values of soft-thresholding powers, with the scale-free index and the mean connectivity as a function of the soft-thresholding power. (B) Dendrogram of all genes divided into 24 modules, with dissimilarity based on topological overlap, presented with assigned module colors. (C) Dendogram representing the 24 modules identified by WGCNA. The heightcut (red line, heightcut = 0.0) was used to unmerge modules. The grey module represents genes that are not included in any of the other modules. (D) A heatmap depicting the topological overlap matrix (TOM) among all genes in the analysis. The intensity of the red color indicates the strength of the correlation between all pairs of genes.

https://doi.org/10.1371/journal.pone.0271847.g003

Individual modules can represent independent units that are responsible for certain biological functions. By linking the gene expression of the modules with the experimental conditions employed, it may be possible to identify modules that have important functions under certain conditions. To this end, we used WGCNA to correlate each module with the conditions used in the three publications (Lebas, D’hoe and Kang) by calculating values of Module Significance (MS) for module-trait correlations (Fig 4A). The expression levels for each module across the 24 samples were then visualized (Fig 4B). Several large modules represent expression changes that clearly delineate between Lebas on one side and D’hoe and Kang on the other. Indeed, the blue, cyan, pink, brown, and purple modules (Pearson correlation coefficient ≥ 0.75) correspond to genes that were upregulated in the D’hoe and Kang datasets, while the green, salmon, red, and yellow modules (Pearson correlation coefficient ≤ - 0.75) represent genes that were upregulated in the Lebas dataset. This suggests that the expression patterns of the genes in these modules reflect the ways in which the growth of F. prausnitzii differed among the conditions used in the different studies.

thumbnail
Fig 4. Identification of the modules associated with conditions in the three original datasets.

(A) Heatmap depicting the correlation between module eigengenes and the original datasets. Pearson coefficient correlations are indicated. The p-value is indicated in parentheses. (B) Heatmap of gene expression levels in the modules across the samples in the three original datasets.

https://doi.org/10.1371/journal.pone.0271847.g004

Functional and pathway enrichment analysis

The top biological processes associated with the large brown and blue modules were “structural constituent of ribosome” and “regulation of translation”, respectively (Table 3). Several modules were enriched in metabolic functions such as arginine, histidine, cobalamin, or fatty acid biosynthesis (green, red, lightgreen, or royal blue modules, respectively; Table 3 and Fig 5). We also identified clusters of genes related to bacteriophage function, chaperones, stress response, or SOS response, suggesting the involvement of functions related to adaptive responses (darkred, tan and black modules). Overall, we found that genes involved in similar pathways or with the same biological function tended to belong to the same expression cluster with WGCNA, which supports the relevance and utility of this integrative bioinformatics approach in this context.

thumbnail
Fig 5. Illustration of enrichment analysis with STRING (version 9.0).

Results are presented for the modules “Lightgreen” (red color: Cobalamin biosynthetic pathway), “Darkred” (blue color: Bacteriophage functions), “Royalblue” (red color: Fatty acid biosynthesis), “Purple” (green color: Transposon-encoded proteins; red color: Molybdenum cofactor biosynthesis; yellow color: Endonuclease/relaxase).

https://doi.org/10.1371/journal.pone.0271847.g005

thumbnail
Table 3. Functional enrichment in each module, analyzed with STRING 9.0.

https://doi.org/10.1371/journal.pone.0271847.t003

In their original study, Lebas et al. showed that F. prausnitzii A2-165 responds to cell-free supernatants from lactic acid bacteria by downregulating mobilome genes and upregulating cell-wall related genes. Here, we observed that the purple module (downregulated in Lebas relative to D’hoe and Kang) contained clusters of genes related to transposon and mobilization proteins, while conversely, the yellow module (upregulated in Lebas relative to D’hoe and Kang) was linked with processes associated to the cell wall and the regulation of cell shape. In addition, we found that some genes involved in mobilization or cell-wall synthesis form potential networks of interaction (Table 3). In this way, the results from our WGCNA strengthen the observations by Lebas et al.

Identification of hub and bottleneck genes

Next, we focused on the genes with the highest intramodular connectivity (hub genes), as these serve as the best representatives of the expression of the module as a whole [39] and play important roles in biological processes. Network analysis highlighted the 25 genes with highest connectivity, and all belong to the turquoise module (Fig 6 and Table 4). Genes with the highest degree of betweenness (called bottlenecks) control most of the information flow, and thus represent the critical points of the network [53]. In our network, we observed that the top 25 bottleneck genes also belonged to the turquoise module, and they tended to be hubs as well (Table 4). Among these, we identified genes encoding proteins involved in DNA metabolism such as DNA topoisomerase III (CG447_13500 and CG447_13565), DNA primase (CG447_13520 and CG447_13625), relaxase (CG447_02250), and integrase (CG447_13030), as well as group II intron reverse transcriptase/maturase (CG447_11550). When we analyzed the location of these genes on the chromosome, we found that they were distributed in seven regions: six encoded mobile genetic elements (MGEs) and one carried prophage genes. Interestingly, the TraG protein family, encoded by putative MGEs 1, 4, and 6 (Table 4), has been shown to be involved in bacterial conjugation [54]. However, to our knowledge, no conjugation system has been described in F. prausnitzii and no endogenous plasmid was isolated in the sequenced strains. The absence of conjugative plasmids but the presence of many conjugation-related genes suggests the presence of integrative and conjugative elements (ICEs) in F. prausnitzii. In addition, we observed that the six MGEs highlighted in the turquoise module were strongly associated within a gene network. This raises questions about possible shared means of regulation and/or induction for these elements.

thumbnail
Fig 6. Network plot depicting the top connections in the “turquoise” module.

Nodes represent genes, and node size is correlated with the degree of connectivity of the gene.

https://doi.org/10.1371/journal.pone.0271847.g006

thumbnail
Table 4. Identification of hub and bottleneck genes from the WGCNA network.

https://doi.org/10.1371/journal.pone.0271847.t004

The conditions in the gut, which are characterized by extremely high concentrations of microbial cells and phages, represent one of the most favorable ecological niches for horizontal gene exchange. Indeed, the gut microbiome responds to selective pressure through the genetic restructuring of gut populations, driven mainly via horizontal gene exchange [55,56]. Recently, it was shown that ICEs play an important role in the adaptation of Streptococcus salivarius to human oral, pharyngeal and gut environments [57]. Indeed, ICEs encode numerous functions such as resistance to stress or antibiotics and numerous enzymes involved in diverse cellular metabolic pathways. Further investigation is needed to understand the role(s) played by MGEs in the evolution of the F. prausnitzii genome, their influence on the dynamic response to selective pressure (i.e. antibiotics, IBDs), and the consequences for host health.

Conclusions

Despite the importance of F. prausnitzii to gut health, the gene regulatory networks that determine the behavior of this bacterium within its ecological niche are still unknown. Using an integrative bioinformatics approach, this work describes the first reconstruction of co-expression networks in F. prausnitzii strain A2-165. Our findings provide insights into the relationships among several cellular processes, and provide evidence for a tight network of mobile genetic elements. This work highlights candidate genes that should be further investigated for their role(s) in the response of F. prausnitzii to different factors in the gut, including antibiotics, therapies, lifestyle, or diet. As F. prausnitzii is considered a promising next generation probiotic, our results raise the question of its functionality, which seems related to environmental conditions. This aspect must be taken into account for the optimization in the administration of F. prausnitzii as a probiotic in patients with an altered digestive ecosystem. In addition, we provide an online tool at https://faeprau.omics.ovh/ for data exploration through a user-friendly graphical interface that may be useful for the scientific community. In the future, this interface may evolve in order to add RNAseq data generated by research teams studying F. prausnitzii.

Supporting information

S1 Fig. Flow chart of data preparation, processing, and analysis.

https://doi.org/10.1371/journal.pone.0271847.s001

(TIF)

S2 Fig. Exploratory analyses of each dataset.

After processing the raw data, we checked the distribution of raw counts and performed principal component analysis.

https://doi.org/10.1371/journal.pone.0271847.s002

(TIF)

S3 Fig. Sample clustering.

The quality of the expression matrix was evaluated by sample clustering based on the distance between different samples, measured as Spearman’s correlation. No outliers were detected.

https://doi.org/10.1371/journal.pone.0271847.s003

(TIF)

S2 Table. List of discriminatory genes in components 1 and 2 with PLS-DA analysis.

The tool eggNOG-mapper was used for functional annotation based on precomputed orthology assignments.

https://doi.org/10.1371/journal.pone.0271847.s005

(XLSX)

S3 Table. List of the genes in the detected modules.

https://doi.org/10.1371/journal.pone.0271847.s006

(XLSX)

References

  1. 1. Antonini M, Lo Conte M, Sorini C, Falcone M. How the Interplay Between the Commensal Microbiota, Gut Barrier Integrity, and Mucosal Immunity Regulates Brain Autoimmunity. Front Immunol. 2019;10: 1937. pmid:31475000
  2. 2. Agus A, Clément K, Sokol H. Gut microbiota-derived metabolites as central regulators in metabolic disorders. Gut. 2021;70: 1174–1182. pmid:33272977
  3. 3. Cryan JF, O’Riordan KJ, Cowan CSM, Sandhu KV, Bastiaanssen TFS, Boehme M, et al. The Microbiota-Gut-Brain Axis. Physiological Reviews. 2019;99: 1877–2013. pmid:31460832
  4. 4. Zheng D, Liwinski T, Elinav E. Interaction between microbiota and immunity in health and disease. Cell Res. 2020;30: 492–506. pmid:32433595
  5. 5. Imhann F, Vich Vila A, Bonder MJ, Fu J, Gevers D, Visschedijk MC, et al. Interplay of host genetics and gut microbiota underlying the onset and clinical presentation of inflammatory bowel disease. Gut. 2018;67: 108–119. pmid:27802154
  6. 6. Sokol H, Leducq V, Aschard H, Pham H-P, Jegou S, Landman C, et al. Fungal microbiota dysbiosis in IBD. Gut. 2017;66: 1039–1048. pmid:26843508
  7. 7. Zitvogel L, Galluzzi L, Viaud S, Vétizou M, Daillère R, Merad M, et al. Cancer and the gut microbiota: An unexpected link. Sci Transl Med. 2015;7. pmid:25609166
  8. 8. Altay O, Nielsen J, Uhlen M, Boren J, Mardinoglu A. Systems biology perspective for studying the gut microbiota in human physiology and liver diseases. EBioMedicine. 2019;49: 364–373. pmid:31636011
  9. 9. Rosario D, Boren J, Uhlen M, Proctor G, Aarsland D, Mardinoglu A, et al. Systems Biology Approaches to Understand the Host–Microbiome Interactions in Neurodegenerative Diseases. Front Neurosci. 2020;14: 716. pmid:32733199
  10. 10. Lopez-Siles M, Duncan SH, Garcia-Gil LJ, Martinez-Medina M. Faecalibacterium prausnitzii: from microbiology to diagnostics and prognostics. ISME J. 2017;11: 841–852. pmid:28045459
  11. 11. Sokol H, Pigneur B, Watterlot L, Lakhdari O, Bermúdez-Humarán LG, Gratadoux J-J, et al. Faecalibacterium prausnitzii is an anti-inflammatory commensal bacterium identified by gut microbiota analysis of Crohn disease patients. Proc Natl Acad Sci U S A. 2008;105: 16731–16736. pmid:18936492
  12. 12. Feng J, Tang H, Li M, Pang X, Wang L, Zhang M, et al. The abundance of fecal Faecalibacterium prausnitzii in relation to obesity and gender in Chinese adults. Arch Microbiol. 2014;196: 73–77. pmid:24292154
  13. 13. Leonel AJ, Alvarez-Leite JI. Butyrate: implications for intestinal function. Curr Opin Clin Nutr Metab Care. 2012;15: 474–479. pmid:22797568
  14. 14. Miquel S, Leclerc M, Martin R, Chain F, Lenoir M, Raguideau S, et al. Identification of metabolic signatures linked to anti-inflammatory effects of Faecalibacterium prausnitzii. mBio. 2015;6: e00300–15. pmid:25900655
  15. 15. Lenoir M, Martín R, Torres-Maravilla E, Chadi S, González-Dávila P, Sokol H, et al. Butyrate mediates anti-inflammatory effects of Faecalibacterium prausnitzii in intestinal epithelial cells through Dact3. Gut Microbes. 2020;12: 1–16. pmid:33054518
  16. 16. Martín R, Bermúdez-Humarán LG, Langella P. Searching for the Bacterial Effector: The Example of the Multi-Skilled Commensal Bacterium Faecalibacterium prausnitzii. Front Microbiol. 2018;9: 346. pmid:29559959
  17. 17. Quévrain E, Maubert MA, Michon C, Chain F, Marquant R, Tailhades J, et al. Identification of an anti-inflammatory protein from Faecalibacterium prausnitzii, a commensal bacterium deficient in Crohn’s disease. Gut. 2016;65: 415–425. pmid:26045134
  18. 18. Auger S, Kropp C, Borras-Nogues E, Chanput W, Andre-Leroux G, Gitton-Quent O, et al. Intraspecific Diversity of Microbial Anti-Inflammatory Molecule (MAM) from Faecalibacterium prausnitzii. Int J Mol Sci. 2022;23: 1705. pmid:35163630
  19. 19. Martín R, Miquel S, Benevides L, Bridonneau C, Robert V, Hudault S, et al. Functional Characterization of Novel Faecalibacterium prausnitzii Strains Isolated from Healthy Volunteers: A Step Forward in the Use of F. prausnitzii as a Next-Generation Probiotic. Frontiers in Microbiology. 2017;8. pmid:28713353
  20. 20. Langella P, Guarner F, Martín R. Editorial: Next-Generation Probiotics: From Commensal Bacteria to Novel Drugs and Food Supplements. Front Microbiol. 2019;10: 1973. pmid:31507575
  21. 21. D’hoe K, Vet S, Faust K, Moens F, Falony G, Gonze D, et al. Integrated culturing, modeling and transcriptomics uncovers complex interactions and emergent behavior in a three-species synthetic gut community. eLife. 2018;7: e37090. pmid:30322445
  22. 22. Lebas M, Garault P, Carrillo D, Codoñer FM, Derrien M. Metabolic Response of Faecalibacterium prausnitzii to Cell-Free Supernatants from Lactic Acid Bacteria. Microorganisms. 2020;8: 1528. pmid:33027936
  23. 23. Kang D, Ham H, Lee S, Cho Y, Kim Y, Yoon C, et al. Functional dissection of the phosphotransferase system provides insight into the prevalence of Faecalibacterium prausnitzii in the host intestinal environment. Environ Microbiol. 2021;23: 4726–4740. pmid:34296500
  24. 24. Kitano H. Systems Biology: A Brief Overview. Science. 2002;295: 1662–1664. pmid:11872829
  25. 25. Trewavas A. A Brief History of Systems Biology: “Every object that biology studies is a system of systems.” Francois Jacob (1974). Plant Cell. 2006;18: 2420–2430. pmid:17088606
  26. 26. von Bronk B, Götz A, Opitz M. Complex microbial systems across different levels of description. Phys Biol. 2018;15: 051002. pmid:29757151
  27. 27. Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005;4: Article17. pmid:16646834
  28. 28. Galán-Vásquez E, Perez-Rueda E. Identification of Modules With Similar Gene Regulation and Metabolic Functions Based on Co-expression Data. Front Mol Biosci. 2019;6: 139. pmid:31921888
  29. 29. Jiang J, Sun X, Wu W, Li L, Wu H, Zhang L, et al. Construction and application of a co-expression network in Mycobacterium tuberculosis. Sci Rep. 2016;6: 28422. pmid:27328747
  30. 30. Liu W, Li L, Long X, You W, Zhong Y, Wang M, et al. Construction and Analysis of Gene Co-Expression Networks in Escherichia coli. Cells. 2018;7: 19. pmid:29518040
  31. 31. Omony J, de Jong A, Kok J, van Hijum SAFT. Reconstruction and inference of the Lactococcus lactis MG1363 gene co-expression network. Bhunia AK, editor. PLoS ONE. 2019;14: e0214868. pmid:31116749
  32. 32. DuPai CD, Wilke CO, Davies BW. A Comprehensive Coexpression Network Analysis in Vibrio cholerae. 2020;5: 12.
  33. 33. Chen X, Ma J. Weighted gene co-expression network analysis (WGCNA) to explore genes responsive to Streptococcus oralis biofilm and immune infiltration analysis in human gingival fibroblasts cells. Bioengineered. 2021;12: 1054–1065. pmid:33781179
  34. 34. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34: i884–i890. pmid:30423086
  35. 35. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25: 1754–1760. pmid:19451168
  36. 36. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25: 2078–2079. pmid:19505943
  37. 37. Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31: 166–169. pmid:25260700
  38. 38. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15: 550. pmid:25516281
  39. 39. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9: 559. pmid:19114008
  40. 40. Rohart F, Gautier B, Singh A, Lê Cao K-A. mixOmics: An R package for ‘omics feature selection and multiple data integration. PLoS Comput Biol. 2017;13: e1005752. pmid:29099853
  41. 41. Horvath S, Dong J. Geometric Interpretation of Gene Coexpression Network Analysis. PLoS Comput Biol. 2008;4: e1000117. pmid:18704157
  42. 42. Yip AM, Horvath S. Gene network interconnectedness and the generalized topological overlap measure. BMC Bioinformatics. 2007;8: 22. pmid:17250769
  43. 43. Ravasz E, Somera AL, Mongru DA, Oltvai ZN, Barabási AL. Hierarchical organization of modularity in metabolic networks. Science. 2002;297: 1551–1555. pmid:12202830
  44. 44. Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47: D309–D314. pmid:30418610
  45. 45. Carpenter C, Payne SM. Regulation of iron transport systems in Enterobacteriaceae in response to oxygen and iron availability. J Inorg Biochem. 2014;133: 110–117. pmid:24485010
  46. 46. Beauchene NA, Mettert EL, Moore LJ, Keleş S, Willey ER, Kiley PJ. O2 availability impacts iron homeostasis in Escherichia coli. Proc Natl Acad Sci U S A. 2017;114: 12261–12266. pmid:29087312
  47. 47. Ferrer A, Rivera J, Zapata C, Norambuena J, Sandoval Á, Chávez R, et al. Cobalamin Protection against Oxidative Stress in the Acidophilic Iron-oxidizing Bacterium Leptospirillum Group II CF-1. Front Microbiol. 2016;7: 748. pmid:27242761
  48. 48. Yoon C-K, Kang D, Kim M-K, Seok Y-J. Vibrio cholerae FruR facilitates binding of RNA polymerase to the fru promoter in the presence of fructose 1-phosphate. Nucleic Acids Res. 2021;49: 1397–1410. pmid:33476373
  49. 49. Alcalde-Rico M, Hernando-Amado S, Blanco P, Martínez JL. Multidrug Efflux Pumps at the Crossroad between Antibiotic Resistance and Bacterial Virulence. Front Microbiol. 2016;7. pmid:27708632
  50. 50. Huang L, Wu C, Gao H, Xu C, Dai M, Huang L, et al. Bacterial Multidrug Efflux Pumps at the Frontline of Antimicrobial Resistance: An Overview. Antibiotics. 2022;11: 520. pmid:35453271
  51. 51. Rodionov DA, Hebbeln P, Eudes A, ter Beek J, Rodionova IA, Erkens GB, et al. A Novel Class of Modular Transporters for Vitamins in Prokaryotes. J Bacteriol. 2009;191: 42–51. pmid:18931129
  52. 52. Corzett CH, Goodman MF, Finkel SE. Competitive Fitness During Feast and Famine: How SOS DNA Polymerases Influence Physiology and Evolution in Escherichia coli. Genetics. 2013;194: 409–420. pmid:23589461
  53. 53. Yu H, Kim PM, Sprecher E, Trifonov V, Gerstein M. The importance of bottlenecks in protein networks: correlation with gene essentiality and expression dynamics. PLoS Comput Biol. 2007;3: e59. pmid:17447836
  54. 54. Cabezón E, Sastre JI, de la Cruz F. Genetic evidence of a coupling role for the TraG protein family in bacterial conjugation. Mol Gen Genet. 1997;254: 400–406. pmid:9180693
  55. 55. Lerner A, Matthias T, Aminov R. Potential Effects of Horizontal Gene Exchange in the Human Gut. Front Immunol. 2017;8: 1630. pmid:29230215
  56. 56. Groussin M, Poyet M, Sistiaga A, Kearney SM, Moniz K, Noel M, et al. Elevated rates of horizontal gene transfer in the industrialized human microbiome. Cell. 2021;184: 2053-2067.e18. pmid:33794144
  57. 57. Lao J, Guédon G, Lacroix T, Charron-Bourgoin F, Libante V, Loux V, et al. Abundance, Diversity and Role of ICEs and IMEs in the Adaptation of Streptococcus salivarius to the Environment. Genes (Basel). 2020;11: 999. pmid:32858915