Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Gene expression profiling meta-analysis reveals novel gene signatures and pathways shared between tuberculosis and rheumatoid arthritis

  • M. T. Badr ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    mohamed.tarek.badr@uniklinik-freiburg.de

    Affiliation Institute of Medical Microbiology and Hygiene, Medical Center—University of Freiburg, Faculty of Medicine, Freiburg, Germany

  • G. Häcker

    Roles Conceptualization, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Institute of Medical Microbiology and Hygiene, Medical Center—University of Freiburg, Faculty of Medicine, Freiburg, Germany, BIOSS Centre for Biological Signaling Studies, University of Freiburg, Freiburg, Germany

Abstract

Tuberculosis (TB) is among the leading causes of death by infectious diseases. An epidemiological association between Mycobacterium tuberculosis infection and autoimmune diseases like rheumatoid arthritis (RA) has been reported but it remains unclear if there is a causal relationship, and if so, which molecular pathways and regulatory mechanisms contribute to it. Here we used a computational biology approach by global gene expression meta-analysis to identify candidate genes and pathways that may link TB and RA. Data were collected from public expression databases such as NCBI GEO. Studies were selected that analyzed mRNA-expression in whole blood or blood cell populations in human case control studies at comparable conditions. Six TB and RA datasets (41 active TB patients, 33 RA patients, and 67 healthy controls) were included in the downstream analysis. This approach allowed the identification of deregulated genes that had not been identified in the single analysis of TB or RA patients and that were co-regulated in TB and RA patients compared to healthy subjects. The genes encoding TLR5, TNFSF10/TRAIL, PPP1R16B/TIMAP, SIAH1, PIK3IP1, and IL17RA were among the genes that were most significantly deregulated in TB and RA. Pathway enrichment analysis revealed ‘T cell receptor signaling pathway’, ‘Toll-like receptor signaling pathway,’ and ‘virus defense related pathways’ among the pathways most strongly associated with both diseases. The identification of a common gene signature and pathways substantiates the observation of an epidemiological association of TB and RA and provides clues on the mechanistic basis of this association. Newly identified genes may be a basis for future functional and epidemiological studies.

Introduction

Tuberculosis (TB) is an infectious disease caused predominantly by Mycobacterium tuberculosis (Mtb). With estimated 10.4 million active cases and 1.3 million TB-related deaths per year, TB is one of the most important infections worldwide [1]. In healthy subjects, contact with M. tuberculosis results mostly in chronic, latent infection without clinical symptoms, but a small number of patients move on to develop active TB. Other than the lack of T cell function, which drives active TB especially in HIV-patients, the factors determining predisposition to the progression to active TB are not well understood although several clinical conditions predispose to active TB [2,3]. TB has many extrapulmonary manifestations, with bone and joint involvement being the most common (10–11% of extrapulmonary TB) [4].

Rheumatoid arthritis (RA) is a chronic systemic inflammatory autoimmune disorder that mainly causes symptoms in the synovial joints such as swelling, pain, and stiffness. RA is estimated to affect approximately 0.5% to 1% of the world’s population. Extra-articular manifestations such as subcutaneous nodules, pericarditis, pulmonary effusion and arteritis are common in RA. The main pathological pathways and underlying mechanisms that initiate and lead to the development of RA remain undetermined [5].

Infections with several agents including Mtb have been repeatedly found to be associated with a wide variety of conditions and syndromes [6] of immune deregulation such as sarcoidosis, psoriasis, Sjögren’s syndrome, systemic lupus erythematosus, and rheumatoid arthritis [710]. TB-patients are specifically known to produce antibodies that are also found in autoimmune syndromes such as anticyclic citrullinated peptide (anti-CCP) and anti-arginine-containing peptide (anti-CAP) [11,12].

(Active) TB and RA are different conditions with different pathogenesis. They however share aspects of chronic immune activation and the concept of immune deviation. Since only a minority of patients develop active TB it may be argued that a specific form of immune reactivity is required. In RA, although the trigger is unknown, the immune system is activated in an unwarranted fashion. There is some evidence of similar immunological activity in both conditions. RA-patients often show a good response to immunosuppressive drugs, notably to blockade of tumor necrosis factor (TNF) signaling. Such blockade of the RA immune activity can drive the progression of latent to active TB, suggesting that mechanisms that drive RA play a role in containing TB [1315].

Recent studies have further shown that infection with Mtb may induce or at least aggravate arthritis. In an arthritis model, mice treated with collagen emulsion plus killed M. tuberculosis showed significantly elevated arthritis scores, while control mice treated with the collagen emulsion alone did not develop arthritis [16]. Furthermore, a significant osteoclast presence in the subchondral bones and increased serum-levels of IL-6 were seen in Mtb-infected mice. A contribution from Toll-like receptor 2 (TLR2) has been described, as TLR2 deficient mice showed significantly less disease-severity in the same model, and TLR2 has been shown in other studies to regulate various invasive mechanisms in RA [17]. Previous population-based studies have further found an unexpectedly high prevalence of TB in RA patients, an association that was even stronger than the established association with other comorbidities such as kidney disease, diabetes, and hypertension [18,19]. Despite this epidemiological and mechanistically suggestive evidence of a link between TB and RA, there are no mechanistic explanations for this association.

Global gene expression analysis is a well-established way of characterizing complex cellular responses. Novel molecular pathways have been identified in various conditions using this approach [20]. The many publicly available data permit pooling gene expression datasets and to increase sensitivity by increasing the number of data points. This strategy has already been used to identify gene signatures and pathways that are co-regulated in a number of autoimmune conditions [21]. Similarly, a gene-expression meta-analysis has been successful in identifying novel genes and pathways deregulated in active TB [22]. We here use this in silico approach to test the hypothesis that immune responses common to both TB and RA exist that are either the result of or even perhaps a mechanistic basis for the association between the two types of disease. We used publicly available microarray datasets to investigate gene co-expression patterns of human blood cells in TB and RA patients. To the best of our knowledge, this is the first study to investigate the shared molecular pathways between RA and TB using this approach.

Methods

Data collection

Collection of the meta-analysis data was carried out by searching public expression databases (NCBI GEO and Array Express; however only GEO data were eventually included). We used following search terms: rheumatoid arthritis, RA, tuberculosis, TB, Mycobacterium tuberculosis) and the filters (organism (Homo sapiens)), study type (expression profiling by array), entry type (Dataset/Series)). Initially 787 entries were recovered. Duplicates and irrelevant studies were excluded, and 34 studies remained. These studies were further refined using the inclusion criteria (below) to reach the 6 final studies included in our analysis.

We included only studies that had analyzed gene expression in whole blood, PBMC or blood cell components but excluded studies using other tissues such as synovial fluid, chondrocytes or lung tissue to ensure comparable gene expression and to remove potential bias through tissue specific gene expression. Only samples from untreated TB or RA were included. Studies investigating patients with latent TB were also excluded. In one case (Dataset GSE62525) we were unable to annotate the data properly; this was also excluded. The database-search followed the Preferred Reporting Items of Systematic reviews and Meta-Analyses (PRISMA) statement and is documented in the PRISMA Flow Diagram (S1 File) [23].

After a thorough search and excluding datasets as specified above, two datasets for RA (GSE15573 and GSE4588) and 4 TB datasets (GSE54992, GSE65517, GSE19435, and GSE19444) [2427] were selected for further analysis. A total of 141 samples were considered for downstream analysis, containing data from 41 TB patients, 33 RA patients, and 67 healthy controls. The R programming language was used for initial processing and analysis of the datasets. The datasets were downloaded from the NCBI GEO database using the GEOquery R package [28]. As the original CEL files for the dataset GSE4588 were unavailable, the deposited gene expression matrix was directly retrieved from the NCBI GEO database using the GEOquery R package and processed as previously described [21]. After including it in our meta-analysis pipeline and cross study normalization we investigated the effect of batch normalization by principle component analysis. As no bias was detected this dataset was considered suitable for analysis. For each study we extracted the GEO accession number, platform, sample type and gene expression data. The microarray chip identifiers were transformed to other suitable Gene IDs including Entrez Gene identifiers for downstream analysis. Datasets were merged after annotation with the Entrez Gene identifiers. A suitable identification condition for each sample (case or control) and class (TB, RA, and healthy control) were assigned, and further analysis was carried out with the web-based tool NetworkAnalyst [29,30].

Data processing

Normalization by Log2 transformation with autoscaling to each dataset was performed. Each dataset was then visually inspected using PCA plots to insure the absence of outliers. The individual analysis of each dataset was carried out using the Benjamini–Hochberg's False Discovery Rate (FDR) [31] with cut-off p-values of <0.05. To adjust for the batch effect between the different datasets we used the ComBat batch effect method through the INMEX tool [32]. For detecting the significantly deregulated genes between cases (TB or RA) and controls, the effect size method was used. This approach offers two models for the analysis, the fixed and random effects models (FEM and REM). The most suitable model can be determined by measuring the statistical heterogeneity estimation by Cochran’s Q tests. Based on the Cochran’s Q test we settled on the REM, which usually gives more conservative results, by extracting fewer DEGs but with more confidence. We set a discovery significant value of <0.01 using the REM to discover the most significant DEGs in our downstream analysis. A heatmap of DEGs was created using the visual inspection tools of NetworkAnalyst and clustered using single linkage method.

Hub genes network analysis

NetworkAnalyst was implemented to generate a protein-protein interaction (PPI) network by integrating the innateDB interactome database [33]. With the original seed of 304 deregulated genes a first-order PPI network was generated with 3,433 nodes representing the proteins and 6,851 edges representing the interaction between these proteins. For better visualization of the network and focusing on key connections a zero-order PPI network was created with 57 nodes connected with 80 edges.

Integrative pathway analysis

Identifying the overrepresented biological terms in the deregulated genes dataset was carried out using DAVID (Database for Annotation, Visualization, and Integrated Discovery) [34]. The analysis was performed by uploading a list of the Entrez Gene identifiers comparing the genes on that list to all human genes in the genome. The Functional Annotation Chart and Clustering program was used in accordance with the developer’s protocol [34]. Default settings of the functional annotation chart and clustering tool were used, and Fisher's exact tests were used to calculate p-values. To rank the cluster of terms based on their biological significance a group enrichment score is used, meaning that the top ranked groups would have higher p-value for their member genes. To better visualize and explore the enriched pathways and biological terms related to our DEGs, we redid the analysis using the ClueGo v2.5.0 [35,36] tool, a visualization plug-in implemented in the Cytoscape v3.6.0 environment. We included the EBI Gene Ontology and GO Annotations QuickGO [37,38], KEGG [39], and Reactome pathway databases [40,41] for our analysis and implemented the Go term fusion option to exclude redundancy, with enrichment (left-sided) hypergeometric distribution tests. The leading biological terms were ranked based on their significance with a p-value significance level of ≤0.05, followed by the Bonferroni adjustment for the terms and the groups with Kappa-statistics score threshold set to 0.3.

Transcription factor analysis

To detect the overrepresented transcription factor binding sites (TFBS) from DEGs, the transcription factor discovery module in NetworkAnalyst was implemented, and processing was carried out against both the JASPAR and ENCODE databases. To discover further possible connections between DEGs and transcription factors, we repeated the analysis using the EnrichR tool, which computes enrichment through 35 different gene-set libraries [42,43]. We detected the binding motif sites in our gene list using the position weight matrices (PWMs) analysis from TRANSFAC and JASPAR. The PWMs from TRANSFAC and JASPAR were used to scan the promoters of all human genes in the region between −2,000 and +500 from the transcription factor start site (TSS).

Results

Meta-analysis data selection and preparation pipeline

From the initial datasets acquired by searching public databases, six matched our predetermined inclusion criteria (see methods). A detailed pipeline of inclusion and analysis workflow can be found in S1 Fig. The six datasets included in the further analysis contained samples from 41 patients with active TB, 33 RA patients, and 67 healthy controls (141 samples in total). The data summary of the included datasets and samples can be found in S1 and S2 Tables.

Data acquisition and normalization

The individual dataset gene expression normalization was carried out using the NetworkAnalyst log2 transformation function, followed by autoscaling. The individual datasets were inspected with PCA plots before and after normalization, and PCA plots of gene expression data of the 6 datasets before and after normalization are shown in S2 and S3 Figs. No major differences were seen that could be attributed to differences in dataset platforms or conditions and that could have introduced a bias.

Identification of a common gene expression signature of TB and RA

Based on the results of Cochran’s Q test (S4 Fig), the REM in NetworkAnalyst was used. Using REM, 341 genes were identified as significantly differentially expressed between the TB and RA datasets versus healthy controls (p<0.01 in the REM). A list of the 50 most significantly up- or downregulated genes is shown in S3 Table. A main hypothesis of this meta-analysis was that through the inclusion of datasets from both TB and RA, DRGs would be identified that had not been significantly regulated in the analysis of individual datasets and in the analysis of only one condition. As illustrated in the Venn diagram (Fig 1), this was indeed the case. A total of 172 DEGs were found to be only significantly deregulated in the meta-analysis but not in the analysis of individual conditions. Further, 408 genes were identified that were only significant in the individual disease analysis but not the meta-analysis.

thumbnail
Fig 1. Venn diagram of DEGs.

In comparison to the individual meta-analysis of the Tuberculosis (TB) and rheumatoid arthritis (RA) datasets, the combined meta-analysis of both diseases shows many DEGs (172) that only were significantly different following this approach. Loss of genes that were only significant in their respective disease datasets (genes that play no role common to both conditions) is also expected. Data sets were analyzed with the same parameters in NetworkAnalyst, and genes with a p-value < 0.01 were considered significant.

https://doi.org/10.1371/journal.pone.0213470.g001

Particularly striking was the highly significant upregulation of the genes encoding Toll-like receptor 5 (TLR5) (combined ES of 1.467 and adjusted p-value of 2.47E-09) and death receptor ligand TNFSF10/TRAIL (combined ES of 2.0036 and adjusted p-value of 4.86E-09), as well as the downregulation of protein phosphatase 1 regulatory subunit 16B PPP1R16B/TIMAP (combined ES of 1,3586 and adjusted p-value of 3,34E-07) and the E3 ubiquitin protein ligase 1 (SIAH1) (combined ES of 1,055 and adjusted p-value of 1,65E-05) in both conditions. In addition, numerous immune response regulating genes were significantly deregulated only in the meta-analysis, including DDX58 and CLEC7A. A complete list of the differentially regulated genes with the highest significance is shown in S3 Table. A heatmap of the most highly differentially regulated genes is shown in Fig 2.

thumbnail
Fig 2. Heatmap of most significantly differentially expressed genes.

Heatmap showing the relative expression of the 50 most significantly differentially expressed genes (DEGs) of the 341 significant DEGs identified through the meta-analysis, where 205 genes were co-up-regulated, and 136 genes were co-down-regulated (TB and RA versus control). The heatmap indicates the normalized expression value of each DEG in the individual samples, and genes were clustered based on their condition (cases vs controls) and their original datasets. The heatmap was created by the visualization module in NetworkAnalyst, and genes with p-value < 0.01 in the Random Effect Model analysis were considered significant.

https://doi.org/10.1371/journal.pone.0213470.g002

Hub genes network analysis

To extract more biologically relevant information, we performed a network-based analysis. This analysis identified key hub genes among the most highly deregulated genes (Fig 3). MEPCE with the combined ES of -1.0558 and adjusted p-value of 1.65E-05 and RPS4X with the combined ES of -0.86739 and adjusted p-value of 0.00045462 were found to be among the most highly ranked hub genes among the downregulated DEGs, while ILK (combined ES of 0.96394 and adjusted p-value of 0.0061503) and PML (combined ES of 0.74222 and adjusted p-value of 0.003701) were among the most highly ranked hub genes in the overexpressed DEGs.

thumbnail
Fig 3. Network analysis of the most highly deregulated genes.

Shared differentially expressed genes (DEGs) (TB and RA versus controls) were integrated in NetworkAnalyst tools to visualize gene interactions. A ‘zero order’ interaction network with 57 nodes was used. The most highly ranked nodes across the dataset based on network topology measures were MEPCE (betweenness centrality = 662.8; degree = 9), and RPS4X (betweenness centrality = 294.76; degree = 8).

https://doi.org/10.1371/journal.pone.0213470.g003

Identification of overrepresented biological pathways and gene ontology terms

Pathway enrichment analysis was first performed using the KEGG based pathway enrichment identification module in NetworkAnalyst using the 341 genes identified as significantly deregulated (S4 Table). As expected, infection related pathways and host immune defense pathways such as the Toll-like receptor signaling pathway were found among the most highly enriched genes. Further pathways were found that may be linked to the triggering of autoimmunity in RA, such as osteoclast differentiation and T cell receptor signaling pathways. We further conducted enrichment analysis using the functional annotation chart tool of DAVID (Table 1) and the functional annotation clustering tool, which permits clustering of biologically related groups/terms with similar annotations and genes. The two most strongly enriched pathway clusters are the innate immunity related pathway with an enriched cluster score of 7.415 and antiviral defense pathway cluster with an enriched score of 3.60.

thumbnail
Table 1. Overrepresented biological pathways and gene ontology terms.

https://doi.org/10.1371/journal.pone.0213470.t001

To visualize the enriched pathways and biological terms in a nonredundant manner we fed the identified DEGs to the ClueGO (35) tool on the Cytoscope program. This method identified cytokine signaling in immune response, viral translation, CD4-positive αβ-T cell differentiation, and innate immune response among the most significant terms. A network showing the most highly enriched pathways and terms from this analysis is shown in Fig 4. The GO/pathway terms match to a great degree with the results obtained with the other tools such as DAVID or NetworkAnalyst.

thumbnail
Fig 4. Over-represented biological pathways and gene ontology terms.

Gene ontology and pathway enrichment analysis were conducted using the ClueGO plugin in Cytoscape. A the most significant terms where the node size correlates to the term enrichment significance. Pathways related to innate immunity activation and pathogen defense mechanisms appear as the most significant terms in our analysis. B the same most highly enriched terms shown as bars, where the bars represent the number of genes associated to this term, and the percentage of genes per term is shown as bar label.

https://doi.org/10.1371/journal.pone.0213470.g004

Analysis of transcription factors

We searched for overrepresented transcription factor binding sites (TFBS) in the identified DEGs, using the TF exploration module in NetworkAnalyst. We scanned the regions around these DEGs with all core vertebrate transcription factor binding profiles present in the JASPAR and ENCODE databases. A network of transcription factors in the JASPAR analysis is shown in S5 Fig. We further used the EnrichR web-based tool to elucidate other possible regulatory mechanisms that may affect these target genes through the detection of binding motif sites in the gene list using PWMs from TRANSFAC and JASPAR (Table 2).

thumbnail
Table 2. Significant transcription factor binding sites associated with DRGs.

https://doi.org/10.1371/journal.pone.0213470.t002

Discussion

These results illustrate the usefulness of global gene expression meta-analysis to extract information that would not be visible in the analysis of individual datasets. Not only does it provide a higher sensitivity for the combination of independent sets of similar patient-cohorts, for instance, when several studies of a similar nature are analyzed together, but when combining datasets from different conditions this approach also opens the possibility of identifying gene sets that are similarly regulated in separate conditions. We here used this approach to identify DRGs, pathways, and regulatory networks that are shared between the two conditions of TB and RA.

In this approach, 172 genes were found as differentially regulated in the disease cohorts only through the gene expression meta-analysis, besides many potentially shared pathways between RA and TB that can shed light on the mechanisms of TB triggered autoimmunity. Innate immunity pathways and antiviral defense pathways were identified as the most highly significantly enriched pathways. This may not be surprising per se, but it supports functional studies and provides the basis for in-depth analyses.

One of the earliest events that can be detected in RA-development is an innate immune response and the activation of various antigen-presenting cells such as dendritic cells and macrophages [44,45]. The host reaction against Mtb begins with the recognition of pathogen‐associated molecular patterns (PAMPs) by various pattern recognition receptors such as C-type lectin receptors [46], Toll‐like receptors (the C-type lectin receptor CLEC4D and TLR5/8 were found to be upregulated in our meta-analysis, see S2 Table), and the reaction of macrophages and dendritic cells (DC) [47]. It has also been long known that infection with Mtb leads to upregulation of various TLRs, in turn leading to the activation of proinflammatory signals and the production of cytokines [48]. TLR5, which is the most significantly upregulated gene in this analysis, is a member of the TLR family, best known for its binding to bacterial flagellin [49,50]. A mycobacterial ligand of TLR5 is not known. However, TLR5 has far-reaching immunoregulatory properties such as stimulation of IL17-production by dendritic cells in different tissues as the lungs, spleen and mucosa and the modulation of TLR2 and TLR4 antimicrobial response [51]. TLR5 can determine vaccine efficiency [52] and the composition of the microbiome [53]. Intriguingly, TLR5 has been identified as a mediator of osteoclast differentiation and bone loss [54], as well as of myeloid cell infiltration in RA [55]. TLR5-regulation may therefore contribute to the clinical association of RA and TB.

Key players in RA are B cells [56]. B-cell activating factor (TNFSF13B/BAFF), upregulated in the datasets analyzed here, is an important factor of humoral immunity and has been implicated in the overproduction of antibodies in RA [5759]. In terms of signaling, BAFF activates the noncanonical NF-κB pathway [60], which has implications on B cell survival, maturation and T cell adhesion [61]. BAFF may therefore also be a factor that facilitates transition between TB and RA. All of these upstream immune signals through TLRs, DCs, T cells or B cells can lead to the downstream activation of fibroblast and osteoclasts, which invade the synovial membrane of RA patients and play a prominent role in the mediation of synovial inflammation and bone destruction [45,62].

In our analysis we also saw many genes that regulate cell death (necroptosis or pyroptosis) pathways, such as TRAIL (tumor necrosis factor ligand superfamily member 10; also involved in apoptosis), MLKL (mixed lineage kinase domain-like), EIF2AK2 and caspase-1/5 (all up-regulated). The role of programmed cell death during Mtb-infection by necroptosis, pyroptosis or apoptosis is still somewhat controversial with various studies suggesting (for different cell types) an enhancement or attenuation of cell death during TB infection [63]. A possible explanation to this complex picture is that Mtb may inhibit apoptotic/necroptotic signals in immune cells especially macrophages during initial stages of the disease to allow for replication, and then in more advanced stages induce cell death through MLKL-dependent necroptosis or apoptosis to allow immune evasion and pathogen distribution. This infectious and especially TB dependent deregulation of the necroptosis/apoptosis pathways ties in with studies reporting upregulation of TRAIL expression in T cells in RA patients compared to controls, correlating with disease activity [64]. This deregulation of cell death and cell cycle pathways may also explain why many cancer-related pathways such as the PI3K/AKT are also enriched in our analysis including GSK3, ITGA, PTEN, and FGF. Deregulation of cell death and enhanced proliferation are features of both cancer and the immune response.

Enrichment in antiviral defense related pathways harboring various deferentially expressed genes such as EIF2AK2, DDX58, OAS1, CASP1, and GSK3B was also seen; these genes have been found to play a role in influenza, HPV, EBV and other viral response-related pathways. Indeed, it has previously been shown in a comparative gene expression profiling study between disease discordant twins with systemic autoimmune diseases including RA that antiviral pathways are consistently upregulated [65]. While the results will ultimately have to be confirmed experimentally, and the links we identify will have to be validated by other studies, we believe that these results provide interesting information on pathways shared by the pathology of two diseases and will help understand their pathogenesis and potential interactions.

Conclusion

Understanding the causative factors for autoimmune diseases such as RA remains a challenge, especially when studying it in a narrow context like the genetic role or the environmental role alone. The patient’s genetic background (such as the carriage of the HLA-DRB1*04 epitope cluster) is likely to play a role in predisposing patients to be more sensitive towards environmental insults such as infections [45]. Our study could provide further evidence of a role of TB infection in the initiation of autoimmune response in RA, and elucidate possible regulatory mechanisms of this process, by the recognition of Mtb PAMP through PRRs (for instance TLRs), enhancing the production of proinflammatory cytokines (such as IL-17), which promote synovitis, and help in T and B cell activation. This activation will impact on the pathogenicity of the disease through enhanced antigen presentation and cytokine production, by promoting angiogenesis, inflammatory cell infiltration and osteoclast formation. Such processes will eventually cause synovial inflammation and articular damage. The findings reported here will have implications on understanding the pathogenesis of RA in response to various environmental and infectious stimuli. Identified genes may represent new potential targets for the understanding and treatment of patients with comorbidity of RA and TB.

Supporting information

S1 Fig. Datasets meta-analysis pipeline.

https://doi.org/10.1371/journal.pone.0213470.s001

(PDF)

S2 Fig. PCA plots of included datasets gene expression before and after normalization.

https://doi.org/10.1371/journal.pone.0213470.s002

(PDF)

S3 Fig. PCA plot of gene expression data.

https://doi.org/10.1371/journal.pone.0213470.s003

(PDF)

S4 Fig. Quantile-Quantile plot of the Cochran’s Q test.

https://doi.org/10.1371/journal.pone.0213470.s004

(PDF)

S1 Table. Summary of the datasets integrated in the meta-analysis pipeline.

https://doi.org/10.1371/journal.pone.0213470.s007

(PDF)

S2 Table. List of included samples in the meta-analysis.

https://doi.org/10.1371/journal.pone.0213470.s008

(PDF)

S3 Table. The 50 most highly up- or downregulated genes.

https://doi.org/10.1371/journal.pone.0213470.s009

(PDF)

S4 Table. A list of the significantly deregulated genes.

https://doi.org/10.1371/journal.pone.0213470.s010

(PDF)

References

  1. 1. WHO | Tuberculosis [Internet]. WHO. Available from: http://www.who.int/mediacentre/factsheets/fs104/en/
  2. 2. Bloom BR, Atun R, Cohen T, Dye C, Fraser H, Gomez GB, et al. Tuberculosis. In: Holmes KK, Bertozzi S, Bloom BR, Jha P, editors. Major Infectious Diseases 3rd ed. Washington (DC): The International Bank for Reconstruction and Development / The World Bank; 2017. Available from: http://www.ncbi.nlm.nih.gov/books/NBK525174/
  3. 3. Cui T, He Z-G. Improved understanding of pathogenesis from protein interactions in Mycobacterium tuberculosis. Expert Review of Proteomics. 2014 Dec 1;11(6):745–55. pmid:25327725
  4. 4. Malaviya AN, Kotwal PP. Arthritis associated with tuberculosis. Best Practice & Research Clinical Rheumatology. 2003 Apr 1;17(2):319–43.
  5. 5. Smolen JS, Aletaha D, McInnes IB. Rheumatoid arthritis. Lancet. 2016 Oct 22;388(10055):2023–38. pmid:27156434
  6. 6. Elkington P, Tebruegge M, Mansour S. Tuberculosis: An Infection-Initiated Autoimmune Disease? Trends Immunol. 2016;37(12):815–8. pmid:27773684
  7. 7. Chao W-C, Lin C-H, Liao T-L, Chen Y-M, Chen D-Y, Chen H-H. Association between a history of mycobacterial infection and the risk of newly diagnosed Sjögren’s syndrome: A nationwide, population-based case-control study. PLOS ONE. 2017 May 9;12(5):e0176549. pmid:28486537
  8. 8. Ramagopalan SV, Goldacre R, Skingsley A, Conlon C, Goldacre MJ. Associations between selected immune-mediated diseases and tuberculosis: record-linkage studies. BMC Med. 2013 Apr 4;11:97. pmid:23557090
  9. 9. Sogkas G, Atschekzei F, Schacht V, von Falck C, Jablonka A, Jacobs R, et al. First Association of Interleukin 12 Receptor Beta 1 Deficiency with Sjögren’s Syndrome. Front Immunol. 2017;8:885. Published 2017 Jul 28. pmid:28804486
  10. 10. Bordignon V, Bultrini S, Prignano G, Sperduti I, Piperno G, Bonifati C, et al. High prevalence of latent tuberculosis infection in autoimmune disorders such as psoriasis and in chronic respiratory diseases, including lung cancer. J Biol Regul Homeost Agents. 2011 Jun;25(2):213–20. pmid:21880210
  11. 11. Kakumanu P, Yamagata H, Sobel ES, Reeves WH, Chan EKL, Satoh M. Patients With Pulmonary Tuberculosis Are Frequently Positive for Anti–Cyclic Citrullinated Peptide Antibodies, but Their Sera Also React With Unmodified Arginine-Containing Peptide. Arthritis Rheum. 2008 Jun;58(6):1576–81. pmid:18512773
  12. 12. Elkayam O, Segal R, Lidgi M, Caspi D. Positive anti‐cyclic citrullinated proteins and rheumatoid factor during active lung tuberculosis. Ann Rheum Dis. 2006 Aug;65(8):1110–2. pmid:16361276
  13. 13. Dixon WG, Hyrich KL, Watson KD, Lunt M, Galloway J, Ustianowski A, et al. Drug-specific risk of tuberculosis in patients with rheumatoid arthritis treated with anti-TNF therapy: results from the British Society for Rheumatology Biologics Register (BSRBR). Annals of the Rheumatic Diseases. 2010 Mar 1;69(3):522–8. pmid:19854715
  14. 14. Cantini F, Nannini C, Niccoli L, Petrone L, Ippolito G, Goletti D. Risk of Tuberculosis Reactivation in Patients with Rheumatoid Arthritis, Ankylosing Spondylitis, and Psoriatic Arthritis Receiving Non-Anti-TNF-Targeted Biologics. Mediators Inflamm. 2017;2017: 8909834 pmid:28659665
  15. 15. Navarra SV, Tang B, Lu L, Lin H-Y, Mok CC, Asavatanabodee P, et al. Risk of tuberculosis with anti-tumor necrosis factor-α therapy: substantially higher number of patients at risk in Asia. Int J Rheum Dis. 2014 Mar;17(3):291–8. pmid:24131578
  16. 16. Kanagawa H, Niki Y, Kobayashi T, Sato Y, Katsuyama E, Fujie A, et al. Mycobacterium tuberculosis promotes arthritis development through Toll-like receptor 2. J Bone Miner Metab. 2015 Mar;33(2):135–41. pmid:24633489
  17. 17. McGarry T, Veale DJ, Gao W, Orr C, Fearon U, Connolly M. Toll-like receptor 2 (TLR2) induces migration and invasive mechanisms in rheumatoid arthritis. Arthritis Res Ther. 2015;17(1):153. Published 2015 Jun 9. pmid:26055925
  18. 18. Shen T-C, Lin C-L, Wei C-C, Chen C-H, Tu C-Y, Hsia T-C, et al. Previous history of tuberculosis is associated with rheumatoid arthritis. Int J Tuberc Lung Dis. 2015 Nov;19(11):1401–5. pmid:26467595
  19. 19. Brode SK, Jamieson FB, Ng R, Campitelli MA, Kwong JC, Paterson JM, et al. Risk of mycobacterial infections associated with rheumatoid arthritis in Ontario, Canada. Chest. 2014 Sep;146(3):563–72. pmid:24384637
  20. 20. Sezin T, Vorobyev A, Sadik CD, Zillikens D, Gupta Y, Ludwig RJ. Gene Expression Analysis Reveals Novel Shared Gene Signatures and Candidate Molecular Mechanisms between Pemphigus and Systemic Lupus Erythematosus in CD4+ T Cells. Front Immunol. 2018;8:1992. Published 2018 Jan 17. pmid:29387060
  21. 21. Toro-Domínguez D, Carmona-Sáez P, Alarcón-Riquelme ME. Shared signatures between rheumatoid arthritis, systemic lupus erythematosus and Sjögren’s syndrome uncovered through gene expression meta-analysis. Arthritis Res Ther. 2014;16(6):489. Published 2014 Dec 3. pmid:25466291
  22. 22. Wang Z, Arat S, Magid-Slav M, Brown JR. Meta-analysis of human gene expression in response to Mycobacterium tuberculosis infection reveals potential therapeutic targets. BMC Syst Biol. 2018 Jan 10;12. pmid:29321020
  23. 23. Moher D, Liberati A, Tetzlaff J, Altman DG, Group TP. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLOS Medicine. 2009 Jul 21;6(7):e1000097. pmid:19621072
  24. 24. Teixeira VH, Olaso R, Martin-Magniette M-L, Lasbleiz S, Jacq L, Oliveira CR, et al. Transcriptome Analysis Describing New Immunity and Defense Genes in Peripheral Blood Mononuclear Cells of Rheumatoid Arthritis Patients. PLoS One. 2009 Aug 27. pmid:19710928
  25. 25. Cai Y, Yang Q, Tang Y, Zhang M, Liu H, Zhang G, et al. Increased complement C1q level marks active disease in human tuberculosis. PLoS ONE. 2014;9(3):e92340. pmid:24647646
  26. 26. Bergenfelz C, Larsson A-M, von Stedingk K, Gruvberger-Saal S, Aaltonen K, Jansson S, et al. Systemic Monocytic-MDSCs Are Generated from Monocytes and Correlate with Disease Progression in Breast Cancer Patients. PLoS ONE. 2015;10(5):e0127028. pmid:25992611
  27. 27. Berry MPR, Graham CM, McNab FW, Xu Z, Bloch SAA, Oni T, et al. An interferon-inducible neutrophil-driven blood transcriptional signature in human tuberculosis. Nature. 2010 Aug 19;466(7309):973–7. pmid:20725040
  28. 28. Davis S, Meltzer PS. GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics. 2007 Jul 15;23(14):1846–7. pmid:17496320
  29. 29. Xia J, Gill EE, Hancock REW. NetworkAnalyst for statistical, visual and network-based meta-analysis of gene expression data. Nature Protocols. 2015 May 7;10(6):823–44. pmid:25950236
  30. 30. Xia J, Benner MJ, Hancock REW. NetworkAnalyst—integrative approaches for protein–protein interaction network analysis and visual exploration. Nucleic Acids Res. 2014 Jul 1;42(Web Server issue):W167–74. pmid:24861621
  31. 31. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological). 1995;57(1):289–300.
  32. 32. Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007 Jan;8(1):118–27. pmid:16632515
  33. 33. Breuer K, Foroushani AK, Laird MR, Chen C, Sribnaia A, Lo R, et al. InnateDB: systems biology of innate immunity and beyond—recent updates and continuing curation. Nucleic Acids Res. 2013 Jan;41(Database issue):D1228–33. pmid:23180781
  34. 34. Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57. pmid:19131956
  35. 35. Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, Kirilovsky A, et al. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics. 2009 Apr 15;25(8):1091–3. pmid:19237447
  36. 36. Bindea G, Galon J, Mlecnik B. CluePedia Cytoscape plugin: pathway insights using integrated experimental and in silico data. Bioinformatics. 2013 Mar 1;29(5):661–3. pmid:23325622
  37. 37. Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2017 Jan 4;45(Database issue):D331–8. pmid:27899567
  38. 38. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene Ontology: tool for the unification of biology. Nat Genet. 2000 May;25(1):25–9. pmid:10802651
  39. 39. Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000 Jan 1;28(1):27–30. pmid:10592173
  40. 40. Fabregat A, Jupe S, Matthews L, Sidiropoulos K, Gillespie M, Garapati P, et al. The Reactome Pathway Knowledgebase. Nucleic Acids Res. 2018 Jan 4;46(Database issue):D649–55.
  41. 41. Annotating Cancer Variants and Anti-Cancer Therapeutics in Reactome. Cancers (Basel). 2012;4(4):1180–211. Published 2012 Nov 8. pmid:24213504
  42. 42. Chen EY, Tan CM, Kou Y, Duan Q, Wang Z, Meirelles GV, et al. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics. 2013 Apr 15;14:128. pmid:23586463
  43. 43. Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z, et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016 08;44(W1):W90–97. pmid:27141961
  44. 44. Smolen JS, Steiner G. Therapeutic strategies for rheumatoid arthritis. Nat Rev Drug Discov. 2003 Jun;2(6):473–88. pmid:12776222
  45. 45. Choy E. Understanding the dynamics: pathways involved in the pathogenesis of rheumatoid arthritis. Rheumatology (Oxford). 2012 Jul 1;51(suppl_5):v3–11.
  46. 46. Wagener M, Hoving JC, Ndlovu H, Marakalala MJ. Dectin-1-Syk-CARD9 Signaling Pathway in TB Immunity. Front Immunol. 2018;9:225. Published 2018 Feb 13. pmid:29487599
  47. 47. Lerner TR, Borel S, Gutierrez MG. The innate immune response in human tuberculosis. Cell Microbiol. 2015 Sep;17(9):1277–85. pmid:26135005
  48. 48. Faridgohar M, Nikoueinejad H. New findings of Toll-like receptors involved in Mycobacterium tuberculosis infection. Pathog Glob Health. 2017 Jul;111(5):256–64. pmid:28715935
  49. 49. Uematsu S, Jang MH, Chevrier N, Guo Z, Kumagai Y, Yamamoto M, et al. Detection of pathogenic intestinal bacteria by Toll-like receptor 5 on intestinal CD11c+ lamina propria cells. Nat Immunol. 2006 Aug;7(8):868–74. pmid:16829963
  50. 50. Vicente-Suarez I, Brayer J, Villagra A, Cheng F, Sotomayor EM. TLR5 Ligation by Flagellin Converts Tolerogenic Dendritic Cells into Activating Antigen-Presenting Cells that Preferentially Induce T-Helper 1 Responses. Immunol Lett. 2009 Aug 15;125(2):114–8. pmid:19555720
  51. 51. Van Maele L, Carnoy C, Cayet D, Songhet P, Dumoutier L, Ferrero I, et al. TLR5 signaling stimulates the innate production of IL-17 and IL-22 by CD3negCD127+ immune cells in spleen and mucosa. J Immunol. 2010 Jul;185(2):1177–85. pmid:20566828
  52. 52. Oh JZ, Ravindran R, Chassaing B, Carvalho FA, Maddur MS, Bower M, et al. TLR5-Mediated Sensing of Gut Microbiota Is Necessary for Antibody Responses to Seasonal Influenza Vaccination. Immunity. 2014 Sep 18;41(3):478–92. pmid:25220212
  53. 53. Fulde M, Sommer F, Chassaing B, Vorst K van, Dupont A, Hensel M, et al. Neonatal selection by Toll-like receptor 5 influences long-term gut microbiota composition. Nature. 2018 Aug;560(7719):489–93. pmid:30089902
  54. 54. Kassem A, Henning P, Kindlund B, Lindholm C, Lerner UH. TLR5, a novel mediator of innate immunity-induced osteoclastogenesis and bone loss. FASEB J. 2015 Nov;29(11):4449–60. pmid:26207027
  55. 55. Kim S, Chen Z, Chamberlain ND, Essani AB, Volin MV, Amin MA, et al. Ligation of TLR5 promotes myeloid cell infiltration and differentiation into mature osteoclasts in RA and experimental arthritis. J Immunol. 2014 Oct 15;193(8):3902–13. pmid:25200955
  56. 56. Bugatti S, Vitolo B, Caporali R, Montecucco C, Manzo A. B cells in rheumatoid arthritis: from pathogenic players to disease biomarkers. Biomed Res Int. 2014;2014:681678. pmid:24877127
  57. 57. Steri M, Orrù V, Idda ML, Pitzalis M, Pala M, Zara I, et al. Overexpression of the Cytokine BAFF and Autoimmunity Risk. New England Journal of Medicine. 2017 Apr 27;376(17):1615–26. pmid:28445677
  58. 58. Seyler TM, Park YW, Takemura S, Bram RJ, Kurtin PJ, Goronzy JJ, et al. BLyS and APRIL in rheumatoid arthritis. J Clin Invest. 2005 Nov 1;115(11):3083–92. pmid:16239971
  59. 59. Cambridge G, Stohl W, Leandro MJ, Migone T-S, Hilbert DM, Edwards JCW. Circulating levels of B lymphocyte stimulator in patients with rheumatoid arthritis following rituximab treatment: relationships with B cell depletion, circulating antibodies, and clinical relapse. Arthritis Rheum. 2006 Mar;54(3):723–32. pmid:16508933
  60. 60. Gardam S, Brink R. Non-Canonical NF-κB Signaling Initiated by BAFF Influences B Cell Biology at Multiple Junctures. Front Immunol. 2014;4:509. Published 2014 Jan 6. pmid:24432023
  61. 61. Vincent FB, Saulep-Easton D, Figgett WA, Fairfax KA, Mackay F. The BAFF/APRIL system: emerging functions beyond B cell biology and autoimmunity. Cytokine Growth Factor Rev. 2013 Jun;24(3):203–15. pmid:23684423
  62. 62. Teitelbaum SL. Bone resorption by osteoclasts. Science. 2000 Sep 1;289(5484):1504–8. pmid:10968780
  63. 63. Stutz MD, Ojaimi S, Allison C, Preston S, Arandjelovic P, Hildebrand JM, et al. Necroptotic signaling is primed in Mycobacterium tuberculosis -infected macrophages, but its pathophysiological consequence in disease is restricted. Cell Death & Differentiation. 2018 May;25(5):951–65.
  64. 64. Bisgin A, Terzioglu E, Aydin C, Yoldas B, Yazisiz V, Balci N, et al. TRAIL Death Receptor-4, Decoy Receptor-1 and Decoy Receptor-2 Expression on CD8+ T Cells Correlate with the Disease Severity in Patients with Rheumatoid Arthritis. BMC Musculoskeletal Disorders. 2010 Aug 27;11:192. pmid:20799941
  65. 65. Gan L, O’Hanlon TP, Lai Z, Fannin R, Weller ML, Rider LG, et al. Gene Expression Profiles from Disease Discordant Twins Suggest Shared Antiviral Pathways and Viral Exposures among Multiple Systemic Autoimmune Diseases. PLoS One 2015;10(11):e0142486. Published 2015 Nov 10. pmid:26556803