Recent identifications of associations between novel variants in inflammation-related genes and several common diseases emphasize the need for systematic evaluations of these genes in disease susceptibility. Considering that many genes are involved in the complex inflammation responses and many genetic variants in these genes have the potential to alter the functions and expression of these genes, we assembled a list of key inflammation-related genes to facilitate the identification of genetic associations of diseases with an inflammation-related etiology. We first reviewed various phases of inflammation responses, including the development of immune cells, sensing of danger, influx of cells to sites of insult, activation and functional responses of immune and non-immune cells, and resolution of the immune response. Assisted by the Ingenuity Pathway Analysis, we then identified 17 functional sub-pathways that are involved in one or multiple phases. This organization would greatly increase the chance of detecting gene-gene interactions by hierarchical clustering of genes with their functional closeness in a pathway. Finally, as an example application, we have developed tagging single nucleotide polymorphism (tSNP) arrays for populations of European and African descent to capture all the common variants of these key inflammation-related genes. Assays of these tSNPs have been designed and assembled into two Affymetrix ParAllele customized chips, one each for European (12,011 SNPs) and African (21,542 SNPs) populations. These tSNPs have greater coverage for these inflammation-related genes compared to the existing genome-wide arrays, particularly in the African population. These tSNP arrays can facilitate systematic evaluation of inflammation pathways in disease susceptibility. For additional applications, other genotyping platforms could also be employed. For existing genome-wide association data, this list of key inflammation-related genes and associated subpathways can facilitate comprehensive inflammation pathway- focused association analyses.
Citation: Loza MJ, McCall CE, Li L, Isaacs WB, Xu J, Chang B-L (2007) Assembly of Inflammation-Related Genes for Pathway-Focused Genetic Analysis. PLoS ONE 2(10): e1035. https://doi.org/10.1371/journal.pone.0001035
Academic Editor: Jeffrey Gold, Oregon Health & Science University, United States of America
Received: August 14, 2007; Accepted: September 25, 2007; Published: October 17, 2007
Copyright: © 2007 Loza et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The study is supported by National Heart, Lung, and Blood Institute-Women's Health Initiative HHSN268200764321C, National Cancer Institute CA119069 and CA105055.
Competing interests: The authors have declared that no competing interests exist.
Inflammation is an essential component of immune-mediated protection against pathogens and tissue damage. Immune responses are also responsible for the unfavorable rejection of tissue/organ transplants, hypersensitivity reactions (e.g., atopy, anaphylaxis, contact hypersensitivity, delayed-type hypersensitivity), and septic shock. Aberrant or unchecked immune responses may lead to a state of chronic inflammation –. This may occur when the immune response: 1) is activated in the absence of ‘danger’ signals; 2) fails to fully turn-off (resolve) after elimination of the danger; and 3) fails to completely clear the danger stimulus. Factors that may influence the initiation, activity, and resolution of immune responses include health (physical and emotional), age, diet, medications, and genetic predisposition.
Inflammation may also be a contributing factor for some diseases. The role of chronic inflammation in a wide variety of diseases is well-appreciated, including rheumatoid arthritis and other autoimmune disorders , cardiovascular disease –, gastrointestinal disorders , , and a number of cancers –. Perhaps the best evidence for the importance of chronic inflammation in disease is the efficacy of NSAIDs in reducing the risk or severity of these disorders . There is mounting evidence that dietary factors that may influence inflammation, such as the balance of omega-3 vs. omega-6 polyunsaturated fatty acids (PUFAs), have an impact on disease risk and progression . Genetic studies also provide evidence that variant alleles of genes associated with inflammatory pathways impact the risk of disease initiation, progression, and severity (see Table 1). The role of inflammation as a mediator of disease is currently receiving extensive attention, resulting in the National Institute of Allergy and Immunologic Diseases (NIAID) plans for an NIH Roadmap initiative with the overarching theme: “Inflammation as a Common Mechanism of Disease” (http://nihroadmap.nih.gov/inflammation/index.asp).
Numerous genetic linkage and case-control association studies have implicated genetic variations in genes important in immunity and inflammation and inflammatory diseases. Single missense heritable mutations can be the sole or major determinant for inflammatory diseases, such as Familial Cold Autoinflammatory Syndrome (missense mutations in exon 3 of Cias1 account for all cases) – and Familial Mediterranean Fever (MEFV, five founder missense mutations, when homozygous, account for 74% of cases) . For many complex inflammatory diseases, polymorphisms in inflammatory genes are more likely to act as modifiers for disease susceptibility rather than sole determinants. Recent meta-analyses report modest associations between single nucleotide polymorphisms (SNPs) in TNF (encoding TNF-α) and increased risk for asthma , system lupus erythamatosus (SLE) , and psoriatic arthritis . A variant of PTPN22 (encoding a lymphoid-specific protein tyrosine phosphatase) is modestly associated with multiple autoimmune diseases (rheumatoid arthritis, SLE, type 1 diabetes, and Graves' disease) . Associations of IRF5 (interferon regulator factor 5) genetic variants and increased SLE risk have been highly replicated . IFIH1, encoding an innate immunity viral mRNA detector (early type I IFN-β responsive gene, Helicard), is strongly associated with type 1 diabetes risk . Association of a small risk for type 1 diabetes and CTLA4 has been consistently replicated –. An insertion polymorphism in CARD15, the gene encoding the microbial nucleotide detector Nod2, is a major risk factor for Crohn's disease . An association between SNP's in IL23R (IL-23 receptor β-chain) and increased risk of inflammatory bowel disease has been reported for a genome-wide association study and confirmed in three independent populations . A recent meta-analysis demonstrated that an IL4R (IL-4 receptor alpha chain) variant modestly increases risk for atopic asthma . A description of these studies and reported associations is provided in Table 1. This list is not intended to be comprehensive but rather serves as an example of representative genetic variations which have been well-established to be associated with common inflammatory disorders.
In addition to inflammatory/autoimmune diseases, polymorphisms in inflammation-associated genes may also contribute to risk for diseases in which inflammatory/immune-disorders are not the primary characteristic. There is evidence from the Breast Cancer Association Consortium that CASP8 (caspase 8) and TGFB1 (TGF-β1) variants impart risk, albeit low penetrance, for breast cancer . Analysis of two independent populations suggest that a rare polymorphism in TNF may also be a low-penetrance risk factor for breast cancer . Several groups report associations between various PTGS2 (COX-2) genetic variants and colorectal cancer risk –. There is also evidence to suggest that a low frequency COX2 variant allele decreases prostate cancer risk , particularly in subjects who frequently eat fish . Interestingly, in a preliminary study using MegAllele™ I&I panel, including 9,275 SNPs in 1,086 genes involved in immunity and inflammation, more SNPs were found to be significantly associated with prostate cancer risk than expected by chance, which suggests multiple genetic variants in this pathway impart modest risk to prostate cancer .
Because genetic associations of disease and genetic variations in inflammatory genes are often relatively modest, it is likely that polymorphisms in multiple inflammatory genes cooperate in an additive or synergistic manner to impact disease risk. Pathway analyses may help to reveal gene-gene interactions or risks imparted independently from other genes in the pathway. The advantages of performing analyses at pathway levels are illustrated by Dinu et al. . Associations between CFH (complement factor H) and age-related macular degeneration have been replicated in numerous studies . To test whether genetic variations in the multiple complement pathway genes impact macular degeneration risk, Dinu et al. analyzed the existing genome-wide association study data of Klein et al. , restricting the analyses to genes in the complement pathway in subjects carrying the CFH risk allele. Significant associations were detected for a C7 and MBL2 variants and severity of macular degeneration in the context of the complement pathway analysis and the CHF risk allele, and these associations would not have been significant in a genome-wide analysis of the data.
Various aspects of immunity contribute to the development of an overall inflammatory immune response. These phases include the development of immune cells, sensing of danger, influx of cells to sites of insult, activation and functional responses of immune and non-immune cells, and resolution of the immune response. To broadly cover most aspects of inflammatory responses, the various phases of immune responses were considered in choosing genes for the SNP array panel, outlined in Table 2. A schematic representation of most of these phases is presented in Fig. 1.
Depicted is a schematic representation of an immune response to a generic pathogenic insult. The phases of immune responses (described in Table 2) are shown in bold. Additional aspects not shown are the involvement of secondary lymphoid tissues for initial T cell and B cell activation by dendritic cells that migrate from the site of inflammation to lymph nodes and other secondary lymphoid structures. The resolution of immune responses, immunological memory, and homeostasis are also not depicted.
Priority was given first to genes of known function in inflammatory responses (in both immune and non-immune cells), and then to genes expressed in immune cells with function implied by homology to other genes but exact function not clear. Ubiquitously expressed genes required for the normal function of most cell types of diverse origin were given lower priority. However, special emphasis was placed on genes at nodes for signaling to and from multiple pathways, most notably genes in NF-κB, MAPK, and PI3K signaling pathways.
Pathways were built using Ingenuity Pathways Analysis, as described in Methods section, using both pre-defined ‘canonical pathways’ and custom-built pathways based on our own queries for genes/pathways not included in the canonical pathways.
Multiple functional pathways are involved each of the immune response phases, and each functional pathway may contribute to several of the immune response phases. For example (Fig. 2), the response of a macrophage responding to a danger signal during a gram-negative bacterial infection involves innate pathogen recognition of LPS by the TLR4 complex. TLR4 transduces signals via NF-κB, MAPK, and PI3K signaling pathways, stimulating synthesis of eicosanoids and cytokines to signal other cells of the danger. LPS may also stimulate expression of stress-induced proteins, such as MIC-A and MIC-B (ligands for natural killer cell activating receptors) and T cell co-stimulatory molecules (B7 family proteins).
The concerted action of multiple functional subpathways in the initial response of a macrophage to bacteria or virus is depicted. Solid arrows indicate signaling events and dashed arrows stimulated production of proteins and other inflammatory mediators (including autocrine/paracrine responses of the macrophage to the released molecules).
Because immune response phases utilize multiple functional pathways and these pathways are overlapping among phases, the genes chosen for the SNP array panel were assigned to one of the following functional pathways:
Adhesion-Extravasation-Migration: adhesion molecules; chemoattractants and chemoattractant receptors; cytoskeletal rearrangement signaling, motility proteins.
Apoptosis signaling: death receptors and ligands and extrinsic apoptosis pathway signaling; mitochondrial-dependent, intrinsic apoptosis pathway signaling; cellular stress signaling.
Calcium signaling: NF-ATs; calcineurins; calcium/calmodulin-dependent kinases.
Complement cascade: components of classical, alternative, and lectin-dependent complement pathways.
Cytokines and cytokine signaling: cytokines; cytokine receptors; cytokine-dependent signaling, including JAK-STAT and interferon-regulatory factor (IRF) pathways; suppressor of cytokine signaling proteins (SOCS).
Eicosanoid synthesis and receptors: enzymes involved in synthesis of prostanoids, leukotrienes, hepoxylins (12-HETE), and lipoxins from arachidonic acid; prostanoid and leukotriene receptors.
Glucocorticoid/PPAR signaling: nuclear receptors for glucocorticoids; steroid-interacting proteins; PPARs and associated proteins.
G-Protein Coupled Receptor Signaling: GPCRs (other than eicosanoid receptors and chemokine receptors); G-protein-dependent signaling pathways, including cAMP-PKA and phospholipase B2.
Innate pathogen detection: Toll-like receptors (TLRs); intracellular nucleotide detectors (e.g., Nod1 and putative members of CARD/Nod family); peptidoglycan recognition proteins; associated signaling molecules linking these detectors to major signaling pathways.
Leukocyte signaling: Signaling molecules, receptors, and adaptors important for regulation of leukocyte activation beyond major signaling pathways (i.e., MAPK, PI3K/Akt, NF-κB, GPCR, and cytokine signaling pathways); including, but not limited to, T cell receptor (TCR) and B cell receptor (BCR) signaling components, B7 family, phosphatases, Foxp3, immunoglobulin receptors, leukocyte inhibitory receptor (CD85) family, scavenger receptors.
MAPK signaling: p38 stress-activated protein kinase, p42/p44 extracellular-regulated kinase (Erk), and Jun kinase (Jnk) signaling pathways.
Natural Killer Cell Signaling: Natural killer (NK) cell-specific activating and inhibitory receptors; adaptors and signaling molecules for transducing NK cell-specific receptor signaling.
NF-kB signaling: Molecules for regulation of NF-κB activation, including adaptors linking. other pathways to and from the NF-κB pathway (e.g., from MAPK, pathogen-detection, and TNF-α signaling pathways).
Antigen presentation: Major Histocompatibility Complex (MHC) molecules and associated proteins; proteins involved in uptake, processing, and loading of peptides on MHC molecules.
PI3K/AKT Signaling: Molecules involved in regulation of PI3K-dependent signaling, including adaptors linking other pathways to and from the PI3K pathway.
ROS/glutathione/cytotoxic granules: Molecules involved in the generation and response to leukocyte-derived cytotoxic agents (reactive oxygen species (ROS), nitric oxide, cytotoxic granules of granulocytes and natural killer cells), including contents of granulocyte and NK cell cytotoxic granules; glutathione peroxidases; peroxiredoxins; catalase; proteinases; superoxide dismutase.
TNF superfamily and signaling: Receptors and ligands of the TNF-α superfamily; adaptors and signaling molecules involved in transducing signals from receptor stimulation to other major signaling pathways (e.g., MAPK and NF-κB pathways).
Examples of the functional subpathways and types of genes chosen for the different phases of immune responses are presented in Table 3 (proteins encoded by the genes are provided). Most of these immune response phases also utilize common signaling pathways, including elements of MAPK, NF-κB, PI3K/Akt, GCPR, cytokine, and leukocyte signaling pathways. An example of the integration of multiple functional subpathways for one phase of an immune response (danger signal) is depicted in Fig. 2. The numbers of genes chosen in each subpathway are listed in Table 4, and the complete list of 1027 candidate genes and their primary subpathway (and secondary subpathway for a subset of genes) is provided in Supplementary Table S1.
Example application: WFINFLAM tSNP panel
Because the components comprising inflammation are very numerous and interacting across many pathways, without strong a priori evidence it is difficult to choose a handful of candidate genes to fully cover the potential genetic risk factors contributing to the inflammatory component of a particular disease. For this reason, panels of single nucleotide polymorphisms (SNPs) in an array of inflammation-related genes broadly covering most aspects of immunity and inflammation based on our assembled list will be critical in objectively evaluating the impact of genetic variations in inflammation-related genes on an inflammation-dependent outcome.
There is a commercially available product, Affymetrix GeneChip® Human Immune and Inflammation 9K SNP panel, that attempts to serve the purpose. This application -specific panel contains ∼9,000 SNPs to cover ∼1,000 immunity- and inflammation- related genes (http://www.affymetrix.com/support/technical/datasheets/humanimmune_9k_snp_datasheet.pdf). However, the rationale for choosing the ∼1,000 immunity and inflammation genes in this panel is not clear, and the coverage of these genes, regardless of their biological functions, may not be sufficient. In order to investigate the impact of genetic variations in a broad array of inflammation-related genes on disease risk, we created two tagging SNP (tSNP) panels, one each for populations with either Caucasian or African ancestries, for Affimetrix ParAllele genotyping chips. These tSNP panels were designed to capture majority of the genetic variations in these 1027 inflammation candidate genes.
tSNPs for the 1027 inflammation-associated candidate genes were chosen based on a pair-wise r2 threshold of 0.8 and MAF ≥5% using data in the HapMap Phase II database (HapMap Data Release 21a/phaseII; http://hapmap.org/index.html.en). tSNPs were chosen separately for the CEU (representing European ancestry) and YRI populations (representing African ancestry). In order to accommodate as many relevant inflammation genes as possible, less stringent criteria with r2 threshold >0.5 were employed for intronic regions (excluding 5kb in the beginning and end of these big introns) greater than 50kb in certain large genes with >100 tSNPs. There were seven genes in this category, and the intronic regions for which a less stringent r2 threshold was applied are listed in Supplementary Table S2. For genes without genotype information from HapMap, additional SNPs in six inflammation genes were chosen to be included based on information from other resources (see Supplementary Table S3), as described in Methods section.
The resulting inflammation tSNP panels, WFINFLAM-CEU for Caucasians and WFINFLAM-YRI for African descent, include 12,011 SNPs and 21,542 SNPs respectively in 1027 inflammation-associated candidate genes. There is an average of 11.7 and 21 SNPs in each candidate gene in WFINFLAM-CEU and WFINFLAM-YRI panels, respectively. Table 4 briefly summarizes the numbers of genes and tSNPs included in each subpathway. The annotation table for this panel, including the list of 1027 genes, their chromosomal positions, their associated primary and secondary subpathways, and the number of tSNPs chosen for each gene can be found in Supplementary Table S1 (and also our website: http://www1.wfubmc.edu/Genomics/PublicationsandData/). Additionally, the annotation file for the SNPs included in the WFINFLAM-CEU and WFINFLAM-YRI panels, including the chromosomal locations, their associated genes and the sub-pathways, can also be found in Supplementary Tables S4 and S5 (and our website: http://www1.wfubmc.edu/Genomics/PublicationsandData/). The coverage for the 1027 inflammation candidate genes in CEU is better compared to other widely used genome-wide association panels. For the coverage of the 1027 inflammation-associated candidate genes, 90.4% of the genes have 90% or more SNPs within these genes that can be captured by r2≥0.8 in CEU using WFINFLAM-CEU panel, compared to 78.9% for the Illumina HumanHap 550 genome-wide panel and 45.8% for the Affymetrix 500k genome-wide panel. For populations with African ancestry, the coverage for the 1027 inflammation candidate genes in YRI is greatly improved compared to other widely used genome-wide association panels. For the coverage of the 1027 inflammation-associated candidate genes, 88.2% of the genes have 90% or more SNPs within these genes that can be captured by r2≥0.8 in YRI using WFINFLAM-YRI panel, compared to 27.1% for the Illumina HumanHap 650k genome-wide panel (which was designed to capture more genetic information from YRI population), and 12.5% for the Affymetrix 500k genome-wide panel. The coverage for all these genotyping panels is detailed in Supplementary Table S6.
Various components and complex interactions comprise immune and inflammation responses, and numerous genes are involved in this complex network. With a thorough review of various aspects of inflammatory immune responses, and a systematic search for gene-gene interactions using Ingenuity Pathway Analysis, we have provided a comprehensive list of inflammation-associated genes and subpathways for genetic association studies.
Genome-wide association studies have been a very popular approach to test the association between disease phenotypes and genetic variations. However, we believe there are still several advantages for a pathway-focused study. First of all, compared to whole-genome analyses, restricting analyses to SNPs in a specific pathway reduces the number of multiple tests performed in the analysis of a study population, thereby reducing the probability of false positive associations and increasing the effective power of the study. This kind of study design is particularly effective when inflammation plays an important role in disease etiology and the goal of the studies is to delineate genetic variations in inflammation pathway to disease risk and/or progression. A related second advantage of restricted pathway analysis is in study design. A large proportion of investigators may not have access to the very large number of subjects and multiple confirmation populations needed to overcome false positive associations due to multiple testing in genome-wide association studies. Studies restricted to a pathway analysis permit the use of study populations that are not large enough for use in whole-genome association studies. When target diseases are not prevalent and inflammation is obviously involved in disease etiology, researchers will gain the most out of an inflammation pathway-specific study design. Although some genes not related to inflammation found in whole-genome panels may impart some risk to inflammation-associated diseases, associated genetic variants would be anticipated to be concentrated in a panel of SNPs in inflammation-associated genes. Therefore, the drawback of potentially missing associated non-inflammation genes is offset by the increased probability of detecting true associations in an inflammation-restricted panel. Thirdly, pathway analysis is far less expensive to perform than whole-genome analysis, especially considering the cost for second, and/or third stage confirmation studies needed to follow-up the significant results from an initial screening in order to rule out false positive associations. Lastly, the functional subpathways are also pre-defined with available biological information. This refined information provides investigators with the opportunity to test gene-gene interactions within subpathways in which synergistic interactions are more likely to be concentrated. Additionally, the interplays between subpathways are also clearly defined to enable investigators to test biologically feasible interactions between subpathways.
However, the results from this manuscript also have potential utility for investigators who have more interests in surveying the whole genome. For whole-genome analyses where there is a prior hypothesis for inflammation being associated with the outcome, the inflammation pathway and subpathways defined in this manuscript may provide a framework for testing whether SNPs in the inflammation pathway or subpathways as a whole are overrepresented for significant associations to the outcome. Although pathway networks can be constructed for whole-genome analyses, such networks should be designed a priori before beginning the study.
The WFINFLAM and WFINFLAM-YRI SNP array panels for inflammation-associated genes provide a powerful tool for analyzing the contribution of genetic variation in diseases that have inflammatory components. Although whole-genome SNP panels are currently available that include almost all of the genes included in the WFINFLAM panels, the coverage of SNPs in genes included in the WFINFLAM array is superior to the coverage of currently available in whole-genome arrays, especially for populations with African ancestry background. For researchers who would like to use other genotyping platforms, the inflammatory gene list provided here would be a good starting point for designing genotyping assays for other platforms. Additionally, alternate approaches, other than r2 based method, for choosing SNPs based on the inflammatory gene list provided here could also be considered. For example, researchers may specifically focus on “high-prior” polymorphisms that are known to be functional or have been previously linked to the specific diseases under study, alone or in combination with the tSNPs provided in the WFINFLAM panel. This approach may be more efficient and powerful than the r2 based method alone, especially if the targeted “high-prior” polymorphisms are causal and their linkage to the nearby tSNPs is incomplete.
In addition, precaution may be warranted for tSNP panels designed based on the HapMap project. The transferability of the LD patterns between populations studied in HapMap project and other study populations may need to be validated. The transferability of HapMap-based selection of tSNPs using the reference CEU population to several other diverse populations of European ancestry has been demonstrated to be almost as effective for overall SNP coverage in selected genomic regions or randomly selected SNPs in the respective populations –. However, the coverage with tSNPs based on HapMap CEU data for a small subset of specific genes or SNPs may be lower in certain populations despite overall similar coverage , particularly for isolated indigenous populations . Other than the data provided by the HapMap project, we are not aware of complete sequence variant information available for validation of the transferability of tSNP panels that were designed based on HapMap data to other study populations.
In summary, pathway analysis of inflammation-associated genes is a powerful approach for determining genetic risk factors for both inflammatory diseases and other diseases that may have an under-appreciated modest inflammatory component, such as cancers. The inflammation pathway gene list and functionally-defined subpathways provide useful tools for assessing the impact of genetic variations in inflammation pathways on disease risk, in situations where either pathway-focused studies or genome-wide analyses are employed.
Selection of genes
Networks of genes involved in the regulation of the phases of immune responses (described in Results and Table 2 ) were built using Ingenuity Pathways Analysis (Ingenuity Systems: www.analysis.ingenuity.com). Both pre-defined ‘canonical pathways’ and custom-built pathways based on our own queries for genes/pathways not included in the canonical pathways were used to establish these networks. Canonical pathways included: actin cytoskeleton signaling; antigen presentation pathway; apoptosis signaling; B cell receptor signaling; calcium signaling; cAMP signaling; chemokine signaling; complement and coagulation cascade; death receptor signaling; ERK/MAPK signaling; FcEpsilon receptor signaling; G-coupled protein receptor signaling; GM-CSF signaling; IGF-1 signaling; IL-10 signaling; IL-2 signaling; IL-4 signaling; IL-6 signaling; integrin signaling; interferon signaling; JAK/STAT signaling; leukocyte extravasation signaling; natural killer cell signaling; NF-kB signaling; nitric oxide signaling; notch signaling; p38 MAPK signaling; PI3K/Akt signaling; PPAR signaling; protein ubiquination pathway; PTEN signaling; SPK/JNK signaling; T cell receptor signaling; TGF-β signaling; Toll-like receptor signaling.
These networks were then arranged into inflammation subpathways by:
- combining several networks together (e.g., ERK/MAPK, p38 MAPK, and SAPK/JNK canonical pathways into the ‘MAPK signaling’ inflammation subpathway; IL-2-, IL-4-, IL-6-, IL-10-, Interferon-, GM-CSF-, IGF-1-, JAK/STAT6-, and TGF-β- signaling canonical pathways into ‘cytokine signaling’ inflammation subpathway; actin cytoskeleton-, chemokine-, integrin-, and leukocyte extravasation- signaling canonincal pathways into the ‘adhesion-extravasation-migration’ inflammation subpathway; etc.);
- adding additional genes to bridge networks within a subpathway and to include appropriate genes not included in the canonical pathways (e.g., additional cytokines and their receptors were added to the ‘cytokine signaling’ inflammation subpathway; additional integrins and chemokines/chemoattractant molecules were added to the ‘adhesion-extravasation-migration’ inflammation subpathway; CD antigens expressed by leukocytes not already included were considered for addition to several subpathways; other missing genes/pathways considered important by the panel of investigators, such as the scavenger receptor network for ‘leukocyte signaling’ inflammation subpathway and Nod1/CARD family networks for ‘innate pathogen detection’ inflammation subpathway; etc.);
- trimming the networks of genes with low priority for inclusion in the inflammation panel. Genes with lower priority include: genes not expressed in immune cells or not directly involved in cells responding to inflammation, including non-immune cells (e.g., skeletal muscle-specific myosins in the actin cytoskeleton signaling canonical pathway; calsequestrins expressed mainly in various muscle cells, in the calcium signaling canonical pathway; GH1 and GHR, growth hormone expressed in pituitary gland and its receptor, and NGFB and NGFR, nerve growth factor and its receptor, in the NF-κB canonical pathway; etc.; MAPK8IP1, specific for pancreatic cell function, in the PI3K/AKT canonical pathway; etc.); genes with unknown function, though genes with high homology to known inflammatory mediators were considered (e.g., bcl-2 family homologs, IL-1β family homologs). Special emphasis was placed on genes at nodes for signaling to and from multiple pathways, most notably genes in NF-κB, MAPK, and PI3K signaling pathways.
Choosing tagging SNPs
Tagging SNPs (tSNPs) for candidate genes were chosen using Tagger server (http://www.broad.mit.edu/mpg/tagger/server.html). The target sequences included genomic regions containing the entire candidate genes, 5kb before transcription start site, and 2kb after the transcription end site, based on annotation in NCBI Build 35. When candidate regions for two or more genes overlapped, the combined genomic regions were used for choosing tSNPs. Two separate lists of non-synonymous coding SNPs for the CEPH population (CEU) and for the Yuruba population (YRI) were downloaded from HapMap (rel21a_NCBI_Build35) and forced in as tagging SNPs for ancestry-specific panels. Pair-wise r2 threshold of 0.8 and MAF≥5% were used. For genes without genotype information from HapMap, SNPs in Affymetrix 500k array, as well as SNPs with frequency data from the Innate Immunity PGA (IIPGA) (http://innateimmunity.net/), were manually chosen to be included in the list based on allele frequencies and inter-SNP distance (the chromosomal positions are based on IIPGA annotation because some of these genes/SNPs were ambiguously mapped in build 35). Some tSNPs were dropped out from the final list because they did not pass the Affymetrix design review due to the following reasons: non-biallelic SNPs, existing SNPs or ambiguous bases too close to the SNP of interest, SNPs exceeding Tm ranges, or SNPs failing BLAST searches. In the attempt to fill in the gaps due to tSNPs failing design criteria, tSNPs for genes with dropped-out tSNPs were chosen again by forcing out the tSNPs failing design review and forcing in the remaining tSNPs when running Tagger. In total, there are 12,011 SNPs in WFINFLAM panel for Caucasians and 21,542 SNPs in WFINFLAM-YRI for African descents in 1027 inflammation-associated candidate genes that passed the Affymetrix design review.
Estimation of genomic coverage by tagging SNPs
We used LdCompare (Hao 2006; http://www.affymetrix.com/support/developer/tools/devnettools.affx) to estimate the coverage of the 1027 inflammation-associated candidate genes with both WFINFLAM panel for Caucasians and WFINFLAM-YRI panel for African descents. We have also computed the coverage of several commercial genotyping arrays, including Affymetrix Mapping 500K, Illumina HumanHap 550 and HumanHap 650, for these 1027 genes. Single-marker coverage (pairwise r2) and multiple-marker coverage (multiple-marker r2) were computed and combined to estimate the coverage for the genomic regions containing the entire candidate genes, 5kb before transcription start site, and 2kb after the transcription end site. The summary for the coverage for these genes in all these genotyping panels are detailed in Table S6.
List of genes included in WFINFLAM and their associated sub-pathways
(0.19 MB XLS)
Genes with big intronic regions
(0.06 MB XLS)
SNPs chosen for the six genes without HapMap genotyping data
(0.04 MB XLS)
Annotation for tSNPs in WFINFLAM_CEU panel
(1.64 MB XLS)
Annotation for tSNPs in WFINFLAM_YRI panel
(3.39 MB XLS)
Conceived and designed the experiments: JX BC ML. Analyzed the data: BC ML. Wrote the paper: BC ML. Other: Contributed to compilation of data: JX WI LL CM ML BC.
- 1. Lawrence T, Gilroy DW (2007) Chronic inflammation: a failure of resolution? Int J Exp Pathol 88: 85–94.
- 2. Han J, Ulevitch RJ (2005) Limiting inflammatory responses during activation of innate immunity. Nat Immunol 6: 1198–1205.
- 3. Forrester JS, Bick-Forrester J (2005) Persistence of inflammatory cytokines cause a spectrum of chronic progressive diseases: implications for therapy. Med Hypotheses 65: 227–231.
- 4. Goronzy JJ, Weyand CM (2005) Rheumatoid arthritis. Immunol Rev 204: 55–73.
- 5. Ross R (1999) Atherosclerosis–an inflammatory disease. N Engl J Med 340: 115–126.
- 6. Willerson JT, Ridker PM (2004) Inflammation as a cardiovascular risk factor. Circulation 109: II2–10.
- 7. Ridker PM, Cushman M, Stampfer MJ, Tracy RP, Hennekens CH (1997) Inflammation, aspirin, and the risk of cardiovascular disease in apparently healthy men. N Engl J Med 336: 973–979.
- 8. Neuman MG (2007) Immune dysfunction in inflammatory bowel disease. Transl Res 149: 173–186.
- 9. James SP (2005) Prototypic disorders of gastrointestinal mucosal immune function: Celiac disease and Crohn's disease. J Allergy Clin Immunol 115: 25–30.
- 10. Coussens LM, Werb Z (2002) Inflammation and cancer. Nature 420: 860–867.
- 11. Ernst PB, Gold BD (2000) The disease spectrum of Helicobacter pylori: the immunopathogenesis of gastroduodenal ulcer and gastric cancer. Annu Rev Microbiol 54: 615–640.
- 12. Karin M, Cao Y, Greten FR, Li ZW (2002) NF-kappaB in cancer: from innocent bystander to major culprit. Nat Rev Cancer 2: 301–310.
- 13. Kuper H, Adami HO, Trichopoulos D (2000) Infections as a major preventable cause of human cancer. J Intern Med 248: 171–183.
- 14. Shacter E, Weitzman SA (2002) Chronic inflammation and cancer. Oncology (Williston Park) 16: 217–26, 229.
- 15. Ulrich CM, Bigler J, Potter JD (2006) Non-steroidal anti-inflammatory drugs for cancer prevention: promise, perils and pharmacogenetics. Nat Rev Cancer 6: 130–140.
- 16. Calder PC (2006) Polyunsaturated fatty acids and inflammation. Prostaglandins Leukot Essent Fatty Acids 75: 197–202.
- 17. Aganna E, Martinon F, Hawkins PN, Ross JB, Swan DC, et al. (2002) Association of mutations in the NALP3/CIAS1/PYPAF1 gene with a broad phenotype including recurrent fever, cold sensitivity, sensorineural deafness, and AA amyloidosis. Arthritis Rheum 46: 2445–2452.
- 18. Dode C, Le Du N, Cuisset L, Letourneur F, Berthelot JM, et al. (2002) New mutations of CIAS1 that are responsible for Muckle-Wells syndrome and familial cold urticaria: a novel mutation underlies both syndromes. Am J Hum Genet 70: 1498–1506.
- 19. Hoffman HM, Mueller JL, Broide DH, Wanderer AA, Kolodner RD (2001) Mutation of a new gene encoding a putative pyrin-like protein causes familial cold autoinflammatory syndrome and Muckle-Wells syndrome. Nat Genet 29: 301–305.
- 20. Touitou I (2001) The spectrum of Familial Mediterranean Fever (FMF) mutations. Eur J Hum Genet 9: 473–483.
- 21. Aoki T, Hirota T, Tamari M, Ichikawa K, Takeda K, et al. (2006) An association between asthma and TNF-308G/A polymorphism: meta-analysis. J Hum Genet 51: 677–685.
- 22. Lee YH, Harley JB, Nath SK (2006) Meta-analysis of TNF-alpha promoter -308 A/G polymorphism and SLE susceptibility. Eur J Hum Genet 14: 364–371.
- 23. Rahman P, Siannis F, Butt C, Farewell V, Peddle L, et al. (2006) TNFalpha polymorphisms and risk of psoriatic arthritis. Ann Rheum Dis 65: 919–923.
- 24. Lee YH, Rho YH, Choi SJ, Ji JD, Song GG, et al. (2007) The PTPN22 C1858T functional polymorphism and autoimmune diseases–a meta-analysis. Rheumatology (Oxford) 46: 49–56.
- 25. Graham RR, Kozyrev SV, Baechler EC, Reddy MV, Plenge RM, et al. (2006) A common haplotype of interferon regulatory factor 5 (IRF5) regulates splicing and expression and is associated with increased risk of systemic lupus erythematosus. Nat Genet 38: 550–555.
- 26. Smyth DJ, Cooper JD, Bailey R, Field S, Burren O, et al. (2006) A genome-wide association study of nonsynonymous SNPs identifies a type 1 diabetes locus in the interferon-induced helicase (IFIH1) region. Nat Genet 38: 617–619.
- 27. Howson JM, Dunger DB, Nutland S, Stevens H, Wicker LS, et al. (2007) A type 1 diabetes subgroup with a female bias is characterised by failure in tolerance to thyroid peroxidase at an early age and a strong association with the cytotoxic T-lymphocyte-associated antigen-4 gene. Diabetologia 50: 741–746.
- 28. Payne F, Cooper JD, Walker NM, Lam AC, Smink LJ, et al. (2007) Interaction analysis of the CBLB and CTLA4 genes in type 1 diabetes. J Leukoc Biol 81: 581–583.
- 29. Ueda H, Howson JM, Esposito L, Heward J, Snook H, et al. (2003) Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease. Nature 423: 506–511.
- 30. Oostenbrug LE, Nolte IM, Oosterom E, van der SG, te Meerman GJ, et al. (2006) CARD15 in inflammatory bowel disease and Crohn's disease phenotypes: an association study and pooled analysis. Dig Liver Dis 38: 834–845.
- 31. Duerr RH, Taylor KD, Brant SR, Rioux JD, Silverberg MS, et al. (2006) A genome-wide association study identifies IL23R as an inflammatory bowel disease gene. Science 314: 1461–1463.
- 32. Loza MJ, Chang B (2007) Association between Q551R IL4R genetic variants and atopic asthma risk demonstrated by meta-analysis. J Allergy Clin Immunol In press.
- 33. Breast Cancer Association Consortium (2006) Commonly studied single-nucleotide polymorphisms and breast cancer: results from the Breast Cancer Association Consortium. J Natl Cancer Inst 98: 1382–1396.
- 34. Gaudet MM, Egan KM, Lissowska J, Newcomb PA, Brinton LA, et al. (2007) Genetic variation in tumor necrosis factor and lymphotoxin-alpha (TNF-LTA) and breast cancer risk. Hum Genet.
- 35. Cox DG, Pontes C, Guino E, Navarro M, Osorio A, et al. (2004) Polymorphisms in prostaglandin synthase 2/cyclooxygenase 2 (PTGS2/COX2) and risk of colorectal cancer. Br J Cancer 91: 339–343.
- 36. Koh WP, Yuan JM, van den BD, Lee HP, Yu MC (2004) Interaction between cyclooxygenase-2 gene polymorphism and dietary n-6 polyunsaturated fatty acids on colon cancer risk: the Singapore Chinese Health Study. Br J Cancer 90: 1760–1764.
- 37. Lin HJ, Lakkides KM, Keku TO, Reddy ST, Louie AD, et al. (2002) Prostaglandin H synthase 2 variant (Val511Ala) in African Americans may reduce the risk for colorectal neoplasia. Cancer Epidemiol Biomarkers Prev 11: 1305–1315.
- 38. Siezen CL, van Leeuwen AI, Kram NR, Luken ME, van Kranen HJ, et al. (2005) Colorectal adenoma risk is modified by the interplay between polymorphisms in arachidonic acid pathway genes and fish consumption. Carcinogenesis 26: 449–457.
- 39. Siezen CL, Bueno-de-Mesquita HB, Peeters PH, Kram NR, van Doeselaar M, et al. (2006) Polymorphisms in the genes involved in the arachidonic acid-pathway, fish consumption and the risk of colorectal cancer. Int J Cancer 119: 297–303.
- 40. Shahedi K, Lindstrom S, Zheng SL, Wiklund F, Adolfsson J, et al. (2006) Genetic variation in the COX-2 gene and the association with prostate cancer risk. Int J Cancer 119: 668–672.
- 41. Hedelin M, Chang ET, Wiklund F, Bellocco R, Klint A, et al. (2007) Association of frequent consumption of fatty fish with prostate cancer risk is modified by COX-2 polymorphism. Int J Cancer 120: 398–405.
- 42. Zheng SL, Liu W, Wiklund F, Dimitrov L, Balter K, et al. (2006) A comprehensive association study for genes in inflammation pathway provides support for their roles in prostate cancer risk in the CAPS study. Prostate 66: 1556–1564.
- 43. Dinu V, Miller PL, Zhao H (2007) Evidence for association between multiple complement pathway genes and AMD. Genet Epidemiol.
- 44. Conley YP, Jakobsdottir J, Mah T, Weeks DE, Klein R, et al. (2006) CFH, ELOVL4, PLEKHA1 and LOC387715 genes and susceptibility to age-related maculopathy: AREDS and CHS cohorts and meta-analyses. Hum Mol Genet 15: 3206–3218.
- 45. Klein RJ, Zeiss C, Chew EY, Tsai JY, Sackler RS, et al. (2005) Complement factor H polymorphism in age-related macular degeneration. Science 308: 385–389.
- 46. Willer CJ, Scott LJ, Bonnycastle LL, Jackson AU, Chines P, et al. (2006) Tag SNP selection for Finnish individuals based on the CEPH Utah HapMap database. Genet Epidemiol 30: 180–190.
- 47. De Bakker PI, Graham RR, Altshuler D, Henderson BE, Haiman CA (2006) Transferability of tag SNPs to capture common genetic variation in DNA repair genes across multiple populations. Pac Symp Biocomput 478–486.
- 48. Montpetit A, Nelis M, Laflamme P, Magi R, Ke X, et al. (2006) An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population. PLoS Genet 2: e27.
- 49. Tenesa A, Dunlop MG (2006) Validity of tagging SNPs across populations for association studies. Eur J Hum Genet 14: 357–363.
- 50. Xu Z, Kaplan NL, Taylor JA (2007) Tag SNP selection for candidate gene association studies using HapMap and gene resequencing data. Eur J Hum Genet.
- 51. Mueller JC, Lohmussaar E, Magi R, Remm M, Bettecken T, et al. (2005) Linkage disequilibrium patterns and tagSNP transferability among European populations. Am J Hum Genet 76: 387–398.
- 52. Johansson A, Vavruch-Nilsson V, Cox DR, Frazer KA, Gyllensten U (2007) Evaluation of the SNP tagging approach in an independent population sample-array-based SNP discovery in Sami. Hum Genet 122: 141–150.