Spondyloarthritis (SpA) is a chronic inflammatory disorder with a strong genetic predisposition dominated by the role of HLA-B27. However, the contribution of other genes to the disease susceptibility has been clearly demonstrated. We previously reported significant evidence of linkage of SpA to chromosome 9q31–34. The current study aimed to characterize this locus, named SPA2. First, we performed a fine linkage mapping of SPA2 (24 cM) with 28 microsatellite markers in 149 multiplex families, which allowed us to reduce the area of investigation to an 18 cM (13 Mb) locus delimited by the markers D9S279 and D9S112. Second, we constructed a linkage disequilibrium (LD) map of this region with 1,536 tag single-nucleotide polymorphisms (SNPs) in 136 families (263 patients). The association was assessed using a transmission disequilibrium test. One tag SNP, rs4979459, yielded a significant P-value (4.9×10−5). Third, we performed an extension association study with rs4979459 and 30 surrounding SNPs in LD with it, in 287 families (668 patients), and in a sample of 139 cases and 163 controls. Strong association was observed in both familial and case/control datasets for several SNPs. In the replication study, carried with 8 SNPs in an independent sample of 232 cases and 149 controls, one SNP, rs6478105, yielded a nominal P-value<3×10−2. Pooled case/control study (371 cases and 312 controls) as well as combined analysis of extension and replication data showed very significant association (P<5×10−4) for 6 of the 8 latter markers (rs7849556, rs10817669, rs10759734, rs6478105, rs10982396, and rs10733612). Finally, haplotype association investigations identified a strongly associated haplotype (P<8.8×10−5) consisting of these 6 SNPs and located in the direct vicinity of the TNFSF15 gene. In conclusion, we have identified within the SPA2 locus a haplotype strongly associated with predisposition to SpA which is located near to TNFSF15, one of the major candidate genes in this region.
Spondyloarthritis (SpA) is a common variety of articular inflammatory disorder characterized by axial and/or peripheral arthritis, frequently associated with extra-articular manifestations such as psoriasis, uveitis, and inflammatory bowel diseases (ulcerative colitis or Crohn's disease (CD)). SpA is a complex disorder with high heritability. The MHC class I HLA-B27 allele is a very strong risk factor for its development, but other genetic factors located outside the MHC also play a role in disease susceptibility. By a previous whole-genome linkage investigation, we have demonstrated that a region located on the chromosome 9q31–34 was involved in SpA susceptibility. The present study aimed to further characterize this locus. Using a stepwise linkage and association approach, we identified a haplotype spanning 6 single-nucleotide polymorphisms strongly associated with SpA and located in a genomic region paralogous to the MHC, near to the TNFSF15 gene. Interestingly, polymorphisms of this gene have previously been shown to be associated with CD. This original finding offers a new research track for the understanding of SpA pathophysiology, which is still poorly understood, as well as new hope for diagnostic and therapeutic innovation.
Citation: Zinovieva E, Bourgain C, Kadi A, Letourneur F, Izac B, et al. (2009) Comprehensive Linkage and Association Analyses Identify Haplotype, Near to the TNFSF15 Gene, Significantly Associated with Spondyloarthritis. PLoS Genet 5(6): e1000528. doi:10.1371/journal.pgen.1000528
Editor: Kathleen Kerr, University of Washington, United States of America
Received: August 21, 2008; Accepted: May 19, 2009; Published: June 19, 2009
Copyright: © 2009 Zinovieva et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was financially supported by grants from the French Rheumatology Society (http://www.rhumatologie.asso.fr), from the National Research Program on Oseo-Articular Diseases (PRO-A), from the “Arthritis-Courtin” foundation (http://www.fondation-arthritis.org/), and by unrestricted grant from the Schering-Plough company. Elena Zinovieva benefited for three years from a fellowship grant from the “Arthritis-Courtin” foundation (http://www.fondation-arthritis.org/).The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Spondyloarthritis (SpA) is one of the most frequent varieties of articular inflammatory disorders with an estimated prevalence of 0.3% in the western European adult population . It is characterized by a predominant axial skeleton inflammation, by a frequent occurrence of enthesitis and peripheral arthritis, and also by a high rate of extra-articular features, the most characteristic of which are acute anterior uveitis, psoriasis, and inflammatory bowel diseases (such as ulcerative colitis or Crohn's disease (CD)) . Depending on its clinical features, SpA is classically subdivided into the following subsets: ankylosing spondylitis (AS), which is the prototypical form characterized by predominant axial skeletal involvement and advanced radiographic sacroiliitis, psoriatic arthritis (PsA), arthritis associated with inflammatory bowel disease (AIBD), reactive arthritis (ReA), and undifferentiated SpA (uSpA). Familial aggregation among these conditions has been well established. Notably, we have previously shown, by analyzing a large number of pedigrees with multiple cases of SpA, that all subtypes are likely to be determined by a core set of predisposing factors and may therefore be studied together in genetic studies –.
The HLA-B27 allele is the first genetic factor which was demonstrated to be associated with AS , and other SpA ,,. Although about 80% of Caucasian patients are HLA-B27 positive, as compared to only 6–8% in the general population, the exact mechanism for this association remains poorly understood .
Family and twin studies have demonstrated additional non-MHC susceptibility regions elsewhere in the genome . For example, concordance rates for HLA-B27 positive monozygotic twins are twice as high as the concordance rates for HLA-B27 positive dizygotic twins . Furthermore, the involvement of genetic factors arising from outside the HLA region is suggested by the large non-HLA component of the relative recurrence risk for the SpA estimated in sib-pairs (λnon-HLA). Indeed, if the overall relative recurrence risk in sibling (λs) has been estimated to be 40 , estimates of the λs component attributable to the HLA region (λHLA), based on previous affected sib-pairs linkage analyses, ranges from 5.2 to 6.25 ,. Variants in several genes such as the IL-1 family gene cluster ,, IL-23R , and ARTS1/ERAP1 , have recently been reported to be associated with AS based on a candidate-gene approach , or a non-synonymous single-nucleotide polymorphisms (SNPs) genome-wide association study .
Our team has previously reported results of the first genome-wide linkage screen and its extension study performed in SpA . Overall, 893 individuals from 120 multiplex families (families with several patients) comprising 336 affected relative pairs have been genotyped in this study. Non parametric multipoint linkage analysis of the whole dataset yielded evidence for significant linkage to the chromosomal region 9q31–34 (NPLmax = 4.87, P = 2×10−5). This locus overlapped with one of those identified by the genome-wide linkage screen performed in AS by a group from Oxford . We named this new susceptibility location SPA2, in reference to the MHC locus, which we considered as SPA1 . SPA2 encompasses a 23.95 cM region (17.44 Mb) containing 85 genes and predicted coding sequences as well as 110 pseudogenes. This locus is very appealing with regard to SpA susceptibility. First of all, it is one of three genomic regions paralogous to the MHC, which is the major SpA susceptibility region ,. Furthermore, it is syntenic to the Pgis2 susceptibility locus mapped in a murine model of SpA . Within its borders SPA2 contains both the TNFSF15 gene found to be associated with CD a condition belonging to the SpA spectrum ,, and the TRAF1-C5 locus associated with rheumatoid arthritis another inflammatory rheumatic disease ,.
The goal of the present study was to identify variants associated with the disease and located in the SPA2 locus. Using a comprehensive four-step linkage and association study in a total of 287 families including 668 affected individuals, followed by an independent case/control analysis (2 samples including a total of 371 cases and 312 controls), we identified a strongly associated six-SNPs haplotype, located at 28.6 kb from the TNFSF15 gene.
Linkage fine mapping
The initial step of our study aimed to refine the linkage signal in the 23.95 cM (17.44 Mb) SPA2 locus. To realise this investigation we selected a fine-grained set of 28 microsatellite markers (more than one marker per cM). These markers were genotyped in 149 independent multiplex families (including the 120 families studied in our initial genome-screen)  (Figure 1B) consisting of 1,065 individuals including 458 affected with SpA (Figure 1A, Table 1).
(A) shows the different stages of the study displayed according to their chronological order from left to right. Frames in blue describe the familial and case/control samples used for each different step. Frames in orange describe the sets of markers, the limits of the intervals covered by these markers on chromosome 9 (in distance from the p-telomere), and the techniques used for single-nucleotide polymorphisms initial genotyping. Frames in green describe the type of genetic analysis performed, the statistic, and the program used in each case. (B) shows compositions of family and case/control samples used in each step of the study. The three first columns show for each step of the study which familial, and case and/or control sample respectively, was used. The far right column displays, for each genotyped sample, the step of the study in which it was involved. The overlay parts of the samples are matched. Ex: Of the 136 families included in the LD mapping association study, 36 were also used in the fine mapping linkage study and in the initial whole genome and extension screen. NB: Both case/control samples were composed of independent individuals not tested elsewhere. Abbreviations: LD: linkage disequilibrium, SpA: spondyloarthritis, SNP: single-nucleotide polymorphism.
Non parametric multipoint linkage analysis allowed us to identify two prominent linkage peaks yielding a significant Zlr value>2.91 (nominal P<1.79×10−3) corresponding to a P<0.05 after correction for multiple testing (Table 2, Figure 2A). The highest peak of linkage was found for the marker D9S1824 at 120.1 cM from the p-telomere (Zlr = 3.20; nominal P = 6.94×10−4). At this stage of the study it was not possible to discriminate between these two peaks, thus we decided to pursue our investigations in the 13.1 Mb region surrounding them between D9S279 and D9S112 (Figure 2A).
(A) shows results of both fine mapping linkage step (right y axis) and LD mapping step (left y axis). Linkage results are shown in red, with the dashed straight line symbolizing the 0.05 Bonferroni corrected significance threshold. Family-based association results are displayed in blue, with each diamond corresponding to a studied tag SNP. The dotted straight line shows the 0.075 Bonferroni corrected significance threshold. In green are represented the locations of the TNFSF15 gene, associated with Crohn's disease and of the TRAF1-C5 locus associated with rheumatoid arthritis. (B) shows a zoomed view of the 135 kb region of interest delineated by frames in (A). Each diamond, corresponding to a studied tag SNP, appears coded by color and size, according to its linkage disequilibrium correlation coefficient (r2) with the most significantly associated tag SNP (rs4979459), as displayed in the legend. (C) shows linkage disequilibrium structure across the 135 kb region zoomed in (B), based on r2 coefficient calculated with the CEU HapMap database. The intron exon structure of the TNFSF15 gene lying within this locus is represented at the top of (C). The genotyped tag SNPs shown in (B) are indicated with red bars. Three of these tag SNPs lie within the TNFSF15 gene.
Linkage disequilibrium (LD) mapping
In the second part of our study we performed a linkage disequilibrium (LD) mapping of the 13.1 Mb region selected after the linkage fine mapping, using a family-based association test. We employed a tag SNP strategy that consisted of genotyping a set of 1,536 markers, extensively representative of genetic variability at the chromosomal region, in a sample of 136 families (Table 1). The sample was composed of 36 families with the highest linkage values in the selected 13.1 Mb region, and 100 novel families never tested before for either linkage or association (Figure 1). Among the 1,536 tag SNPs genotyped, 1,489 (96.9%) were suitable for family-based association testing. The remaining 47 tag SNPs were discarded from the analysis for genotyping failure or lack of polymorphism across the sample. Association analysis was performed using a transmission disequilibrium test (TDT) adapted for families larger than trios, and suitable for testing of association in the area of known linkage . Considering the number of tag SNPs tested, applying crude Bonferroni correction would set the nominal P-value corresponding to a global type I error of 5% at 3.4×10−5. However such threshold is overly conservative, since some of the 1,489 tag SNPs presented a weak level of LD and were therefore not totally independent. To date, there is no consensus on the best method to take into account the non independence between SNPs. A method such as that proposed by Nyholt , would set the nominal 5% threshold at P = 5.56×10−5. Alternatively, accepting a corrected global type I error of 7.5% with Bonferroni would set the nominal threshold at P = 5×10−5. Using such criteria, one single intergenic tag SNP, rs4979459, was found to be significantly associated, with SpA (P = 4.9×10−5, Figure 2A, Table S1). Suggestive association was also found for several additional markers in LD with rs4979459 (Figure 2B, Table S1). The evidence of association for these SNPs was also supported by the comparison of observed and expected distributions of association P-values (Figure 3A). The distribution of observed P-values was very suggestively skewed from the null distribution with 23 SNPs having P<0.01, versus 14 expected under the null hypothesis. All SNPs demonstrating significant or suggestive association were located within an 80 kb LD block downstream from the TNFSF15 gene (Figure 2A and 2C). There was no other region in the SPA2 locus presenting significant or even suggestive association with SpA. Notably the TRAF1-C5 locus previously identified as associated with rheumatoid arthritis did not show any association with SpA in our investigation (Figure 2A).
(A) shows the LD mapping Q-Q plot. The blue squares represent the observed P-values. The grey line represents the null hypothesis of no true association. (B) shows the extension study mapping Q-Q plot. The green circles and the red triangles represent the observed P-values for the family-based and case/control studies respectively. The grey line represents the null hypothesis of no true association.
Extension family-based and case/control association study
To refine the association signal identified by the LD mapping stage we genotyped the tag SNP rs4979459 and an additional panel of 30 surrounding SNPs in an extended sample of 287 families comprising 668 SpA patients (including the 149 families of the linkage fine mapping, the 100 families added for the LD mapping, and 38 additional families (Table 1, Figure 1B)), as well as in an independent set of 139 cases and 163 controls (Figure 1A and 1B). Association was investigated by the TDT described above for the family sample and by an allelic chi-square test for the case/control set. In keeping with the LD mapping stage, we used the Bonferroni correction for multiple testing. The nominal P-values to achieve global type I errors of 5% and 7.5% significance were 1.61×10−3 and 2.42×10−3 respectively. Of note, the Nyholt correction set the nominal 5% threshold at P = 3.93×10−3.
In the family-based association study, the 5% Bonferroni corrected significance threshold was reached for 3 SNPs: rs10817669, rs10739427, and rs10759734, and the 7.5% significance threshold for 2 additional SNPs: rs10733612, and rs7849556 (Table 3, Table S2). Several other markers including the tag SNP rs4979459 yielded low P-values (Table S2). For all these SNPs the major allele was overtransmitted to affected children.
In the case/control study two markers, in strong LD with each other, reached a Bonferroni corrected 0.005 significance threshold: rs6478105 and rs10982396 (Table 3, Figure 4). Odds ratios (ORs)<1 were observed for both of them, indicating that the major allele was more frequent in cases than in controls. Other markers yielded non-significant low P-values (Table S3).
LD calculations were based on pairwise r2 values of our entire case/control dataset. These values are displayed, in percentage, in the grey boxes. The red triangle indicates the LD block, computed by the program HAPLOVIEW, which was further used for haplotypic association with the program PLINK.
The evidence of association for these SNPs was also supported by the comparison of the observed and expected distributions of P-values for association (Figure 3B). The distribution of observed P-values was very suggestively skewed from the null distribution with 7 SNPs having P<0.01 in the family-based study and 2 in the case/control study, versus 0 expected under the null hypothesis.
All the markers significantly associated with SpA in all of our association studies belong to the same LD block (Figure 4).
The 5 SNPs located within the TNFSF15 gene, which were previously found to be associated with CD ,, did not reach a level of significant association in both the family-based design and the case/control study (Table S2 and Table S3).
Replication case/control association study
To replicate our results in an independent sample, we genotyped a set of eight SNPs consisting of the most strongly associated markers in the foregoing extension study (rs7849556, rs10817669, rs10759734, rs6478105, rs1982396, and rs10733612), the tag SNP (rs4979459) and one SNP located in the neighboring TNFSF15 gene (rs4246905) in an additional independent set of 232 cases and 149 controls (Figure 1, Table 3). Association was assessed using a chi-square test. After correction for multiple testing, one SNP, rs6478105, reached a suggestive association level (nominal P = 0.029), the Bonferroni corrected for multiple testing thresholds being 0.0063 (5%) and 0.0094 (7.5%).
Of note, OR<1 was observed for all eight SNPs. This trend was similar to that observed in the whole extension step of the study, for both family-based and case/control approaches (see above), suggesting that even if the P-value for significance was not reached here, the direction of association was consistent between both studies for all markers.
Pooled case/control analyses
The entire case/control sample, comprising the “extension” and the “replication” part (371 cases and 312 controls) was actually an independent set, as compared to the family sample (Figure 1). Thus, it made sense to perform an association analysis on this “pooled” sample. Results of this analysis are displayed in the Table 3. Significant association level was reached for two of the eight tested SNPs: rs6478105 (P = 3×10−5; OR = 0.5) and rs10982396 (P = 2×10−4; OR = 0.53). Furthermore, three other SNPs presented suggestive association with low P-values (rs7849556, rs10759734, and rs10733612). Notably, HLA-B27 conditioned exploratory analyses showed exactly the same trend of allelic distribution between cases and controls, suggesting that the detected association signal was independent of the presence of HLA-B27 (Table S4 and Table S5).
We also performed combined analysis of the genotyping issued from both the family extension sample and either the pooled case/control set or the extension case/control set for the same eight SNPs. The tests were performed using the Cochran-Mantel-Haenszel method  (Table 3 and Table S6, respectively). These investigations led us to confirm the strong association with SpA of the whole LD block containing these SNPs (Figure 4).
Of note the non-synonymous SNP, rs4246905, which changes a histidine into arginine, located in the fourth exon of TNFSF15 gene and previously identified as strongly associated with CD , reached a suggestive P-value of less than 0.01 in both combined analyses. Nonetheless, this was by far the weakest association of the eight tested markers.
Family-based and case/control haplotypic analyses
The six SNPs presenting the lowest P-values after the combined analysis were located in the same strong LD block (Figure 4). It is known that in the presence of multiple tightly linked markers a haplotype test may be more powerful to detect association.
Association results for the haplotypes comprising these six SNPs are displayed in the Table 4, together with their estimated frequencies.
The family-based haplotype association analysis identified the most frequent allele H1 as overtransmitted to affected children with a high significance level (Table 4; P = 8.81×10−6), the other three alleles being undertransmitted. The significance threshold corrected for the multiplicity of tested haplotypes and extrapolated with Bonferroni method was set to 0.013.
The case/control haplotype association analysis conversely showed that the rare haplotype H2 was very significantly more frequent in controls than in cases (Table 4; P = 8.75×10−5), with all the other haplotypes being more frequent in cases than in controls. Case/control HLA-B27 conditioned analyses showed that the frequency of H2 haplotype was significantly decreased in patients, independently of the presence of HLA-B27 (Table S4 and Table S5).
A significant omnibus haplotype association was detected in family-based sample (P = 4.5×10−5 at 4 degrees-of-freedom (df)), as well as in the case/control one (P = 7.0×10−4 at 3 df).
We report for the first time an association between several SNPs determining a haplotype near the TNFSF15 gene (9q32) and SpA. This was achieved by a comprehensive study with several linkage and association steps.
The basis of our investigations aimed to narrow down from our previous whole genome linkage screen the susceptibility region for SpA on chromosome 9q31–34 called SPA2 . Since initial linkage in this locus was spread over a long distance of 23.95 cM, our first attempt was to refine it using a fine-grained set of 28 microsatellite markers in an extended set of families. The results revealed two areas of statistically significant linkage, with the highest linkage peak located on the marker D9S1824 at 120.1 cM (115.9 Mb) from the p-telomere, only 1.3 cM apart from D9S1776, which corresponded to the linkage peak in our former screen. The second statistically significant area was located near a suggestive linkage peak reported in AS (D9S1682, P = 6×10−4) , supporting the validity of linkage between this region and SpA. Since non parametric linkage analyses usually do not characterize linkage localization with high precision , we decided to continue our investigations on the 13.1 Mb region between markers D9S279 and D9S112 comprising both significantly linked areas.
To identify whether one or several loci of the refined SPA2 region were associated with SpA we performed a family-based dense LD mapping of this 13.1 Mb area, using a tag SNP strategy. Our sample set was enriched in families with strong evidence of linkage, in order to minimize the risk of false negative results. After correction for multiple testing, significant association was observed for one single tag SNP, rs4979459 (P = 4×10−5).
Several arguments support the assumption that this is a true positive finding. Firstly, suggestive association was also found for several additional markers in LD with rs4979459, indicating that the identified significant association was unlikely to be explained by systematic genotyping error. Secondly, the use of a family-based design rules out the possible confounding effect of population stratification. Finally, rs4979459 was located in the direct vicinity of our highest linkage peak, between the microsatellite markers D9S279 and D9S1855.
None of the genes having a counterpart in the MHC, i.e. those that are theoretically good candidates for disease susceptibility, were shown to be associated with the disease in this study. Notably, the TRAF1-C5 locus which lies at 122.7 Mb from the p-telomere and was recently described as associated with rheumatoid arthritis ,, did not show any association with SpA in our study.
To refine the association pointed out by our LD map, we performed an extension study with 31 SNPs. Characteristically, the initial strong association observed with rs4979459, was not as strong in the extension stage, suggesting that rs4979459 is not the causal variant. Its effect was probably overestimated in the LD mapping step, since initial positive reports tend to overestimate found effects, while subsequent studies regress to the true value . Moreover, SpA is a complex disease with known genetic heterogeneity, and therefore enriching the LD mapping family set with strongly linked families is also likely to have contributed to this overestimation.
After extension and replication steps of our study, strong association was seen in the family sample as well as in the entire case/control set with SNPs located in the strong 40.3 kb LD block showed in the Figure 4. The markers that were found to be associated by these two approaches were not exactly the same. Several reasons could readily explain such an apparent discrepancy. First of all, the alleles of rs6478105 and rs10982396, which were found to be associated with SpA in the pooled case/control sample, were highly frequent (>0.89). Therefore there were a relatively modest number of informative families for these markers in the family-based study, since at least one parent has to be heterozygous to render the family suitable for association analysis. On the other hand, the power of our case/control study was much lower to demonstrate association with rs10817669 (the SNP significantly associated in the family-based study (OR of 0.82)), than with rs6478105 and rs10982396 (OR of 0.50, and 0.53, respectively).
Nevertheless, even if the significance level was not reached in every study for every marker (probably attributable to a lack of power), we can see that in three totally independent samples trends for all SNPs were exactly the same: the frequent allele of each marker was overtransmitted to affected children and noticeably more present in cases rather than in controls. Moreover the results of combined analysis of family and case/control samples also strongly support the association with all six SNPs, composing the 40.3 kb LD block. Finally, our haplotype investigations also confirmed this trend and showed that haplotypes composed of markers of this block were very significantly associated with the disease, for both family-based and case/control investigations.
The high risk haplotype (H1) identified in our study was too frequent to explain the strong linkage signal detected in SPA2 , and was likely only a surrogate for the causal variant(s). As the LD block containing this haplotype is located 28.6 kb from the TNFSF15 gene, one of the best candidate genes in the region, in particular because of its implication in CD ,, it made sense to test this gene directly. However, our LD mapping stage data did not implicate the TNFSF15 “strictly genic” region (introns and exons). Indeed, no tag SNP association was identified in this region and none of the five SNPs described as associated with CD were associated with SpA in our analyses. A full re-sequencing of all exons, 1st and 3rd introns, and additional intronic boundaries of the TNFSF15 gene in several patients and controls did not reveal unmatched variations either (data not shown).
We were also well aware of the limits presented by our extension/replication approach, which was focused only on the “LD mapping” of an association peak. Thus, we tested by a classic candidate-gene approach several genes located within the region surrounding the linkage peaks. We first performed variants screening in a group of independent SpA patients from families presenting a high linkage signal within the studied region, and unrelated controls. The most suggestive polymorphisms were subsequently genotyped. The association with the disease was therefore either assessed with TDT in a sample of independent trios or with a chi-square test in a large case/control sample. In this way, we tested the implication of Tenascine-Cytoactine gene (TNC), coding for an extracellular matrix glycoprotein and presenting a paralogous counterpart in the MHC class III locus. No association was found between polymorphisms in this gene and SpA . We also performed a more systematic candidate-gene approach, testing among others Tumor necrosis factor ligand superfamily member 8 - CD30 ligand (TNFSF8), Zinc finger protein 618 (ZNF618), Kinesin-like protein (KIF12), Alpha-1-acid glycoprotein 1 Precursor (ORM1), Alpha-1-acid glycoprotein 2 Precursor (ORM2) and alpha-1-microglobulin/bikunin precursor (AMBP) genes. The implication in the disease of polymorphisms tested within these genes was excluded by this approach, corroborating the results of our LD mapping, presented in this article (Zinovieva E et al. manuscript in preparation).
Despite the fact that SNPs within the TNFSF15 coding region were excluded by our combined tag SNP and candidate-gene approach, it is still possible that polymorphisms identified by the H1 haplotype play a role in the regulation of this gene. TNFSF15 belongs to the TNF superfamily of genes, otherwise implicated in SpA  and is specifically implicated in gut inflammation, a frequent SpA manifestation ,. Its product, TNFSF15, also called TLA1, is implicated in the modulation of T-helper 17 lymphocytes activation , the number of which has been shown to be increased in patients with SpA . Finally a recent study reported that polymorphisms within TNFSF15 to be associated with CD are playing a role in the transcriptional regulation of the gene . This report compounds with our hypothesis, since our findings could be consistent with an indirect role of TNFSF15 in SpA. However, whether the causal variant(s) tagged by the six SNPs haplotype is(are) related to TNFSF15 function and/or regulation or to any other gene in the SPA2 region will require further investigations.
Materials and Methods
This study was approved by the institutional ethics committee of Cochin Hospital (Paris, France) and of Ambroise Paré Hospital (Boulogne-Billancourt, France), and written informed consent was obtained from each participant.
Caucasian families consisting of one or several cases of SpA and additional parents were recruited throughout France by the “Groupe Français d'Etude Génétique des Spondylarthropathies” (GFEGS). In case/control panels, independent cases were recruited through the Rheumatology clinic of Ambroise Paré Hospital (Boulogne-Billancourt), or through the national self-help patients' organization: “Association Française des Spondylarthritiques”. Independent controls were obtained from the “Centre d'Etude du Polymorphisme Humain”, or were recruited as healthy spouses of cases with either no children or no affected children.
The phenotypic description of patients from familial and case/control samples is shown in Table 5. The diagnosis of SpA was made according to the classification criteria of Amor et al  and/or the European Spondylarthropathy Study Group (ESSG) . Within the group of SpA, AS was diagnosed according to the modified New York criteria . Regarding extra-articular manifestations, the diagnosis of psoriasis required the presence of typical lesions and/or a clinical diagnosis established by a dermatologist. The diagnosis of anterior uveitis required examination by an ophthalmologist. Inflammatory bowel disease diagnosis (including CD and ulcerative colitis) was based on endoscopic and histological examination of the gut. ReA was diagnosed according to the criteria published by Willkens . Finally, uSpA was diagnosed when SpA criteria were fulfilled, without any of the foregoing diagnosis.
DNA isolation and HLA-B typing
Genomic DNA was extracted from peripheral blood using standard methods. HLA-B typing was routinely performed by a polymerase chain reaction (PCR) - based sequence-specific method . For individuals already typed as positive for HLA-B27, retyping was not routinely performed.
Linkage fine mapping.
The fine mapping markers panel consisted of 28 microsatellites with maximal level of heterozygosity, regularly spaced over the SPA2 region, between D9S1677 and D9S112 (average marker spacing 0.89 cM and mean heterozygosity of 0.76). Markers were selected from the University of California Santa Cruz (UCSC) public database. They included 3 of the 5 microsatellites previously studied in our former screen , and 25 additional markers (Table 2, Figure 1). Typing of microsatellites was performed at the French National Genotyping Center (CNG, Evry, France) on a set of 149 multiplex families (Figure 1, Table 1). Multiplex PCRs were performed using fluorescently-labeled primers (FAM, HEX and NED) (Applied Biosystems, Foster City, CA). Amplimers generated by PCRs were loaded on a MegaBACE1000 (GE Healthcare, Chalfont St. Giles, UK) and allele calling of the fragments was performed with the GeneticProfiler software (Amersham Biosciences - GE Healthcare).
A customized chip containing 1,536 tag SNPs was designed using resources provided by the HapMap project . Tag SNP selection aimed at covering almost all the common variations (r2>0.8; more than 1 tag SNP/10 kb) of the 13.23 Mb target region, which was selected following the linkage fine mapping stage. High-throughput genotyping was performed on a customized Illumina BeadChips using the GoldenGate assay at the CNG. A detailed protocol for this assay is described by Illumina on their web site. The GoldenGate reaction is based on allele specific extension and universal PCRs at 1,536 targets . After amplification the GoldenGate assay products were hybridized on a Sentrix 96Array matrix , a fiber-optic gene array ,, and washed prior to being analyzed by fluorescence. The call rate obtained was ≥0.99. Patients DNA genotypes in 136 families by the mean of this methodology were investigated (Figure 1, Table 1).
Extension association study.
After the tag SNP screen described above was achieved, 25 SNPs were further selected through the HapMap and dbSNP NCBI databases according to the following criteria:
- The selected markers had a minor allele frequency (MAF) ≥0.1 in Caucasian populations, and were located nearby the tag SNP previously identified to be associated with SpA (up to a maximum of 30 kb upstream and of 106 kb downstream). These markers were also as far as possible in strong LD (D' = 1; strong r2) with the associated tag SNP. We genotyped 5 more additional SNPs within the TNFSF15 gene (rs4246905, rs6478108, rs7030574, rs6478109, and rs7848647), being the closest gene located in the vicinity of the peak of association. Of note, these 5 SNPs were chosen because they have been previously identified as associated with CD ,. In all, 31 SNPs were genotyped in this extension part of the study, including the tag SNP previously identified as associated with SpA. Genotyping was carried out at the CNG using TaqMan (assay-by-design) according to the manufacturer's recommendations with probes and Mastermix from Applied Biosystems (Courtaboeuf, France). End point fluorescence was detected using an ABI7900HT reader (Applied Biosystems). Genotypes were assigned with the SDS 2.1 software. Investigations were carried out on a set made of 287 families (Table 1) as well as on an independent sample of 139 cases and 163 controls (Figure 1).
Finally, a replication study focused on 8 SNPs located in the reduced 71.6 kb region was performed on a new case/control sample composed of 232 SpA patients and 149 controls. Genotyping was put through a production-scale 48-plex (SNPlex) assay (Applied Biosystems, Courtaboeuf, France) ,. Automatic allele assignment was achieved with the GeneMapper software v4.0 (Applied Biosystems), with the rules-clustering algorithm.
Verification of SNPs genotyping.
In the different stages of the study, several individuals have been genotyped by two or three different technologies (Illumina, Taqman, or SNPlex) for several SNPs. The level of genotyping concordance was set to 95%. For SNPs that did not reach this threshold (rs4979459, rs10759734, rs6478105, rs10982396, and rs4246905) an alternative method of genotyping was used in order to resolve the correct genotype. In this way, 436 individuals were genotyped using melting curve analyses (LightCycler System, Roche, Meylan, France). Both PCR primers and hybridization probes were synthesized by Tib MolBiol (Berlin, Germany). All observed discrepancies were solved.
For familial studies, Mendelian inheritance inconsistencies were identified with the PEDCHECK program . PLINK program  was used to assess the deviation from Hardy-Weinberg equilibrium in unrelated subjects. Family pairwise distributions among first and second degree relative pairs were accounted for with the PEDSTATS program .
For the fine mapping linkage study, allele frequencies were estimated using MENDEL software . Evidence for linkage was assessed using Zlr statistic  based on Spairs with the exponential model and multipoint identity-by-descent computation using ALLEGRO program . P-values were computed on the basis of large-sample theory; the distribution of Zlr statistic approximates a standard normal random variable under the null hypothesis ,. Whole fine map significance was extrapolated using Bonferroni correction for 28 tests.
Family-based allelic single-locus association analyses (1,536 tag SNP in the 136 families of the LD mapping and 31 SNPs in the 287 families of the extension study (Figure 1)) were carried out using FBAT . FBAT is a flexible program appropriate for analyses of family data larger than trios, allowing association tests that are robust to population cofounds in the case where parental data are missing and/or other offspring are included in the analysis . We specified the option to calculate the variance empirically (“-e” option) in order to provide valid tests of association in the presence of linkage . The global significance threshold for each set of SNPs was assessed using Bonferroni correction.
It has been shown that in some situations (such as when r2 factor between the risk variant and a particular multi-SNP haplotype is very strong) haplotypes may provide more information for association than corresponding single-locus tests . HBAT is an elaboration of FBAT that allows family-based association tests of haplotypes, even when the phasing is ambiguous. Family-based haplotype-specific association in the presence of known linkage was assessed with the “hbat -e” option of FBAT program, for a set of pre-selected tightly linked markers. This option allows one to perform two types of haplotype tests. In the first type, each haplotype allele is tested for association against all the others using a one degree-of-freedom (df) test. Significance of these tests must be extrapolated with a multiple testing correction; here we used the Bonferroni method. The second type of haplotype tests is a global multiallelic test with several df. In this case, there is no need to correct for multiple testing.
Case/control association studies were carried out using the standard chi-square test comparing allelic frequencies between cases and controls and giving asymptotic P-values, implemented in PLINK package . Allele frequencies, ORs, and their 95% confidence intervals were also estimated using this software. When needed, the adjustment for multiple testing was performed using the Bonferroni correction.
The quantile-quantile (Q-Q) plots (Figure 3) were constructed by ranking the sets of association P-values from the largest to the smallest and plotting them against their expected values. Under the null hypothesis the expected P-value for the ith SNP is i/n, where n is the total number of tested markers.
Haplotype-specific association was assessed in case/control samples for a set of pre-selected tightly linked SNPs (the same as for the family investigation described above) with the “hap-assoc” option of PLINK. This procedure takes into account the uncertainty of haplotype phase and performs both one df chi-square haplotype-specific tests, which significance must be extrapolated with a multiple testing correction, here Bonferroni correction, and an omnibus association statistic considering all the haplotypes.
The explorative combined analysis of the whole association data from the extension and replication studies (1 family sample and 2 case/control samples; Figure 1) was performed with the “dfam” option of PLINK. This particular test implements a sib-TDT  for nuclear families, to include sibships without parents as well as unrelated individuals and assesses the association via a clustered-analysis using the Cochran-Mantel-Haenszel test . It does not take into account the presence of linkage in the region.
LD plots were constructed using HAPLOVIEW program .
The URLs for data presented herein are as follows:
* Association Française de Spondylarthritiques (AFS): http://www.spondylarthrite.org/
* Centre d'Etude du Polymorphisme Humain (CEPH): http://www.cephb.fr/
* University of California Santa Cruz (UCSC) public database: http://genome.ucsc.edu/
* French National Genotyping Center (CNG): http://www.cng.fr/
* Detailed protocol for the Illumina BeadChips GoldenGate assay: http://icom.illumina.com/General/pdf/LinkageIV/GOLDENGATE_ASSAY_FINAL.pdf
* Ensembl database: http://www.ensembl.org/index.html
* NCBI dbSNP database: http://www.ncbi.nlm.nih.gov/projects/SNP/
* HapMap database: http://www.hapmap.org/
* PEDCHECK program: http://watson.hgen.pitt.edu/register/docs/pedcheck.html
* MENDEL program: http://www.genetics.ucla.edu/software/mendel
* ALLEGRO program: http://www.decode.com/software/
* FBAT program: http://www.biostat.harvard.edu/~fbat/fbat.htm
* PEDSTATS program: http://www.sph.umich.edu/csg/abecasis/PedStats/index.html
* PLINK program: http://pngu.mgh.harvard.edu/~purcell/plink/
* HAPLOVIEW program: http://www.broad.mit.edu/mpg/haploview/
Complete results of the family-based association linkage disequilibrium mapping study (Illumina BeadChip genotyping - 136 families).
(1.74 MB DOC)
Complete results of the family-based association extension study (TaqMan genotyping - 287 families).
(0.08 MB DOC)
Complete results of case/control association extension study (TaqMan genotyping - 139 SpA cases/163 healthy controls).
(0.07 MB DOC)
Single SNP (1), and haplotype-based (2) association results in HLA-B27 positive case/control sample (261 SpA cases/27 healthy controls).
(0.05 MB DOC)
Single SNP (1), and haplotype-based (2) association results in HLA-B27 negative case/control sample (73 SpA cases/255 healthy controls).
(0.05 MB DOC)
Results of combined family-based and case/control association-extension studies for eight SPA2 single-nucleotide polymorphisms (SNPs).
(0.04 MB DOC)
We acknowledge the contribution of the investigators of GFEGS (Groupe Français d'Etude Génétique des Spondylarthropathies) and of the AFS (Association Française des Spondylarthrites) to patient recruitment. We are indebted to the patients and their relatives for their dedication to our research program.
We thank Dr. Françoise Clerget-Darpoux and Dr. Jean-Pierre Hugot for fruitful discussions, as well as Dr. Joel D. Taurog and Miss Emma Walton for the proofreading of the manuscript.
Conceived and designed the experiments: EZ CB AK FL BI RSN CMR HJG SH DZ GC MB. Performed the experiments: EZ AK FL BI NL AV SJ SH CC DB AB. Analyzed the data: EZ CB AK FL BI NL NC AV HJG SH CC AB DZ GC MB. Contributed reagents/materials/analysis tools: CB FL RSN NL NC MB. Wrote the paper: EZ CB CC GC MB.
- 1. Saraux A, Guillemin F, Guggenbuhl P, Roux CH, Fardellone P, et al. (2005) Prevalence of spondyloarthropathies in France: 2001. Ann Rheum Dis 64: 1431–1435.
- 2. Breban M (2006) Genetics of spondyloarthritis. Best Pract Res Clin Rheumatol 20: 593–599.
- 3. Breban M, Miceli-Richard C, Zinovieva E, Monnet D, Said-Nahal R (2006) The genetics of spondyloarthropathies. Joint Bone Spine 73: 355–362.
- 4. Said-Nahal R, Miceli-Richard C, Berthelot JM, Duche A, Dernis-Labous E, et al. (2000) The familial form of spondylarthropathy: a clinical study of 115 multiplex families. Groupe Francais d'Etude Genetique des Spondylarthropathies. Arthritis Rheum 43: 1356–1365.
- 5. Said-Nahal R, Miceli-Richard C, D'Agostino MA, Dernis-Labous E, Berthelot JM, et al. (2001) Phenotypic diversity is not determined by independent genetic factors in familial spondylarthropathy. Arthritis Rheum 45: 478–484.
- 6. Brewerton DA, Hart FD, Nicholls A, Caffrey M, James DC, et al. (1973) Ankylosing spondylitis and HL-A 27. Lancet 1: 904–907.
- 7. Schlosstein L, Terasaki PI, Bluestone R, Pearson CM (1973) High association of an HL-A antigen, W27, with ankylosing spondylitis. N Engl J Med 288: 704–706.
- 8. Amor B, Feldmann JL, Delbarre F, Hors J, Beaujan MM, et al. (1974) Letter: HL-A antigen W27–a genetic link between ankylosing spondylitis and Reiter's syndrome? N Engl J Med 290: 572.
- 9. Breban M, Hacquard-Bouder C, Falgarone G (2004) Animal models of HLA-B27-associated diseases. Curr Mol Med 4: 31–40.
- 10. Breban M, Said-Nahal R, Hugot JP, Miceli-Richard C (2003) Familial and genetic aspects of spondyloarthropathy. Rheum Dis Clin North Am 29: 575–594.
- 11. Dernis E, Said-Nahal R, D'Agostino MA, Aegerter P, Dougados M, et al. (2009) Recurrence of spondylarthropathy among first-degree relatives of patients, a systematic cross-sectional study. Ann Rheum Dis 68: 503–507.
- 12. Laval SH, Timms A, Edwards S, Bradbury L, Brophy S, et al. (2001) Whole-genome screening in ankylosing spondylitis: evidence of non-MHC genetic-susceptibility loci. Am J Hum Genet 68: 918–926.
- 13. Miceli-Richard C, Zouali H, Said-Nahal R, Lesage S, Merlin F, et al. (2004) Significant linkage to spondyloarthropathy on 9q31–34. Hum Mol Genet 13: 1641–1648.
- 14. Sims AM, Timms AE, Bruges-Armas J, Burgos-Vargas R, Chou CT, et al. (2008) Prospective meta-analysis of IL-1 gene complex polymorphisms confirms associations with ankylosing spondylitis. Ann Rheum Dis 67: 1305–1309.
- 15. Timms AE, Crane AM, Sims AM, Cordell HJ, Bradbury LA, et al. (2004) The interleukin 1 gene cluster contains a major susceptibility locus for ankylosing spondylitis. Am J Hum Genet 75: 587–595.
- 16. Burton PR, Clayton DG, Cardon LR, Craddock N, Deloukas P, et al. (2007) Association scan of 14,500 nonsynonymous SNPs in four diseases identifies autoimmunity variants. Nat Genet 39: 1329–1337.
- 17. Danchin E, Vitiello V, Vienne A, Richard O, Gouret P, et al. (2004) The major histocompatibility complex origin. Immunol Rev 198: 216–232.
- 18. Danchin EG, Pontarotti P (2004) Towards the reconstruction of the bilaterian ancestral pre-MHC region. Trends Genet 20: 587–591.
- 19. Vegvari A, Szabo Z, Szanto S, Nesterovitch AB, Mikecz K, et al. (2005) Two major interacting chromosome loci control disease susceptibility in murine model of spondyloarthropathy. J Immunol 175: 2475–2483.
- 20. Tremelling M, Berzuini C, Massey D, Bredin F, Price C, et al. (2008) Contribution of TNFSF15 gene variants to Crohn's disease susceptibility confirmed in UK population. Inflamm Bowel Dis 14: 733–737.
- 21. Yamazaki K, McGovern D, Ragoussis J, Paolucci M, Butler H, et al. (2005) Single nucleotide polymorphisms in TNFSF15 confer susceptibility to Crohn's disease. Hum Mol Genet 14: 3499–3506.
- 22. Chang M, Rowland CM, Garcia VE, Schrodi SJ, Catanese JJ, et al. (2008) A large-scale rheumatoid arthritis genetic study identifies association at chromosome 9q33.2. PLoS Genet 4: e1000107. doi:10.1371/journal.pgen.1000107.
- 23. Plenge RM, Seielstad M, Padyukov L, Lee AT, Remmers EF, et al. (2007) TRAF1-C5 as a risk locus for rheumatoid arthritis–a genomewide study. N Engl J Med 357: 1199–1209.
- 24. Lake SL, Blacker D, Laird NM (2000) Family-based tests of association in the presence of linkage. Am J Hum Genet 67: 1515–1525.
- 25. Nyholt DR (2004) A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. Am J Hum Genet 74: 765–769.
- 26. Agresti A (1990) Categorical data analysis;. In: Wiley J, editor. New York: pp. 100–102.
- 27. Roberts SB, MacLean CJ, Neale MC, Eaves LJ, Kendler KS (1999) Replication of linkage studies of complex traits: an examination of variation in location estimates. Am J Hum Genet 65: 876–884.
- 28. Lander E, Kruglyak L (1995) Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results. Nat Genet 11: 241–247.
- 29. Thomson G, Bodmer W (1977) The genetics of HLA and disease associations. In: Christiansen FB, Frenchel T, editors. Measuring Selection in Natural Populations. Berlin: Springer Verlag. pp. 545–564.
- 30. Zinovieva E, Lebrun N, Letourneur F, Laurent FX, Said-Nahal R, et al. (2008) Lack of association between Tenascin-C gene and spondyloarthritis. Rheumatology (Oxford) 47: 1655–1658.
- 31. Zhu X, Wang Y, Sun L, Song Y, Sun F, et al. (2007) A novel gene variation of TNFalpha associated with ankylosing spondylitis: a reconfirmed study. Ann Rheum Dis 66: 1419–1422.
- 32. Takedatsu H, Michelsen KS, Wei B, Landers CJ, Thomas LS, et al. (2008) TL1A (TNFSF15) regulates the development of chronic colitis by modulating both T-helper 1 and T-helper 17 activation. Gastroenterology 135: 552–567.
- 33. Young HA, Tovey MG (2006) TL1A: a mediator of gut inflammation. Proc Natl Acad Sci U S A 103: 8303–8304.
- 34. Jandus C, Bioley G, Rivals JP, Dudler J, Speiser D, et al. (2008) Increased numbers of circulating polyfunctional Th17 memory cells in patients with seronegative spondylarthritides. Arthritis Rheum 58: 2307–2317.
- 35. Kakuta Y, Ueki N, Kinouchi Y, Negoro K, Endo K, et al. (2009) TNFSF15 transcripts from risk haplotype for Crohn's disease are overexpressed in stimulated T cells. Hum Mol Genet 18: 1089–1098.
- 36. Amor B, Dougados M, Mijiyawa M (1990) [Criteria of the classification of spondylarthropathies]. Rev Rhum Mal Osteoartic 57: 85–89.
- 37. Dougados M, van der Linden S, Juhlin R, Huitfeldt B, Amor B, et al. (1991) The European Spondylarthropathy Study Group preliminary criteria for the classification of spondylarthropathy. Arthritis Rheum 34: 1218–1227.
- 38. van der Linden S, Valkenburg HA, Cats A (1984) Evaluation of diagnostic criteria for ankylosing spondylitis. A proposal for modification of the New York criteria. Arthritis Rheum 27: 361–368.
- 39. Willkens RF, Arnett FC, Bitter T, Calin A, Fisher L, et al. (1981) Reiter's syndrome. Evaluation of preliminary criteria for definite disease. Arthritis Rheum 24: 844–849.
- 40. Yoshida M, Kimura A, Numano F, Sasazuki T (1992) Polymerase-chain-reaction-based analysis of polymorphism in the HLA-B gene. Hum Immunol 34: 257–266.
- 41. HapMap-Consortium TI (2005) A haplotype map of the human genome. Nature 437: 1299–1320.
- 42. Gunderson KL, Kruglyak S, Graige MS, Garcia F, Kermani BG, et al. (2004) Decoding randomly ordered DNA arrays. Genome Res 14: 870–877.
- 43. Oliphant A, Barker DL, Stuelpnagel JR, Chee MS (2002) BeadArray technology: enabling an accurate, cost-effective approach to high-throughput genotyping. Biotechniques Suppl56–58, 60–51.
- 44. Steemers FJ, Ferguson JA, Walt DR (2000) Screening unlabeled DNA targets with randomly ordered fiber-optic gene arrays. Nat Biotechnol 18: 91–94.
- 45. Walt DR (2000) Techview: molecular biology. Bead-based fiber-optic arrays. Science 287: 451–452.
- 46. De la Vega FM, Lazaruk KD, Rhodes MD, Wenz MH (2005) Assessment of two flexible and compatible SNP genotyping platforms: TaqMan SNP Genotyping Assays and the SNPlex Genotyping System. Mutat Res 573: 111–135.
- 47. Tobler AR, Short S, Andersen MR, Paner TM, Briggs JC, et al. (2005) The SNPlex genotyping system: a flexible and scalable platform for SNP genotyping. J Biomol Tech 16: 398–406.
- 48. O'Connell JR, Weeks DE (1998) PedCheck: a program for identification of genotype incompatibilities in linkage analysis. Am J Hum Genet 63: 259–266.
- 49. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81: 559–575.
- 50. Wigginton JE, Abecasis GR (2005) PEDSTATS: descriptive statistics, graphics and quality assessment for gene mapping data. Bioinformatics 21: 3445–3447.
- 51. Lange K, Weeks D, Boehnke M (1988) Programs for Pedigree Analysis: MENDEL, FISHER, and dGENE. Genet Epidemiol 5: 471–472.
- 52. Kong A, Cox NJ (1997) Allele-sharing models: LOD scores and accurate linkage tests. Am J Hum Genet 61: 1179–1188.
- 53. Gudbjartsson DF, Jonasson K, Frigge ML, Kong A (2000) Allegro, a new computer program for multipoint linkage analysis. Nat Genet 25: 12–13.
- 54. Laird NM, Horvath S, Xu X (2000) Implementing a unified approach to family-based tests of association. Genet Epidemiol 19: Suppl 1S36–42.
- 55. Laird NM, Lange C (2006) Family-based designs in the age of large-scale gene-association studies. Nat Rev Genet 7: 385–394.
- 56. Clayton D, Chapman J, Cooper J (2004) Use of unphased multilocus genotype data in indirect association studies. Genet Epidemiol 27: 415–428.
- 57. Spielman RS, Ewens WJ (1998) A sibship test for linkage in the presence of association: the sib transmission/disequilibrium test. Am J Hum Genet 62: 450–458.
- 58. Barrett JC, Fry B, Maller J, Daly MJ (2005) Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21: 263–265.