Figures
Abstract
Introduction
Osteoarthritis (OA) is the most common degenerative joint disease and one of the major causes of disability worldwide. It is a multifactorial disorder with a significant genetic component. The heritability of OA has been estimated to be 60% for hip OA and 39% for knee OA. Genetic factors behind OA are still largely unknown. Studying families with strong history of OA, facilitates examining the co-segregation of genetic variation and OA. The aim of this study was to identify new, rare genetic factors and novel candidate genes for OA.
Methods
Eight patients from three Finnish families with hip and knee OA were studied using whole exome sequencing. We focused on rare exonic variants with predicted pathogenicity and variants located in active promoter or strong enhancer regions. Expression of identified candidate genes were studied in bone and cartilage tissues and the observed variants were investigated using bioinformatic analyses.
Results
Two rare variants co-segregated with OA in two families. In Family 8 a missense variant (c.628C>G, p.Arg210Gly) was observed in the OLIG3 gene that encodes a transcription factor known to be associated with rheumatoid arthritis and inflammatory polyarthritis. The Arg210Gly variant was estimated to be pathogenic by Polyphen-2 and Mutation taster and the locus is conserved among mammals. In Family 12 the observed variant (c.-127G>T) was located in the transcription start site of the FIP1L1 gene. FIP1L1 participates in the regulation of polyadenylation. The c.-127G>T is located in the transcription start site and may alter the DNA-binding of transcription factors. Both, OLIG3 and FIP1L1 were observed in human bone and cartilage.
Citation: Skarp S, Kämäräinen O-P, Wei G-H, Jakkula E, Kiviranta I, Kröger H, et al. (2018) Whole exome sequencing in Finnish families identifies new candidate genes for osteoarthritis. PLoS ONE 13(8): e0203313. https://doi.org/10.1371/journal.pone.0203313
Editor: Obul Reddy Bandapalli, German Cancer Research Center (DKFZ), GERMANY
Received: June 11, 2018; Accepted: August 18, 2018; Published: August 29, 2018
Copyright: © 2018 Skarp et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This study was funded by the University of Oulu (university researcher recruitment funding, MM); The Connective Tissue Gene Tests (Allentown) provided support in the form of salaries for author [LAK], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific role of authors are articulated in the ‘author contributions’ section.
Competing interests: Leena Ala-Kokko is employed by The Connective Tissue Gene Tests. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Background
Osteoarthritis (OA) is the most common degenerative joint disease and one of the leading causes of disability in the world [1]. The incidence of OA is increasing as the population ages and OA has been predicted to become the greatest single cause of disability by 2030 [2].
OA is characterized by progressive degradation of the joint cartilage. It includes synovial inflammation and bone remodeling causing pain, and eventually leading to loss of joint function. Most often it affects the joints in the knee, hip, or hand, but can affect any joint. OA is a complex disease that has a strong genetic background with heritability estimations of 39% and 65% for hand and knee OA, respectively [3,4] and 60% for hip OA [5].
Genetic factors behind OA have been studied in both family and case-control datasets using linkage analyses and candidate gene approaches, as well as in genome-wide association studies (GWAS) carried out in large population cohorts. Although, several genetic loci have been associated with OA the underlying genetic variants and genes remain elusive. Based on these numerous studies the genetic background of OA is likely to be affected by common (minor allele frequency, MAF > 0.05), low frequency (0.01 < MAF < 0.05) and rare variants (MAF < 0.01) [6,7]. Large GWASs have usually focused on the common genetic variation, although study of Islandic population identified variants with rare or low frequency in n cartilage oligomeric matrix protein (COMP) gene and chondroadherin like (CHADL) gene [8]. The variant in COMP was specific to Islandic population [8]. Families with multiple early onset OA cases in several generations are of particular interest as there might be a stronger genetic effect. For example, candidate gene studies of families with an early onset of OA have previously identified variants in the genes encoding for collagens II and XI [9,10]. Studies of families with OA have identified rare or low frequency variants in loci 1p31 and 2q33.3 [11,12]. In general, selective pressure keeps pathogenic variants from reaching a high frequency in human populations and thus rare coding variants are typically more harmful and population specific compared to common variants [13,14].
The Finnish population is particularly advantageous for studying familial cases of common diseases due to its population history [15]. It has undergone multiple historic bottlenecks followed by population expansion [16]. Due to the resulting founder effect, functional and low-frequency variants have become enriched in the Finnish population [17,18].
The objective of this study was to identify rare variants co-segregating with OA in Finnish families.
Methods
Families
Three families diagnosed with an early onset hip and knee OA were studied using whole exome sequencing. These families were part of a previously described data set of 15 families [19] and were recruited during the years 1998–2000 and 2003–2006. For the whole exome sequencing we chose families with at least two affected family members in more than one generation (Families 8, 11 and 12, Fig 1). Exome sequencing was performed for three patients from Families 11 and 12, and two from Family 8. They had been diagnosed with OA of the hip under the age of 55, and seven of them had undergone total joint replacement surgery (TJR) due to OA. Characteristics of the families are presented in Table 1. None of the individuals had any other inflammatory joint disease such as rheumatoid arthritis (RA). There was also no evidence of a skeletal or chondrodysplasia.
The patients included in whole exome sequencing are marked with an asterisk and individuals included in genotyping are marked with a plus symbol. Deceased are marked with diagonal line through symbol. In Family 8 the carriers of the OLIG3 variant c.628C>G, p.Arg210Gly are marked with c.628C>G. In Family 12 the observed variant c.-127G>T is located on the FIP1L1 gene 5’UTR. Abbreviations: Hip OA = Osteoarthritis of the hip, Knee OA = Osteoarthritis of the knee, TJR = Total joint replacement surgery, Symptoms = Individual has no OA diagnosis, but experiences pain and/or joint stiffness.
In addition to the samples subjected to exome sequencing, DNA (deoxyribonucleic acid) samples were available from seven other family members from Family 8 (3 unaffected, 4 affected), two family members from Family 11 (both affected) and three family members from Family 12 (2 affected, 1 unaffected).
This study was approved by the Ethics Committee of the Northern Ostrobothnia Hospital District and family members as well as the donor of the osteochondral specimen gave their informed written consent.
Whole exome sequencing
Genomic DNA was extracted from 3 ml of whole blood treated with ethylenediaminetetraacetic acid (EDTA) using standard protocols. The exome sequencing data was acquired through a commercial service (BGI, Hong Kong), who provided the exome capture, alignment and variant calling. The SureSelect 51M Capture Kit (Agilent Technologies) was used for the exome capture. The targets were sequenced using the IlluminaHiSeq2000 100PE platform and the reads were aligned to the GRCh37/hg19 human reference genome. The Genome Analysis ToolKit (GATK) [20] was used for variant calling and filtering. Variants were filtered out according to quality parameters/threshold values, which for SNVs are the root mean square of the mapping quality (MQ) < 40, Haplotype score > 13 and genotype quality (GQ) < 20, and for indels are the quality by depth (QD) < 2.0, the Phred-scaled p-value using Fisher’s Exact Test (FisherStrand) > 200.0 and the u-based z-approximation from the Mann-Whitney Rank Sum Test for mapping qualities (MappingQualityRankSumTest) < -20.0.
Analysis of whole exome sequencing data
Variants shared by the affected family members were extracted and variants, which were found in the unaffected individuals or in the in-house exome set (N = 71) or had a MAF > 1% in the 1000Genomes database were filtered out.
Functional annotation was performed for the remaining rare variants using database for nonsynonymous SNPs' functional predictions (dbNSFP) [21] through ANNOVAR [22] (version Aug 2013) to identify variants with estimated pathogenicity (SIFT, PolyPhen-2, MutationTaster) and variants affecting active promoters or strong enhancer regions (the Encyclopedia of DNA Elements [ENCODE] databases for chromHMM estimations for HSMM and GM12878 cells). Thresholds stated in the dbNSFP and ANNOVAR guidelines were used to determine pathogenic estimation: SIFT≤0.05, Polyphen-2 ≥ 0.957 or MutationTaster ≥ 0.85. All annotations were done according to the ANNOVAR manual. Rare variants without any predictions were not excluded at this stage, however. The Sequencing Initiative Suomi (SISu) database (N = 6118) representing the Finnish population (http://www.sisuproject.fi/), the Exome Aggregation Consortium (ExAC) (http://exac.broadinstitute.org) and the Genome Aggregation Database (gnomAD) (http://gnomad.broadinstitute.org/) were used as references.
All variants with estimated pathogenicity and those affecting active promoters or strong enhancer regions were validated using capillary sequencing (ABI3500xL Genetic Analyzer, Applied Biosystems). DNA samples from additional family members were genotyped using the same method to study the co-segregation of variants in families.
Bioinformatic analyses
RegulomeDB [23] and Enhancer Element Locator (EEL) software [24] were used to predict whether the c.-127G>T variant in the FIP1L1 (Factor Interacting With PAPOLA And CPSF1) 5’ untranslated region (5’UTR) directly affects DNA-binding motifs of certain transcription factors (TF). The conservation of the Oligodendrocyte transcription factor 3 (OLIG3) protein sequence at the p.Arg2010Gly site was visualized using Constraint-based Multiple Alignment Tool (COBALT) [25].
Osteochondral specimen
Bone and cartilage tissues are not included in the public databases such as Genotype-Tissue Expression portal (GTEx, www.gtexportal.org) and Human protein atlas (www.proteinatlas.org). To study the expression of the identified genes in bone and cartilage, tissue samples were collected from one unrelated patient who had undergone total knee replacement therapy at Oulu University Hospital in 2014. The Ethics Committee of the Northern Ostrobothnia Hospital District approved of the study protocol for tissue collection and gene analysis and the patient gave a written informed consent in accordance with the Declaration of Helsinki. Tissue samples were stored at -70°C before RNA extraction.
Total ribonucleic acid (RNA) extraction from bone and cartilage tissue
Each tissue sample was cut into approximately 5 mm thick and 2 mm wide pieces and stored in an RNAlater solution (Ambion). Four pieces of each tissue sample were used for RNA extraction reaction. The samples were pulverized on a metal surface pre-cooled in liquid nitrogen and then homogenized in 4 M guanidinium isothiocyanate. A pellet was collected following cesium chloride (5.7 M) gradient centrifugation. The pellet was dissolved in 300 μl of diethyl pyrocarbonate treated water (DEPC-H2O) before adding 20 μl of proteinase K (20 mg/ml) and 500 μl of SET-buffer (1% sodium dodecyl sulfate (SDS), 4 mM of EDTA, 10 mM of tris(hydroxymethyl)aminomethane (Tris), and incubated with constant agitation for 1.5 hours in 37°C. Total RNA was extracted using 25:24:1 phenol:chloroform:isoamylalcohol. The samples were mixed and centrifuged at 20,000 rcf at room temperature for four minutes. The upper aqueous phase was collected and RNA was precipitated using 3 M sodium acetate (NaAc) and absolute ethanol. Samples were centrifuged for 15 min at + 4°C and dried using SpeedVac (Savant) before dissolving the RNA into 50 μl of DEPC-H2O.
Messenger RNA (mRNA) expression in bone and cartilage
The quality of the RNA was determined with the QIAxcel Advanced system and RNA screening kit (Qiagen). Some level of degradation was observed. Complementary DNA (cDNA) was synthesized from the extracted total RNA using the iScript cDNA Synthesis Kit (Bio-Rad). A 327 base pair (bp) region of the FIP1L1 cDNA and a 343 bp region of the OLIG3 cDNA were amplified using polymerase chain reaction (PCR) and an AmpliTaq gold DNA polymerase enzyme (Applied Biosystems). These PCR products were used as a template in further amplification, where the product sizes were 109 bp for FIP1L1 and 343 bp for OLIG3. The presence and size of the PCR product was determined by electrophoresis using the QIAxcel Advanced system and QIAxcel ScreenGel Software (Qiagen). Primer sequences are available on request.
Results
Whole exome sequencing
Variant calling resulted in an average of 78,638 SNVs and 8,922 short insertion/deletion variants (ins/del) per sample. The average sequencing depth in the target area was 67.06X. Filtering by quality resulted in an average of 71,199 SNVs and 8,846 ins/del variants per sample. All variants that were shared by the exome sequenced family members and not found in the in-house exome set (N = 71) are listed in S1 Table. Functional annotation identified 37 variants in Family 8, twelve variants in Family 11 and eleven variants in Family 12. After validation and genotyping in additional family members only one variant in Family 8 and one variant in Family 12 were found to be co-segregating with OA (Fig 1, Table 2). None of the variants co-segregated with OA in Family 11.
The observed variant in Family 8 was a missense variant c.628C>G, p.Arg210Gly (NM_175747) in the OLIG3 gene. It was carried by all the family members with diagnosed OA or showing some symptoms of OA. This was observed as heterozygous in one individual in the ExAC and gnomAD datasets (Table 2). Variant was not available in the SISu database.
In Family 12 one variant, c.-127G>T, in the 5’UTR of FIP1L1 gene was observed in five affected family members. This variant had frequency of 0.0035 in the gnomAD database (Table 2). Variant was not available in the SISu database.
Bioinformatic analyses
The area around FIP1L1 c.-127G>T harbors high signals for acetylation lysine 27 histone 3 (H3K27ac) and several transcription factor binding sites (TFBS) (Fig 2) that are characteristic for active transcription starting sites (TSS). H3K27ac is an epigenetic marker for the activation of gene expression [23] and the binding of TFs at TSS is a hallmark of the start of transcription [24]. The prediction using RegulomeDB showed that the FIP1L1 c.-127G>T variant is likely to affect TF binding, but the program was unable to match the exact TF DNA-binding motif. The EEL analysis of the region revealed that this variant may alter the DNA-binding of several transcription factors (Fig 3), including IRF1 (IRF Family), SPI1, ELF1, ELK1, GABPA (ETS Family) and TCF3 (TCF Family) that have ChIP-seq signals in this region, and thus their chromatin binding may be affected due to the observed variant.
A) The c.-127G>T variant is located at the transcription start site of FIP1L1. The location of the variant is marked with a vertical bar. Transcription factor binding sites are shown with black and grey bars below the layered H3K27ac signals. The results were visualized with the UCSC genome browser utilizing ENCODE project data sets. B) Protein sequence alignment of OLIG3. The locus of p.Arg210Gly is highlighted. The 210th arginine is conserved across mammals. Alignment shown from 160th amino acid to 239th amino acid. The alignment is drawn using Constraint-based Multiple Alignment Tool (COBALT).
The OLIG3 p.Arg210Gly variant was considered pathogenic by Polyphen-2 and Mutation taster, but benign by SIFT. The pathogenicity estimation by SIFT is based on conservation whereas the other two (Polyphen-2 and Mutation taster) consider also biochemical information available. The conservation of OLIG3 Arg210 locus is presented in Fig 2. The multiple alignment of OLIG3 protein sequence shows that Arg210 is conserved among mammals. However variation in the locus occurs in more distant vertebrata members.
mRNA expression in bone and cartilage
The expression of FIP1L1 and OLIG3 messenger RNAs (mRNA) in human bone and cartilage tissues was observed using PCR. Products of expected sizes (109 bp and 343 bp) were seen in both tissues (Fig 4).
The size of the PCR fragments was determined by electrophoresis using the QIAxcel Advanced system and QIAxcel ScreenGel Software (Qiagen). The 5,000 bp and 15 bp size markers are visible as green fragments in all the samples. The size of the FIP1L1 qPCR fragment was 109 bp and the size of OLIG3 PCR fragment was 343 bp. Abbreviations: SM = Size Marker, B = Bone sample, C = cartilage sample, NC = negative control, bp = base pair.
Discussion
In order to identify new susceptibility genes for OA we studied a set of Finnish OA families using whole exome sequencing. We identified rare variants that co-segregated with the disease in two of the three studied families (Families 8 and 12). In Family 8 c.628C>G, p.Arg210Gly variant in OLIG3 was located in the protein coding region, while in Family 12 the identified variant (c.-127G>T) was found in the 5’UTR region of the FIP1L1 gene.
The observed p.Arg210Gly variant in OLIG3 was estimated pathogenic by PolyPhen-2 and Mutation Taster. Arginine in this location is highly conserved in mammals and when replaced by glycine, a much smaller amino acid, the change is likely to affect the protein structure. OLIG3 is a TF that has previously been identified in association with increased rates of joint destruction in RA [26] and poor response to methotrexate treatment in early inflammatory polyarthritis [27]. The genes regulated by OLIG3 are not known, but the expression of OLIG3 has been linked with Wnt/β-catenin signaling during neuronal development [28]. Wnt/β-catenin signaling pathway is active in the development of synovial joint [29] and has also been implicated in the pathogenesis of RA [30] and OA [31].
FIP1L1 is a subunit of the cleavage and polyadenylation specificity factor (CPSF) protein complex, which contributes to the mRNA polyadenylation site recognition and stimulates polyadenylation. The polyadenylation of the mRNA 3’ end is an important part of the pre-mRNA processing and affects the efficiency and tissue specificity of gene expression [32,33]. Mutations interfering with 3’ end processing are known to affect human health [34]. FIP1L1 is also associated with a fusion gene contributing to the pathogenesis of leukemia [35] and hypereosinohpilic syndrome [36].
The observed c.-127G>T variant in FIP1L1 can affect the affinity of TFs and thus may alter the expression of FIP1L1 in target tissues. Changes in the FIP1L1 expression may have downstream effects on the expression of other genes via the 3’end processing. The location of variant c.-127G>T in the 5’UTR region of FIP1L1 suggests that it may affect the binding of TFs such as IRF1, SPI1, ELF1, ELK1, GABPA and TCF3. For example, IRF1 and SPI1 have previously been linked to musculoskeletal phenotypes: The mouse homolog for IRF1 has an important role in regulating bone metabolism [37] and SPI1 influences the differentiation of precursor myeloid cells to osteoclasts [38]. In the spi1-deficient mice, differentiation of both macrophages and osteoclasts is inhibited [38].
Bone and cartilage tissues are not represented in general databases for human tissue expression such as Human Protein Atlas and Genotype-Tissue Expression portal. Therefore, we carried out a PCR analysis and observed that FIP1L1 and OLIG3 are indeed expressed in these tissues supporting their potential role in musculoskeletal diseases. Unfortunately we were not able to assess if the relative expression levels of these genes is different between affected and unaffected individuals as it was not possible to obtain patient samples from these families. These genes have not been reported to be differentially expressed in OA patients in earlier studies [39–42], however the variants identified in our study are rare and may require considerably large data sets to reach statistical significance in general population.
Regulatory mechanisms are known to play an important role in the pathogenesis of OA. The most well-established finding is the SNV rs143383 in the GDF5 5’UTR [43]. It has been shown that rs143383 affects the binding affinity of TFs leading to a repressed expression [44]. FIP1L1 and OLIG3 both participate in the regulation of transcription. Mutations in regulators of gene expression can have wide downstream effects. Even subtle regulatory changes in the gene expression can, especially over time, predispose towards pathogenic effects. Identifying the target genes and pathways in joint tissues could provide new insight into molecular background of OA and possibly help in finding new biomarkers or therapeutic targets.
The whole exome sequencing method has been shown to be capable of identifying co-segregating variants in family datasets of common diseases. In the present study the identified variants in the OLIG3 and FIP1L1 genes were specific to the respective families. This is not unexpected in the light of the Finnish population history of local isolates and enrichment of rare, functional alleles [16–18]. We have previously shown that specific variants associate with vertebral endplate signal changes (modic changes) in two Finnish families [45]. Functional evidence is needed, however, to understand the biological effect of the identified variants. Ideally, to further study their role would require tissue samples from patients, which were not available from the studied families. Alternatively, in vitro studies with general cell lines and specifically generated gene constructs could be possible. Here we utilized a bioinformatic approach to study the function and frequency of the variants. Previous studies have identified rare and low-frequency variants in association with OA [11,12] and functional evidence of the role of these genes and variants have been reported in subsequent studies [46,47].
In Family 11 we did not identify any rare, pathogenic variants co-segregating with OA. Since the whole exome sequencing data is restricted to the coding regions of the genome, it is possible or even likely that some of the predisposing variants are located elsewhere in the genome and that there are other regulatory mechanisms involved. This was actually the case in another family in the same family dataset (Family 10) where a common variant (rs11446594) in TFBS on chromosome 2q21 was identified by linkage analysis and targeted re-sequencing [19].
Conclusions
In conclusion, we have identified two variants co-segregating with OA in Finnish families leading to the finding of novel candidate genes for OA. Based on their well-defined function in the transcription OLIG3 and FIP1L1 may participate in regulatory events leading to OA.
Supporting information
S1 Table. All variants that were shared by the exome sequenced family members and not found in the in-house exome set.
https://doi.org/10.1371/journal.pone.0203313.s001
(XLSX)
Acknowledgments
We would like to thank the patients and their families for participating in this study. We greatly appreciate the help and expertise of Dr. Riku Kiviranta and technician Merja Lakkisto (University of Turku, Finland) on RNA extraction protocols. We also acknowledge the excellent technical assistance received from laboratory technicians Anu Myllymäki and Aira Erkkilä (University of Oulu, Finland).
References
- 1. Vos T, Flaxman AD, Naghavi M, Lozano R, Michaud C, Ezzati M, et al. Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 2012 Dec 15;380(9859):2163–2196. pmid:23245607
- 2. Thomas E, Peat G, Croft P. Defining and mapping the person with osteoarthritis for population studies and public health. Rheumatology (Oxford) 2014 Feb;53(2):338–345.
- 3. Neame RL, Muir K, Doherty S, Doherty M. Genetic risk of knee osteoarthritis: a sibling study. Ann Rheum Dis 2004 Sep;63(9):1022–1027. pmid:15308512
- 4. Spector TD, Cicuttini F, Baker J, Loughlin J, Hart D. Genetic influences on osteoarthritis in women: a twin study. BMJ 1996 Apr 13;312(7036):940–943. pmid:8616305
- 5. MacGregor AJ, Antoniades L, Matson M, Andrew T, Spector TD. The genetic contribution to radiographic hip osteoarthritis in women: results of a classic twin study. Arthritis Rheum 2000 Nov;43(11):2410–2416. pmid:11083262
- 6. Gonzalez A. Osteoarthritis year 2013 in review: genetics and genomics. Osteoarthritis Cartilage 2013 Oct;21(10):1443–1451. pmid:23845519
- 7. Reynard LN, Loughlin J. The genetics and functional analysis of primary osteoarthritis susceptibility. Expert Rev Mol Med 2013 Feb 18;15:e2. pmid:23414786
- 8. Styrkarsdottir U, Helgason H, Sigurdsson A, Norddahl GL, Agustsdottir AB, Reynard LN, et al. Whole-genome sequencing identifies rare genotypes in COMP and CHADL associated with high risk of hip osteoarthritis. Nat Genet 2017 May;49(5):801–805. pmid:28319091
- 9. Jakkula E, Melkoniemi M, Kiviranta I, Lohiniva J, Raina SS, Perala M, et al. The role of sequence variations within the genes encoding collagen II, IX and XI in non-syndromic, early-onset osteoarthritis. Osteoarthritis Cartilage 2005 Jun;13(6):497–507. pmid:15922184
- 10. Vikkula M, Palotie A, Ritvaniemi P, Ott J, Ala-Kokko L, Sievers U, et al. Early-onset osteoarthritis linked to the type II procollagen gene. Detailed clinical phenotype and further analyses of the gene. Arthritis Rheum 1993 Mar;36(3):401–409. pmid:8452585
- 11. Styrkarsdottir U, Thorleifsson G, Helgadottir HT, Bomer N, Metrustry S, Bierma-Zeinstra S, et al. Severe osteoarthritis of the hand associates with common variants within the ALDH1A2 gene and with rare variants at 1p31. Nat Genet 2014 May;46(5):498–502. pmid:24728293
- 12. Meulenbelt I, Min JL, van Duijn CM, Kloppenburg M, Breedveld FC, Slagboom PE. Strong linkage on 2q33.3 to familial early-onset generalized osteoarthritis and a consideration of two positional candidate genes. Eur J Hum Genet 2006 Dec;14(12):1280–1287. pmid:16912703
- 13. Kryukov GV, Pennacchio LA, Sunyaev SR. Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. Am J Hum Genet 2007 Apr;80(4):727–739. pmid:17357078
- 14. Marth GT, Yu F, Indap AR, Garimella K, Gravel S, Leong WF, et al. The functional spectrum of low-frequency coding variation. Genome Biol 2011 Sep 14;12(9):R84-2011-12-9-r84.
- 15. Peltonen L, Palotie A, Lange K. Use of population isolates for mapping complex traits. Nat Rev Genet 2000 Dec;1(3):182–190. pmid:11252747
- 16. Nevanlinna HR. Genetic markers in Finland. Haematologia (Budap) 1980;13(1–4):65–74.
- 17. Lim ET, Wurtz P, Havulinna AS, Palta P, Tukiainen T, Rehnstrom K, et al. Distribution and medical impact of loss-of-function variants in the Finnish founder population. PLoS Genet 2014 Jul 31;10(7):e1004494. pmid:25078778
- 18. Chheda H, Palta P, Pirinen M, McCarthy S, Walter K, Koskinen S, et al. Whole-genome view of the consequences of a population bottleneck using 2926 genome sequences from Finland and United Kingdom. Eur J Hum Genet 2017 Apr;25(4):477–484. pmid:28145424
- 19. Taipale M, Jakkula E, Kamarainen OP, Gao P, Skarp S, Barral S, et al. Targeted re-sequencing of linkage region on 2q21 identifies a novel functional variant for hip and knee osteoarthritis. Osteoarthritis Cartilage 2016 Apr;24(4):655–663. pmid:26603474
- 20. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010 Sep;20(9):1297–1303. pmid:20644199
- 21. Liu X, Jian X, Boerwinkle E. dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions. Hum Mutat 2011 Aug;32(8):894–899. pmid:21520341
- 22. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 2010 Sep;38(16):e164. pmid:20601685
- 23. Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res 2012 Sep;22(9):1790–1797. pmid:22955989
- 24. Hallikas O, Palin K, Sinjushina N, Rautiainen R, Partanen J, Ukkonen E, et al. Genome-wide prediction of mammalian enhancers based on analysis of transcription-factor binding affinity. Cell 2006 Jan 13;124(1):47–59. pmid:16413481
- 25. Papadopoulos JS, Agarwala R. COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics 2007 May 1;23(9):1073–1079. pmid:17332019
- 26. Plenge RM, Cotsapas C, Davies L, Price AL, de Bakker PI, Maller J, et al. Two independent alleles at 6q23 associated with risk of rheumatoid arthritis. Nat Genet 2007 Dec;39(12):1477–1482. pmid:17982456
- 27. Plant D, Farragher T, Flynn E, Martin P, Eyre S, Bunn D, et al. A genetic marker at the OLIG3/TNFAIP3 locus associates with methotrexate continuation in early inflammatory polyarthritis: results from the Norfolk Arthritis Register. Pharmacogenomics J 2012 Apr;12(2):128–133. pmid:20921970
- 28. Zechner D, Muller T, Wende H, Walther I, Taketo MM, Crenshaw EB,3rd, et al. Bmp and Wnt/beta-catenin signals control expression of the transcription factor Olig3 and the specification of spinal cord neurons. Dev Biol 2007 Mar 1;303(1):181–190. pmid:17150208
- 29. Guo X, Day TF, Jiang X, Garrett-Beal L, Topol L, Yang Y. Wnt/beta-catenin signaling is sufficient and necessary for synovial joint formation. Genes Dev 2004 Oct 1;18(19):2404–2417. pmid:15371327
- 30. Diarra D, Stolina M, Polzer K, Zwerina J, Ominsky MS, Dwyer D, et al. Dickkopf-1 is a master regulator of joint remodeling. Nat Med 2007 Feb;13(2):156–163. pmid:17237793
- 31. Luyten FP, Tylzanowski P, Lories RJ. Wnt signaling and osteoarthritis. Bone 2009 Apr;44(4):522–527. pmid:19136083
- 32. Lou H, Gagel RF. Alternative RNA processing—its role in regulating expression of calcitonin/calcitonin gene-related peptide. J Endocrinol 1998 Mar;156(3):401–405. pmid:9582495
- 33. Takagaki Y, Manley JL. Levels of polyadenylation factor CstF-64 control IgM heavy chain mRNA accumulation and other events associated with B cell differentiation. Mol Cell 1998 Dec;2(6):761–771. pmid:9885564
- 34. Danckwardt S, Hentze MW, Kulozik AE. 3' end mRNA processing: molecular mechanisms and implications for health and disease. EMBO J 2008 Feb 6;27(3):482–498. pmid:18256699
- 35. Iwasaki J, Kondo T, Darmanin S, Ibata M, Onozawa M, Hashimoto D, et al. FIP1L1 presence in FIP1L1-RARA or FIP1L1-PDGFRA differentially contributes to the pathogenesis of distinct types of leukemia. Ann Hematol 2014 Sep;93(9):1473–1481. pmid:24763514
- 36. Cools J, DeAngelo DJ, Gotlib J, Stover EH, Legare RD, Cortes J, et al. A tyrosine kinase created by fusion of the PDGFRA and FIP1L1 genes as a therapeutic target of imatinib in idiopathic hypereosinophilic syndrome. N Engl J Med 2003 Mar 27;348(13):1201–1214. pmid:12660384
- 37. Salem S, Gao C, Li A, Wang H, Nguyen-Yamamoto L, Goltzman D, et al. A novel role for interferon regulatory factor 1 (IRF1) in regulation of bone metabolism. J Cell Mol Med 2014 Aug;18(8):1588–1598. pmid:24954358
- 38. Tondravi MM, McKercher SR, Anderson K, Erdmann JM, Quiroz M, Maki R, et al. Osteopetrosis in mice lacking haematopoietic transcription factor PU.1. Nature 1997 Mar 6;386(6620):81–84. pmid:9052784
- 39. Karlsson C, Dehne T, Lindahl A, Brittberg M, Pruss A, Sittinger M, et al. Genome-wide expression profiling reveals new candidate genes associated with osteoarthritis. Osteoarthritis Cartilage 2010 Apr;18(4):581–592. pmid:20060954
- 40. Soul J, Dunn SL, Anand S, Serracino-Inglott F, Schwartz JM, Boot-Handford RP, et al. Stratification of knee osteoarthritis: two major patient subgroups identified by genome-wide expression analysis of articular cartilage. Ann Rheum Dis 2018 Mar;77(3):423-2017-212603. Epub 2017 Dec 22.
- 41. Xu Y, Barter MJ, Swan DC, Rankin KS, Rowan AD, Santibanez-Koref M, et al. Identification of the pathogenic pathways in osteoarthritic hip cartilage: commonality and discord between hip and knee OA. Osteoarthritis Cartilage 2012 Sep;20(9):1029–1038. pmid:22659600
- 42. Ramos YF, den Hollander W, Bovee JV, Bomer N, van der Breggen R, Lakenberg N, et al. Genes involved in the osteoarthritis process identified through genome wide expression analysis in articular cartilage; the RAAK study. PLoS One 2014 Jul 23;9(7):e103056. pmid:25054223
- 43. Miyamoto Y, Mabuchi A, Shi D, Kubo T, Takatori Y, Saito S, et al. A functional polymorphism in the 5' UTR of GDF5 is associated with susceptibility to osteoarthritis. Nat Genet 2007 Apr;39(4):529–533. pmid:17384641
- 44. Syddall CM, Reynard LN, Young DA, Loughlin J. The identification of trans-acting factors that regulate the expression of GDF5 via the osteoarthritis susceptibility SNP rs143383. PLoS Genet 2013 Jun;9(6):e1003557. pmid:23825960
- 45. Kraatari M, Skarp S, Niinimäki J, Karppinen J, Männikkö M. A Whole Exome Study Identifies Novel Candidate Genes for Vertebral Bone Marrow Signal Changes (Modic Changes). Spine (Phila Pa 1976) 2017 Aug 15;42(16):1201–1206.
- 46. Shepherd C, Zhu D, Skelton AJ, Combe J, Threadgold H, Zhu L, et al. Functional characterisation of the osteoarthritis genetic risk residing at ALDH1A2 identifies rs12915901 as a key target variant. Arthritis Rheumatol 2018 May 6.
- 47. Min JL, Meulenbelt I, Kloppenburg M, van Duijn CM, Slagboom PE. Mutation analysis of candidate genes within the 2q33.3 linkage area for familial early-onset generalised osteoarthritis. Eur J Hum Genet 2007 Jul;15(7):791–799. pmid:17406639