Figures
Abstract
Objective
Knee osteoarthritis (OA) is a heterogeneous disease with different endotypes and phenotypes, resulting in patients’ varying clinical and structural progression. Several genomic markers have been associated with knee OA presence. This study aimed to find new associations of these genetic markers with knee OA progression and to investigate the risk of knee OA progression using a polygenic risk score (PRS).
Methods
Data from knee OA patients (n = 297) from the IMI-APPROACH cohort with detailed measurements on disease progression were used. Knee OA progression definitions were based on the decrease in minimum joint space width in mm (minJSW; primary outcome), increase in pain on the Knee injury and Osteoarthritis Outcome Score (KOOS), and presence of radiographic OA (based on the Kellgren-Lawrence score) over 24 months. 30 previously reported single nucleotide polymorphisms (SNPs) associated with presence of OA irrespective of affected joints or knee OA specifically were investigated. We performed a SNP based genome-wide association analysis using the disease progression definitions. Furthermore, a PRS was created using the 30 presence SNPs to predict knee OA progression.
Results
Existing genetic markers for knee OA presence were not found to be associated with knee OA progression. The PRS of the SNPs for knee OA presence did also not show significant predictive value for knee OA progression. Unexpectedly, nineteen different variants were associated significantly (P < 5 × 10−8) with minJSW decrease. Ten SNPs are located near protein coding genes PLCL2, CDYL2, and NTNG1, and several SNPs are located in or near long non-coding RNAs (lncRNA).
Conclusions
The 30 OA risk SNPs individually and combined in a PRS are not associated with progression of knee OA in the IMI-APPROACH cohort. 19 different SNPs were associated with minJSW decrease. We demonstrated how to employ multiple bioinformatics tools to, despite a limited dataset, still prioritise potential biomarkers for associations to knee OA progression.
Citation: Bentvelzen MLM, Welsing PM, Moingeon P, Mastbergen SC, Kloppenburg M, Blanco FJ, et al. (2025) Genetic markers for knee osteoarthritis presence are not associated with disease progression - data from the IMI-APPROACH cohort. PLoS One 20(6): e0325819. https://doi.org/10.1371/journal.pone.0325819
Editor: Germain Honvo, University of Liege: Universite de Liege, BELGIUM
Received: April 15, 2024; Accepted: May 19, 2025; Published: June 24, 2025
Copyright: © 2025 Bentvelzen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data may be obtained from a third party and are not publicly available. In order to gain and govern access to the central APPROACH databases, tranSMART and XNAT, access has to be approved by the APPROACH Steering Committee. Contact information: URCI@umcutrecht.nl.
Funding: The research leading to these results has received support from the Innovative Medicines Initiative Joint Undertaking under Grant Agreement no 115770, resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007–2013) and EFPIA companies’ in kind contribution. See www.imi.europa.eu and http://www.approachproject.eu. This manuscript reflects the views of the authors and neither IMI nor the European Union and EFPIA are liable for any use that may be made of the information contained herein. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: I have read the journal's policy and the authors of this manuscript have the following competing interests: Outside the current manuscript, PM is employed by Servier and declares no competing interests in relationship with this manuscript; MK reports grants from IMI-APPROACH, grants from Dutch Arthritis Association, royalties from Wolters Kluwer and Springer Verlag, consulting fees (all paid to institution) from Pfizer, Galapagos, CHDR, Novartis, [Peptinov] and UCB, and MK is a member of the EULAR council; FJB reports payment to the institution from Gedeon Richter Plc., Bristol-Myers Squibb International corporation (BMSIC), Sun Pharma Global FZE, Celgene Corporation, Janssen Cilag International N.V, Janssen Research & Development, Viela Bio, Inc., Astrazeneca AB, UCB BIOSCIENCES GMBH, UCB BIOPHARMA SPRL, AbbVie Deutschland GmbH & Co.KG, Merck KGaA, Amgen, Inc., Novartis Farmacéutica, S.A., Boehringer Ingelheim España, S.A, CSL Behring, LLC, Glaxosmithkline Research & Development Limited, Pfizer Inc, Lilly S.A., Corbus Pharmaceuticals Inc., Biohope Scientific Solutions for Human Health S.L., Centrexion Therapeutics Corp., Sanofi, MEIJI FARMA S.A., Kiniksa Pharmaceuticals, Ltd, Fundación para la Investigación Biomédica Del Hospital Clínico San Carlos, Grünenthal, Galapagos; IKH reports personal fees from Novartis, GSK and Grünenthal; FB reports consulting fees from Grünenthal, GSK, Eli Lilly, Novartis, Pfizer, Servier, honoraria for lectures from Viatris and Pfizer, personal fees from AstraZeneca, Sun Pharma, Nordic Pharma and Nordic Bioscience, and is shareholder of 4Moving Biotech and 4P Pharma. The other authors have declared no conflicts of interest. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Introduction
Knee osteoarthritis (OA) is one of the most prevalent types of OA and affects millions of people worldwide [1]. OA is a chronic joint condition with a highly variable disease course, and it is characterised by progressive joint deterioration. Despite significant efforts, no effective treatments are available that can delay or even halt cartilage degeneration [2]. The heterogeneous and unpredictable disease course makes clinical trials and timely personalised treatment approaches challenging. It is hypothesised that different endotypes exist in OA, which are related to variability in disease progression, affected joints, and symptoms [2,3]. To discover targeted drugs, it is essential to find the right pathobiological mechanisms underlying the disease. Therefore, identifying disease subtypes is a promising way forward in the OA field.
One approach to defining these endotypes is through studying genetic variations. Although the heritability of knee OA presence and progression is estimated at 40%−65% and 48%−70% [4,5,6], genetic loci identified so far account for only 10%–20% [7,8]. Over 300 genetic risk loci for OA or knee OA presence have been established, and some of these encode proteins that are active in pathways with available therapeutics. Consequently, these genetic loci may hold potential for therapeutic interventions [7,9,10]. While there is substantial evidence for an inherited genetic predisposition to OA or knee OA, the evidence for a predisposition to progressive disease within knee OA patients is unclear.
Many genetics studies on OA focus on the risk of OA in general or in specific joints rather than progression of disease in patients [7–10]. However, finding genetic variants associated with the risk of progression of OA is highly relevant for defining endotypes/phenotypic subtypes of OA. Some clinical risk factors for developing knee OA are also relevant risk factors for disease progression [11]. However, it is unknown whether genetic risk loci for presence of OA or knee OA are also relevant for progression of knee OA.
In this study, we used patient data from the IMI-APPROACH (Innovative Medicines Initiative Applied Public-Private Research enabling OsteoArthritis Clinical Headway) cohort. Patients with a clinical diagnosis of knee OA were prospectively followed for 2 years. A wide array of data was acquired, including genotyping data, with the aim to discover and predict progressive knee OA endotypes/phenotypes and to support the identification of suitable patients for future clinical trials of disease-modifying osteoarthritis drugs (DMOADs) or treatments (DMOATs) [12]. To gain insights into the heterogeneous nature of OA, the APPROACH project specifically focused on progression definitions regarding widely used clinical outcomes (based on the Knee injury and Osteoarthritis Outcome Score (KOOS) questionnaire [13]) and structural outcomes (based on radiographic minimum joint space width; minJSW [14]).
Our aim was to apply these progression outcomes to find novel associations of previously established genetic markers for presence of OA irrespective of affected joints or knee OA specifically with progression of knee OA. We calculated a polygenic risk score (PRS) using these previously established SNPs for OA risk to predict progression of knee OA. Finally, we examined novel significant SNPs found during the SNP association analysis. We created an additional PRS using these SNPs and explored their functional implications. Finding genetic loci, either individually or combined, that are associated with disease progression could provide insights into the pathways or molecular mechanisms involved in knee OA progression and enhance our understanding of knee OA pathogenesis.
Methods
Study design and participants
Data from the IMI-APPROACH observational cohort study (registered under ClinicalTrials.gov no.: NCT03883568, recruitment period: 2015–2019) was used [12]. The IMI-APPROACH study prospectively followed 297 patients with clinical and/or structural knee OA. All participants provided written informed consent. Demographic, clinical, and imaging data were collected at baseline, 12 months, and 24 months and patient characteristics are presented in Table 1. At the screening visit, an index knee was selected in every patient based on the clinical American College of Rheumatism classification criteria, using history and physical examination [15]. If both knees met the ACR criteria, the most affected knee was selected as indicated by the patient. Posterior-anterior weight-bearing semiflexed knee radiographs of the index knee were obtained to determine the minJSW using Knee Images Digital Analysis software and to grade the Kellgren Lawrence (KL) score [14,16,17]. The radiographs were acquired according to the protocol described by Buckland-Wright et al. at baseline and 24 months [18]. Pain was evaluated using the KOOS questionnaire (on a 0–100 scale, with a higher score indicating worse outcomes) [13]. More study details and baseline characteristics have been described previously [12,19].
Outcome variables
The outcome of interest is knee OA progression. Change in minJSW is a preferred measure for defining structural progression in OA [14,20]. Therefore, we defined the reduction in minJSW in mm over 2 years as the primary outcome for knee OA progression in this study. Secondary progression outcomes included the absolute change in pain score and binary outcomes concerning the presence of increasing pain (KOOS increase ≥ 5 or ≥ 10 per year with a minimum score of ≥ 35 or ≥ 40 at 2 years respectively, binary), stable high pain (KOOS ≥ 40 at baseline as well as at 24 months), radiographic OA (KL score ≥ 2 at 2 years), and of minJSW decrease of ≥ 0.3 mm/year. These knee OA progression outcomes are in line with the IMI-APPROACH outcome definitions [3].
Genetic analysis
Genotype quality control.
Genomic DNA was collected from baseline peripheral blood samples. Genotyping of 2.380.364 single-nucleotide polymorphisms (SNPs) was performed by Newcastle University using the Illumina Omni2.5Exome-8 array v1.5 kit. Quality thresholds can be summarised as follows: call rate per patient ≥ 95%, call rate per SNP ≥ 95%, minor allele frequency (MAF) ≥ 5% and Hardy-Weinberg equilibrium P-value cut-off level ≤ 1E-6. Patients with a heterozygosity rate of ≥±3 standard deviations (s.d.) from the mean heterozygosity rate were removed (n = 6). A set of 54.848 autosomal linkage disequilibrium (LD)-pruned SNPs with MAF > 40% were used to estimate identity by descent (IBD) using PLINK [21]. Patient relationships more closely related than second cousins (proportion of IBD sharing > 0.03125) were identified, and one from each pair was removed (n = 1). The same set of LD-pruned SNPs was used in the assessment of the patients’ ancestry. Principal component analysis of these SNPs against three sample populations from the 1000 Genome project (60 samples with European ancestry (CEU), 60 samples with African ancestry (Yoruba, YRI), 90 samples with East Asian ancestry (Japanese/Chinese, JPT)) were used to exclude samples of non-European ancestry (n = 13, see S1 Fig) [22]. Quality control was performed using PLINK v1.9 [21] and R version 3.5.1 (2018-07-02) [23].
Validation of genetic markers.
Previous Genome-wide association studies (GWAS) in large cohorts discovered over 100 OA risk SNPs [7,9]. We selected all SNPs from these GWAS studies that were significantly associated (P < 5 × 10−8) with the presence of OA irrespective of the joint affected or knee OA and that were also present in the genotyping dataset of the IMI-APPROACH study (from now on referred to as ‘OA presence SNPs’). This set of 30 OA presence SNPs was investigated in our cohort for an association with knee OA progression (see Table 2).
We performed a genome-wide association analysis using the OA progression outcomes. We included the 30 OA presence SNPs, as well as all other SNPs measured in the study, in the analysis (to include other SNPs that may be in linkage disequilibrium with or in the same haplotype block as one of the 30 OA presence SNPs). The analysis was performed using linear regression in PLINK 1.9 [21] while adjusting for sex, age, time since OA diagnosis, and baseline minJSW or KOOS pain, depending on the outcome. Binary progression outcomes were analysed with logistic regression using a similar approach. Associations were considered significant at genome-wide level P < 5 × 10−8. The ’qqman’ R package was used for generating Manhattan and quantile-quantile (QQ) plots [35].
Functional exploration.
We further explored the SNPs that were significantly associated with OA progression in our analyses using various bioinformatics data sources and open-source software to evaluate their potential (functional) effects. LocusZoom was used to visualise the regions around the significant SNPs that are in linkage-disequilibrium [36]. The effect of SNPs in or near genes on the gene expression levels was evaluated using the baseline RNA-sequencing data from IMI-APPROACH as well (see S1 File) [19]. Furthermore, the significant SNPs were cross-referenced (using their rsID) in the GTEx Portal (accession number phs000424.v8.p2 on 03/04/2023) to find expression quantitative trait loci (eQTL) that can show previously established associations of the SNPs with protein abundance (up- or down-regulation). Gene expression of genes near significant SNPs was related to the outcome using linear regression. Lastly, a protein-protein interaction (PPI) network was built using the STRING-db (Search Tool for the Retrieval of Interacting Genes, v11.5, http://string-db.org) database to identify networks of proteins coded by genes near the significant SNPs and near previously established SNPs related to knee OA (Table 2) [7,9].
Polygenic risk score.
A weighted polygenic risk score (PRS) combines the effect of multiple SNPs with a small effect across the genome by multiplying the dosage of the risk alleles (0, 1 or 2 risk alleles) by the effect size of the SNP (log(odds ratio)) and summing across all SNPs. We calculated a PRS combining the 30 OA presence SNPs (Table 2) to determine the association of this PRS with the progression of knee OA [7,9]. The PRS was calculated in PLINK v1.9 [21] using the effect size of the SNPs (as previously reported, see Table 2) multiplied by their presence and averaged over the number of non-missing SNPs in individual patients. Linear or logistic regression was then used to determine the association of the PRS with our knee OA progression outcomes while adjusting for sex, age, time since diagnosis of knee OA (of the index knee) and baseline minJSW or KOOS pain. A P-value < 0.05 was considered statistically significant.
Results
Participants and descriptive data
The participants in IMI-APPROACH were mostly women (77,4%) and 66.5 ± 7.1 years old on average (Table 1). The majority of participants (55%) had definite radiographic OA (KL score ≥ 2). The mean minJSW was 2.52 ± 1.3 mm at baseline and 2.41 ± 1.2 mm at 24 months; the mean decrease in minJSW over 2 years was 0.17 ± 0.69 mm. Four patients did not agree to genotyping. After data preprocessing, 1.263.798 SNPs and 273 samples (62 males, 211 females) were available for analysis.
Validation of genetic markers
None of the OA presence SNPs were found to be associated with any of the progression outcomes (highlighted in green in Fig 1A and S2A, S2C, S2E, S2G, and S2I Fig). However, 19 new variants showed an association with minJSW decrease and reached the significance threshold (P < 5 × 10−8), summarised in Table 3 and shown in Fig 1A and 1B. These 19 SNPs are further explored and associated with minJSW decrease in the next section.
(A) Manhattan plot of minJSW decrease at 24 months. The Manhattan plots show the -log10(P) values of all ~ 1,5 million SNPs against their position. The red line represents the significance threshold (P < 5 × 10−8). Green dots indicate SNPs that have been previously shown in GWAS to be associated with knee OA or OA in general [7,9]. (B) Q-Q plot of minJSW decrease at 24 months GWAS. The Q-Q plot shows the genome-wide –log10(P) values of the association analysis. The red line corresponds to the null hypothesis.
Functional exploration
While we did not find an association between the 30 OA presence SNPs and knee OA progression, 19 SNPs were associated with minJSW decrease. Three SNPs on chromosome 3 are in strong linkage disequilibrium: r2 > 0.8 (S3A Fig). LD describes the extent to which one SNP is inherited with or associated with another SNP within the population. These variants most likely represent one locus associated with knee OA progression. The SNPs are located near the PLCL2 gene, which is involved in cell signalling pathways [37]. On chromosome 16, six SNPs were also in strong LD and located near CDYL2 (S3B Fig). CDYL2 is involved in transcription regulation [37]. Several other significant SNPs were in or near NTNG1, LINC02619, LINC02379, and ARLNC1 and were not in LD with other significant SNPs (Table 3).
We examined the effects of the SNPs near PLCL2 and CDYL2 on the expression of these genes, but the SNPs were not significantly associated with expression levels (S4 Fig). In contrast, six SNPs near CDYL2 were found in the GTEx eQTL database to be significantly associated with lower gene expression in muscle skeletal tissues (S5 Fig). Furthermore, CDYL2 gene expression was significantly lower in patients who had a larger minJSW decrease (S6 Fig, p-value = 0.044). The expression of PLCL2 was not significantly associated with the minJSW decrease. NTNG1 gene expression data was not present in the IMI-APPROACH cohort or GTEx eQTL database.
A protein-protein interaction network did not predict any direct interactions between the proteins near the 19 significant SNPs: NTNG1, CDYL2, and PLCL2 (S7 Fig). Similarly, the protein-protein interaction network of the 3 newly identified genes and the genes near the previously identified OA presence SNPs (Table 2) did not show direct interactions (S8 Fig).
Polygenic risk score
The PRS based on the 30 OA presence SNPs had a mean ± standard deviation of 0.016 ± 0.006 (normally distributed), indicating a low average genetic predisposition for OA among individuals. The PRS was not significantly related to minJSW decrease or KOOS pain increase in linear regression analyses (for change in minJSW in mm: β = −1.403, 95% confidence intervals (CI) [−17.084, 14.278], p = 0.862, and for change in KOOS: β = −6.939, 95% CI [−371.357, 357.479], p = 0.970,). Similarly, no significant relations were found for the binary knee OA progression outcomes (S1 Table).
For the sake of completeness, we performed an exploratory analysis using the 19 SNPs newly found to be associated with the minJSW decrease to create a second PRS (−0.040 ± 0.053, mean ± sd). Based on a linear regression analysis, corrected for age, sex, disease duration, and baseline minJSW, a significant association of this PRS with the minJSW decrease was found (β = 7.61, 95% CI [6.07, 9.15], p = 6.899e-19, adjusted R2 = 0.26).
Discussion
Contrary to expectations, previous findings on SNPs associated with knee OA presence were not found to be related to the progression of knee OA. Given that the link between a phenotype and single SNPs is generally weak, we calculated a PRS, combining the effects of several OA presence SNPs. This PRS also showed no relation with the progression of knee OA. We hypothesise that different outcomes, i.e., OA presence or OA progression, and their structural or pain characteristics may have partly or even fully different genetic backgrounds.
An unexpected finding was that 19 other SNPs were significantly associated with minJSW decrease over time (Table 3). In reviewing the literature, no previous reports demonstrated an association of these SNPs with OA or knee OA. To interpret these candidate SNP findings, we further investigated them by incorporating various data sources. The GTEx eQTL database revealed that one locus near CDYL2, containing six SNPs, showed lower expression of CDYL2 in muscle-skeletal tissue. CDYL2 is involved in regulating genome stability, mitosis, and transcription repression [37,38]. Given its role in transcription repression, CDYL2 could indirectly influence muscle physiology by modulating gene expression of genes involved in muscle function or development. Moreover, skeletal muscle dysfunction has been implicated as a contributing factor to progression of OA [39]. Additionally, we observed a slightly lower expression of CDYL2 in patients with more decrease in minJSW. This further implies that CDYL2 might play a role in structural knee OA progression, although likely a small role. However, given the lack of studies linking CDYL2 to muscle function or OA progression, this hypothesis remains speculative.
We did not observe a significant association between the expression of PLCL2 and minJSW decrease. However, it was previously found that PLCL2 is downregulated in knee OA patients with damaged cartilage compared to those with undamaged cartilage [40]. PLCL2 should be further investigated to determine its role in the progression of knee OA.
Several identified SNPs are located in or near long non-coding RNAs (lncRNA). Although we were unable to investigate the implications of these SNPs, previous studies have shown regulatory effects of lncRNA on pathological processes of OA [41,42]. Further research into lncRNAs could help improve our understanding of molecular mechanisms responsible for knee OA progression.
Known risk factors for OA, such as age, obesity, and sex, are a source of heterogeneity that complicate genetic research in OA. Genetic factors might even affect other prognostic factors (effect modification), further complicating analysis [3]. Our study population included patients at different stages of OA, varying from early to more advanced OA (KL grade 4). While we adjusted for several confounding factors, additional stratification of the population could be beneficial. However, this was not considered feasible due to the relatively small sample size.
One limitation of our analysis is related to the outcome measures. Differences in minJSW are only detectable with certainty if they are > 0.7 mm [20], and the overall changes observed in the cohort were small (the mean decrease in minJSW was −0.16 mm). In addition, the KOOS pain score is a patient-reported outcome measure (PROM). The subjective nature of PROMS, as well as the variability between the patients’ perceptions of their symptoms, may have obscured any associations in the analysis.
Most genetic studies on OA focus on the presence of OA rather than looking at subtypes or progression of the disease. Because OA manifests in diverse clinical presentations, often a larger and more diverse study population is needed to find relevant genetic associations, and interpretation of results remains challenging. Future larger studies that focus on more specific OA traits could provide a better understanding of the genetic background underlying OA disease courses.
Although we did not find any associations between genetic factors related to the presence of OA and the progression of knee OA, new associations of SNPs with minJSW decrease were found. To verify that a PRS based on these SNPs would also be significant, an additional PRS was created using these SNPs and indeed showed a significant association with minJSW decrease. These results were calculated in our dataset and not validated externally, so they should be interpreted with caution. We analysed our primary progression outcome on the natural continuous scale (change in minJSW), rather than using a cut-off for presence of progression, to optimally and efficiently analyse the data. One could argue that identifying statistically significant SNPs associated with progression using a strict p-value (P < 5 × 10−8), even with this small sample size, may suggest that there are other SNPs related to progression. Carefully defining OA traits for genetic research may help increase the number of discoveries, statistical power, and clinical interpretability, and ultimately improve future studies and advance our understanding of knee OA. The analysis steps of our study are provided in a GitHub repository (https://github.com/MiekeBvz/IMI-APPROACH_Genomics.git using open-source software) and might be useful for research of genetic associations and further explorations and interpretation of the downstream (functional) effects of genetic markers.
In summary, we found no associations between SNPs related to OA presence and knee OA progression. However, 19 SNPs were newly associated with minJSW decrease and should be further validated to determine their role in knee OA. Given the rising incidence of OA and the lack of effective treatment options, continued efforts are needed to discover the basis of OA and OA progression, enabling patient stratification into more homogeneous subgroups relevant for treatment.
Supporting information
S1 Fig. Principal component analysis of the IMI-APPROACH samples against three sample populations from the 1000 Genome project.
A set of 54848 autosomal LD-pruned SNPs with MAF > 40% was used to perform the principal component analysis. Three sample populations from the 1000 Genome project were used to compare to the IMI-APPROACH samples [22]. In black = IMI-APPROACH samples, red = 60 samples with European ancestry (CEU), blue = 60 samples with African ancestry (Yoruba, YRI), and green = 90 samples with East Asian ancestry (Japanese/Chinese, JPT)).
https://doi.org/10.1371/journal.pone.0325819.s001
(DOCX)
S2 Fig. Validation of genetic markers for risk of OA for five knee OA progression outcomes.
Manhattan plots showing the -log10(P) values of all ~ 1,5 million SNPs. The x-labels 1–22 represent chromosomes 1–22, 23 is the X chromosome, 24 is the Y chromosome, 25 is the pseud-autosomal region of X, and 26 is mitochondrial chromosomes (0 are unplaced SNPs). The red line represents the genome-wide significance threshold (P < 5 × 10−8). Green dots indicate SNPs that were associated with knee OA or OA in general in previous GWAS (meta-analysis) [7,9]. The Q-Q plot shows the genome-wide –log10(P) values of the association analysis, where the red line corresponds to the null hypothesis. (A) Manhattan plot increasing pain (KOOS score increase of ≥5 or ≥ 10 per year with a minimum score of ≥ 35 or ≥ 40 at 2 years). (B) Q-Q plot increasing pain. (C) Manhattan plot stable high pain (KOOS score of ≥40 over 2 years). (D) Q-Q plot stable high pain. (E) Manhattan plot presence of decrease in minJSW (minimum decrease of ≤ 0.3 mm/year at 2 years). (F) Q-Q plot presence of decrease in minJSW. (G) Manhattan plot change in pain (change in KOOS score). (H) Q-Q plot change in pain. (I) Manhattan plot presence of radiographic OA (Kellgren Lawrence (KL) ≥ 2 at 2 years). (J) Q-Q plot radiographic OA.
https://doi.org/10.1371/journal.pone.0325819.s002
(DOCX)
S3 Fig. Regional plots of rs73146904 and rs2549732 based on the Genome Reference Consortium Human Build 37.
Association with minJSW decrease (-log10(P)) of SNPs is plotted against the genomic position. The colour represents the pairwise correlation coefficient (Linkage disequilibrium pattern) of each SNP with the most significant SNP (shown as the purple square, in Fig A. rs73146904 and Fig B. rs2549732). (A) The regional plot of rs73146904 shows that several SNPs lay within the near region of PLCL2. (B) Regional plot of rs2549732 shows that several SNPs lay within the near region of CDYL2.
https://doi.org/10.1371/journal.pone.0325819.s003
(DOCX)
S4 Fig. Comparison of expression levels of PLCL2 and CDYL2 associated with ten significant SNPs.
Comparison of gene expression levels at baseline is shown of PLCL2 and CDYL2 of the patients split into groups based on their genotype. None of the plots show significant up- or down-regulation. The number of patients within a group is noted below the boxplot. The patients with two minor alleles are underrepresented. The black lines indicate the median values. The following SNPs are located near PLCL2 and shown in the plots: (A) rs6442653 versus PLCL2. (B) rs6788010 versus PLCL2. (C) rs6777965 versus PLCL2. (D) rs73146904 versus PLCL2. (E) rs77953406 versus CDYL2. (F) rs79402702 versus CDYL2. (G) rs2549732 versus CDYL2. (H) rs2549726 versus CDYL2. (I) rs9933385 versus CDYL2. (J) rs75620223 versus CDYL2.
https://doi.org/10.1371/journal.pone.0325819.s004
(DOCX)
S5 Fig. Expression quantitative trait loci in muscle-skeletal tissue, associated with six significant SNPs located near CDYL2.
The violin plots of the expression quantitative trait loci (eQTLs) of muscle-skeletal tissue show up- and down-expression of the gene (normalised expression) based on the alleles for six SNPs located near CDYL2. The white line in the box plot (shown within the black boxes) represents the median value of the expression. All eQTLs show a significant effect of the genotype on the normalised expression of the gene (all p-values are below the gene p-value threshold of 0.00012).
https://doi.org/10.1371/journal.pone.0325819.s005
(DOCX)
S6 Fig. Scatter-plot of minJSW decrease versus expression of PLCL2 and CDYL2.
The minJSW decrease is in mm over 2 years. Each dot represents a patient. (A) Change in minJSW in mm at 2 years versus the normalised expression of PLCL2 at baseline. The x-coefficient (representing PLCL2) is not significant (P > 0.05). (B) Change in minJSW in mm at 2 years versus the normalised expression of CDYL2 at baseline. The x-coefficient (representing CDYL2) is 0.01 and is significant: P = 0.044.
https://doi.org/10.1371/journal.pone.0325819.s006
(DOCX)
S7 Fig. Protein-protein interaction network between the protein-coding genes near significant SNPs.
Protein-protein interaction network, created with STRING-db, between the protein-coding genes associated with the identified significant SNPs (in red). No known direct associations are shown.
https://doi.org/10.1371/journal.pone.0325819.s007
(DOCX)
S8 Fig. Protein-protein interaction network of proteins near known and newly identified SNPS.
The network, created with STRING-db, includes both protein-coding genes associated with the 19 newly identified significant SNPs (in red), as well as the protein-coding genes as reported along the SNPs identified to be associated with OA in previous GWAS. No direct links are found between genes related to the newly found SNPs and genes near previously established SNPs.
https://doi.org/10.1371/journal.pone.0325819.s008
(DOCX)
S1 Table. Polygenic risk score and OA progression association.
The PRS is calculated using 30 OA presence SNPs. The associations of the PRS with OA progression outcomes, as was found with linear and logistic regression, are summarised in this table. The reduction in minJSW in mm over 2 years and change in KOOS score were used in linear regression. The presence of increasing pain, stable high pain, radiographic OA, and minJSW decrease of ≥ 0.3 mm/year were evaluated using logistic regression.
https://doi.org/10.1371/journal.pone.0325819.s009
(DOCX)
References
- 1. Vos T, Flaxman AD, Naghavi M, Lozano R, Michaud C, Ezzati M, et al. Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990-2010: a systematic analysis for the Global burden of disease study 2010. Lancet. 2012;380(9859):2163–96. pmid:23245607
- 2. Oo WM, Little C, Duong V, Hunter DJ. The Development of Disease-Modifying Therapies for Osteoarthritis (DMOADs): the evidence to date. Drug Des Devel Ther. 2021;15:2921–45. pmid:34262259
- 3. Deveza LA, Nelson AE, Loeser RF. Phenotypes of osteoarthritis: current state and future implications. Clin Exp Rheumatol. 2019;37 Suppl 120(5):64–72. pmid:31621574
- 4. Zhai G, Hart DJ, Kato BS, MacGregor A, Spector TD. Genetic influence on the progression of radiographic knee osteoarthritis: a longitudinal twin study. Osteoarthritis Cartilage. 2007;15(2):222–5. pmid:17045816
- 5. Magnusson K, Turkiewicz A, Rydén M, Englund M. Genetic influence on osteoarthritis versus other rheumatic diseases. Arthritis Rheumatol. 2024;76(2):206–15. pmid:37691153
- 6. Spector TD, Cicuttini F, Baker J, Loughlin J, Hart D. Genetic influences on osteoarthritis in women: a twin study. BMJ. 1996;312(7036):940–3. pmid:8616305
- 7. Boer CG, Hatzikotoulas K, Southam L, Stefánsdóttir L, Zhang Y, Coutinho de Almeida R, et al. Deciphering osteoarthritis genetics across 826,690 individuals from 9 populations. Cell. 2021;184(18):4784-4818.e17. pmid:34450027
- 8. Sedaghati-Khayat B, Boer CG, Runhaar J, Bierma-Zeinstra SMA, Broer L, Ikram MA, et al. Risk assessment for hip and knee osteoarthritis using polygenic risk scores. Arthritis Rheumatol. 2022;74(9):1488–96. pmid:35644035
- 9. Aubourg G, Rice SJ, Bruce-Wootton P, Loughlin J. Genetics of osteoarthritis. Osteoarthritis Cartilage. 2022;30(5):636–49. pmid:33722698
- 10. Zhai G, Huang J. Genetics of osteoarthritis. Best Pract Res Clin Rheumatol. 2024;38(4):101972. pmid:38971692
- 11. Sarmanova A, Fernandes GS, Richardson H, Valdes AM, Walsh DA, Zhang W, et al. Contribution of central and peripheral risk factors to prevalence, incidence and progression of knee pain: a community-based cohort study. Osteoarthritis Cartilage. 2018;26(11):1461–73. pmid:30099115
- 12. van Helvoort EM, van Spil WE, Jansen MP, Welsing PMJ, Kloppenburg M, Loef M, et al. Cohort profile: The Applied Public-Private Research enabling OsteoArthritis Clinical Headway (IMI-APPROACH) study: a 2-year, European, cohort study to describe, validate and predict phenotypes of osteoarthritis using clinical, imaging and biochemical markers. BMJ Open. 2020;10(7):e035101. pmid:32723735
- 13. Roos EM, Roos HP, Lohmander LS, Ekdahl C, Beynnon BD. Knee Injury and Osteoarthritis Outcome Score (KOOS)--development of a self-administered outcome measure. J Orthop Sports Phys Ther. 1998;28(2):88–96. pmid:9699158
- 14. Ravaud P, Giraudeau B, Auleley GR, Chastang C, Poiraudeau S, Ayral X, et al. Radiographic assessment of knee osteoarthritis: reproducibility and sensitivity to change. J Rheumatol. 1996;23(10):1756–64. pmid:8895154
- 15. Altman R, Asch E, Bloch D, Bole G, Borenstein D, Brandt K, et al. Development of criteria for the classification and reporting of osteoarthritis. Classification of osteoarthritis of the knee. Diagnostic and therapeutic criteria committee of the American rheumatism association. Arthritis Rheum. 1986;29(8):1039–49. pmid:3741515
- 16. Kellgren JH, Lawrence JS. Radiological assessment of osteo-arthrosis. Ann Rheum Dis. 1957;16(4):494–502. pmid:13498604
- 17. Marijnissen ACA, Vincken KL, Vos PAJM, Saris DBF, Viergever MA, Bijlsma JWJ, et al. Knee Images Digital Analysis (KIDA): a novel method to quantify individual radiographic features of knee osteoarthritis in detail. Osteoarthritis Cartilage. 2008;16(2):234–43. pmid:17693099
- 18. Buckland-Wright JC, Ward RJ, Peterfy C, Mojcik CF, Leff RL. Reproducibility of the semiflexed (metatarsophalangeal) radiographic knee position and automated measurements of medial tibiofemoral joint space width in a multicenter clinical trial of knee osteoarthritis. J Rheumatol. 2004;31(8):1588–97. pmid:15290740
- 19. van Helvoort EM, Ladel C, Mastbergen S, Kloppenburg M, Blanco FJ, Haugen IK, et al. Baseline clinical characteristics of predicted structural and pain progressors in the IMI-APPROACH knee OA cohort. RMD Open. 2021;7(3):e001759. pmid:34426541
- 20. Ornetti P, Brandt K, Hellio-Le Graverand M-P, Hochberg M, Hunter DJ, Kloppenburg M, et al. OARSI-OMERACT definition of relevant radiological progression in hip/knee osteoarthritis. Osteoarthritis Cartilage. 2009;17(7):856–63. pmid:19230857
- 21. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
- 22. 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. pmid:26432245
- 23. R Core Team. R: a language and environment for statistical computing. Vienna, Austria. 2016. https://www.R-project.org/
- 24. Zengini E, Hatzikotoulas K, Tachmazidou I, Steinberg J, Hartwig FP, Southam L, et al. Genome-wide analyses using UK Biobank data provide insights into the genetic architecture of osteoarthritis. Nat Genet. 2018;50(4):549–58. pmid:29559693
- 25. Miyamoto Y, Shi D, Nakajima M, Ozaki K, Sudo A, Kotani A, et al. Common variants in DVWA on chromosome 3p24.3 are associated with susceptibility to knee osteoarthritis. Nat Genet. 2008;40(8):994–8. pmid:18622395
- 26. arcOGEN Consortium. Identification of new susceptibility loci for osteoarthritis (arcOGEN): a genome-wide association study. Lancet. 2012;380(9844):815–23. pmid:22763110
- 27. Tachmazidou I, Hatzikotoulas K, Southam L, Esparza-Gordillo J, Haberland V, Zheng J, et al. Identification of new therapeutic targets for osteoarthritis through genome-wide analyses of UK Biobank data. Nat Genet. 2019;51(2):230–6. pmid:30664745
- 28. Nakajima M, Takahashi A, Kou I, Rodriguez-Fontenla C, Gomez-Reino JJ, Furuichi T, et al. New sequence variants in HLA class II/III region associated with susceptibility to knee osteoarthritis identified by genome-wide association study. PLoS One. 2010;5(3):e9723. pmid:20305777
- 29. Casalone E, Tachmazidou I, Zengini E, Hatzikotoulas K, Hackinger S, Suveges D, et al. A novel variant in GLIS3 is associated with osteoarthritis. Ann Rheum Dis. 2018;77(4):620–3. pmid:29436472
- 30. Styrkarsdottir U, Lund SH, Thorleifsson G, Zink F, Stefansson OA, Sigurdsson JK, et al. Meta-analysis of Icelandic and UK data sets identifies missense variants in SMO, IL11, COL11A1 and 13 more new loci associated with osteoarthritis. Nat Genet. 2018;50(12):1681–7. pmid:30374069
- 31. Day-Williams AG, Southam L, Panoutsopoulou K, Rayner NW, Esko T, Estrada K, et al. A variant in MCF2L is associated with osteoarthritis. Am J Hum Genet. 2011;89(3):446–50. pmid:21871595
- 32. Hackinger S, Trajanoska K, Styrkarsdottir U, Zengini E, Steinberg J, Ritchie GRS, et al. Evaluation of shared genetic aetiology between osteoarthritis and bone mineral density identifies SMAD3 as a novel osteoarthritis risk locus. Hum Mol Genet. 2017;26(19):3850–8. pmid:28934396
- 33. Miyamoto Y, Mabuchi A, Shi D, Kubo T, Takatori Y, Saito S, et al. A functional polymorphism in the 5’ UTR of GDF5 is associated with susceptibility to osteoarthritis. Nat Genet. 2007;39(4):529–33. pmid:17384641
- 34. Valdes AM, Evangelou E, Kerkhof HJM, Tamm A, Doherty SA, Kisand K, et al. The GDF5 rs143383 polymorphism is associated with osteoarthritis of the knee with genome-wide statistical significance. Ann Rheum Dis. 2011;70(5):873–5. pmid:20870806
- 35. D. Turner S. qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots. J Open Sour Softw. 2018;3(25):731.
- 36. Boughton AP, Welch RP, Flickinger M, VandeHaar P, Taliun D, Abecasis GR, et al. LocusZoom.js: interactive and embeddable visualization of genetic association study results. Bioinformatics. 2021;37(18):3017–8. pmid:33734315
- 37. Sayers EW, Bolton EE, Brister JR, Canese K, Chan J, Comeau DC, et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2022;50(D1):D20–6. pmid:34850941
- 38. Siouda M, Dujardin AD, Dekeyzer B, Schaeffer L, Mulligan P. Chromodomain on Y-like 2 (CDYL2) implicated in mitosis and genome stability regulation via interaction with CHAMP1 and POGZ. Cell Mol Life Sci. 2023;80(2):47. pmid:36658409
- 39. Culvenor AG, Ruhdorfer A, Juhl C, Eckstein F, Øiestad BE. Knee extensor strength and risk of structural, symptomatic, and functional decline in knee osteoarthritis: a systematic review and meta-analysis. Arthritis Care Res (Hoboken). 2017;69(5):649–58. pmid:27563843
- 40. Snelling S, Rout R, Davidson R, Clark I, Carr A, Hulley PA, et al. A gene expression study of normal and damaged cartilage in anteromedial gonarthrosis, a phenotype of osteoarthritis. Osteoarthritis Cartilage. 2014;22(2):334–43. pmid:24361742
- 41. Xie F, Liu Y-L, Chen X-Y, Li Q, Zhong J, Dai B-Y, et al. Role of MicroRNA, LncRNA, and exosomes in the progression of osteoarthritis: a review of recent literature. Orthop Surg. 2020;12(3):708–16. pmid:32436304
- 42. Okuyan HM, Begen MA. LncRNAs in osteoarthritis. Clin Chim Acta. 2022;532:145–63. pmid:35667478