Since screening programs identify only a small proportion of the population as eligible for an intervention, genomic prediction of heritable risk factors could decrease the number needing to be screened by removing individuals at low genetic risk. We therefore tested whether a polygenic risk score for heel quantitative ultrasound speed of sound (SOS)—a heritable risk factor for osteoporotic fracture—can identify low-risk individuals who can safely be excluded from a fracture risk screening program.
Methods and findings
A polygenic risk score for SOS was trained and selected in 2 separate subsets of UK Biobank (comprising 341,449 and 5,335 individuals). The top-performing prediction model was termed “gSOS”, and its utility in fracture risk screening was tested in 5 validation cohorts using the National Osteoporosis Guideline Group clinical guidelines (N = 10,522 eligible participants). All individuals were genome-wide genotyped and had measured fracture risk factors. Across the 5 cohorts, the average age ranged from 57 to 75 years, and 54% of studied individuals were women. The main outcomes were the sensitivity and specificity to correctly identify individuals requiring treatment with and without genetic prescreening. The reference standard was a bone mineral density (BMD)–based Fracture Risk Assessment Tool (FRAX) score. The secondary outcomes were the proportions of the screened population requiring clinical-risk-factor-based FRAX (CRF-FRAX) screening and BMD-based FRAX (BMD-FRAX) screening. gSOS was strongly correlated with measured SOS (r2 = 23.2%, 95% CI 22.7% to 23.7%). Without genetic prescreening, guideline recommendations achieved a sensitivity and specificity for correct treatment assignment of 99.6% and 97.1%, respectively, in the validation cohorts. However, 81% of the population required CRF-FRAX tests, and 37% required BMD-FRAX tests to achieve this accuracy. Using gSOS in prescreening and limiting further assessment to those with a low gSOS resulted in small changes to the sensitivity and specificity (93.4% and 98.5%, respectively), but the proportions of individuals requiring CRF-FRAX tests and BMD-FRAX tests were reduced by 37% and 41%, respectively. Study limitations include a reliance on cohorts of predominantly European ethnicity and use of a proxy of fracture risk.
Our results suggest that the use of a polygenic risk score in fracture risk screening could decrease the number of individuals requiring screening tests, including BMD measurement, while maintaining a high sensitivity and specificity to identify individuals who should be recommended an intervention.
Why was this study done?
- Osteoporosis screening identifies only a small proportion of the screened population to be eligible for intervention.
- The prediction of heritable risk factors using polygenic risk scores could decrease the number of screened individuals by reassuring those with low genetic risk.
- We investigated whether the genetic prediction of heel quantitative ultrasound speed of sound (SOS)—a heritable risk factor for osteoporotic fracture—could be incorporated into an established screening guideline to identify individuals at low risk for osteoporosis.
What did the researchers do and find?
- Using UK Biobank, we developed a polygenic risk score (gSOS) consisting of 21,717 genetic variants that was strongly correlated with SOS (r2 = 23.2%).
- Using the National Osteoporosis Guideline Group clinical assessment guidelines in 5 validation cohorts, we estimate that reassuring individuals with a high gSOS, rather than doing further assessments, could reduce the number of clinical-risk-factor-based Fracture Risk Assessment Tool (FRAX) tests and bone-density-measurement-based FRAX tests by 37% and 41%, respectively, while maintaining a high sensitivity and specificity to identify individuals who should be recommended an intervention.
What do these findings mean?
- We show that genetic pre-screening could reduce the number of screening tests needed to identify individuals at risk of osteoporotic fractures.
- Therefore, the potential exists to improve the efficiency of osteoporosis screening programs without large losses in sensitivity or specificity to identify individuals who should receive an intervention.
- Further translational studies are needed to test the clinical applications of this polygenic risk score; however, our work shows how such scores could be tested in the clinic.
Citation: Forgetta V, Keller-Baruch J, Forest M, Durand A, Bhatnagar S, Kemp JP, et al. (2020) Development of a polygenic risk score to improve screening for fracture risk: A genetic risk prediction study. PLoS Med 17(7): e1003152. https://doi.org/10.1371/journal.pmed.1003152
Academic Editor: Christelle Nguyen, Univ. Paris Descartes, PRES Sorbonne Paris Cité, Hôpital Cochin, Assistance Publique - Hôpitaux de Paris, FRANCE
Received: December 5, 2019; Accepted: June 3, 2020; Published: July 2, 2020
Copyright: © 2020 Forgetta et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant summary-level data are within the manuscript and its Supporting Information files. All other relevant underlying individual-level data will be returned to UK Biobank in accordance with the signed Material Transfer Agreement. UK Biobank will then make this individual-level data available researchers in accordance with their data access policies. UK Biobank can be contacted by email at firstname.lastname@example.org.
Funding: This program was funded by the Canadian Institutes of Health Research. UK Biobank is funded by the Wellcome Trust, UK Medical Research Council, Department of Health, Scottish Government and the Northwest Regional Development Agency. It has also had funding from the Welsh Assembly Government and the British Heart Foundation. None of these funders had a role in the design, implementation or interpretation of this study. The Richards lab is supported by the Canadian Institutes of Health Research, the Canadian Foundation for Innovation, the Lady Davis Institute and the Fonds de Recherche Santé Québec (FRSQ). Dr. Richards is supported by a FRQS Clinical Research Scholarship. TwinsUK is funded by the Wellcome Trust, Medical Research Council, European Union, the National Institute for Health Research (NIHR)-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust in partnership with King’s College London. J.P.K is funded by a University of Queensland Development Fellowship (UQFEL1718945), and a National Health and Medical Research Council (Australia) Investigator grant (GNT1177938). CLSA is funded by the Canadian Institutes of Health Research and the Canadian Foundation for Innovation. MrOS: The Osteoporotic Fractures in Men (MrOS) Study is supported by National Institutes of Health funding. The following institutes provide support: The National Institute on Aging (NIA), the National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS), the National Center for Advancing Translational Sciences (NCATS), and NIH Roadmap for Medical Research under the following grant numbers: U01 AG027810, U01 AG042124, U01 AG042139, U01 AG042140, U01 AG042143, U01 AG042145, U01 AG042168, U01 AR066160, and UL1 TR000128. NIAMS provided funding for the MrOS ancillary study ‘Replication of candidate gene associations and bone strength phenotype in MrOS’ under the grant number R01 AR051124 and the MrOS ancillary study ‘GWAS in MrOS and SOF’ under the grant number RC2 AR058973. Dr. Nielson is supported by a K01 from NIAMS (K01AR062655). SOF: The Study of Osteoporotic Fractures (SOF) is supported by National Institutes of Health funding. The National Institute on Aging (NIA) provides support under the following grant numbers: R01 AG005407, R01 AR35582, R01 AR35583, R01 AR35584, R01 AG005394, R01 AG027574, and R01 AG027576. The National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS) provides funding for the SOF ancillary study ‘GWAS in MrOS and SOF’ under the grant number RC2AR058973. No funders had any influence over the study design, its implementation or interpretation.
Competing interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: JAK reports grants from Amgen, Eli Lilly and Radius Health; consulting fees from Theramex. JAK is the architect of FRAX but has no financial interest. JBR reports investigator-initiated grants from Biogen, Eli Lilly and GlaxoSmithKline, for programs unrelated to the research presented here. JBR is an advisor to GlaxoSmithKline. DPK reports grants from Radius Health and the Dairy Council unrelated to the research presented here and consulting fees from Solarea Bio unrelated to the research presented here. CC reports personal fees (outside the submitted work) from Amgen, Danone, Eli Lilly, GSK, Kyowa Kirin, Medtronic, Merck, Nestle, Novartis, Pfizer, Roche, Servier, Shire, Takeda, UCB. NCH reports consultancy, lecture fees and honoraria (outside the submitted work) from Alliance for Better Bone Health, AMGEN, MSD, Eli Lilly, Servier, Shire, UCB, Kyowa Kirin, Consilient Healthcare, Radius Health and Internis Pharma.
Abbreviations: BMD, bone mineral density; BMD-FRAX, bone-mineral-density-based Fracture Risk Assessment Tool; CLSA, Canadian Longitudinal Study on Aging; CRF-FRAX, clinical-risk-factor-based Fracture Risk Assessment Tool; FRAX, Fracture Risk Assessment Tool; GWAS, genome-wide association study; LASSO, least absolute shrinkage and selection operator; NOGG, National Osteoporosis Guideline Group; SOF, Study of Osteoporotic Fractures; SOS, speed of sound
Screening programs are generally designed to identify a proportion of the screened population whose risk of a clinically relevant outcome is high enough to merit an intervention. However, usually only a small proportion of individuals who undergo screening is found to be at high risk, indicating that much of the screening expenditure is spent on individuals who will not qualify for intervention.
Osteoporosis is a common and costly disease that results in an increased predisposition to fractures . Many guidelines [2–6] aimed at the prevention of osteoporosis-related fractures incorporate the Fracture Risk Assessment Tool (FRAX) [7,8], a validated method to risk stratify individuals for treatment by assessing their 10-year probability of major osteoporotic fracture. Guidelines vary widely, but often recommend a staged process where individuals are first assessed with a clinical-risk-factor-based FRAX (CRF-FRAX), and those at increased risk of fracture are then additionally characterized with a more expensive bone mineral density (BMD)–based FRAX (BMD-FRAX) score. Such approaches are usually recommended in the setting of enhanced case-finding strategies, but recently, a large randomized controlled trial (SCOOP) demonstrated the potential benefit of community-based fracture risk assessment in reducing rates of hip fractures in elderly women . This trial used a strategy based on the National Osteoporosis Guideline Group (NOGG) screening strategy , which implements fracture risk stratification through the use of FRAX scores. In this trial, the entire screened population underwent FRAX assessment using clinical risk factors, and almost half (49%) had a sufficiently high probability of fracture to warrant further testing using a BMD-FRAX test. Yet, only 14% of the screened population had a resultant probability of fracture high enough to warrant intervention. This suggests that a method that improves screening efficiency and decreases the number of persons undergoing risk stratification, particularly BMD-FRAX assessments, would be a welcome addition to the screening strategy.
Skeletal measures that predict fracture risk are highly heritable (50%–85%) and include BMD and quantitative ultrasound speed of sound (SOS) measurements, which are highly correlated [10–13]. Recently, large cohort resources have enabled the genomic prediction of such heritable clinical risk factors from genotypes through polygenic risk scores [14–20], which capture information from many single nucleotide polymorphisms assayed from genome-wide genotyping. These assays assess common genetic variation at millions of single nucleotide polymorphisms and cost approximately $40 in a research context. However, the clinical utility of such polygenic risk scores is unclear, widespread replication of polygenic risk scores is currently lacking, and it is unknown whether they can aid in screening programs. Defining their clinical relevance may be particularly relevant in a British context, where the National Health Service aims to sequence 5 million individuals within 5 years .
Very large cohorts are required to train polygenic risk scores, and current cohorts lack sufficient sample size to generate useful BMD polygenic risk scores. However, since BMD is strongly correlated with SOS  and SOS has been measured in 341,449 individuals in UK Biobank, we developed a polygenic risk score for SOS termed “gSOS” (for “genetically predicted SOS”) that could be used to identify individuals unlikely to have low enough BMD to warrant a clinical intervention. To improve screening efficiency, such individuals could be removed from an osteoporosis screening program prior to measurement of BMD. We then tested the generalizability and potential benefit of incorporating gSOS into the NOGG guidelines using 5 cohorts, comprising 10,522 eligible individuals. Last, we tested if gSOS could decrease the number of people requiring more detailed assessments, such as BMD measurement, while still identifying those who require interventions to decrease their risk of fracture.
Overall study design and cohorts
The purpose of this study was not to predict fractures. Rather, the purpose of this study was to understand if genetic prescreening could reduce the number of screening tests needed to identify individuals at risk of osteoporotic fractures. This study included 3 phases (Fig 1). The first 2 phases were conducted in 2 distinct subsets of the UK Biobank study cohort, and the final phase in a further subset of UK Biobank combined with 4 other cohorts. Characteristics of the cohorts are shown in Table 1, with the cohorts described in detail in Table A in S1 Tables.
BMD, bone mineral density; CLSA, Canadian Longitudinal Study on Aging; GWAS, genome-wide association study; NOGG, National Osteoporosis Guideline Group; PRS, polygenic risk score; QC, quality control; SOF, Study of Osteoporotic Fractures; SOS, speed of sound; UKB, UK Biobank.
The first phase used least absolute shrinkage and selection operator (LASSO) regression  to train a set of polygenic risk score models to predict SOS in the UK Biobank Training Set (N = 341,449). In phase 2, the polygenic risk score model explaining the most variance in measured SOS in the UK Biobank Model Selection Set (N = 5,335) was selected and named gSOS. The ability of gSOS to explain variance in measured SOS was then tested in the UK Biobank Test Set (N = 84,768). In phase 3, gSOS was tested for its performance in a screening strategy, based on NOGG guideline thresholds of fracture risk, applied to a population of 10,522 individuals derived from 5 separate cohorts. Inclusion in the screening program required these individuals to be ≥50 years with at least 1 risk factor and available measurement of femoral neck BMD. This population comprised a further distinct subset of the UK Biobank Test Set (N = 2,445), as well as individuals from the Canadian Longitudinal Study on Aging (CLSA) (N = 2,931), the Study of Osteoporotic Fractures (SOF) (N = 2,094), Mr OS US (2,026), and Mr OS Sweden (N = 1,026). Together these 5 cohorts in phase 3 are referred to as the validation cohorts. Next, to test the effect of gSOS on fracture screening by age, we stratified the CLSA cohort by age, dividing the population into 3 age groups: 50–59 years, 60–69 years, and ≥70 years. The CLSA cohort was chosen for this age-stratified analysis, because it was the largest validation cohort and had the widest age range. To assess the performance of gSOS in ancestries other than White British, we tested it in individuals in the UK Biobank Test Set who were eligible for screening and were of non–White British ancestry, as defined by genotypes (see S1 Text for further details of definition of ancestry; Table B in S1 Tables shows the demographic and risk factor characteristics of the sub-population).
SOS and BMD measurement
We decided to use polygenic risk scores to predict SOS, rather than BMD, because polygenic risk scores require a large number of individuals with proper phenotyping and genome-wide genotyping. The largest dataset for SOS is approximately 10-fold larger than that for BMD [10,25]. SOS also predicts fracture, with similar performance characteristics compared to BMD, and the 2 measures are correlated (r = 0.4–0.6) . However, since femoral neck BMD is required for FRAX calculations used in screening programs , we required that all individuals in the phase 3 analysis have femoral neck BMD measure available. Details of SOS and BMD measurement are available in S1 Text. All analyses used SOS standardized to a mean of 0 and standard deviation of 1.
Development of machine learning model to predict SOS
Training, model selection, and test datasets.
To develop and test gSOS, we followed best practices in clinical prediction to ensure unbiased estimates of model performance by developing the models in datasets distinct from the datasets that were used to test model performance . Participants in the UK Biobank with White British ancestry (see S1 Text), measured SOS, and genotyping information (N = 426,811) were randomly assigned to the UK Biobank Training Set (80% of participants), the UK Biobank Model Selection Set (1.25% of participants), or the UK Biobank Test Set (18.75% of participants) (Fig 1; Table 1). Since BMD was measured in only 4,741 individuals in all of UK Biobank , these individuals were assigned to the UK Biobank Test Set to enable them to be used in phase 3 of the study.
Genome-wide association study (GWAS).
Using methods from our previous GWAS of estimated BMD in UK Biobank , but using a different sample size and SOS as the outcome, we undertook a GWAS for SOS in the UK Biobank Training Set (N = 341,449 individuals with White British ancestry). We tested the additive allelic effects of each of the 13.9 million SNPs passing quality control, separately, on SOS using a linear mixed model to adjust for cryptic relatedness and population stratification , as well as adjusting for age, sex, assessment center, and genotyping array (S1 Text). Linkage-disequilibrium-independent associations where obtained using PLINK by clumping SNPs in linkage equilibrium at a r2 > 0.05 and selecting a single most significant SNP from within each clumped set. To reduce potential bias due to population stratification, the UK Biobank Training, Model Selection, and Test Sets included only White British participants, while all other cohorts included only people of general European ancestry (as defined in S1 Text). Further, as stated above, the performance of gSOS-based screening was also tested in non–White British participants in UK Biobank.
Polygenic risk scores using LASSO.
Using the UK Biobank Training Set, we fitted 6 LASSO models  to predict SOS using only SNPs with p-values smaller than a chosen set of thresholds (Table C in S1 Tables). The UK Biobank Model Selection Set was then used to identify the p-value threshold and regularization parameter (λ) that resulted in the lowest root mean square error for the prediction of SOS. This p-value threshold and regularization parameter were then taken forward for testing in the UK Biobank Test Set. Hence, we ensured that the performance of only 1 final polygenic risk score was evaluated in the UK Biobank Test Set. We refer to this final predictor as gSOS, in which SOS is predicted only from genotype.
Traditional polygenic risk scores.
Traditional polygenic risk scores  were derived from the GWAS for SOS performed in the UK Biobank Training Set, without the use of LASSO, by including different sets of SNPs, selected by p-value threshold and linkage disequilibrium clumping as described in S1 Text (Table C in S1 Tables).
Generation of FRAX scores
FRAX risk scores for major osteoporotic fracture (hip, clinical vertebra, proximal humerus, or wrist) can be generated with or without BMD, referred to in this paper as BMD-FRAX and CRF-FRAX, respectively . Therefore CRF-FRAX and BMD-FRAX were calculated for all participants in each validation cohort . FRAX clinical risk factors were assessed at the baseline visit for each cohort and included age, sex, body mass index (BMI), previous fracture, smoking, glucocorticoid use, rheumatoid arthritis, and secondary causes of osteoporosis. Measures of more than 2 daily units of alcohol and parental history of hip fracture were not available in UK Biobank and were set to “no” for this cohort, as suggested by FRAX guidelines. Not all secondary causes of osteoporosis were available for the SOF, Mr OS US, and Mr OS Sweden cohorts, and these variables were also set to “no” for these cohorts, as recommended by FRAX. Age was recorded at baseline visit. Sex was self-reported and verified by genotype. Individuals with discordant sex between self-report and genotype were excluded. CRF-FRAX and BMD-FRAX were calculated for all participants in each of the clinical cohorts, using country-specific FRAX models .
Genomic screening in fracture risk screening
In the absence of an international consensus on fracture risk screening [2,4,5,30], we chose to use the assessment and management clinical algorithm developed by NOGG , since a screening program similar to the NOGG screening strategy is supported by randomized controlled trial evidence . The NOGG screening strategy uses 10-year absolute probability of fracture as calculated by FRAX and suggests treatment or reassurance based on thresholds of risk, which are age dependent and consider competing risks. The NOGG guidelines (Fig 2) also aim to identify individuals at risk for fracture in a cost-efficient manner by reserving clinical visits and more costly BMD testing for those at intermediate risk, i.e., where the FRAX score lies close to an intervention threshold. This intervention threshold is equivalent to the age-specific FRAX 10-year probability in women with a prior fragility fracture, since nearly all such women would be recommended an intervention . Individuals without any risk factors are excluded from the CRF-FRAX assessment. By applying CRF-FRAX, individuals can be recommended for either an intervention (high risk), a BMD-FRAX assessment (intermediate risk), or reassurance and no further participation in the screening program (low risk). Those having a BMD-FRAX assessment can then be recommended an intervention if their resulting 10-year probability of major osteoporotic fracture exceeds the age-specific threshold, or they can be reassured (see Fig 2).
Both CRF-FRAX and BMD-FRAX generate a 10-year probability of major osteoporotic fracture, which is used to designate risk of fracture. BMD-FRAX, bone-mineral-density-based Fracture Risk Assessment Tool; CRF-FRAX, clinical-risk-factor-based Fracture Risk Assessment Tool; NOGG, National Osteoporosis Guideline Group.
Despite the efficiencies gained by using this stepwise approach , false negatives can occur when interventions are not recommended to individuals who have a low CRF-FRAX-based probability and are discharged from subsequent screening, whereas if they had undergone BMD-FRAX, would have qualified for intervention. Likewise, false positives can arise when an individual is recommended for an intervention based on the CRF-FRAX score but would not have qualified for an intervention with BMD-FRAX.
To try to reduce the number of individuals undergoing testing, particularly more costly BMD testing, who would subsequently not require intervention, we introduced a gSOS-based screening step in the NOGG algorithm, where individuals were reassured if their gSOS was above a threshold (Fig 3). This is because individuals with a high SOS are likely to have a high BMD and are thus less likely to be recommended for an intervention. The trade-off of this strategy is that it could result in reassurance of individuals who, if their BMD was measured, would have been recommended an intervention. This would result in a decrease in sensitivity to identify individuals requiring an intervention. To calculate the sensitivity and specificity of the gSOS-modified NOGG algorithm, we used BMD-FRAX as a reference standard within the NOGG screening strategy (Fig 4). According to NOGG guidelines, women ≥50 years with a prior fragility fracture are recommended treatment without further FRAX testing. As a result, these individuals were assigned an intervention recommendation when calculating the sensitivity and specificity of correct treatment assignment (Fig 4).
Both CRF-based FRAX and BMD-based FRAX generate a 10-year probability of major osteoporotic fracture, which is used to designate risk of fracture. gSOS is standardized to have a mean of 0 and standard deviation of 1. BMD-FRAX, bone-mineral-density-based Fracture Risk Assessment Tool; CRF-FRAX, clinical-risk-factor-based Fracture Risk Assessment Tool; NOGG, National Osteoporosis Guideline Group.
BMD-FRAX, bone-mineral-density-based Fracture Risk Assessment Tool; NOGG, National Osteoporosis Guideline Group.
Since resources are often expended to measure BMD-FRAX in individuals whose final probability of fracture is too low to warrant intervention, we also estimated the number of CRF-FRAX and BMD-FRAX tests that were performed but led to the individual being reassured without a recommended intervention.
We chose the sex-specific thresholds of gSOS that reduced CRF-FRAX and BMD-FRAX testing but minimized the loss of sensitivity to identify individuals who would be recommended for treatment. This threshold was chosen using data from the UK Biobank Test Set (S4 Fig). The generalizability of the selected gSOS threshold was then tested in the remaining 4 validation cohorts (CLSA, SOF, Mr OS US, and Mr OS Sweden). The number of CRF-FRAX and BMD-FRAX tests performed but not leading to an intervention were counted. These analyses were conducted in each validation cohort, men and women separately, and in all groups combined. We also tested individuals of non–White British ancestry in UK Biobank (N = 350), i.e., the individuals who remain subsequent to filtering out the White British subset and who have available measurements of femoral neck BMD. The characteristics are provided in Table B in S1 Tables.
Table 1 describes the FRAX risk factors for all of the cohorts. There were few clinically relevant differences in any of the osteoporosis-related risk factors in the UK Biobank Training, Model Selection, and Test Sets, as expected, since these sets were generated randomly. As planned, all individuals from UK Biobank with BMD measures were included in the UK Biobank Test Set, to ensure availability of BMD-FRAX scores as the reference standard. There were few differences in demographics or clinical risk factors between individuals with and without BMD measured. The validation cohorts (CLSA, SOF, Mr OS US, and Mr OS Sweden) provided a range of characteristics, allowing for a better assessment of the generalizability of results (Table 1).
After quality control (see S1 Text), 13,958,249 SNPs were included in the GWAS. The GWAS in the training set identified 1,404 independent (r2 ≤ 0.05) genome-wide significant loci at a p-value threshold of <5 × 10−8. S1 Fig shows the Manhattan and QQ plots for this GWAS.
Variance explained in SOS in the UK Biobank Model Selection Set
The polygenic risk score models trained with LASSO explained at most 25.0% (95% CI 23.0%–27.0%) of the variance in SOS in the UK Biobank Model Selection Set (Table C in S1 Tables). S2 Fig provides detailed information on the optimal algorithm tuning parameters. None of the traditional polygenic risk scores performed better than the polygenic risk score derived from the LASSO regression. S3 Fig demonstrates that, as expected, the estimated effects of the activated SNPs from the LASSO algorithm were attenuated compared to the effects estimated from the GWAS.
Variance explained in SOS in the UK Biobank Test Set
Age, sex, and BMI explained 4.0% (95% CI 3.7%–4.2%) of the variance in SOS. Adding all available FRAX clinical risk factors increased the variance explained to 5.3% (95% CI 5.0%–5.6%). The polygenic risk score from the UK Biobank Model Selection Set explaining the most variance in measured SOS was designated as “gSOS” and was then tested for its correlation with SOS in the UK Biobank Test Set. This model explained 23.2% (95% CI 22.7%–23.7%) of the variance in measured SOS and included 21,717 SNPs activated from a total of 345,111 SNPs that had p-values for association with SOS of ≤5 × 10−4 (Table C in S1 Tables; Fig 5).
Available FRAX clinical risk factors included age, sex, BMI, smoking, previous fracture, use of glucocorticoids, rheumatoid arthritis, and secondary osteoporosis. BMI, body mass index; FRAX, Fracture Risk Assessment Tool; SOS, speed of sound.
Screening by NOGG guidelines in validation cohorts
The validation cohorts comprised 10,522 individuals eligible for fracture risk screening (Table 1). Both the sensitivity and specificity of the NOGG screening strategy to identify individuals at high enough risk to merit an intervention, compared to the reference standard, BMD-FRAX, were high (99.6% and 97.1%, respectively; Fig 6; Table D in S1 Tables). This high sensitivity and specificity required CRF-FRAX tests to be undertaken in 81% of the population eligible for screening, with BMD-FRAX tests subsequently recommended in 37% of the population. In total, 74% of those requiring CRF-FRAX tests were classified for reassurance, i.e., without a recommendation for an intervention. As well, just over one-third of all individuals who received a BMD-FRAX test were classified for reassurance without intervention (Fig 6; Table D in S1 Tables).
Screening incorporating a gSOS-based screening step
Using the UK Biobank Test Set, we selected the threshold of gSOS that would minimize the number of BMD tests done in persons who would ultimately be reassured rather than receiving an intervention, but also would minimize the number of false negatives (S3 Fig). Applying this threshold separately in men and women, we found that a threshold of standardized gSOS set to 0.5 and 0 for men and women, respectively, balanced these goals in the UK Biobank Test Set, and subsequently individuals above these thresholds were excluded from further screening in the validation cohorts, prior to receiving a CRF-FRAX or BMD-FRAX test (Fig 3). The utility of this threshold was then tested in all validation cohorts.
Fig 6 demonstrates that applying a gSOS screening step in the validation cohorts resulted in a small decrease in sensitivity to identify eligible participants for therapy, to 93.4%, but that the specificity increased slightly, to 98.5%. However, the proportion of screened individuals requiring CRF-FRAX testing decreased from 81% to 51% (representing a relative decrease of 37%) compared to NOGG-based screening without a gSOS screening step. Additionally, the proportion of screened individuals requiring BMD-FRAX testing decreased from 37% to 22% (representing a relative decrease of 41%) (Fig 6; Table D in S1 Tables).
The proportion of CRF-FRAX and BMD-FRAX tests that resulted in an individual being excluded from the screening program without a recommendation for an intervention also decreased from 74% to 46% and from 34% to 20%, respectively (Fig 6; Table D in S1 Tables). Cohort-specific results are shown in Tables E–I in S1 Tables.
The positive predictive value for correct treatment assignment in all validation cohorts was 91.8% without a gSOS screening step and increased to 95.4% with the gSOS screening step (Table D in S1 Tables; cohort-level results and subgroup results are available in Tables D–P in S1 Tables).
Women and men separately
The SOF cohort was composed of only women, while Mr OS US and Mr OS Sweden were composed of only men, providing the opportunity to explore performance characteristics by sex. Further, we divided the UK Biobank Test Set and CLSA into sex-specific datasets (Tables J–M in S1 Tables). Amongst 4,859 women who were eligible for screening in the cohorts (SOF, UK Biobank Test Set, and CLSA), the sensitivity and specificity for correct treatment assignment were high (99.9% and 95%, respectively). Nevertheless, 58% of the population required CRF-FRAX tests, and 43% required BMD-FRAX tests (Table N in S1 Tables).
When applying a gSOS screening step, the sensitivity decreased marginally, to 94.6%, and the specificity increased marginally, to 98.2%. The proportion of the population requiring a CRF-FRAX test reduced from 58% to 27% (representing a relative decrease of 55%), while the proportion requiring a BMD-FRAX test reduced from 43% to 20% (representing a relative decrease of 55%) (Table N in S1 Tables).
Amongst the 5,668 men eligible for screening, the sensitivity and specificity were 96.9% and 98.2%, respectively, using CRF-FRAX alone as the screening step. In order to achieve this performance, 100% of men had a CRF-FRAX test, and 31% required a BMD-FRAX test. The yield of high-risk individuals from these tests was low, such that 94% of men receiving a CRF-FRAX test were reassured, and 29% of those receiving a BMD-FRAX test were reassured (Table O in S1 Tables). Applying a gSOS screening step to these men reduced the sensitivity to 82% while maintaining a similar specificity at 99%. However, the proportion of men requiring a CRF-FRAX test reduced to 72% (representing a relative decrease of 28%), and the proportion undergoing BMD-FRAX reduced to 23% (representing a relative decrease of 25%).
Stratification by age
We next tested the performance of gSOS in different age strata to understand if the screening efficiency improved more for one age group than another. Using the largest cohort, with the largest variation in age (CLSA, N = 6,704), we found that gSOS had the highest performance in individuals aged ≥70 years. Specifically, the sensitivity and specificity to identify individuals who require an intervention remained high, at 99.6% and 94.9%, respectively. The proportion of screened individuals requiring CRF-FRAX testing decreased from 73% to 37% (representing relative decrease of 49%) compared to the NOGG screening strategy without a gSOS screening step. Additionally, the proportion of screened individuals requiring BMD-FRAX testing decreased from 24% to 12% (representing a relative decrease of 50%) (Table F in S1 Tables). In contrast, in individuals aged 50–59 years, sensitivity reduced to 86%, but specificity was 99.6%. The percent of individuals requiring CRF-FRAX and BMD-FRAX testing reduced by 51% and 50%, respectively. This demonstrates that gSOS pre-screening improves the efficiency of screening, but that the sensitivity to correctly identify individuals requiring therapy is maximized in older age groups.
Non–White British individuals
We then assessed the effect of a gSOS pre-screening in individuals from UK Biobank with dual-energy X-ray absorptiometry BMD measures who were of non–White British ancestry (Table B in S1 Tables). We found that the results were generally consistent with those in individuals of White British ancestry. Specifically, the proportion of screened individuals requiring CRF-FRAX testing decreased from 94% to 48% (representing a relative decrease of 49%) compared to NOGG-based screening without a gSOS screening step. Additionally, the proportion of screened individuals requiring BMD-FRAX testing decreased from 39% to 17% (representing a relative decrease of 57%) (Table P in S1 Tables).
The proportion of CRF-FRAX and BMD-FRAX tests that resulted in an individual being excluded from the screening program without a recommendation for an intervention also decreased from 92% to 47% and from 38% to 16%, respectively (Table P in S1 Tables).
By building a polygenic risk score using 341,449 individuals and validating its utility in fracture risk screening in 5 separate cohorts totaling 10,522 individuals, we determined that genomics-enabled fracture risk screening could reduce the proportion of people who require BMD-based testing by 41%, while maintaining a high overall sensitivity and specificity for correct treatment assignment. While these findings are not meant to be prescriptive, they indicate the possible utility of polygenic risk scores in screening programs that are dependent on heritable risk factors.
Fracture risk assessment is expensive, with estimates of approximately US$50,000 per quality-adjusted life year gained , but is less expensive, or even cost-saving, using NOGG assessment strategies [33,34], because NOGG decreases the number of individuals who require CRF-FRAX and BMD-FRAX testing. Current guidelines suggest testing a large proportion of the population [2,3,5], yet most screened individuals are not identified as having a clinically actionable level of fracture risk [9,35]. This circumstance provides an opportunity for genetically derived measures of risk to increase cost-efficiencies in healthcare systems where investments have been made in genome-wide genotyping. Already at least 7 large healthcare systems have invested in genome-wide genotyping of a large proportion of their population, among whom electronic health record data are available [36,37]. Since the costs associated with genome-wide genotyping have now dropped below those of several routine clinical tests, the use of polygenic risk scores could be particularly helpful in these environments since a one-time genotyping cost could be used to generate several polygenic risk scores. However, there is a clear need to study the translation of such polygenic risk scores to clinical applications —and the work presented here provides one example of how such scores could be translated to the clinic.
Previous attempts to predict osteoporosis from genomic data did not substantially increase discrimination compared to standard clinical measures alone, likely because the GWAS that underpinned these attempts was derived from 32,961 individuals and explained only 5.8% of the variance in BMD [39,40]. The improvement in variance explained in this study was attributable to the increase in sample size afforded by UK Biobank and to the LASSO regression’s ability learn SNP associations with SOS jointly, as opposed to summing over independently measured effects on BMD. Other attempts to predict BMD have been based on several dozen genome-wide significant SNPs , whereas our approach used machine learning to jointly consider the effects of 642,127 SNPs (Table C in S1 Tables). LASSO regression has recently been used to predict estimated BMD, but from a GWAS sample size that was one-third of that used here, explaining only 17.2% of the BMD variance, and it was not used in a screening program . Our work has improved the genomic prediction of BMD and demonstrated its potential clinical relevance.
We observed similar predictive performance across all LASSO models in the model selection step (Table C in S1 Tables); therefore, it remains possible that a more parsimonious model containing fewer SNPs would perform as well. As a result, further exploration of these LASSO models is warranted in a future technical study. However, should a more complex model with more SNPs prove to be ideal, the hinderance to clinical translation should be minimal, as the computational burden is limited to the training of the models, and is not in the prediction of an individual’s genetic risk.
The sensitivity and specificity to correctly assign intervention was maximized in individuals ≥70 years of age. This could be clinically relevant because this is the age range for which the SCOOP trial demonstrated that a community-based screening program could be effective in reducing hip fractures .
We acknowledge that for many practicing physicians, such as those in the UK, who have access to an automatically generated electronic-health-record-based CRF-FRAX test, the result of interest would be the reduction in BMD-FRAX tests. However, we observed no appreciable difference in the sensitivity and specificity to correctly identify individuals requiring therapy if the gSOS screening step was placed prior to the CRF-FRAX test or immediately after the CRF-FRAX test. Tables E–O in S1 Tables show the results for a reduction in BMD-FRAX tests by cohort and sex.
We have generated a polygenic risk score for SOS, rather than BMD, since there are insufficient data resources to generate such a score for BMD. Nevertheless, the correlation between SOS and BMD has enabled the identification of individuals unlikely to have a BMD low enough to warrant an intervention. Further refinement could improve the efficiencies presented here, including a polygenic risk score for BMD, when sample sizes are large enough to enable this. While nearly all FRAX risk factors were available for study, alcohol intake and parental history of fracture were not available from the UK Biobank cohorts. However, these were available in the other validation cohorts. Secondary causes of osteoporosis were not uniformly available in SOF, Mr OS US, and Mr OS Sweden. Nevertheless, CLSA provided similar results to other cohorts and had all required information. Like participants in most cohort studies, the participants used in these studies are, on average, healthier than the general population . Thus, external validation in a truly population-based study may provide helpful estimates of the real-world performance of genomics-enabled fracture risk screening. While we have tested the utility of gSOS in individuals of non–White British ancestry, the sample size available for study was relatively small, and thus results should be replicated in additional cohorts of different ancestry, underlining the need for large-scale GWAS datasets in individuals of non-European ancestry . We recognize that different approaches could be taken to incorporate polygenic risk scores into fracture risk screening, but here we offer a simple approach that could be readily implemented in a genotyped population with required FRAX risk factors using the NOGG strategy .
In summary, we have developed and tested gSOS, a polygenic risk score for SOS, which when introduced into a fracture risk screening program decreased the number of people requiring CRF-FRAX and BMD-FRAX assessments, while still maintaining a high sensitivity and specificity to identify individuals in whom an intervention should be recommended. These findings highlight the role that genetic prediction could play in screening programs that rely upon heritable risk factors.
S1 Fig. GWAS of SOS.
(A) Manhattan plot from GWAS of SOS. (B) QQ plot from GWAS of SOS.
S2 Fig. Performance of each SNP set using LASSO regression in the model selection set.
Each feature set consists of a set of SNPs associated with SOS at a specified p-value threshold (sub-panel titles). For each feature set, we fit a regularized model to the training set over a range of regularization constants (λ) (top left), with each λ resulting in a variable subset of activated features (bottom left). The model with the minimal root mean square error in the model selection set (top right) was selected to compare the variance explained (r2, bottom right among all feature sets.
S3 Fig. Correlation of effect estimates from the GWAS and the coefficients from the LASSO regression for activated SNPs.
Activated SNPs are those SNPs chosen by the machine learning algorithm to be in gSOS, the final selected model.
S4 Fig. Effects of gSOS threshold on treatment assignment.
Results stratified by women (top) and men (bottom).
This research has been conducted using the UK Biobank Resource under project number 24268. We appreciate the generosity of UK Biobank and validation cohort volunteers. We appreciate advice on the manuscript provided by Dr. Suzanne Morin.
- 1. Consensus development conference: diagnosis, prophylaxis, and treatment of osteoporosis. Am J Med. 1993;94: 646–50. pmid:8506892
- 2. Papaioannou A, Morin S, Cheung AM, Atkinson S, Brown JP, Feldman S, et al. 2010 clinical practice guidelines for the diagnosis and management of osteoporosis in Canada: summary. CMAJ. 2010;182:1864–73. pmid:20940232
- 3. Compston J, Cooper A, Cooper C, Gittoes N, Gregson C, Harvey N, et al. UK clinical guideline for the prevention and treatment of osteoporosis. Arch Osteoporos. 2017;12:43. pmid:28425085
- 4. Curry SJ, Krist AH, Owens DK, Barry MJ, Caughey AB, Davidson KW, et al. Screening for osteoporosis to prevent fractures us preventive services task force recommendation statement. JAMA. 2018;319:2521–31. pmid:29946735
- 5. Cosman F, de Beur SJ, LeBoff MS, Lewiecki EM, Tanner B, Randall S, et al. Clinician’s guide to prevention and treatment of osteoporosis. Osteoporos Int. 2014;25:2359–81. pmid:25182228
- 6. Kanis JA, Harvey N, Cooper C, Johansson H, Odén A, McCloskey E, et al. A systematic review of intervention thresholds based on FRAX: a report prepared for the National Osteoporosis Guideline Group and the International Osteoporosis Foundation. Arch Osteoporos. 2016;11:25. pmid:27465509
- 7. Kanis JA. Assessment of osteoporosis at the primary health care level. WHO Scientific Group Technical Report. Sheffield (UK): World Health Organization Collaborating Centre for Metabolic Bone Diseases; 2007. https://www.sheffield.ac.uk/FRAX/pdfs/WHO_Technical_Report.pdf.
- 8. Kanis JA, Johnell O, Oden A, Johansson H, McCloskey E. FRAX and the assessment of fracture probability in men and woman from the UK. Osteoporos Int. 2008;19:385–97. pmid:18292978
- 9. Shepstone L, Lenaghan E, Cooper C, Clarke S, Fong-Soe-Khioe R, Fordham R, et al. Screening in the community to reduce fractures in older women (SCOOP): a randomised controlled trial. Lancet. 2018;391:741–7. pmid:29254858
- 10. Zheng HF, Forgetta V, Hsu YH, Estrada K, Rosello-Diez A, Leo PJ, et al. Whole-genome sequencing identifies EN1 as a determinant of bone density and fracture. Nature. 2015;526112–7. pmid:26367794
- 11. Kemp JP, Morris JA, Medina-Gomez C, Forgetta V, Warrington NM, Youlten SE, et al. Identification of 153 new loci associated with heel bone mineral density and functional involvement of GPC6 in osteoporosis. Nat Genet. 2017;49:1468–75. pmid:28869591
- 12. Richards JB, Zheng HF, Spector TD. Genetics of osteoporosis from genome-wide association studies: advances and challenges. Nat Rev Genet. 2012;13:672.
- 13. Howard GM, Nguyen TV, Harris M, Kelly PJ, Eisman JA. Genetic and environmental contributions to the association between quantitative ultrasound and bone mineral density measurements: a twin study. J Bone Miner Res. 1998;13:1318–27. pmid:9718201
- 14. Kim SK. Identification of 613 new loci associated with heel bone mineral density and a polygenic risk score for bone mineral density, osteoporosis and fracture. PLoS ONE. 2018;13:e0200785. pmid:30048462
- 15. Evans DM, Visscher PM, Wray NR. Harnessing the information contained within genome-wide association studies to improve individual prediction of complex disease risk. Hum Mol Genet. 2009;18:3525–31. pmid:19553258
- 16. Khera AV, Chaffin M, Aragam K, Haas M, Roselli C, Choi SH, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet. 2018;50:1219–24. pmid:30104762
- 17. Inouye M, Abraham G, Nelson CP, Wood AM, Sweeting MJ, Dudbridge F, et al. Genomic risk prediction of coronary artery disease in nearly 500,000 adults: implications for early screening and primary prevention. bioRxiv 250712. 2018 Jan 19.
- 18. Thériault S, Lali R, Chong M, Velianou JL, Natarajan MK, Paré G. Polygenic contribution in individuals with early-onset coronary artery disease. Circ Genom Precis Med. 2018;11:e001849. pmid:29874178
- 19. Seibert TM, Fan CC, Wang Y, Zuber V, Karunamuni R, Parsons JK, et al. Polygenic hazard score to guide screening for aggressive prostate cancer: development and validation in large scale cohorts. BMJ. 2018;360:j5757. pmid:29321194
- 20. Khera AV, Chaffin M, Wade KH, Zahid S, Brancale J, Xia R, et al. Polygenic prediction of weight and obesity trajectories from birth to adulthood. Cell. 2019;177:587–96.e9. pmid:31002795
- 21. Office for Life Sciences. UK life sciences sector deal 2, 2018. London: HM Government; 2018.
- 22. Gonnelli S, Cepollaro C, Gennari L, Montagnani A, Caffarelli C, Merlotti D, et al. Quantitative ultrasound and dual-energy X-ray absorptiometry in the prediction of fragility fracture in men. Osteoporos Int. 2005;16:963–8. pmid:15599495
- 23. Tibshirani R. regression selection and shrinkage via the lasso. J R Stat Soc Series B Stat Methodol. 1996;58:267–88.
- 24. Janssens ACJW, Ioannidis JPA, van Duijn CM, Little J, Khoury MJ. Strengthening the reporting of genetic risk prediction studies: the GRIPS statement. PLoS Med. 2011;8:e1000420. pmid:21423587
- 25. Morris JA, Kemp JP, Youlten SE, Laurent L, Logan JG, Chai RC, et al. An atlas of genetic influences on osteoporosis in humans and mice. Nat Genet. 2019;51:258–66. pmid:30598549
- 26. Kanis JA, Oden A, Johnell O, Johansson H, De Laet C, Brown J, et al. The use of clinical risk factors enhances the performance of BMD in the prediction of hip and osteoporotic fractures in men and women. Osteoporos Int. 2007;18:1033–46. pmid:17323110
- 27. Riley RD, Ensor J, Snell KIE, Debray TPA, Altman DG, Moons KGM, et al. External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges. BMJ. 2016;353:i3140. pmid:27334381
- 28. Harvey NC, Matthews P, Collins R, Cooper C. Osteoporosis epidemiology in UK Biobank: a unique opportunity for international researchers. Osteoporosis Int. 2013;24:2903–5. pmid:24057481
- 29. Loh P-R, Tucker G, Bulik-Sullivan BK, Vilhjálmsson BJ, Finucane HK, Salem RM, et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat Genet. 2015;47:284–90. pmid:25642633
- 30. Kanis JA. Assessment of fracture risk and its application to screening for postmenopausal osteoporosis: synopsis of a WHO report. WHO Study Group. Osteoporos Int. 1994;4:368–81. pmid:7696835
- 31. Johansson H, Kanis JA, Oden A, Compston J, McCloskey E. A comparison of case-finding strategies in the UK for the management of hip fractures. Osteoporos Int. 2012;23:907–15. pmid:22234810
- 32. Nayak S, Roberts MS, Greenspan SL. Cost-effectiveness of different screening strategies for osteoporosis in postmenopausal women. Ann Intern Med. 2011;155:751. pmid:22147714
- 33. Turner DA, Khioe RFS, Shepstone L, Lenaghan E, Cooper C, Gittoes N, et al. The cost-effectiveness of screening in the community to reduce osteoporotic fractures in older women in the UK: economic evaluation of the SCOOP study. J Bone Miner Res. 2018;33:845–51. pmid:29470854
- 34. Söreskog E, Borgström F, Shepstone L, Clarke S, Cooper C, Harvey I, et al. Long-term cost-effectiveness of screening for fracture risk in a UK primary care setting: the SCOOP study. Osteoporos Int. 2020 Apr 1. pmid:32239237
- 35. Richards JB, Leslie WD, Joseph L, Siminoski K, Hanley DA, Adachi JD, et al. Changes to osteoporosis prevalence according to method of risk assessment. J Bone Miner Res. 2006;22:228–34. pmid:17129177
- 36. Jensen PB, Jensen LJ, Brunak S. Mining electronic health records: towards better research applications and clinical care. Nat Rev Genet. 2012;13:395–405. pmid:22549152
- 37. Grzymski JJ, Coppes MJ, Metcalf J, Galanopoulos C, Rowan C, Henderson M, et al. The Healthy Nevada Project: rapid recruitment for population health study. bioRxiv 250274. 2018 Jan 19.
- 38. Hunter DJ, Drazen JM. Has the genome granted our wish yet? N Engl J Med. 2019;380:2391–3. pmid:31091368
- 39. Estrada K, Styrkarsdottir U, Evangelou E, Hsu Y-H, Duncan EL, Ntzani EE, et al. Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture. Nat Genet. 2012;44:491–501. pmid:22504420
- 40. Eriksson J, Evans DS, Nielson CM, Shen J, Srikanth P, Hochberg M, et al. Limited clinical utility of a genetic risk score for the prediction of fracture risk in elderly subjects. J Bone Miner Res. 2015;30:184–94. pmid:25043339
- 41. Allen N, Sudlow C, Downey P, Peakman T, Danesh J, Elliott P, et al. UK Biobank: current status and what it means for epidemiology. Health Policy Technol. 2012;1:123–6.
- 42. Martin AR, Kanai M, Kamatani Y, Okada Y, Neale BM, Daly MJ. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat Genet. 2019;51:584–91. pmid:30926966