Skip to main content
Advertisement
  • Loading metrics

Polygenic risk scores and Parkinson’s disease in South Africa advancing ancestry informed disease prediction

  • Kathryn Step,

    Roles Conceptualization, Formal analysis, Methodology, Project administration, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa, South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa

  • Carene Anne Alene Ndong Sima,

    Roles Methodology, Writing – review & editing

    Affiliations Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa, South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa

  • Spencer Grant,

    Roles Methodology, Writing – review & editing

    Affiliations Center for Alzheimer’s and Related Dementias, National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Maryland, United States of America, Department of Clinical Neurosciences, School of Clinical Medicine, The University of Cambridge, Cambridge, United Kingdom

  • Jonggeol Jeffrey Kim,

    Roles Methodology, Writing – review & editing

    Affiliation Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America

  • Emily Waldo,

    Roles Formal analysis, Writing – review & editing

    Affiliation Genomic Medicine, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, Ohio, United States of America

  • Soraya Bardien,

    Roles Conceptualization, Project administration, Supervision, Writing – review & editing

    Affiliations Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa, South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa, South African Medical Research Council/Stellenbosch University Genomics of Brain Disorders Research Unit, Stellenbosch University, Cape Town, South Africa

  • Ignacio F. Mata ,

    Roles Conceptualization, Project administration, Supervision, Writing – review & editing

    * MATAI@ccf.org

    Affiliation Genomic Medicine, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, Ohio, United States of America

  • on behalf of the Global Parkinson’s Genetics Program (GP2)

    Membership of Global Parkinson’s Genetics Program (GP2) is provided in S4 Table and at https://doi.org/10.5281/zenodo.7904831.

Abstract

Parkinson’s disease (PD) is a complex neurodegenerative disorder with environmental and genetic influences. Using genotyping array data of 661 South African PD cases and 737 controls, we conduct a polygenic risk score (PRS) analysis using PRSice-2. Summary statistics from two PD association studies have been used as base datasets. We split the target dataset into training (70%; n = 979) and validation (30%; n = 419) cohorts. We test various clumping window sizes, linkage disequilibrium thresholds, and p-value thresholds to determine the optimal combination for risk prediction. Additionally, we investigate the variance explained by different combinations of covariates. Overall, we observe modest predictive performance (AUC: 0.5847-0.6183). Age at recruitment emerges as the strongest individual predictor, while sex contributes the least. These findings provide the first evaluation of PRS performance for PD in a highly admixed South African cohort, underscoring the importance of including underrepresented populations in genetic risk prediction. By systematically assessing predictive performance across two base datasets, we highlight how ancestry composition and study design affect risk estimation in diverse populations. This work lays a foundation for refining genomic prediction in admixed populations and contributes to ongoing efforts to ensure that advances in precision medicine are globally relevant.

Author summary

Parkinson’s disease is a complex brain disorder influenced by both genes and environment. Most of what we know about the genetic contribution to this disease comes from studies in people of European ancestry, leaving major gaps in our understanding of how genetic risk works in other populations. In this study, we examined how well existing genetic risk models, known as polygenic risk scores, can predict Parkinson’s disease in people from South Africa, a population with a rich mix of ancestries. We compared different approaches to building these scores and tested how accurately they could identify individuals with the disease. Although we found that genetic scores alone are modest predictors compared to age, our results highlight important insights about the limits and possibilities of using genetic information in diverse populations. By including underrepresented groups in genetic research, our study takes an important step toward making precision medicine more inclusive and equitable worldwide.

Introduction

Parkinson’s disease (PD) is a complex neurodegenerative disorder characterized by motor and non-motor symptoms [1]. Once considered primarily sporadic and environmentally driven [2,3], research has revealed a multifactorial etiology involving monogenic causes, gene-environment, and gene-gene interactions [2]. Approximately 5–10% of cases are monogenic, caused by single-gene mutations with large effect sizes, identified through genetic linkage analysis in families [4,5]. Although often observed in familial PD, they can occur in sporadic cases due to incomplete penetrance [6]. Genome-wide association studies (GWAS) have further shown that sporadic PD has a polygenic basis, with many small-effect variants jointly contributing to risk [7].

These susceptibility variants can be used for risk prediction through polygenic risk scores (PRS) [8]. In a PRS analysis, the variants identified through GWAS, along with their effect sizes (whether conferring increased or decreased risk) are combined to estimate an individual’s genetic predisposition to a disease [9]. In 2016, the first report on polygenic risk and clinical outcomes for PD was published [10]. Since then, several studies have evaluated PRS for PD risk prediction, with predictive performance, assessed by the area under the receiver operating characteristic curve (AUC), ranging from ~60%-76% [7,1114]. This is largely dependent on the number of single-nucleotide polymorphisms (SNPs) included and the population characteristics [15].

PRS analysis classifies individuals by relative disease risk, supporting risk stratification, early intervention strategies, and tailored precision medicine approaches [8,15]. We conducted the first PRS analysis for PD in a South African cohort.

Results

PRSice-2 for disease status prediction

The analysis included 35,075,375 variants in the two target datasets: training (70%; n = 979) and validation (30%; n = 419) cohorts. We conducted the analysis using two sets of summary statistics as the base data. For each, we evaluated varying combinations of thresholds for SNP inclusion (S1 and S2 Figs). The analyses were performed on the training dataset to determine the optimal thresholds and replicated in the validation dataset. All analyses were adjusted for sex, age, and inferred local ancestry components (ANC) with linkage disequilibrium (LD) estimated using the European (EUR) reference panel from the 1000 Genomes project.

One of the main uses of PRS is to predict case status according to their genetic risk or predisposition, making it a useful prognostic tool (S3 Fig) [15]. Using the EUR-based summary statistics [7], we tested 48 combinations of parameters to identify the best-performing PRS model for PD status prediction (S4 Fig). The highest predictive performance was observed under the following parameter set: 100kb clumping window,  = 0.8, and a SNP inclusion threshold of p-value = 1 × 10−3. Here, 3,466 SNPs remained after clumping and the full R2 explained 35.84% of the variance in the disease phenotype (PRS R2 = 0.005), with a strong association (empirical p-value = 0.076). The PRS analysis was conducted on the validation dataset (30%) using the abovementioned parameter combinations and the 3,466 SNPs included in the model. When applied to the validation set, 37.90% of the variance in case-control status (PRS R² = 0.019).

Given the high genetic admixture in the South African cohort [16], we evaluated predictive accuracy using multi-ancestry summary statistics from Kim et al. (2024) [17]. We tested the same 48 parameter combinations as for the previous analysis (S5 Fig). The highest predictive performance was observed under the following parameter set: a 100kb clumping window,  = 0.8, and a p-value threshold of 1 × 10−3. The full R2 explained 36.93% of the variance (PRS R² = 0.015; adjusted R² = 0.016; p-value = 9.82 × 10−5; empirical p-value = 5.0 × 10−4). A total of 3,208 variants remained after clumping in this model. We performed PRS analysis on the validation dataset (30%), where the best-fitting model, defined by a p-value threshold of 1 × 10−3 and including 3,208 SNPs, explained 40.17% of the variance in disease phenotype (PRS R² = 0.042).

Assessment of model performance

The AUC, sensitivity, and specificity were assessed using the two base datasets as well as the training and validation cohorts of the target dataset (Table 1; Fig 1). We assessed the mean AUC (Nall et al 2019: 0.6038 ± 0.0103; Kim et al 2024: 0.6052 ± 0.0178) across 20 random data splits to assess the robustness of our data split into training and validation cohorts. The results were consistent with the original split presented below, indicating the predictive performance was stable and not notably influenced by the random split.

thumbnail
Table 1. Model performance across base and target datasets.

https://doi.org/10.1371/journal.pgen.1012064.t001

thumbnail
Fig 1. Receiver Operating Characteristic (ROC) Curve Comparing Polygenic Risk Score (PRS) Models for Case Status.

The ROC curves compare the predictive performance of two PRS models for Parkinson’s disease: one based on Nalls et al. (2019) (blue) and the other on Kim et al. (2024) (green). The area under the curve (AUC) indicates the discriminative ability of each model, with higher AUC values reflecting better classification of cases and controls. The AUC for the Nalls model is 0.585, while the AUC for the Kim model is 0.616. PRS were calculated and matched to phenotype data from the same sample set (N = 419).

https://doi.org/10.1371/journal.pgen.1012064.g001

Using the Nalls et al., 2019 summary statistics, the PRS demonstrated a moderate ability to distinguish between PD status, with an AUC of 0.6077 (95% CI: 0.5721-0.6433) in the training dataset and 0.5847 (95% CI: 0.5302-0.6391) in the validation dataset. At a fixed probability threshold of 0.5, classification accuracy was 57.00% (95% CI: 0.5383-0.6012) in the training dataset and 55.85% (95% CI: 0.5095-0.6067) in the validation dataset. The observed balanced accuracy varied slightly between datasets (53.43% and 55.60%). Sensitivity was higher in the validation dataset (63.43%) compared to the training dataset (14.16%), whereas specificity was higher in the training dataset (92.70% versus 47.78%).

In contrast, the PRS model constructed using summary statistics from Kim et al., 2024 yielded a comparable discriminative ability, with an AUC of 0.6183 (95% CI: 0.5830-0.6537) in the training dataset and 0.6159 (95% CI: 0.5624-0.6694) in the validation dataset. Classification accuracy at the 0.5 threshold was 60.16% (95% CI: 0.5702-0.6325) and 58.23% (95% CI: 0.5335-0.6300) in the training and validation datasets, respectively. Balanced accuracy was modest in both datasets (58.56% and 58.05%), with sensitivity again higher in the validation dataset (63.88%) compared to the training dataset (40.90%). Specificity followed the same pattern as observed in the EUR-based base dataset [7], with higher values in the training dataset (76.22%) relative to validation (52.22%).

Finally, we evaluated the predictive performance of the PRS using top percentile thresholds (S1 Table). The sensitivity increased as more cases were included when the threshold was lowered, while specificity decreased correspondingly. The positive predictive value remained low across all thresholds, whereas the negative predictive values were consistently high (>99%). The overall patterns observed were similar between the PRS derived from the two base datasets with minor differences in the number of cases captured at the top 5% threshold.

Covariate contribution to the variance observed

We evaluated the contribution of covariates to the explained variance by examining their effect on model performance (Table 2). We looked at the two variance models (Full R2 and Null R2) under seven different covariate inclusion scenarios, using each possible combination of covariates: age, sex, and ANC.

thumbnail
Table 2. Variance explained by polygenic risk score models with different covariate adjustments for the training dataset.

https://doi.org/10.1371/journal.pgen.1012064.t002

We assessed this using the Nalls et al., 2019 summary statistics as the base data. Here, the highest PRS R², representing the variance explained by the PRS after accounting for covariates, was observed when adjusting for sex only. The highest Null R2 was observed with age, sex, and ANC, indicating they explained the most variability independent of polygenic score. In contrast, the lowest Null R² was seen in the model including only sex, suggesting this alone has a limited contribution to the model variance and cannot be used alone to predict risk. The models that included age or sex, or both had statistically significant empirical p-values. Models with ANC alone or sex and ANC showed nominal significance, while the age, sex, and ANC as well as the age and ANC models were not significant. However, the magnitude of variance explained by genetic and covariate components varied depending on the covariate combination.

For Kim et al., 2024, the same seven covariate combinations were assessed. The highest PRS R2, showing the variance explained by the PRS model only, was observed when adjusting for sex (PRS R2 = 0.0552), while the lowest was seen when adjusting for age and ANC (PRS R2 = 0.0139). The Null R2 values are consistent with those observed in the EUR-based summary statistics, as expected, since they capture the variance explained by covariates alone and are independent of genetic influence. Finally, the variance explained by the PRS was dependent on the covariate structure, highlighting the impact of covariate selection on model performance.

We evaluated whether the inclusion of PRS improved the predictive performance for the models including only covariates (Table 3). For this, we looked at both base dataset summary statistics as well as the training and validation cohorts. The addition of the PRS consistently increased the AUC for all covariate combinations. The improvement was more pronounced in models with fewer covariates showing statistical significance (DeLong p-value <0.05). For models including the full set of covariates (age, sex, and ANC), the PRS still increased the AUC, though the change was smaller and, in some cases, not statistically significant (DeLong p-value >0.05). These results indicate that the PRS provides a meaningful addition to the predictive power beyond the covariates included.

thumbnail
Table 3. Effect of adding the polygenic risk score on predictive performance of covariate models.

https://doi.org/10.1371/journal.pgen.1012064.t003

Polygenic risk score analysis: age at onset

The PRS explained a small proportion of variance in PD AAO across all datasets. Specifically, for the Nalls et al. (2019) analysis, the PRS R² was 0.0019 in the training subset and 0.0054 in the validation subset. For the Kim et al. (2024) analysis, the PRS R² was 0.0005 in the training subset and 0.0243 in the validation subset. Overall, these results suggest that the selected PRS have limited explanatory power for AAO in this cohort.

Internal polygenic risk score analysis using South African summary statistics

We constructed an internal PRS for PD using 351 variants that reached a suggestive significance threshold in our South African GWAS. After LD clumping, 141 independent variants remained and were included in the PRS. In the training cohort, the PRS explained 33.9% of the phenotypic variance (R² = 0.3386) and achieved an AUC of 0.7736. When applied to the validation cohort, the PRS explained 36.0% of the variance (R² = 0.3598) with an AUC of 0.7667, indicating consistent discriminative performance across cohorts.

Discussion

To our knowledge, this is the first study to evaluate PRS for PD prediction in a South African cohort. We used a well-established PRS software, PRSice-2, and leveraged summary statistics from both EUR-ancestry and multi-ancestry GWAS for PD. Despite the summary statistics not fully matching the genetic background of our cohort, which is five-way admixed [16], our findings demonstrate that polygenic models can still capture a modest but significant proportion of the variance in PD susceptibility, highlighting the utility of PRS in diverse populations.

Using PRSice-2, we identified the best parameter sets for both the Nalls et al. (2019) and Kim et al. (2024) summary statistics. In the training dataset, we found that clumping parameters (e.g., r² and window size) and p-value thresholds had notable effects on predictive power. The optimal PRS derived from the Nalls et al. (2019) summary statistics achieved an AUC of 0.6077. Similarly, using the Kim et al. (2024) summary statistics with more diverse populations, the best-performing PRS model had an AUC of 0.6183. Although the PRS derived from Kim et al. (2024) explained more variance in disease risk, its slightly higher AUC compared to Nalls et al. (2019) highlights how R² and AUC capture different aspects of predictive performance, the underlying genetic predisposition versus classification accuracy, respectively. This may suggest that multi-ancestry meta-analysis PRS better models genetic predisposition for distinguishing cases from controls in the study population. However, the observed AUC is similar to those previously reported for PD PRS which range from 0.620 to 0.760 [15].

Notably, the models that achieved the highest predictive accuracy tended to favor moderate clumping thresholds and relatively lenient SNP inclusion thresholds, likely balancing the trade-off between including informative variants and controlling for LD-induced noise. Moreover, the validation of model performance in an independent subset of the cohort strengthens the robustness of these findings and supports the potential clinical utility of PRS in diverse populations.

Overall, our results highlight that while sex contributes modestly to the explained variance, accounting for approximately 3–5% across models, it cannot be used in isolation to predict disease status [15], as evidenced by the low full model R2 values when sex was the only covariate. In contrast, age appears to provide a more meaningful contribution, particularly when combined with other covariates such as ancestry. Importantly, the PRS R2 values were consistently low across all models and datasets, suggesting that genetic risk alone, as captured by current PRS methods, is insufficient for reliable disease prediction [18]. However, the DeLong test demonstrated that adding PRS to models with sex or ancestry alone can significantly improve AUC, whereas the incremental gain was not significant when age and ancestry covariates were already included. This underscores that the predictive contribution of PRS is dependent on demographic and ancestral factors. These findings underscore the importance of incorporating non-genetic covariates to enhance predictive performance, as previously illustrated in a PD context [19]. Ultimately, the limited variance explained by PRS alone constrains its current clinical utility for PD and emphasizes the need for integrative models that include both genetic and phenotypic information [8].

The additional sensitivity and specificity analyses support the overall performance of our PRS models, showing results that are consistent with previous work on multi-ancestry populations [20]. Our AUC values ranged from 0.5847 to 0.6183 across base datasets, with corresponding balanced accuracy values between 0.5343 and 0.5856. These values are comparable to those previously reported, whose AUC ranged from 0.505 to 0.651 across four ancestries [20]. Our model performs similarly to theirs, particularly in populations with higher predictive power, such as the EUR populations. Our sensitivity and specificity values also reflect expected trade-offs: for example, when sensitivity increased, specificity tended to decrease, consistent with typical classification dynamics. Together, these findings reinforce the idea that PRS models retain moderate predictive power in diverse populations, but that further optimization may be needed to improve performance, especially in underrepresented groups. This aligns with previous reports showing PRS transferability from EUR to AFR populations is substantially reduced, with performance estimated at 20–40% of that observed in EUR populations [21].

We generated an internal PRS using summary statistics from our South African GWAS. Of 351 suggestively associated variants, 141 remained after clumping. The PRS explained 33.9% of the variance in the training cohort (AUC = 0.774) and 36.0% in the validation cohort (AUC = 0.767). As this PRS was derived and tested in the same cohort, these estimates likely reflect overfitting and should be interpreted as cohort-specific rather than generalizable predictive performance.

Despite the moderate predictive performance observed, our study underscores both the promise and limitations of current PRS models in underrepresented populations. A key limitation of this study is the small sample size and the lower mean age of our control group relative to the case group. A further limitation for the study is the subsequent lack of an appropriate ancestry-matched validation cohort. Moreover, the AUC values achieved reflect modest discriminative power. This finding suggests that while PRS can contribute to risk stratification, they are not yet sufficient for clinical decision-making on their own. This is further illustrated by the AAO PRS, which explains only a modest proportion of variation in AAO, highlighting its limited predictive power for this phenotype. Future studies incorporating ancestry-specific GWAS, functional annotations, and integrative risk models may further improve PRS accuracy in AFR and admixed populations. As LD resources tailored to AFR and admixed genomes continue to improve, evaluating LD-aware methods such as PRS-CS or PRS-CSx will be critical to determine whether these approaches can further enhance PRS performance in this setting [22,23].

A key strength of the study is its novelty, representing the first evaluation of PRS for PD in a South African study collection, thereby addressing a critical gap in genetic risk research. By including both EUR-based and multi-ancestry summary statistics, we were able to compare the PRS transferability across ancestries and assess how base dataset ancestral composition influences predictive performance. Furthermore, the systematic evaluation of clumping thresholds, LD parameter, and p-value thresholds to identify the optimal input parameter combinations further strengthens the methodological approach of this analysis. Finally, the application of PRS across various diseases, including diabetes and cancer [24,25], has proven valuable for stratifying individuals at highest risk, rather than serving as a direct predictor of disease development [26]. In this context, our study contributes by refining PD risk prediction for a smaller subset of individuals most at risk for developing PD.

In conclusion, our results highlight the importance of including diverse ancestral cohorts and relevant covariates when constructing and evaluating PRS models. By systematically assessing the variance observed across different covariate combinations, we highlight the contributions of demographic and genetic factors to disease risk prediction. Future efforts should continue to refine ancestry specific risk-models to ensure equitable translation of PRS from research into clinical applications for early screening, disease risk prediction, and precision medicine [27].

Materials and methods

Ethics statement

Ethical approval was obtained from the Health Research Ethics Committee 1 (HREC 1), Stellenbosch University, South Africa. The HREC 1 reference number is: S23/10/251 (PhD) Sub Study 2002C/059. All participants provided a written statement of formal consent.

Participant demographics

Study participants were recruited from 2002 to 2020 as part of the South African PD Study Collection (S2 Table) [28]. Individuals living with PD (n = 691) were diagnosed in accordance with the Queen’s Square Brain Bank Criteria [29]. In total, 826 controls were recruited as part of the study collection [28].

Data preprocessing

Genotyping was completed through the Global Parkinson’s Genetics Program [30] using the NeuroBooster Array (v1.0, Illumina, San Diego, CA) [31]. Quality control (QC) was performed using PLINK v1.9 and v2.0 [32,33], as previously described [16]. Imputation was performed using the TOPMed Imputation Server [34]. The related individuals (n = 63) were identified using NAToRA [35] and a kinship coefficient of 0.0884 (second degree relation [36]) and excluded from the analysis. After QC, 661 PD cases and 737 controls remained (S2 Table). The South African population is five-way admixed [37], therefore, a reference panel was created using samples from the 1000 Genomes Project Phase III, including individuals of African (AFR), EUR, and South Asian (SAS) ancestries [38,39]. Additional individuals of Malay (MAL) ancestry and an indigenous hunter-gatherer Khoe-San (NAMA) population were included in the reference panel [40], as previously described [16]. The reference files were phased using the TOPMed Imputation Server [34]. For the South African dataset, local ancestry inference was performed using G-Nomix [41], as previously described [16] (S6 Fig).

Polygenic risk score calculation

A PRS analysis includes the following steps (Fig 2): [1] data QC and preparation, [2] PRS calculation, and [3] PRS performance assessment [9]. The analysis utilizes two independent datasets: discovery and target datasets [9]. The discovery dataset consists of GWAS summary statistics, including effect sizes for each variant. The target dataset contains individual-level genotype data, from which SNP dosages are derived for variants included in the PRS calculation. In general, PRS is computed for each individual as the sum of the dosages of risk alleles at selected SNPs, weighted by their corresponding effect sizes from the discovery dataset [42]. The PRS analysis was run using PRSice-2 v2.3.3 [43], which applies clumping and thresholding based on LD and p-values. The predictive PRS model is evaluated using Nagelkerke’s pseudo-R2 (R2) [9].

thumbnail
Fig 2. An overview of the methods followed in the present study.

LiftOver was specific to the summary statistics used as the base datasets. PRS, polygenic risk score. Image created with BioRender.com/. https://BioRender.com/19qh9dv.

https://doi.org/10.1371/journal.pgen.1012064.g002

Polygenic risk score: file preparation

The covariate file was created using the local ancestry analysis output containing the ancestry inferences for the parental populations for each individual and both haplotypes across inferred local ancestry windows. The ancestry proportions were calculated by extracting the ancestry window information, specifically calculating the total genomic span of each parental ancestry, and normalizing the values to determine relative ancestry contribution per individual. Additional covariates included age and sex. Full summary statistics were obtained from the NHGRI-EBI GWAS catalog [44] on 05/09/2024 for studies GCST009325 [7] and GCST90275127 [17]. These two base datasets were used to assess and compare the predictive power of the EUR dataset relative to a multi-ancestry dataset which is better matched to the admixture of the South African cohort. This approach was used to evaluate whether ancestry-matched summary statistics enhanced predictive performance in an admixed population, as traditional EUR-derived GWAS does not fully capture the genetic architecture in diverse populations, like the South African population. The summary statistics were converted to GRCh38 using LiftOver v1.0 [45]. The South African PD data, serving as the target dataset, was randomly split into two cohorts: 70% of the samples were in the training dataset (n = 979 individuals; n = 445 cases; n = 534 controls), and 30% in the validation dataset (n = 419 individuals; n = 216 cases; n = 203 controls). To assess the robustness of the data split, we ran the PRS analysis across 20 random splits and compared the distribution of the AUC values with the original split (S3 Table).

Polygenic risk score analysis: case status

For this analysis, clumping was conducted using window sizes of 100kb, 250kb, and 500kb; LD thresholds (r2) of 0.1, 0.2, 0.5, and 0.8; as well as the p-value thresholds of 1 × 10-3, 1 × 10-5, 1 × 10-6, and 5 × 10-8. The 1000 Genomes EUR reference panel was used for LD calculations in both base datasets, as the Nalls et al. (2019) study is EUR-based and approximately 62% of participants in the Kim et al. (2024) study were of EUR ancestry [7,17,46]. All analyses were performed using logistic regression, adjusting for sex, age, and ancestral components (AFR, EUR, NAMA, SAS). To prevent perfect multicollinearity, we excluded the MAL ancestry from the covariates, as it represented the smallest ancestral contribution. To obtain robust significance estimates, empirical p-values were calculated using 10,000 phenotype permutations. The initial search for optimal PRS parameters was conducted on 70% of the dataset (training cohort), and the best-performing parameters and variants included in the model were then applied to fit a new model on the remaining 30% (validation cohort). The AUC was used to evaluate the performance of each PRS model using the pROC package [47] in R v4.2.0, providing a quantitative metric for comparing models [48]. Additionally, predicted probabilities from a logistic regression model were converted to binary disease status using a fixed threshold of 0.5, and model performance was evaluated using accuracy, balanced accuracy, sensitivity, and specificity calculated at this threshold. Additionally, the positive predictive value and negative predictive values were calculated at multiple top-percentile thresholds (5%, 10%, and 20%) using the global PD prevalence of 1.386 × 10-4 [49].

Assessment covariate variance

We evaluated the contribution of different covariate combinations to the variance explained. Using PRSice-2 outputs, we examined the variance models across seven covariate inclusion scenarios, each incorporating a distinct combination of the following variables: age, sex, and ANC. This analysis allowed us to quantify the incremental variance explained by each covariate and their combinations, providing insight into their effects on risk prediction.

Polygenic risk score analysis: age at onset

PRS for PD age at onset (AAO) were generated using PRSice-2. The analysis followed the same 70/30 training-validation split and identical PRS parameters described above, with the only modification being that AAO was modelled as a continuous phenotype rather than a binary outcome (as seen in the PRS for case status). The aforementioned covariates were used and LD was calculated with the EUR reference panel. This analysis was performed using both base datasets as well as the training and validation cohorts.

Internal polygenic risk score analysis using South African summary statistics

We constructed a PD PRS using summary statistics from our internal GWAS [16]. Variants were selected based on a suggestive significance threshold (p < 5 × 10 ⁻ ⁶) to capture loci with potential contribution to PD risk within this population, and to see how predictive performance varies in comparison to the external summary statistics. The analysis was performed in the same 70.30 split using the optimal thresholds determined above. Predictive performance of the PRS was assessed by calculating the AUC.

Supporting information

S1 Fig. The following plots are using Nalls et al 2019 as the base dataset.

(A) PRS R² across clumping thresholds. The variance explained (R²) by the polygenic risk score at different GWAS p-value thresholds, stratified by linkage disequilibrium clumping parameters (r² and kb). Each panel corresponds to a clumping window size (kb), with points and lines indicating R² across p-value thresholds. (B) PRS significance versus predictive performance. Relationship between PRS model significance (PRS association p-value; log-scaled) and variance explained (R²). Colors indicate clumping window sizes (kb) and shapes indicate LD thresholds (r²), highlighting the trade-off between model fit and predictive power.

https://doi.org/10.1371/journal.pgen.1012064.s001

(TIFF)

S2 Fig. The following plots are using Kim et al 2024 as the base dataset.

(A) PRS R² across clumping thresholds. The variance explained (R²) by the polygenic risk score at different GWAS p-value thresholds, stratified by linkage disequilibrium clumping parameters (r² and kb). Each panel corresponds to a clumping window size (kb), with points and lines indicating R² across p-value thresholds. (B) PRS significance versus predictive performance. Relationship between PRS model significance (PRS association p-value; log-scaled) and variance explained (R²). Colors indicate clumping window sizes (kb) and shapes indicate LD thresholds (r²), highlighting the trade-off between model fit and predictive power.

https://doi.org/10.1371/journal.pgen.1012064.s002

(TIFF)

S3 Fig. Density plots showing the polygenic risk score distribution for cases versus controls.

(A) Density plot with Nalls et al 2019 as the base dataset and (B) the density plot with Kim et al 2024.

https://doi.org/10.1371/journal.pgen.1012064.s003

(TIFF)

S4 Fig. Bars plot showing the proportion of phenotypic variance (R²) explained by the polygenic risk score (PRS) at varying SNP inclusion p-value thresholds.

Each bar represents a PRS model fit calculated at a specific threshold. The optimal threshold (defined as the point with the highest R²) is highlighted, indicating the most predictive model. (A) Results shown are based on the training dataset using summary statistics from Nalls et al., 2019 for the strongest association. (B) Results shown are based on the training dataset using summary statistics from Nalls et al., 2019 for the highest predictive performance. (C) Results shown are based on the validation dataset using summary statistics from Nalls et al., 2019 based on the highest predictive performance thresholds from the training dataset.

https://doi.org/10.1371/journal.pgen.1012064.s004

(TIFF)

S5 Fig. Bars plot showing the proportion of phenotypic variance (R²) explained by the polygenic risk score (PRS) at varying SNP inclusion p-value thresholds.

Each bar represents a PRS model fit calculated at a specific threshold. The optimal threshold (defined as the point with the highest R²) is highlighted, indicating the most predictive model. (A) Results shown are based on the training dataset using summary statistics from Kim et al., 2019 for the strongest association. (B) Results shown are based on the training dataset using summary statistics from Kim et al., 2024 for the highest predictive performance. (C) Results shown are based on the validation dataset using summary statistics from Kim et al., 2024 based on the highest predictive performance thresholds from the validation dataset.

https://doi.org/10.1371/journal.pgen.1012064.s005

(TIFF)

S6 Fig. Karyograms showing the inferred local ancestry of three individuals from the South African Parkinson’s Disease Study Collection.

This highlights the complex admixture of the cohort. AFR, African; EAS, East Asian; EUR, European; MAL, Malaysian; NAMA, Nama; POP, Population; SAS, South Asian.

https://doi.org/10.1371/journal.pgen.1012064.s006

(TIFF)

S1 Table. Positive predictive value and negative predictive value of polygenic risk scores for Parkinson’s disease at multiple top-percentile thresholds.

https://doi.org/10.1371/journal.pgen.1012064.s007

(XLSX)

S2 Table. Study participants included in the polygenic risk score analysis.

https://doi.org/10.1371/journal.pgen.1012064.s008

(XLSX)

S3 Table. Stability of PRS performance estimates across repeated for 20 random splits.

https://doi.org/10.1371/journal.pgen.1012064.s009

(XLSX)

S4 Table. Global Parkinson’s Genetics Program banner author list.

https://doi.org/10.1371/journal.pgen.1012064.s010

(XLSX)

Acknowledgments

We would like to acknowledge and thank the study participants for their contribution. For a complete list of GP2 members see https://zenodo.org/records/17753486. All figures were created using BioRender (https://www.biorender.com/). We would like to acknowledge Lim Shen-Yang, Tan Ai-Huey, and Azlina Ahmad-Annuar for their efforts in recruiting study participants from Malaysia. We thank Kate Andersh, Laurel Screven, and Kim Paquette for their work as scientific project managers for this project. We would like to acknowledge Dr. Thiago Peixoto Leal for his work in developing the scripts used in the analysis. We also acknowledge the Centre for High Performance Computing (CHPC), South Africa, for providing computational resources.

This research was supported [in part] by the Intramural Research Program of the National Institutes of Health (NIH). The contributions of the NIH author(s) were made as part of their official duties as NIH federal employees, are in compliance with agency policy requirements, and are considered Works of the United States Government. However, the findings and conclusions presented in this paper are those of the author(s) and do not necessarily reflect the views of the NIH or the U.S. Department of Health and Human Services.

References

  1. 1. Bloem BR, Okun MS, Klein C. Parkinson’s disease. Lancet. 2021;397(10291):2284–303.
  2. 2. Chen H, Ritz B. The Search for Environmental Causes of Parkinson’s Disease: Moving Forward. J Parkinsons Dis. 2018;8(s1):S9–17. pmid:30584168
  3. 3. Ascherio A, Schwarzschild MA. The epidemiology of Parkinson’s disease: risk factors and prevention. Lancet Neurol. 2016;15(12):1257–72. pmid:27751556
  4. 4. Jia F, Fellner A, Kumar KR. Monogenic Parkinson’s disease: Genotype, phenotype, pathophysiology, and genetic testing. Genes. 2022;13(3).
  5. 5. Klein C, Westenberger A. Genetics of Parkinson’s disease. Cold Spring Harb Perspect Med. 2012;2(1):a008888.
  6. 6. Reed X, Bandrés-Ciga S, Blauwendraat C, Cookson MR. The role of monogenic genes in idiopathic Parkinson’s disease. Neurobiol Dis. 2019;124:230–9. pmid:30448284
  7. 7. Nalls MA, Blauwendraat C, Vallerga CL, Heilbron K, Bandres-Ciga S, Chang D, et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. Lancet Neurol. 2019;18(12):1091–102. pmid:31701892
  8. 8. Step K, Ndong Sima CAA, Mata I, Bardien S. Exploring the role of underrepresented populations in polygenic risk scores for neurodegenerative disease risk prediction. Front Neurosci. 2024;18:1380860. pmid:38859922
  9. 9. Ndong Sima CAA, Step K, Swart Y, Schurz H, Uren C, Möller M. Methodologies underpinning polygenic risk scores estimation: a comprehensive overview. Hum Genet. 2024;143(11):1265–80. pmid:39425790
  10. 10. Pihlstrøm L, Morset KR, Grimstad E, Vitelli V, Toft M. A cumulative genetic risk score predicts progression in Parkinson’s disease: Genetic Risk and Progression in Parkinson’s Disease. Mov Disord. 2016;31(4):487–90.
  11. 11. Li W-W, Fan D-Y, Shen Y-Y, Zhou F-Y, Chen Y, Wang Y-R, et al. Association of the Polygenic Risk Score with the Incidence Risk of Parkinson’s Disease and Cerebrospinal Fluid α-Synuclein in a Chinese Cohort. Neurotox Res. 2019;36(3):515–22. pmid:31209785
  12. 12. Foo JN, Chew EGY, Chung SJ, Peng R, Blauwendraat C, Nalls MA, et al. Identification of Risk Loci for Parkinson Disease in Asians and Comparison of Risk Between Asians and Europeans: A Genome-Wide Association Study. JAMA Neurol. 2020;77(6):746–54. pmid:32310270
  13. 13. Ibanez L, Dube U, Saef B, Budde J, Black K, Medvedeva A, et al. Parkinson disease polygenic risk score is associated with Parkinson disease status and age at onset but not with alpha-synuclein cerebrospinal fluid levels. BMC Neurol. 2017;17(1):198. pmid:29141588
  14. 14. Loesch DP, Horimoto ARVR, Sarihan EI, Inca-Martinez M, Mason E, Cornejo-Olivas M, et al. Polygenic risk prediction and SNCA haplotype analysis in a Latino Parkinson’s disease cohort. Parkinsonism Relat Disord. 2022;102:7–15. pmid:35917738
  15. 15. Dehestani M, Liu H, Gasser T. Polygenic Risk Scores Contribute to Personalized Medicine of Parkinson’s Disease. J Pers Med. 2021;11(10):1030. pmid:34683174
  16. 16. Step K, Leal TP, Waldo E, Madula L, Swart Y, Hernández CF, et al. Genome-wide association analyses reveal susceptibility variants linked to Parkinson’s disease in the South African population using inferred global and local ancestry. medRxiv. 2025. Available from:
  17. 17. Kim JJ, Vitale D, Otani DV, Lian MM, Heilbron K, 23andMe Research Team, et al. Multi-ancestry genome-wide association meta-analysis of Parkinson’s disease. Nat Genet. 2024;56(1):27–36. pmid:38155330
  18. 18. Staerk C, Klinkhammer H, Wistuba T, Maj C, Mayr A. Generalizability of polygenic prediction models: how is the R2 defined on test data? BMC Med Genomics. 2024 May 16;17(1):132.
  19. 19. Nalls MA, McLean CY, Rick J, Eberly S, Hutten SJ, Gwinn K, et al. Diagnosis of Parkinson’s disease on the basis of clinical and genetic classification: a population-based modelling study. Lancet Neurol. 2015;14(10):1002–9. pmid:26271532
  20. 20. Saffie-Awad P, Grant SM, Makarious MB, Elsayed I, Sanyaolu AO, Crea PW, et al. Insights into ancestral diversity in Parkinson’s disease risk: a comparative assessment of polygenic risk scores. NPJ Parkinsons Dis. 2025;11(1):201. pmid:40610451
  21. 21. Wang Y, Tsuo K, Kanai M, Neale BM, Martin AR. Challenges and Opportunities for Developing More Generalizable Polygenic Risk Scores. Annu Rev Biomed Data Sci. 2022;5:293–320. pmid:35576555
  22. 22. Ge T, Chen C-Y, Ni Y, Feng Y-CA, Smoller JW. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat Commun. 2019;10(1):1776. pmid:30992449
  23. 23. Ruan Y, Lin Y-F, Feng Y-CA, Chen C-Y, Lam M, Guo Z, et al. Improving polygenic prediction in ancestrally diverse populations. Nat Genet. 2022;54(5):573–80. pmid:35513724
  24. 24. Mavaddat N, Michailidou K, Dennis J, Lush M, Fachal L, Lee A. Polygenic risk scores for prediction of breast cancer and breast cancer subtypes. Am J Hum Genet. 2019;104(1):21–34.
  25. 25. Tremblay J, Haloui M, Attaoua R, Tahir R, Hishmih C, Harvey F, et al. Polygenic risk scores predict diabetes complications and their response to intensive blood pressure and glucose control. Diabetologia. 2021;64(9):2012–25. pmid:34226943
  26. 26. Ayoub A, McHugh J, Hayward J, Rafi I, Qureshi N. Polygenic risk scores: improving the prediction of future disease or added complexity? Br J Gen Pract. 2022;72(721):396–8. pmid:35902257
  27. 27. Roberts MC, Khoury MJ, Mensah GA. Perspective: The Clinical Use of Polygenic Risk Scores: Race, Ethnicity, and Health Disparities. Ethn Dis. 2019;29(3):513–6. pmid:31367172
  28. 28. van Rensburg ZJ, Abrahams S, Chetty D, Step K, Acker D, Lombard CJ, et al. The South African Parkinson’s Disease Study Collection. Mov Disord. 202;37(1):230–2.
  29. 29. Hughes AJ, Daniel SE, Kilford L, Lees AJ. Accuracy of clinical diagnosis of idiopathic Parkinson’s disease: a clinico-pathological study of 100 cases. J Neurol Neurosurg Psychiatry. 1992;55(3):181–4. pmid:1564476
  30. 30. Global Parkinson’s Genetics Program. GP2: The global Parkinson’s genetics program. Mov Disord. 2021;36(4):842–51.
  31. 31. Bandres-Ciga S, Faghri F, Majounie E, Koretsky MJ, Kim J, Levine KS, et al. NeuroBooster Array: A Genome-Wide Genotyping Platform to Study Neurological Disorders Across Diverse Populations. Mov Disord. 2024;39(11):2039–48. pmid:39283294
  32. 32. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
  33. 33. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. pmid:25722852
  34. 34. Das S, Forer L, Schönherr S, Sidore C, Locke AE, Kwong A, et al. Next-generation genotype imputation service and methods. Nat Genet. 2016;48(10):1284–7. pmid:27571263
  35. 35. Leal TP, Furlan VC, Gouveia MH, Saraiva Duarte JM, Fonseca PA, Tou R, et al. NAToRA, a relatedness-pruning method to minimize the loss of dataset size in genetic and omics analyses. Comput Struct Biotechnol J. 2022;20:1821–8.
  36. 36. Manichaikul A, Mychaleckyj JC, Rich SS, Daly K, Sale M, Chen W-M. Robust relationship inference in genome-wide association studies. Bioinformatics. 2010;26(22):2867–73. pmid:20926424
  37. 37. Swart Y, Uren C, van Helden PD, Hoal EG, Möller M. Local Ancestry Adjusted Allelic Association Analysis Robustly Captures Tuberculosis Susceptibility Loci. Front Genet. 2021;12:716558. pmid:34721521
  38. 38. Byrska-Bishop M, Evani US, Zhao X, Basile AO, Abel HJ, Regier AA, et al. High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios. Cell. 2022;185(18):3426-3440.e19. pmid:36055201
  39. 39. Shriner D, Bentley AR, Gouveia MH, Heuston EF, Doumatey AP, Chen G, et al. Universal genome-wide association studies: Powerful joint ancestry and association testing. HGG Adv. 2023;4(4):100235. pmid:37653728
  40. 40. Ragsdale AP, Weaver TD, Atkinson EG, Hoal EG, Möller M, Henn BM, et al. Publisher Correction: A weakly structured stem for human origins in Africa. Nature. 2023;620(7972):E11. pmid:37460744
  41. 41. Hilmarsson H, Kumar AS, Rastogi R, Bustamante CD, Montserrat DM, Ioannidis AG. High resolution ancestry deconvolution for next generation genomic data. bioRxiv. 2021. http://dx.doi.org/10.1101/2021.09.19.460980
  42. 42. Choi SW, Mak TS-H, O’Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc. 2020;15(9):2759–72. pmid:32709988
  43. 43. Choi SW, O’Reilly PF. PRSice-2: Polygenic Risk Score software for biobank-scale data. Gigascience. 2019;8(7).
  44. 44. Cerezo M, Sollis E, Ji Y, Lewis E, Abid A, Bircan KO, et al. The NHGRI-EBI GWAS Catalog: standards for reusability, sustainability and diversity. Nucleic Acids Res. 2025;53(D1):D998–1005. pmid:39530240
  45. 45. Perez G, Barber GP, Benet-Pages A, Casper J, Clawson H, Diekhans M, et al. The UCSC Genome Browser database: 2025 update. Nucleic Acids Res. 2025;53(D1):D1243–9.
  46. 46. 1000 Genomes Project Consortium, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. pmid:26432245
  47. 47. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77. pmid:21414208
  48. 48. Konuma T, Okada Y. Statistical genetics and polygenic risk score for precision medicine. Inflamm Regen. 2021;41(1):18. pmid:34140035
  49. 49. Ou Z, Pan J, Tang S, Duan D, Yu D, Nong H, et al. Global trends in the incidence, prevalence, and years lived with disability of Parkinson’s disease in 204 countries/territories from 1990 to 2019. Front Public Health. 2021;9:776847.