Figures
Abstract
Background and objective
Prediction models enable healthcare providers to perform early risk stratification. This study aimed to develop and internally validate prediction models for 5- and 10-year risks of developing diabetic retinopathy (DR) in the Iranian individuals with type 2 diabetes.
Methods
This study utilized data from individuals with diabetes involved in the Shahroud Eye Cohort Study (ShECS), a prospective cohort study in Iran. The initial phase of ShECS began in 2009, with the second and third follow-up phases occurring in 2014 and 2019, respectively. Logistic regression developed prediction models, with bootstrap validation assessing internal validity. Model performance was evaluated using the discrimination and calibration.
Results
A total of 637 individuals with diabetes (35.0% men, mean (SD) of age: 53.0 (6.3 years)) were diagnosed. The five-year cumulative incidence of DR was 25.3% (95%CI: 21.8, 29.0%), and 17.0% (95%CI: 13.3, 21.0%) based on the second and third phases, respectively, while 10-year cumulative incidence was 40.0% (95%CI: 35.8, 44.0%). Incorporating various predictors, six models were developed with three recommended prediction models. Using mean blood pressure (MBP), non-fasting blood glucose (BG), and diabetes duration, Model-1 predicts 5-year risk indicating good calibration and discrimination with a c-statistic of 0.773 after bootstrap validation. The optimal statistical threshold was a predicted probability of 0.24. Model-2 predicts a 10-year risk incorporating diabetes duration, MBP, and BG, with a good calibration and a c-statistic of 0.687 after bootstrap validation showing moderate discrimination. The optimal statistical threshold was a predicted probability of 0.32. Model-3 predicts the 5-year risk using diabetes duration, MBP, glycated hemoglobin, high-density lipoprotein, triglycerides, and fasting blood glucose, showing good calibration and a c-statistic of 0.735 after bootstrap validation, indicating good discrimination. The optimal statistical threshold was a predicted probability of 0.20.
Citation: Feˈli SN, Emamian MH, Yaseri M, Riazi-Esfahani H, Hashemi H, Fotouhi A, et al. (2025) Development and validation of prediction models for diabetic retinopathy in type 2 diabetes patients. PLoS One 20(7): e0325814. https://doi.org/10.1371/journal.pone.0325814
Editor: Mohammad Reza Mahmoodi, Kerman University of Medical Sciences Physiology Research Center, IRAN, ISLAMIC REPUBLIC OF
Received: December 12, 2024; Accepted: May 19, 2025; Published: July 10, 2025
Copyright: © 2025 Feˈli et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data used in this study are owned by a third-party organization (The Shahroud Eye Cohort Study). Data can be available upon reasonable request to the Shahroud Eye Cohort Study team. Other researchers would be able to access the data in the same manner as the authors, and the authors did not have any special access privileges that other researchers would not have. Contact information: Email: oerc@shmu.ac.ir
Funding: The present study, as a Ph.D. dissertation in epidemiology was funded by Tehran University of Medical Sciences (project number: 57226). Moreover, the Shahroud Eye Cohort Study (the study that provided the data for the present study) was supported by Noor Ophthalmology Research Center and Shahroud University of Medical Sciences (Projects numbers: 8737, and 9826, respectively). Noor Ophthalmology Research Center, and Shahroud University of Medical Sciences provided funding for data collection of Shahroud Eye Cohort Study. Tehran University of Medical Sciences provided the dissertation funding.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Diabetic retinopathy (DR) is one of the most common complications of diabetes. Based on a meta-analysis of 59 population-based studies on the individuals with diabetes, the global prevalence of diabetic retinopathy, vision-threatening DR, and clinically significant macular edema was reported to be 22.3%, 6.2%, and 4.1%, respectively [1]. An analysis of a comprehensive nationwide U.S. registry showed that out of 53,535 eyes from newly diagnosed patients with DR who had good vision, 678 eyes (1.3%) progressed to sustained blindness over an average follow-up time of approximately 1.8 years [2].
Prediction models can assist clinicians in making individualized decisions regarding patient management by combining a set of patient characteristics as predictor variables [3,4]. Applications of DR prediction models are comparable to those of other predictive models. One of these models is the prediction model by Gong et al., which was developed and validated using data from a Chinese cohort study [5], and Nugawela et al. models, which were developed using a longitudinal study from the UK and externally validated using cohorts from Wales, the UK and India [6]. The factors such as age, sex, body mass index (BMI), systolic blood pressure (SBP), diabetes duration, glycated hemoglobin (HbA1c), antidiabetic medication use, and history of retinopathy were introduced as predictors of DR or sight threatening DR. To the best of our knowledge, only model available in Iran was developed by Azizi-Soleiman et al. [7]. However, the model was based on data from a cross-sectional study, which was inappropriate for prediction. Furthermore, while it is sometimes feasible to use an already-developed predictive model, it is preferred to create a tailored one for each particular population owing to variations in racial and ethnic backgrounds, the distribution of risk factors, and the incidence rate of outcomes [8,9]. Furthermore, a prediction model established using readily available predictors can be efficiently used in primary healthcare settings.
Considering the initial stages of DR are asymptomatic [10], early identification of DR risk (probability) can justify timely diagnosis and referral for a comprehensive ophthalmological examination. This study aimed to develop and internally validate models to predict 5- and 10-year risks of developing DR among Iranian individuals with type 2 diabetes mellitus (T2DM), which can enable primary healthcare providers, and physicians to perform early risk stratification of these patients.
Materials and methods
Study design and participants
This study was conducted using the data collected over 10 years in three phases of Shahroud Eye Cohort Study (ShECS). ShECS is a population-based prospective cohort study for detecting risk factors of visual impairments and major eye conditions started in 2009 in Shahroud, Iran. The first phase of the ShECS was completed by 5,190 participants, ages 40–64, who were selected using a random cluster sampling technique. All participants of the first phase were invited to return for a follow-up examination after a period of 5 and 10 years. Therefore, phases two and three of the follow-up were carried out in 2014 and 2019, respectively. The details of the study design were reported elsewhere [11]. Data for current study were accessed on 2022/06/25. The data collection contained no personal identifiers for the study participants.
Individuals with diabetes diagnosed in the first phase of ShECS were included in the study. Those with DR in the baseline phases were not included in the development of prediction models. Therefore, participants with DR in the first and second phases of the ShECS were excluded when the models were being developed.
TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) Statement was followed for reporting this study [12]. This study, which was part of a Ph.D. dissertation in epidemiology, was approved by Institutional Review Board of Tehran University of Medical Sciences (IRB no. IR.TUMS.SPH.REC.1401.048). All three phases of ShECS were separately reviewed and approved by the Institutional Review Board of Shahroud University of Medical Sciences. Moreover, a written informed consent was obtained from each of the participants.
Measurements
At the first phase of the ShECS, a non-fasting blood glucose level of 200 mg/dL or higher and/or taking glucose-lowering drugs were considered diabetes [13]. Considering low prevalence of type 1 diabetes in adults, we considered all individuals with diabetes as type 2 [14].
Outcome.
To determine if DR was present in the first phase of the ShECS, participants were asked about their DR diagnosis status. Morovere, participants recieved two occular examinations in the first and second phase of the ShECS. During these stages of the ShECS, ophthalmologists used direct or indirect ophthalmoscopy to examine the retina. To diagnose DR, fundus imaging was also used. A Nidek AFC-230 Fundus Camera (Nidek, Chiyoda-ku, Japan) was used for fundus photography.
Predictors.
As a review of the literature is a common approach for narrowing down the list of candidate predictors [15], the most relevant candidate predictors, that could be readily available in clinical practice and did not require an ophthalmological examination, were identified based on a review of the literature. All measurements were based on the ShECS protocol.
Participants were asked about their birth date, smoking status (cigarette, pip, or hookah), and diabetes duration (time since diabetes diagnosis) via a face-to-face interview [11]. The participants’ sex was determined based on the investigator’s observations. Trained nurses measured SBP and diastolic blood pressure (DBP) using a digital blood pressure sphygmomanometer [16]. Mean blood pressure (MBP) was defined as (2 × DBP + 1 × SBP)/3. Fasting blood glucose (FBG), HbA1c, and blood lipid levels were measured after 12–14 hours of fasting [17]. Non-fasting blood glucose (BG) was assessed [18]. Body weight was measured using digital scales, and body height was measured in a standing position [19] and BMI was subsequently calculated as the ratio of weight (in kilograms) to the square of height (in meters). Waist circumference (WC) was measured at the highest part of the iliac crest using a non-stretchable tape [17]. A non-elastic tape was used to measure hip circumference (HC) in the biggest pelvic circumference region while the subject was standing (personal communication). Then, WC divided by HC was used to compute the waist-to-hip ratio (WHR). Pulse rate was recorded as number of beats per minute (bpm) using a digital sphygmomanometer (personal communication).
Statistical analyses
Descriptive statistics.
For categorical variables, frequencies and percentages were used as the descriptive measures. Quantitative variables were described using mean, and standard deviation. In cases where quantitative variables were not near a normal distribution, the median and interquartile range were reported along with the mean and standard deviation.
Development and validation of the prediction models.
Because not all the predictors were measured simultaneously during the phase one of the ShECS, three separate models were developed, as illustrated in the Fig 1. Models M1 and M3 predict 5-year risk of DR, and M2 predicts the 10-year risk. Log-binomial, logistic, and Poisson regression models with robust standard error estimates were used to achieve the optimal risk prediction model. However, only the results of the logistic regression were presented since the log-binomial regression was unable to converge and there was no simple way to evaluate the calibration of the Poisson models. The predictors included in each model are described in the Fig 1. Since SBP in M1, DBP in M2, and both blood pressures in M3 were statistically significant predictors, each of the three models was fitted separately regarding SBP-DBP and MBP to utilize all available information on the blood pressure variable. We used a complete case analysis so that participants with missing values were excluded from the model-building and model-validation processes.
Predictors considered for developing M1 included: sex, smoking, age, diabetes duration, blood pressure, body mass index, and non-fasting blood glucose. Predictors considered for developing M2 included: sex, smoking, age, diabetes duration, blood pressure, body mass index, and non-fasting blood glucose. Predictors considered for developing M3 included: sex, smoking, age, diabetes duration, blood pressure, body mass index, HbA1c, total cholesterol, high-density lipoprotein cholesterol, triglycerides, fasting blood glucose, pulse rate, abdomen circumference, hip circumference, abdominal to hip ratio. a: systolic and diastolic blood pressure were included as separate predictors. b: mean blood pressure ((2*DBP + 1*SBP)/3) was included.
Model development.
A univariable regression analysis was conducted, following the criterion of a P-value0.2, to examine unadjusted associations. However, key predictors were not excluded solely based on statistical non-significance in the univariable analysis [20]. Subsequently, a multivariable analysis was performed using a backward elimination approach, focusing on the remaining predictors. In multivariable regression, predictors and interaction terms with P-value
0.05 were retained in the final models.
We considered a generalized additive model (GAM) and corresponding plots in univariable and multivariable analyses to check the nonlinearity of associations between outcome and continuous predictors. An appropriate polynomial term was supposed to be included in the model if there was evidence of a nonlinear association; if not, the predictors were thought to be linear. To find potentially influential observations, a modified Cook’s distance plot was used. If any influential observations were found, a sensitivity analysis was carried out, in which the final model was repeated without the influencing points. To detect multicollinearity, which is a high correlation among predictors, we used the variance inflation factor (VIF). A VIF above 10 was considered as high collinearity.
Model performance measures.
- Discrimination, that is the ability of a prediction model to discriminate among the patients with and without future outcomes was quantified using Harrell’s concordance (c) statistic, which was equal to the area under the Receiver Operating Characteristic curve (AUC).
- Calibration, which specifies the agreement between the observed, and predicted outcomes of all participants, was graphically illustrated using calibration plots and statistically tested using Hosmer–Lemeshow goodness-of-fit test. A p-value of greater than 0.05 indicated a good fit. Moreover, the expected (predicted) to observed (E/O) ratio was calculated by dividing the expected to observed outcome risks.
Model validation.
- Three distinct approaches were applied to assess internal validation of final models: 1) for apparent validation, performance measures were assessed directly on the data used to drive the models, resulting in optimistic performance estimates, 2) following the approach outlined by Kuhn and Johnson [21] cross-validated performance measures were derived using a stratified 10-fold cross-validation method, 3) following the approach outlined by Steyerberg, performance measures were additionally obtained using a bootstrap validation with 200 bootstrap samples, that reflect future performance in new patients [22]. However, there was a significant amount of random error in the performance measures in the 10-fold cross-validation since there weren’t many outcomes in certain folds. Rather, more stable measures were established using bootstrap validation. Therefore, only performance measures using bootstrap validation approach were reported.
- We compared the apparent c-statistic and calibration plots with those after bootstrap validation. In the case of a considerable drop in performance measures, it was intended to recheck the models in terms of the number of included predictors in relation to the sample size.
Decision-curve analysis.
A decision curve was formed using net benefit (NB) on y-axis, and thresholds of predicted probabilities on x-axis. The NB was defined as (TP –wFP)/N, where “w” indicated a weight equal to the odds of selected threshold (Pt/1-Pt), which is considered as the harm to benefit ratio of treatment. For example, at the threshold of 10%, the false-positives is valued at one-ninth of true-positives. “Pt” is the threshold probability, “TP” was the number of true-positives, “FP” is the number of false-positives, and N was the total number of subjects [23]. To determine the model line, we plotted NBs against a broad range (0–1) of predicted probability thresholds. Additionally, two additional lines were drawn to illustrate the default strategies of “treat none” and “treat all” without the use of the prediction model. We regarded NB as zero for the “treat none” line, and we drew the “treat all” line depending on the thresholds and event rate [24,25].
Using the decision curve of each model, the interval between the intersection points of model’s line with both lines of default strategies was considered the reasonable interval of thresholds for prediction. As is usual, the statistical optimum decision threshold was determined based on Youden’s J statistic (J = model sensitivity model specificity−1). Decision curves were plotted using both apparent and bootstrap validation approaches.
Regarding the use of cluster sampling, robust standard errors and p-values were considered to correct the design effect. STATA17 (StataCorp, USA), and R4.2.3 statistical software were used to perform the statistical analyses.
Results
Characteristics of the participants
A total of 5,190 individuals, of whom 5,183 participants underwent blood glucose tests completed the first phase of the ShECS. Among them, 637 were diagnosed with diabetes, representing a 12.3% prevalence. The mean age of those with diabetes was 53.0 years (standard deviation: 6.3 years), and 35.0% and 65% of them were men and women, respectively. As a baseline, 24 (3.8%) and 148 (23.2%) participants with DR in the first and second phases of the ShECS, respectively, were not included in the analyses. There were 148 and 64 incident cases of DR during the second and third phases, respectively. The five-year cumulative incidence of DR based on the second and third phase was 25.3% (95%CI: 21.8, 29.0%), and 17.0% (95%CI: 13.3, 21.0%), respectively. Moreover, the 10-year cumulative incidence was 40.0% (95%CI: 35.8, 44.0%). The predictors measured in each phase of ShECS are described in Table 1.
Diabetic retinopathy prediction models
Based on GAM plots no nonlinear association between predictors and outcomes was observed in any of the univariable or multivariable models. Besides, after performing sensitivity analyses, no outlier observations were identified as influential. Moreover, no severe multicollinearity was observed among the predictors (the results of these assumptions have not been reported).
The unadjusted univariable associations between candidate predictors and DR are reported in Table 2. Final risk prediction models are reported in Table 3 and 4.
Model performance
Considering the values of c-statistic after bootstrap validation, the discriminative power of all prediction models was satisfactory, ranging from 0.686 to 0.776, indicating a medium to good ability to distinguish between cases and non-cases (Table 5). Calibration curves, corresponding Hosmer–Lemeshow tests, and E/O ratios were presented in S1 Appendix.
Model presentation
We suggest M1 and M2, including MBP for predicting 5- and 10-year DR risk, respectively. Moreover, M3, including MBP is recommended to predict the five-year risk. Choosing between M1 and M3 can be based on the availability of predictors in clinical practice. Given that the models’ c-statistics after bootstrap validation were roughly similar, this recommendation was based on the models’ completeness and the c-statistic, which is a crucial metric for comparing predictive models. The algorithm of recommended models has the form of:
- M1 including MBP to predict five-year risk:
ln
−5.2855
(0.1412
diabetes duration)
(0.0175
MBP)
(0.0070
BG)
- M2 with MBP to predict 10-year risk:
ln
−4.2554
(0.0876
diabetes duration)
(0.0192
MBP)
(0.0048
BG)
- M3 including MBP to predict five-year risk:
ln
8.4990
(0.1827
diabetes duration)
(−0.0389
MBP)
(−0.8821
HbA1c)
(−0.3027
HDL-C)
(−0.0214
TG)
(0.0117
FBG)
(−0.0008
(diabetes duration
FBS))
(0.0345
(HbA1c
HDL-C))
(0.0002
(TG
MBP))
Where p is the risk of DR, and the right side of the algorithm is the linear predictor. Predicted absolute risk () for a specific individual can be estimated by inserting his/her predictor values into the algorithm, then transforming back to the probability scale using the equation of
. For example, an individual with T2DM for 10 years, at a routine clinical check-up has a MBP of 100.33 mmHg, and a BG of 301 mg/dL. Using M1, the probability that she/he will develop DR in the next five years is predicted to be 50%.
Clinical usefulness
Based on the decision curve of final suggested models after bootstrap validation, the interval of threshold probabilities in which M1 would be useful ranged from 0.12 to 0.60. The maximum value of Youden statistic occurred at a cut-off point of 0.24 which was equal to 0.45. Furthermore, the interval for M2 was 0.22 to 0.43. The maximum value of the Youden statistic occurred at a cut-off point of 0.32 which was equal to 0.43. Thus, the interval for M3 was 0.12 to 0.37. The corresponding maximum value of the Youden statistic occurred at the cut point of 0.20 which was equal to 0.45. The decision curves are shown in Fig 2.
The y-axis indicates the net benefit. The x-axis indicates the thresholds of predicted probabilities. The solid line represented treat all strategy. The dotted line represented the treat none strategy. The dashed line (pred) describes the net benefits of the prediction model at various thresholds using apparent validation. The long-dashed (pred.bootc) line describes the bootstrap corrected net benefits of the prediction model at various thresholds.
Discussion
This study aimed to develop a statistical model that can predict the risk of DR in 5 and 10 yearsˈ timeframe in the individuals with T2DM. Therefore, using data from a population-based prospective cohort study, we developed and validated six regression models including various predictors. Based on the availability of predictors in clinical settings, we discovered that three models could be recommended out of all the developed models: one for predicting 10-year risk and two for predicting 5-year risk. These three models showed that the duration of diabetes, BG, FBG, HbA1c, MBP, HDL-C, and TG levels were critical predictors of DR. Moreover, the models represented favorable predictive performance measures.
Major predictors of DR
Disease course.
Previous studies established that a longer diabetes duration is a major non-modifiable risk factor for the onset of DR; a recent systematic review revealed that diabetes duration was associated with DR development with a pooled odds ratio (OR) of 1.03 (95% CI: 1.02, 1.03) [26]. Other previous epidemiological studies confirmed a relationship between long duration of diabetes and DR [27–29]. The present study discovered a statistically significant association between the length of diabetes and DR, which is consistent with previous research. The impact of chronic exposure to hyperglycemia, which causes microvascular damage, is probably the cause of this connection. However, because of the impossibility of intervention in this factor, it can only be considered a significant predictor to identify high-risk patients and instead focus on modifiable risk factors.
Blood glucose.
After the diabetes duration, hyperglycemia is reported to be the most relevant risk factor for the development of DR [30]. In a systematic review, and meta-analysis, 2-hour postprandial blood glucose, FBG, and HbA1c were reported to be associated risk factors of DR with the pooled OR of 1.94 (95% CI: 0.81, 4.65), 1.33 (95% CI: 1.12, 1.59), and 1.15 (95% CI: 1.09, 1.20), respectively [31]. The results of our study revealed that BG was a statistically significant predictor across all multivariable models where this variable was included. A similar pattern was observed in the case of FBG. These results are in agreement with those of prior research, suggesting that elevated blood glucose levels may contribute to the development of DR. Furthermore, HbA1c is a widely recognized glycemic control indicator, and its influence on the development of DR has been thoroughly examined in prior research [26,31,32]. Nevertheless, our investigation demonstrated a detrimental association. One possible explanation is that HDL-C has a protective effect, and the regression models included an interaction effect between HbA1c and HDL-C. Therefore, HbA1c may have different effects on DR depending on HDL-C levels.
Blood pressure.
Several studies suggested that high blood pressure can contribute to the development of DR. A large population-based cross-sectional study discovered an association between hypertension and DR with an adjusted OR of 1.08 (95% CI: 1.04, 1.11) [33]. Moreover, in another population-based study, poorly controlled and untreated hypertension were associated with DR with adjusted OR of 1.97 (95% CI: 1.39, 2.83), and 2.01 (95% CI: 1.34, 3.05), respectively. Systolic blood pressure was associated with DR with an adjusted OR of 1.45 (95% CI: 1.28, 1.65) [34]. The results of present study showed that SBP in M1, DBP in M2, and both variables in M3 were significantly associated with DR. These findings indicate the predictive capability of both variables. Therefore, prediction models were developed by combining these two variables as the MBP. Mean blood pressure was a statistically significant predictor of both M1 and M2.
Lipid profile.
Dyslipidemia is a common comorbidity of diabetes [35,36], and can be involved in the development of DR [37]. Research findings on the association between HDL-C and DR are conflicting. While some studies suggest that HDL-C plays a protective role against DR development [38], others indicate a positive association between HDL-C levels and DR occurrence [39]. Therefore, a study revealed an inverted U-shaped association between HDL-C and DR [40]. Our findings indicate a statistically significant protective effect of HDL-C on DR in univariable and multivariable regression models. There is a discrepancy among the studies regarding the association of TG and DR [37,41,42]. Triglycerides was identified as a detrimental risk factor in our investigation. A positive association between TC and DR has been substantiated by numerous investigations [37,43]. In contrast, the current study did not demonstrate a statistically significant relationship between TC levels in either univariable or multivariable analyses. This discrepancy may be due to the small sample size and consequent lack of statistical power to detect a significant association.
Prediction models
Model development.
After incorporating the significant predictors from the univariable regression analyses of this study, three predictor variables, including diabetes duration, MBP, and BG constructed final suggested M1 and M2. In addition to these variables, HbA1c, FBG, HDL-C, and TG levels were involved in the development of suggested M3.
Several DR risk prediction models were developed and validated in various populations. A recent population-based cohort study in China developed a model to predict the risk of vision-threatening DR including predictors of age, BMI, SBP, diabetes duration, and HbA1c, with a c-statistic of 0.72 and 0.68, respectively, for the training and testing set, and a good calibration [44]. In addition, a model was fitted using diabetes duration, HbA1c, FBG, SBP, proteinuria, BMI, and education level, as determined by cross-sectional study data from Iran. The model exhibited a c-statistic of 0.76 and a high degree of agreement between the predicted and observed risk [7]. Another risk prediction algorithm by Aspelund et al. was created to calculate the risk of sight-threatening DR in a time period ranging from 6 to 60 months based on clinical data, including type and duration of diabetes, HbA1c, SBP, presence and DR grade, from an Icelandic type 1 and 2 population with diabetes. The c-statistic of the algorithm was found to be 0.76 [45]. A validation study by Rao et al., applying the data of T2DM patients from Australia, New Zealand, and Finland, showed that the Aspelundˈs algorithm predicted two- and five-year risk of sight-threatening DR well with a c-statistic of 0.86 and 0.86, respectively [46].
Several predictors in our risk prediction models are common to existing models, and their significance in DR development has been previously documented in the literature, as mentioned earlier.
Model performance.
Discrimination analysis of all models in this study, using the c-statistic, demonstrated acceptable power to discriminate between low- and high-risk individuals, even after correcting for optimism. The c-statistic shows the probability that, of two people, one will have DR within the next 5 or 10 years and the other will not, with the patient who will acquire DR having a larger predicted risk than the patient who will not. The discriminatory ability of prediction models in our study was consistent with that in previous evidence [7,44,45]. Including key predictors may be a reason for achieving moderate-to-good discrimination power in this study, and consistency with other studies. However, we observed lower discrimination than that in the model validated by Rao et al. [46], which may be due to differences among populations and outcome spectrum. Furthermore, in all the suggested models, the mean predicted values were close to the mean observed outcomes (E/O ratio close to 1), indicating good calibration, except in higher risks (graphically tested). The latter implies a relatively poor calibration at higher risks, which may be due to the lower number of observed DR cases.
Clinical usefulness.
In addition to discrimination and calibration, we assessed the ability of models to make better clinical decisions than settings without using the model. For M1, no difference was observed between the “treat all” and “treat none” strategies and the model line at predicted risk levels below 0.12 and above 0.6, with a proposed cut-off point of 0.24. Using this model in clinical settings, individuals with a predicted risk over 0.24 should be categorized as positive for developing DR in the next five years and need care, while other patients will be classed as negative. This cut-off point was thought of as a decision threshold. Moreover, due to M2, the appropriate threshold interval was found to be 0.22 to 0.43 with a proper cut-off point of 0.32 to classify patients based on 10-year predicted risk. Decision curve of M3 also showed that in the predicted risk of less than 0.12, there is no difference between “treat all” approach and the prediction model. Hence, in the predicted risk higher than 0.37, there is no difference between “treat none” and the model line. The most appropriate decision threshold for M3 was 0.20. Previously, a clinically useful predicted risk threshold interval of 21–51% by Mo et al. [47], 2–85% by Wang et al. [48], and 11–95% by Liu et al. [49] were proposed for Chinese population. However, to our knowledge, such an interval has not been reported in the Iranian population. We suggested the most suitable cut-off point based on statistical considerations to categorize people as future cases or non-cases of DR, but healthcare providers are free to select any other reasonable classification cutoff within the clinically useful interval based on the patient’s condition or the importance of false positives or false negatives.
Strengths and limitations of the study
The strengths of this study include the development of predictive models using data from a prospective cohort study with 10-year follow up and a representative sample from the general population. Thus, easily obtainable predictors were considered to develop the models, which could facilitate their use in clinical settings. Furthermore, as far as we are aware, this is the first research to evaluate the clinical usefulness of a DR risk prediction model in the Iranian community in order to establish a cutoff point for categorizing people as positive or negative.
The study limitation was the diagnosis of diabetes based on BG. Another limitation was the unavailability of FBG, HbA1c, and lipid profile in the first phase of ShECS. Moreover, Determining the duration of diabetes based on the time since diagnosis may have been underestimated, because diabetes may have remained undiagnosed for some time. However, considering that most individuals are checked up periodically and the prominent symptoms of diabetes that lead people to refer, it is expected that the effect of this underestimation on the study results is small and negligible.
Potential impact on clinical practice
Early identification of DR is crucial for effective treatment and preservation of vision [50]. Clinicians may use our prediction algorithms to identify high-risk individuals who might need extra care or intervention by using easily accessible clinical indicators. Consequently, preventive strategies can be enhanced by focusing the modifiable risk factors. Furthermore, determining the interval of risk thresholds allows healthcare providers and clinicians to select the optimal cutoff point for classifying individuals as potential future positive or negative cases within a narrowed, reasonable span of predicted probabilities.
Conclusions
Three prediction models for DR risk were introduced using logistic regression analysis: two for 5-year risk and one for 10-year risk. Among five-year risk prediction models, one incorporated diabetes duration, MBP, and BG, while another included these factors, along with FBG, HbA1c, HDL-C, and TG measurements. Moreover, 10-year risk prediction model included diabetes duration, MBP, and BG. The models demonstrated sufficient predictive performance in terms of discrimination and calibration after internal validation. Through decision curves, the clinically useful interval of the predicted risk threshold was identified based on NB for each final model and a single cutoff point was proposed from a statistical perspective.
Supporting information
S1 Appendix. Calibration of the prediction models.
The calibration plots indicate the mean of the predicted risks versus observed outcome proportions in deciles of the predicted risks. Both apparent and optimism-corrected calibration were reported. The 45° diagonal line is the line of identity that represents perfect predictions. Circles show predicted risk deciles. The bold line indicate the Lowess smoother line of agreement between observed and expected (predicted) risks.
https://doi.org/10.1371/journal.pone.0325814.s001
(PDF)
References
- 1. Teo ZL, Tham YC, Yu M, Chee ML, Rim TH, Cheung N. Global prevalence of diabetic retinopathy and projection of burden through 2045: systematic review and meta-analysis. Ophthalmology. 2021;128(11):1580–91.
- 2. Wykoff CC, Khurana RN, Nguyen QD, Kelly SP, Lum F, Hall R, et al. Risk of blindness among patients with diabetes and newly diagnosed diabetic retinopathy. Diabetes Care. 2021;44(3):748–56. pmid:33472864
- 3. Moons KG, Royston P, Vergouwe Y, Grobbee DE, Altman DG. Prognosis and prognostic research: what, why, and how?. BMJ. 2009;338(b375):1317–20.
- 4. Steyerberg EW, Moons KG, van der Windt DA, Hayden JA, Perel P, Schroter S. Prognosis Research Strategy (PROGRESS) 3: prognostic model research. PLoS Medicine. 2013;10(2):e1001381.
- 5. Gong D, Fang L, Cai Y, Chong I, Guo J, Yan Z, et al. Development and evaluation of a risk prediction model for diabetes mellitus type 2 patients with vision-threatening diabetic retinopathy. Front Endocrinol (Lausanne). 2023;14:1244601. pmid:37693352
- 6. Nugawela MD, Gurudas S, Prevost AT, Mathur R, Robson J, Sathish T, et al. Development and validation of predictive risk models for sight threatening diabetic retinopathy in patients with type 2 diabetes to be applied as triage tools in resource limited settings. EClinicalMedicine. 2022;51:101578. pmid:35898318
- 7. Azizi-Soleiman F, Heidari-Beni M, Ambler G, Omar R, Amini M, Hosseini S-M. Iranian risk model as a predictive tool for retinopathy in patients with type 2 diabetes. Can J Diabetes. 2015;39(5):358–63. pmid:25837808
- 8. Moons KG, Altman DG, Vergouwe Y, Royston P. Prognosis and prognostic research: application and impact of prognostic models in clinical practice. BMJ. 2009;338(b606):1487–90.
- 9. Sullivan LM, Massaro JM, D’Agostino RB. Presentation of multivariate data for clinical use: The Framingham Study risk score functions. Stat Med. 2004;23(10):1631–60.
- 10. Shah CA. Diabetic retinopathy: A comprehensive review. Indian J Med Sci. 2008;62(12):500–19. pmid:19265246
- 11. Fotouhi A, Hashemi H, Shariati M, Emamian MH, Yazdani K, Jafarzadehpur E, et al. Cohort profile: Shahroud Eye Cohort Study. Int J Epidemiol. 2013;42(5):1300–8. pmid:23081880
- 12. Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594. pmid:25569120
- 13. Association AD. Standards of medical care in diabetes--2013. Diabetes Care. 2013;36(Suppl 1):S11-66.
- 14. Xu G, Liu B, Sun Y, Du Y, Snetselaar LG, Hu FB, et al. Prevalence of diagnosed type 1 and type 2 diabetes among US adults in 2016 and 2017: population based study. BMJ. 2018;362:k1497. pmid:30181166
- 15.
Steyerberg EW. Clinical Prediction Models. 2nd ed. Switzerland: Springer; 2019: 192.
- 16. Khosravi A, Emamian MH, Shariati M, Hashemi H, Fotouhi A. The prevalence of pre-hypertension and hypertension in an Iranian urban population. High Blood Press Cardiovasc Prev. 2014;21(2):127–35. pmid:24272061
- 17. Ebrahimi H, Emamian MH, Hashemi H, Fotouhi A. Dyslipidemia and its risk factors among urban middle-aged Iranians: A population-based study. Diabetes Metab Syndr. 2016;10(3):149–56. pmid:27033172
- 18. Ebrahimi H, Emamian MH, Shariati M, Hashemi H, Fotouhi A. Diabetes mellitus and its risk factors among a middle-aged population of Iran, a population-based study. Int J Diabetes Dev Ctries. 2016;36(2):189–96.
- 19. Khabazkhoob M, Emamian MH, Hashemi H, Shariati M, Fotouhi A. Prevalence of overweight and obesity in the middle-age population: a priority for the health system. Iran J Public Health. 2017;46(6):827–34. pmid:28828326
- 20. Cowley LE, Farewell DM, Maguire S, Kemp AM. Methodological standards for the development and evaluation of clinical prediction rules: a review of the literature. Diagn Progn Res. 2019;3(1):1–23.
- 21.
Max Kuhn KJ. Applied Predictive Modeling. 3rd ed: Springer; 2016:69–71.
- 22.
Steyerberg EW. Clinical Prediction Models. 2nd ed. Switzerland: Springer; 2019:106–8.
- 23. Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decision Making: An Int J Soc Med Decision Making. 2006;26(6):565–74.
- 24. Vickers AJ, Cronin AM, Elkin EB, Gonen M. Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med Inform Decis Mak. 2008;8(53):1–17.
- 25. Van Calster B, Wynants L, Verbeek JFM, Verbakel JY, Christodoulou E, Vickers AJ, et al. Reporting and interpreting decision curve analysis: a guide for investigators. Eur Urol. 2018;74(6):796–804. pmid:30241973
- 26. Xuan J, Wang L, Fan L, Ji S. Systematic review and meta-analysis of the related factors for diabetic retinopathy. Ann Palliat Med. 2022;11(7):2368–81. pmid:35927772
- 27. Ghaem H, Daneshi N, Riahi S, Dianatinasab M. The prevalence and risk factors for diabetic retinopathy in Shiraz, Southern Iran. Diabetes Metab J. 2018;42(6):538–43. pmid:30113147
- 28. Heydari B, Yaghoubi G, Yaghoubi MA, Miri MR. Prevalence and risk factors for diabetic retinopathy: an Iranian eye study. Eur J Ophthalmol. 2012;22(3):393–7. pmid:21928255
- 29. Javadi MA, Katibeh M, Rafati N, Dehghan MH, Zayeri F, Yaseri M, et al. Prevalence of diabetic retinopathy in Tehran province: a population-based study. BMC Ophthalmol. 2009;9:12. pmid:19835608
- 30. Solomon SD, Chew E, Duh EJ, Sobrin L, Sun JK, VanderBeek BL, et al. Diabetic retinopathy: a position statement by the American Diabetes Association. Diabetes Care. 2017;40(3):412–8. pmid:28223445
- 31. Song P, Yu J, Chan KY, Theodoratou E, Rudan I. Prevalence, risk factors and burden of diabetic retinopathy in China: a systematic review and meta-analysis. J Glob Health. 2018;8(1):010803. pmid:29899983
- 32. Shiferaw WS, Akalu TY, Desta M, Kassie AM, Petrucka PM, Assefa HK, et al. Glycated hemoglobin A1C level and the risk of diabetic retinopathy in Africa: A systematic review and meta-analysis. Diabetes Metab Syndr. 2020;14(6):1941–9. pmid:33039936
- 33. Zhang M, Wu J, Wang Y, Wu J, Hu W, Jia H, et al. Associations between blood pressure levels and diabetic retinopathy in patients with diabetes mellitus: A population-based study. Heliyon. 2023;9(6):e16830. pmid:37484372
- 34. Liu L, Quang ND, Banu R, Kumar H, Tham Y-C, Cheng C-Y, et al. Hypertension, blood pressure control and diabetic retinopathy in a large population-based study. PLoS One. 2020;15(3):e0229665. pmid:32134944
- 35. Iglay K, Hannachi H, Joseph Howie P, Xu J, Li X, Engel SS, et al. Prevalence and co-prevalence of comorbidities among patients with type 2 diabetes mellitus. Curr Med Res Opin. 2016;32(7):1243–52. pmid:26986190
- 36. Lin P-J, Kent DM, Winn A, Cohen JT, Neumann PJ. Multiple chronic conditions in type 2 diabetes mellitus: prevalence and consequences. Am J Manag Care. 2015;21(1):e23–34. pmid:25880265
- 37. Li Z, Yuan Y, Qi Q, Wang Q, Feng L. Relationship between dyslipidemia and diabetic retinopathy in patients with type 2 diabetes mellitus: a systematic review and meta-analysis. Syst Rev. 2023;12(1):148. pmid:37620980
- 38. Tomić M, Vrabec R, Bulum T, Ljubić S. HDL cholesterol is a protective predictor in the development and progression of retinopathy in type 1 diabetes: A 15-year follow-up study. Diabetes Res Clin Pract. 2022;186:109814. pmid:35248652
- 39. Sasso FC, Pafundi PC, Gelso A, Bono V, Costagliola C, Marfella R, et al. High HDL cholesterol: A risk factor for diabetic retinopathy? findings from NO BLIND study. Diabetes Res Clin Pract. 2019;150:236–44. pmid:30904748
- 40. Xu W, Xu X, Zhang M, Sun C. Association between HDL cholesterol with diabetic retinopathy in diabetic patients: a cross-sectional retrospective study. BMC Endocr Disord. 2024;24(1):65. pmid:38730329
- 41. Zhang G, Chen H, Chen W, Zhang M. Prevalence and risk factors for diabetic retinopathy in China: a multi-hospital-based cross-sectional study. Br J Ophthalmol. 2017;101(12):1591–5. pmid:28855195
- 42. Jeong H, Maatouk CM, Russell MW, Singh RP. Associations between lipid abnormalities and diabetic retinopathy across a large United States national database. Eye (Lond). 2024;38(10):1870–5. pmid:38521836
- 43. Yin L, Zhang D, Ren Q, Su X, Sun Z. Prevalence and risk factors of diabetic retinopathy in diabetic patients: A community based cross-sectional study. Medicine (Baltimore). 2020;99(9):e19236. pmid:32118727
- 44. Gong D, Fang L, Cai Y, Chong I, Guo J, Yan Z, et al. Development and evaluation of a risk prediction model for diabetes mellitus type 2 patients with vision-threatening diabetic retinopathy. Front Endocrinol (Lausanne). 2023;14:1244601. pmid:37693352
- 45. Aspelund T, Thornórisdóttir O, Olafsdottir E, Gudmundsdottir A, Einarsdóttir AB, Mehlsen J, et al. Individual risk assessment and information technology to optimise screening frequency for diabetic retinopathy. Diabetologia. 2011;54(10):2525–32. pmid:21792613
- 46. Rao BN, Quinn N, Januszewski AS, Peto T, Brazionis L, Aryal N, et al. Retinopathy risk calculators in the prediction of sight-threatening diabetic retinopathy in type 2 diabetes: A FIELD substudy. Diabetes Res Clin Pract. 2022;186:109835. pmid:35314259
- 47. Mo R, Shi R, Hu Y, Hu F. Nomogram-based prediction of the risk of diabetic retinopathy: a retrospective study. J Diabetes Res. 2020;2020:7261047. pmid:32587869
- 48. Wang Q, Zeng N, Tang H, Yang X, Yao Q, Zhang L. Diabetic retinopathy risk prediction in patients with type 2 diabetes mellitus using a nomogram model. Front Endocrinol. 2022;13:993423.
- 49. Liu Y, Zhou Z, Wang Z, Yang H, Zhang F, Wang Q. Construction and clinical validation of a nomogram-based predictive model for diabetic retinopathy in type 2 diabetes. Am J Transl Res. 2023;15(10):6083–94. pmid:37969206
- 50. Upadhyay T, Prasad R, Mathurkar S. A narrative review of the advances in screening methods for diabetic retinopathy: enhancing early detection and vision preservation. Cureus. 2024;16(2):e53586. pmid:38455792