Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Developing a novel evaluation model to assess the performance of base hospitals in China National Stroke Screening Surveys

  • Xuedong Liu,

    Roles Conceptualization, Data curation, Formal analysis, Writing – original draft, Writing – review & editing

    Affiliations Department of Medical Administration, The First People’s Hospital of Neijiang, Shizhong, Neijiang, People's Republic of China, School of Public Health, Chongqing Medical University, Shapingba, Chongqing, People's Republic of China

  • Jian Cao,

    Roles Supervision, Writing – original draft

    Affiliation Department of Medical Administration, The First People’s Hospital of Neijiang, Shizhong, Neijiang, People's Republic of China

  • Ou Jiang

    Roles Supervision, Validation, Writing – review & editing

    professorjiangou@163.com

    Affiliation Department of Medical Administration, The First People’s Hospital of Neijiang, Shizhong, Neijiang, People's Republic of China

Abstract

Background

The China National Stroke Screening Surveys (CNSSS), a decade-long national public health initiative, aims to reduce stroke morbidity and mortality through early detection and intervention. This study develops a novel evaluation model to systematically assess the performance of base hospitals (BHS) participating in the CNSSS program in Sichuan Province, China.

Methods

Sixteen BHS in Sichuan Province participated in the CNSSS program in 2024. We utilized eleven performance indicators to develop the evaluation model. Subjective weights were derived from scores assigned to indicators, while objective weights were calculated using the Entropy Weight Method (EWM). A Multiplicative Synthesis with Normalization (MSN) method was adopted to generate combined weights. Based on the subjective, objective, and combined weights, we generated weighted data matrices, determined the corresponding BHS rankings separately, and compared BHS performance in CNSSS implementation across these three ranking systems.

Results

Among the evaluation indicators, Task Completion Rate received the highest subjective weight (0.3125), whereas Intervention Rate dominated both objective (0.1594) and combined (0.2303) weights. Notable weight changes were observed: Task Completion Rate exhibited the largest reduction (−92.45%) from subjective to objective weights, followed by Age Deviation Degree (−76.11%), Follow-up Completion Rate (−73.31%), and Hypertension Awareness Rate (−3.31%). Conversely, Diabetes Detection Rate displayed the most significant increase (+313.65%), followed by Dyslipidemia Detection Rate (+249.44%), Hypertension Detection Rate (+227.79%), Stroke High-Risk Detection Rate (+119.52%), Stroke High-Risk Intervention Rate (+107.90%), Intervention Rate (+48.77%), and Risk Factor Control Rate (+42.52%). BHS A and B ranked top 3 across all weighting methodologies. BHS A ranked first under original, subjective and combined weights, while BHS D led in the objective ranking. Compared to the original methodology, the combined weighting methodology has the highest discrimination degree (0.1166).

Conclusions

Weighting methodologies significantly influence BHS performance evaluations. Subjective approaches emphasize expert expertise, whereas objective methods prioritize data variability. Compared to single weighting method, combined weighting effectively balances discrepancies between expert subjective priorities and data-driven objectivity, thereby addressing limitations of single-method designs. For the CNSSS program, our model underscores the need to shift quality focus from high-risk screening to targeted management, including timely post-screening interventions and effective risk factor control. These targeted interventions are expected to significantly reduce stroke incidence, recurrence, and mortality among screened populations, while enhancing the overall quality of the CNSSS program in the region.

Introduction

According to the 2021 Global Burden of Disease (GBD) study, stroke remains the second leading cause of death and the third leading cause of death and disability combined among non-communicable diseases (NCDS) globally [1]. Between 1990 and 2021, there was a marked increase in the number of people affected by stroke worldwide. Specifically, stroke incidence rose by 70% (95% Uncertainty Interval [UI]: 66–75), mortality by 44% (95% UI: 32–56), and disability-adjusted life years (DALYS) by 32% (95% UI: 22–43) [2]. The majority of the global stroke burden is concentrated in low-income and lower-middle-income countries (LMICS). It is estimated that the global cost of stroke exceeds 890 billion USD annually. Furthermore, stroke mortality is projected to increase by 50% from 2020 to 2050, with DALYS rising from 144.8 million (95% UI: 133.9–156.9 million) in 2020 to 189.3 million (95% UI: 161.8–224.9 million) in 2050 [3].

In China, the most recent “Brief Report on Stroke Prevention and Treatment in China, 2021” revealed that stroke ranked as the leading cause of death and disability among the Chinese population [4]. The epidemiology of stroke in China exhibits several distinct characteristics: 1) Ischemic stroke incidence has shown a gradual upward trend, increasing from 0.129% in 2010 to 0.145% in 2019, while hemorrhagic stroke incidence has shown a gradual downward trend, decreasing from 0.061% in 2010 to 0.045% in 2019; 2) The prevalence of ischemic stroke has risen from 1.100% in 2010 to 1.256% in 2019, whereas the prevalence of hemorrhagic stroke has declined from 0.232% in 2010 to 0.215% in 2019; 3) The crude mortality rate of stroke among rural residents has shown an upward trend with fluctuations between 2010 and 2019 and has consistently been higher than that of urban residents during the same period; 4) The DALYS for ischemic stroke have declined from 1.209% in 2010 to 1.148% in 2019, while those for hemorrhagic stroke have shown a notable downward trend, decreasing from 1.167% in 2010 to 1.142% in 2019; 5) The average inpatient medical expenditures per ischemic and hemorrhagic stroke patient increased by 37% and 82%, respectively, from 2010 to 2019 [4]. These findings underscore that stroke has become a major public health challenge in China.

As the largest developing country globally, China accounts for one-fifth of the world’s population. The burden of stroke in China is intensifying due to population aging and inadequate control of stroke-related risk factors such as hypertension, diabetes, and dyslipidemia [5]. In response to these urgent challenges, the Chinese government has prioritized stroke prevention and treatment within the Healthy China Initiative and the Healthy China 2030 campaign [6]. In 2011, the China Stroke Prevention Project Committee (CSPPC) was established by the then Ministry of Health (CMH), with the goal of reducing stroke incidence by less than 5% and decreasing mortality from cardiovascular and cerebrovascular diseases by 10% by 2020 [6]. To achieve these objectives, the CSPPC has implemented several measures, including formulating stroke prevention and treatment guidelines, accrediting a number of demonstration stroke centers, higher-level stroke centers, stroke prevention and treatment centers, and stroke map hospitals, as well as organizing national stroke education and training for healthcare professionals [6]. To further enhance the concept of “prevention and treatment coordination”, the CSPPC launched the China National Stroke Screening Survey (CNSSS) in 2011 [4,6]. The CNSSS is an ongoing community-based screening program funded by the central government, targeting residents aged 40 and above across all provinces in mainland China [7].

Hospitals have been a key force in implementing the CNSSS program. By 2021, the CSPPC had designated 327 tertiary hospitals as stoke screening and prevention base hospitals (BHS) [4]. The responsibilities of BHS include: 1) Pre-hospital promotion of an integrated prevention and treatment model, focusing on screening high-risk stroke populations and providing coordinated emergency care for acute stroke patients; 2) In-hospital implementation of high-risk stroke screening and multidisciplinary joint diagnosis and treatment; and 3) Post-hospital follow-up and intervention for patients [4]. Moreover, BHS are tasked with training and guiding healthcare professionals from community health service centers and township health centers to conduct high-risk stroke population screening, follow-up, and risk factor intervention. Under the supervision and guidance of BHS, the CNSSS completed 1.256 million stroke screening and prevention activities, identified 268,000 high-risk stroke individuals, and implemented 662,000 follow-up and intervention in 2020.

The quality of BHS work directly impacts the effectiveness of the CNSSS program. High-quality BHS work ensures accurate identification, timely follow-up, and appropriate intervention for high-risk stroke populations. To evaluate BHS performance, the Office of the CSPPC constructed an evaluation matrix consisting of 11 indicators [8], aiming to improve the quality of stroke screening and prevention among residents aged 40 and older. However, the existing methodology of calculating the total score for each BHS-by simply aggregating the scores of individual indicators (assigning equal weights to each)-may, to some extent, introduce biases and lead to distorted evaluation results. It is widely recognized that incorporating weights is crucial for obtaining scientifically robust and accurate evaluation results [9]. Our literature review revealed that no evaluation model has been developed in China to assess the performance of BHS in conducting the CNSSS program.

To address this gap, we developed a novel combined-weighted evaluation model, integrating subjective weights from expert judgment and the objective weights from the Entropy Method, to systematically evaluate performance disparities among BHS in conducting the CNSSS program. Firstly, subjective weights of the indicators were determined by the scores given by experts from the CSPPC; secondly, the Entropy Method was utilized to ascertain the objective weights of the indicators; thirdly, the multiplicative synthesis with normalization (MSN) method was applied to calculate the combined weights of the indicators; lastly, the performance of each BHS participating in the CNSSS program was re-ranked based on the combined weighted scores (CWS). The findings of this study may: 1) Fill the literature gap regarding the theoretical implications of weight determination for evaluating BHS performance during CNSSS implementation; 2) Offer practical insights to enhance the quality of the CNSSS program; 3) Provide useful experience in conducting performance evaluation of similar programs.

Methods and materials

Study design

This study was a retrospective cross-sectional study conducted at the First People’s Hospital of Neijiang, Sichuan Province, China. The study did not involve human participants, nor did it use human tissue samples or human biomedical information (e.g., name, ID number, admission number, diagnostic results, examination data). Therefore, ethical approval were waived by the Ethics Committee of the First People’s Hospital of Neijiang.

Data sources

In 2024, sixteen BHS across Sichuan Province, participated in the CNSSS program. In March, 2025, the Office of the CSPPC issued the 2024 annual quality control reports on BHS participating in the CNSSS program. The data for this study were derived from these reports, and all data were complete, valid, and endorsed by the Office of the CSPPC. All BHS were tertiary A general hospitals (the highest level in China’s hospital ranking system), distributed across 13 municipalities in Sichuan Province. Specifically, Chengdu (hospitals E and I), Nanchong (hospitals A and B), and Yibin (hospitals L and O) each occupied two BHS, while Dazhou (hospital N), Deyang (hospital J), Leshan (hospital F), Luzhou (hospital G), Mianyang (hospital D), Neijiang (hospital K), Panzhihua (hospital P), Suining (hospital H), Yaan (hospital M), and Zigong (hospital C) each had one BHS. The geographical distribution of these BHS is shown in Fig 1.

thumbnail
Fig 1. The geographical distribution of BHS in Sichuan Province.

BHS: base hospitals.

https://doi.org/10.1371/journal.pone.0336499.g001

Indicators

The Office of the CSPPC proposed eleven evaluation indicators to assess the annual performance of BHS. Specifically, Task Completion Rate, Risk Factor Control Rate, Hypertension Awareness Rate, Stroke High-Risk Intervention Rate, Intervention Rate, and Follow-up Completion Rate were categorized as positive indicators, whereas Stroke High-Risk Detection Rate, Hypertension Detection Rate, Diabetes Detection Rate, Dyslipidemia Detection Rate, and Age Deviation Degree were defined as moderate indicators. The detailed evaluation matrix for BHS performance assessment is presented in Table 1.

thumbnail
Table 1. The evaluation matrix for BHS performance assessment.

https://doi.org/10.1371/journal.pone.0336499.t001

Procedures for the construction of the evaluation model

Data processing.

Data were processed using Excel 2024. All calculations were completed using functions built in Excel.

Determining the weights of the evaluation indicators.

Weights are widely recognized as a crucial element in multi-criteria decision-making process due to its significant influence on the outcomes of comprehensive evaluations [10,11]. The selection of appropriate weighting methodologies directly affects the accuracy and reliability of evaluation results [12]. Currently, weighting methodologies are categorized into three types: subjective, objective, and combined (integrated) [10]. To ensure the robustness and trustworthiness of evaluation results, it is recommended to employ multiple methodologies when determining weights [13]. Given these considerations, our study adopted a combined weighting approach to determine the weights of the evaluation indicators.

Subjective weights.

CSPPC experts evaluated the relative importance of each evaluation indicator and assigned corresponding scores. The subjective weights (WS) for the evaluation indicators were derived from these scores using Equation (1).

(1)

where m denotes the number of indicators and corresponds to the score assigned to the j-th indicator.

Objective weights.

Various methodologies can be applied to calculate the objective weights for evaluation indicators, such as principal component analysis (PCA) [14], Entropy Weight Method (EWM) [15], and coefficient of variation (CV) [16]. In this study, we adopted the EWM to determine the objective weights for the evaluation indicators. Detailed procedures were as follows:

  1. 1. The original data matrix was standardized using Equations (2)–(4) [17], where i = 1,2,…,n; j = 1,2,…,m; n represents the number of BHS; m represents the number of evaluation indicators.

Positive indicators:

(2)

Negative indicators:

(3)

Moderate indicators:

(4)
  1. 2. The standardized data matrix was added 0.001 to eliminate the influence of zero values.
(5)
  1. 3. The value of was calculated.
(6)
  1. 4. The value of was calculated.
(7)
  1. 5. The objective weights were calculated.
(8)

Combined weights.

After obtaining the subjective and objective weights of the evaluation indicators via the aforementioned approaches, we then integrated these weights appropriately to obtain the combined weights for the evaluation indicators. Commonly applied weight combination methodologies include least squares linear fusion (LSLF), MSN, and multi-weight fusion based on maximum membership degree (MWFMMD) [18]. In our study, we assumed equal importance of subjective and objective weights [18], aimed to enlarge the disparities in evaluation indicators to enhance differentiation between evaluation results, and ensured interpretation and horizontal comparability of results through normalization [19]. Thus, we utilized the MSN methodology to calculate the combined weights for the evaluation indicators. The main calculation function was as follows:

(9)

where denotes the combined weight of the j-th indicator; =1, 2,...L, L represents the number of weighting methodologies; represents the product of weights of the j-th indicator using L weighting methodologies; and represents the sum of such products across all indicators.

Calculating the total weighted scores (TWS) and rankings for each BHS.

The original data matrix was recalculated according to the subjective, objective, and combined weights of the evaluation indicators, respectively. Subsequently, the performance of each BHS conducting the CNSSS program was re-ranked based on the TWS values. The calculation functions were as follows:

(10)(11)

where j = 1,2,…,m, m represents the number of evaluation indicators; represents the weighted score of the j-th indicator; and TWSS, TWSO, and TWSC represent the total weighted scores for each BHS, respectively.

Comparison of different evaluation methodologies.

Setting the original scores as the baseline, three weighting methodologies were compared using two indicators (head-tail consistency rate and discrimination degree) proposed by professor Yu LP, et al [11,20]. The head-tail consistency rate measures the proportion of top 20% and bottom 20% BHS consistently identified by different evaluation methodologies. Its value ranges between 0 and 1, with a higher value indicating stronger consistency across methodologies. The discrimination degree quantifies the ability of an evaluation methodology to magnify performance differences between BHS. A higher value means the methodology can more clearly distinguish “superior” from “inferior” BHS. The calculation functions were as follows:

Head-tail consistency rate:

(12)

where x represents the number of BHS consistently ranked in the top 20%; y denotes the number of BHS consistently ranked in the bottom 20%; n represents the total number of BHS.

Discrimination degree:

(13)

where n represents the number of BHS; 1 ≤ i ≤ n; represents the original score, is the largest score and is the smallest score; represents the standardized value corresponding to .

Results

Subjective weights of evaluation indicators

The highest subjective weight was assigned to Task Completion Rate, with a value of 0.3125. Four evaluation indicators, including Risk Factor Control Rate, Intervention Rate, Follow-up Completion Rate, and Age Deviation Degree, received the second-highest and equal subjective weights (0.1071), reflecting that experts perceived them as equally important. Followed by Stroke High-Risk Intervention Rate (0.0714), Hypertension Awareness Rate, and Stroke High-Risk Detection Rate (0.0536). Three evaluation indicators: Hypertension Detection Rate, Diabetes Detection Rate, and Dyslipidemia Detection Rate were allocated the smallest and equal weights (0.0268). Details are shown in Table 2.

thumbnail
Table 2. The subjective weights of the evaluation indicators.

https://doi.org/10.1371/journal.pone.0336499.t002

Objective weights of evaluation indicators

Contrasts to the subjective weights, the Intervention Rate received the highest objective weight, with a value of 0.1594. Followed by Risk Factor Control Rate (0.1527), Stroke High-Risk Intervention Rate (0.1485), Stroke High-Risk Detection Rate (0.1176), Diabetes Detection Rate (0.1108), Dyslipidemia Detection Rate (0.0936), Hypertension Detection Rate (0.0878), Hypertension Awareness Rate (0.0518), Follow-up Completion Rate (0.0286), and Age Deviation Degree (0.0256). Notably, the Task Completion Rate obtained the smallest weight, with a value of 0.0236. Details are displayed in Table 3.

thumbnail
Table 3. The objective weights of the evaluation indicators.

https://doi.org/10.1371/journal.pone.0336499.t003

Combined weights of evaluation indicators

Among the eleven evaluation indicators, the Intervention Rate obtained the highest combined weight (0.2303), followed by the Risk Factor Control Rate (0.2207), Stroke High-risk Intervention Rate (0.1430), and Task Completion Rate (0.0995). Six indicators: Follow-up Completion Rate, Diabetes Detection Rate, Hypertension Awareness Rate, Age Deviation Degree, Dyslipidemia Detection Rate, and Hypertension Detection Rate exhibited relatively small and stable combined weights, with values of 0.0413, 0.0401, 0.0375, 0.0370, 0.0339, and 0.0318, respectively. Details are presented in Table 4.

thumbnail
Table 4. The combined weights of the evaluation indicators.

https://doi.org/10.1371/journal.pone.0336499.t004

Comparison of subjective, objective, and combined weights of evaluation indicators

With subjective weights as the baseline, four indicators: Task Completion Rate, Follow-up Completion Rate, Age Deviation Degree, and Hypertension Awareness Rate exhibited declining objective weights. Among these, Task Completion Rate displayed the most significant reduction (92.45%; 0.3125 vs. 0.0236), whereas Hypertension Awareness Rate remained relatively stable, with only a 3.36% decrease (0.0536 vs. 0.0518). Conversely, seven indicators: Risk Factor Control Rate, Intervention Rate, Stroke High-Risk Intervention Rate, Stroke High-Risk Detection Rate, Hypertension Detection Rate, Diabetes Detection Rate, and Dyslipidemia Detection Rate demonstrated increasing trends. Notably, Diabetes Detection Rate, Dyslipidemia Detection Rate, Hypertension Detection Rate, and Stroke High-Risk Detection Rate experienced the largest proportional increases: 313.43% (0.0268 vs. 0.1108), 249.25% (0.0268 vs. 0.0936), 227.61% (0.0268 vs. 0.0878), and 119.40% (0.0536 vs. 0.1176), respectively. Detailed results are presented in Fig 2.

thumbnail
Fig 2. Comparison of subjective, objective, and combined weights of evaluation indicators.

(A) The results of subjective weights. (B) The results of objective weights. (C) The results of combined weights. X1: Task Completion Rate; X2: Risk Factor Control Rate; X3: Stroke High-Risk Detection Rate; X4: Hypertension Awareness Rate; X5: Hypertension Detection Rate; X6: Diabetes Detection Rate; X7: Dyslipidemia Detection Rate; X8: Stroke High-Risk Intervention Rate; X9: Intervention Rate; X10: Follow-up Completion Rate; X11: Age Deviation Degree.

https://doi.org/10.1371/journal.pone.0336499.g002

Scores and rankings of BHS under different evaluation methodologies

The rankings of BHS were determined based on the respective scores corresponding to each BHS. The top three BHS under the original methodology were BHS A,B, and C, with original scores (99.0000 vs. 96.2500 vs. 91.0000). However, the order under the subjective, objective, and combined methodologies were different, with subjective: A, B, G (16.3824 vs. 15.9006 vs. 15.4345); objective: D, B, A (7.6998 vs. 7.6021 vs. 7.3770); combined: A, B, D (11.4458 vs. 11.2582 vs. 11.0085). Similarly, the bottom three BHS varied by methodology, with original: N, O, P (62.8000 vs. 56.8000 vs. 56.0000); subjective: O, P, J (12.6569 vs. 12.4106 vs. 12.3149); objective: P, M, O (2.1406 vs. 2.1389 vs. 2.0551); and combined: M, O, P (4.6854 vs. 4.5789 vs. 4.5001). Detailed results are presented in Table 5.

thumbnail
Table 5. Scores and rankings of BHS under different evaluation methodologies.

https://doi.org/10.1371/journal.pone.0336499.t005

Comparison of different evaluation methodologies

For the first five comparison pairs (Subjective vs. Original, Objective vs. Original, Combined vs. Original, Objective vs. Subjective, and Combined vs. Subjective), the head-tail consistency rate remained stable at 0.6250. This indicates consistent alignment in the head-tail distribution patterns of these evaluation methodologies. Notably, in the Combined vs. Objective pair, this rate surged to the highest value of 0.9375, reflecting a substantially enhanced consistency between the combined and objective methodologies. The discrimination degree exhibited a more dynamic trend. It started at 0.0336 (Subjective vs. Original), then rose to the highest value 0.1166 (Combined vs. Original), suggesting a notable difference between the combined and original methodologies. Subsequently, it decreased to 0.0820 (Objective vs. Subjective), slightly rebounded to 0.0830 (Combined vs. Subjective), and finally dropped to 0.0010 (Combined vs. Objective). The extremely low discrimination degree in the Combined vs. Objective pair implied a high level of consistency between the two methodologies, which aligned with the highest head-tail consistency rate in this pair. Collectively, these results demonstrated that the combined methodology aligned well with the objective methodology in both head-tail consistency and discrimination degree, whereas the original methodology differed most from the combined approach. Detailed comparison results are presented in Fig 3.

thumbnail
Fig 3. Comparison of original, subjective, objective, and combined evaluation methodologies.

Ori: Original; Sub: Subjective; Obj: Objective; Com: Combined.

https://doi.org/10.1371/journal.pone.0336499.g003

Discussion

Necessity of establishing a model to evaluate the performance of BHS

In recent decades, the epidemiological characteristics of stroke have shifted dramatically, with incidence, mortality, and DALYS increasing significantly [1]. These changes underscore the urgency of implementing effective stroke screening and prevention programs, such as the CNSSS, which relies heavily on BHS performance for high-quality implementation. Since 2011, the CNSSS has evolved into a national key public health initiative [8]. After over a decade of practice, developing a scientific and practical model to evaluate BHS performance in CNSSS implementation has become increasingly crucial. This model serves as an indispensable component of the CNSSS’s sustainable development [21]. Notably, however, to our knowledge, there remains a lack of literature that attempts to develop a model to assess BHS performance in CNSSS implementation. Previous studies have mainly focused on identifying stroke prevalence and stroke-related risk factors based on CNSSS data [2224].

Our study is the first to develop a novel combined-weighted evaluation model, integrating experts’ subjective judgments and the objective Entropy Methodology, to systematically evaluate performance disparities among BHS in CNSSS implementation in Sichuan Province, China. The model addresses a critical limitation of the current CNSSS program: its practice of assigning equal weights to all evaluation indicators when assessing BHS performance. Our findings not only address the literature gap regarding the theoretical implications of weight determination for BHS performance evaluation in CNSSS implementation but also offer practical insights for enhancing the quality of the CNSSS program.

Discrepancies between weights under different weighting methodologies

Subjective weighting methodologies, despite potential bias, reflect experts’ clinical and practical experience. This is exemplified by the highest subjective weight assigned to Task Completion Rate, an outcome that reflects administrative mandates and emphasizes compliance with the CNSSS program. In contrast, objective weighting methodologies mitigate bias by leveraging empirical data variability. This is evidenced by the Intervention Rate being assigned the highest objective weight, which surpasses its subjective counterpart, indicating that data-driven variability underscores its operational significance.Such complementarity aligns with debates in relevant literature: while subjective weighting methodologies (e.g., Analytic Hierarchy Process, AHP) excel at incorporating theoretical frameworks, objective methodologies (e.g., Entropy Method) specialize in capturing dynamic performance patterns [25]. Compared to single methodology, combined weighting methodologies balance discrepancies between experts’ subjective priorities and data-driven objectivity, thereby addressing limitations of single-method evaluations [26].

Our study identified notable discrepancies between subjective and objective weights of evaluation indicators. Compared to subjective weights, four indicators: Task Completion Rate, Follow-up Completion Rate, Age Deviation Degree, and Hypertension Awareness Rate exhibited reduced objective weights. The most pronounced finding is the 92.45% reduction in Task Completion Rate’s weight, dropping from 0.3125 (subjective) to 0.0236 (objective). This divergence highlights a fundamental difference in weight determination: subjective weights (derived from expert scoring) prioritize administrative compliance, whereas objective weights quantify indicators based on data discriminability. Specifically, indicators with higher entropy values are assigned higher objective weights [27], as they are independent of expert subjectivity. The low objective weight of Task Completion Rate reflects minimal performance variation across BHS, which had the smallest coefficient of variation (0.08), rendering it less informative for data-driven performance evaluations. This aligns with prior research showing that process-oriented indicators (e.g., Task Completion Rate) often exhibit ceiling effects in healthcare settings, limiting their ability to distinguish institutional work performance [28]. Importantly, the core mandate for BHS in the CNSSS program is to meet annual stroke screening and prevention targets and complete follow-up for these populations. BHS must use all accessible approaches to reach these mandatory missions, thus assigning higher weights to such indicators may fail to reflect genuine performance disparities among BHS.

In contrast, the objective weighting methodology (Entropy Method) prioritizes indicators with higher variation degrees (lower information entropy), as they better distinguish discrepancies between evaluated objects [29]. Compared to subjective weights, seven indicators: Risk Factor Control Rate, Intervention Rate, Stroke High-Risk Intervention Rate, Stroke High-Risk Detection Rate, Hypertension Detection Rate, Diabetes Detection Rate, and Dyslipidemia Detection Rate showed increased objective weights. Notably, detection rates for hypertension, diabetes, dyslipidemia, and stroke high-risk populations exhibited the largest average weight increase (227.60%,) followed by intervention rates for previously screened populations and stroke high-risk groups (78.34%), with Stroke Risk Factor Control Rate showing the smallest increase (42.52%). These weight increases indicate high performance variation across Sichuan BHS in stroke risk factor detection, population screening, intervention, and control. Specifically, stroke risk factor detection rates (diabetes: 313.43%, dyslipidemia: 249.25%, and hypertension: 227.61%) demonstrated the largest weight surges, signaling significant variation in detection performance and pronounced disparities in stroke high-risk factor screening implementation among BHS.

The limitations of single weighting methodology are partially illustrated by Follow-up Completion Rate, whose weight dropped from 0.1071 (subjective) to 0.0286 (objective), highlighting potential misalignment between expert assumptions and empirical data [30]. To address such inconsistencies, methodological transparency is essential: subjective methodologies should include consistency verification (e.g., ensuring AHP consistency ratio below 0.1) [31], while objective methodologies must validate data sufficiency [32]. The combined weights: Intervention Rate (0.2303) and Risk Factor Control Rate (0.2207) exemplify effective reconciliation of these inconsistencies, as they integrate theoretical primacy and empirical discriminability, pinpointing indicators that are both conceptually pivotal and practically informative.

Discrepancies between rankings among BHS under different weighting methodologies

Using the original methodology as a baseline, discrepancies have been observed in BHS ranking across subjective, objective, and combined weighting methodologies, which highlight the critical role of weight determination in multi-criteria decision-making [18]. Under the subjective methodology, BHS A and B maintain their 1st and 2nd ranks, aligning with the original methodology’s recognition of their overall standing. However, other BHS (e.g., D, falling from 4th to 9th; G, rising from 7th to 3rd) exhibit dramatic ranking shifts in their performance evaluations, reflecting the experts’ valued qualitative indicators may be balanced differently by the original methodology. The objective methodology shows even more dramatic changes: BHS D surges from 4th to 1st due to its excellence in indicators with high objective weights (e.g., Intervention Rate, Detection Rates for Hypertension, Diabetes, and Dyslipidemia), while BHS A (falling from 1st to 3rd) and C (falling from 3rd to 4th) drop in rank, as these BHS have inferior performance in such high-objective-weight indicators (e.g., Intervention Rate, Age Deviation Degree, Detection Rates for Hypertension, Diabetes, and Dyslipidemia). This underscores the objective methodology’s sensitivity to statistical patterns over expert priorities [22]. Conversely, rankings from the combined weighting methodology occupy a middle ground, illustrating its role in balancing subjective expertise and objective data. For example, it keeps BHS A and B in the top two (consistent with the original methodology) but adjusts BHS D’s rank from 1st (objective) to 3rd (combined)-a change driven by BHS D’s poor performance in Task Completion Rate and Follow-up Completion Rate, which have the highest and second-highest subjective weights, respectively. This demonstrates that combined weights can reconcile conflicting perspectives by aligning the relative importance of qualitative and quantitative factors. In general, the original methodology implicitly blends qualitative and quantitative considerations to some extent; the subjective and objective methodologies explicitly prioritize either expert-driven or data-driven criteria; and the combined methodology optimizes multi-criteria decision-making by harmonizing these diverse perspectives.

Practical implications for evaluating BHS performance in the CNSSS program

By integrating subjective and objective weights, we have developed an evaluation model to scientifically assess the performance of BHS in conducting the CNSSS program. Compared to the original methodology (which uses equal weights), the combined weighting methodology has the highest discrimination degree, which means that it has a robust ability to distinguish superior BHS from inferior ones. The combined weighting results highlight the critical importance of three evaluation indicators: Intervention Rate for Previously Screened Populations, Stroke Risk Factor Control Rate, and Intervention Rate for Stroke High-Risk Populations. These findings align with both the objectives of stroke screening and prevention and the core mandates of the CNSSS program. Beyond identifying stroke high-risk individuals in the region, the model emphasizes the need to implement targeted management for these individuals, control preexisting risk factors, and ultimately reduce stroke incidence, recurrence, and mortality.

Limitations

Several limitations of this study should be acknowledged. Firstly, the objective Entropy Weighting Methodology relies on the premise of indicator independence, which may not hold for inherently correlated indicators such as Hypertension Detection Rate, Diabetes Detection Rate, and Dyslipidemia Detection Rate. Secondly, the sample size of this study was relatively small and geographically limited. The evaluation model was developed based on data from only 16 BHS in Sichuan Province, which may reduce its generalizability to other regions of China. Therefore, caution is advised when extrapolating these findings to broader contexts. Thirdly, the study uses cross-sectional data from 2024, thus longitudinal validation across multiple years is therefore needed to assess the stability of the results.

Future directions

Future research could address these limitations through the following directions. Firstly, expand the sample size to include BHS from diverse geographical regions. Secondly, incorporate multi-year longitudinal data to enhance temporal validity and verify the long-term stability of the evaluation model. Thirdly, integrate multiple subjective-objective weighting methodologies to further refine the precision of performance evaluation results.

Conclusions

To our knowledge, this study presents the first novel evaluation model for assessing BHS performance in CNSSS implementation. The findings highlight how weighting methodologies profoundly influence performance rankings, demonstrating that combined weighting effectively balances expert-driven subjectivity and data-driven objectivity. The discrepancy between subjective and objective weights underscores the necessity of transparent, robust multi-methodological evaluation frameworks for public health initiatives. For the CNSSS program, these results provide a scientific basis for refining evaluation systems to prioritize evidence-based interventions and risk factor management. Future research should focus on three key directions: extending this model to nationwide BHS contexts, enhancing its generalizability through sample size expansion and longitudinal data integration, and ensuring robust results through the integration of multiple methodologies.

Supporting information

S1 File. The original de-identified data set.

https://doi.org/10.1371/journal.pone.0336499.s001

(XLS)

Acknowledgments

We thank Center for Disease Prevention and Control of Sichuan Province, for their support in providing the quality control data of BHS to guarantee the successful conduction of this study.

References

  1. 1. Feigin VL, Abate MD, Abate YH, Abd ElHafeez S, Abd-Allah F, Abdelalim A, et al. Global, regional, and national burden of stroke and its risk factors, 1990–2021: a systematic analysis for the Global Burden of Disease Study 2021. The Lancet Neurology. 2024;23(10):973–1003.
  2. 2. Feigin VL, Brainin M, Norrving B, Martins SO, Pandian J, Lindsay P, et al. World stroke organization: global stroke fact sheet 2025. Int J Stroke. 2025;20(2):132–44. pmid:39635884
  3. 3. Feigin VL, Owolabi MO, World Stroke Organization–Lancet Neurology Commission Stroke Collaboration Group. Pragmatic solutions to reduce the global burden of stroke: a World Stroke Organization-Lancet Neurology Commission. Lancet Neurol. 2023;22(12):1160–206. pmid:37827183
  4. 4. Tu W-J, Wang L-D, Special Writing Group of China Stroke Surveillance Report. China stroke surveillance report 2021. Mil Med Res. 2023;10(1):33. pmid:37468952
  5. 5. Wang S. Epidemiological survey and risk prediction study of stroke based on million population (in Chinese). Hunan: Central South University; 2023.
  6. 6. Chao B-H, Yan F, Hua Y, Liu J-M, Yang Y, Ji X-M, et al. Stroke prevention and control system in China: CSPPC-Stroke Program. Int J Stroke. 2021;16(3):265–72. pmid:32223541
  7. 7. Tu W-J, Hua Y, Yan F, Bian H, Yang Y, Lou M, et al. Prevalence of stroke in China, 2013-2019: A population-based study. Lancet Reg Health West Pac. 2022;28:100550. pmid:36507089
  8. 8. Bigdata Observatory Platform for Stroke of China. Evaluation indicators for stroke high-risk population screening and prevention (2024 edition) (in Chinese).[cited 2025 Jun 9] https://www.chinasdc.cn/
  9. 9. Yang Y. Analysis of weighting methods in multi-index comprehensive evaluation (in Chinese). Stat Decis. 2006;7:17–9.
  10. 10. Ayan B, Abacıoğlu S, Basilio MP. A comprehensive review of the novel weighting methods for multi-criteria decision-making. Information. 2023;14(5):285.
  11. 11. Liu X, Ye M. Medical service performance evaluation of tertiary general hospitals in Sichuan Province in China based on diagnosis-related groups. BMC Health Serv Res. 2025;25(1):563. pmid:40247309
  12. 12. Singh M, Pant M. A review of selected weighing methods in MCDM with a case study. Int J Syst Assur Eng Manag. 2020;12(1):126–44.
  13. 13. Do DT, Nguyen N-T. Applying Cocoso, Mabac, Mairca, Eamr, Topsis and Weight Determination Methods for Multi-Criteria Decision Making in Hole Turning Process. Strojnícky časopis - J Mech Eng. 2022;72(2):15–40.
  14. 14. Wu RMX, Zhang Z, Yan W, Fan J, Gou J, Liu B, et al. A comparative analysis of the principal component analysis and entropy weight methods to establish the indexing measurement. PLoS One. 2022;17(1):e0262261. pmid:35085274
  15. 15. Zhu Y, Tian D, Yan F. Effectiveness of entropy weight method in decision-making. Math Probl Eng. 2020;2020:1–5.
  16. 16. Wang MH, Li SY, Li JS, Yu XQ. Experts’ questionnaires survey on COPD therapeutic evaluation outcome indexes weight. China J Tradition Chin Med Pharm. 2013;28(8):2401–4.
  17. 17. Wang JJ. A new method for multi-index comprehensive evaluation and decision-making (in Chinese). Stat Decis. 2012;24:32–5.
  18. 18. Song DM, Liu CX, Shen C, Shi XF, Zang L, Feng WQ. Multiple objective and attribute decision making based on the subjective and objective weighting (in Chinese). J Shandong Univ (Eng Sci). 2015;45(4):1–9.
  19. 19. Yu LP, Pan YT, Wu YS. A comparative study of utility function synthesis methods in science and technology evaluation (in Chinese). Sci Technol Progr Policy. 2010;27(1):106–10.
  20. 20. Yu LP, Pan YT, Wu YS. Two new indicators to compare different evaluation methods’ effect-based on times higher-qs world university rankings (in Chinese). J Nanjing Norm Univ (Nat Sci Ed). 2008;31(3):135–40.
  21. 21. Meng X. The quality evaluation of high-risk population screening project for strokein Henan province (in Chinese). Henan: Zhengzhou University; 2013.
  22. 22. Yi X, Luo H, Zhou J, Yu M, Chen X, Tan L, et al. Prevalence of stroke and stroke related risk factors: a population based cross sectional survey in southwestern China. BMC Neurol. 2020;20(1):5. pmid:31910820
  23. 23. Wang W, Jiang B, Sun H, Ru X, Sun D, Wang L, et al. Prevalence, incidence, and mortality of stroke in China: results from a nationwide population-based survey of 480 687 adults. Circulation. 2017;135(8):759–71. pmid:28052979
  24. 24. Xia X, Yue W, Chao B, Li M, Cao L, Wang L, et al. Prevalence and risk factors of stroke in the elderly in Northern China: data from the National Stroke Screening Survey. J Neurol. 2019;266(6):1449–58. pmid:30989368
  25. 25. Politi MC, Shacham E, Barker AR, George N, Mir N, Philpott S, et al. A comparison between subjective and objective methods of predicting health care expenses to support consumers’ health insurance plan choice. MDM Policy Pract. 2018;3(1):2381468318781093. pmid:30288450
  26. 26. Li G, Li JP, Sun XL, Wu DS. Research on a combined method of subjective-objective weighting based on the ordered information and intensity information (in Chinese). Chin J Manage Sci. 2017;25(12):179–87.
  27. 27. Huang I-C, Frangakis C, Atkinson MJ, Willke RJ, Leite WL, Vogel WB, et al. Addressing ceiling effects in health status measures: a comparison of techniques applied to measures for people with HIV disease. Health Serv Res. 2008;43(1 Pt 1):327–39. pmid:18211533
  28. 28. Thomson EE, Kristan WB. Quantifying stimulus discriminability: a comparison of information theory and ideal observer analysis. Neural Comput. 2005;17(4):741–78. pmid:15829089
  29. 29. Guo XG. Application of improved Entropy Method in evaluation of economic result (in Chinese). Syst Eng-Theor Pract. 1998;12:98–102.
  30. 30. Saaty TL. Decision making with the analytic hierarchy process. Int J Serv Sci. 2008;1(1):83.
  31. 31. Zeshui X, Cuiping W. A consistency improving method in the analytic hierarchy process. Eur J Operat Res. 1999;116(2):443–9.
  32. 32. Shannon CE. A mathematical theory of communication. Bell Syst Tech J. 1948;27(3):379–423.