Correction
2 Dec 2025: Ang IYH, Rahman N, Wong SH, Ng SHX, Tan KXQ, et al. (2025) Correction: Development and validation of the health segment classification of population encompassed within Singapore (HealthSCOPES) framework. PLOS ONE 20(12): e0337925. https://doi.org/10.1371/journal.pone.0337925 View correction
Figures
Abstract
Introduction
The population is heterogeneous with varying levels of healthcare needs. Clustering individuals into health segments with more homogeneous healthcare needs allows for better understanding and monitoring of health profiles in the population, which can support data-driven resource allocation.
Methods
Using the developed criteria, data from several of Singapore’s national administrative datasets were used to classify individuals into the various health segments. Cross-sectional analysis of healthcare utilization charges was conducted. Validation was done for the framework’s prognostic ability of clinically relevant outcomes measured in the following year.
Results
The framework is comprised of twelve segments classed within four broad groups. The segments comprising individuals with cancer, with transitional care needs, and in the last year of their lives had the highest mean per resident healthcare charges. The segments comprising adults and seniors with complex chronic conditions and with transitional care needs had the highest percentage of individuals historically diagnosed with obesity. The framework was able to distinguish varying tiers of healthcare utilization charges and relative risk of death in the following year.
Citation: Ang IYH, Rahman N, Wong SH, Ng SH-X, Tan KXQ, Eh KX, et al. (2025) Development and validation of the Health Segment Classification of Population Encompassed within Singapore (HealthSCOPES) framework. PLoS ONE 20(1): e0317016. https://doi.org/10.1371/journal.pone.0317016
Editor: Nenad Filipovic, University of Kragujevac, SERBIA
Received: June 13, 2024; Accepted: December 19, 2024; Published: January 13, 2025
Copyright: © 2025 Ang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data cannot be shared publicly because there are legal restrictions on sharing the de-identified national administrative data consolidated and owned by the Government of Singapore. Access to the data can be requested through the TRUST Platform with approvals from the Data Access Committee (https://trustplatform.sg/collaborate/access-trust-data/).
Funding: The author(s) received no specific funding for this work.
Competing interests: NO authors have competing interests.
Introduction
As populations age and prevalence of chronic diseases increase, there is a greater demand for healthcare resources and an increased burden on healthcare systems [1, 2]. Efficient utilization of finite healthcare resources is thus essential for long-term sustainability. Populations are inevitably heterogeneous, with different individuals having varying levels of healthcare needs. Clustering individuals in the population into health segments that are likely to have more homogeneous healthcare needs is a first step in better understanding and monitoring of health profile trends in the population [3–5]. This would facilitate health systems administrators and policy-makers to plan ahead with resource and infrastructure needs, and develop more targeted programs, interventions, and financial schemes that are integrated across the various settings in the care continuum [3–7].
Population health segmentation frameworks could take either a primarily quantitative data-driven approach or a qualitative expert-driven approach in their development [8]. Most of the prior framework development work has taken a hybrid approach of combining both data-driven and expert-driven approaches. With increased adoption of centralized electronic health records, and availability of hospital administrative data and insurance claims data, the use of the data-driven approach has become increasingly popular in health segmentation work. Previous models developed have however been limited to specific populations, such as elderly [9–11], or specific high-needs or high-hospital utilizer patients [12–14], which are not generalizable to the full population in a nation.
Using only hospital utilization data for population health segmentation is not person-centered, particularly since it ignores other social, economic, psychological, and behavioral determinants that influence health and that also better inform healthcare needs. Hospital utilization data is also restricted by the current healthcare structures and services provided, with most centered around hospitals without inclusion of healthcare provided outside of the hospital setting. Homogeneity achieved in hospital utilization-based clusters therefore would not necessarily allow translation of the model to anticipate and plan for future needs of the greater population. Specifically, the remaining segments of the population that have not come to contact with the hospitals or the healthcare system might be wrongly classed as healthy because there is insufficient information available. In planning ahead for unmet needs and disease prevention work, further segmentation of risk is required of those usually considered generally healthy and that have yet to have significant healthcare utilization.
An expert-driven approach to the development of population health segmentation framework often involves a workgroup of relevant personnel. This could include clinicians, healthcare administrators, public health experts, and/or policy makers with domain knowledge in healthcare systems, public health, and healthcare policy planning [5, 15–17]. The workgroup would develop and refine the framework based on their experience and research of published theoretical and experimental works. Frameworks by Lynn et al. (2007), the Ministry of British Columbia, Canada [5], the North West London Whole Systems Integrated Care [17], and by Low et al. (2017) can be considered to primarily use an expert-driven approach. The framework by Lynn et al. (2007) has eight segments, while that by the Ministry of British Columbia, Canada [5] has 14 segments arranged into 4 broad groups. The framework by North West London Whole Systems Integrated Care [17] used a matrix form to arrange 10 segments across a dimension of three age groups. The framework by Low et al. (2017) had further split one of the five segments outlined by Singapore’s Ministry of Health (MOH), generating an extension with six segments.
As part of the efforts to better understand population health in Singapore, the Health Segment Classification of Population Encompassed within Singapore (HealthSCOPES) framework was developed to segment the national population. This paper will describe the development of the framework from conceptualization through refinement to the finalized framework and the practical criteria to classify individuals into segments. This paper will also present the descriptive and the utilization charges across different key healthcare settings for each segment. Unlike other previous efforts that have focused on only a cross-sectional snapshot of the population, the objective of this paper is also to investigate the prognostic ability of the framework for relevant outcomes in the following year.
Material and methods
Study design
In the development and refinement of the HealthSCOPES framework, an iterative process with literature reviews, expert consultations, and data analyses was used to develop and refine the framework and its segment classification criteria. Data used was from a combination of several national administrative datasets with individual-level socio-demographic, mortality, morbidities, and healthcare utilization information. The data was used to classify individuals into the various segments based on the developed criteria. Cross-sectional analysis of healthcare utilization charges was conducted for 2016 and 2017 calendar years. This study obtained exemption approval from National Healthcare Group Domain Specific Review Board (reference number: 2018/00892) and there was no need to obtain consent as it was research that involved analysis of dataset without identifiers.
Setting
Singapore is a city-state nation located in Southeast Asia with a total population of 5.6 million, of which over 3.9 million are citizens and permanent residents residing in the nation within the last 12 months [18]. The population has an ethnicity composition of 74% Chinese, 13% Malays, 9% Indians, and 3% Others [19], and a life expectancy of 82.9 years [18]. There are three public integrated healthcare systems in Singapore, each with a geographical region that they are tasked by the government to oversee: Central, Western, and Eastern.
Database
National data from several administrative datasets held in MOH that covered the period from 1990 to 2017 was used. The datasets contained individual-level socio-demographic information, mortality, chronic disease diagnoses, as well as past healthcare utilization, claims, and referrals. The data held in MOH were de-identified but linked within and between datasets using a uniquely generated identification code. The datasets were accessed in years 2018 to 2022, and the authors did not have any information that could identify individual participants during or after data collection. In total, the datasets contained information on 10,243,409 unique individuals (alive and deceased). Of which, 6,862,023 were Singaporeans or Permanent Residents.
The datasets contained healthcare utilization data submitted through the Central Claims Processing System and data submitted for subvention purposes from (i) public sector hospitals for inpatient episodes, emergency department (ED) attendances, and specialist outpatient clinic (SOC) attendances, (ii) public sector primary care centers (polyclinics), and (iii) private sector Community Health Assist Scheme (CHAS) clinics. CHAS is a scheme introduced in 2014 aimed to make primary healthcare more accessible and affordable to lower- to middle-income Singaporean households by providing higher subsidies to these households. Before November 2019, CHAS benefits were tiered depending on household income with CHAS Blue cardholder receiving higher subsidies than CHAS Orange cardholder. From November 2019, CHAS Green was introduced for Singaporean with chronic conditions who do not qualify for CHAS Blue and Orange.
The public acute hospitals and polyclinics, which handle about 80% and 20% respectively of all the patient load in Singapore [20, 21]. The data contained diagnosis codes and visit specialties, which allowed for classifying individuals with illnesses such as mental health conditions, cancer, and cognitive decline. The national chronic disease registry, Chronic Disease Management Programme (CDMP), also contained information of patients diagnosed with a list of twenty chronic diseases and managed in hospitals, polyclinics, and/or participating private primary care General Practitioner (GP) clinics. Historical diagnosis of obesity was captured in this registry, with obesity set as having body mass index of 27.5 kg/m2 or higher, the criteria for Asian populations [22].
The datasets also contained claims and referrals for some nationwide standardized services and programs that had specific eligibility criteria. These criteria reflected the health and social needs of individuals who were referred to and/or enrolled in them. For instance, the eligibility criteria of admission to nursing home included being physically or mentally disabled due to illness, being semi-mobile, and/or requiring assistance in activities of daily living such as using the toilet. Being admitted to nursing homes was therefore used as a proxy to identify those who were frail and that required long-term care. A program with nationally standardized eligibility criteria was the Hospital2Home (H2H) program, which was a post-discharge transition care program. A list of potentially high healthcare utilizers generated from an artificial intelligence predictive model by the national technology agency, Synapxe (previously named Integrated Health Information Systems), was used to enroll patients into the H2H program.
The datasets also contained socio-demographic information, which included age, gender, ethnicity, housing type, as well as financial assistance schemes that an individual qualified for. These data allowed for identification of factors [23] commonly associated of having high risk of chronic diseases and increased utilization of healthcare services, such as housing type [24–28], living alone [29, 30], and requiring financial assistance [26].
Outcome measures
Outcome measures of interest are utilization charges (unsubsidized full gross bill amounts) and utilization frequencies across different healthcare settings in Singapore. These measures of interest covered settings for acute hospital inpatient admissions, day surgery inpatient admissions, emergency department (ED) visits, and specialist outpatient clinic (SOC) visits, as well as polyclinic visits. For inpatient admissions, the number of inpatient bed days was also included.
The HealthSCOPES segmentation criteria was applied to Singapore residents to derive 2016 broad groups and segments. Only segment criteria with available data in MOH administrative dataset at point of analysis were applied. The 2016 broad groups and segments were described in terms of demographic information and healthcare utilization. The categorical variables were summarized by frequency and percent and, continuous variables were summarized by descriptive statistics.
Framework development
The framework was first conceptualized using the broad segment groups outlined by the Singapore MOH [16] as a basis: mostly healthy, serious acute illness but curable, stable chronic, complex chronic, and end-of-life. Using these five broad groups as a reference, four broad groups were generated titled using the care goals in mind for each of the groups: “Staying Healthy”, “Living Well with Illness”, “Getting Well”, and “Maximizing Quality Life Years”. These names for the broad groups were also designed to reframe and shift the messaging away from being negative and disease-centric to one that is positive and salutogenic. The matrix dimension of age [15, 17] was added, with those aged 0–18 years old grouped as children (and adolescents), 19–64 years old grouped as adults, 65 years old and above grouped as seniors.
Existing expert-driven frameworks that were published in peer-reviewed journals or as reports publicly available online were reviewed. Segments from expert-driven frameworks by Lynn et al. (2007) [3], the Ministry of British Columbia, Canada [5], and the North West London [17] were combined and collapsed, and a segment of “at risk of developing a chronic condition” from Langton et al. (2016) [31] added. Based on the approach recommended by Lynn et al.’s “Bridges to Health” model [3], the number and definitions for each segment were intentionally kept limited and simple with the segments include every individual, and within each included people with sufficiently similar healthcare needs and priorities. As also highlighted by the London Health Commission [15], the population health segments generated should also be stable over time, be applied in the real-world based on good judgment, and allow for budgets to be set for the entire segment.
An initial framework was developed and consultations with five local clinical experts were also conducted to get their input. The clinical experts were at the consultant level and above, and covered specialties of oncology, palliative care, pediatrics, endocrinology, and preventive medicine. The expert inputs from the consultations were to ensure that the framework was best contextualized to Singapore’s population and facilitated development of practical care goals, actionable care programs, and measurable outcomes for each segment. As each clinical expert focused on segments relevant to their specialty, there were no consensus procedures taken. The classification criteria for the different segments were then developed and refined based on available data in the national-level healthcare and social administrative datasets and expert inputs. A final framework was developed with 12 segments, [A] to [L], in the four broad segment groups and across the dimension of three age groups (Fig 1).
Segments [A] to [C] are the three segments that make up the “Staying Healthy” broad group, who have no major health needs. Segment [A] comprises individuals of all ages that are mostly healthy and with no known chronic conditions and no known clinical and social risk factors. Segments [B] and [C] comprise children and adults, and seniors, respectively, that are considered to be at risk. These individuals all have no known chronic conditions but do have at least one clinical or social risk factor (such as low socio-economic status or obesity).
Segments [D] to [H] are the five segments that make up the “Living Well with Illness” broad group, who have been diagnosed in the system with at least one chronic illness. Segment [D] comprises children with known chronic conditions that are fairly stable and that require minimal follow-up at acute care settings. Segment [E] includes adults and seniors with simple chronic conditions that can be managed primarily in primary care with minimal support from specialist care. Segment [F] comprises adults and seniors with complex chronic conditions that require some management by specialist care. Segment [G] comprises adults and seniors with mental health conditions, while segment [H] includes adults and seniors with cancer.
Patients with transient, acute healthcare needs fall into Segment [I] and [J]. These individuals would have transitional care needs that help transition their care from hospital to community or home setting to aid in their continued recovery to baseline function. Segment [I] comprises children that have transitional and long-term care needs and that require additional specialist care to return baseline functions and/or to delay the progression of their diseases. Segment [J] comprises adults and seniors that have experienced a major acute health episode and thus require additional support after discharge from hospital.
Segments [K] and [L], together Segment [I], make up the “Maximizing Quality Life Years” broad group. Segment [K] comprises adults and seniors that have diminished physical and/or cognitive functions and are considered frail, and/or that are living in a long-term care institution such as a nursing home. Segment [L] comprises individuals of all ages that are in the final year of their lives or were referred to palliative care services, and so require advanced care planning and services that support their end-of-life preferences.
Classification criteria
Together, all the segments covered the entire population and were tiered from highest to lowest healthcare needs (Fig 2). At any point, individuals would fall into only one of the segments based on the definition. As their health state changed across time points, individuals could be reclassified into another segment, but never in more than one segment at a time. The classification of individuals was based on meeting criteria of a segment with the highest healthcare needs in one calendar year, and that overrode classification into a segment with lower healthcare needs. The developed hierarchy of classification is presented visually as a flowchart as shown in Fig 2.
Statistical analyses
Descriptive analyses of demographics were performed for each segment. Yearly healthcare utilization charges were calculated for each of the different settings for the 2016 calendar year. Total yearly healthcare utilization charges were calculated from summing yearly healthcare utilization charges across the various healthcare settings and presented as a total dollar amount incurred for each segment (total segment healthcare charges) and as a mean dollar amount per person of the segment (mean per resident healthcare charges).
In order to validate the HealthSCOPES framework, the prognostic ability of the 2016 broad groups and segments, on clinically relevant outcomes measured in 2017, were studied. Prognostic ability of segments was used as a proxy indicator of clinically meaningful, needs-based segmentation. Differing expected healthcare utilization and mortality among the different segments shows that the segmentation framework could be a potential tool to facilitate planning of healthcare resources and interventions. Only residents with assigned broad groups and segments in 2016 who continued to live in 2017 were included in the validation. The outcomes studied were (i) number of polyclinic visits, (ii) number of SOC visits, (iii) number of ED visits, (iv) number of inpatient admissions, (v) total inpatient bed days (BD), (vi) mortality, and (vii) admission to intensive care unit (ICU).
Two-part models (twopm) with offset term and broad groups or segments as independent variable were used to model dependent variables (i)-(v) in order to calculate the marginal effects (ME). Twopm was used as a large fraction of the residents did not have any healthcare utilization. Twopm consists of two parts. The first part is a binary component which models the probability of using the specific healthcare service. The second part is a continuous component which models non-zero healthcare utilization using generalized linear model (GLM). Logit-link and log-link were used for the first part and second part of the twopm respectively. Offset term was used to account for patients who died in 2017 and hence less opportunity for utilization. For each dependent variable, Poisson, Gamma, Negative Binomial, and Gaussian family were considered for the second part of the twopm. The distribution family with the lower Akaike information criterion (AIC) was selected and the results reported. Marginal effects can be interpreted as the average change in 2017 utilization when there was a change (hypothetical) in broad group or segment in 2016 from reference.
With broad groups or segments as independent variable, GLM with logit link and binomial family was used to model presence of ICU admission in 2017 and to generate odds ratio (OR). Mortality was modelled using Cox-proportional hazard models to derive hazard ratio with broad groups or segments as independent variable without any other covariate.
For both twopm and GLM, the group or segment with the least healthcare needs was used as the reference group (i.e. “Staying Healthy” group and [A] Mostly Healthy Children, Adults & Seniors segment). As deaths are expectedly rare in the “Staying Healthy” group and [A] Mostly Healthy Children, Adults & Seniors segment, the “Living Well with Illness” group and [E] Adults & Seniors with Simple Chronic Condition segment were used as reference group in the survival analyses for a more informative comparison. For the broad group analysis, [I] Children with Transitional & Long-Term Needs was classed only within the “Getting Well” group and not in the “Maximizing Quality Life Years”. Moreover, the segments encompassing solely children were dropped in the segment-level survival analysis as deaths were rare in those segments. There were 13 deaths in [D] Children with Chronic Condition and 24 deaths in [I] Children with Transitional & Long-Term Needs.
Statistical significance was assessed using a threshold of 0.05. Point estimates and 95% confidence intervals (CI) of the marginal effects, odds ratios and hazard ratios will be reported. STATA version 14.1 (Stata Corp, College Station, Texas, USA) was used to segment the population and perform twopm. R Version 3.6 was used to perform the logistic regression and survival analyses. The R package ‘survival’ was specifically used to perform the latter analyses. Forest plots found in this paper were produced using the R package ‘forestplot’.
Results
Demographics
Using the database master list of individuals, exclusion criteria were applied to form 2016 base population for segmentation (Fig 3). Those without race information were excluded. To exclude individuals who might be missing death date, we excluded those aged above 110 in 2016. There was a total of 4,671,465 residents that were alive at any point in 2016 (Table 1). The size of each of the broad groups and segments followed the order of healthcare needs–the size decreased with increasing healthcare needs. The majority of the population was part of the “Staying Healthy” group (n = 3,489,881, percentage of total = 74.7%). The sizes of the two other broad groups (with segment [I] classed within the “Getting Well”) and 12 segments can be found in Table 2. Based on percentages of the total in each geographical region, there was no significant difference in distribution of broad groups and segments across the different regions, indicating homogeneity within Singapore.
Table 2 also provides summary statistics on demographic information with no missing data. Examining only the segments including adults, segment [K] had the highest mean age followed by segments [J], [F], [H], [E], and [G]. There was higher percentage of males in all segments except for segments [E], [G], [H], [J], and [K]. Compared to segment [A], there was higher percentage (more than 5%) of ethnic Chinese in segments [C], [E], [F], [G], [H], [K], and [L]. Compared to segment [A], there was also a higher percentage of ethnic Malay in the segments [B], [D], [I], [J], and [L]. Although requiring financial assistance is not a classification criterion for segments [J], [K], and [L], more than 40% of the residents in the segments were beneficiaries of Community Health Assist Scheme (CHAS). Similarly, segments [F] and [J] had the highest percentage of residents historically diagnosed with obesity even though historical diagnosis of obesity is not a classification criterion. The other patterns in demographics statistics were an artefact of the framework classification criteria.
Healthcare utilization
The total segment healthcare charges (with breakdown of yearly healthcare utilization charges across different settings) and the mean per resident healthcare charges are shown in Fig 4 with no further missing data. Segment [A] had the highest total segment healthcare utilization charges standing at about $1 billion but the lowest mean per resident healthcare charges. The mean healthcare charges per resident were the highest for segment [J], followed by segment [L]. As a result, even though the population size of segments [J] and [L] were about 1% that of segment [A], these segments respectively incurred about 31% and 40% of segment [A]’s total healthcare charges. Although the mean per resident healthcare charges for segments [H]-[L] were about 4–9 times higher than that of segment [F], the population size of segment [F] was about 20-fold the size of segments [H] and [L]. This led to the total segment healthcare charges of [F] to be the second highest at close to $790 million, and over 2–18 times that of segments [H]-[L].
Looking at the contribution of different settings to the total segment healthcare charges, a sizeable proportion of the charges for segments [E]-[H] could be attributed to the inpatient admissions, SOC visits, and day surgery admissions. There was also a sizable contribution of healthcare utilization charges from polyclinic visits in segment [F]. In contrast, for segments [J]-[L], the bulk of the charges were driven by inpatient admissions, with some proportion incurred at the SOCs, while there was minimal relative contribution from polyclinics and day surgery.
Validation
Of the 4,701,074 residents segmented for 2016, 4,653,020 were alive in 2017. These 4,653,020 residents were included in the validation analyses. There was no further missing data.
Broad group validation.
Gamma was selected as the family for the second part of the twopm modeling number of SOC visits (AIC = 4.56), total inpatient bed days (AIC = 5.90) and number of ED visits (AIC = 2.77). Poisson was selected as the family for the second part of the twopm modeling number of polyclinic visits (AIC = 4.24) and number of inpatient admissions (AIC = 2.77).
The estimated marginal effects and odds ratios showed that there was variation in 2017 healthcare utilization amongst the broad groups segmented for 2016 (Table 3). With the “Staying Healthy” broad group as reference, the marginal effects were all larger than 0 and the odds ratios were all larger than 1. Larger variation was observed in SOC visits (ME: 1.80 to 4.40), total inpatient bed days (ME: 0.98–8.88) and ICU admission (OR: 6.59 to 27.84).
The “Getting Well” group had the highest number of polyclinic visits, SOC visits, inpatient admissions and ED visits when compared to the reference group of “Staying Healthy”. For instance, the “Getting Well” group on average utilized 6.05 (95% CI: 5.97–6.14) more SOC visits than the “Staying Healthy” group.
The “Maximizing Quality Life Years” group had the highest total inpatient bed days in the following year. The group on average utilized 8.88 (95% CI: 8.66–9.11) more bed days in the following year than the “Staying Healthy” group. The odds of ICU admission in 2017 for the “Maximizing Quality Life Years” group was the highest, standing at 27.84 times (95% CI: 26.35–29.40) that of the “Staying Healthy” group.
Hazard ratios were higher in broad groups with higher health care needs (Fig 5). For instance, the death rate per unit time in the “Maximizing Quality Life Years” group was 17.48 (95% CI: 16.94–18.03) times that of the “Living Well with Illness” group.
[A]-[C]: staying healthy; [D]-[H]: living well with illness; [I]-[J]: getting well; [K]-[L]: maximizing quality life years.
Segment validation.
Gamma was selected as the family for the second part of the twopm modeling number of SOC visits (AIC = 4.54), total inpatient bed days (AIC = 5.86) and number of ED visits (AIC = 2.77). Poisson was selected as the family for the second part of the twopm modeling number of polyclinic visits (AIC = 4.16) and number of inpatient admissions (AIC = 2.76). The estimated marginal effects and odds ratios showed that there was variation in 2017 healthcare utilization amongst the segments for 2016 (Table 4).
With segment [A] as reference, the marginal effects were all larger than 0 and the odds ratios were all larger than 1. Larger variation was observed in SOC visits (ME: 0.18 to 8.36), total inpatient bed days (ME: 0.11–11.69) and ICU admission (OR: 1.42–69.59). Differentiated MEs and ORs were also observed in segments belonging to the same broad groups. An example would be the number of SOC visits of the segments [D]-[H] in “Living Well with Illness” group, which was on average 0.91 to 8.36 higher when compared to the reference segment [A].
Segment [F] had the highest number of polyclinic visits when compared to the reference segment. On average, the segment was estimated to have visited the polyclinics 2.93 (95% CI: 2.93–2.94) more in the following year when compared to the reference segment. Segment [H] had the highest number of SOC visits, with an estimated 8.36 (95% CI: 8.24–8.49) more SOC visits when compared to the reference segment. Segment [L] had the highest inpatient admissions, total inpatient bed days, and ED visits when compared to the reference segment. For instance, the segment on average utilized 11.69 (95% CI: 11.22–12.17) more bed days in the following year than the reference segment. The odds of ICU admission in 2017 for segment [L] were the highest, standing at 69.59 times (95% CI: 64.98–74.47) that of the reference segment.
An overall increasing trend in hazard ratios were observed in segments with higher health care needs (Fig 6). For instance, the death rate per unit time in segment [L] was 89.27 (95% CI: 82.77–96.28) times that of [E]. However, some exceptions were observed. With segment [E] as reference, the hazard ratio of segment [C] was larger than 1 (i.e. 3.63 (95% CI: 3.19–4.13)). The hazard ratio of segment [G] was lower than segment [F]. The hazard ratio of segment [H] was similar to that of segment [J].
[A]: mostly healthy children, adults & seniors; [B]: at risk children & adults; [C]: at risk seniors; [E]: adults & seniors with simple chronic condition; [F]: adults & seniors with complex chronic condition; [G]: adults & seniors with mental health condition; [H]: adults & seniors with cancer; [J]: adults & seniors with transitional care needs; [K]: frail adults & seniors; [L]: end-of-life.
Discussion
We developed a framework to segment the Singaporean resident population into four broad groups and 12 segments, based on a set of hierarchical criteria operationalizable in administrative data. Segments comprising individuals with cancer, with transitional care needs, and in the last year of their lives had the highest mean per resident healthcare charges. These findings mirror that of a previous study using data from an academic medical center in Singapore that found cancers and inpatient deaths as being major drivers of hospital healthcare expenditures [32]. This indicates that when individuals move into these segments, the health utilization charges for them would be expected to escalate. At current state, these segments make up about 2% of the population. Growth in size of these segments would thus majorly add to the extended burden on the healthcare system and potentially cripple it.
Strategic policies and scalable care models need to be designed to provide continued optimal care population-wide, while also working on preventing the growth of size of these segments in the future. Existing interventions, such as post-discharge transition care programs, might help better integrate care and decrease mortality, but might not be significantly decreasing healthcare utilization charges as intended [33]. Individuals with chronic diseases are at greater risk of having acute health episodes and/or complications that incur additional healthcare resources. National efforts would therefore need to focus on sustainable long-term monitoring and management of chronic diseases in the community to ensure that these individuals can continue to live well and not worsen. Going further upstream, early detection and prevention of chronic diseases through screening and educational programs would be crucial to ensure that the generally healthy individuals continue to stay healthy.
Strengths
One of the key strengths of this effort was the hybrid approach of being both qualitatively expert-driven and quantitatively data-driven. The consultations with local clinical experts ensured that the framework was best contextualized to Singapore’s population. The use of comprehensive national data from several administrative datasets extended beyond only the patient pool in the acute hospital setting and allowed coverage of the entire nation’s population to include those that have not come to contact by hospitals or the healthcare system. However, the datasets could benefit from having additional sources of data, such as health behaviors and psychosocial factors, which would usually only be available if routinely collected systematically with using questionnaires for these patient-reported outcomes.
Another strength of this effort was the validation of the framework on both the broad groups and specific segments. The analysis showed that the framework was able to distinguish varying tiers of healthcare utilization charges as well as relative risk of death, in the following year. The practical tiering of population segments with the framework can thus be directly applied for use in program or policy design, evaluation, and cost-effectiveness analyses.
Limitations
The healthcare utilization data used primarily covered public healthcare settings and did not include most private healthcare settings. There was a lack of complete data from private acute hospitals and some private primary care, and from community hospitals, nursing homes, hospices, and other intermediate and long-term care (ILTC) services, which were more commonly utilized by individuals with cancer, with transitional care needs, who are frail, and in the last year of their lives. As such, (1) individuals who only utilized private healthcare with no contact with the public healthcare system, and (2) individuals with only utilization of ILTC services would not be accurately assigned to the correct segments. The healthcare utilization and charges are also an underestimation.
Another limitation is that the HealthSCOPES framework was developed specifically with Singapore’s setting and healthcare system in mind. The overall conceptual framework and the general classification approach could be translated to and tweaked for another nation’s population. However, the exact classification criteria and detailed analyses of healthcare utilization charges might not be applicable unless similar data exist and can be linked at a national level.
The outcome measure of focus in this paper was on healthcare utilization charges and did not include other societal and personal costs incurred. This was again limited by data availability, with societal and personal costs incurred difficult and thus not routinely collected systematically on a national level. Future work for the HealthSCOPES framework would be to incorporate indicators of interests specific to each segment that measures short- and long-term outcomes, as well as process indicators of any nation-wide programs. The long-term stability of the health segments over a longer-term duration, which was not assessed here due to limited data in subsequent years, could also be explored in future work.
Conclusion
The HealthSCOPES framework developed was used to systematically analyze existing data and could also allow for integration of new data to continue tracking of the health of population. The framework was designed to cover the entire population in Singapore while maintaining relevance to the healthcare needs and setting. It was found that segments comprising individuals with cancer, with transitional care needs, and in the last year of their lives had the highest mean per resident healthcare charges. The use of such a health segmentation framework and analysis allows for better understanding and monitoring of health profiles in the population, which can support data-driven resource allocation based on population health needs and is an anchoring pathway for sustainable and equitable care delivery.
Acknowledgments
We would like to thank Deanette Pang and Zheng Yi Lau for supporting the access to the databases. We would also like to thank John Wong, Noreen Chan, Jeremy Lin, Milawaty Nurjono, Keith Chong, Teresa Quek, Xu Weiming, and Jaslyn Wong for their feedback during the development of the framework.
References
- 1. Bodenheimer T, Chen E, Bennett HD. Confronting the growing burden of chronic disease: can the U.S. health care workforce do the job? Health Aff (Millwood). 2009;28(1):64–74. pmid:19124856
- 2. Vogeli C, Shields AE, Lee TA, Gibson TB, Marder WD, Weiss KB, et al. Multiple chronic conditions: prevalence, health consequences, and implications for quality, care management, and costs. J Gen Intern Med. 2007;22 Suppl 3:391–5. pmid:18026807
- 3. Lynn J, Straube BM, Bell KM, Jencks SF, Kambic RT. Using population segmentation to provide better health care for all: the "Bridges to Health" model. Milbank Q. 2007;85(2):185–208; discussion 9–12. pmid:17517112
- 4. North West London Whole Systems Integrated Care. Why should we group? 2018 [Available from: http://integration.healthiernorthwestlondon.nhs.uk/section/why-should-we-group-.
- 5. Health Sector Planning and Innovation Division BCMoH. The Health System Matrix 6.1: Understanding the Health Care Needs of the British Columbia Population through Population Segmentation. 2015.
- 6. The Better Care Fund. ’How to’ Guide: The BCF Technical Toolkit. Section 1: Population Segmentation, Risk Stratification and Information Governance. 2014.
- 7. Chong JL, Matchar DB. Benefits of Population Segmentation Analysis for Developing Health Policy to Promote Patient-Centred Care. Ann Acad Med Singapore. 2017;46(7):287–9. pmid:28821893
- 8. Jean-Baptiste DO’Malley AS, Shah T. Population Segmentation and Targeting of Health Care Resources: Findings from a Literature Review 2017 [Available from: https://www.mathematica-mpr.com/download-media?MediaItemId=%7BFEB2C9F5-5B7B-429F-A9AB-0922A9354D03%7D.
- 9. Eissens van der Laan MR, van Offenbeek MA, Broekhuis H, Slaets JP. A person-centred segmentation study in elderly care: towards efficient demand-driven care. Soc Sci Med. 2014;113:68–76. pmid:24852657
- 10. Lafortune L, Beland F, Bergman H, Ankri J. Health status transitions in community-living elderly with complex care needs: a latent class approach. BMC Geriatr. 2009;9:6. pmid:19192295
- 11. Liu LF, Tian WH, Yao HP. Utilization of health care services by elderly people with National Health Insurance in Taiwan: the heterogeneous health profile approach. Health Policy. 2012;108(2–3):246–55. pmid:23022286
- 12. Bates DW, Saria S, Ohno-Machado L, Shah A, Escobar G. Big data in health care: using analytics to identify and manage high-risk and high-cost patients. Health Aff (Millwood). 2014;33(7):1123–31. pmid:25006137
- 13. Lewis GH. "Impactibility models": identifying the subgroup of high-risk patients most amenable to hospital-avoidance programs. Milbank Q. 2010;88(2):240–55. pmid:20579284
- 14. Rinehart DJ, Oronce C, Durfee MJ, Ranby KW, Batal HA, Hanratty R, et al. Identifying Subgroups of Adult Superutilizers in an Urban Safety-Net System Using Latent Class Analysis: Implications for Clinical Practice. Med Care. 2018;56(1):e1–e9. pmid:27632768
- 15. London Health Commission. Better Health for London 2014 [Available from: https://www.healthylondon.org/resource/better-health-london-report/.
- 16. Low LL, Kwan YH, Liu N, Jing X, Low ECT, Thumboo J. Evaluation of a practical expert defined approach to patient population segmentation: a case study in Singapore. BMC Health Serv Res. 2017;17(1):771. pmid:29169359
- 17. North West London Whole Systems Integrated Care. What approach should we take? 2018 [Available from: http://integration.healthiernorthwestlondon.nhs.uk/section/what-approach-should-we-take-.
- 18. Department of Statistics Singapore. Latest Data 2017 [Available from: https://www.singstat.gov.sg/statistics/latest-data#16.
- 19. Department of Statistics Singapore. Census of Population 2010 Statistical Release 1: Demographic Characteristics, Education, Language and Religion. 2011.
- 20. Health Information Division MoH, Republic of Singapore,. Primary Care Survey 2014 2014 [Available from: https://www.moh.gov.sg/docs/librariesprovider5/resources-statistics/reports/moh-primary-care-survey-2014-report.pdf?sfvrsn=6e773b6d_0.
- 21. Ministry of Health Singapore. Hospital Services 2018 [Available from: https://www.moh.gov.sg/our-healthcare-system/healthcare-services-and-facilities/hospital-services.
- 22. Appropriate body-mass index for Asian populations and its implications for policy and intervention strategies. Lancet. 2004;363(9403):157–63.
- 23. Chan CQH, Lee KH, Low LL. A systematic review of health status, health seeking behaviour and healthcare utilisation of low socioeconomic status populations in urban Singapore. Int J Equity Health. 2018;17(1):39. pmid:29609592
- 24. Low LL, Wah W, Ng MJ, Tan SY, Liu N, Lee KH. Housing as a Social Determinant of Health in Singapore and Its Association with Readmission Risk and Increased Utilization of Hospital Services. Front Public Health. 2016;4(109). pmid:27303662
- 25. Seng JJB, Lim VZK, Kwan YH, Thumboo J, Low LL. Outpatient primary and tertiary healthcare utilisation among public rental housing residents in Singapore. BMC Health Serv Res. 2019;19(1):227. pmid:30987617
- 26. Wee LE, Koh GC. Individual and neighborhood social factors of hypertension management in a low-socioeconomic status population: a community-based case-control study in Singapore. Hypertens Res. 2012;35(3):295–303. pmid:22089533
- 27. Wee LE, Yeo WX, Yang GR, Hannan N, Lim K, Chua C, et al. Individual and Area Level Socioeconomic Status and Its Association with Cognitive Function and Cognitive Impairment (Low MMSE) among Community-Dwelling Elderly in Singapore. Dement Geriatr Cogn Dis Extra. 2012;2(1):529–42. pmid:23277785
- 28. Wee LE, Yong YZ, Chng MW, Chew SH, Cheng L, Chua QH, et al. Individual and area-level socioeconomic status and their association with depression amongst community-dwelling elderly in Singapore. Aging Ment Health. 2014;18(5):628–41. pmid:24392759
- 29. Ng TP, Jin A, Feng L, Nyunt MS, Chow KY, Feng L, et al. Mortality of older persons living alone: Singapore Longitudinal Ageing Studies. BMC Geriatr. 2015;15:126. pmid:26470796
- 30. Wee LE, Low LL, Thumboo J, Chan A, Lee KH. Factors associated with emergency room visits and hospitalisation amongst low-income public rental flat dwellers in Singapore. BMC Public Health. 2019;19(1):713. pmid:31174499
- 31. Langton JM, Wong ST, Johnston S, Abelson J, Ammi M, Burge F, et al. Primary Care Performance Measurement and Reporting at a Regional Level: Could a Matrix Approach Provide Actionable Information for Policy Makers and Clinicians? Healthc Policy. 2016;12(2):33–51.
- 32. Rahman N, Ng SH, Ramachandran S, Wang DD, Sridharan S, Tan CS, et al. Drivers of hospital expenditure and length of stay in an academic medical centre: a retrospective cross-sectional study. BMC Health Serv Res. 2019;19(1):442. pmid:31266515
- 33. Ang IYH, Tan CS, Nurjono M, Tan XQ, Koh GC, Vrijhoef HJM, et al. Retrospective evaluation of healthcare utilisation and mortality of two post-discharge care programmes in Singapore. BMJ Open. 2019;9(5):e027220. pmid:31122989