Figures
Abstract
HIV remains a major public health challenge, with 1.4 million new infections and 31.6 million people accessing antiretroviral treatment in 2024. In many countries, HIV services have been provided through vertical programs, which, although highly successful in bringing treatment to people living with HIV since the early 2000s, are unlikely to sustain progress beyond donor dependency. The increasing push towards Universal Health Coverage (UHC), while facing reduction in international assistance, are prompting countries search for innovative strategies to effectively deliver HIV services through national systems, supported by domestic financing. Developing country-tailored HIV financing and service delivery approaches will be critical to reaching the end of AIDS as a public health threat and sustaining gains by and beyond 2030. Our study aims to develop an innovative data-driven approach to identify clusters of countries with similar challenges in defining their HIV response sustainability pathways. These clusters provide a framework for mutual learning, allowing countries to co-develop context-specific solutions to HIV financing and service delivery. We utilized unsupervised machine learning (ML) methods, including partitional, hierarchical, spectral, and density-based algorithms, to identify clusters of countries based on HIV epidemic and response data among 134 LMICs. We pooled open-source data from repositories covering indicators related to HIV epidemic and response, UHC commitment and progress, legislation surrounding human rights and HIV response, wealth and equity. We identified four country clusters, which did not align with conventional global regions but instead cut across them, revealing more nuanced groupings. Clusters were located in (1) South Asia, East Africa, and Oceania; (2) Sub-Saharan Africa and the Caribbean; (3) Eastern Europe, Middle East, Latin and Southern Africa; (4) Latin America, Asia, Middle East, North Africa, Oceania and the Caribbean. Our study is an early example of how ML techniques can be applied to health policy and (public) health performance.
Citation: Alexiou ZW, Dixit S, Ogbuoji O, Kohler S, Atun R, Terris-Prestholt F, et al. (2025) Using country-level variables to discover country clusters beyond traditional health policy and performance metrics: An unsupervised machine learning approach for HIV healthcare delivery and financing. PLOS Glob Public Health 5(12): e0004583. https://doi.org/10.1371/journal.pgph.0004583
Editor: Julia Robinson, PLOS: Public Library of Science, UNITED STATES OF AMERICA
Received: February 4, 2025; Accepted: October 30, 2025; Published: December 2, 2025
Copyright: © 2025 Alexiou et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data used in the study is secondary data from open-access sources. Instructions for access to the data have been provided in the manuscript.
Funding: In addition, the financial statement should be updated the financial statement to: “This work was supported by the Joint United Nations Programme on HIV/AIDS (UNAIDS) (project number PR2020/1050056 to CAB; project number PR2020/1050056 to TB) and the Dutch Research Council (NWO) Rubicon program (project number 452022313 to CAB). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
HIV continues to be one of the most significant public health challenges, with 1.4 million new infections and 31.6 million people accessing antiretroviral treatment in 2024 [1]. In many low- and middle-income countries (LMICs), HIV services are delivered through donor-funded vertical programs. These programs have enabled a rapid scale-up of HIV prevention and treatment – saving millions of lives, particularly in sub-Saharan Africa [2,3]. However, as disease burdens evolve and non-communicable diseases such as diabetes, hypertension, and depression rise, health systems must adapt to deliver a broader range of services [4,5]. At the same time, roughly 400 million people worldwide still lack access to essential health services or face catastrophic health costs, highlighting the urgency of achieving Universal Health Coverage (UHC) as part of the Sustainable Development Goals (SDGs) [6]. A reorientation of health systems towards investment in primary health care, offering a broad range of essential health services, will be required [4].
Despite remarkable progress, projections suggest that the number of people living with HIV will continue to increase until at least 2039, underscoring the need for sustainable, country-led systems capable of providing lifelong care [7]. Yet, recent cuts in U.S. global health funding and international assistance, including President’s Emergency Plan for AIDS Relief (PEPFAR) support, threaten to undermine both HIV programs and broader health system transformation across LMICs [8]. Many people living with HIV cannot access life-saving services due to pervasive stigma, inequity in health care, sudden funding reductions, and emerging health threats [9]. Achieving ambitious global goals will therefore require reimagining the HIV response—shifting from donor-driven, vertical programs to sustainable, nationally owned systems that deliver people-centred, affordable services financed through domestic mechanisms [5,6].
To date, over 30 countries have initiated developing pathways to transition the HIV programmes to sustained HIV responses, delivered through national systems, domestically-financed, and politically supported self-reliance [10,11]. However, the implementation of such mechanisms is fractured, characterized by different programs, funding sources, and phases across various countries [12].
Progress towards sustaining HIV response impact, however, will require context-specific strategies shaped by political leadership, service quality, health care system capacity, supportive policies, and available financing [10,13]. The pace and type of solutions will vary based on how factors across these five areas interact within each country. Effective and equitable delivery of the HIV response through national health care systems depends on overall system performance, primary health care coverage, streamlined integrated delivery models, and person-centred care [12]. Increasing domestic financing will depend on countries’ fiscal capacity, political commitment to HIV and health investments, and competing national priorities. Stigma and human rights violations, including against key and vulnerable populations at risk of or infected with HIV, perpetuate inequitable service access, driving new infections to increase and poor viral suppression rates [9].
Given these differences, there is a growing need for a more nuanced understanding of where countries stand in their transition toward sustainable, nationally led HIV responses. Currently, policymaking is largely done by region, and based on epidemic burden and available funding sources, which overlooks the fact that countries within the same region may exhibit distinct epidemic and systemic characteristics [14,15]. Sustainable financing for the HIV response requires customized approaches—“one size fits all” will not suffice. For example, while some countries prioritize integrating HIV services into health insurance benefit packages, others focus on strengthening primary care infrastructure. Identifying clusters of countries with similar challenges can help foster South-to-South learning and guide strategic investments to sustain epidemic control.
Machine learning (ML) methods are increasingly utilized in global health and policy domains for their ability to identify patterns in complex data, enabling more effective, context-specific interventions at macro level. This includes the use of unsupervised ML to update existing typologies to describe and categorize increasingly difficult to describe health systems [16–20]. This study applies unsupervised ML to cluster countries using selected multi-criteria critical to enabling sustained epidemic control through national systems beyond donor dependency. The objectives are to: (1) identify distinct country clusters; (2) assess their robustness; and (3) interpret these clusters through key contributing variables and comparison with existing typologies.
Methods
Data
Data sources and indicators.
We pooled open-source country-level data from various data repositories for all 134 LMICs. We defined domains that could impact sustainable HIV service delivery and financing, as shown in Table 1. These domains were derived from UNAIDS’ framework for achieving sustainable HIV response results and corresponding criteria to guide country actions (Text A in S1 Text) [21]. The UNAIDS multi-criteria framework to guide country actions includes areas on rights and equity Status, ability to pay, disease Burden, pace of progress towards control of the epidemic, and health care system robustness, measured through progress in UHC. We included indicators that represent these areas in the UNAIDS framework and could facilitate the diagnosis of a country’s key challenges and help group countries according to their contexts and needs. The selected indicators were across (i) healthcare, Universal Health Coverage commitments and progress, (ii) the HIV/AIDS epidemic, (iii) HIV/AIDS response: healthcare and prevention, (iv) healthcare financing, including for HIV/AIDS, and (v) legislation about human rights and the HIV response. We chose equity indicators, such as disease burdens among vulnerable populations and wealth inequalities, in line with the SDG’s and the UNAIDS global AIDS strategy 2021–2026 [22,23]. In addition, we included general country-level information related to (vi) demographic and socioeconomic indicators, (vii) other disease burdens, and (vii) overall health and mortality. We created a cross-sectional database with 120 variables. Details of various variables included in each domain can be found in Table A in S1 Text.
Data processing.
If available, we included the estimate for each variable from 2017 or later. If such current estimates were not available, we expanded the time window to include data back until 2000 to be able to use most of the pre-selected relevant variables. We transformed some variables into ratios (e.g., people using at least basic sanitation services (% of population): Q1/Q5) to take relative differences into account. Variables with data availability in less than 75% of countries were dropped. We used the k-nearest neighbour algorithm for the remaining variables to impute the missing information [24]. Details on the strategies adopted to manage the data missingness issue can be found in Table A in S1 Text.
Data analysis
We used unsupervised ML algorithms to cluster country-level data, using several approaches including partitional (Partitioning Around Medoids; PAM), hierarchical, spectral, and density-based (DBSCAN) methods [25]. As a pre-processing step, Gower distances were computed to handle mixed data types, followed by Uniform Manifold Approximation and Projection (UMAP), a non-linear dimensionality reduction technique using default parameters (15 neighbours and a minimum distance of 0.1) [26].
PAM clustering was applied using the Euclidean distance metric, with default initialization and iterative optimization until convergence. We used the Silhouette Index (with values above 0.50 considered acceptable based on reference standard) and the Gap Statistic (with an substantial increase indicating better clustering) to assess the optimal number of clusters for PAM, iterating between solutions with 2:5 clusters [13]. Cluster stability was evaluated using bootstrapping with mean Jaccard bootstrap values across 1000 samples (>0.70 considered stable).
Hierarchical (HC) and spectral clustering were applied specifying four clusters, and DBSCAN was run with eps = 0.5 and minPts = 5, resulting in 4 clusters and no points unclustered. The Adjusted Rand Index (ARI) was used to compare consistency across all four clustering algorithms. An ARI above 0.70 indicates strong consistency, 0.50–0.70 suggests moderate agreement, and below 0.50 indicates poor consistency [14,15]. We then used silhouette plots, 2D UMAP plots and country maps to visualize country typologies across algorithms.
The most robust and easy-to-interpret cluster solution was then chosen and used for downstream analyses to aid interpretation of clusters. Fuzzy clustering was additionally used to examine membership probabilities at cluster boundaries, which were visualized using country maps [20].
Random Forest, an ensemble of decision trees, was applied post-hoc to identify the variables most important for distinguishing clusters, out of the box error (OOB) was used to estimate the prediction error. Variable importance was quantified using MeanDecreaseAccuracy, which measures how much model accuracy decreases when a variable is excluded, and MeanDecreaseGini, which reflects how much a variable contributes to reducing node impurity across the trees [27]. The top 20 key input indicators were then selected based on the Random Forest results. We calculated correlation coefficients (Cramer’s V Coefficient (φc)) to assess if clusters represent groupings that are recognizably different from existing typologies by world region and HIV/AIDS epidemic type.
Finally, each cluster was summarized into a policy‑interpretable “profile name” by extracting original mean or median values of key variables identified by the Random Forest. These profiles were then refined by consensus within our multidisciplinary team, incorporating both HIV/AIDS prevalence and region alongside new insights from this study.
We used R version 4.1.2 (The R Foundation, Vienna, Austria) and RStudio version 2022.02.2 + 485 (RStudio, Boston, Massachusetts) for data pre-processing and analysis. The complete code for the analysis can be found on GitHub [28].
Results
Using PAM clustering, we found four clusters (k = 4) solution robust and interpretable, as the silhouette score remained above 0.50, with acceptable scores for k = 2 (0.671), k = 3 (0.583), and k = 4 (0.527), coupled with a favourable visual assessment of the Gap Statistic metrics (Fig 1; Fig A in S1 Text). Bootstrap validation showed the following mean Jaccard bootstrap values: cluster 1 (0.779), cluster 2 (0.928), cluster 3 (0.934), and cluster 4 (0.887).
The silhouette score represents how well an observation fits within its assigned cluster compared to neighbouring clusters, with higher values indicating better-defined and more cohesive clusters. PAM = partitioning around medoids, HC = hierarchical, DBSCAN = density-based spatial clustering of applications with noise.
Comparison between the four clustering algorithms showed that PAM and hierarchical clustering (HC) had very similar average silhouette scores (0.53 and 0.53, respectively), indicating good cluster separation, while spectral (0.46) and DBSCAN (0.44) showed slightly lower, but still acceptable, silhouette values. Adjusted Rand Index (ARI) values demonstrated strong agreement between PAM and HC (0.929), as well as between PAM and spectral clustering (0.835). In contrast, agreement with DBSCAN was lower, particularly with PAM (0.710) and HC (0.672), indicating moderate consistency (Table B in S1 Text; Fig B in S1 Text).
Visual inspection of 2D UMAP plots shows that PAM and HC produce well-separated clusters with clear gaps between groups. Spectral clustering yields similar groupings but with slightly more overlap, while DBSCAN identifies dense regions with moderate consistency and some overlap (Fig 2).
Each point represents a country positioned along the first and second UMAP dimensions. The spatial distribution reflects similarity across all input indicators, with clusters appearing as groups of points that are closer together in this reduced-dimensional space. PAM = partitioning around medoids, HC = hierarchical, DBSCAN = density-based spatial clustering of applications with noise.
Given that PAM clustering was the most robust solution and corresponded closely to other algorithms, we further examined the geographical distribution of the clusters in Fig 3. The clusters did not correspond strictly to global regions but cut across them. Cluster 1 (blue) included countries in South Asia (e.g., Afghanistan, Pakistan, Nepal, Bangladesh), East Africa (e.g., Eritrea, Somalia, Tanzania, Sudan), and a few from Oceania (e.g., Myanmar, Cambodia). Cluster 2 (green) was dominated by Sub-Saharan Africa (e.g., Nigeria, Kenya, Zambia, Zimbabwe, South Africa) along with Haiti and Jamaica. Cluster 3 (yellow) brought together a wide geographical range, including Eastern Europe (e.g., Ukraine, Belarus, Serbia), the Middle East and North Africa (e.g., Egypt, Jordan, Algeria), Latin America (e.g., Brazil, Mexico, Argentina), and several Asian countries (e.g., China, Iran, Turkey). Cluster 4 (orange) combined Latin American countries (e.g., Peru, Venezuela, Dominican Republic), Asian countries (e.g., India, Mongolia, Indonesia), North African and Middle Eastern countries (e.g., Iraq, Morocco), and multiple small island states in Oceania and the Caribbean (e.g., Kiribati, Samoa, Tonga, Grenada). Country assignments for the other clustering algorithms can be found in Tables C-F in S1 Text.
Each country is assigned to one of the identified clusters based on similarities across indicators. The map visualizes how countries group together according to these patterns. Source of the basemap shapefile: a SpatialPolygonsDataFrame object containing a simplified world map with polygons attributed to 244 countries, based on Natural Earth data (© Natural Earth, www.naturalearthdata.com), public domain.
Fig 4 shows countries colour-coded by the degree of belonging to each cluster based on fuzzy PAM clustering, with darker colours representing a higher likelihood of belonging to the cluster. It showed that most countries had a high probability of belonging to a single cluster, such as Mozambique (81.5% cluster 2) and Sri Lanka (75.8% cluster 3). Others, however, displayed more mixed memberships, for example Democratic Republic of Congo (41.1% in cluster 1, 43.1% in cluster 2) (full distributions can be found in Table G in S1 Text). This indicates overlap between typologies and highlights that discrete clustering may underestimate uncertainty at cluster boundaries as displayed in Fig 3.
The colour gradient indicates each country’s degree of membership (ranging from 0.00 to 1.00) in a given cluster. Higher values represent a stronger association with that cluster, showing that countries can partially belong to multiple clusters rather than being assigned to just one. Source of the basemap shapefile: a SpatialPolygonsDataFrame object containing a simplified world map with polygons attributed to 244 countries, based on Natural Earth data (© Natural Earth, www.naturalearthdata.com), public domain.
The Random Forest classifier achieved good performance in predicting the PAM clusters (OOB error 11%). Cluster 3 was classified perfectly, while clusters 1 and 4 showed some misclassification, mainly with cluster 2 (Tables H and I in S1 Text). The Random Forest classifier also identified the key variables driving cluster assignment. The most influential variables spanned the subject domains Universal Health Coverage (dentists density, physician density, nurse-initiated ART access), HIV/AIDS epidemic (adult HIV prevalence), healthcare financing (international and public funding ratios for HIV, e.g., for prevention, treatment and vertical transmission, Global Fund resources), demographic and socioeconomic factors (population age structure, access to basic sanitation, per capita GDP), and health and mortality outcomes (life expectancy at birth and age 60, maternal and under-five mortality rates) (Table 2, Table J in S1 Text). Indicators from the subject domains ‘HIV response’ and ‘Legislation about human rights/the HIV response’ were not included among the top 20 variables but showed measurable contributions to cluster assignment.
The correlation between identified groupings and world region (φc = 0.54) and HIV/AIDS prevalence indicators in general and key populations (φc = 0.15- 0.46) were low to moderate (Table 2).
Discussion
Applying unsupervised ML in this context is non-trivial due to the difficulty of integrating different and multi-dimensional data sources. Using a complex and heterogeneous dataset, our study successfully extracted distinct clusters using unsupervised ML algorithms, which offer an alternative to traditional country-clustering by world region or HIV disease prevalence and incidence. Based on multidimensional open-source data, we identified four country profiles of HIV service delivery and financing. Our post-hoc analysis showed that countries cluster primarily according to HIV prevention and treatment coverage, health system resources (e.g., GDP per capita, Global Fund support, and UHC service coverage), service delivery capacity (e.g., health workforce, access to basic sanitation), and demographic structure, and are notably different from existing typologies based solely on HIV/AIDS prevalence and country region.
The analysis delineates distinctive clusters across regions: the Middle East, Latin America, and China form one cluster (cluster 4) characterized by low HIV burden, average high annual GDP, and high UHC. This suggests the potential to incorporate HIV services into general healthcare systems. Meanwhile, clusters one and two, encompassing nations in sub-Saharan Africa, exhibit variations in epidemic nature—concentrated versus generalized. This distinction signifies opportunities for tailored service delivery strategies. Notably, disparities in clusters one and two may stem from factors like access to care for key populations and the level of service integration. At the same time, our results indicate that, using the currently available indicators, clustering largely mirrors regional groupings adjusted for income and epidemiology—factors already considered in policymaking—while also revealing crossovers that offer opportunities for further investigation.
By considering many indicators across relevant domains, we aimed to reduce bias introduced in more traditional analyses by handpicking a few indicators relevant to HIV service delivery and financing. A recent study by Kavanagh et al. [29] suggested significant variation exists in the content and adoption of HIV-related laws and policies between countries, regions, and policy areas. Our study adds to their work by considering the relevant HIV dimensions beyond laws and policies (i.e., healthcare financing, HIV/AIDS burden) and considering possible confounders (i.e., demographic and socioeconomic indicators and health and mortality indicators).
Compared to existing UNAIDS and WHO classifications, which often group countries by geographic region or epidemic burden [14,15], our ML-derived clusters highlight cross-regional similarities and divergences. For example, while sub-Saharan Africa largely groups together under both approaches, our analysis unexpectedly clusters China with Latin America, and identifies outliers such as Sri Lanka and Papua New Guinea—patterns not visible in standard typologies. This illustrates how ML approaches can supplement existing classification models by dynamically incorporating financing and service delivery dimensions. For instance, Zimbabwe and the Caribbean both fall under different regions and face different local epidemics, but both have extensive community healthcare worker programs and currently learning from each other to further strengthen their local health responses [30].
The added value of ML lies in its capacity to incorporate multi-criteria data—such as legal environments for key populations, integration of HIV services into UHC, and health spending efficiency—which could yield typologies directly applicable to policymaking. In the future, moving towards the cascade targets, countries that have already achieved the 95–95–95 targets [1], such as Botswana and Rwanda, may offer lessons to others within their cluster that face similar system constraints but are at an earlier stage of transition.
Comparable studies that assessed health system performance suggest that unsupervised ML analysis is a helpful tool for identifying typologies through a mix of quantitative and institutional indicators [17,31]. A study on long-term care systems in OECD countries underlines that when systems transform rapidly, previous regional-based typologies become outdated and can be supplemented by ML approaches [17]. Kavanagh et al. demonstrated, through the work with the HIV Policy Lab, that HIV-related law and policy environments vary widely across countries and regions, underscoring the limitations of relying solely on geographic or epidemic classifications [29]. Allel et al. (2022) further showed how efficiency in HIV/AIDS spending differs significantly across 78 countries, highlighting the value of multi-dimensional approaches that integrate financing and service delivery factors [32]. Together, these studies support the use of unsupervised methods to generate typologies that reflect not only epidemiological and regional patterns but also legal, financial, and systemic dimensions of the HIV response.
A key benefit of our approach is that analyses can be completed in days using minimal computing resources and open-source software like R. We provide a reproducible workflow—including data preparation, dimensionality reduction, clustering, and validation—available on GitHub [28]. While expert input is still needed to select relevant indicators and interpret clusters in a policy context, this method greatly accelerates and simplifies the generation of actionable typologies.
Our study has several limitations. The unsupervised ML methods used in this study are explorative; no gold standard exists to validate the clustering process and results. The results from the ML models should be interpreted in light of the chosen indicators, as indicators directly drive the results. To increase usefulness for policy and practice, selected indicators are based on existing dimensions of relevant indicators to (sub)national HIV healthcare delivery and financing strategies as defined by UNAIDS [21]. The selected indicators are comparable to those chosen in other HIV strategic planning studies [32,33]. In addition, we included indicators that reflected the status of UHC, such as health worker density and out-of-pocket spending on health. We could not include some valuable indicators for which no data were currently available, for example, user fees, which are known to impact health outcomes in LMICs and could reveal further insights [34].
Overall, the clusters were relatively robust, though stability varied. Clusters 2 and 3 were highly stable, while cluster 1 was less cohesive. Fuzzy clustering confirmed this pattern, with most countries clearly assigned in one cluster, but a few exhibited mixed affiliations, highlighting areas of overlap and underlying heterogeneity. Although most studies do not explicitly report cluster statistics, we compared our methodology to that of Carrillo-Larco et al. [19], who used five country-level variables (among the UHC service coverage index) to classify countries by COVID-19 status. They considered silhouette scores of 0.42–0.44 acceptable, which is below our cluster quality score (0.53). Altogether, while the data being a mix of demographic, epidemiological and institutional/political indicators adds to the study’s strength in terms of completeness, it also makes information less precise and structured for ML analysis. For future analyses, using more granular data inputs, for example about legislation about human rights, or differentiated by sub-national level, ML could generate more actionable typologies, moving beyond broad regional categories to provide a more efficient tool for identifying cross-country innovations and decision-making.
Conclusions
In this study, we applied ML methods to identify distinctive country profiles of HIV service delivery and financing, providing a “proof of principle” for the use of ML in strategic public health analysis. While our current results largely reproduce regional groupings adjusted for income and epidemiology, they also reveal unexpected cross-regional similarities and outliers not captured by existing UNAIDS or WHO classifications. This study illustrates the potential of ML methods to generate useful insights and to support policy and intervention planning in the global HIV response. The clusters identified here provide a basis for comparative case studies among countries with similar profiles, while stakeholder engagement will be essential to translate these to best policy practices. While exploratory, our analysis demonstrates both the feasibility of ML approaches in global health and the importance of investing in richer datasets to enhance their policy relevance. In the current context of drastic funding reduction, however, country partners and global health stakeholders have a short time to pursue urgent reforms to safeguard the future of this vital program and save millions of lives. Identifying groups of countries with similar entry points and challenges across the five areas could help accelerate identifying innovative models tailored to a country’s needs that transcend the traditional regional grouping.
Supporting information
S1 Text. Supporting information.
Supporting information for Using country-level variables to discover country clusters beyond traditional health policy and performance metrics: An unsupervised machine learning approach for HIV healthcare delivery and financing.
https://doi.org/10.1371/journal.pgph.0004583.s001
(DOCX)
References
- 1.
Joint United Nations Programme on HIV/AIDS. Understanding measures of progress towards the 95–95–95 HIV testing, treatment and viral suppression targets. Geneva. 2023. https://www.unaids.org/sites/default/files/media_asset/progress-towards-95-95-95_en.pdf
- 2. Joint United Nations Programme on HIV/AIDS. UNAIDS Data 2019. 2019. https://www.unaids.org/sites/default/files/media_asset/2019-UNAIDS-data_en.pdf_aidsinfo.unaids.org
- 3. Bor J, Herbst AJ, Newell M-L, Bärnighausen T. Increases in adult life expectancy in rural South Africa: valuing the scale-up of HIV treatment. Science. 2013;339(6122):961–5. pmid:23430655
- 4. UHC2030 Initiative. UHC2030 launches “Key Asks from the UHC Movement” for the UN High-Level Meeting on Universal Health Coverage. https://www.uhc2030.org/blog-news-events/uhc2030-news/uhc2030-launches-key-asks-from-the-uhc-movement-for-the-un-high-level-meeting-on-universal-health-coverage-544846/. 2019.
- 5. Bekker L-G, Alleyne G, Baral S, Cepeda J, Daskalakis D, Dowdy D, et al. Advancing global health and strengthening the HIV response in the era of the Sustainable Development Goals: the International AIDS Society-Lancet Commission. Lancet. 2018;392(10144):312–58. pmid:30032975
- 6. Holmes CB, Rabkin M, Ford N, Preko P, Rosen S, Ellman T, et al. Tailored HIV programmes and universal health coverage. Bull World Health Organ. 2020;98(2):87–94. pmid:32015578
- 7. Feinstein MJ, Hsue PY, Benjamin LA, Bloomfield GS, Currier JS, Freiberg MS, et al. Characteristics, Prevention, and Management of Cardiovascular Disease in People Living With HIV: A Scientific Statement From the American Heart Association. Circulation. 2019;140(2):e98–124. pmid:31154814
- 8. Ratevosian J, Millett G, Honermann B, Bennett S, Connor C, Bekker L-G, et al. PEPFAR under review: what’s at stake for PEPFAR’s future. Lancet. 2025;405(10479):603–5. pmid:39929219
- 9. Stackpool-Moore L, Logie CH, Cloete A, Reygan F. What will it take to get to the heart of stigma in the context of HIV? J Int AIDS Soc. 2022;25 Suppl 1(Suppl 1):e25934. pmid:35818934
- 10. Joint United Nations Programme on HIV/AIDS. HIV Response Sustainibility Primer. 2024. https://sustainability.unaids.org/wp-content/uploads/2024/06/HIV-response-sustainability-response-primer_web.pdf
- 11.
Joint United Nations Programme on HIV/AIDS. Sustainability of the HIV response. Geneva. 2024. https://sustainability.unaids.org/
- 12. Matanje B, Masha RL, Rwibasira G, Ngure K, Yahaya HB, Anam FR, et al. The global HIV response at a crossroads: protecting gains and advancing sustainability amid funding disruptions. Lancet HIV. 2025;12(7):e532–6. pmid:40354797
- 13. Oberth G, Whiteside A. What does sustainability mean in the HIV and AIDS response?. Afr J AIDS Res. 2016;15(1):35–43. pmid:26785676
- 14. Elendu C, Amaechi DC, Elendu TC, Amaechi EC, Elendu ID, Akpa KN, et al. Shaping sustainable paths for HIV/AIDS funding: a review and reminder. Ann Med Surg (Lond). 2025;87(3):1415–45. pmid:40213211
- 15. Hontelez JAC, Bulstra CA, Yakusik A, Lamontagne E, Bärnighausen TW, Atun R. Evidence-based policymaking when evidence is incomplete: The case of HIV programme integration. PLoS Med. 2021;18(11):e1003835. pmid:34752470
- 16. Maugeri A, Barchitta M, Basile G, Agodi A. Applying a hierarchical clustering on principal components approach to identify different patterns of the SARS-CoV-2 epidemic across Italian regions. Sci Rep. 2021;11(1):7082. pmid:33782519
- 17. Ariaans M, Linden P, Wendt C. Worlds of long-term care: A typology of OECD countries. Health Policy. 2021;125(5):609–17. pmid:33715875
- 18. Grant RW, McCloskey J, Hatfield M, Uratsu C, Ralston JD, Bayliss E, et al. Use of Latent Class Analysis and k-Means Clustering to Identify Complex Patient Profiles. JAMA Netw Open. 2020;3(12):e2029068. pmid:33306116
- 19. Carrillo-Larco RM, Castillo-Cara M. Using country-level variables to classify countries according to the number of confirmed COVID-19 cases: An unsupervised machine learning approach. Wellcome Open Res. 2020;5:56. pmid:32587900
- 20. Luczak A, Kalinowski S. Fuzzy clustering methods to identify the epidemiological situation and its changes in European countries during COVID-19. Entropy. 2021;24(1).
- 21.
UNAIDS Programme Coordinating Board. Way forward to achieving sustainable AIDS results; UNAIDS/PCB (43)/18.25. 2018.
- 22.
Global AIDS strategy 2021–2026 end inequalities. end AIDS. 2021.
- 23.
UNAIDS Programme Coordinating Board. Evidence Review - Implementation of the 2016–2021 Unaids Strategy: on the Fast-Track To End Aids. 2020; UNAIDS/PCB (47)/CRP3.; 2020 1 December 2020.
- 24. Zhang Z. Introduction to machine learning: k-nearest neighbors. Ann Transl Med. 2016;4(11):218. pmid:27386492
- 25. Gao CX, Dwyer D, Zhu Y, Smith CL, Du L, Filia KM, et al. An overview of clustering methods with guidelines for application in mental health research. Psychiatry Res. 2023;327:115265. pmid:37348404
- 26. Allaoui M, Kherfi ML, A C. Considerably improving clustering algorithms using UMAP dimensionality reduction. Image and Signal Processing. 2020;12119:317–25.
- 27. Strobl C, Malley J, Tutz G. An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychol Methods. 2009;14(4):323–48. pmid:19968396
- 28. Alexiou Z, Dixit S, Bulstra C. AIDS-HIV-clustering-typology code. https://github.com/AIDS-HIV-clustering-typology/HIV-Clustering
- 29. Kavanagh MM, Graeden E, Pillinger M, Singh R, Eaneff S, Bendaud V, et al. Understanding and comparing HIV-related law and policy environments: cross-national data and accountability for the global AIDS response. BMJ Glob Health. 2020;5(9):e003695. pmid:32999052
- 30. Solar O, Valentine N, Castedo A, Brandt GS, Sathyandran J, Ahmed Z, et al. Action on the social determinants for advancing health equity in the time of COVID-19: perspectives of actors engaged in a WHO Special Initiative. Int J Equity Health. 2023;21(Suppl 3):193. pmid:36694195
- 31. Braithwaite J, Tran Y, Ellis LA, Westbrook J. Inside the black box of comparative national healthcare performance in 35 OECD countries: Issues of culture, systems performance and sustainability. PLoS One. 2020;15(9):e0239776. pmid:32986783
- 32. Allel K, Abou Jaoude GJ, Birungi C, Palmer T, Skordis J, Haghparast-Bidgoli H. Technical efficiency of national HIV/AIDS spending in 78 countries between 2010 and 2018: A data envelopment analysis. PLOS Glob Public Health. 2022;2(8):e0000463. pmid:36962380
- 33. Stover J, Glaubius R, Teng Y, Kelly S, Brown T, Hallett TB, et al. Modeling the epidemiological impact of the UNAIDS 2025 targets to end AIDS as a public health threat by 2030. PLoS Med. 2021;18(10):e1003831. pmid:34662333
- 34. Qin VM, Hone T, Millett C, Moreno-Serra R, McPake B, Atun R, et al. The impact of user charges on health outcomes in low-income and middle-income countries: a systematic review. BMJ Glob Health. 2019;3(Suppl 3):e001087. pmid:30792908