Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A two-tiered latent class and spatial analytical approach to identify clusters of neonatal mortality among very low birth weight infants: A population-based cohort study

  • Daniela Testoni Costa-Nobre ,

    Contributed equally to this work with: Daniela Testoni Costa-Nobre, Adriana Sanudo, Kelsy Nema Areco, Ana Sílvia Scavacini Marinonio, Maria Fernanda de Almeida, Ruth Guinsburg, Carlos Roberto Veiga Kiffer

    Roles Conceptualization, Formal analysis, Writing – original draft

    danielatestoni@gmail.com

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Adriana Sanudo ,

    Contributed equally to this work with: Daniela Testoni Costa-Nobre, Adriana Sanudo, Kelsy Nema Areco, Ana Sílvia Scavacini Marinonio, Maria Fernanda de Almeida, Ruth Guinsburg, Carlos Roberto Veiga Kiffer

    Roles Formal analysis, Methodology, Validation, Writing – original draft

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Kelsy Nema Areco ,

    Contributed equally to this work with: Daniela Testoni Costa-Nobre, Adriana Sanudo, Kelsy Nema Areco, Ana Sílvia Scavacini Marinonio, Maria Fernanda de Almeida, Ruth Guinsburg, Carlos Roberto Veiga Kiffer

    Roles Data curation, Investigation, Writing – review & editing

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Ana Sílvia Scavacini Marinonio ,

    Contributed equally to this work with: Daniela Testoni Costa-Nobre, Adriana Sanudo, Kelsy Nema Areco, Ana Sílvia Scavacini Marinonio, Maria Fernanda de Almeida, Ruth Guinsburg, Carlos Roberto Veiga Kiffer

    Roles Formal analysis, Investigation, Writing – review & editing

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Milton Harumi Miyoshi ,

    Roles Data curation, Investigation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Tulio Konstantyner ,

    Roles Investigation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Carina Nunes Vieira e Oliveira ,

    Roles Investigation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Rita de Cassia Xavier Balda ,

    Roles Investigation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Mandira Daripa Kawakami ,

    Roles Investigation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Paulo Bandiera-Paiva ,

    Roles Data curation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Rosa Maria Viera Freitas ,

    Roles Data curation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Fundação Sistema Estadual de Análise de Dados (SEADE Foundation), São Paulo, São Paulo, Brazil

  • Mônica La Porte Teixeira ,

    Roles Data curation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Fundação Sistema Estadual de Análise de Dados (SEADE Foundation), São Paulo, São Paulo, Brazil

  • Bernadette Cunha Waldvogel ,

    Roles Data curation, Writing – review & editing

    ‡ These authors also contributed equally to this work.

    Affiliation Fundação Sistema Estadual de Análise de Dados (SEADE Foundation), São Paulo, São Paulo, Brazil

  • Maria Fernanda de Almeida ,

    Contributed equally to this work with: Daniela Testoni Costa-Nobre, Adriana Sanudo, Kelsy Nema Areco, Ana Sílvia Scavacini Marinonio, Maria Fernanda de Almeida, Ruth Guinsburg, Carlos Roberto Veiga Kiffer

    Roles Conceptualization, Formal analysis, Visualization, Writing – original draft

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  • Ruth Guinsburg ,

    Contributed equally to this work with: Daniela Testoni Costa-Nobre, Adriana Sanudo, Kelsy Nema Areco, Ana Sílvia Scavacini Marinonio, Maria Fernanda de Almeida, Ruth Guinsburg, Carlos Roberto Veiga Kiffer

    Roles Conceptualization, Formal analysis, Funding acquisition, Methodology, Visualization, Writing – review & editing

    Affiliation Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil

  •  [ ... ],
  • Carlos Roberto Veiga Kiffer

    Contributed equally to this work with: Daniela Testoni Costa-Nobre, Adriana Sanudo, Kelsy Nema Areco, Ana Sílvia Scavacini Marinonio, Maria Fernanda de Almeida, Ruth Guinsburg, Carlos Roberto Veiga Kiffer

    Roles Conceptualization, Formal analysis, Methodology, Supervision, Writing – original draft

    Affiliations Escola Paulista de Medicina – Universidade Federal de São Paulo (UNIFESP), São Paulo, São Paulo, Brazil, CEPID – ARIES (Antimicrobial Resistance Institute of São Paulo), São Paulo, São Paulo, Brazil

  • [ view all ]
  • [ view less ]

Abstract

Objective

To identify and analyze patterns of neonatal deaths among very low birth weight (VLBW) infants in the most socioeconomically developed state of Brazil, from 2004 to 2020, using a two-tiered probabilistic approach that combines Latent Class Analysis (LCA) and spatial analysis.

Study design

This historical population-based cohort study included 137,224 live births with birthweight of 400-1499g to mothers residing in São Paulo State, using linked birth and death certificate data.

Results

Among 42,230 neonatal deaths, five distinct latent classes were identified: infection-dominant, intrapartum event-dominant, malformation-dominant, respiratory-dominant, and other. Survival analysis showed differences in timing of death across classes, with intrapartum-related deaths concentrated in the first hours of life, and infection-related deaths occurring later. Spatial analysis revealed geographic clustering especially for infection, malformation, and respiratory-related deaths, primarily in southern municipalities of the State.

Conclusion

The combined use of LCA and spatial analysis identified distinct patterns of neonatal mortality. LCA differentiated clinically meaningful profiles with specific timing of death, while spatial analysis revealed municipal-level clustering and overlap of these patterns. These findings showed how neonatal mortality is shaped by both diagnostic profiles and territorial context, providing actionable evidence to guide targeted improvements in perinatal and neonatal care to reduce preventable deaths among VLBW infants.

Introduction

Understanding the multifocal nature of neonatal mortality requires analytical approaches that can account for diagnostic uncertainty, overlapping clinical conditions, and geographic disparities in access and quality of maternal neonatal care. Despite overall improvements in health indicators in Brazil, the reduction in neonatal mortality has been slower and uneven across regions, particularly in areas with limited access to high-quality health services [1]. Neonatal mortality accounts for approximately 70% of all infant deaths in the country, reflecting persistent inequalities in perinatal care [2].

Secondary data from vital statistics systems offer valuable insights into the complex interplay of biological, health systems, and socio-environmental determinants of neonatal deaths. When appropriately analyzed, these data enable large-scale assessments of mortality patterns, the burden of specific conditions and spatio-temporal disparities [3]. However, traditional approaches often struggle to disentangle these overlapping factors and to account for the variability and uncertainty in diagnostic reporting. To address these limitations, probabilistic classification methods such as Latent Class Analysis (LCA) can classify deaths into mutually exclusive groups based on co-occurring diagnostic information, refining interpretation of overlapping causes [4]. Each major category of neonatal death has distinct etiological, clinical, and programmatic implications [5]. LCA accommodates this complexity by grouping cases according to shared diagnostic profiles, rather than relying solely on underlying cause-of-death coding.

Moreover, integrating LCA with spatial analysis and geoprocessing can identify geographic areas with elevated probabilities of clustering for specific latent classes (LC) of death, revealing spatial heterogeneity and potential hotspots of systemic failure [6]. Stratifying deaths by both diagnostic patterns and geographic location thus enhances understanding of health system performance and facilitates the design of targeted interventions. In addition, different causes of death are often associated with distinct timing of death. Intrapartum-related deaths, for instance, may reflect deficiencies in labor and delivery care, while deaths from infections may indicate failures in neonatal care or infection control [7]. Early neonatal deaths (0–6 days) are often linked to birth asphyxia or extreme prematurity, whereas late neonatal deaths (7–27 days) are more frequently associated with infections, congenital anomalies, and postnatal complications [79].

Therefore, this study aimed to identify and analyze latent patterns of neonatal deaths among very low birth weight (VLBW) infants in the state of São Paulo, Brazil, over the period from 2004 to 2020, applying a two-tiered probabilistic approach that combines LCA and spatial analysis.

Methods

Population and case definition

This was a historical population-based cohort study using data from deterministic linkage between death and live birth certificates. The study included all live births ≥22 weeks gestational age and birth weight between 400 and 1499 grams, born from 2004 to 2020 to mothers residing in São Paulo State, Brazil. A deterministic linkage process was applied to link live births and neonatal deaths (<28 days). [10] Data were received from the State Data Analysis System Foundation (Fundação SEADE) on 17/01/2024, which compiles vital statistics with over 99% coverage of births and deaths in São Paulo [11]. We had full access to the de-identified population-based database used to create the study population. Procedures to ensure data adequacy, including data preparation and data validation prior to data use, were undertaken and have been previously published [12].

Diagnostic groups were defined a priori to represent major etiological pathways of neonatal mortality and were intentionally broad to capture overlapping diagnoses, allowing latent class analysis to probabilistically identify clinically meaningful patterns of co-occurring conditions [5,79]. Diagnoses from the death certificates were categorized using the 10th revision of International Classification of Diseases (ICD-10) [13] codes into the following groups (full list in S1 Table):

  • Infection: bacterial and fungal sepsis, meningitis, pneumonia (A32.7; A40; A41; B37.7; G00-01; G04; J13-18; M86; N30; N39.0; P23; P36; P37.2; P38-39).
  • Intrapartum events: birth injury, intrauterine hypoxia, birth asphyxia and meconium aspiration syndrome (P10-15; P20; P21; P24.0).
  • Malformations: any malformation listed in the death certificate (Q00-99).
  • Respiratory failure or diseases: respiratory conditions of the perinatal period, pneumothorax, respiratory failure, extreme immaturity, respiratory distress syndrome of newborn, interstitial lung diseases (J43; J81; J84; J93; J96; P07.2; P22; P25; P27-28).

Multiple diagnoses could be assigned to a single live birth. The study was approved by the Ethics Committee of the Federal University of São Paulo (#65016822.1.0000.5505). The requirement for informed consent was waived by the Institutional Review Board due to the use of secondary data and a de-identified database.

First-tier: Latent class analysis

The diagnostic groups served as the basis for the first-tier approach of LCA. LCA was applied to identify distinct subgroups within the cohort of neonatal deaths in the entire population [14]. Models ranging from one to six LCs were estimated, using 600 random starting values and 120 final stage optimizations to ensure model stability and avoid local maxima. Model selection was guided by multiple fit indices, including the Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), sample-size-adjusted BIC (SSABIC), the Voung‐Lo–Mendell–Rubin test (VMLR), the bootstrap likelihood ratio test (LMRadjLRT), and entropy [15].

Among the information criteria AIC, BIC, and SSABIC indicated better model fit; in cases where the criteria suggested different solutions, preference was given to the model with the lowest BIC, in accordance with previous recommendations [15]. The model with the highest entropy was also favored, as entropy provides a summary measure of classification precision, with values closer to one indicating greater certainty in class assignment and separation between LCs. When the VLMR and bootstrap LMRadjLRT tests yield divergent results, the latter was prioritized due to its superior accuracy in model selection [15]. While all statistical criteria were considered, the final model choice also incorporated expert judgment, prioritizing solutions that were conceptually meaningful with internally consistent and distinct LCs.

Once the number of LCs had been calculated and the final model chosen, the probability of each neonate’s class was estimated, and they were classified into the most appropriate LC. Finally, for clarity each LC was numbered and named based on the predominant diagnosis of its specific class.

Second-tier: Spatial analysis

The neonatal mortality rate for each LC was calculated at the municipal level, based on the municipality of death, by dividing the total number of neonatal deaths in each class by the total number of live births, expressed as per cent of live births. These rates were calculated for each municipality and a smoothing method was applied with a local Bayesian estimator for reducing the impact of outlier areas with low event counts by considering the mean rates of its neighbors [16,17]. Spatial dependency and autocorrelation of the neonatal mortality rates for each LC per municipality were assessed using Global Moran’s Indicator (I) and Local Indicators of Spatial Association (LISA). Finally, a visual correlation was performed between neonatal mortality clusters for each pair of LCs to identify overlapping patterns across municipalities. Different colors were used to represent distinct cluster combinations (e.g., both LCs showing “high-high” or “low-low”, or one LC showing “high-high” and the other “low-low”). This visual representation enabled the identification of spatial overlaps across municipalities.

Validation of early and late mortality categorization

The Kaplan–Meier survival analysis was conducted as a sensibility test to estimate and compare the timing of death (in days of life) among neonates assigned to each LC identified in the final model. This approach enabled visualization and statistical comparison of survival curves, providing objective evidence to confirm whether specific LC were predominantly associated with early neonatal deaths (0–6 days) or late neonatal deaths (7–27 days).

Data management

Data management was performed using STATA software (version 18.5, StataCorp LP, College Station, TX, USA). LCA analysis was conducted using the Mplus program (version 8.11) [18] and spatial analysis was conducted with TerraView software (version 4.2.2, Instituto Nacional de Pesquisas Espaciais, São José dos Campos, Brazil). Geographic coordinates and spatial files for São Paulo State and its municipalities were obtained from the Brazilian Institute of Geography and Statistics [19].

Results

Population

From 2004 to 2020, a total of 10,265,105 live births were recorded. Of these, 9,713,206 had a birth weight of 1500g or more, 410,221 had an unknown birth weight, 2341 had birth weight < 400g, and 2113 had a gestational age < 22 weeks, resulting in 137,224 newborns included in the study. Among them, 42,230 (31%) died during the neonatal period. Maternal age was under 20 years in 17%, 60% were born by cesarean section and 34% were born before 28 weeks of gestation.

First-tier: Latent class analysis

A total of seven LC models (1–7 classes) were evaluated (Table 1). As expected, the 1-class solution showed the poorest fit to the data. Although the 7-class model yielded the lowest AIC, BIC, and SSABIC values, its classes were not conceptually distinct or interpretable. The entropy of the 7-class model (0.879) was only marginally higher than that of the 5-class model (0.875), and comparable to that of the 6-class model (0.845). Despite the slightly lower SSABIC observed in the 6-class model, the 5-class model also demonstrated adequate fit indices.

thumbnail
Table 1. Comparison of model fit statistics for selection of optimal latent class model.

https://doi.org/10.1371/journal.pone.0343004.t001

The 5-class model was chosen, and each LC was named based on the predominant diagnosis: class 1 as infection-dominant, class 2 as intrapartum event-dominant, class 3 as malformation-dominant, class 4 as respiratory-dominant, and class 5 as other. Regarding the distribution of the population among LCs, 28% (n = 11,758) belonged to LC 1; 4% (n = 1834) to class 2; 9% (n = 3643) to class 3; 46% (n = 19,601) to class 4 and 13% (n = 5394) to class 5. LC 1 comprised exclusively of infants with an infection-related diagnosis in any line of the death certificate, with 8% also having intrapartum events, 8% malformations, and 27% respiratory issues. LC 2 had no infection-related diagnoses, but all deaths were associated with intrapartum events, with 1% also presenting malformations and 3% respiratory issues. LC 3 included 7% of infants with an infection-related diagnosis, 15% with intrapartum events, 100% with malformations, and 32% with respiratory issues. In LC 4 all cases had respiratory issues, and 23% of infants also with an infection-related diagnosis, 14% with intrapartum events, and 2% with malformations. LC 5 had infants only with other causes of death, with no diagnoses related to infection, intrapartum events, malformations, or respiratory issues (Fig 1). Demographic characteristics for each class are described in Table 2.

thumbnail
Table 2. Demographic characteristics across five latent classes of death and survival.

https://doi.org/10.1371/journal.pone.0343004.t002

thumbnail
Fig 1. Overlay plot of latent classes categorized by causes of death.

IE: intrapartum event.

https://doi.org/10.1371/journal.pone.0343004.g001

Second-tier: Spatial analysis

The spatial analysis focused on the four most significant LC, i.e.,: infection-dominant (1), intrapartum event-dominant (2), malformation-dominant (3) and respiratory-dominant (4). Class 5 (other) comprised diverse infrequent and/or not discriminative diagnoses and was not used in the second-tier analysis. Descriptive maps displaying neonatal mortality rates by quintiles for each LC suggested the presence of clustered areas across all classes (Fig 2). Moran’s Global Index, applied to the smoothed rates for each class, confirmed significant spatial autocorrelation: I = 0.50 for infection-dominant LC (p = 0.001), I = 0.30 for intrapartum event-dominant LC (p = 0.001), I = 0.43 for malformation-dominant LC (p = 0.001) and I = 0.44 for respiratory-dominant LC (p = 0.001).

thumbnail
Fig 2. Distribution of neonatal mortality rates by latent classes after smoothing with local Bayes estimates (Quintiles, %).

Latent classes: A) Infection-dominant, B) Respiratory-dominant, C) Malformation-dominant, and D) Intrapartum Event-dominant. Data from 645 São Paulo Municipalities (Brazil), from 2004-2020.

https://doi.org/10.1371/journal.pone.0343004.g002

The LISA Cluster Maps highlighted areas of high and low rates for each LC (Fig 3). Infection, malformation and respiratory-dominant classes exhibited an apparently similar clustering pattern, which differed from that of intrapartum event-dominant LC. High-rate clusters for infection, malformation, and respiratory-dominant classes were predominantly found in the southern region of the state, whereas low-rate clusters for intrapartum event-dominant class were concentrated in the southern and central areas. In the northern and northwestern regions, alternating areas of high- and low-rate clusters were observed for each LC.

thumbnail
Fig 3. LISA cluster map of neonatal mortality rates by latent classes in São Paulo State, Brazil (2004–2020).

Latent classes: A) Infection-dominant, B) Respiratory-dominant, C) Malformation-dominant, and D) Intrapartum Event-dominant. Data from 645 São Paulo Municipalities (Brazil), from 2004-2020.

https://doi.org/10.1371/journal.pone.0343004.g003

Fig 4 confirms a correlation between infection and malformation-dominant cluster areas, with a positive correlation in 40 municipalities and a negative correlation in only 4. The findings also reinforce the occurrence of a negative correlation between intrapartum event and infection, malformation and respiratory-dominant classes clusters.

thumbnail
Fig 4. Correlation of municipalities with overlapping latent class clusters of neonatal mortality in São Paulo State, Brazil (2004–2020).

Correlation among latent classes: A) Infection-dominant & Respiratory-dominant, B) Infection-dominant & Malformation-dominant, C) Malformation-dominant & Respiratory-dominant, D) Infection-dominant & Intrapartum Event-dominant, E) Respiratory-dominant & Intrapartum Event-dominant, and F) Malformation-dominant & Intrapartum Event-dominant.

https://doi.org/10.1371/journal.pone.0343004.g004

Validation of early and late mortality categorization

As shown by Fig 5, time of death varied across LCs. Deaths occurred significantly later in the infection-dominant class, with a median of 8 days or 190 hours (interquartile range, IQR: 77–339 hours), while they occurred earlier in the intrapartum event-dominant class, with a median of 7 hours (IQR: 1–48 hours). The median time to death was 11 hours (IQR: 1–96 hours) for the malformation-dominant class and 37 hours (IQR: 8–96 hours) for the respiratory-dominant class.

thumbnail
Fig 5. Survival curve by latent class of neonatal mortality in São Paulo State, Brazil (2004–2020).

https://doi.org/10.1371/journal.pone.0343004.g005

Discussion

This study applied a two-tiered probabilistic framework, combining LCA and spatial analysis, to investigate diagnostic and geographic patterns of neonatal mortality among VLBW infants in São Paulo State, Brazil. Five distinct LCs of neonatal death were identified based on diagnostic profiles – each reflecting co-occurring clinical conditions rather than single-cause attributions – enhancing interpretability beyond conventional death certificate categorizations. Notably, one class (intrapartum-dominant) was clearly linked to early neonatal mortality, while the infection-dominant class was associated with later deaths. The malformation- and respiratory-dominant classes presented an intermediate pattern, with median time to death under 48 hours, but survival distributions that more closely resembled the infection class. Spatial clustering of mortality by class revealed significant geographic heterogeneity, particularly for infection-, malformation-, and respiratory-related deaths, which concentrated in southern municipalities of São Paulo state. These findings underscore the interaction between cause-specific mortality patterns and regional disparities in health system performance.

From a methodological perspective, the use of LCA enabled the classification of neonatal deaths into internally consistent and conceptually distinct LCs. Similar approaches have been employed in adult and children mortality studies in other settings, demonstrating LCA’s utility in uncovering hidden diagnostic patterns and clarifying heterogeneous mortality profiles [20,21]. These classes reflected co-occurring diagnostic profiles and allowed for more nuanced interpretation beyond the limitations of single-cause attribution from death certificates [14,22]. The advantage of LCA is that it defines subgroups by considering multiple variables concurrently, independent of the outcome [14].

The use of entropy and fit indices (AIC, BIC, SSABIC) provided statistical rigor, while the incorporation of conceptual coherence ensured the interpretability of the resulting classes [15]. The selection of the 5-class model was guided by the principle of parsimony and, most importantly, by epidemiological interpretation of the resulting classes in the context of neonatal mortality. While the 6- and 7-class solutions offered marginal statistical improvements, they lacked meaningful differentiation between classes. The 5-class model offered the best balance between statistical adequacy and substantive interpretability, making it the most appropriate choice for this analysis. The final five-class model included distinct profiles: infection-dominant, intrapartum event-dominant, malformation-dominant, respiratory-dominant, and other causes.

The second analytical tier revealed spatial heterogeneity in the distribution of these LCs across municipalities. The use of Moran’s I and LISA statistics, commonly applied in spatial epidemiology, strengthened the detection of local clustering and regional disparities in mortality burden [16,23,24]. The empirical Bayesian method was used to calculate the smoothed rates, to stabilize and reduce the high variability observed in the crude rates [25]. The spatial autocorrelation findings confirmed the presence of geographic clustering, particularly in the southern region of the state for infection, malformation, and respiratory-dominant classes. In contrast, intrapartum-related deaths showed a distinct pattern, with low-rate clusters concentrated in the south-central areas. These spatial insights suggest that the burden and likely contributing factors for each class vary geographically, underscoring the need for regionally tailored public health strategies [23,25].

The spatial analysis indicated that latent classes dominated by intrapartum-related conditions and infections were the main contributors to geographic disparities in neonatal mortality, as they showed the most consistent and concentrated high–high clusters across municipalities. These patterns likely reflect deficiencies in the quality of antenatal and intra-partum care, delays in referral, and limitations in neonatal intensive care and infection prevention practices [2325]. In contrast, classes characterized predominantly by congenital malformations and respiratory conditions exhibited more diffuse or overlapping spatial patterns, suggesting a lesser contribution to overall spatial inequality. By linking class-specific spatial clustering to plausible health system gaps, these findings provide actionable guidance for targeted interventions aimed at reducing preventable neonatal deaths.

The spatial analysis may be influenced by referral bias, as clustering based on municipality of death can overestimate rates in municipalities hosting referral centers, particularly for infection- and respiratory-dominant classes associated with later neonatal deaths. Conversely, early intrapartum-related deaths may be underestimated in these areas, as deaths often occur before inter-municipality transfer. Importantly, class-specific spatial clustering offers actionable insights: intrapartum-dominant clusters point to the need for strengthening antenatal and intrapartum care, whereas infection-dominant clusters highlight priorities for infection prevention bundles, antimicrobial stewardship, and enhanced surveillance within neonatal intensive care units.

A conceptual strength of the study lies in its dual approach to distinguishing early and late neonatal deaths – drawing from both epidemiological literature and empirical survival analysis. Early neonatal deaths (0–6 days) are typically associated with perinatal complications such as extreme prematurity, birth asphyxia, and delivery-related trauma, while late neonatal deaths (7–27 days) more frequently reflect infections or congenital anomalies [5,79,26]. The Kaplan-Meier analysis broadly aligned with these expectations but also revealed important nuances across classes. Specifically, the intrapartum-dominant class showed a sharply concentrated distribution of deaths in the first hours of life. In contrast, the infection-dominant class exhibited a more delayed pattern, consistent with late neonatal mortality and the known pathophysiology of hospital-acquired or postnatal infections [27]. Interestingly, the malformation-dominant and respiratory-dominant classes demonstrated a median time to death of less than 48 hours, yet with distributions less accurately skewed toward the first hours than the intrapartum class. These profiles may be best characterized as occupying a transitional space between early and late neonatal deaths. Despite the earlier median timing of the malformation and respiratory dominant classes, their overall survival curves more closely resembled that of the infection class than the intrapartum class – suggesting that, although pathophysiologically distinct, these deaths may share causal pathways such as insufficient access to early specialized care, limited postnatal stabilization care, lack of timely surgical approach for congenital anomalies or inadequate ventilatory support for respiratory failure in VLBW infants [2830]. These findings reinforce the critical role of immediate newborn care, special newborn care with dedicated staff, specialized equipment in preventing avoidable clinical deterioration among high-risk neonates [29].

In contrast, the late timing of infection-related deaths suggests these infants likely survived the initial high-risk perinatal period, only to succumb to complications arising during hospitalization. This finding highlights the importance of ongoing high-quality care within neonatal intensive care units [27,29]. Factors such as breaches in infection control practices, delays in recognizing clinical deterioration, and inappropriate empirical antimicrobial regimens may contribute significantly to these outcomes [29]. Ensuring optimal sepsis prevention, surveillance protocols and timely therapeutic interventions within neonatal intensive care units is essential in reducing late neonatal mortality [29].

These findings have important implications for neonatal care policies. The predominance of late deaths in the infection-dominant class highlights the need for improved postnatal surveillance, early diagnosis, and infection control practices, particularly in municipalities with identified clusters. Conversely, the concentration of intrapartum-related deaths in the early period underscores gaps in obstetric and delivery care that require urgent attention [5,26]. The geographic clustering of specific mortality profiles highlights the need for spatially target interventions, especially in the context of Brazil’s decentralized health system, where health service delivery and resource allocation vary widely across municipalities [31]. Prior studies in Brazil have documented the impact of structural health inequalities, such as inconsistent access to neonatal intensive care and maternal care networks, in shaping neonatal outcomes [1]. This study also demonstrates the value of applying advanced analytical techniques to routinely collected data. This spatially informed approach is essential for pinpointing areas of concern, enabling more strategic intervention efforts and resource distribution, thus strengthening the study’s implications for public policy. By leveraging probabilistic modeling and spatial epidemiology, it is possible to extract meaningful patterns that are otherwise obscured in aggregated mortality statistics. The results support a more refined understanding of neonatal mortality, offering a framework for evidence-based, regionally responsive strategies to reduce preventable deaths in vulnerable populations such as VLBW infants.

Nonetheless, the study has limitations. The use of death certificate data introduces potential for misclassification, though this was partially mitigated by incorporating the causes found in any line of the death certificate. Moreover, the spatial analysis was based on the municipality of death rather than residence, which may limit the ability to fully capture upstream social or systemic factors. This approach, however, was chosen intentionally to approximate the quality and effectiveness of perinatal care, under the assumption that, in most cases, death occurs near the (or at the same) place of birth. Nonetheless, we acknowledge that inter-municipality transfers may occur, potentially introducing some misclassification in the spatial attribution of care-related outcomes. As this study relied on routinely collected secondary data not originally designed to address the specific research question, potential misclassification, unmeasured confounding, and temporal changes in care practices should be considered when interpreting the findings. Future studies may benefit from integrating additional data sources, such as maternal and neonatal health records, to further contextualize mortality patterns.

In conclusion, by applying a two-tiered approach this study highlights how the combination of methodological rigor and epidemiological insight can uncover diagnostic and geographic patterns in neonatal mortality. LC analysis identified distinct and clinically meaningful diagnostic patterns, differentiating deaths predominantly related to intrapartum events from those associated with infections, respiratory conditions and congenital malformations, with clear differences in timing of death. Spatial analysis further demonstrated that these classes were unevenly distributed across municipalities, revealing geographic clustering and overlap of specific mortality patterns that reflect heterogeneity in health system performance. Interventions must address both the immediate quality of perinatal care, particularly for conditions associated with rapid deterioration, and the continuity of care in neonatal intensive care units, especially to prevent deaths from nosocomial infections and complications emerging after the early neonatal period.

Supporting information

S1 Table. ICD-10-coded diagnoses from death certificates and their frequency in the study population.

https://doi.org/10.1371/journal.pone.0343004.s001

(DOCX)

References

  1. 1. Victora CG, Aquino EML, do Carmo Leal M, Monteiro CA, Barros FC, Szwarcwald CL. Maternal and child health in Brazil: progress and challenges. Lancet. 2011;377(9780):1863–76. pmid:21561656
  2. 2. Secretaria de Vigilância em Saúde e Ambiente. Análise epidemiológica e vigilância de doenças não transmissíveis. 2025. https://svs.aids.gov.br/daent/centrais-de-conteudos/paineis-de-monitoramento/mortalidade/infantil-e-fetal/
  3. 3. de Frias PG, Szwarcwald CL, de Lira PIC. Evaluation of information systems on live births and mortality in Brazil in the 2000s. Cad Saude Publica. 2014;30(10):2068–280. pmid:25388310
  4. 4. Weller BE, Bowen NK, Faubert SJ. Latent Class Analysis: A Guide to Best Practice. Journal of Black Psychology. 2020;46(4):287–311.
  5. 5. Lawn JE, Kerber K, Enweronu-Laryea C, Cousens S. 3.6 million neonatal deaths--what is progressing and what is not?. Semin Perinatol. 2010;34(6):371–86. pmid:21094412
  6. 6. Feng X, Sarma H, Seubsman S-A, Sleigh A, Kelly M. Spatial Analysis of patterns of Multimorbidity in the Thai Cohort Study Using Latent Class Analysis. J Epidemiol Glob Health. 2025;15(1):24. pmid:39937408
  7. 7. Lawn JE, Blencowe H, Oza S, You D, Lee ACC, Waiswa P, et al. Every Newborn: progress, priorities, and potential beyond survival. Lancet. 2014;384(9938):189–205. pmid:24853593
  8. 8. Lehtonen L, Gimeno A, Parra-Llorca A, Vento M. Early neonatal death: A challenge worldwide. Semin Fetal Neonatal Med. 2017;22(3):153–60. pmid:28238633
  9. 9. Liu L, Oza S, Hogan D, Chu Y, Perin J, Zhu J, et al. Global, regional, and national causes of under-5 mortality in 2000-15: an updated systematic analysis with implications for the Sustainable Development Goals. Lancet. 2016;388(10063):3027–35. pmid:27839855
  10. 10. Waldvogel BC, Morais LCC, Perdigão ML, Teixeira MLP, Freitas RMV, Aranha VJ. Ensaio & Conjuntura. 2019;1:1–25.
  11. 11. Instituto Brasileiro de Geografia e Estatística IBGE. Sistemas de estatísticas vitais no Brasil: avanços, perspectivas e desafios. Rio de Janeiro: IBGE. 2018.
  12. 12. Areco KN, Konstantyner T, Bandiera-Paiva P, Balda RCX, Costa-Nobre DT, Sanudo A, et al. Operational Challenges in the Use of Structured Secondary Data for Health Research. Front Public Health. 2021;9:642163. pmid:34211950
  13. 13. World Health Organization. International statistical classification of diseases and related health problems. 10th rev. ed. Geneva: World Health Organization. 2010.
  14. 14. Hagenaars JA, McCutcheon AL. Applied latent class analysis. Cambridge (UK): Cambridge University Press. 2002.
  15. 15. Nylund KL, Asparouhov T, Muthén BO. Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study. Structural Equation Modeling: A Multidisciplinary Journal. 2007;14(4):535–69.
  16. 16. Carvalho MS, Câmara G, Cruz OG, Correa V. Análise de dados de área. In: Druck S, Carvalho MS, Câmara G, Monteiro AMV. Análise espacial de dados geográficos. Brasília: Embrapa. 2004. 155–205.
  17. 17. Marshall RJ. Mapping Disease and Mortality Rates Using Empirical Bayes Estimators. Applied Statistics. 1991;40(2):283.
  18. 18. Muthén LK, Muthén BO. Mplus user´s guide. 7th ed. Los Angeles, CA: Muthén & Muthén. 2017.
  19. 19. Instituto Brasileiro de Geografia e Estatística BR. Bases e referenciais: malhas digitais. https://mapas.ibge.gov.br/bases-e-referenciais.html
  20. 20. Belza C, Szentkúti P, Horváth-Puhó E, Ray JG, Nelson KE, Grandi SM, et al. Use of Latent Class Analysis to Predict Intensive Care Unit Admission and Mortality in Children with a Major Congenital Anomaly. J Pediatr. 2024;270:114013. pmid:38494089
  21. 21. Nazar G, Díaz-Toro F, Concha-Cisternas Y, Leiva-Ordoñez AM, Troncoso-Pantoja C, Celis-Morales C, et al. Latent class analyses of multimorbidity and all-cause mortality: A prospective study in Chilean adults. PLoS One. 2023;18(12):e0295958. pmid:38113219
  22. 22. Gordijn SJ, Korteweg FJ, Erwich JJHM, Holm JP, van Diem MT, Bergman KA, et al. A multilayered approach for the analysis of perinatal mortality using different classification systems. Eur J Obstet Gynecol Reprod Biol. 2009;144(2):99–104. pmid:19272694
  23. 23. Fottrell E, Osrin D, Alcock G, Azad K, Bapat U, Beard J, et al. Cause-specific neonatal mortality: analysis of 3772 neonatal deaths in Nepal, Bangladesh, Malawi and India. Arch Dis Child Fetal Neonatal Ed. 2015;100(5):F439-47. pmid:25972443
  24. 24. Maugeri A, Barchitta M, Schillaci G, Agodi A. Spatial patterns and temporal trends in stillbirth, neonatal, and infant mortality: an exploration of country-level data from 2000 to 2021. J Glob Health. 2025;15:04034. pmid:39977670
  25. 25. Rodríguez EYA, Rodríguez ECA, Marins FAS, da Silva AF, Nascimento LFC. Spatial patterns of mortality in low birth weight infants at term and its determinants in the State of São Paulo, Brazil. Rev Bras Epidemiol. 2023;26:e230034. pmid:37436330
  26. 26. de Almeida MFB, Guinsburg R, Martinez FE, Procianoy RS, Leone CR, Marba STM, et al. Perinatal factors associated with early deaths of preterm infants born in Brazilian Network on Neonatal Research centers. J Pediatr (Rio J). 2008;84(4):300–7. pmid:18528584
  27. 27. Harrison ML, Dickson BFR, Sharland M, Williams PCM. Beyond Early- and Late-onset Neonatal Sepsis Definitions: What are the Current Causes of Neonatal Sepsis Globally? A Systematic Review and Meta-analysis of the Evidence. Pediatr Infect Dis J. 2024;43(12):1182–90. pmid:39264197
  28. 28. Bhutta ZA, Das JK, Bahl R, Lawn JE, Salam RA, Paul VK, et al. Can available interventions end preventable deaths in mothers, newborn babies, and stillbirths, and at what cost?. Lancet. 2014;384(9940):347–70. pmid:24853604
  29. 29. Lawn JE, Bhutta ZA, Ezeaka C, Saugstad O. Ending Preventable Neonatal Deaths: Multicountry Evidence to Inform Accelerated Progress to the Sustainable Development Goal by 2030. Neonatology. 2023;120(4):491–9. pmid:37231868
  30. 30. Sitkin NA, Ozgediz D, Donkor P, Farmer DL. Congenital anomalies in low- and middle-income countries: the unborn child of global surgery. World J Surg. 2015;39(1):36–40. pmid:25135175
  31. 31. Brasil M da S. Pesquisa nacional de saúde, 2013: percepção do estado de saúde, estilos de vida e doenças crônicas: Brasil, grandes regiões e unidades da federação. Rio de Janeiro: Ministério da Saúde. 2014. https://biblioteca.ibge.gov.br/index.php/biblioteca-catalogo?view=detalhes&id=291110