Examining spatial heterogeneity in built environment and climate factors affecting motorcycle crashes

Yaqiu Li; Junyi Zhang; Haoran Li; Lon Virakvichetra

doi:10.1371/journal.pone.0346916

Abstract

Motorcycle crashes are a major contributor to road traffic fatalities in Cambodia, where motorcycles represent the dominant mode of transportation. Given the spatial dependence and heterogeneity inherent in crash data, this study examines spatial associations between built environment characteristics, climatic factors, and motorcycle crash frequency across 197 districts in Cambodia in 2019. Global Moran’s Index was used to assess spatial autocorrelation in crash frequency and explanatory variables. After evaluating the distributional properties of crash counts and multicollinearity among predictors, several regression models were estimated and compared, including Ordinary Least Squares regression (OLS), Poisson regression (PR), Negative Binomial regression (NBR), and Geographically Weighted Negative Binomial Regression (GWNBR). The results indicate that the GWNBR model outperforms global models by more effectively capturing spatial heterogeneity in the relationships between environmental factors and motorcycle crash frequency. Several variables exhibit relatively consistent spatial association patterns across districts: road length, road density, residential land use proportion, and precipitation are positively associated with motorcycle crash frequency in many locations, whereas population density, intersection density, and the number of annual rainy days are predominantly negatively associated. By revealing spatially varying association patterns in motorcycle crashes, this study provides evidence to support geographically differentiated approaches to motorcycle safety analysis and planning in Cambodia and other low- and middle-income countries.

Citation: Li Y, Zhang J, Li H, Virakvichetra L (2026) Examining spatial heterogeneity in built environment and climate factors affecting motorcycle crashes. PLoS One 21(4): e0346916. https://doi.org/10.1371/journal.pone.0346916

Editor: Niranga Amarasingha, Sri Lanka Institute of Information Technology, SRI LANKA

Received: November 13, 2025; Accepted: March 25, 2026; Published: April 27, 2026

Copyright: © 2026 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Built environment data in Cambodia were obtained from OpenStreetMap (OSM) through the Geofabrik download service (https://download.geofabrik.de/asia/cambodia-200101.osm.pbf). Climate data were obtained from the Visual Crossing Weather API (https://www.visualcrossing.com/weather-api/). Historical weather data for Cambodia for the year 2019 were retrieved using the API, including precipitation, and wind speed. Crash data used in this study were obtained from the Cambodian National Road Safety Committee. Due to governmental data-sharing restrictions, these data cannot be publicly released. The Cambodian National Road Safety Committee imposed data sharing restrictions on the crash dataset used for this study. Researchers may request access to the relevant data through the Cambodian National Road Safety Committee (nrsc@nrsc.gov.kh), subject to institutional approval and a data use agreement. All other derived variables used in the analysis were generated from these sources as described in the manuscript.

Funding: The authors received the following fundings for this work: (1) the National Natural Science Foundation of China (Grant No. 5721002303), (2) the Startup Research Fund of Southeast University, China (Grant No. 4021002332), (3) the Innovative Research Fund of Jiangsu Province, China (Grant No. 4221002403), (4) the Natural Science Foundation of Jiangsu Province (Grant No. BK20231197, BK20220243), (5) the Science and Technology Program of Suzhou (Grant No. SYG2024057, SYC2022078), (6) the Hubei Science and Technology Talent Service Enterprise Project (Grant No. 2023DJC084, 2023DJC195), (7) the Hubei Science and Technolgy Project (Grant No. 2024BAB087), (S) the Ministry of Education Industry-University Cooperation Collaborative Education Project (Grant No. 231007228312034), (9) the Hubei Provincial Natural Science Foundation (Grant No. JCZRMS202600143), and (10) sponsored by Tsinghua-Toyota Joint Research Institute Interdisciplinary Program.

Competing interests: The authors have declared that no competing interests exist.

1. Introduction

According to the World Health Organization [1], road traffic fatalities and injuries continue to be a major health and development concern worldwide, with pedestrians, motorcyclists, and cyclists accounting for over half of these deaths. These vulnerable road users’ susceptibility to traffic incidents is markedly higher relative to automobile drivers, attributable to their increased exposure to environmental risk factors. WHO [1] also demonstrates that Southeast Asia Region accounts for 28% of global road traffic deaths, with a staggering 92% of these fatalities occurring in countries with low to middle income levels. Within the Southeast Asian region, Cambodia exhibits the highest incidence of motorcycle-related fatalities, surpassing neighboring nations such as Thailand, Malaysia, and Myanmar [2]. Fig 1 indicates that more than 65% of road traffic related fatalities in Cambodia involve motorcyclists. Motorcycles dominate as the primary mode of transportation in Cambodia, with annual motorcycle registrations consistently accounting for over 75% of total registrations between 2011 and 2019 [3]. Additionally, there has also been a substantial increase of 66.67% in motorcycle registrations during the same period [3], which highlights the importance of conducting research on motorcycle crashes and implement measures to mitigate them.

Download:

Fig 1. Annual mortality rate by transportation mode in Cambodia.

https://doi.org/10.1371/journal.pone.0346916.g001

The Cambodia National Road Safety Committee Report provides a comprehensive analysis of road crash fatalities across the twenty-five provinces of Cambodia for the years 2016 and 2017 [4]. This report highlights a disparate spatial distribution of fatalities, with Phnom Penh experiencing the highest number, followed by the Kandal and Kompong Thom provinces. Notably, while Phnom Penh observed a 9% decline in road fatalities from 2016 to 2017, the Kandal province experienced a substantial 45% increase during the same period. Concurrently, Kampong Thom province also witnessed a 6% escalation in road crash fatalities. This data underscores the geographical variability in road safety trends across Cambodia’s provinces.

The spatial distribution of motorcycle crash frequency by each district across Cambodia for the year 2019 is shown in Fig 2. This Figure reveals that the central region of Cambodia, which includes parts of Phnom Penh municipality and its surrounding districts, exhibits a notably higher frequency of motorcycle crashes, as well as the southwest coastal area. This is contrasted starkly with the majority of the country’s provinces, which predominantly fall within the lower crash frequency categories of 0–10 and 11–20, indicating a lower occurrence of motorcycle crashes. It is apparent that certain districts, particularly those in the proximity of Phnom Penh, Kandal, and Kampong Thom provinces, are marked by a darker shade. This highlights a moderate to high level of motorcycle crash occurrences, which may warrant further investigation. This spatial distribution further underscores the need for targeted road safety measures and interventions in high-frequency zones, to mitigate the risk of motorcycle crashes.

Download:

Fig 2. Spatial distribution of motorcycle crash frequency across Cambodia in 2019.

(Base map data © OpenStreetMap contributors, licensed under the Open Database License (ODbL); Analytical layers were created by the authors.).

https://doi.org/10.1371/journal.pone.0346916.g002

To further check the spatial autocorrelation of motorcycle crash frequency across Cambodia, its global Moran’s Index (as shown in Equation 1) was calculated using ArcGIS Pro 2.7 software. The derived Moran’s Index suggests a significant clustering pattern within the motorcycle crash frequency, evidenced by a Moran’s I value of 0.447 (as shown in Table 2). Crucially, the statistical significance of this clustering is supported by a p-value of 0.000, which is notably below the 0.01 threshold, and a z-score of 10.302, far exceeding the critical value of 2.58. These results provide a 99% confidence level in rejecting the null hypothesis that the observed spatial distribution is random, thereby indicating a less than 1% probability of a random distribution in the motorcycle crash frequency.

Fig 3 shows the spatial distribution of district-level crash frequency in Cambodia using Local Moran’s Index. This index is a statistical measure of spatial autocorrelation, which means it evaluates whether the pattern expressed is clustered, dispersed, or random across the map. The High-High Cluster (Pink) means regions with high crash frequency surrounded by other areas with high crash frequency. These are clusters where the high crash frequency is similar in a location and its neighbors, representing a significant concentration of crashes. This might be due to factors such as heavy traffic, higher rates of pedestrian and vehicle interaction, or possibly inadequate road safety measures. The Low-Low Cluster (Light Blue) represents regions with low crash frequency surrounded by other areas with low crash frequency, forming a cluster of similar low incidence locations. The light blue areas might correspond to rural or less densely populated regions where the frequency of crashes is relatively low. This could be due to lower traffic volumes, fewer intersections, or potentially less reporting of incidents. The Low-High Outlier (Blue) are areas that have a low crash frequency but are surrounded by areas with high crash frequency. These are considered spatial outliers because their condition is different from their neighbors. The blue areas could be safer zones surrounded by high-risk areas, perhaps due to effective local traffic management, better road conditions, or community-driven safety initiatives. Areas in grey indicate regions, which represents large part of the total 197 districts in Cambodia, the crash frequency does not show significant spatial autocorrelation, meaning that the distribution neither forms clusters nor outliers with statistical significance. The grey areas do not show a clear pattern and might represent regions where the crash frequency is random, or the data is inconclusive. This might be due to inconsistent data collection or diverse conditions that do not lead to a clear spatial correlation. For Cambodia, a developing country with a mix of urban and rural areas, these patterns highlight the necessary of exploring spatial effect of factors on motorcycle crash frequency.

Download:

Fig 3. Local indicators of spatial association (LISA) map of motorcycle crash frequency across Cambodia.

(Base map data © OpenStreetMap contributors (ODbL); Analytical layers were created by the authors.).

https://doi.org/10.1371/journal.pone.0346916.g003

In traditional statistical modeling process, it’s typically assumed that observations are independent and that the relationships between explanatory variables and a response variable remain constant across different spatial locations [5]. However, the reality of crash occurrences contradicts this assumption, as they often exhibit spatial correlation [6–9]. For spatial analyses, crash event data are often aggregated into various levels, from broader regional zones to more specific traffic analysis zones [10], and even down to individual road segments to investigate their spatial correlation with several crash indicators [11]. Zonal crash analyses can provide crucial insights, such as enabling comparative assessments across various zones or pinpointing safety issues within specific areas. Consequently, this information can guide the implementation of targeted safety measures to enhance local traffic safety [12].

Therefore, considering the spatial variation of motorcycle crash frequency, the primary objective of this study is to investigate the spatial pattern of motorcycle crash frequency across Cambodia. More specifically, this study aims to: (a) explore the spatial characteristics and distribution of motorcycle crashes; (b) develop and compare the non-spatial and spatial zonal motorcycle crash frequency regression models; (c) examine the impact of built environments and climate characteristics on motorcycle crash frequency. The findings from this study are intended to not only provide estimates of motorcycle crash frequency by incorporating spatial differences in explanatory variables but also to support efforts in planning, engineering, enforcement, and educational initiatives aimed at improving road safety.

2. Literature review

2.1. Research on the statistical analysis of crash frequency

Numerous empirical research has evaluated the effect of various attributes on crash frequency in a non-spatial statistical framework, such as Generalized Linear models (GLM), Poisson regression models (PR), Negative Binomial models (NB). These studies have investigated the influence of built environmental and transportation infrastructure on the probability of crash occurrences [13–18], as well as the role of the road characteristics such as arterial roads, rural roads, freeways [19–22], and specific locations like junctions and controlled crossroads [23–25]. Additionally, the impact of climatic attributes including wind strength, visibility, precipitation, light, and annual wet days on crash frequency has also been a subject of research [26–29]. The inclusion of the number of rainy days as a variable has helped in understanding the influence of factors like exposure and pavement condition on crash occurrences [28,30]. However, there has been a lack of comprehensive exploration into spatial variations of these explanatory factors affecting road crash frequency.

Although such non-spatial global regression models are useful to provide information about road crashes, they often assume the independence of variables. Therefore, they fail to adequately address the crucial aspect of spatial autocorrelation, potentially leading to biased results and reduced accuracy when examining the relationship between explanatory and dependent variables. Spatial dependence and spatial heterogeneity are two common spatial elements when considering spatial autocorrelation [6]. Spatial dependence refers to the possibility that the intensity of events at one location may affect the intensity of events at surrounding locations. Spatial heterogeneity means that events may not be evenly distributed across different spaces, such as the incidence of traffic accidents at different locations. These two issues often arise when collecting data by geographical units such as provinces, districts/cities, or villages. Scholars have also shown that the relationship between independent and dependent variables in collision accidents may change with spatial changes. Therefore, analyzing without considering spatial differences may lead to misleading conclusions [31]. Hence, there exists a critical need for further investigation utilizing spatially methodologies to obtain more precise insights into the relationship between the explanatory variables and overall crash occurrences [32].

2.2. Research on spatial analysis methodology of crash frequency

Previous studies have introduced various methodologies to assess the spatial impact of explanatory variables on crash frequency. Among these, geographically weighted regression (GWR) stands out as an effective approach, rooted in the fundamental principles of geography [33,34]. GWR enhances analysis by applying weights to nearby observations based on their proximity to a focal point, allowing for the adjustment of regression coefficients to reflect local contexts. This approach enables the exploration of location-specific relationships between dependent and independent variables, offering subtle insights into spatial dependence and heterogeneity. The application of GWR in road safety research has gained considerable recognition in recent years [35,36]. For instance, Mathew et al. [37] demonstrated GWR’s superiority in identifying spatial dependencies and variations in risk factors affecting teen crash frequency, compared to traditional generalized linear models. Similarly, Pirdavani et al. [38] also found that GWR outperforms the OLSR model in spatial analyses, underscoring its value in road safety studies.

Previous research has developed various types of GWR models, including Geographically Weighted Lasso regression (GWLR), Geographically Weighted Poisson regression (GWPR), Geographically Weighted Poisson Quantile regression (GWPQR), and Geographically Weighted Negative Binomial regression (GWNBR) [37,39–41]. These models are particularly useful in handling spatial effects while suitable for various types of data and problems. In dealing with the spatial effect of collision accident data, the GWPR and GWNBR models have shown better results than global non-spatial models [39,42,43]. However, Poisson distribution’s premise is that the mean is equal to the variance, which is not always true for crash data. To address this issue, GWNBR was introduced as a method for modeling counting data with spatial heterogeneity and excessive dispersion [44–45]. It has been widely used in some regional studies to explore the spatial changes of the relationship between explanatory variables and dependent variables [46–48]. In the field of accident analysis, several scholars have carried out research using GWNBR models. Gomes et al. [10] investigated the impact of spatial factors on crash frequency. Their findings indicated that the GWNBR model was better at capturing the spatial heterogeneity of crash frequency than GWPR model. Mathew et al. [37] used GWNBR to study the impact of road environmental factors such as road network, population characteristics and land factors on teen crash frequency. Oluwajana et al. [49] used seven goodness-of-fit indicators to compare the GWPR and GWNBR models, and the results showed that the GWNBR model with fixed bandwidth had the best predictive performance.

Although there are numerous studies on geographic risk factors related to crashes on the regional level, Lee et al. [50] applied Negative Binomial regression models to crash data from the United States and Italy to assess the transferability of models between these two countries. Their findings indicated that models for crashes were not transferable between the two nations, despite sharing several significant variables [51]. Considering this non-transferability between crash frequency analysis in various countries, it is important to explore the motorcycle crash frequency in developing countries like Cambodia.

Overall, research on spatial analysis of motorcycle crash frequency is very limited. Given the challenges in transferring crash frequency analysis findings across different countries, this study seeks to fill the existing research gaps by examining the spatial influence of built environment and climatic factors on motorcycle crash frequency in Cambodia, employing GWR-based methodologies.

3. Methodology

In this study, we utilize Poisson and Negative Binomial regression methods to model motorcycle crash frequency due to their proficiency in handling count data. This modelling procedure begins with an assessment of motorcycle crash frequency distribution, emphasizing its mean and variance across the 197 districts of Cambodia. As indicated in Table 1, the average motorcycle crash frequency is 16.152, with a standard deviation of 25.908. Notably, the variance significantly surpasses the mean, leading us to adopt the Negative Binomial distribution for modeling motorcycle crash frequency in Cambodia. Additionally, an enhanced GWNBR model is applied to accurately discern the spatial influences of various explanatory factors on motorcycle crash frequency, thereby offering a more comprehensive understanding of the crash data.

Download:

Table 1. Descriptive statistics of the variables.

https://doi.org/10.1371/journal.pone.0346916.t001

3.1. Data preparation

In this section, a spatial motorcycle crash database is compiled with the terms and conditions for the source of the data. The motorcycle crash frequency data, which includes property-damage-only, injury, and fatal crashes, were extracted from Road Crash Victim and Information System (RCVIS) by Cambodian National Road Safety Committee, and the data were analyzed in aggregated, district-level form only. No personal, identifiable, or sensitive information was accessed or processed. For this study, we focused on crashes that occurred in 2019. The frequency of motorcycle crashes within each district was the dependent variable in our analysis. The selection of explanatory variables was guided by prior research and their anticipated influence on motorcycle crashes. This study adopts the “3Ds” framework of built environments – density, diversity, and design [52]. Density encompasses variables like population density, male population proportion, and household density, all of which are commonly used in zonal level traffic safety analyses [37,53–55]. Diversity is represented by land use proportions, residential areas, tourist attractions, green spaces, and cultural and sports facilities, which are identified as significant crash pattern influencers by previous research [56]. The design component includes road network characteristics such as road density, number of intersections, intersection density, and the lengths of major and minor roads, which are key factors affecting crash frequency [57]. Climate and built environment data were obtained from publicly available sources and used in accordance with their respective usage policies.

In this research, the primary spatial unit is the district level, which represents the second-tier administrative division in Cambodia, encompassing a total of 197 districts nationwide. Data regarding the road network, bus stops, administrative boundaries, land use types, and Points of Interest (POI) were sourced from OpenStreetMap (OSM). Using this OSM data, the road length, number of intersections were calculated with ArcGIS software. Population information was obtained from the 2019 General Population Census of the Kingdom of Cambodia, conducted by the National Institute of Statistics under the Ministry of Planning. Furthermore, climate-related data in 2019 were obtained from the VisualCrossing Weather API [58], which provides station-based historical weather observations. To spatially associate climate information with traffic analysis units, climate station data were matched to each district using the nearest-distance principle, whereby each district is assigned the observations from the geographically closest weather station.

Then, the dataset for this study underwent a rigorous data cleaning process for each variable, where observations with missing or omitted values were removed. The final dataset contained 3182 motorcycle crashes in total. The minimum, maximum, mean, and standard deviation of this dependent variable and other 17 explanatory variables are shown in Table 1. The motorcycle crash frequency ranges from 0 to 162 with an average of 16.152 and a standard deviation of 25.908. Demographic data show a male population portion averaging 0.488 with a low standard deviation, suggesting minor variation across districts. Population and household densities vary widely among districts. Built environment characteristics provide details on road network, bus station, residential areas, and amenities like schools and public services, while road network attributes with high stand deviation indicate significant variation among districts. Due to the fact that commercial area and industrial area are not available in many districts in our database, we do not include the commercial area and industrial area in our research. Finally, climate variables like average precipitation, annual rainy days, wind speed, temperature and humidity are reported, both of which could potentially impact crash frequency.

Fig 4 shows the spatial distribution of the built environments and climate characteristics. We could conclude that road density, population density, residential area proportion, and intersection density show large numbers in areas around the capital Phnom Penh, Krong Siem Reap in the northwest area, and the southwest Kampong Som area.

Download:

Fig 4. Spatial distribution of built environments and climate characteristics.

(Base map data © OpenStreetMap contributors (ODbL); Analytical layers were created by the authors.).

https://doi.org/10.1371/journal.pone.0346916.g004

3.2. The measurement of spatial dependency - Moran’s Index

In this section, we use Moran’s Index to measure the spatial dependency of each potential explanatory variable. The Moran’s Index is calculated as follows:

(1)

where denotes the spatial weight of sample i and j, representing the spatial proximity or connection between them. N denotes the number of spatial units indexed by i and j. W is the sum of all spatial weights .

Moran’s Index value is a measure that ranges from −1 to +1, which indicates perfect dispersion and perfect correlation, respectively. A value of 0 suggests that the spatial pattern is random, with no apparent clustering or dispersion. Positive values suggest that similar values occur near each other (spatial autocorrelation), while negative values suggest that dissimilar values occur near each other (spatial dispersion). After checking the value of Moran’s Index, the bus station number, school number, public service number and tourism place number variables were removed from our explanatory dataset due to their non-significant variation among districts. The Moran’s Index test results of the remaining variables are shown in Table 2. The positive z-score values indicate that the data are spatially clustered.

Download:

Table 2. Moran’s Index and VIF of each variable.

https://doi.org/10.1371/journal.pone.0346916.t002

3.3. Geographically weighted negative binomial regression

The Geographically Weighted Negative Binomial Regression (GWNBR) is a statistical model developed by de Silva for analyzing spatial data where the response variable is count-based and potentially over-dispersed (i.e., the variance exceeds the mean) [59]. For the over dispersed data, the GWNBR can model data in a non-stationary manner. The negative binomial distribution has an overdispersion coefficient α. The formula for negative binomial regression is as follows:

(2)

where is an offset variable, , α is an excessive dispersion coefficient, is a parameter related to the explanatory variable , and , is the dependent variable, NB represents negative binomial. This model considers a logarithm link function

GWNBR allows parameters and α to vary with spatial variation. The formula is as follows:

(3)

where are the locations (coordinates) of the data points j.

Considering the weight of the data near point i, the local log-likelihood of GWNBR at i has the following formula:

(4)

where is the weight matrix and is an integral term, which can be calculated by the following equation.

(5)

When converging, the following equation can be used to estimate :

(6)

where is the coefficient vector estimated from region i, is a matrix composed of explanatory variables, is the GLM diagonal weighting matrix at convergence, is the adjusted dependent variable vector, calculated by the following equation:

(7)

The best fit of the model was assessed using Akaike Information Criterion (AIC). The lower value of AIC indicates a better fit of the model [60].

Another important parameter in evaluating the GWNBR model is the bandwidth. The selection of bandwidth and corresponding kernel function will affect the performance of the model. The rate at which bandwidth control data point weights decrease with increasing distance regression points. When the bandwidth is large, the weight slowly decays, and vice versa. The AIC and Cross-validation (CV) are two common evaluation indicators for finding the optimal bandwidth. The optimum AIC aims to identify the most suitable model by selecting a bandwidth that minimizes both the discrepancies between observed and predicted values and the complexity of the model [61]. Concurrently, the optimal CV method aims to pinpoint the best model by choosing a bandwidth that reduces the divergence between observed and fitted values through the cross-validation technique [61]. Therefore, when searching for the optimal bandwidth, it is necessary to reference the values of both indicators.

3.4. Measurement of goodness of fit

The three commonly used indicators for evaluating and comparing the performance of GWNBR models are AIC, Mean Absolute Deviation (MAD), and Root Mean Squared Error (RMSE). The AIC, MAD and RMSE indicators are defined as follows:

(8)

where is the maximum estimate of log likelihood and k is the degree of freedom.

(9)

(10)

where n is the number of observations, is the observed number of crashes, is the predicted number of crashes. The lower the values of AIC, MAD, and RMSE, the better the model performance.

4. Results and discussion

To further enhance the reliability of our models, assessing the multi-collinearity among independent variables was deemed crucial. High correlation among independent variables in a regression model can lead to skewed results and reduce the model’s reliability [62]. In this study, we employed the Variance Inflation Factor (VIF) to test multi-collinearity. VIF values, which are indicative of the level of collinearity, suggest that higher values correspond to stronger collinearity [63,64]. A VIF range between 0–10 is generally considered indicative of low collinearity. The multi-collinearity test results for the initially chosen 16 explanatory variables are presented in Table 2. Due to their VIF values exceeding 10, “Household density” and “Health service number” were excluded from the preliminary explanatory variables [64].

4.1. The overall results of the global model and GWNBR models

In our research, non-spatial Ordinary Least Squares regression (OLSR), Poisson regression (PR), and Negative Binomial Regression (NBR) models were established as foundational comparisons for GWNBR models. Key variables identified in the NBR model informed the development of GWNBR models. These GWNBR models were particularly effective in assessing how various factors differently influence motorcycle crash frequency across Cambodia’s districts, highlighting the spatial variability of the relationship between the dependent variable and key explanatory variables.

Selecting the appropriate bandwidth and kernel function is pivotal in the calibration of GWNBR models [38]. GWNBR offers two kernel options: fixed or adaptive. With a fixed kernel, neighboring data points are selected based on a predefined distance criterion, such as including all neighbors within a 50 km radius. Conversely, the adaptive kernel’s selection of neighbors is guided by optimal values determined by the AIC or CV [61]. In this study, we opted for the adaptive kernel due to its flexibility in adjusting the spatial extent to encompass a broader geographic area, ensuring a consistent number of observations within the kernel’s scope.

The SAS/IML© macros [65] were employed in our study to develop the GWNBR models. The GWNBR models require the determination of the optimum bandwidth for model calibration. Thus, in this study, we determined the optimum bandwidth of GWNBR by adopting both the optimum AIC and CV indicators. The golden section search method was employed to find the optimal bandwidth in our study.

Table 3 shows the performance metrics for both non-spatial and spatial models, focusing on Likelihood, AIC, RMSE, and MAD. The results highlight the GWNBR model with adaptive bandwidth and optimized AIC as superior compared to other models, as evidenced by its lower MAD, RMSE, and AIC values. This confirms the necessity of considering the spatial variation and over dispersion that exists in the motorcycle crash data. The MAD values increase from 3.031 in the GWNBR model with adaptive optimized CV indicator to 4.224 in the NBR model, which further marks a notable enhancement in the model’s ability to account for variability. Additionally, the AIC values for the GWNBR model with adaptive AIC are markedly lower compared to those of the global models, highlighting the superior efficiency of the GWNBR approach. This suggests that incorporating spatial variation into the analysis can notably enhance the explanatory power of the model. The detailed outcomes derived from the GWNBR model with adaptive AIC will be thoroughly examined in the subsequent section.

Download:

Table 3. Performance comparison of each model.

https://doi.org/10.1371/journal.pone.0346916.t003

Diagnostic statistics further justify the progressive transition from PR to NBR framework. For the PR model, the dispersion statistic (χ²/df = 12.3) substantially exceeds the conventional threshold of unity, indicating severe overdispersion and rendering the Poisson assumption inappropriate for the motorcycle crash data. To address this issue, a NBR model was estimated; however, Moran’s I test on the global NBR residuals showed significant positive spatial autocorrelation (Moran’s I = 0.5007, z = 12.0523, p < 0.01), suggesting that spatial heterogeneity remained inadequately captured. In contrast, the GWNBR model with an adaptive bandwidth selected by the AIC criterion effectively mitigated residual spatial dependence, with Moran’s I reduced to 0.0301 and no longer statistically significant (p = 0.265). This indicates that allowing regression parameters to vary spatially substantially improves model adequacy. Moreover, relative to the global NBR model, the GWNBR model achieved a 44.2% reduction in RMSE (from 14.872 to 7.795) and a marked decrease in AIC (from 1394.088 to 965.323). These results demonstrate that the incorporation of both overdispersion and spatial non-stationarity yields improvements that are not only statistically significant but also practically meaningful, thereby providing strong empirical support for the adoption of the GWNBR framework.

4.2. Local estimates

This section explores the spatial association variations between various built environments, climate characteristics variables and motorcycle crash frequency are analyzed and presented in Table 4 and Fig 5. Only the local estimates of each explanatory variable by the best GWNBR with adaptive bandwidth and optimal AIC model will be presented. Five statistical measures are utilized to show the non-stationary spatial effect of each explanatory variable: the minimum value (MIN), 25th percentile lower quartile (LQ), median, 75th percentile upper quartile (UQ), and maximum value (MAX), as detailed in Table 4. The local coefficients across various districts exhibit diverse and sometimes surprising directional tendencies, demonstrating the GWNBR model’s capability at capturing spatial non-stationarity. In some districts, a particular independent variable may show a statistically significant relationship with motorcycle crash frequency, whereas in others, the relationship may not be significant. Additionally, the LQ, Median, and UQ values reveal that some built environment and climate characteristic estimates fluctuate from negative to positive, a complexity not observable in non-spatial models such as the OLSR, PR, and NBR models. This phenomenon is consistently observed in the outcomes of the GWNBR models, where both positive and negative coefficients were estimated across all zones, indicating the complex and variable nature of the spatial relationships being analyzed.

Download:

Table 4. Estimation results of the GWNBR model for local variables.

https://doi.org/10.1371/journal.pone.0346916.t004

Download:

Fig 5. Local estimates of built environments and climate from GWNBR model.

(Base map data © OpenStreetMap contributors (ODbL); Analytical layers were created by the authors.).

https://doi.org/10.1371/journal.pone.0346916.g005

4.2.1. Built environments.

Table 4 and Fig 5 present the spatial distribution of local GWNBR parameter estimates and their statistical significance for built environment variables. By examining the proportion of districts with positive versus negative coefficients, this study assesses the degree of spatial non-stationarity across explanatory variables. When a high proportion of districts (approximately 70% or more) exhibit coefficients with the same sign, this suggests a relatively homogeneous association pattern between the built environment factor and motorcycle crash frequency, although the strength and significance of these associations may still vary spatially. Based on the GWNBR results, seven built environment variables demonstrate statistically significant associations with motorcycle crash frequency in at least part of the study area, and these patterns are discussed below with explicit reference to statistical significance.

With respect to the exposure variable, road length, exhibits strong positive associations with motorcycle crash frequency, and statistically significant associations are observed primarily in southeast districts (Fig 5a). Roads often coincide with economic corridors or intercity connections, and districts with higher major road length tend to exhibit stronger associations with motorcycle crash frequency in these regions. In contrast, several districts with extensive road infrastructure show weak or insignificant associations, suggesting that factors such as enforcement intensity, road design standards, or safety interventions may moderate this relationship. These findings underscore the spatially heterogeneous nature of the association rather than implying a uniform effect of major roads on crash occurrence.

Road density shows predominantly positive coefficients across most districts, with a large proportion reaching statistical significance (Fig 5b). This spatial pattern aligns with previous findings [10], indicating that districts with denser road networks tend to exhibit higher motorcycle crash frequencies. In the current analysis, higher road density is associated with increased crash frequency, particularly in and around Phnom Penh and other major urban areas. While denser networks may coincide with more complex traffic environments and higher exposure levels, this study does not infer causality, and the observed associations likely reflect a combination of network structure, traffic volume, and urban activity intensity.

Intersection density exhibits both positive and negative local coefficients, with statistically significant associations observed in spatially clustered districts. Although higher intersection density is often expected to be associated with increased crash occurrence, the predominantly negative associations identified in this study—particularly in districts where the estimates are statistically significant—suggest a more nuanced relationship in the Cambodian context. The lower and upper quartile values indicate that negative associations dominate overall, consistent with previous research [57,66–69]. One plausible interpretation is that higher intersection density reflects a finer-grained street network characterized by shorter block lengths, lower operating speeds, and more frequent points of conflict awareness. In such environments, riders may adopt more cautious behavior due to frequent stopping, increased visual scanning, and the presence of traffic control devices, which may be associated with lower motorcycle crash frequencies. This interpretation is consistent with urban planning literature that highlights the role of dense intersection networks in traffic calming and speed moderation, particularly in mixed-traffic environments [70–72]. Conversely, positive associations observed in a limited number of districts, which are consistent with findings reported by Barua et al. [73] and Jeon et al. [74], indicating that the relationship between intersection density and motorcycle crashes is spatially non-stationary and context-dependent. Differences in intersection design quality, enforcement levels, and rider compliance may partially explain these variations. Overall, the results suggest that intersection density does not exert a uniform influence on motorcycle crash frequency, and its association appears to be shaped by localized network and behavioral conditions rather than a single dominant mechanism.

Population density exhibits consistently negative coefficients across all 197 districts, with statistically significant effects concentrated mainly in the northwestern regions. This spatial pattern is broadly consistent with some previous studies [32,75], although it differs from others [14,17]. The magnitude of the negative association gradually weakens moving outward from the northwest, indicating spatial heterogeneity in the relationship. Importantly, interpretation is limited to districts where coefficients are statistically significant. In these areas, higher population density is associated with lower motorcycle crash frequency. One possible interpretation—offered cautiously—is that higher-density districts may feature urban environments with greater reliance on walking or four-wheel vehicles rather than motorcycles, as well as lower average travel speeds due to congestion. Such contextual factors may contribute to the observed association but cannot be confirmed within the scope of this cross-sectional analysis. Future research incorporating additional indicators, such as sidewalk length or modal share, would help further clarify these relationships.

The male population percentage is associated with predominantly negative coefficients in most districts; however, statistically significant effects are observed in only three districts nationwide. Given this limited spatial significance, interpretations are necessarily restrained. The spatial comparison of Figs 4 and 5e suggests that districts with higher male population shares often coincide with lower motorcycle crash frequencies, particularly in peripheral regions. This pattern may reflect differences in regional socioeconomic structure, travel behavior, or motorcycle usage rates, but these explanations remain speculative. Overall, the limited statistical significance indicates that male population percentage plays a relatively minor and spatially localized role in explaining motorcycle crash frequency.

Regarding residential land use, Table 4 and Fig 5f indicate positive coefficients in several districts, with statistically significant associations observed primarily in eastern and central areas. These results are consistent with prior studies reporting positive associations between residential land use intensity and traffic crashes [76,77], as well as between household concentration and motorcycle crashes [37]. In the present study, districts with a higher proportion of residential land use are associated with higher motorcycle crash frequencies in areas where coefficients are statistically significant. The spatial variation in coefficient magnitude suggests that this relationship is not uniform across Cambodia. For example, strong associations in certain northwest districts may reflect local characteristics such as concentrated residential development combined with limited public service infrastructure. However, these interpretations should be viewed as contextual explanations rather than causal conclusions.

4.2.2. Climate characteristics.

In this research, due to the climatic data availability and multicollinearity between explanatory variables, we only focus on the average precipitation, annual rainy days percentage, and average wind speed variables.

Average precipitation shows a clear south-to-north spatial gradient (Fig 5g), consistent with regional climatic patterns. Statistically significant positive associations with motorcycle crash frequency are concentrated in northern districts. In these areas, higher precipitation levels are associated with higher crash frequencies, in line with prior research identifying rainfall as a correlate of crash occurrence [78–80]. However, in districts where coefficients are not statistically significant, no substantive interpretation is offered. The spatial heterogeneity suggests that precipitation-related effects on motorcycle crashes vary with local travel behavior and infrastructure conditions.

Annual rainy days percentage exhibits predominantly negative associations with motorcycle crash frequency across much of Cambodia (Fig 5h), although positive and statistically significant associations emerge in several districts, particularly in the northwest. In districts with significant negative coefficients, a greater number of rainy days is associated with lower crash frequency, potentially reflecting reduced motorcycle usage or more cautious riding behavior during frequent rainfall. Conversely, in districts with significant positive associations—consistent with previous studies [28,30,81]—terrain characteristics and prolonged exposure to wet conditions may play a role. These mixed patterns highlight the importance of spatial context in understanding weather–crash relationships.

However, the contrasting associations observed for annual rainy days percentage and average precipitation highlight an important behavioral dimension in motorcycle crash occurrence. While precipitation intensity shows a positive association with motorcycle crash frequency in several districts, the number of annual rainy days is predominantly negatively associated with crashes in many areas where the coefficients are statistically significant. This contrast suggests that these two climate indicators may capture different underlying processes. A higher frequency of rainy days may be associated with adaptive or preventive travel behavior, such as reduced motorcycle usage [82,83], increased caution, or lower riding speeds over prolonged periods of exposure to wet conditions. In contrast, precipitation intensity represents an immediate physical hazard, potentially affecting road surface friction and visibility, which may be associated with increased crash risk during rainfall events. The coexistence of negative associations for rainy day frequency and positive associations for precipitation intensity underscores the importance of distinguishing between behavioral adaptation and physical exposure when interpreting climate–crash relationships. These findings indicate that riders may adjust their behavior in response to frequent adverse weather, whereas intense rainfall events pose short-term risks that are less amenable to behavioral compensation. The spatial variability of these associations further suggests that local environmental and infrastructural conditions mediate how climate factors relate to motorcycle crash frequency.

Average wind speed demonstrates both positive and negative associations with motorcycle crash frequency across districts (Fig 4i). Significant negative associations are mainly observed in northern and northeastern regions, while significant positive associations are concentrated in southwestern districts. These contrasting patterns suggest that wind-related effects on motorcycle crashes are highly context-dependent and may be influenced by local topography, urban form, and exposure conditions. Similar associations between wind conditions and crash occurrence have been reported by Hermans et al. [81], although causal mechanisms remain uncertain.

Finally, the spatially varying intercepts in the GWNBR model may capture the influence of unobserved factors not explicitly included in the analysis, such as income level [38], unemployment [84], education [85], or traffic volume [86,87]. The relatively small adaptive bandwidths further indicate strong local sensitivity of motorcycle crash occurrence to spatial context [56].

5. Conclusions and policy recommendations

This study examined the spatially varying associations between built environment characteristics, climatic conditions, and motorcycle crash frequency across 197 districts in Cambodia. By comparing global models (OLS, Poisson regression, and negative binomial regression) with a geographically weighted negative binomial regression (GWNBR) model, the analysis demonstrates that motorcycle crash–environment relationships are characterized by pronounced spatial non-stationarity. The superior performance of the GWNBR model highlights its suitability for modeling overdispersed crash count data with spatially heterogeneous effects, offering a more nuanced understanding of local safety patterns than conventional non-spatial approaches.

The results indicate that the direction, magnitude, and statistical significance of associations between explanatory variables and motorcycle crash frequency vary considerably across districts. While many built environment and climate variables exhibit locally differentiated associations, several factors display relatively consistent patterns in a large proportion of districts. Higher road density, longer major and minor road networks, greater residential land-use intensity, and higher precipitation levels are generally associated with higher motorcycle crash frequencies in districts where effects are statistically significant. In contrast, higher population density, greater intersection density, and a higher number of annual rainy days tend to be associated with lower crash frequencies in many districts. These findings reinforce the importance of accounting for spatial heterogeneity when analyzing motorcycle safety outcomes, particularly in low- and middle-income country contexts.

From a policy perspective, the findings suggest that motorcycle safety interventions in Cambodia should move beyond uniform, nationwide strategies and instead adopt a spatially differentiated approach. Districts characterized by dense road networks and extensive road infrastructure may benefit from targeted enforcement, speed management measures, and context-sensitive road design interventions. In areas where residential land use is strongly associated with higher crash frequency, integrating road safety considerations into land-use planning—such as traffic calming in residential zones and improved access management—may be particularly relevant. Conversely, districts where higher intersection density is associated with lower crash frequency may reflect the presence of traffic-calming effects or increased rider vigilance, suggesting that intersection design and control strategies could play a protective role in certain contexts. Climate-related findings further indicate that safety strategies should distinguish between long-term exposure to frequent rainfall, which may be associated with adaptive riding behavior, and high-intensity precipitation events that pose immediate physical hazards. Weather-responsive enforcement, public awareness campaigns, and infrastructure measures such as improved drainage and surface maintenance could therefore be tailored to local climatic conditions.

Several limitations should be acknowledged when interpreting these findings. First, the analysis relies on district-level zonal data, which may give rise to ecological fallacy, whereby associations observed at the aggregate level do not necessarily reflect individual-level risk relationships [88]. Second, although the crash data are aggregated and comprehensive at the district scale, under-reporting—particularly for property-damage-only motorcycle crashes—may affect the observed crash frequencies and introduce bias. Third, as with all geographically weighted models, the GWNBR framework identifies spatially varying associations rather than causal effects; the results should therefore be interpreted as indicative of spatial patterns rather than definitive causal mechanisms. Fourth, the analysis focuses on variables exhibiting spatial variation across districts; future work could explore semi-parametric GWNBR specifications to jointly model spatially varying and spatially invariant factors. Fifth, the absence of detailed traffic exposure data, such as average annual daily traffic (AADT) [28,37,89], limits the ability to directly control for traffic volume effects. Improved availability of exposure and land-use data—including enhanced OpenStreetMap coverage in Cambodia—would further strengthen future analyses.

Despite these limitations, this study demonstrates the value of spatially explicit modeling for understanding motorcycle crash patterns in Cambodia. By highlighting localized associations between the built environment, climate, and motorcycle crashes, the findings provide a foundation for more targeted, context-sensitive safety planning and offer methodological insights applicable to other low- and middle-income countries facing similar road safety challenges.

References

1. World Health Organization (WHO). Global status report on road safety 2023. Geneva: WHO; 2023.
2. Roehler DR, Ear C, Parker EM, Sem P, Ballesteros MF. Fatal motorcycle crashes: a growing public health problem in Cambodia. Int J Inj Contr Saf Promot. 2015;22(2):165–71. pmid:24499413
- View Article
- PubMed/NCBI
- Google Scholar
3. Ministry of Public Works and Transport (MPWT), General Department of Land Transport (GDLT), Land Transport Department (LTD). Traffic safety in Cambodia. The 13th Public and Private Joint Forum in Asian Region. 2022.
4. NRSC (National Road Safety Committee). Summary report on road crashes and casualties in Cambodia. 2017.
5. Miller JA, Hanham RQ. Spatial nonstationarity and the scale of species–environment relationships in the Mojave Desert, California, USA. Int J Geogr Inf Sci. 2011;25(3):423–38.
- View Article
- Google Scholar
6. Anselin L. Spatial econometrics: methods and models. Dordrecht: Springer; 1988.
7. Geurts K, Wets G, Brijs T, Vanhoof K. Identification and ranking of black spots: sensitivity analysis. Transp Res Rec. 2004;1897(1):34–42.
- View Article
- Google Scholar
8. Mandloi D, Gupta R. Evaluation of accident black spots on roads using geographical information systems (GIS). In: Map India Conference. 2003.
9. Yao S, Loo BPY, Yang BZ. Traffic collisions in space: four decades of advancement in applied GIS. Ann GIS. 2015;22(1):1–14.
- View Article
- Google Scholar
10. Gomes MJTL, Cunto F, da Silva AR. Geographically weighted negative binomial regression applied to zonal level safety performance models. Accid Anal Prev. 2017;106:254–61.
- View Article
- Google Scholar
11. Zafri NM, Khan A. A spatial regression modeling framework for examining relationships between the built environment and pedestrian crash occurrences at macroscopic level: a study in a developing country context. Geogr Sustain. 2022;3(4):312–24.
- View Article
- Google Scholar
12. Huang H, Abdel-Aty MA, Darwiche AL. County-level crash risk analysis in Florida: Bayesian spatial modeling. Transp Res Rec. 2010;2148(1):27–37.
- View Article
- Google Scholar
13. Flahaut B. Impact of infrastructure and local environment on road unsafety. Logistic modeling with spatial autocorrelation. Accid Anal Prev. 2004;36(6):1055–66. pmid:15350882
- View Article
- PubMed/NCBI
- Google Scholar
14. Amoh-Gyimah R, Saberi M, Sarvi M. Macroscopic modeling of pedestrian and bicycle crashes: a cross-comparison of estimation methods. Accid Anal Prev. 2016;93:147–59. pmid:27209153
- View Article
- PubMed/NCBI
- Google Scholar
15. Wang Y, Veneziano D, Russell S, Al-Kaisy A. Traffic safety along tourist routes in rural areas. Transp Res Rec. 2016;2568(1):55–63.
- View Article
- Google Scholar
16. Osama A, Sayed T. Macro-spatial approach for evaluating the impact of socio-economics, land use, built environment, and road facility on pedestrian safety. Can J Civ Eng. 2017;44(12):1036–44.
- View Article
- Google Scholar
17. Chimba D, Musinguzi A, Kidando E. Associating pedestrian crashes with demographic and socioeconomic factors. Case Stud Transp Policy. 2018;6(1):11–6.
- View Article
- Google Scholar
18. Apardian RE, Smirnov O. An analysis of pedestrian crashes using a spatial count data model. Pap Reg Sci. 2020;99(5):1317–38.
- View Article
- Google Scholar
19. Carcaillon LI, Salmi LR, Atout-Route Evaluation Group. Evaluation of a program to reduce motor-vehicle collisions among young adults in the county of Landes, France. Accid Anal Prev. 2005;37(6):1049–55. pmid:16036209
- View Article
- PubMed/NCBI
- Google Scholar
20. Lord D, Washington SP, Ivan JN. Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory. Accid Anal Prev. 2005;37(1):35–46. pmid:15607273
- View Article
- PubMed/NCBI
- Google Scholar
21. Haynes R, Lake IR, Kingham S, Sabel CE, Pearce J, Barnett R. The influence of road curvature on fatal crashes in New Zealand. Accid Anal Prev. 2008;40(3):843–50. pmid:18460350
- View Article
- PubMed/NCBI
- Google Scholar
22. Yu R, Xiong Y, Abdel-Aty M. A correlated random parameter approach to investigate the effects of weather conditions on crash risk for a mountainous freeway. Transp Res Part C: Emerg Technol. 2015;50:68–77.
- View Article
- Google Scholar
23. Donnell ET, Mason JM Jr. Predicting the frequency of median barrier crashes on Pennsylvania interstate highways. Accid Anal Prev. 2006;38(3):590–9. pmid:16442487
- View Article
- PubMed/NCBI
- Google Scholar
24. Kim D-G, Lee Y, Washington S, Choi K. Modeling crash outcome probabilities at rural intersections: application of hierarchical binomial logistic models. Accid Anal Prev. 2007;39(1):125–34. pmid:16925978
- View Article
- PubMed/NCBI
- Google Scholar
25. Wang X, Abdel-Aty M. Temporal and spatial analyses of rear-end crashes at signalized intersections. Accid Anal Prev. 2006;38(6):1137–50. pmid:16777040
- View Article
- PubMed/NCBI
- Google Scholar
26. Caliendo C, Guida M, Parisi A. A crash-prediction model for multilane roads. Accid Anal Prev. 2007;39(4):657–70. pmid:17113552
- View Article
- PubMed/NCBI
- Google Scholar
27. Mitra S, Washington S. On the nature of over-dispersion in motor vehicle crash prediction models. Accid Anal Prev. 2007;39(3):459–68. pmid:17161374
- View Article
- PubMed/NCBI
- Google Scholar
28. Theofilatos A, Yannis G. A review of the effect of traffic and weather characteristics on road safety. Accid Anal Prev. 2014;72:244–56. pmid:25086442
- View Article
- PubMed/NCBI
- Google Scholar
29. Wen H, Zhang X, Zeng Q, Sze NN. Bayesian spatial-temporal model for the main and interaction effects of roadway and weather characteristics on freeway crash incidence. Accid Anal Prev. 2019;132:105249. pmid:31415995
- View Article
- PubMed/NCBI
- Google Scholar
30. Shankar V, Mannering F, Barfield W. Effect of roadway geometrics and environmental factors on rural freeway accident frequencies. Accid Anal Prev. 1995;27(3):371–89. pmid:7639921
- View Article
- PubMed/NCBI
- Google Scholar
31. Zheng L, Robinson RM, Khattak A, Wang X. All accidents are not equal: using geographically weighted regressions models to assess and forecast accident impacts. Washington, DC: Transportation Research Board, National Research Council; 2011.
32. Liu J, Das S, Khan MN. Decoding the impacts of contributory factors and addressing social disparities in crash frequency analysis. Accid Anal Prev. 2024;194:107375. pmid:37956504
- View Article
- PubMed/NCBI
- Google Scholar
33. Brunsdon C, Fotheringham AS, Charlton ME. Geographically weighted regression: a method for exploring spatial nonstationarity. Geogr Anal. 1996;28(4):281–98.
- View Article
- Google Scholar
34. Fotheringham AS, Yang W, Kang W. Multiscale geographically weighted regression (MGWR). Ann Am Assoc Geogr. 2017;107(6):1247–65.
- View Article
- Google Scholar
35. Huang Y, Wang X, Patton D. Examining spatial relationships between crashes and the built environment: a geographically weighted regression approach. J Transp Geogr. 2018;69:221–33.
- View Article
- Google Scholar
36. Hezaveh AM, Arvin R, Cherry CR. A geographically weighted regression to estimate the comprehensive cost of traffic crashes at a zonal level. Accid Anal Prev. 2019;131:15–24. pmid:31233992
- View Article
- PubMed/NCBI
- Google Scholar
37. Mathew S, Pulugurtha SS, Duvvuri S. Exploring the effect of road network, demographic, and land use characteristics on teen crash frequency using geographically weighted negative binomial regression. Accid Anal Prev. 2022;168:106615. pmid:35219106
- View Article
- PubMed/NCBI
- Google Scholar
38. Pirdavani A, Bellemans T, Brijs T, Wets G. Application of geographically weighted regression technique in spatial analysis of fatal and injury crashes. J Transp Eng. 2014;140:04014032.
- View Article
- Google Scholar
39. Arvin R, Kamrani M, Khattak AJ. How instantaneous driving behavior contributes to crashes at intersections: extracting useful information from connected vehicle message data. Accid Anal Prev. 2019;127:118–33. pmid:30851563
- View Article
- PubMed/NCBI
- Google Scholar
40. Tang J, Gao F, Liu F, Han C, Lee J. Spatial heterogeneity analysis of macro-level crashes using geographically weighted Poisson quantile regression. Accid Anal Prev. 2020;148:105833. pmid:33120184
- View Article
- PubMed/NCBI
- Google Scholar
41. He Y, Zhao Y, Tsui KL. An adapted geographically weighted LASSO (Ada-GWL) model for predicting subway ridership. Transp. 2021;48(3):1185–216.
- View Article
- Google Scholar
42. Hadayeghi A, Shalaby AS, Persaud BN. Development of planning level transportation safety tools using geographically weighted poisson regression. Accid Anal Prev. 2010;42(2):676–88. pmid:20159094
- View Article
- PubMed/NCBI
- Google Scholar
43. Li Z, Wang W, Liu P, Bigham JM, Ragland DR. Using geographically weighted Poisson regression for county-level crash modeling in California. Saf Sci. 2013;58:89–97.
- View Article
- Google Scholar
44. Lord D, Mannering F. The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives. Transp Res A Policy Pract. 2010;44(5):291–305.
- View Article
- Google Scholar
45. Lord D. Modeling motor vehicle crashes using Poisson-gamma models: examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter. Accid Anal Prev. 2006;38(4):751–66. pmid:16545328
- View Article
- PubMed/NCBI
- Google Scholar
46. Su Z, Hu H, Tigabu M, Wang G, Zeng A, Guo F. Geographically weighted negative binomial regression model predicts wildfire occurrence in the Great Xing’an mountains better than negative binomial model. Forests. 2019;10(5):377.
- View Article
- Google Scholar
47. Chen J, Liu L, Xiao L, Xu C, Long D. Integrative analysis of spatial heterogeneity and overdispersion of crime with a geographically weighted negative binomial model. ISPRS Int J Geo-Inf. 2020;9(1):60.
- View Article
- Google Scholar
48. Fitriani R, Gede Nyoman Mindra Jaya I. Spatial modeling of confirmed COVID-19 pandemic in East Java province by geographically weighted negative binomial regression. Commun Math Biol Neurosci. 2020;2020:1–17.
- View Article
- Google Scholar
49. Oluwajana SD, Park PY, Cavalho T. Macro-level collision prediction using geographically weighted negative binomial regression. J Transp Saf Secur. 2022;14:1085–120.
- View Article
- Google Scholar
50. Lee J, Abdel-Aty M, De Blasiis MR, Wang X, Mattei I. International transferability of macro-level safety performance functions: a case study of the United States and Italy. Transp Saf Environ. 2019;1(1):68–78.
- View Article
- Google Scholar
51. Ziakopoulos A, Yannis G. A review of spatial approaches in road safety. Accid Anal Prev. 2020;135:105323. pmid:31648775
- View Article
- PubMed/NCBI
- Google Scholar
52. Cervero R, Kockelman K. Travel demand and the 3Ds: density, diversity, and design. Transp Res D Transp Environ. 1997;2(3):199–219.
- View Article
- Google Scholar
53. Kim K, Brunner IM, Yamashita EY. Influence of land use, population, employment, and economic activity on accidents. Transp Res Rec. 2006;1953(1):56–64.
- View Article
- Google Scholar
54. Wier M, Weintraub J, Humphreys EH, Seto E, Bhatia R. An area-level model of vehicle-pedestrian injury collisions with implications for land use and transportation planning. Accid Anal Prev. 2009;41(1):137–45. pmid:19114148
- View Article
- PubMed/NCBI
- Google Scholar
55. Ding C, Chen P, Jiao J. Non-linear effects of the built environment on automobile-involved pedestrian crash frequency: a machine learning approach. Accid Anal Prev. 2018;112:116–26. pmid:29329016
- View Article
- PubMed/NCBI
- Google Scholar
56. Tang X, Bi R, Wang Z. Spatial analysis of moving-vehicle crashes and fixed-object crashes based on multi-scale geographically weighted regression. Accid Anal Prev. 2023;189:107123. pmid:37257354
- View Article
- PubMed/NCBI
- Google Scholar
57. Dumbaugh E, Li W. Designing for the safety of pedestrians, cyclists, and motorists in urban environments. J Am Plann Assoc. 2010;77(1):69–88.
- View Article
- Google Scholar
58. VisualCrossing Weather. Weather history data. [cited 2023 May 20]. Available from: https://www.visualcrossing.com/weather-history
59. da Silva AR, Rodrigues TCV. Geographically weighted negative binomial regression—incorporating overdispersion. Stat Comput. 2014;24:769–83.
- View Article
- Google Scholar
60. Bozdogan H. Model Selection and Akaike’s Information Criterion (AIC): The General Theory and its Analytical Extensions. Psychometrika. 1987;52(3):345–70.
- View Article
- Google Scholar
61. Charlton M, Fotheringham S, Brunsdon C. Geographically Weighted Regression: White Paper. National Centre for Geocomputation, National University of Ireland Maynooth; 2009.
62. Wang CH, Chen N. A geographically weighted regression approach to investigating the spatially varied built-environment effects on community opportunity. J Transp Geogr. 2017;62:136–47.
- View Article
- Google Scholar
63. Yang H, Lu X, Cherry C, Liu X, Li Y. Spatial variations in active mode trip volume at intersections: a local analysis utilizing geographically weighted regression. J Transp Geogr. 2017;64:184–94.
- View Article
- Google Scholar
64. Zhou X, Ding X, Yan J, Ji Y. Spatial heterogeneity of urban illegal parking behavior: a geographically weighted Poisson regression approach. J Transp Geogr. 2023;110:103636.
- View Article
- Google Scholar
65. da Silva AR, Rodrigues TCV. A SAS Macro for geographically weighted negative binomial regression. Cary: SAS Institute; 2016 [cited 2023 Dec 20]. Available from: https://support.sas.com/resources/papers/proceedings16/8000-2016.pdf
66. Ladrón de Guevara F, Washington SP, Oh J. Forecasting crashes at the planning level: simultaneous negative binomial crash model applied in Tucson, Arizona. Transp Res Rec. 2004;1897(1):191–9.
- View Article
- Google Scholar
67. Lovegrove GR, Sayed T. Using macrolevel collision prediction models in road safety planning applications. Transp Res Rec. 2006;1950(1):73–82.
- View Article
- Google Scholar
68. Dumbaugh E, Rae R. Safe urban form: revisiting the relationship between community design and traffic safety. J Am Plann Assoc. 2009;75(3):309–29.
- View Article
- Google Scholar
69. Zhang Y, Bigham J, Ragland D, Chen X. Investigating the associations between road network structure and non-motorist accidents. J Transp Geogr. 2015;42:34–47.
- View Article
- Google Scholar
70. Noland RB. Traffic fatalities and injuries: the effect of changes in infrastructure and other trends. Accid Anal Prev. 2003;35(4):599–611. pmid:12729823
- View Article
- PubMed/NCBI
- Google Scholar
71. Pulugurtha SS, Sambhara VR. Pedestrian crash estimation models for signalized intersections. Accid Anal Prev. 2011;43(1):439–46. pmid:21094342
- View Article
- PubMed/NCBI
- Google Scholar
72. Han C, Huang H, Lee J, Wang J. Investigating varying effect of road-level factors on crash frequency across regions: A Bayesian hierarchical random parameter modeling approach. Anal Methods Accid Res. 2018;20:81–91.
- View Article
- Google Scholar
73. Barua S, El-Basyouny K, Islam MT. A Full Bayesian multivariate count data model of collision severity with spatial correlation. Anal Methods Acc Res. 2014;3–4:28–43.
- View Article
- Google Scholar
74. Jeon J, Woo A. The effects of built environments on bicycle accidents around bike-sharing program stations using street view images and deep learning techniques: the moderating role of streetscape features. J Transp Geogr. 2024;121:103992.
- View Article
- Google Scholar
75. Guerra E, Dong X, Kondo M. Do denser neighborhoods have safer streets? Population density and traffic safety in the Philadelphia region. J Plann Educ Res. 2019;39(4):450–63.
- View Article
- Google Scholar
76. Kim K, Yamashita E. Motor vehicle crashes and land use: empirical analysis from Hawaii. Transp Res Rec. 2002;1784(1):73–9.
- View Article
- Google Scholar
77. Ukkusuri S, Miranda-Moreno LF, Ramadurai G, Isa-Tavarez J. The role of built environment on pedestrian crash frequency. Saf Sci. 2012;50(4):1141–51.
- View Article
- Google Scholar
78. Andrey J, Yagar S. A temporal analysis of rain-related crash risk. Accid Anal Prev. 1993;25(4):465–72. pmid:8357460
- View Article
- PubMed/NCBI
- Google Scholar
79. Edwards JB. Weather-related road accidents in England and Wales: a spatial analysis. J Transp Geogr. 1996;4(3):201–12.
- View Article
- Google Scholar
80. Chang L-Y, Chen W-C. Data mining of tree-based models to analyze freeway accident frequency. J Safety Res. 2005;36(4):365–75. pmid:16253276
- View Article
- PubMed/NCBI
- Google Scholar
81. Hermans E, Brijs T, Stiers T, Offermans C. The impact of weather conditions on road safety investigated on an hourly basis. Transp Res Rec. 2006.
- View Article
- Google Scholar
82. Bergel-Hayat R, Depireb A. Climate, road traffic and road risk: an aggregate approach. In: 10th World Conference on Transport Research. Istanbul: Istanbul Technical University; 2004.
83. Keay K, Simmonds I. The association of rainfall and other weather variables with road traffic volume in Melbourne, Australia. Accid Anal Prev. 2005;37(1):109–24. pmid:15607282
- View Article
- PubMed/NCBI
- Google Scholar
84. Li Z, Chen X, Ci Y, Chen C, Zhang G. A hierarchical Bayesian spatiotemporal random parameters approach for alcohol/drug impaired-driving crash frequency analysis. Anal Methods Accident Res. 2019;21:44–61.
- View Article
- Google Scholar
85. Amiri AM, Naderi K, Cooper JF, Nadimi N. Evaluating the impact of socio-economic contributing factors of cities in California on their traffic safety condition. J Transp Health. 2021;20:101010.
- View Article
- Google Scholar
86. Rhee K-A, Kim J-K, Lee Y, Ulfarsson GF. Spatial regression analysis of traffic crashes in Seoul. Accid Anal Prev. 2016;91:190–9. pmid:26994374
- View Article
- PubMed/NCBI
- Google Scholar
87. Liu J, Khattak AJ, Wali B. Do safety performance functions used for predicting crash frequency vary across space? Applying geographically weighted regressions to account for spatial heterogeneity. Accid Anal Prev. 2017;109:132–42. pmid:29065336
- View Article
- PubMed/NCBI
- Google Scholar
88. Clark WA, Avery KL. The effects of data aggregation in statistical analysis. Geogr Anal. 1976;8(4):428–38.
- View Article
- Google Scholar
89. Becker N, Rust HW, Ulbrich U. Weather impacts on various types of road crashes: a quantitative analysis using generalized additive models. Eur Transp Res Rev. 2022;14:37.
- View Article
- Google Scholar

[ref1] 1. World Health Organization (WHO). Global status report on road safety 2023. Geneva: WHO; 2023.

[ref2] 2. Roehler DR, Ear C, Parker EM, Sem P, Ballesteros MF. Fatal motorcycle crashes: a growing public health problem in Cambodia. Int J Inj Contr Saf Promot. 2015;22(2):165–71. pmid:24499413
View Article
PubMed/NCBI
Google Scholar

[3] View Article

[4] PubMed/NCBI

[5] Google Scholar

[ref3] 3. Ministry of Public Works and Transport (MPWT), General Department of Land Transport (GDLT), Land Transport Department (LTD). Traffic safety in Cambodia. The 13th Public and Private Joint Forum in Asian Region. 2022.

[ref4] 4. NRSC (National Road Safety Committee). Summary report on road crashes and casualties in Cambodia. 2017.

[ref5] 5. Miller JA, Hanham RQ. Spatial nonstationarity and the scale of species–environment relationships in the Mojave Desert, California, USA. Int J Geogr Inf Sci. 2011;25(3):423–38.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref6] 6. Anselin L. Spatial econometrics: methods and models. Dordrecht: Springer; 1988.

[ref7] 7. Geurts K, Wets G, Brijs T, Vanhoof K. Identification and ranking of black spots: sensitivity analysis. Transp Res Rec. 2004;1897(1):34–42.
View Article
Google Scholar

[13] View Article

[14] Google Scholar

[ref8] 8. Mandloi D, Gupta R. Evaluation of accident black spots on roads using geographical information systems (GIS). In: Map India Conference. 2003.

[ref9] 9. Yao S, Loo BPY, Yang BZ. Traffic collisions in space: four decades of advancement in applied GIS. Ann GIS. 2015;22(1):1–14.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref10] 10. Gomes MJTL, Cunto F, da Silva AR. Geographically weighted negative binomial regression applied to zonal level safety performance models. Accid Anal Prev. 2017;106:254–61.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref11] 11. Zafri NM, Khan A. A spatial regression modeling framework for examining relationships between the built environment and pedestrian crash occurrences at macroscopic level: a study in a developing country context. Geogr Sustain. 2022;3(4):312–24.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref12] 12. Huang H, Abdel-Aty MA, Darwiche AL. County-level crash risk analysis in Florida: Bayesian spatial modeling. Transp Res Rec. 2010;2148(1):27–37.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref13] 13. Flahaut B. Impact of infrastructure and local environment on road unsafety. Logistic modeling with spatial autocorrelation. Accid Anal Prev. 2004;36(6):1055–66. pmid:15350882
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref14] 14. Amoh-Gyimah R, Saberi M, Sarvi M. Macroscopic modeling of pedestrian and bicycle crashes: a cross-comparison of estimation methods. Accid Anal Prev. 2016;93:147–59. pmid:27209153
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref15] 15. Wang Y, Veneziano D, Russell S, Al-Kaisy A. Traffic safety along tourist routes in rural areas. Transp Res Rec. 2016;2568(1):55–63.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref16] 16. Osama A, Sayed T. Macro-spatial approach for evaluating the impact of socio-economics, land use, built environment, and road facility on pedestrian safety. Can J Civ Eng. 2017;44(12):1036–44.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref17] 17. Chimba D, Musinguzi A, Kidando E. Associating pedestrian crashes with demographic and socioeconomic factors. Case Stud Transp Policy. 2018;6(1):11–6.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref18] 18. Apardian RE, Smirnov O. An analysis of pedestrian crashes using a spatial count data model. Pap Reg Sci. 2020;99(5):1317–38.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref19] 19. Carcaillon LI, Salmi LR, Atout-Route Evaluation Group. Evaluation of a program to reduce motor-vehicle collisions among young adults in the county of Landes, France. Accid Anal Prev. 2005;37(6):1049–55. pmid:16036209
View Article
PubMed/NCBI
Google Scholar

[49] View Article

[50] PubMed/NCBI

[51] Google Scholar

[ref20] 20. Lord D, Washington SP, Ivan JN. Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory. Accid Anal Prev. 2005;37(1):35–46. pmid:15607273
View Article
PubMed/NCBI
Google Scholar

[53] View Article

[54] PubMed/NCBI

[55] Google Scholar

[ref21] 21. Haynes R, Lake IR, Kingham S, Sabel CE, Pearce J, Barnett R. The influence of road curvature on fatal crashes in New Zealand. Accid Anal Prev. 2008;40(3):843–50. pmid:18460350
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

[ref22] 22. Yu R, Xiong Y, Abdel-Aty M. A correlated random parameter approach to investigate the effects of weather conditions on crash risk for a mountainous freeway. Transp Res Part C: Emerg Technol. 2015;50:68–77.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref23] 23. Donnell ET, Mason JM Jr. Predicting the frequency of median barrier crashes on Pennsylvania interstate highways. Accid Anal Prev. 2006;38(3):590–9. pmid:16442487
View Article
PubMed/NCBI
Google Scholar

[64] View Article

[65] PubMed/NCBI

[66] Google Scholar

[ref24] 24. Kim D-G, Lee Y, Washington S, Choi K. Modeling crash outcome probabilities at rural intersections: application of hierarchical binomial logistic models. Accid Anal Prev. 2007;39(1):125–34. pmid:16925978
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref25] 25. Wang X, Abdel-Aty M. Temporal and spatial analyses of rear-end crashes at signalized intersections. Accid Anal Prev. 2006;38(6):1137–50. pmid:16777040
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref26] 26. Caliendo C, Guida M, Parisi A. A crash-prediction model for multilane roads. Accid Anal Prev. 2007;39(4):657–70. pmid:17113552
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref27] 27. Mitra S, Washington S. On the nature of over-dispersion in motor vehicle crash prediction models. Accid Anal Prev. 2007;39(3):459–68. pmid:17161374
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref28] 28. Theofilatos A, Yannis G. A review of the effect of traffic and weather characteristics on road safety. Accid Anal Prev. 2014;72:244–56. pmid:25086442
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref29] 29. Wen H, Zhang X, Zeng Q, Sze NN. Bayesian spatial-temporal model for the main and interaction effects of roadway and weather characteristics on freeway crash incidence. Accid Anal Prev. 2019;132:105249. pmid:31415995
View Article
PubMed/NCBI
Google Scholar

[88] View Article

[89] PubMed/NCBI

[90] Google Scholar

[ref30] 30. Shankar V, Mannering F, Barfield W. Effect of roadway geometrics and environmental factors on rural freeway accident frequencies. Accid Anal Prev. 1995;27(3):371–89. pmid:7639921
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref31] 31. Zheng L, Robinson RM, Khattak A, Wang X. All accidents are not equal: using geographically weighted regressions models to assess and forecast accident impacts. Washington, DC: Transportation Research Board, National Research Council; 2011.

[ref32] 32. Liu J, Das S, Khan MN. Decoding the impacts of contributory factors and addressing social disparities in crash frequency analysis. Accid Anal Prev. 2024;194:107375. pmid:37956504
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref33] 33. Brunsdon C, Fotheringham AS, Charlton ME. Geographically weighted regression: a method for exploring spatial nonstationarity. Geogr Anal. 1996;28(4):281–98.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref34] 34. Fotheringham AS, Yang W, Kang W. Multiscale geographically weighted regression (MGWR). Ann Am Assoc Geogr. 2017;107(6):1247–65.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

[ref35] 35. Huang Y, Wang X, Patton D. Examining spatial relationships between crashes and the built environment: a geographically weighted regression approach. J Transp Geogr. 2018;69:221–33.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref36] 36. Hezaveh AM, Arvin R, Cherry CR. A geographically weighted regression to estimate the comprehensive cost of traffic crashes at a zonal level. Accid Anal Prev. 2019;131:15–24. pmid:31233992
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

[ref37] 37. Mathew S, Pulugurtha SS, Duvvuri S. Exploring the effect of road network, demographic, and land use characteristics on teen crash frequency using geographically weighted negative binomial regression. Accid Anal Prev. 2022;168:106615. pmid:35219106
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref38] 38. Pirdavani A, Bellemans T, Brijs T, Wets G. Application of geographically weighted regression technique in spatial analysis of fatal and injury crashes. J Transp Eng. 2014;140:04014032.
View Article
Google Scholar

[118] View Article

[119] Google Scholar

[ref39] 39. Arvin R, Kamrani M, Khattak AJ. How instantaneous driving behavior contributes to crashes at intersections: extracting useful information from connected vehicle message data. Accid Anal Prev. 2019;127:118–33. pmid:30851563
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref40] 40. Tang J, Gao F, Liu F, Han C, Lee J. Spatial heterogeneity analysis of macro-level crashes using geographically weighted Poisson quantile regression. Accid Anal Prev. 2020;148:105833. pmid:33120184
View Article
PubMed/NCBI
Google Scholar

[125] View Article

[126] PubMed/NCBI

[127] Google Scholar

[ref41] 41. He Y, Zhao Y, Tsui KL. An adapted geographically weighted LASSO (Ada-GWL) model for predicting subway ridership. Transp. 2021;48(3):1185–216.
View Article
Google Scholar

[129] View Article

[130] Google Scholar

[ref42] 42. Hadayeghi A, Shalaby AS, Persaud BN. Development of planning level transportation safety tools using geographically weighted poisson regression. Accid Anal Prev. 2010;42(2):676–88. pmid:20159094
View Article
PubMed/NCBI
Google Scholar

[132] View Article

[133] PubMed/NCBI

[134] Google Scholar

[ref43] 43. Li Z, Wang W, Liu P, Bigham JM, Ragland DR. Using geographically weighted Poisson regression for county-level crash modeling in California. Saf Sci. 2013;58:89–97.
View Article
Google Scholar

[136] View Article

[137] Google Scholar

[ref44] 44. Lord D, Mannering F. The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives. Transp Res A Policy Pract. 2010;44(5):291–305.
View Article
Google Scholar

[139] View Article

[140] Google Scholar

[ref45] 45. Lord D. Modeling motor vehicle crashes using Poisson-gamma models: examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter. Accid Anal Prev. 2006;38(4):751–66. pmid:16545328
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref46] 46. Su Z, Hu H, Tigabu M, Wang G, Zeng A, Guo F. Geographically weighted negative binomial regression model predicts wildfire occurrence in the Great Xing’an mountains better than negative binomial model. Forests. 2019;10(5):377.
View Article
Google Scholar

[146] View Article

[147] Google Scholar

[ref47] 47. Chen J, Liu L, Xiao L, Xu C, Long D. Integrative analysis of spatial heterogeneity and overdispersion of crime with a geographically weighted negative binomial model. ISPRS Int J Geo-Inf. 2020;9(1):60.
View Article
Google Scholar

[149] View Article

[150] Google Scholar

[ref48] 48. Fitriani R, Gede Nyoman Mindra Jaya I. Spatial modeling of confirmed COVID-19 pandemic in East Java province by geographically weighted negative binomial regression. Commun Math Biol Neurosci. 2020;2020:1–17.
View Article
Google Scholar

[152] View Article

[153] Google Scholar

[ref49] 49. Oluwajana SD, Park PY, Cavalho T. Macro-level collision prediction using geographically weighted negative binomial regression. J Transp Saf Secur. 2022;14:1085–120.
View Article
Google Scholar

[155] View Article

[156] Google Scholar

[ref50] 50. Lee J, Abdel-Aty M, De Blasiis MR, Wang X, Mattei I. International transferability of macro-level safety performance functions: a case study of the United States and Italy. Transp Saf Environ. 2019;1(1):68–78.
View Article
Google Scholar

[158] View Article

[159] Google Scholar

[ref51] 51. Ziakopoulos A, Yannis G. A review of spatial approaches in road safety. Accid Anal Prev. 2020;135:105323. pmid:31648775
View Article
PubMed/NCBI
Google Scholar

[161] View Article

[162] PubMed/NCBI

[163] Google Scholar

[ref52] 52. Cervero R, Kockelman K. Travel demand and the 3Ds: density, diversity, and design. Transp Res D Transp Environ. 1997;2(3):199–219.
View Article
Google Scholar

[165] View Article

[166] Google Scholar

[ref53] 53. Kim K, Brunner IM, Yamashita EY. Influence of land use, population, employment, and economic activity on accidents. Transp Res Rec. 2006;1953(1):56–64.
View Article
Google Scholar

[168] View Article

[169] Google Scholar

[ref54] 54. Wier M, Weintraub J, Humphreys EH, Seto E, Bhatia R. An area-level model of vehicle-pedestrian injury collisions with implications for land use and transportation planning. Accid Anal Prev. 2009;41(1):137–45. pmid:19114148
View Article
PubMed/NCBI
Google Scholar

[171] View Article

[172] PubMed/NCBI

[173] Google Scholar

[ref55] 55. Ding C, Chen P, Jiao J. Non-linear effects of the built environment on automobile-involved pedestrian crash frequency: a machine learning approach. Accid Anal Prev. 2018;112:116–26. pmid:29329016
View Article
PubMed/NCBI
Google Scholar

[175] View Article

[176] PubMed/NCBI

[177] Google Scholar

[ref56] 56. Tang X, Bi R, Wang Z. Spatial analysis of moving-vehicle crashes and fixed-object crashes based on multi-scale geographically weighted regression. Accid Anal Prev. 2023;189:107123. pmid:37257354
View Article
PubMed/NCBI
Google Scholar

[179] View Article

[180] PubMed/NCBI

[181] Google Scholar

[ref57] 57. Dumbaugh E, Li W. Designing for the safety of pedestrians, cyclists, and motorists in urban environments. J Am Plann Assoc. 2010;77(1):69–88.
View Article
Google Scholar

[183] View Article

[184] Google Scholar

[ref58] 58. VisualCrossing Weather. Weather history data. [cited 2023 May 20]. Available from: https://www.visualcrossing.com/weather-history

[ref59] 59. da Silva AR, Rodrigues TCV. Geographically weighted negative binomial regression—incorporating overdispersion. Stat Comput. 2014;24:769–83.
View Article
Google Scholar

[187] View Article

[188] Google Scholar

[ref60] 60. Bozdogan H. Model Selection and Akaike’s Information Criterion (AIC): The General Theory and its Analytical Extensions. Psychometrika. 1987;52(3):345–70.
View Article
Google Scholar

[190] View Article

[191] Google Scholar

[ref61] 61. Charlton M, Fotheringham S, Brunsdon C. Geographically Weighted Regression: White Paper. National Centre for Geocomputation, National University of Ireland Maynooth; 2009.

[ref62] 62. Wang CH, Chen N. A geographically weighted regression approach to investigating the spatially varied built-environment effects on community opportunity. J Transp Geogr. 2017;62:136–47.
View Article
Google Scholar

[194] View Article

[195] Google Scholar

[ref63] 63. Yang H, Lu X, Cherry C, Liu X, Li Y. Spatial variations in active mode trip volume at intersections: a local analysis utilizing geographically weighted regression. J Transp Geogr. 2017;64:184–94.
View Article
Google Scholar

[197] View Article

[198] Google Scholar

[ref64] 64. Zhou X, Ding X, Yan J, Ji Y. Spatial heterogeneity of urban illegal parking behavior: a geographically weighted Poisson regression approach. J Transp Geogr. 2023;110:103636.
View Article
Google Scholar

[200] View Article

[201] Google Scholar

[ref65] 65. da Silva AR, Rodrigues TCV. A SAS Macro for geographically weighted negative binomial regression. Cary: SAS Institute; 2016 [cited 2023 Dec 20]. Available from: https://support.sas.com/resources/papers/proceedings16/8000-2016.pdf

[ref66] 66. Ladrón de Guevara F, Washington SP, Oh J. Forecasting crashes at the planning level: simultaneous negative binomial crash model applied in Tucson, Arizona. Transp Res Rec. 2004;1897(1):191–9.
View Article
Google Scholar

[204] View Article

[205] Google Scholar

[ref67] 67. Lovegrove GR, Sayed T. Using macrolevel collision prediction models in road safety planning applications. Transp Res Rec. 2006;1950(1):73–82.
View Article
Google Scholar

[207] View Article

[208] Google Scholar

[ref68] 68. Dumbaugh E, Rae R. Safe urban form: revisiting the relationship between community design and traffic safety. J Am Plann Assoc. 2009;75(3):309–29.
View Article
Google Scholar

[210] View Article

[211] Google Scholar

[ref69] 69. Zhang Y, Bigham J, Ragland D, Chen X. Investigating the associations between road network structure and non-motorist accidents. J Transp Geogr. 2015;42:34–47.
View Article
Google Scholar

[213] View Article

[214] Google Scholar

[ref70] 70. Noland RB. Traffic fatalities and injuries: the effect of changes in infrastructure and other trends. Accid Anal Prev. 2003;35(4):599–611. pmid:12729823
View Article
PubMed/NCBI
Google Scholar

[216] View Article

[217] PubMed/NCBI

[218] Google Scholar

[ref71] 71. Pulugurtha SS, Sambhara VR. Pedestrian crash estimation models for signalized intersections. Accid Anal Prev. 2011;43(1):439–46. pmid:21094342
View Article
PubMed/NCBI
Google Scholar

[220] View Article

[221] PubMed/NCBI

[222] Google Scholar

[ref72] 72. Han C, Huang H, Lee J, Wang J. Investigating varying effect of road-level factors on crash frequency across regions: A Bayesian hierarchical random parameter modeling approach. Anal Methods Accid Res. 2018;20:81–91.
View Article
Google Scholar

[224] View Article

[225] Google Scholar

[ref73] 73. Barua S, El-Basyouny K, Islam MT. A Full Bayesian multivariate count data model of collision severity with spatial correlation. Anal Methods Acc Res. 2014;3–4:28–43.
View Article
Google Scholar

[227] View Article

[228] Google Scholar

[ref74] 74. Jeon J, Woo A. The effects of built environments on bicycle accidents around bike-sharing program stations using street view images and deep learning techniques: the moderating role of streetscape features. J Transp Geogr. 2024;121:103992.
View Article
Google Scholar

[230] View Article

[231] Google Scholar

[ref75] 75. Guerra E, Dong X, Kondo M. Do denser neighborhoods have safer streets? Population density and traffic safety in the Philadelphia region. J Plann Educ Res. 2019;39(4):450–63.
View Article
Google Scholar

[233] View Article

[234] Google Scholar

[ref76] 76. Kim K, Yamashita E. Motor vehicle crashes and land use: empirical analysis from Hawaii. Transp Res Rec. 2002;1784(1):73–9.
View Article
Google Scholar

[236] View Article

[237] Google Scholar

[ref77] 77. Ukkusuri S, Miranda-Moreno LF, Ramadurai G, Isa-Tavarez J. The role of built environment on pedestrian crash frequency. Saf Sci. 2012;50(4):1141–51.
View Article
Google Scholar

[239] View Article

[240] Google Scholar

[ref78] 78. Andrey J, Yagar S. A temporal analysis of rain-related crash risk. Accid Anal Prev. 1993;25(4):465–72. pmid:8357460
View Article
PubMed/NCBI
Google Scholar

[242] View Article

[243] PubMed/NCBI

[244] Google Scholar

[ref79] 79. Edwards JB. Weather-related road accidents in England and Wales: a spatial analysis. J Transp Geogr. 1996;4(3):201–12.
View Article
Google Scholar

[246] View Article

[247] Google Scholar

[ref80] 80. Chang L-Y, Chen W-C. Data mining of tree-based models to analyze freeway accident frequency. J Safety Res. 2005;36(4):365–75. pmid:16253276
View Article
PubMed/NCBI
Google Scholar

[249] View Article

[250] PubMed/NCBI

[251] Google Scholar

[ref81] 81. Hermans E, Brijs T, Stiers T, Offermans C. The impact of weather conditions on road safety investigated on an hourly basis. Transp Res Rec. 2006.
View Article
Google Scholar

[253] View Article

[254] Google Scholar

[ref82] 82. Bergel-Hayat R, Depireb A. Climate, road traffic and road risk: an aggregate approach. In: 10th World Conference on Transport Research. Istanbul: Istanbul Technical University; 2004.

[ref83] 83. Keay K, Simmonds I. The association of rainfall and other weather variables with road traffic volume in Melbourne, Australia. Accid Anal Prev. 2005;37(1):109–24. pmid:15607282
View Article
PubMed/NCBI
Google Scholar

[257] View Article

[258] PubMed/NCBI

[259] Google Scholar

[ref84] 84. Li Z, Chen X, Ci Y, Chen C, Zhang G. A hierarchical Bayesian spatiotemporal random parameters approach for alcohol/drug impaired-driving crash frequency analysis. Anal Methods Accident Res. 2019;21:44–61.
View Article
Google Scholar

[261] View Article

[262] Google Scholar

[ref85] 85. Amiri AM, Naderi K, Cooper JF, Nadimi N. Evaluating the impact of socio-economic contributing factors of cities in California on their traffic safety condition. J Transp Health. 2021;20:101010.
View Article
Google Scholar

[264] View Article

[265] Google Scholar

[ref86] 86. Rhee K-A, Kim J-K, Lee Y, Ulfarsson GF. Spatial regression analysis of traffic crashes in Seoul. Accid Anal Prev. 2016;91:190–9. pmid:26994374
View Article
PubMed/NCBI
Google Scholar

[267] View Article

[268] PubMed/NCBI

[269] Google Scholar

[ref87] 87. Liu J, Khattak AJ, Wali B. Do safety performance functions used for predicting crash frequency vary across space? Applying geographically weighted regressions to account for spatial heterogeneity. Accid Anal Prev. 2017;109:132–42. pmid:29065336
View Article
PubMed/NCBI
Google Scholar

[271] View Article

[272] PubMed/NCBI

[273] Google Scholar

[ref88] 88. Clark WA, Avery KL. The effects of data aggregation in statistical analysis. Geogr Anal. 1976;8(4):428–38.
View Article
Google Scholar

[275] View Article

[276] Google Scholar

[ref89] 89. Becker N, Rust HW, Ulbrich U. Weather impacts on various types of road crashes: a quantitative analysis using generalized additive models. Eur Transp Res Rev. 2022;14:37.
View Article
Google Scholar

[278] View Article

[279] Google Scholar

Figures

Abstract

1. Introduction

2. Literature review

2.1. Research on the statistical analysis of crash frequency

2.2. Research on spatial analysis methodology of crash frequency

3. Methodology

3.1. Data preparation

3.2. The measurement of spatial dependency - Moran’s Index

3.3. Geographically weighted negative binomial regression

3.4. Measurement of goodness of fit

4. Results and discussion

4.1. The overall results of the global model and GWNBR models

4.2. Local estimates

4.2.1. Built environments.

4.2.2. Climate characteristics.

5. Conclusions and policy recommendations

References