Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Dynamic road section risk identification model based on connected in-vehicle data

Abstract

To dynamically identify road section risks, the research team designed an onboard unit to collect various dynamic driving behavior data when the Advanced Driver Assistance Systems (ADAS) are activated. To do this, we divided the roads into three categories (urban road, expressway, freeway) and established separate BN models to analyze the relationship between driving behavior and road section risk. These models were constructed based on natural driving data from 10,000 km, collected from vehicles equipped with ADAS. For road segment division, fixed-length intervals were used for freeways and urban expressways, while segments on urban roads were defined as the stretches between adjacent intersections. Using braking deceleration and time to collision, we identified near-crash events and classified them into high, medium, and low severity levels using the DBSCAN clustering algorithm. These near-crash events were then matched to the corresponding road section, assigning different weights based on their severity levels to evaluate the risk level of each segment. Additionally, driving behavior data, including velocity, lateral acceleration, longitudinal acceleration, yaw rate, accelerator position, steering angle, and steering angle velocity, were matched to the road segments. Finally, using the Netica software, Bayesian network models were constructed separately for urban roads, expressways, and freeways to identify driving risks at road segments. The models exhibited high sensitivity to observed nodes.

1. Introduction

How to identify risks, reduce traffic accidents, and improve traffic safety are key priorities in traffic management. At present, most risk identification models rely on historical accident data for verification. The common approach is to employ cluster analysis in accident data mining [1]. This involves clustering accident black spots based on the accident rate index statistics of each kilometer of road section, thus revealing the distribution patterns of accidents within a specific time frame. For instance, Krsto et al. [2] utilized both subjective methods, such as field research and expert interviews, and objective methods, utilizing historical and spatial distribution data of road accidents, to identify hazardous road locations in Serbia. They compared and analyzed the strengths and weaknesses of each method, concluding that the objective data-based discrimination method yielded greater accuracy. Zahran et al. [3] developed a Spatial Traffic Accident Analysis (STAA) model, which considered accident frequency, severity, and social cost, to identify and rank accident black spots using historical accident data from a road section in Brunei. The results demonstrated that STAA outperformed other methods in terms of accuracy. Ferit et al. [4] applied the relative frequency method to establish the relationship between the number of accidents and the environmental characteristics of a road section based on accident data from an expressway in Turkey. This method proved to possess excellent detection ability and even demonstrated accurate identification for road sections that lacked accident data. However, historical accident data cannot be utilized for real-time risk identification and predicting potential accidents.

As a substitute for actual collisions, near-crash events are increasingly utilized in research related to driving risk. A near-crash event refers to a situation where the target vehicle or any other vehicle, pedestrian, bicycle, or animal needs to rapidly evade, satisfying the criteria of no collision, no premeditation, the need for avoidance, and the need for quick response [5]. These near-crash events provide controlled experimental data and serve as a valuable addition to real crash data in traffic safety research.

Advanced driver assistance systems (ADAS) help drivers with various driving tasks and capture motion parameters and real-time location data during driving. These motion parameters are directly related to traffic safety. By using historical data, a model can be developed to understand how motion parameters relate to the safety risk of a specific road section. This model, combined with real-time motion parameters, allows for real-time assessment of road section safety. Formos et al [6] conducted a study using traffic flow and vehicle-mounted data from the UK’s M1 highway. They developed a deep neural network (DNN) model that analyzed traffic conflicts by combining these data and calculating surrogate safety indicators (SSM). The model achieved 94% accuracy, showing that time-to-collision (TTC) varied with speed, weather, and traffic density. Gao et al [7] used the random forest algorithm and video trajectory data to identify potential collision risks during vehicle travel. Their multimodal deep convolutional neural network (DCNN) outperformed other models, achieving an AUC of 0.81. Xie et al [8] utilized networked driving data from Michigan to propose the time-to-disturbance collision (TTCD) as a method for capturing rear-end collision risks. Chen et al [9] introduced a collaborative driving scheme for Internet of Vehicles (IoV) intersections, demonstrating high accuracy in driver intent recognition, trajectory prediction, and collision probability assessment.

The analysis reveals that near-crash events can be utilized instead of historical accidents to comprehensively evaluate the road risk level in terms of potential accidents. There exists a potential connection between vehicle operating parameters and road safety level. ADAS-equipped vehicles can gather abundant near-crash events and operating parameters, enabling the establishment of a Bayesian network model to establish the relationship between them. Ultimately, by utilizing real-time vehicle operating parameters, the safety level of a road section can be evaluated.

Therefore, considering the current lack of methods that solely use a large amount of vehicle operation data for road risk assessment in existing traffic safety research, this study proposes a method based on connected Advanced Driver Assistance Systems (ADAS) to extract Near-crash events and classify road section risks. Road sections are first divided by type, and their risks are categorized into three severity levels (high, medium, and low) using Near-crash events extracted from connected vehicle driving data. A Bayesian network model is then established with Netica software to identify risks, supported by a road risk identification database constructed by matching driving data to each section. The main contributions of this paper are as follows:

  1. 1) Unlike traditional methods that rely on fixed monitoring devices such as microwave radars or video cameras for localized risk assessment, this study innovatively leverages collision warning information from connected vehicles’ ADAS systems to construct a dynamically updated road segment risk identification database. This approach enables comprehensive and real-time evaluation of traffic operation risks across the entire road network.
  2. 2) A Bayesian-based risk identification network is developed to analyze fundamental driving data such as speed, acceleration, and steering wheel angle, enabling real-time risk identification for each road segment. This method eliminates the need for costly roadside sensing infrastructure and can be seamlessly integrated into navigation apps, offering drivers real-time updates on road network risks and safe route planning suggestions.

2. Method

The research methods of this paper are as follows. Firstly, road sections are categorized based on road type. Then, near-crash events are used to classify the risk of each section into high, medium, and low severity levels as output. Driving data is matched to each road section, creating a road risk identification database. A Bayesian network model is developed using Netica software to identify the risk of road sections, and its feasibility are verified. The research process is illustrated in Fig 1. The data used in this study was collected on public roads and is publicly available [10]. No specific permits or approvals were required from regulatory authorities, as the activities complied with local traffic laws. The data was used solely for research purposes, with no privacy concerns, and no ethical or legal issues arose during the process.

2.1 Road segment division

The data for this study is derived from real-world driving data collected from vehicles equipped with ADAS on urban roads, expressways, and freeways. Due to their variations in road layout, geometric design, and the arrangement of traffic signs and markings, as well as differences in driving behavior and types of risks involved, separate considerations are required for developing accident risk prediction models.

Road sections of different lengths have varying impacts on traffic conditions and subsequently affect the risk division. For instance, shorter road sections enable connected vehicles to more accurately reflect changes in traffic conditions, especially during congestion. However, they are susceptible to errors due to latitude, longitude, and GIS software limitations, resulting in poor stability and accuracy of vehicle data. Conversely, longer road sections provide more accurate and stable connected vehicle data, but only offer an average depiction of traffic conditions without accounting for the diffusion process during congestion. Therefore, the guiding principle for dividing road sections is to use the shortest length that still ensures accuracy and stability of the vehicle data index.

To divide urban expressways and freeways into sections, this study utilized the rule of equal intervals. However, when segmenting urban roads, adjacent intersections were treated as a single segment. This means that the latitude and longitude ( , ) were gathered at the center point of each intersection, and the actual distance between the two intersections determined the length of the road segment. The segments were then sequentially numbered as 1, 2...n, and the four corner points of each segment represented its range, as shown in the formula (1) and (2).

(1)(2)

2.2 Classification method of road section risk level

2.2.1 Extraction of near-crash events.

A near-crash event refers to a quick and dangerous evasive driving maneuver, such as sudden steering, braking, accelerating, or a combination of these actions [11]. Research [10] indicates that near-crash events are the most common type of collision replacement data. However, the definition of near-crash events differs among scholars, as they use different criteria to identify such events. Currently, near-crash events are primarily identified based on vehicle operating parameters and alternative safety indicators. Due to variations in these indicators and their threshold values across different studies, the number of near-crash events obtained from the same natural driving data can vary. Table 1 presents a summary of the methods employed by researchers to identify near-crash events.

As shown in Table 1, Braking deceleration and Time to Collision (TTC) are commonly used predictors for driving risk. In the same lane, the group of front and rear vehicles can be considered as a driving and following group. The rear vehicle’s safe braking deceleration can be calculated based on the preceding vehicle’s emergency braking deceleration. This means that each vehicle’s deceleration reflects the driving risk of that vehicle [18]. TTC is the duration between the moment when two adjacent vehicles do not take any measures and a rear-end collision occurs [19].

In this study, collision time and braking deceleration were chosen as indicators to identify near-crash events. Relevant researchers [2021] have suggested that TTC < 1.5 seconds or deceleration with longitudinal acceleration ax < −3.5 m/s² is an appropriate criterion for event extraction. Therefore, this study adopts a combined threshold of TTC < 1.5 s and ax < −3.5 m/s² as the standard for identifying near-crash events. If the interval between two events was less than 3 seconds, they were merged. Even if only one criterion was met, the event was still considered a near-crash event.

It is worth noting that TTC is commonly used to assess rear-end collision risks. Therefore, many scholars have developed various 2-D TTC models to evaluate multi-angle collision risks during lane-changing [21]. In this study, the TTC values are directly measured using the Mobileye advanced driver-assistance system. This system can simultaneously track up to eight targets ahead and output the TTC value with the highest collision probability during lane-changing [22].

2.2.2 Quantification of road section risk levels.

Near-crash events have varying probabilities of progressing into actual collisions, with higher-risk events being more likely to result in collisions. Therefore, this study utilizes the extracted indicators of near-crash events, specifically braking deceleration and TTC, as parameters for clustering. The DBSCAN database in the Python clustering algorithm is employed to divide each near-crash event into three risk levels: high, medium, and low. These risk levels serve as the foundation for categorizing road sections based on their level of risk.

The more risk events occur on a stretch of road, the higher the level of risk events, the more dangerous the stretch. Based on this, this study calculated the risk score of the road section through the formula. x, y and z are the number of low, medium and high grade risk events respectively, and their weights are 1, 2 and 3 respectively.

The more the occurrence of risk events on a road segment, and the higher the level of those events, the more hazardous the segment becomes. This study calculates the risk score for each road section using the formula(3):

(3)

In the formula, x, y, and z represent the number of low, medium, and high-level risk events, respectively. The weights assigned to these events are 1, 2, and 3, respectively. Based on the calculated risk scores, the K-means clustering method was employed to classify the road segments into three risk levels: high, medium, and low.

2.3 Risk identification using the Bayesian network model

BN, commonly known as Bayesian Belief Network (BBN), possesses strong capabilities in addressing uncertain problems. It excels in learning and reasoning even when faced with limited and uncertain information. Furthermore, BN can effectively incorporate both prior knowledge and newly acquired information, allowing for the expression and fusion of multi-source information within the network. As a result, BN has found applications in various fields [2324], including state detection, fault diagnosis, quality assessment, dynamic analysis, and risk management.

Typically, Bayesian networks are represented using a directed acyclic graph (DAG) structure. This structure encompasses a set of random variables that capture their respective states and their probabilistic causal relationships. It is often denoted as Fig 2 presents commonly used DAG structures.

In Fig 2, represent variables with multiple states. The one-way arrows between the variables illustrate the causal relationships, with the direction indicating the relationship between the parent node and the child node. The child node is situated on the receiving side of the arrow. Four types of relationships exist between nodes, classified as series, dispersed, shrinkage, and mixed types. The mixed type can consist of any combination of the three aforementioned forms. In a DAG, the relationship between any set of child nodes and parent nodes can be determined by referring to the conditional probability table.

In a DAG, each node can be in either a true or false state. The conditional probability of depends on the true or false values of and . Table 2 displays the conditional probability of .

If a parent node in a DAG has no child nodes, then the prior probability of the state of the node can be calculated from the joint probability function of all variables in the DAG, which becomes a marginalized formula, as shown in Equations (4).

(4)

Among them, represents each parent node of node . If there are relevant conditional independence assumptions, the above calculation can be further simplified.

The identification of road segment risk using a Bayesian network involves a structured process consisting of the following key steps:

  • Step 1 Construction of the Directed Acyclic Graph-DAG: The first step is to establish the causal relationships between different nodes, which represent variables such as traffic conditions, driver behavior, and environmental factors. This involves defining the pointing relationships between nodes to construct a DAG, which serves as the foundational framework for the Bayesian network.
  • Step 2 Calculation of Probabilities: Next, the prior probabilities of each node and the conditional probabilities between nodes are calculated. These probabilities are derived from historical data or expert knowledge.
  • Step 3 Learning and Reasoning Process: Based on the prior probabilities of the evidence nodes (observed data) and the conditional probabilities between the evidence nodes and intermediate nodes, the probability values of the target node (road risk) are computed under different risk states. This step enables the network to infer the likelihood of risk based on the given evidence.
  • Step 4 Risk Identification: Finally, leveraging the reasoning capabilities of the Bayesian network, driving data is used to evaluate and identify road risks.

3. Data preparation for risk identification modeling

3.1 Test route

To gather natural driving data from different types of roads, a 105 km mixed test route was chosen. The test route consists of a conventional urban road, two urban expressways, and a freeway. The specific route details can be found in Fig 3.

3.2 Onboard unit and connected driving data

To gather data on driver operating behavior, vehicle motion, driving environment, and surrounding vehicles during real-world driving, the research team developed an onboard unit. This unit incorporates various equipment, including the CAN bus, inertial navigation system, Mobileye, DVR, and other components. Table 3 provides a partial list of the obtained data. For this study, only the vehicle’s longitudinal acceleration from the inertial navigation system and the Time to Collision (TTC) from the Mobileye were utilized as indicators to identify near-crash events. Additionally, inputs to the Bayesian Network (BN) model included speed, lateral acceleration, longitudinal acceleration, yaw angle, accelerator, steering wheel angle, and steering wheel angle velocity.

3.3 Extraction of near-crash events

Based on the near-crash event definition standard described in 2.2, a total of 1,342 near-crash events were collected during the 10,000 km natural driving experiment. Using the Python DBSCAN database, these events were classified into three levels: high, medium, and low, based on the Time to Collision (TTC) and longitudinal acceleration. The clustering algorithm achieved a good result, as indicated by a silhouette coefficient of 0.84. Table 4 presents the statistics of the near-crash events at different risk levels: 205 high-risk events, 367 medium-risk events, and 770 low-risk events, following a pyramid-shaped distribution.

thumbnail
Table 4. Statistics of near-crash events at different levels.

https://doi.org/10.1371/journal.pone.0333623.t004

3.4 Risk classification of road sections

The experimental road was divided into 617 sections, which consisted of 124 urban road sections, 218 freeway sections, and 274 urban expressway sections. The average length of the road sections for each road type can be found in Table 5.

thumbnail
Table 5. The average length of the road sections for each road type.

https://doi.org/10.1371/journal.pone.0333623.t005

During the natural driving experiment, the vehicle recorded real-time latitude and longitude information. The nearest neighbor analysis function in ArcGIS was used to match the risk points to the divided road segments. First, the latitude and longitude of the risk points were converted from the geographic coordinate system to the projected coordinate system using the projection function in ArcGIS. Then, the projected risk points were matched to the route based on their x and y coordinates in the projected system. Examples of matching risk points to road segment information can be observed in Table 6, while the results of the matching process are depicted in Fig 4. Table 7 showcases the number of risk points for each risk level within each road segment and provides the total risk assessment score for each segment.

thumbnail
Table 6. Example of matching risk points to road segment information.

https://doi.org/10.1371/journal.pone.0333623.t006

Using K-means clustering, the road segment scores were classified into three categories: high risk, medium risk, and low risk. The cluster center points for the three risk levels were 1.04, 29.84, and 101.25, respectively. Table 8 displays the number of road sections belonging to each risk level.

thumbnail
Table 8. Number of different types of road sections with different risk levels.

https://doi.org/10.1371/journal.pone.0333623.t008

As shown in Table 8, the proportions of high risk, medium risk, and low risk sections in all three road types exhibit a pyramid distribution. This phenomenon aligns with the safety pyramid theory’s description of traffic event severity distribution [25]. Furthermore, as illustrated by the spatial distribution of risk points in Fig 3 and Fig 4, Freeway and Urban Expressway sections are longer, with risk points concentrated in a few critical locations. In contrast, risk points on Urban Roads are more evenly distributed across segment units, resulting in a higher proportion of medium-risk sections on Urban Roads.

3.5 Driving behavior database

For this research and analysis, the following indicators were selected: velocity, lateral acceleration, longitudinal acceleration, yaw rate, accelerator, steering angle, and steering angular velocity, as shown in Table 9. Real-time data collected during vehicle operation included speed, lateral acceleration, longitudinal acceleration, and yaw angular velocity, which were referred to as vehicle operating parameters. The accelerator, steering wheel angle, and steering angular velocity were associated with driver operating behaviors and referred to as vehicle control parameters. To match the driving data of all drivers with the road segments, latitude and longitude information was used, and the nearest neighbor analysis function in GIS was employed to assign each track point to its corresponding road segment.

Before constructing the BN model, it is important to establish the thresholds for these indicators. In this study, it is assumed that the majority (80%) of drivers’ behavior indicators are at the intermediate level, while a minority (20%) of drivers’ behavior indicators are at either the high or low level. With this assumption, the thresholds for each behavior indicator are determined using the 10th and 90th percentiles, resulting in three levels: High, Medium, and Low. The specific steps for determining these thresholds are as follows:

Step 1: Analyze the data on a certain road section by obtaining the 10% and 90% quantile data of the aforementioned indicators for each driver on a certain road section.

Step 2: Calculate the average value of these quantile values for all drivers on the same road section and use it as the percentile value of the final index for this road section.

Step 3: Take the average value obtained in step 2 on all the same road section types as the basis for the classification of the corresponding risk levels of the aforementioned indicators.

Following these steps, different indicators are divided into their respective high, medium, and low levels, as presented in Table 10.

thumbnail
Table 10. Value ranges corresponding to the high, medium, and low levels of different indicators.

https://doi.org/10.1371/journal.pone.0333623.t010

4. Road risk identification modeling

4.1 Construction of BN model

Feature selection for the BN model was informed by recent developments in learning-based risk prediction models [26], we applied a triple BN model <V, E, ɵ> in this study, constructed based on the adopted indicators. E represents a finite set of edges, which depict one-to-one relationships between nodes. The value of ɵ reflects the degree of causal relationship among each node. V represents the node, with as a non-empty and finite set. denotes the hidden node in the network model, while represents the observation node. The specifics of each node are outlined in Table 11. The final constructed road segment risk BN structure model is illustrated in Fig 5.

thumbnail
Fig 5. Structure of BN model for road segment risk identification.

https://doi.org/10.1371/journal.pone.0333623.g005

4.2 Road risk identification model reasoning

4.2.1 Model parameter establishment.

In the BN models for urban roads, expressways, and freeways, each node was in either a risk (Y) or non-risk (N) state, except for the observation nodes. The road section risk node’s risk state was represented by , the vehicle operating parameters’ risk state was represented by , and the vehicle control parameters’ risk state by . The observation nodes included speed, lateral acceleration, longitudinal acceleration, yaw rate, accelerator, steering wheel angle, and steering wheel angle speed, with their states categorized as Low, Medium, or High. The probability and conditional probability for these states appearing in different road types are shown in Table 12.

thumbnail
Table 12. Occurrence probability and conditional probability of observation nodes in different road types.

https://doi.org/10.1371/journal.pone.0333623.t012

4.2.2 Inference modeling based on Netica software.

We used Netica software to establish risk identification models. In order to do so, we needed to create a suitable BN model and input node data that is relevant to the analysis. This node data can be obtained from conditional probability tables that are based on expert assessments, or acquired through driving experiments with cyclic training. For our study, we chose the latter approach and used the “Equation” function to input the probability data. We then calculated the conditional probability of the hidden node by considering the correlation between different factors. The Fig 6 displays the proposed BN models for the three types of roads.

thumbnail
Fig 6. Three types of BN models for road segment risk.

(a) BN model for risk of urban road sections. (b) BN model for risk of urban expressway sections. (c) BN model for risk of freeway sections].

https://doi.org/10.1371/journal.pone.0333623.g006

4.2.3 Probability division of operational risk levels.

The risk levels were determined by the probability of risk occurrence, thus resulting in high, medium, and low risk. Table 13 illustrates the classification of road section operation risk levels.

thumbnail
Table 13. Classification of road section operation risk levels.

https://doi.org/10.1371/journal.pone.0333623.t013

4.3 Model sensitivity analysis

In the BN model, sensitivity analysis is a valuable tool for identifying the key factors that significantly impact the risk of road sections. Using the Netica software, we conducted sensitivity analysis on all nodes in the urban road sections BN model. The results are presented in Table 14. The analysis revealed that among all the observed nodes, Speed, lateral acceleration, and longitudinal acceleration had the greatest influence on the operational risk of urban road sections. For expressways and freeways, we used the same approach to determine the three observed nodes that have the most significant impact on the risk of road sections.

thumbnail
Table 14. Sensitivity analysis of driving risk on urban road sections.

https://doi.org/10.1371/journal.pone.0333623.t014

Next, we moved forward with altering the parameter value of the three observed nodes that have the highest impact on the risk of road sections. The sensitivity of the models to these changes was assessed by observing how they affected the final risk probability. Sensitivity analysis was performed on all three models.

Assuming the speed, lateral acceleration, and longitudinal acceleration were set at 100% in the low-risk segment, the BN deduction showed a decrease in the operational risk level from 23.4% to 12.2%, as seen in Fig 7(a). Conversely, assuming the same settings in the high-risk segment, the BN deduction indicated an increase in the operational risk level from 23.4% to 69.5%, as depicted in Fig 7(b). By altering the speed, lateral acceleration, and longitudinal acceleration, it was observed that the risk probability of urban road operations experienced significant fluctuations, demonstrating the sensitivity of the proposed model.

thumbnail
Fig 7. BN model when the risk index of urban road sections change.

(a) When the risk index of urban road sections decreased. (b) When the risk index of urban road sections increased.

https://doi.org/10.1371/journal.pone.0333623.g007

Lateral acceleration, speed, and steering angle were found to have the greatest impact on the operational risk of urban expressway sections. Assuming these factors were set at 100% in the low-risk segment, the BN deduction revealed a decrease in the operation risk level from 18.3% to 11.1%, as shown in Fig 8(a). Conversely, assuming the same settings in the high-risk segment, the BN deduction indicated an increase in the operation risk level from 18.3% to 30.2%, as illustrated in Fig 8(b).

thumbnail
Fig 8. BN model when the risk index of urban expressway sections increased.

(a) When the risk index of urban expressway sections decreased. (b) When the risk index of urban expressway sections increased.

https://doi.org/10.1371/journal.pone.0333623.g008

For freeways, longitudinal acceleration, speed, and lateral acceleration were found to have the greatest impact on freeway operation risk. Assuming these factors were set at 100% in the low-risk segment, the BN deduction revealed a decrease in the operation risk level from 14.7% to 9.8%, as shown in Fig 9(a). On the other hand, assuming the same settings in the high-risk segment, the BN deduction indicated an increase in the operation risk level from 14.7% to 25.7%, as illustrated in Fig 9(b).

thumbnail
Fig 9. BN model when the risk index of freeway sections increased.

(a) When the risk index of freeway sections decreased. (b) When the risk index of freeway sections increased.

https://doi.org/10.1371/journal.pone.0333623.g009

The changes in operation risk for the mentioned road sections are summarized in Table 15. It can be observed that as the risk probability of a specific sensitive factor of a road section significantly increased or decreased, the corresponding risk level of that section also underwent significant change. This demonstrates the strong sensitivity of the proposed BN models for urban road sections, urban expressway sections, and freeway sections.

thumbnail
Table 15. Influence of Observed Node Changes on Model Results.

https://doi.org/10.1371/journal.pone.0333623.t015

5 Conclusion

This study utilizes near-crash events to determine risk levels of road sections and establishes a Bayesian network (BN) model to capture the relationship between driving behavior and road section risk levels. The main conclusions and contributions of this study are as follows:

  1. (1) This study proposes a method for extracting near – crash events and identifying road section risks using connected ADAS. Analyzing 10,000 km of natural driving test data, we extracted 205 high-risk, 367 medium-risk, and 770 low-risk events, laying a data-driven foundation for subsequent analyses.
  2. (2) A novel road section risk level classification method is developed. We divided three road types (urban roads, urban expressways, and freeways) into sections. By matching near – crash events of different levels to road sections based on driving-collected longitude and latitude data and calculating risks according to event frequencies, we identified 576 low-risk, 37 medium-risk, and 4 high-risk sections. This method provides a practical approach for quantifying road risks.
  3. (3) Employing Netica software, we established separate BN models for different road types to explore the link between driving behavior and road section risk. These models demonstrated high sensitivity.

This study has several limitations. First, due to the challenges in obtaining accurate accident data, near-crash events were used as a proxy for real crash data, which may not fully capture the complexities of actual collisions. Second, our analysis is based on basic ADAS with risk warning functions. While sufficient for our research, these basic systems may lack the advanced capabilities of more sophisticated ADAS. Recent advancements like Automatic Emergency Braking (AEB), Lane Departure Warning (LDW), Lane Keeping Assistance (LKA), or Blind Spot Warning (BSW) [2728] can provide more information related to potential vehicle collision risks, thus enhancing the accuracy and applicability of our risk assessment model.

Future work will focus on integrating advanced ADAS systems to enhance risk identification precision and analyzing data from moments preceding near-crash events to optimize model performance under complex traffic conditions. Additionally, statistical methods will be explored to quantify structural differences among BN models across road types, investigating the influence of road design versus driving behavior patterns. These efforts aim to improve the robustness and applicability of traffic safety solutions.

References

  1. 1. Bei R, Du Z, Lyu N, Yu L, Yang Y. Exploring the Mechanism for Increased Risk in Freeway Tunnel Approach Zones: A Perspective on Temporal-spatial Evolution of Driving Predictions, Tasks, and Behaviors. Accid Anal Prev. 2025;211:107914. pmid:39787825
  2. 2. Lipovac K, Nešić M, Tešić M, Ross A, Tubić V, Marić B. A comparative analysis of dangerous locations on the public roads in Serbia. Safety Science. 2016;84:190–200.
  3. 3. Zahran E-SMM, Tan SJ, Tan EHA, Mohamad ’Asri Putra NA ’Atiqah B, Yap YH, Abdul Rahman EK. Spatial analysis of road traffic accident hotspots: evaluation and validation of recent approaches using road safety audit. Journal of Transportation Safety & Security. 2019;13(6):575–604.
  4. 4. Yakar F. Identification of Accident-Prone Road Sections by Using Relative Frequency Method. PROMTT. 2015;27(6):539–47.
  5. 5. Wu K-FK, Wang L. Exploring the combined effects of driving situations on freeway rear-end crash risk using naturalistic driving study data. Accid Anal Prev. 2021;150:105866. pmid:33276188
  6. 6. Formosa N, Quddus M, Ison S, Abdel-Aty M, Yuan J. Predicting real-time traffic conflicts using deep learning. Accid Anal Prev. 2020;136:105429. pmid:31931409
  7. 7. Gao Z, Liu Y, Zheng J. Predicting Hazardous Driving Events Using Multi-Modal Deep Learning Based on Video Motion Profile and Kinematics Data. 21st International Conference on Intelligent Transportation Systems (ITSC). Maui, Hawaii, USA. 2018, 3352–57.
  8. 8. Xie K, Yang D, Ozbay K, Yang H. Use of real-world connected vehicle data in identifying high-risk locations based on a new surrogate safety measure. Accid Anal Prev. 2019;125:311–9. pmid:29983165
  9. 9. Chen C, Liu L, Qiu T, Ren Z, Hu J, Ti F. Driver’s Intention Identification and Risk Evaluation at Intersections in the Internet of Vehicles. IEEE Internet Things J. 2018;5(3):1575–87.
  10. 10. Lyu N, Wen J, Duan Z, Wu C. Vehicle Trajectory Prediction and Cut-In Collision Warning Model in a Connected Vehicle Environment. IEEE Trans Intell Transport Syst. 2022;23(2):966–81.
  11. 11. Naik B, Tung L-W, Zhao S, Khattak AJ. Weather impacts on single-vehicle truck crash injury severity. J Safety Res. 2016;58:57–65. pmid:27620935
  12. 12. Guo F, Klauer SG, Hankey JM, Dingus TA. Near Crashes as Crash Surrogate for Naturalistic Driving Studies. Transportation Research Record: Journal of the Transportation Research Board. 2010;2147(1):66–74.
  13. 13. Wang J, Zheng Y, Li X, Yu C, Kodaka K, Li K. Driving risk assessment using near-crash database through data mining of tree-based model. Accid Anal Prev. 2015;84:54–64. pmid:26319604
  14. 14. Naji Hasan, Xue Q, Lyu N, Wu C, Zheng K. Evaluating the Driving Risk of Near-Crash Events Using a Mixed-Ordered Logit Model. Sustainability. 2018;10(8):2868.
  15. 15. Wu J, Xu H, Zhang Y, Sun R. An improved vehicle-pedestrian near-crash identification method with a roadside LiDAR sensor. J Safety Res. 2020;73:211–24. pmid:32563396
  16. 16. Lyu N, Duan Z, Ma C, Wu C. Safety margins – a novel approach from risk homeostasis theory for evaluating the impact of advanced driver assistance systems on driving behavior in near-crash events. Journal of Intelligent Transportation Systems. 2020;25(1):93–106.
  17. 17. Lyu N, Wen J, Duan Z, Wu C. Vehicle Trajectory Prediction and Cut-In Collision Warning Model in a Connected Vehicle Environment. IEEE Trans Intell Transport Syst. 2022;23(2):966–81.
  18. 18. Yan Y, Sheng Y, Yuan H, Liu H. Driving risk evaluation and speed control in passageway areas of freeway. Journal of Traffic and Transportation Engineering. 2011;11(02):90–6.
  19. 19. Sobol I. On sensitivity estimates for nonlinear mathematical models. Mathematical Modeling & Computational Experiment. 1993;1(1):112–8.
  20. 20. Wen J, Lyu N, Zheng L. Exploring safety effects on urban expressway diverging areas: crash risk estimation considering extreme conflict types. Int J Inj Contr Saf Promot. 2025;32(1):25–39. pmid:39676247
  21. 21. Lyu N, Xie T, Wen J, Wang Y. A real-time conflict risk prediction modeling based on causal inference and graph-model: Applied to multi-configuration expressway diversion zones. Transportation Research Part C: Emerging Technologies. 2025;178:105219.
  22. 22. Raphael E, Kiefer R, Reisman P, Hayon G. Development of a Camera-Based Forward Collision Alert System. SAE Int J Passeng Cars – Mech Syst. 2011;4(1):467–78.
  23. 23. Xu BG. Intelligent fault inference for rotating flexible rotors using Bayesian belief network. Expert Systems with Applications. 2012;39(1):816–22.
  24. 24. Zhang Y, Ji Q. Active and dynamic information fusion for multisensor systems with dynamic Bayesian networks. IEEE Trans Syst Man Cybern B Cybern. 2006;36(2):467–72. pmid:16602605
  25. 25. Ali Y, Haque MM, Zheng Z. An Extreme Value Theory approach to estimate crash risk during mandatory lane-changing in a connected environment. Analytic Methods in Accident Research. 2022;33:100193.
  26. 26. Yang L, Zhang J, Lyu N, Zhao Q. Predicting lane change maneuver and associated collision risks based on multi-task learning. Accid Anal Prev. 2025;209:107830. pmid:39500177
  27. 27. Masello L, Castignani G, Sheehan B, Murphy F, McDonnell K. On the road safety benefits of advanced driver assistance systems in different driving contexts. Transportation Research Interdisciplinary Perspectives. 2022;15:100670.
  28. 28. Aleksa M, Schaub A, Erdelean I, Wittmann S, Soteropoulos A, Fürdös A. Impact analysis of Advanced Driver Assistance Systems (ADAS) regarding road safety – computing reduction potentials. Eur Transp Res Rev. 2024;16(1).