Correction
5 Apr 2021: Elhenawy M, Glaser S, Bond A, Rakotonirainy A, Demmel S, et al. (2021) Correction: A framework for testing independence between lane change and cooperative intelligent transportation system. PLOS ONE 16(4): e0249804. https://doi.org/10.1371/journal.pone.0249804 View correction
Figures
Abstract
Cooperative Intelligent Transportation Systems (C-ITS) are being deployed in several cities around the world. We are preparing for the largest Field Operational Test (FOT) in Australia to evaluate C-ITS safety benefits. Two of the safety benefit hypotheses we formulated assume a dependency between lane changes and C-ITS warnings displayed on the Human Machine Interface (HMI) during safety events. Lane change detection is done by processing many predictors from several sensors at the time of the safety event. However, in our planned FOT, the participating vehicles are only equipped with the vehicle C-ITS and the IMU. Therefore, in this paper, we propose a framework to test lane change and C-ITS dependency. In this framework, we train a random forest classifier using data collected from the IMU to detect lane changes. Consequently, the random forest output probabilities of the testing data in case of C-ITS and control are used to construct a 2x2 contingency table. Then we develop a permutation test to calculate the null hypothesis needed to test the independence of the lane change during safety events and the C-ITS.
Citation: Elhenawy M, Glaser S, Bond A, Rakotonirainy A, Demmel S, Masoud M (2020) A framework for testing independence between lane change and cooperative intelligent transportation system. PLoS ONE 15(2): e0229289. https://doi.org/10.1371/journal.pone.0229289
Editor: Chen Lv, Nanyang Technological University, SINGAPORE
Received: October 16, 2019; Accepted: February 3, 2020; Published: February 27, 2020
Copyright: © 2020 Elhenawy et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The pre-processed data in excel sheet is provided and attached with the documents. Also, the raw (the UAH-DriveSet) data is available publically at: http://www.robesafe.uah.es/personal/eduardo.romera/uah-driveset/#download
Funding: The authors would like to acknowledge the support of the iMOVE Cooperative Research Centre (CRC), where this work is funded under grant number 1-002.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The United States was the first country to introduce the concept of intelligence into transportation systems in the 20th century [1]. Nowadays, the research and development of ITS have moved to other countries such as Japan, the European Union, South Korea, Australia, and Qatar. In the 1970s, the United States started the Electronic Route Guidance System (EGRS), which is considered one of the initial ITS applications. Later, in 1991, the Integrated Surface Transportation Efficiency Act (ISTEA) was enacted. The Transportation Equity Act for the 21st Century TEA-21 followed ISTEA.
C-ITS is the technology that enables the components of ITS (road users, vehicles, roadside units) to communicate and cooperate via wireless networks. The interaction between ITS components is based on several standards developed by different organizations such as European Committee for Standardization (CEN) [2], the International Organization for Standardization (ISO) [3], the European Telecommunications Standards Institute (ETST) [4], depending on the jurisdiction. An important step towards C-ITS was the cooperation between governmental transportation departments and vehicle manufacturers around the world to enable vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) communications. This enabled C-ITS to build an integrated system of people, roads, and vehicles by applying and integrating communications, computers, and other technologies in the field of transportation. Integrating information and technologies can establish a large, full-functioning, real-time, accurate, efficient, and safe transportation system.
There are many C-ITS projects around the world [5–8] including:
- Cooperative Intelligent Transport Initiative (CITI) in Australia,
- Cooperative Intelligent Transport Systems (C-ITS) Pilot which is the main component of the Cooperative and Automated Vehicle Initiative (CAVI) in Australia[9],
- Cooperative Mobility Pilot on Safety and Sustainability Services for Deployment (COMPASS4D)—a C-ITS deployment in Europe,
- ETC2.0 Project in Japan, and,
- Safe and Intelligent Mobility—Test Field (SIM-TD) in Germany.
Recent research uses lane changes as a safety indicator to evaluate the safety benefits of C-ITS [10]. The challenge in such studies is the limited number of sensing modalities so that the trained machine learning models yield relatively low sensitivity and accuracy. The following paragraphs will show why lane change detection/prediction is an important surrogate measure of safety. We will briefly describe different approaches used for lane change detection/prediction and list the used sensing modalities.
Scene understanding, scene interpretation, and the prediction of the associated driver behaviour are critical tasks for advanced driver assistance systems, semi-autonomous, and fully autonomous driving vehicles. Lane change detection/prediction is one pattern of behaviour which has attracted the attention of researchers because of two reasons. First, it is important for semi-autonomous and fully autonomous driving. For example, lane change prediction of other non-autonomous vehicles is important for risk assessment for safe driving. Second, safely changing lane in traffic requires the driver to look forward and sideways to adjacent lanes. Looking in these two directions makes a lane change a complicated manoeuvre which could contribute to crashes. Fitch and Hankey analysed the 2009 National Automotive Sampling System General Estimates System and found that 7.44 per cent of all car crashes are due to lane changes [11]. The different approaches used for lane change detection/prediction are based on different sensing modalities, including mono and stereo vision, LIght Detection and Ranging (LIDAR), car odometry, inertial measurement unit (IMU), global positioning system (GPS), electrocardiogram (ECG), galvanic skin response (GSR), and respiration rate (RR).
Woo et al. [12] installed sensor system in the subject vehicle to predict the target vehicle trajectory and hence detect the lane change of the target vehicle; the system consisted of a position sensor (RT3003) and six laser scanners (Ibeo LUX). This system enabled the subject vehicle to acquire its position, the position of other vehicles, and their velocity. The authors pre-processed this information to construct the input features vector to the support vector machine (SVM) driving-intention estimation model. The SVM output is used to predict the trajectory of the target using the potential field method. Consequently, a lane change can be detected with high accuracy, on average, 1.74 s before the target vehicle crosses the centre line. Mandalia and Salvucci [13] applied SVM to data collected from an instrumented vehicle to detect lane change intentions. They tried several subsets of features that include lane position, heading, steering angle, and acceleration. Their proposed approach achieved the best recognition result with high accuracy when using lane position features at different longitudinal distances from the driver’s vehicle. Salvucci [14] proposed mind-tracking architecture for detecting driver lane changes. The proposed architecture simulates a set of possible driver intentions to predict their resulting steering-wheel angle and accelerator depression behaviours. Then the observed driver behaviour is matched with the predicted behaviour using a Gaussian kernel to find the most probable driver intention. The proposed architecture requires measuring two visual features in addition to the minimum time headway to either the lead vehicle (in case of lane-keeping) or to the lead vehicle in the destination lane (in case of lane changing). McCall and Trivedi [15] developed a novel “video-based lane estimation and tracking” (VioLET) system. The system is based on steerable filters to detect lane-markings. The proposed system is efficient in detecting circular-reflector markings, solid-line markings, and segmented-line markings in different lighting and road conditions. It is suitable for lane detection in different traffic situations including overpasses and tunnels. Xuan and Coifman [16] used the Differential Global Positioning System (DGPS) data to detect lane change without high-resolution maps. They started by creating a reference trajectory by estimating the median of multiple probe vehicle trajectories through a study corridor. After controlling for mandatory lane change manoeuvres (MLC), the authors created virtual lanes by setting lateral thresholds around the reference trajectory, which were then used for lane change detection. Henning et al. [17] compared selected behavioural and environmental indicators that can be used to predict the intention to change lane. The authors extracted these indicators from drivers, vehicles, and environment data collected in a field study. They reported that the glance to the left outside mirror is a potential indicator to predict the driver intention to change lanes.
Jeong et al. [18] used state of the art deep convolutional neural network to classify the adjacent lanes from two rear cameras. The proposed approach achieved 96.98% classification accuracy which makes it a promising assistance system for lane-change decisions for human drivers and autonomous vehicles. Jang et al. [19] proposed using eye movements and the pupil size of the driver to predict lane change intention. Extracted features from eye moments data are used to train a discriminative classifier to identify the probable lane change. Zheng and Hansen [20] explored the effectiveness of using steering angle and vehicle speed extracted from the CAN-bus for lane change detection. They proposed a machine learning-based segmentation and classification algorithm to detect lane-change and achieved 80.36% classification accuracy for lane-change-left and 83.22% for lane-change-right. Schlechtriemen et al. [21] used data collected by a vehicle equipped with a front-facing stereo camera and several radar sensors to obtain a 360 field of view. The data was used to extract several features, which were modelled using a mixture of Gaussian distributions. Then a naïve Bayesian classifier was used to estimate the probabilities of the left lane change, right lane change, and lane-keeping for new incoming feature vectors. Kasper et al. [22] used an object-oriented Bayesian network for the recognition of lane changes. The authors investigated the usage of lane-related coordinate systems and occupancy schedule grids for all modelled vehicles. Their proposed approach recognised a lane change 0.6s earlier than a standard adaptive cruise control (ACC) system.
There are two papers proposing using physiological signals from the driver to predict their intention to change lane. Those signals are the electrocardiogram (ECG), galvanic skin response (GSR), and respiration rate (RR). In the first paper, Murphey et al. [23] computed a set of biologically meaningful signals using the ECG, GSR and RR signals. Then a subset of the computed biological signals was selected using the Granger causality test. This subset of signals was used as input to a neural network for classification of lane changing and lane-keeping [23]. In the second paper, Gao et al. [24] used these three signals as an input to a proposed novel Group-wise Convolutional Neural Network to predict lane change intention. Experimental results using a data set collected from five different drivers showed that the proposed approach had a promising prediction accuracy.
The present paper is different from earlier research in that it developed a framework consisting of the following:
- Training a lane change/lane-keeping machine learning classifier using a limited number of sensing modalities (i.e. longitudinal acceleration, lateral acceleration, roll, and yaw)
- Overcoming the uncertainty of machine learning classifier if it classifies one data point. This occurs by summing up the output probabilities of the trained classifier of the whole collected data (i.e. all data points) to construct the 2x2 contingency table.
- Estimating the null hypothesis of the test statistic using the proposed permutation test.
This framework enables us to test the independence between the lane change and the C-ITS warning without collecting expensive data modalities such as videos to train accurate machine learning classifier. In other words, the proposed framework allows us to test the hypotheses that assumes C-ITS warnings at safety-critical situation help the drivers changing lane safely compared to the drivers who do not receive C-ITS warnings.
Problem statement and proposed approach
The authors of this paper are members of a larger team responsible for designing the largest on-road trial in Australia of cooperative vehicles and infrastructure. This FOT will be deployed on the public road from 2020 for a year. One goal of this FOT is evaluating the safety benefits of the Cooperative Intelligent Transport Systems (C-ITS). The FOT will test seven use-cases applications that hopefully will make roads safer and hence contribute towards the Vision Zero goal of no road deaths and serious injuries on Australian roads.
This FOT will include a large number of passengers car which are equipped with C-ITS and IMU devices. Moreover, many C-ITS roadside devices will be deployed on arterial roads and motorways. The installed devices will allow real-time vehicle-to-vehicle and vehicle-to-infrastructure communication to exchange information related to the road and vehicles; this combined information will generate relevant safety warning messages for drivers. In this paper, we are interested in two use-case applications, namely the Slow/Stopped Vehicle (SSV) and The Road Works Warning (RWW).
The Slow/Stopped Vehicle (SSV) event provides warning to the Subject Vehicle (SV) (upstream) which is triggered by a slow or a stopped C-ITS enabled Target Vehicle (TV) (downstream). Every cooperative vehicle is consistently broadcasting cooperative awareness message (CAM) [25] to other cooperative vehicles within communication range. These CAM messages include position, heading, speed, acceleration, and other parameters from the GPS chipset of the emitting vehicle. It is the responsibility of the SV to detect the SSV vehicle hazard by processing received CAM messages. The in-vehicle HMI will then display an SSV warning to the driver, as shown in Fig 1.
The Road Works Warning (RWW) provides an advanced warning to the SV approaching a roadworks zone. The RWW DENM (decentralized environmental notification message) is created and broadcasted by C-ITS-F which interfaces to a Transport and Main Roads road works database service. The DENM contains the work zone information such as the speed limit, lane closures, and location. The SV will receive the RWW DENM and decide on the relevance of its location to the work zone and, in particular, the approach zone to the road works. When the location of the roadwork becomes relevant, the HMI will display an RWW warning in addition to the speed limit and lane closure (if any) to the driver, directing them to take suitable action, as shown in Fig 2.
In the non-C-ITS environment, SSV is similar to lane closure due to roadworks because both can cause safety risk and traffic capacity drop. In the case of lane closure due to roadworks, the driver will see the posted warning before the position of the lane closure but this kind of warning does not exist in the SSV case. In the above two cases, in the previous two figures, the event speed is zero; however, the speed of SSV event is not zero. Thus, to our mind, it has less safety risk compared to the zero speed safety events.
In the case of Slow/Stopped Vehicle and lane closure from roadworks, we test the hypotheses that the driver will change lane early before the location of the lane closure. In other words, we need to test the independence of the lane change and the SSV warning as well as the independence of the lane change and the lane closure warning for RWW.
In the remaining of this paper, C-ITS warnings or application will only mean SSV or RWW. Moreover, lane change and early lane change will be used interchangeably. Testing the independence of the lane change and C-ITS warning displayed on the HMI will require constructing a 2x2 contingency table using the IMU data collected from the C-ITS vehicles and non-C-ITS (control) vehicles during the safety events related to a particular C-ITS application. Then using the chi-square statistical test to check the independence of lane changing and C-ITS warning. This approach looks straightforward; however, estimating the elements of the 2x2 contingency table is not trivial because we only have IMU data.
In this paper, given a set of labelled (lane change/lane-keeping) IMU reading, we proposed using the Fast Fourier Transform to estimate power spectrum for each of the IMU signals (Acceleration in Y [Gs], Acceleration in Z [Gs], Roll [degrees], and Yaw [degrees]) within a window of a predefined length. In a real scenario, this window will be started at the onset time of the C-ITS warning. Consequently, we concatenate the power spectrum of all signal of each example to form the corresponding features vector, which is a row in the training matrix. The training data is used to build a Random Forest model to classify the new unseen and unlabelled features vectors. For each unlabelled features vector, the classification output could be one and zero (which we called a “hard” decision), or the probability of each label (which we called a “soft” decision). Finally, we use the output of the classifier to construct the 2x2 contingency matrix and test the C-ITS and lane change and the direction of the association. In the following sections, we will describe in details each of the above steps and show the experimental results.
UAH-DriveSet dataset
In this section, we describe the dataset used to train and test the proposed framework to evaluate the safety benefits in the planned FOT. At the time of writing, we are in the planning phase for the FOT, when we proposed the hypotheses based on the FOT research questions. Moreover, the sensors measures and the models/frameworks to test the hypotheses are identified. Because we proposed a new framework, testing is needed before starting the data collection from the FOT; for that, we used the UAH-DriveSet [26]. The UAH-DriveSet it exhibits the same characteristics as the dataset we will collect for the FOT. This dataset was collected using six different drivers and vehicles. The dataset has more than 500 minutes of naturalistic driving on highways and arterial roads. The data set is collected with the smartphone application DriveSafe, which uses the smartphone’s inertial sensors, GPS, camera, and internet access. The smartphone with the DriveSafe is put up on the centre of the windshield and its rear camera is facing the road. DriveSafe executes its calibration routine at startup in order to make the smartphone perpendicular to the ground such that the Y-axis aligns with the lateral axis of the vehicle and Z-axis with the longitudinal axis of the vehicle. The experimenters used another phone and put it up on the windshield beside the first phone to capture the video of the road during the drive session. For each trip, the available data includes lane changes list, raw accelerometer data at 10 Hz, and the video of the trip, which is reduced to generate the lane changes list. The lane changes list is used as the ground truth labels in our experiment.
In the context of the C-ITS safety benefits evaluation, we are interested only in detecting lane changes versus lane keeping. Thus we consider slow right, fast right, slow left, and fast left lane changes one single class (lane change) and assign them the same label.
Methods
In this section, we will discuss the details of the features extraction and the method used to test the independence between lane changes and the C-ITS, as well as the method used to test the direction of the association between the lane changes and the C-ITS.
Random forest
The random forest (RF) [27] is an ensemble approach that is very efficient in dealing with continuous response and labels. The most significant advantage of the RF approach is that it does not suffer from overfitting because of the Law of Large Numbers, where the error rate of RF will get smaller as more CARTs are added. By adding more CATRs, the error rate will be saturated but will never get larger no matter how many CARTs are added.
The main concept behind RF is building a large group of weak models that will give a resultant strong model. The RF is a large group of un-pruned decision trees with a randomised selection of predictors’ subset at each split. Each un-pruned decision tree is trained using a bootstrap sample from the original dataset. The well-known machine learning technique called Classification And Regression Tree (CART) [28, 29] is one of the common decision trees used in RF. An RF starts with the CART which, in ensemble terms, corresponds to the weak model. CART is a greedy and recursive top-down binary partitioning that divides the feature space into sets of disjoint regions. These regions should be pure with respect to the response variable. Given trained RF classifier, the label assigned to a test example is obtained based on the majority votes from all trees, [30, 31].
Features extraction
We have four channels/sensors. Each one provides us with a time series, namely:
- Acceleration in Y filtered by KF (Gs)
- Acceleration in Z filtered by KF (Gs)
- Roll (degrees)
- Yaw (degrees)
In order to build a data set that has the lane changing examples, for each time series, we take a short window (2w+1) around the event as shown in Fig 3 and extract features from it. The output of the four windows from the time series is concatenated to form an example of a lane change in the dataset. We created several examples from the same lane change event by shifting the window left and right such that the location of the event inside the window would be different. Creating several examples from the same event has two advantages. First, this data set will have the event at different locations within the window and hence the model built using this data will be independent of the exact location of the event within the window. Second, we will overcome the problem of a relatively small number of events in the UAH-DriveSet. We tried two sets of features. The first set is based on calculating the following quantities of each window; note that we applied the measure to both the values inside the window and the difference of the values inside the window (Table 1).
The second set is based on calculating the FFT and then calculating the single side spectrum as the input feature for each time series within the window, as shown in Fig 3.
Chi-square test
As will be shown in the experimental work section, the sensitivity of the RF classifier is not good and our goal is only to detect if the number of lane changes and the usage of the C-ITS are independent. So instead of using zero/one output of the RF classifier (i.e. hard decision), we use the probabilities of lane change and lane-keeping (i.e. soft decision). The lane change and lane-keeping probabilities of the C-ITS and Non-C-ITS examples are used to construct the 2x2 contingency table. In this case, we are violating the assumption of the cells of the table being drawn from one multinomial distribution or two binomial distributions. However, the cell probabilities can be estimated using the sample mean. In the context of our problem, the explanatory variable is the presence (i.e. C-ITS) or absence (i.e. non C-ITS) of the C-ITS and the response is the lane change or lane-keeping. To test the independence of the number of the lane change and the C-ITS we used the Pearson chi-squared statistic
Where
nij is the observed cell (i,j)
μij is the expected frequency of cell (i,j)
Given n1+ of C-ITS examples and n2+ of non C-ITS examples. We do not know how many lane changes n11 or n21 and we want to test the independence of the lane change and the C-ITS.
Recall that the given examples are in the form of raw measurement from the IMU. Moreover, we are given a trained RF classifier which has low sensitivity. In this paper, the raw data is processed as described in the above section to extract the input features to the RF classifier. The output of the RF are the probabilities of the lane change and the lane-keeping given certain data example. For the n1+ C-ITS examples, the output of the RF is n1+ pairs of probabilities. Each pair consists of the probability of lane change and the probability of lane-keeping for certain example in n1+. The summation of the n1+ lane change probabilities are the observed n11 where the summation of the n1+ lane keeping is the observed n12. Similarly for n2+we have n21 and n22 which are the lane change and lane-keeping observed numbers in the case of the non C-ITS.
The above contingency table can be used to estimate three different probabilities: joint, marginal, and conditional. Let πij = P(presence of C−ITS = i, early lane change = j) denotes the probability of observing cell (i,j). These set of probabilities form the joint probability of the explanatory variable and the response and they satisfy ∑i,jπij = 1. We can get the marginal distribution of the lane change by calculating the column total π+1 = π11+π21 and π+2 = π12+π22. Where π+1 and π+2 are the marginal probability of lane change and lane-keeping respectively. Since we observed a sample from the population, so we need to estimate the πij using the sample mean , where N = n+1+n+2 = n1++ n2+
Moreover, we can use Table 2 to estimate a separate probability distribution for lane change at the given level of the explanatory variable (i.e. C-ITS or Non-C-ITS). These probability distributions are the conditional distribution. The lane change and C-ITS are said to be statistically independent if the population conditional distributions of lane changes are identical at each level of C-ITS. In other words, When these two variables are independent, the probability of lane change is the same at C-ITS and Non-C-ITS.
The odds ratio test
The odds of lane change are defined as where π is the probability of lane change. In the context of our 2x2 table, within the first row (C-ITS) the odds of lane change are
and within the second row (non C-ITS) the odds of lane change are
. Then the odds ratio from the two rows is defined as,
. The odds ratio can equal any positive number. The odds ratio θ equals one when the C-ITS and the lane change are independent (πNon−C−ITS = πC−ITS). So that θ = 1 is the baseline for comparison. When θ>1 this means that the probability of lane change (πC−ITS) in the presence of C-ITS is higher than the probability of lane change in the absence of C-ITS (πNon−C−ITS). When θ<1 this means that the probability of lane change (πC−ITS) in the presence of C-ITS is smaller than the probability of lane change in the absence of C-ITS (πNon−C−ITS).
As shown, the odds ratio has the advantage of showing the direction of the association between the C-ITS and the lane change which is important for evaluating the safety benefits of the C-ITS. In this paper, the sample odds ratio equals the ratio of the sample odds in the two rows . As mentioned in the above subsection, the observed cell counts are obtained by summing the output probabilities of the C-ITS and the non C-ITS examples using the RF. Then using the permutation test, we construct the null hypothesis. Consequently, we calculate the p-value of the observed sample odds ratio and drive the conclusion about the direction of the association between the C-ITS and the lane change.
Permutation test
The basic idea of the permutation test is randomly rearranging the observed sample to generate the sampling distribution of the test statistic under the null hypothesis. In the context of this paper, the null hypothesis is the lane change and C-ITS are independent. Under this hypothesis, all possible permutations are equally likely, and the Pearson chi-squared statistic is computed for one thousand of permutation to estimate its sampling distribution under the independence of lane change and C-ITS. Below are the steps to estimate the p-value using the permutation test:
- Given the RF outputs of C-ITS and Non-C-ITS examples (original data), construct the 2x2 contingency table;
- Compute the original test statistics using the original 2x2 contingency table;
- Randomly permute RF outputs, where we randomly move RF outputs among the C-ITS and Non-C-ITS as shown in Fig 4;
- Construct the 2x2 contingency table using the permuted data—notice that columns and rows sums are not changed (i.e. marginals are the same as the original data);
- Compute the test statistics using the 2x2 contingency table of the permuted data;
- Repeat steps 3, 4, and 5 large number of times to get the distribution of the test statistics under the null hypothesis.
Finally, we compare the original test statistic to the estimated null distribution to estimate the p-value as will be explained in the experimental work section.
Experimental work
This section is divided into two subsections. In the first subsection, we describe the RF classifier training. In the second subsection, we use the classification output of all examples in the dataset to test the proposed test at different sample and effect size.
Classification
The time series output of the four selected sensors is used to extract the features needed to train the RF. In this paper, we investigated two sets of features. The first features set is based on the calculation of the summary quantities shown in Table 1. These features are calculated using a window of the time series so it called “time-domain features”. We should highlight that this window includes the lane change event if it is corresponding to a lane change example. The second features set is the single side spectrum of a window of the time series around the lane change event.
The goal of the first set of experiments is to find a good window size (2w+1) for each of the features set. In order to do so, we used RF with 100 trees. We changed w from 2 to 20. This corresponds to a window size from 0.4 sec to 4 sec. We used leave-one-out-cross-validation to divide the data into a training set and a testing set. Where in each fold we keep all the examples belonging to one trip as a test dataset and all other trips are used as the training dataset. The training dataset is used to build the RF model which is tested using the unseen testing dataset. Then the classification outputs for the examples in the testing trip are saved. For each trip, we repeat this sequence of training-testing sequence and save the classification outputs. Finally, the classification outputs of all trips (i.e. the whole dataset) and the ground truth labels are used to calculate the True Positive Rate (TPR) and the False Positive Rate (FPR) where lane change is coded as positive. As shown in the top panel of Fig 5, the best RF using the time domain features is at = 16, w = 14 or w = 18 which is corresponding to 3.2 sec, 3.4 sec, and 3.6 sec respectively.
The bottom panel in Fig 5 shows the best RF using the frequency domain features is at w = 16, w = 14, w = 18, or w = 20 which is corresponding to 3.2 sec, 3.4 sec, 3.6 sec, and 4 sec respectively. It seems the frequency domain features are better than the time domain in terms of sensitivity. In addition, we decided to set w = 15 which equals 3 seconds window. We choose 3 seconds window because it has a relatively good TPR and FPR. In addition, because we are interested in the early lane change a 3 seconds window after the onset of the C-ITS warning is a reasonable choice, especially if we consider that the 85th percentile of the perception reaction time (PRT) of the drivers is in the range of 1.1 to 1.3 seconds [18, 19].
Statistical testing
In this subsection, we show how the power of the adapted test change with the sample and effect size. We defined the effect size as and the sample size is set equal to 4M. The dataset has 3,994 lane changes examples and almost 8,000 lane-keeping examples. In addition to the ground truth labels of these examples, we have the hard decision and soft decision classification output from the RF. We set up the experiment as shown in Table 3. We assume the sample size of the Non-C-ITS is 2M where randomly draw M lane change examples and M lane keeping examples without replacement. In the case of the C-ITS row, we randomly draw M+k lane change examples and M−k lane keeping examples without replacement. This makes the C-ITS examples equal the Non-C-ITS examples equal 2M. Notice that, for the randomly chosen examples, if we summed the ground-truth label, it would give us the numbers shown in Table 3. However, if we summed the hard decision or soft decision outputs of the RF of these examples, they would have different numbers than those shown in Table 3 due to the inaccuracy of the classifier.
The randomly selected subset of examples are used to estimate the p-value of the chi-square test as described in the following steps:
- The constructed 2x2 contingency table from soft decision output of the RF is used to calculate the original chi-square statistic (Statorg);
- Randomly permute the subset of the examples as described in the Methods section;
- Construct the 2x2 contingency table using the permuted data;
- Calculate the and save the chi-square statistic of the permuted data (Statperm);
- Repeat steps 2,3 and 5 B times, where B = 1000;
- Calculate the
where I is the indicator function.
We repeat the sequence of examples of the random draw, 2x2 contingency table construction, permutation test, and p-value calculation 1,000 times. Then we count the number of times we detected significant differences between C-ITS and Non-C-ITS given that there is a difference between the two groups. Besides, we count the number of times, we fail to detect significant differences between C-ITS and Non-C-ITS, given that there is no difference between the two groups. These counts are divided by 1,000 to estimate the pr(reject H0|H1 is true) and pr(accept H0|H0 is true) respectively.
The following figures show the pr(reject H0|H1 is true) and pr(acceptH0|H0 is true) at different M∈{250,500,1000,1500} and different effect size ∈[−0.2,0.2]. In addition, we tried positive and negative k value in order to show that the ground truth odds ratio and the predicted odds ratio are in the same direction, as shown in Fig 6.
The pr(reject H0|H1 is true) and pr(accept H0|H0 is true) at (a) M = 250, (b) M = 500, M = 1000 and M = 1500.
In addition to the pr(reject H0|H1 is true) and pr(accept H0|H0 is true) shown in the above figures, the probability of correct odds ratio are shown as well. The above curve shows only the probability of the ground truth odds ratio and the predicted odds ratio are both greater than one, or less than one without any statistical testing.
We conducted the following steps to test for the odds ratio:
- The constructed 2x2 contingency table from soft decision output of the RF is used to calculate the original statistics
;
- Randomly permute the subset of the examples as described in the methods section.
- Construct the 2x2 contingency table using the permuted data;
- Calculate the test statistics of the 2x2 contingency table of permuted data
and save it.;
- Repeat steps 2,3 and 5 B times, where B = 1000;
- Calculate
where I is the indicator function.
We repeat the sequence of examples of the random draw, 2x2 contingency table construction, permutation test, and p-value calculation 1,000 times. Then we count the number of times the odds ratio is significantly different from one (i.e. C-ITS and lane change are dependent). Additionally we count the number of times we fail to reject the null hypothesis given that there is no difference between the two groups. These counts are divided by 1,000 to estimate the pr(Correct association direction|H1 is true) and pr(Correct association direction|H0 is true) respectively, as shown in Table 4
As shown in Table 4 above, pr(Correct association direction|H1 is true) is high at the smallest sample size and reach one as the sample size gets bigger. In addition, the pr(Correct association direction|H0 is true) is very high for all tested sample sizes at 0.02 effect size.
Conclusion
In this paper, we proposed a framework to test the dependency of lane change and the C-ITS warning in two use-cases, namely: the roadworks warning (RWW) and slow/stopped vehicle (SSV). We used the longitudinal acceleration, lateral acceleration, roll, and yaw reading from the IMU. We extracted features from a short time window of predefined width. The extracted features from the four IMU time series were used to train an RF classifier to classify new unseen feature vectors. In order to evaluate the dependency of the lane change and the C-ITS, we collected data right after the onset of the C-ITS warnings that are related to the above use cases. For each warning data, a window of a three-second of each time series was used to extract the features, which were concatenated to form one input example to the RF. The Non-C-ITS examples were collected the same way but with a turned-off HMI. Finally, the classification probabilities of all C-ITS and Non-C-ITS examples were used to build the 2x2 contingency table and adopted statistical tests were applied. The experimental results showed that using the soft decision as an input to construct the contingency table for the Chi-square test gives better statistical power and a higher probability of correct acceptance of the null hypothesis. In addition, using the soft decision as an input to construct the contingency table for the odds ratio test gives a better probability of correct association direction. Consequently, the proposed framework is suitable to check the lane change and C-ITS dependency.
Acknowledgments
The authors would like to acknowledge the support of the iMOVE Cooperative Research Centre (CRC) for this work.
References
- 1.
S.-h. An, B.-H. Lee, and D.-R. Shin, "A survey of intelligent transportation systems," in 2011 Third International Conference on Computational Intelligence, Communication Systems and Networks, 2011, pp. 332–337: IEEE.
- 2.
European Committee for Standardization. Available: https://www.cen.eu/Pages/default.aspx
- 3.
International Organization for Standardization. Available: https://www.iso.org/home.html
- 4.
The European Telecommunications Standards Institute. Available: https://www.etsi.org/
- 5. Höfs W. et al., "International Deployment of Cooperative Intelligent Transportation Systems− Bilateral Efforts of the European Commission and United States Department of Transportation," 2012.
- 6. Sjoberg K., Andres P., Buburuzan T., and Brakemeier A., "Cooperative Intelligent Transport Systems in Europe: Current Deployment Status and Outlook," IEEE Vehicular Technology Magazine, vol. 12, no. 2, pp. 89–97, 2017.
- 7.
Aniss H., "Overview of an ITS Project: SCOOP@F," Cham, 2016, pp. 131–135: Springer International Publishing.
- 8.
J. Vreeswijk et al., "Compass4D: Cooperative Mobility Pilot on Safety and Sustainability Services for Deployment," in 10th ITS European Congress, Helsinki, 2014.
- 9.
CAVI components. Available: https://www.qld.gov.au/transport/projects/cavi/cavi-components#cits
- 10.
M. Elhenawy, A. Bond, and A. Rakotonirainy "C-ITS Safety Evaluation methodology based on Cooperative Awareness Messages," presented at the 21st IEEE International Conference on Intelligent Transportation Systems, Maui, Hawaii, USA, 2018.
- 11.
G. M. Fitch and J. M. Hankey, "Investigating improper lane changes: driver performance contributing to lane change near-crashes," in Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2012, vol. 56, no. 1, pp. 2231–2235: Sage Publications Sage CA: Los Angeles, CA.
- 12. Woo H. et al., "Lane-change detection based on vehicle-trajectory prediction," IEEE Robotics and Automation Letters, vol. 2, no. 2, pp. 1109–1116, 2017.
- 13.
H. M. Mandalia and M. D. D. Salvucci, "Using support vector machines for lane-change detection," in Proceedings of the human factors and ergonomics society annual meeting, 2005, vol. 49, no. 22, pp. 1965–1969: SAGE Publications Sage CA: Los Angeles, CA.
- 14.
D. D. Salvucci, "Inferring driver intent: A case study in lane-change detection," in Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2004, vol. 48, no. 19, pp. 2228–2231: SAGE Publications Sage CA: Los Angeles, CA.
- 15. McCall J. C. and Trivedi M. M., "Video-based lane estimation and tracking for driver assistance: survey, system, and evaluation," IEEE Transactions on Intelligent Transportation Systems, vol. 7, no. 1, pp. 20–37, 2006.
- 16.
Y. Xuan and B. Coifman, "Lane change manoeuvre detection from probe vehicle DGPS data," in Intelligent Transportation Systems Conference, 2006. ITSC'06. IEEE, 2006, pp. 624–629: IEEE.
- 17.
M. J. Henning, O. Georgeon, T. Wynn, and J. F. Krems, "Modelling driver behaviour in order to infer the intention to change lanes," in Proceedings of European Conference on Human Centred Design for Intelligent Transport Systems, 2008, vol. 113.
- 18.
S.-G. Jeong, J. Kim, S. Kim, and J. Min, "End-to-end learning of image based lane-change decision," in Intelligent Vehicles Symposium (IV), 2017 IEEE, 2017, pp. 1602–1607: IEEE.
- 19.
Y. M. Jang, R. Mallipeddi, and M. Lee, "Driver's lane-change intent identification based on pupillary variation," in 2014 IEEE International Conference on Consumer Electronics (ICCE), 2014, pp. 197–198.
- 20. Zheng Y. and Hansen J. H., "Lane-change detection from steering signal using spectral segmentation and learning-based classification," IEEE Transactions on Intelligent Vehicles, vol. 2, no. 1, pp. 14–24, 2017.
- 21.
J. Schlechtriemen, A. Wedel, J. Hillenbrand, G. Breuel, and K.-D. Kuhnert, "A lane change detection approach using feature ranking with maximized predictive power," in Intelligent Vehicles Symposium Proceedings, 2014 IEEE, 2014, pp. 108–114: IEEE.
- 22. Kasper D. et al., "Object-Oriented Bayesian Networks for Detection of Lane Change Maneuvers," IEEE Intelligent Transportation Systems Magazine, vol. 4, no. 3, pp. 19–31, 2012.
- 23. Murphey Y. l., Kochhar D. S., Watta P., Wang X., and Wang T., "Driver lane change prediction using physiological measures," SAE International Journal of Transportation Safety, vol. 3, no. 2, pp. 118–125, 2015.
- 24. Gao J., Murphey Y. L., and Zhu H., "Multivariate time series prediction of lane changing behaviour using deep neural network," Applied Intelligence, 2018/03/21 2018.
- 25.
E. ETSI, "302 637–2 V1. 3.2 (2014–11)," Intelligent Transport Systems (ITS); Basic Set of Applications; Part 2: Specification of Cooperative Awareness Basic Service
- 26.
E. Romera, L. M. Bergasa, and R. Arroyo, "Need data for driver behaviour analysis? Presenting the public UAH-DriveSet," in 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), 2016, pp. 387–392.
- 27. Breiman L., "Random forests," Machine learning, vol. 45, no. 1, pp. 5–32, 2001.
- 28. Breiman L., Friedman J. H., Olshen R. A., and Stone C. J., "Classification and regression trees", 1984.
- 29.
Elhenawy M., El-Shawarby I., and Rakha H., "Modeling the Perception Reaction Time and Deceleration Level for Different Surface Conditions Using Machine Learning Techniques," Cham, 2017, pp. 131–142.
- 30.
El-Shawarby I., Rakha H., Amer A., and McGhee C., "Characterization of Driver Perception Reaction Time at the Onset of a Yellow Indication," in Advances in Human Aspects of Transportation: Springer, 2017, pp. 371–382.
- 31.
J. Oh et al., "Estimating neuronal variable importance with random forest". In: IEEE Bioengineering Conference, vol. 29, pp. 33–34. IEEE, Los Alamitos, 2003, pp 33–34.