Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Cognitive workload evaluation of landmarks and routes using virtual reality

  • Usman Alhaji Abdurrahman ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Writing – original draft, Writing – review & editing

    usmanalhaji79@gmail.com, aausman18@fudan.edu.cn

    Affiliations School of Information Science and Technology, Fudan University, Yangpu District, Shanghai, China, Department of Computer Science, Yusuf Maitama Sule University, City Campus, Kano, Nigeria

  • Lirong Zheng,

    Roles Formal analysis, Investigation, Project administration, Supervision, Validation, Visualization

    Affiliation School of Information Science and Technology, Fudan University, Yangpu District, Shanghai, China

  • Shih-Ching Yeh

    Roles Conceptualization, Data curation, Methodology, Resources

    Current address: Department of Computer Science and Information Engineering, National Central University, Taoyuan City, Taiwan

    Affiliation School of Information Science and Technology, Fudan University, Yangpu District, Shanghai, China

Abstract

Investigating whether landmarks and routes affect navigational efficiency and learning transfer in traffic is essential. In this study, a virtual reality-based driving system was employed to determine the effects of landmarks and routes on human neurocognitive behavior. The participants made four (4) journeys to predetermined destinations. They were provided with different landmarks and routes to aid in reaching their respective destinations. We considered two (2) groups and conducted two (2) sessions per group in this study. Each group had sufficient and insufficient landmarks. We hypothesized that using insufficient landmarks would elicit an increase in psychophysiological activation, such as increased heart rate, eye gaze, and pupil size, which would cause participants to make more errors. Moreover, easy and difficult routes elicited different cognitive workloads. Thus, a high cognitive load would negatively affect the participants when trying to apply the knowledge acquired at the beginning of the exercise. In addition, the navigational efficiency of routes with sufficient landmarks was remarkably higher than that of routes with insufficient landmarks. We evaluated the effects of landmarks and routes by assessing the recorded information of the drivers’ pupil size, heart rate, and driving performance data. An analytical strategy, several machine learning algorithms, and data fusion methods have been employed to measure the neurocognitive load of each participant for user classification. The results showed that insufficient landmarks and difficult routes increased pupil size and heart rate, which caused the participants to make more errors. The results also indicated that easy routes with sufficient landmarks were deemed more efficient for navigation, where users’ cognitive loads were much lower than those with insufficient landmarks and difficult routes. The high cognitive workload hindered the participants when trying to apply the knowledge acquired at the beginning of the exercise. Meanwhile, the data fusion method achieved higher accuracy than the other classification methods. The results of this study will help improve the use of landmarks and design of driving routes, as well as paving the way to analyze traffic safety using the drivers’ cognition and performance data.

1. Introduction

One of people’s daily routines is navigating from one route to another. This includes moving from home to work, from a particular shop to another, and from offices to visiting friends [1]. While navigating, people tend to make sequential decisions, such as going straight, turning left, or turning right; this is called route knowledge [2]. Apart from route knowledge, other essential information is required for effective navigation. Information such as metrics is acquired from sensory sources, while features (e.g., landmarks) are perceived visually [3]. It was discovered that, in the course of route-finding, people tend to make one error per week on average, with 49% of those errors occurring when people turn in the wrong direction [4]. Metrics and feature information are needed to reduce the number of mistakes during navigation, as they play a vital role in enriching people’s knowledge. Feature information, such as landmarks, affects our ability to navigate effectively, as they provide positional and orientation information. Landmarks reflect spatial information of natural objects, reduce subjects’ cognitive loads, and increase their navigational efficiency [5]. Successful wayfinding requires accurate integration and memory of landmarks in their spatial relations [6].

Thus, landmarks may be categorized into local and global [79]. According to Waller et al. [10], local landmarks can be associative or beacons; associative landmarks indicate a decision point’s position, while beacon landmarks indicate both the position and how one should turn. Previous studies showed that environmental landmarks reduce participants’ mistakes when traversing a route. The landmarks help participants to apply the knowledge acquired while traversing a route from one direction when returning to the other [11].

In the current study, participants used landmarks to locate their respective destinations. Landmarks can be any distinctive point used for navigation, such as buildings, intersections, and streets [6, 12, 13]. In this study, locations such as a basketball court, a McDonald’s, a convenience store, a gas station, a post office, a church, and a Walmart were used as the primary source of landmarks. We will refer to these locations as landmarks in the subsequent sections. Therefore, we hypothesized that sufficient landmarks would reduce the overall number of mistakes that participants make during a navigational exercise.

One of the targets of this study was the measurement of navigational efficiency, which is the time required by the participants to complete an assigned task. In this study, participants were given the task of recognizing routes based on landmarks to reach their destination. The navigational efficiency was determined by analyzing the recorded data of the participants using driving performance metrics. Consequently, we hypothesized that the navigational efficiency of routes with sufficient landmarks would be remarkably higher than that of routes with insufficient landmarks.

Moreover, complex routes affect drivers because they have to process more information, which in turn increases their driving cognitive workload. As defined by Senders [14], the workload is a measure of effort dissipated by a human operator while performing a task, regardless of the performance of the task itself. When workload levels are low, the performance is also low because of inattention and missed information. As the workload increases, the level of performance increases as well up to a maximum level. This maximum performance represents the optimal workload level for a given task. An additional mental workload leads to an abrupt decrease in performance because of the extra amount of information to be processed, resulting in a high cognitive workload [15]. According to Sweller [16], the cognitive workload is the total amount of cognitive resources needed to process information in cognitive activities. The cognitive workload is characterized by psychophysiological changes (e.g., alterations in heart rate and skin conductance, and behavioral approach or avoidance). It involves several subcomponents occurring in frontal subcortical circuits [17, 18]. Therefore, we hypothesized that the easy and difficult routes used in our study would elicit different cognitive workloads. In addition, a high cognitive load would negatively affect the participants applying the knowledge acquired at the beginning of the exercise.

Several studies investigated the measurement of driving behavior using different psychophysiological parameters [1922]. Pupil size is well known to respond rapidly to changes in brightness in the visual field and has been used to measure the cognitive load while performing an assigned task [19, 22]. Research shows that heart rate increases when a participant is subjected to more challenging conditions [2326]. According to Brookhuis et al. [27], an increase in task demand, such as entering a traffic circle, increases the heart rate, which in turn decreases with the demand, e.g., driving on a two-lane highway. An increase in respiratory rate has been often related to an increase in cognitive demand [26, 28, 29]. Another essential metric that has been found to increase as the cognitive workload increases is skin conductance [21]. Electroencephalogram (EEG) signals are sensitive and reliable for cognitive load measurement [30, 31]. Thus, in this study, heart rate and pupil size were used to measure the cognitive workload of the participants during the navigation exercise.

The application of previously acquired knowledge and skills in new learning or problem-solving situations is termed as transfer of learning [32]. According to Perkins et al. [33], transfer of learning is said to occur when learning in one context enhances (positive transfer) or undermines (negative transfer) a related performance in another context. It expresses the ability to transfer what was learned from one context or situation to another [34]. Some previous studies argued that transfer is only possible when the original and the new learning situations are the same or very similar; this is termed as a near-transfer. Other researchers achieved transfer across different learning situations, termed as far-transfer [35]. However, the degree of transfer increases as the similarity of the elements increases. Thus, we used similar elements in our study to simplify the process of learning transfer.

Virtual reality (VR) technology was used to achieve the stated objectives. VR technology can create immersive and realistic interactive environments for behavioral learning. Besides, VR technology provides individualized treatment, accurate control of complex stimuli, and a structured and safe learning environment [36, 37]. Therefore, we employed a VR-based driving system to investigate the effects of landmarks and routes on navigational efficiency and transfer of learning.

The remaining sections of this paper are organized as follows: Section 2 presents the methods employed and the experiments conducted in this work. Section 3 presents the results obtained from the experiments. Section 4 gives a thorough analysis of the presented results. Section 5 concludes the paper.

2. Methods and experiments

2.1 Hypotheses

This study aimed to evaluate the effects of different landmarks and routes on navigational efficiency using a VR-based driving system. Psychophysiological metrics such as heart rate, eye gaze, pupil size, and driving performance features were used to assess the cognitive load experienced by the participants. Thus, we hypothesized the following points:

  1. The use of insufficient landmarks would elicit an increase in psychophysiological activation, such as increased heart rate, eye gaze, and pupil size.
  2. An increase in psychophysiological activation would cause the participants to commit more errors.
  3. The navigational efficiency of routes with sufficient landmarks is remarkably higher than that of routes with insufficient landmarks.
  4. The easy and difficult routes would elicit different cognitive workloads. In addition, a high cognitive load would negatively affect the participants applying the knowledge acquired at the beginning of the exercise.

2.2 Participants

Seventy-nine (79) undergraduates, distributed as 36 males and 43 females, from Fudan University participated in the experiment. The participants age between 18 and 24 years old (Age: M = 19.27; SD = 2.31) and were recruited through in-house advertisement. No significant differences were found in terms of age, education, or gender. The participants were divided into two (2) groups; group X and group Y. Group X had 39 participants (18 males and 21 females), and group Y had 40 participants (18 males and 22 females). All volunteers had normal eyesight and no real-life driving experience. Before conducting the experiment, the participants filled out a consent form, and approval was granted by the university. All participants received a reward after the experiment. The recorded data were analyzed, and they are reported below.

2.3 Experimental materials

The experiments were conducted at Fudan University. The VR driving system used in this study is illustrated in Fig 1. The materials used include Vive Pro Eye for tracking eye data [38] and a Logitech G27 steering-wheel controller for controlling the virtual agent vehicle in the driving environment. An Autodesk Maya [39] and Esri CityEngine [40] were used to design landmarks, traffic lights, routes, intersections, cars, and buildings, while Unity3D [41] was used to develop the game platform. The driving routes consisted of city roadways that comprised long straightaways, several turns, intersections, and landmarks (such as a basketball court, a McDonald’s, a convenience store, a gas station, a post office, a church, and a Walmart). Two difficulty levels were developed for the VR-based driving system, with each level comprising two driving assignments.

thumbnail
Fig 1. VR driving system.

A participant driving in the virtual system (left) and a snapshot containing the ego vehicle, sample of the driving route, intersection, and landmark within the virtual system (right).

https://doi.org/10.1371/journal.pone.0268399.g001

The data capture module recorded the participant’s information while driving. A Vive Pro eye tracker recorded the eye-gaze data (including the pupil size and gaze origin) at 50 Hz. A heartbeat recording device designed in our laboratory was used to track the subject’s heart rate at 500 Hz while driving in the virtual environment.

2.4 Experimental procedure

All participants submitted their informed consent forms before the commencement of the experiments. The experiments were conducted according to relevant guidelines and regulations. As stated earlier, there were two (2) groups. Each group had two (2) sessions in the study: group X1 (sufficient landmarks and easy routes), group X2 (insufficient landmarks and easy routes), group Y1 (sufficient landmarks and difficult routes), and group Y2 (insufficient landmarks and difficult routes). The two sessions per group presented the same difficulty; however, the number of landmarks varied. Group X1 had seven landmarks, group X2 had three landmarks, group Y1 had seven landmarks, and group Y2 had five landmarks. The easy routes had three turns, intersections, and three traffic lights, while the difficult routes had five turns, intersections, and five traffic lights. Each participant completed two (2) sessions of the assigned group (either X1 and X2, or Y1 and Y2).

Baseline data were recorded prior to the experiments. Then, a video tutorial was given to the participants to introduce them to the routes they were going to follow, as well as the landmarks, intersections, and traffic lights they were going to see, to help them complete the given task. This video was played only once, and the participants were asked to memorize all the landmarks along the routes. The staff then set up the eye-tracking and heartbeat sensors on the subject’s body and ensured that all signals were properly recorded and the devices operated perfectly. The subjects were then asked to carry out the pre-selected driving assignment based on the video tutorial they had just watched.

The subjects used the landmarks along the routes to locate the right turns to take them to their various destinations. Thus, whenever a subject made a wrong turn, they were dragged to the starting point to re-watch the video tutorial and start driving all over again. The tutorial sessions were included to assess the system and improve the subject’s driving performance. However, we did not consider the performance of these sessions in data analysis.

2.5 Experimental design and data collection

This study employed a VR-based driving system to determine the effects of landmarks and routes on navigational efficiency and learning transfer. We chose the between-group design of 2 (landmarks: sufficient landmarks and insufficient landmarks) x 2 (routes: difficult routes and easy routes). The participants in each group completed two (2) sessions of the four tests (either X1 and X2, or Y1 and Y2).

All physiological measures such as pupil size and heart rate, and the driving performance data were generated from each session during the experiment. Data for all 79 subjects were extracted for analysis. To enhance the credibility and robustness of the results, all the generated data were preprocessed to remove the undesired parts. The median value method was used to reduce the noise level of the eye gaze data. Pupil size data were extracted from the eye gaze data. We extracted the following ten features from the eye gaze data, motivated by previous studies [19, 20]: fixation rate; blink rate; mean (M) and standard deviation (SD) of blink duration; M and SD of fixation duration; M and SD of pupil size; and M and SD of saccade duration. Similarly, M and SD were also extracted from the heart rate data.

Data related to the participants’ task performance were also generated. The performance features indicate how well a participant completed the assigned task; some examples of these features are the task completion time, wrong turns, and collision counts. The performance features used in this study and their meanings are listed in Table 1. We extracted six features: M and SD of task completion time, M and SD of wrong counts, and M and SD of barricade counts. The extracted data were used as the input vectors for the data fusion method.

2.5.1 Navigational efficiency determination.

One of the primary objectives of this study was to investigate the effects of landmarks and routes on navigational efficiency. After the participants completed their assigned task successfully, the total time spent (task completion time) in completing the task and the number of mistakes (wrong turns) were recorded in terms of driving performance metrics. The recorded data were used to determine the navigational efficiency of the assigned task.

2.5.2 Learning transfer determination.

We also investigated whether landmarks and routes influence learning transfer. This was achieved by evaluating the differences in psychophysiological response patterns associated with driving along the routes (easy and difficult) and landmarks (sufficient and insufficient). All psychophysiological metrics (such as drivers’ pupil size and heart rate) and performance measures were recorded simultaneously throughout the experiment. All these data were used to determine the subjects’ cognitive load, which, by extension, helped assess its effect on learning transfer.

2.6 Data fusion methods

The extracted features from different modalities were input into the classifiers to assess the participants’ cognitive workload. Five (5) popular classifiers (Table 5) were used to classify the cognitive workload in this study: a Support Vector Machine (SVM), an Artificial Neural Network (ANN), Naïve Bayes (NB), K-Nearest Neighbor (KNN), and a Decision Tree (DT). Three-level approaches for data fusion were used, namely feature-level fusion, decision-level fusion, and hybrid-level fusion. The structures of the data fusion methods used to fuse multimodal information are shown in Fig 2.

thumbnail
Fig 2. The structures of the data fusion methods.

A feature-level fusion (a), a decision-level fusion (b), and a hybrid-level fusion (c).

https://doi.org/10.1371/journal.pone.0268399.g002

Fig 2A depicts feature-level fusion. The extracted features from the eye gaze, heart rate, and driving performance were input into the preprocessing module. The features were normalized there, and their dimensions were reduced using principal component analysis. The cognitive load was determined from the preprocessed data.

In decision-level fusion (Fig 2B), the extracted features from each modality were preprocessed separately and then input into different classifiers. These classifiers then output a cognitive load as a sub-decision (Sn).

Finally, the fusion module added up these sub-decisions with weights (w) and computed the final cognitive load (CLfinal) according to the following expression: (1)

Fig 2C depicts the hybrid-level fusion used in this study. It combines both feature-level and decision-level fusions. Heart rate and performance features were preprocessed through hybrid-level fusion to obtain a sub-decision. The remaining features (pupil size and eye gaze) were preprocessed separately, and additional decisions were obtained. The final sub-decision was obtained by adding all the sub-decisions with weights.

3. Results

This study aimed to investigate the effects of landmarks and routes on navigational efficiency and learning transfer using a VR-based driving system. There were 39 data points for group X1 (easy routes and sufficient landmarks) and X2 (easy routes and insufficient landmarks), and 40 data points for group Y1 (difficult routes and sufficient landmarks) and Y2 (difficult routes and insufficient landmarks), resulting in a total of 79 data points. Statistical and data fusion methods were used to analyze the data.

3.1 Statistical method

A t-test method was employed to assess the influence of landmarks and routes on cognitive load. The assessments were performed as follows:

3.1.1 Assessment of spatial cognitive load.

Participants’ cognitive loads were examined using recorded psychophysiological responses and driving performance features. The following responses were obtained.

  1. a. Response of psychophysiological measures with cognitive load

A t-test method was employed for comparison. The heart-rate results show that group X using sufficient landmarks was significantly different from group Y using insufficient landmarks (t = -230.10, p = 1.89E-26). The heart rate of group Y (M = 76.2, SD = 10.1) was significantly higher than that of group X (M = 74.3, SD = 6.3). In addition, the heart rate of group X1 was significantly higher than that of group X2 (t = -147.65, p = 1.34E-14), and the heart rate of group X2 (M = 74.3, SD = 8.1) was significantly higher than that of group X1 (M = 73.2, SD = 4.3). Similarly, the heart rate of group Y1 was significantly different from that of Y2 (t = -158.32, p = 1.45E-16), and the heart rate of group Y2 (M = 76.8, SD = 9.4) was significantly higher than that of group Y1 (M = 75.1, SD = 7.6).

Furthermore, the results of the pupil size show that group X using sufficient landmarks was significantly different from group Y using insufficient landmarks (t = -19.49, p = 4.56E-13). The pupil size of group Y (M = 5.4, SD = 0.71) was significantly larger than that of group X (M = 4.3, SD = 0.64). Additionally, the pupil size of group X1 was significantly larger than that of group X2 (t = -17.20, p = 3.7E-12), and the pupil size of group X2 (M = 4.5, SD = 0.3) was significantly larger than that of group X1 (M = 3.9, SD = 0.7). Similarly, the pupil size of group Y1 was significantly different from that of group Y2 (t = -17.53, p = 4.8E-13), and the pupil size of Y2 (M = 5.4, SD = 0.71) was significantly larger than that of group Y1 (M = 4.9, SD = 0.31).

The results obtained from all the groups were associated with the nature of the routes and insufficient use of landmarks.

  1. b. Response of driving performance features with cognitive load

Similar to psychophysiological measures, the results of driving performance measures (Table 2) show a significant difference between groups X and Y using the t-test method (t = 9.63, p = 2.3E-21). Likewise, the performance measures of group X1 were significantly different from those of group X2 (t = 6.41, p = 1.9E- 18), and the performance measures of group Y1 were significantly different from those of group Y2 (t = 7.32, p = 2.0E-19). As shown in Table 2, the performance features of participants who drove the difficult routes with insufficient landmarks were significantly higher than those who drove easy routes and sufficient landmarks.

Moreover, as hypothesized, participants who drove the difficult routes with insufficient landmarks made more mistakes than their counterparts who drove easy routes (Table 2). Cognitive workload significantly differed between difficult and easy routes regarding performance features (t = 5.54, p = 0.001), leading to a significant interaction between the two groups.

  1. c. Assessment of navigational efficiency

Previous studies focused on the task completion time regarding navigational efficiency and cognitive workload. Task completion time is an essential indicator of the cognitive workload and plays a vital role in practical applications owing to its significant effect on learning transfer. A higher task completion time means that a participant had to use more mental capacity to identify the right way to follow, resulting in a more significant workload.

The navigational efficiency assessment can be obtained by analyzing the task completion time (as shown in Table 3) and observing whether intergroup differences exist owing to different landmarks and routes. The two-sample t-test method was used for intergroup comparison. The results show that the task completion time of group X1 was significantly different from that of group X2 (t = -90.83, p = 5.16E-12), and the task completion time of group X2 using insufficient landmarks (M = 250.34, SD = 42.05) was significantly higher than that of group X1 using sufficient landmarks (M = 203.13, SD = 22.48). Likewise, the task completion time of group Y1 was significantly different from that of group Y2 (t = -97.47, p = 7.10E-13). The task completion time of group Y2 using insufficient landmarks (M = 347.85, SD = 35.13) was significantly higher than that of group Y1 using sufficient landmarks (M = 289.37, SD = 32.54). The relationship between difficulty level and mean of the task completion time can be viewed in Fig 3.

thumbnail
Fig 3. Relationship between difficulty level and mean task completion time.

Participants spent much time completing the task as the task difficulty level increased.

https://doi.org/10.1371/journal.pone.0268399.g003

  1. d. Effects of landmarks and routes on gender

The use of insufficient landmarks and difficult routes influenced the participants according to their gender. The experimental results show that female participants generally drove slower than their male counterparts. This resulted in female participants spending much time to complete the exercise owing to the extra cognitive load. Moreover, the mean change applied to cognitive load and task completion time reveal the degree of these effects. As shown in Table 4, the mean change (ΔM1) in sufficient landmarks for females (ΔM1 = 142.70) was more significant than that for males (ΔM1 = 112.29). This indicates that insufficient landmarks influenced female participants more than their male counterparts, as shown in Fig 4.

thumbnail
Fig 4. Mean task completion time.

Mean sufficient and mean difficulty.

https://doi.org/10.1371/journal.pone.0268399.g004

Additionally, the mean change in difficult routes for males (mean difficulty) (ΔM2 = 135.19) was barely different from that for females (ΔM2 = 133.22). This indicates that the effect of difficult routes on navigational efficiency is similar for both sexes, as shown in Fig 4.

3.2 Data fusion method

3.2.1 Feature-level fusion.

Table 5 lists the single-modality classifiers used in this study. The accuracies of feature-level fusion versus single-modality algorithms are presented in Table 6. The feature-level fusion achieved the highest accuracy, 93.21%, compared to the single-modality algorithms. The best accuracy in heart rate was obtained from the KNN algorithm, while that of eye gaze, pupil size, and performance was obtained from the SVM algorithm.

thumbnail
Table 6. Feature-level fusion vs. single-modality classifiers (%).

https://doi.org/10.1371/journal.pone.0268399.t006

3.2.2 Decision-level fusion and hybrid-level fusion.

All the classifiers for each modality were applied to decision-level fusion to obtain the sub-decision. Additionally, different combinations of weights were tested for every sub-decision. The best accuracy for the decision-level fusion, 91.45%, was obtained from the KNN algorithm.

The accuracy obtained from hybrid-level fusion was 95.34% higher than that of feature-level and decision-level fusions. This accuracy was achieved when pupil-size and eye-gaze features were combined for sub-decision one with the SVM algorithm, heart rate for sub-decision two with the KNN algorithm, and performance for sub-decision three with the ANN algorithm.

4. Discussion

The results just presented show that the use of landmarks (sufficient landmarks, insufficient landmarks) and routes (easy routes, difficult routes) affects the navigational efficiency and transfer of learning; however, the impact of these effects differs. This section thoroughly analyzes the results presented in the previous section.

4.1 Effect of landmarks on driving workloads

We hypothesized that sufficient landmarks would reduce the overall number of mistakes that participants make during navigational exercise. Sufficient landmarks reduced the participants’ number of mistakes from both easy and difficult routes. This can be concluded from the results in Table 2 concerning the response of driving performance features with cognitive load. This finding aligns with a previous study in which participants learned a route in one direction [42]. In addition, according to Ruddle [43], landmarks primarily facilitate traveling between specific places instead of assisting in learning the overall layout of a space. These landmarks provided positional information, which helped address the participants’ most common mistake: going straight instead of turning left or right.

Moreover, we hypothesized that the navigational efficiency of routes with sufficient landmarks would be significantly higher than that of routes with insufficient landmarks. This hypothesis was also supported by the results obtained regarding task completion time for all groups. In addition, these results are supported by previous studies [5, 11, 44]. The results show that the task completion times of groups X2 and Y2 was significantly higher than those of groups X1 and Y1, respectively. This happened because the participants required more cognitive resources (i.e., high cognitive load) in searching for the right routes to follow to reach the destination. Consequently, this negatively affected the participants in applying the knowledge acquired at the beginning of the exercise.

4.2 Effect of workloads on psychophysiological metrics

One of our primary analyses in this study was to evaluate the differences in psychophysiological response regarding navigation on easy routes using sufficient landmarks versus insufficient landmarks, and on difficult routes using sufficient landmarks versus insufficient landmarks. As anticipated, navigation using insufficient landmarks elicited increased psychophysiological activation in terms of, for instance, pupil size and heart rate. An increase in cognitive workload was related to the conditions of a cognitively challenging task (searching for the right turn owing to insufficient landmarks and difficult routes), which led to an increase in the response level.

The results obtained for the pupil size are supported by several findings, as indicated in [21, 45, 46]. According to Kahneman et al. [47], pupillary diameter increases as the amount of information loaded into working memory increases. In an experiment conducted by Querino et al. [45], the pupillary diameter was used to distinguish the cognitive effort between automated and controlled cognitive processing during the so-called five digits test (FDT) as the task progressed. The results show that, compared to a control task, the FDT required higher cognitive effort for each consecutive part, and the first half of every aspect of the FDT induced more size than the second.

Similarly, several previous studies also support our findings [48, 49]. According to Soroosh et al. [49], an increase in perceived cognitive workload appears to increase the sympathetic and parasympathetic components of the autonomic nervous system. Verway et al. [50] tested participants through cognitive tasks with respect to a control task in which no additional cognitive task was carried out while driving. The results show decreased heart rate variability and increased heart rate (decreased IBIs) when performing the cognitive tasks.

4.3 Effect of workloads on driving performance

In this study, we considered and investigated the task completion time, wrong turns, and collision counts for driving performance. These parameters were quantified and measured under different cognitive workloads, and the participants’ driving performance was assessed. Concerning the task completion time, participants in groups X2 and Y2 required more time to reach the destination than their counterparts in groups X1 and Y1. This happened because of the additional time needed by the participants to search for the right direction to reach the destination. This extra time also led to an increase in the cognitive workload of the participants, thereby hindering the transfer of learning. Our results are consistent with those of Fan H. et al. [5]. The navigational task-finishing time of the group using the full-landmark map was significantly higher than that of the group using the key-landmark map.

The wrong turns and collision counts were also higher in groups X2 and Y2 than in groups X1 and Y1 for the same reasons. The cognitively challenging task of searching for the right turn using insufficient landmarks was tedious, especially on difficult routes. This added more information to the participants’ working memory, which required more cognitive resources to process them. This result is in line with previous research conducted by Lyu et al. [51], in which the speed maintenance and lane deviations were significantly different under different levels of cognitive workload.

4.4 Effect of psychophysiological metrics and driving performance on gender

According to Underwood G. et al. [52], the cognitive workload is higher for novice drivers than for skilled drivers, as novice drivers need to pay much attention while driving. Therefore, only novice drivers were recruited for the experiments in this study. As shown in the results, insufficient landmarks and difficult routes presented a gender bias. The female participants paid more attention to the landmarks and drove slower than their male counterparts. Thus, they made more mistakes (higher wrong turns and collision counts) while driving. Moreover, female participants had higher task completion times than their male counterparts [19, 51].

4.5 Multimodal data fusion

For the single-modality classification, the best accuracy in heart rate (90.78%) was obtained from the KNN algorithm, while that of eye gaze (89.73%), pupil size (91.67%), and performance (89.67%) was obtained from the SVM algorithm. These results were consistent with the findings of Alzubi et al. [53]. In their study, an SVM combined with Harris Hawks optimization (HHO) for Android malware detection was proposed. It was observed that the proposed approach outperformed the baseline approaches on most datasets and measures.

It was verified that feature-level fusion outperforms all the single-modality classification algorithms in cognitive workload measurement, as indicated by its best and average accuracy. The best accuracy (93.21%) was obtained using ANN, which agrees with the research conducted by Movassagh et al. [54]. In their research, they revealed that ANN is more convergent with the neural network coefficient than the existing algorithms. Several studies followed a multimodal method to measure the cognitive load [19, 55, 56]. According to Novak et al., measuring the cognitive load with physiological signals and task performance features together can achieve higher accuracy than using physiological signals or task performance features individually [55]. Putze F et al. [57] combined skin conductance, EEG, respiration, and pulse to categorize CL into visual and cognitive tasks through simple majority voting fusion. The results show that decision-level fusion outperformed the single-modality method in one task, while it was surpassed in other tasks. According to Abdurrahman U. A. et al. [19], decision-level fusion outperforms feature-level fusion as well as single-modality methods with an accuracy of 94.67%. In another study conducted by Alzubi et al., a consensus-based combining method (CCM) was proposed and evaluated [58]. In thier study, the effectiveness of CCM was evaluated by comparing its performance with that of existing linear combination methods (majority voting, product, and average methods). The experimental results show that CCM provides a significant improvement in terms of accuracy over the product and average methods, in addition to demonstrating that the CCM’s classification accuracy is better than or comparable to that of majority voting.

Hussain S. et al. [59] combined galvanic skin response (GSR), ECG, Eye, and RESP features from physiological sensors into a classification model to investigate the cognitive load. Participants’ task performance features were applied to different classification models; sub-decisions were then combined using majority voting. The results show that hybrid-level fusion improved the classification accuracy by 6% compared to single classification methods. In another report by Abdurrahman U. A. et al., hybrid-level fusion performed better than feature-level and decision-level fusions with the highest accuracy, i.e., 97.14% [19].

5. Conclusions

This study investigated the effects of landmarks and routes on the navigational efficiency and learning transfer using a VR-based driving system. In addition, the fundamental reasons for these effects were thoroughly investigated. This study primarily aimed to measure both the psychophysiological and performance behaviors of the participants. Cognitive workload was applied using insufficient landmarks and difficult routes. The participants were required to use these landmarks to identify the correct directions to reach their destinations based on the knowledge acquired at the beginning of the exercise. While driving through these routes, driving performance and psychophysiological metrics were recorded simultaneously throughout the experiments. All the stated hypotheses of the study were confirmed by analyzing both the psychophysiological and performance features of the subjects. Human cognition and behavior correlate with psychophysiological characteristics that are strongly related to a constantly changing environment. The results of this study would help road administrators deduce how to correctly plan and accurately upgrade the operating conditions of public roads. It would also help them analyze traffic safety using driver cognition and driving performance data. A limitation of this study is that a driving simulator was used to collect data. However, data collected from the driving simulator can be easily controlled and reproduced, and dangerous driving conditions can be experienced without the risk of physical injury. Nevertheless, in the future, we will consider collecting data using real-world driving experiments and analyze the proposed hypotheses. The total number of participants in the study was 79. More samples may be required to obtain credible and robust results for a study of this nature. In the future, we will consider recruiting more participants to obtain more reliable and convincing results.

Acknowledgments

We wish to extend our sincere gratitude to Fudan University students who participated in the study for their valuable time. Thanks also to Yunying Wong for facilitating the data collection from the participants.

References

  1. 1. Stokes G, Karen L. National travel survey analysis. Transport Studies Unit, School of Geography and the Environment. 2011.
  2. 2. Siegel AW, White SH. The development of spatial representations of large-scale environments. Advances in child development and behavior. 1975; 10:9–55. https://doi.org/10.1016/s0065-2407(08)60007-5 pmid:1101663
  3. 3. Montello DR. A new framework for understanding the acquisition of spatial knowledge in large-scale environments. Spatial and temporal reasoning in geographic information systems. 1998; 143–154.
  4. 4. Williamson J, Barrow C. Errors in everyday routefinding: a classification of types and possible causes. Applied cognitive psychology. 1994; 8(5):513–524. https://doi.org/10.1002/acp.2350080508
  5. 5. Fang H, Xin S, Zhang Y, Wang Z, Zhu J. Assessing the Influence of Landmarks and Paths on the Navigational Efficiency and the Cognitive Load of Indoor Maps. ISPRS International Journal of Geo-Information. 2020; 9(2):82. https://doi.org/10.3390/ijgi9020082
  6. 6. Lee PU, Barbara T. Interplay between visual and spatial: The effect of landmark descriptions on comprehension of route/survey spatial descriptions. Spatial cognition & computation. 2005; 5(2–3):163–185. https://doi.org/10.1207/s15427633scc052&3_4
  7. 7. Gillner S, Weiß AM, Mallot HA. Visual homing in the absence of feature-based landmark information. Cognition. 2008; 109:105–122. https://doi.org/10.1016/j.cognition.2008.07.018 pmid:18804202
  8. 8. Stankiewicz BJ, Kalia AA. Acquisition of structural versus object landmark knowledge. Journal of Experimental Psychology: Human Perception and Performance. 2007; 33:378–390. https://doi.org/10.1037/0096-1523.33.2.378 pmid:17469974
  9. 9. Steck SD, Mallot HA. The role of global and local landmarks in virtual environment navigation. Presence: Tele-operators and Virtual Environments. 2000; 9:69–83. https://doi.org/10.1162/105474600566628
  10. 10. Waller D, Yvonne L. Landmarks as beacons and associative cues: their role in route learning. Memory & Cognition. 2007; 35(5):910–924. https://doi.org/10.3758/bf03193465 pmid:17910176
  11. 11. Ruddle RA, Volkova E, Mohler B, Bülthoff HH. The effect of landmark and body-based sensory information on route knowledge. Memory & cognition. 2011; 39(4):686–699. https://doi.org/10.3758/s13421-010-0054-z pmid:21264583
  12. 12. Denis M. The description of routes: A cognitive approach to the production of spatial discourse. Current psychology of cognition. 1997; 16: 409–458.
  13. 13. Raubal M, Winter S. Enriching wayfinding instructions with local landmarks. International conference on geographic information science, Springer, Berlin, Heidelberg, 2002. https://doi.org/10.1007/3-540-45799-2_17
  14. 14. Senders JW. The estimation of operator workload in complex systems. Systems psychology. 1970; 207–216.
  15. 15. Cafiso S, Cava Grazia La. Driving performance, alignment consistency, and road safety: real-world experiment. Transportation research record. 2009; 2102(1):1–8. https://doi.org/10.3141/2102-01
  16. 16. Sweller J. Instructional design. Australian educational review. 1999. https://doi.org/10.1007/978-3-319-19650-3_2438
  17. 17. Bonelli RM, Cummings JL. Frontal-subcortical circuitry and behavior. Dialogues in clinical neuroscience. 2007; 9(2):141. https://doi.org/10.31887/dcns.2007.9.2/rbonelli pmid:17726913
  18. 18. Ray Rebecca D, Zald David H. Anatomical insights into the interaction of emotion and cognition in the prefrontal cortex. Neuroscience & Biobehavioral Reviews. 2012; 36(1):479–501. https://doi.org/10.1016/j.neubiorev.2011.08.005 pmid:21889953
  19. 19. Abdurrahman UA, Yeh SC, Wong Y, Wei L. Effects of Neuro-Cognitive Load on Learning Transfer Using a Virtual Reality-Based Driving System. Big Data and Cognitive Computing. 2021; 5(4):54. https://doi.org/10.3390/bdcc5040054
  20. 20. Zhang L, Wade J, Bian D, Fan J, Swanson A, Weitlauf A, et al. Cognitive load measurement in a virtual reality-based driving system for autism intervention. IEEE transactions on affective computing. 2017; 8(2):176–189. https://doi.org/10.1109/taffc.2016.2582490 pmid:28966730
  21. 21. Parsons TD, Christopher GC. Interactions between threat and executive control in a virtual reality stroop task. IEEE Transactions on Affective Computing. 2016; 9(1):66–75. https://doi.org/10.1109/taffc.2016.2569086
  22. 22. Pomplun M, Sindhura S. Pupil dilation as an indicator of cognitive workload in human-computer interaction. Proceedings of the International Conference on HCI. 2003.
  23. 23. Jorna PG. Spectral analysis of heart rate and psychological state: A review of its validity as a workload index. Biological psychology. 1992; 34(2–3):237–257. https://doi.org/10.1016/0301-0511(92)90017-o pmid:1467395
  24. 24. Charlton SG, O’Brien TG. eds. Handbook of human factors testing and evaluation. CRC Press; 2019. https://doi.org/10.1201/9781003000815
  25. 25. Lenneman JK, Jonathon RS, Richard WB. Deciphering psychological-physiological mappings while driving and performing a secondary memory task. 2005. https://doi.org/10.17077/drivingassessment.1203
  26. 26. Richard WB, Kimberle AS. Metabolic and cardiorespiratory measures of mental effort: the effects of level of difficulty in a working memory task. International journal of psychophysiology. 1994; 16(1):57–68. https://doi.org/10.1016/0167-8760(94)90042-6 pmid:8206805
  27. 27. Brookhuis KA, De Waard D. Assessment of Drivers’ workload: Performance and Subjective and Physiological Indexes. Stress, workload and fatigue. 2001. https://doi.org/10.1201/b12791-2.5
  28. 28. Mehler B, Reimer B, Coughlin JF, Jeffery AD. Impact of incremental increases in cognitive workload on physiological arousal and performance in young adult drivers. Transportation Research Record. 2009; 2138(1):6–12. https://doi.org/10.3141/2138-02
  29. 29. Brookings JB, Glenn FW, Carolyne RS. Psychophysiological responses to changes in workload during simulated air traffic control. Biological psychology. 1996; 42(3):361–377. https://doi.org/10.1016/0301-0511(95)05167-8 pmid:8652753
  30. 30. Gevins A, Michael ES. Neurophysiological measures of working memory and individual differences in cognitive ability and cognitive style. Cerebral cortex. 2000; 10(9): 829–839. https://doi.org/10.1093/cercor/10.9.829 pmid:10982744
  31. 31. Gevins A, Smith ME, Leong H, McEvoy L, Whitfield S, Du R, et al. Monitoring working memory load during computer-based tasks with EEG pattern recognition methods. Human factors. 1998; 40(1):79–91. https://doi.org/10.1518/001872098779480578 pmid:9579105
  32. 32. Smelser Neil J, Baltes Paul B. International encyclopedia of the social & behavioral sciences, Amsterdam: Elsevier. 2001. https://doi.org/10.1016/b0-08-043076-7/04687-8
  33. 33. Perkins David N, Salomon Gavriel. Transfer of learning, Oxford, England: International encyclopedia of education. 1992; 2:6452–6457.
  34. 34. Martel Michelle M. The Clinical Guide to Assessment and Treatment of Childhood Learning and Attention Problems. Cambridge: Academic Press; 2020. https://doi.org/10.1016/c2017-0-03740-0
  35. 35. Herring JE. Year 12 students’ use of information literacy skills: A constructivist grounded analysis in Practising Information Literacy. Chandos Publishing; 2010;143–165. https://doi.org/10.1016/b978-1-876938-79-6.50007-8
  36. 36. Strickland D. Virtual reality for the treatment of autism. Stud. Health Technol. Inform. 1997; 44:81–86. pmid:10184809
  37. 37. Armougum A, Orriols E, Gaston-Bellegarde A, Joie-La Marle C, Piolino P. Virtual reality: A new method to investigate cognitive load during navigation. Journal of Environmental Psychology. 2019; 65:101338. https://doi.org/10.1016/j.jenvp.2019.101338
  38. 38. VIVE. [Cited 23 February 2020]. In: VIVE [Internet]. Available from: https://www.vive.com/eu/product/vive-pro-eye/.
  39. 39. Autodesk. [Cited 21 December 2020]. In: Autodesk [Internet]. Available from: https://www.autodesk.com/.
  40. 40. A. CityEngine. [Cited 21 December 2020]. In: ArcGIS CityEngine [Internet]. Available from: https://www.esri.com/en-us/arcgis/products/arcgis-cityengine/.
  41. 41. Unity3D. [Accessed 20 December 2020]. In: Unity3D [Internet]. Available from: www.unity3d.com.
  42. 42. Jansen-Osmann P, Fuchs P. Wayfinding behavior and spatial knowledge of adults and children in a virtual environment: The role of landmarks. Experimental Psychology. 2006; 53(3):171–181. https://doi.org/10.1027/1618-3169.53.3.171 pmid:16955726
  43. 43. Ruddle RA. The effect of trails on first-time and subsequent navigation in a virtual environment. EEE Proceedings Virtual Reality, Bonn, Germany. 2005. https://doi.org/10.1109/vr.2005.1492761
  44. 44. Ishikawa T, Montello Daniel R. Spatial knowledge acquisition from direct experience in the environment: Individual differences in the development of metric knowledge and the integration of separately learned places. Cognitive psychology. 2006; 52(2):93–129. https://doi.org/10.1016/j.cogpsych.2005.08.003 pmid:16375882
  45. 45. Querino E, Dos Santos L, Ginani G, Nicolau E, Miranda D, Romano-Silva M, et al. Cognitive effort and pupil dilation in controlled and automatic processes. Translational Neuroscience. 2015; 6(1):168–173. https://doi.org/10.1515/tnsci-2015-0017 pmid:28123801
  46. 46. van der Wel P, Steenbergen HV. Pupil dilation as an index of effort in cognitive control tasks: A review. Psychonomic bulletin & review. 2018; 25(6):2005–2015. https://doi.org/10.3758/s13423-018-1432-y pmid:29435963
  47. 47. Kahneman D, Jackson B. Pupil diameter and load on memory. Science. 1966; 154(3756):1583–1585. https://doi.org/10.1126/science.154.3756.1583 pmid:5924930
  48. 48. David SG, Oladi B, Mee-Yeong P, Yehonatan S. LF power of heart rate variability is not a measure of cardiac sympathetic tone but may be a measure of modulation of cardiac autonomic outflows by baroreflexes. Experimental physiology. 2011; 96:1255–1261. https://doi.org/10.1113/expphysiol.2010.056259 pmid:21890520
  49. 49. Solhjoo S, Haigney MC, McBee E, van Merrienboer JJ, Schuwirth L, Artino AR, et al. Heart rate and heart rate variability correlate with clinical reasoning performance and self-reported measures of cognitive load. Scientific reports. 2019; 9(1):1–9. https://doi.org/10.1038/s41598-019-50280-3 pmid:30626917
  50. 50. Verwey WB, Hans AV. Detecting short periods of elevated workload: A comparison of nine workload assessment techniques. Journal of experimental psychology: Applied. 1996; 2(3):270. https://doi.org/10.1037/1076-898x.2.3.270
  51. 51. Lyu N, Xie L, Wu C, Fu Q, Deng C. Driver’s cognitive workload and driving performance under traffic sign information exposure in complex environments: A case study of the highways in China. International journal of environmental research and public health. 2017; 14(2):203. https://doi.org/10.3390/ijerph14020203 pmid:28218696
  52. 52. Geoffrey U, Peter C, Neil B, Jean U, David C. Visual attention while driving: sequences of eye fixations made by experienced and novice drivers. Ergonomics. 2003; 46(6):629–646. https://doi.org/10.1080/0014013031000090116 pmid:12745692
  53. 53. Alzubi OA, Alzubi JA, Al-Zoubi AM, Hassonah MA, Kose U. An efficient malware detection approach with feature weighting based on Harris Hawks optimization. Cluster Computing. 2021; 8:1–9. https://doi.org/10.1007/s10586-021-03459-1
  54. 54. Movassagh AA, Alzubi JA, Gheisari M, Rahimi M, Mohan S, Abbasi AA, et al. Artificial neural networks training algorithm integrating invasive weed optimization with differential evolutionary model. Journal of Ambient Intelligence and Humanized Computing. 2021; 27:1–9. https://doi.org/10.1007/s12652-020-02623-6
  55. 55. Novak D, Mihelj M, Ziherl J, Olensek A, Munih M. Psychophysiological measurements in a biocooperative feedback loop for upper extremity rehabilitation. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2011; 19(4):400–410. https://doi.org/10.1109/tnsre.2011.2160357 pmid:21708507
  56. 56. Son J, Oh H, Park M. Identification of driver cognitive workload using support vector machines with driving performance, physiology and eye movement in a driving simulator. International Journal of Precision Engineering and Manufacturing. 2013; 14(8):1321–1327. https://doi.org/10.1007/s12541-013-0179-7
  57. 57. Putze F, Jarvis JP, Schultz T. Multimodal recognition of cognitive workload for multitasking in the car. 2010 20th International Conference on Pattern Recognition, Istanbul, Turkey. 2010. https://doi.org/10.1109/icpr.2010.913
  58. 58. Alzubi OA, Alzubi JA, Tedmori S, Rashaideh H, Almomani O. Consensus-based combining method for classifier ensembles. Int. Arab J. Inf. Technol. 2018; 15(1):76–86.
  59. 59. Hussain S, Chen S, Calvo RA, Chen F. Classification of cognitive load from task performance & multichannel physiology during affective changes. Conference on Multimodal Interaction. 2011.