Figures
Abstract
To meet the demand for efficient and safe underground material transportation in the intelligent construction of coal mines, addressing the unstructured environmental characteristics of mine roadways, this study proposes an improved path planning algorithm based on the A* algorithm. It aims to achieve the dual requirements of driving efficiency and operational safety for robots in complex environments. The algorithm adopts a collaborative architecture combining global and local path planning: at the global level, it enhances search efficiency by introducing a bidirectional adaptive search strategy and incorporates terrain risk weights into the cost function, enabling the planned path to effectively avoid high-risk areas and achieve global path optimization; at the local level, it integrates the DWA algorithm to strengthen the robot’s real-time obstacle avoidance capability and ensure operational safety. To validate the algorithm’s effectiveness, path planning experiments were conducted in a simulated environment. The results demonstrate that the proposed algorithm effectively avoids various obstacles, significantly shortens path length and search time, providing a viable solution for path planning and navigation of tracked transport robots in complex roadways.
Citation: Yu T, Wang J, Zhang D, Xiao Y, Zheng C, Kang D, et al. (2026) Research on path planning algorithms for Crawler transport robots in complex tunnels. PLoS One 21(5): e0342122. https://doi.org/10.1371/journal.pone.0342122
Editor: Jiapeng Liu, Qingdao University, CHINA
Received: January 19, 2026; Accepted: March 24, 2026; Published: May 4, 2026
Copyright: © 2026 Yu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Yes - all data are fully available without restriction; All relevant data are within the paper and its Supporting information files.
Funding: The authors would like to thank ‘Natural Science Foundation of Shandong Province (CN) (ZR2023QA035), Technology Innovation Guidance of Shandong Province Plan (CN) (YDZX2024119), and Major Innovation Projects in Shandong Province (CN) (2025CXGC010214)’ for the financial support. In addition, some authors are affiliated with Yankuang Energy Group Co., Ltd, a commercial company. The funder (Yankuang Energy Group Co., Ltd) provided support in the form of salaries for authors TZ Yu, JH Wang, DX Zhang, YM Xiao, CG Zheng, D Kang, Y Jiang, MC Du and K Zhang, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: Some authors (TZ Yu, JH Wang, DX Zhang, YM Xiao, CG Zheng, D Kang, Y Jiang, MC Du and K Zhang) are employed by Yankuang Energy Group Co., Ltd. The remaining authors (X Tong) have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. No authors have any consultancy relationships, patent applications, products in development or marketed products related to the research content of this paper. This does not alter our adherence to PLOS ONE policies on sharing data and materials. All data and materials used in this study are available from the corresponding author on reasonable request, in line with the journal’s sharing requirements.
1. Introduction
With the advancement of science and technology and the continuous progress of intelligent coal mine construction, the manual handling-dominated material transfer method for anchorage materials in the complex working environment of the excavation face clearly fails to meet the requirements of safe, efficient, and intelligent mining. Therefore, exploring the use of equipment-assisted or automated manual handling, and developing a transfer robot suitable for the complex working conditions of the excavation face can effectively improve underground workers’ operational modes, reduce labor intensity, enhance material loading efficiency, and become a key objective in intelligent coal mine construction [1]. However, traditional path planning algorithms face limitations in real-time dynamic obstacle avoidance, environmental adaptability, and global path optimization. Moreover, existing research primarily focuses on global path optimization in static environments, and the collaborative processing mechanism for kinematic constraints under dynamic disturbances and complex terrains remains imperfect. Therefore, achieving safe, efficient, and stable autonomous path planning in an environment where dynamic disturbances and terrain uncertainties coexist has become a crucial technical issue that needs to be addressed for the engineering application of tracked transfer robots.
In the field of automatic navigation, path planning, as the core technology is usually divided into global path planning and local path planning. Global path planning is to plan a global optimal or near optimal path from the starting point to the target point according to the given starting point and target point under the known global map information of the environment. In the global path planning algorithm, the traditional A* algorithm was proposed by Hart et al. [2] in the 1970s, and the heuristic A* algorithm proposed by Jeauneau V et al. [3] calculates the moving cost through the evaluation function and gradually extends to the target point. Wangzhiqiang et al. [4] improved the traditional A* algorithm from two aspects of search neighborhood and search direction, adaptively selected 4-neighborhood or 8-neighborhood search strategy, and developed an adaptive search direction A* algorithm on this basis. Gujianping et al. [5] designed an optimal path search model based on the optimized genetic algorithm. Nam [6] when calculating the cost function value of A* algorithm, the factor of turning angle cost is included. In this way, the planned path will more meet the actual driving demand of heavy transport vehicles with high turning cost. Akshay [7] changed the calculation of the heuristic value of A* algorithm from the initial stage to the collision stage, and the simulation shows that this method can significantly reduce the time required for planning the path. When Liu [8] designed the heuristic function of A* algorithm, he established the risk model of berths and maneuverability restrictions of ships to improve the navigation safety of the path. Sandip et al. [9] proposed the MHA* algorithm with multiple heuristic functions, which is suitable for the situation that a single heuristic function cannot describe the complexity of the problem. The algorithm calls all heuristic functions at the same time to find the optimal solution of the problem. Xie G et al. [10] proposed a new strategy combining Bessel curve and genetic algorithm, adding the limit of path and the risk value of path in the fitness function, and improving the selection operator and mutation operator. This new strategy can provide an effective path planning method to avoid risk points.
Local path planning, also known as real-time path planning or online path planning, is mainly used for path planning in the current local working environment of robots or vehicles in the process of movement. Ataollahi M [11] proposed a repulsion mechanism based on the force field model to avoid obstacles. Sheng z [12] and others combined and improved the BI-RRT* bidirectional exploration method and APF-RRT* expansion guidance, and proposed a bi-directional fast exploration random tree algorithm based on adaptive mechanism and artificial potential field (AB-APF-RRT*), which improved the RRT* sampling and expansion method. Das et al. [13] proposed a new algorithm for multiple robots based on the artificial potential field method. The algorithm can navigate from the source to the target position in a coordinated and formed way at the best path and time, so as to effectively avoid static and dynamic obstacles. Dynamic window approach (DWA) [14] as a commonly used local path planning algorithm, has good performance in dynamic obstacle avoidance. Zhang Rui et al. [15] proposed the RRT*-DWA collaborative algorithm, which uses the optimal random tree path as the tracking benchmark of the dynamic window method. Although it improves the dynamic environment adaptability, there is still the problem of obstacle avoidance response lag in high-density obstacle scenes. Therefore, LiuA et al. [16] proposed a trajectory similarity evaluation function based on the dynamic time warping method to provide better guidance; At the same time, aiming at the problem that DWA has poor adaptability to complex environment due to the fixed weight of evaluation function, a fuzzy controller is designed to enhance the adaptability of DWA algorithm in complex environment. Gong x et al. [17] improved the cost function of DWA algorithm to improve the efficiency of obstacle avoidance because of the problem that DWA algorithm cannot balance security and speed due to fixed parameters in complex environments.
In addition, path planning research has evolved from a focus on single algorithms to the integration of multiple strategies, and from static environmental assumptions to dynamic adaptability. In terms of dynamic adaptability, the research emphasis has shifted from planners with fixed parameters to models that can self-adjust based on environmental states. He Yong et al. [18] adjusted the weights of the objective function according to the distance and density of obstacles relative to the robot. Li Xinying et al. [19] employed the Multi-Objective Particle Swarm Optimization (MOPSO) algorithm to transform the adaptive change of DWA weight coefficients into a multi-objective optimization problem for solution. In terms of interference suppression and robustness enhancement, Zhang M et al. [20] introduced the Discrete-Time Control Barrier Function (DCBF) constraint to enhance the safety of obstacle avoidance for dynamic obstacles. At the algorithm framework level, there have emerged asymptotic optimal replanning algorithms specifically designed for dynamic environments, such as FMTX [21], which achieves efficient interference response by updating the incremental graph mechanism, repairing only the local path affected by environmental changes.
To sum up, considering the characteristics of narrow structure, dense arrangement of equipment, random distribution of temporary obstacles and dynamic changes in geological conditions of coal mine roadway props, the traditional path planning algorithm has obvious limitations in real-time obstacle avoidance ability, dynamic environment adaptability and global optimization, and the existing research mostly focuses on the global path optimization in the static environment, while the collaborative processing mechanism for the unique dynamic disturbance factors of the heading face, such as mobile equipment or personnel inrush, as well as the minimum turning radius and other constraints, has not been improved. To solve the above problems, this paper proposes a bilevel path planning framework combining the improved A* algorithm and the dynamic window method (DWA): firstly, the A* algorithm based on the improved cost function realizes the global optimal path search, secondly, the dynamic window method (DWA) is used for local dynamic obstacle avoidance, and the multi-objective constraint optimization model is established by combining the real-time environmental perception data of inertial navigation and lidar. To verify the engineering applicability of the algorithm, the simulation model of coal mine roadway was built in the gazebo simulation platform in ros1, and the navigation experiment was completed in the simulation environment [22].
2. Research on an improved path planning algorithm based on the A* algorithm
2.1. Bidirectional alternating search strategy
Addressing the computational efficiency bottleneck issue caused by the unidirectional traversal strategy in the traditional A* algorithm, this paper introduces a bidirectional adaptive search strategy [23] for optimization. This approach establishes a bidirectional alternating expansion path exploration mechanism between the starting point and the end point, significantly improving the algorithm’s search efficiency and environmental adaptability while reducing the frequency of redundant node traversal. The specific process of this strategy is as follows: First, allocate and clear the forward search queue (F_OPEN) and the backward search queue (B_OPEN), and reset the node closure set (CLOSE) and the node traceability matrix (PARENT) [24], thereby establishing a basic framework for bidirectional collaborative search. Subsequently, select the node with the minimum evaluation function value from the forward and backward OPEN lists as the current expansion node, denoted as NF and NB respectively, and move them into their corresponding CLOSE lists. During the forward and backward search operations, in the forward search phase, the target is the backward current node NB, and the adjacent nodes of the forward current node NF are expanded, calculating the cost value of each adjacent node. If the node is an obstacle or already in the closure list, it is ignored. The parent node is updated to NF, and the forward open list is refreshed. In the backward search phase, the target is the forward current node NF, and the same process is executed correspondingly, filtering and updating the adjacent nodes of NB. After each expansion, update the new node that meets the conditions as the current node in the corresponding direction. Check whether the forward current node NF and the backward current node NB coincide. If they coincide, it indicates successful convergence of the bidirectional search. Then, through the bidirectional PARENT list, trace back from the convergence point to the starting point and the end point respectively, concatenate to obtain a complete path and output it. If they do not coincide, continue the search. If both the forward and backward OPEN queues are not empty during the search, continue the subsequent search; if any queue is empty, it indicates that the bidirectional search has traversed all possible nodes but failed to converge, and the search is determined to have failed and the program is terminated. Based on the above process, it can effectively avoid the back-and-forth oscillation phenomenon that occurs in unidirectional search under environmental obstacles, significantly reducing the number of expansion nodes. Its specific performance is shown in Fig 1.
In this study, the Euclidean distance function was used as the heuristic function, and the forward heuristic function hF(NF) and the backward heuristic function hB(NB) were defined, where and
are the coordinate values of the forward current search node NF and the backward current search node NB, respectively. The specific expressions are shown in Eqs. (1) and (2):
To verify the efficiency of the bidirectional search strategy, simulation experiments were conducted based on MATLAB 2024b. These experiments tested the performance of the Bi-A* algorithm [25] and the traditional A* algorithm in generating collision-free paths under different workspaces, as shown in Fig 2. This figure includes six sub-figures. The upper row (a)(b)(c) depicts the search simulations of the traditional A* algorithm in three scenarios, while the lower row (d)(e)(f) illustrates the search simulations of the Bi-A* algorithm in corresponding scenarios. In the figure, dark green squares represent the “starting point”, red squares represent the “target point”, light blue squares indicate “searched nodes”, and gray squares represent obstacles in the scene. As can be seen from the figure, the upper half shows the simulation process of the traditional A* algorithm searching for the target point in scenarios involving straight lines, bifurcations, and corners. The lower half presents the search situation of the Bi-A* algorithm in the same scenarios. In the straight-line scenario, the traditional A* algorithm performs a unidirectional search from the starting point to the target point, with the light blue searched nodes covering a longer path and a wider search range. The Bi-A* algorithm starts searching from both the starting point and the target point simultaneously, resulting in a more concentrated coverage of searched nodes, a simpler search path, and a significant reduction in the number of nodes. In the bifurcation scenario, unidirectional search requires traversing more branch nodes, with the light blue area covering a large range and resulting in lower search efficiency. Bidirectional search can reduce the exploration of invalid branches, concentrating the searched nodes on effective paths, significantly reducing the number of search nodes, and improving efficiency. When facing corner terrain, unidirectional search requires gradual path exploration, with the light blue node coverage area being scattered, leading to a redundant search process. Bidirectional search converges faster at corners, concentrating the searched nodes on key paths, resulting in higher search efficiency, further verifying the optimization effect of the bidirectional search mechanism on path search in complex terrain.
The simulation results of traditional A* algorithm and Bi-A* algorithm are shown in Table 1. According to the table, the search time of Bi-A* algorithm is reduced by 10.5–33.3%, and the number of search nodes is reduced by 13.5–37.3%; At the same time, through the analysis of the experimental data, it can be seen that with the expansion of the environment scale, the calculation time of Bi-A* decreases more significantly than that of the traditional A* algorithm.
2.2. Improved cost function
Compared to two-dimensional path planning, when conducting actual path planning, the actual cost must not only consider the X, Y directions but also incorporate terrain risks to ensure that the planning results are more aligned with real-world application scenarios. Therefore, the design of weights is crucial, requiring a balance between reasonably penalizing terrain features and adhering to the motion characteristics of tracked robots.
The actual terrain slope and the basic movement cost (distance or energy consumption) exhibit significant differences in dimension and numerical range, necessitating normalization to eliminate the impact of dimension. Although tracked robots possess a certain degree of climbing capability, it significantly increases energy consumption. Let the slope angle between node n and its parent node be , with a range set between 0°and 30°. The formula for the slope angle is presented in Eq. (3):
In Eq. (3), z(n) and z(nparent) represent the height values of the current node n and its parent node p, respectively. d(nparent, n) denotes the horizontal movement distance (Euclidean distance) between the current node n and its parent node p. The calculation formula is presented in Eq. (4):
In Eq. (4), (x(n), y(n)) and (x(nparent), y(nparent)) represent the coordinate values of the current node n and its parent node p, respectively. To eliminate the issue of dimensional differences, a slope normalization function is designed. The actual slope angle θ(n) is divided by the maximum safe slope angle θmax(n) and normalized to a proportional coefficient within the interval [0, 1]. This coefficient can be directly multiplied by the basic cost (such as distance). Its nonlinear growth characteristic makes the algorithm more sensitive to steep slopes. The specific calculation formula is shown in Eq. (5):
To quantify the energy consumption differences of tracked robots on different slopes, a segmented nonlinear penalty function is adopted to assess the slope risk. This function establishes a medium slope threshold θmiddle: if the current slope angle θ exceeds θmiddle, a high penalty coefficient Phigh is applied, causing the penalty cost to increase rapidly; if the current slope angle θ is less than θmiddle, a low penalty coefficient Plow is used; if θ exceeds the maximum safe slope angle, it is considered an impassable area. The specific calculation formula for the penalty function is shown in Eq. (6):
Considering computational efficiency, and to avoid introducing too many parameters that could affect path planning efficiency, the cost function design relies solely on the slope angle to simplify the algorithm and enhance its practicality. The improved cost evaluation function is presented in Eq. (7):
The cost function gi(n) incorporating slope weights, as defined by Eq. (7), is incorporated into the A* algorithm, forming an improved version of the A* algorithm. This algorithm effectively bypasses high terrain risk areas during the search process and seeks the globally optimal path, thereby enhancing overall search efficiency. The improved forward and backward search cost functions are presented in Eqs. (8) and (9), respectively:
2.3. DWA fusion strategy
Based on the traditional A* algorithm, the algorithm in this study introduces a piecewise slope penalty function into the cost function. By calculating the slope angle and performing normalization processing, it achieves the quantification and avoidance of terrain risks. The traditional DWA algorithm samples in the velocity space (v, ω) to predict multiple trajectories for a future period and selects the optimal trajectory based on the evaluation function. Therefore, to integrate the terrain risk into the DWA algorithm, the evaluation function of DWA is extended by adding a terrain slope cost term. The specific formula is shown in Eq. (10):
In the above formula, Pdist represents the degree of conformity between the local path and the global path, Gdist denotes the closeness between the path point and the target point, Dobs signifies the distance between the current target point and the obstacle, and terrain stands for the terrain slope cost value. α, β, γ, and δ are the weight coefficients for each cost, which can be determined by combining static presetting with dynamic adaptation according to the environmental state. In static or structured scenarios, fixed weights calibrated empirically can be adopted; whereas in complex environments such as mine tunnels, to adapt to sudden terrain changes and dynamic appearance of obstacles, the weights can be fine-tuned during the planning process based on real-time environmental information. For instance, when the robot deviates significantly from the global path, α can be increased to enhance path conformity; as the robot approaches the target location, β can be increased to strengthen target convergence; in areas with dense obstacles, γ can be increased to prioritize obstacle avoidance constraints; and when the terrain slope is relatively rugged, δ can be appropriately increased to make the evaluation function pay more attention to terrain risks, thereby ensuring driving safety.
The specific fusion strategy adopts a hierarchical architecture, where global planning and local planning work in concert: First, the improved A* algorithm is run under the global map to generate the global path Pglobal; subsequently, the local target point is updated in real-time based on the robot’s position. If a dynamic obstacle occurs, making the global path infeasible, re-planning is triggered, and local terrain elevation data is obtained through the global map; under the DWA framework, candidate trajectories are generated by sampling velocity space, and a comprehensive evaluation index is calculated by combining terrain slope cost, and the optimal trajectory is selected for execution; the above process is iterated continuously until the robot reaches the target point.
However, in actual underground environments, robots often exhibit significant drift and sideslip effects during turning maneuvers. These non-ideal motion characteristics may directly affect the trajectory prediction accuracy and driving safety of DWA local planning. Therefore, future research could consider incorporating dynamic parameters such as ground adhesion coefficient and slip angle estimation into the trajectory prediction model, or enhancing trajectory prediction accuracy and control robustness in complex terrains through online state correction.
3. Simulation verification of path planning using improved A* algorithm
3.1. Performance comparison experiment of improved A* algorithm
To further verify the superiority of the algorithm in this study, comparative simulation experiments were conducted on the Bi-A* algorithm and the algorithm in this study under two-dimensional grid maps of three sizes: 10 × 10, 15 × 15, and 20 × 20. The performance of the algorithms was comprehensively evaluated based on four indicators: the number of expanded nodes, path length, number of path nodes, and search time, across maps of different sizes. The planning results are shown in Fig 3.
The data comparison in Table 2 and visualization results indicate that the Bi-A* algorithm, which employs a bidirectional search strategy, can successfully plan a collision-free path. However, the generated path exhibits issues such as numerous redundant nodes, dense turning points, and weak terrain adaptability. Additionally, the total length of the path significantly increases compared to the proposed algorithm. Conversely, the algorithm in this study plans a smoother path with a significantly reduced number of turning points under the same grid scale. At the same time, it can effectively avoid high-cost grid areas, fully demonstrating superior terrain adaptability and path planning quality.
To verify the effectiveness and applicability of the algorithm in this study under sloping scenarios, a comparative experiment for 3D path planning was also designed. An unstructured 3D environment featuring slopes and undulating terrain was constructed in MATLAB 2024b, and the algorithm in this study, Bi-A* algorithm, and traditional A* algorithm was imported. The performance of each algorithm was comprehensively evaluated based on four indicators: the number of expanded nodes, path length, number of path nodes, and search time, across maps of different sizes. The planning results are shown in Fig 4.
The results indicate that the algorithm in this study, while introducing a bidirectional alternating search strategy, also incorporates a terrain risk evaluation mechanism into the cost function. This enables the robot to flexibly adjust its path based on the distribution of cost regions, effectively avoiding high-risk areas while satisfying the maximum safe slope angle constraint, thereby ensuring safe travel. In contrast, the Bi-A* algorithm, although improving search efficiency and reducing the number of expanded nodes due to the bidirectional strategy, does not consider terrain risks and misjudges low-slope areas as obstacles, leading to increased path length and planning time. The traditional A* algorithm, which employs unidirectional search, has a wider search range and covers more nodes, resulting in lower efficiency and longer time consumption in the same scenario.
As can be seen from Table 3, the algorithm in this study reduces search time by 58.4–62.3% compared to the traditional A* algorithm and by approximately 25.5–27.2% compared to the Bi-A* algorithm. In terms of the number of expanded nodes, it reduces it by 61.5–66.8% compared to the traditional A* algorithm and by approximately 35.4–37.4% compared to the Bi-A* algorithm. In summary, based on the experimental comparison data, it is evident that as the map size continues to expand, the algorithm demonstrates more significant advantages in search efficiency and path length.
3.2. Path planning experiment in simulation environment
Taking the scene of main haulage roadway and heading face under the mine as an example, the simulated roadway environment is built in Gazebo at a ratio of 1:5, as shown in Fig 5. Among them, the size characteristics of the roadway built by the simulation platform are as follows: the width of the roadway is 1m and the total length is 6.5m. To facilitate observation, the top features are removed; There are also three representative roadways in the roadway: straight line, fork road and corner. Each roadway is 1m wide and 3m long. Target points are set at the end of each roadway, namely fork road target point a, straight line target point B and corner target point C. In addition, considering that there are all kinds of coal machinery equipment in the roadway under real working conditions, static and dynamic obstacles are arranged in the model to test the path planning ability of the robot in complex environments. Taking the long box as an example, the static obstacles are 1m × 0.25m × 1m in size, and a total of 7 groups are set in S shape; Taking the square box as an example, the dynamic obstacles are 0.25m × 0.25m × 0.25m in size and move continuously in the simulation environment.
Now combined with the improved A* algorithm and cartographer mapping algorithm proposed in this paper, the path planning experiment of tracked transfer vehicle in simulated roadway is completed. The specific workflow of the simulation is as follows: firstly, the tracked transfer vehicle model is built in the gazebo simulation environment and conFigd as a simulation platform to support path planning; Secondly, a two-dimensional map of the environment is established in the simulated roadway, the roadway entrance is set as the original starting point of the path planning, and the 2D NAV goal tool in Rviz (Robot Visualization) is used to set the target destination. The improved A* algorithm is used for global path planning to generate the optimal path from the starting point to the destination; Then, in the process of path planning, DWA algorithm, as a local path planning algorithm, adjusts the generated path in real time according to the obstacles and dynamic changes in the environment, so as to ensure that the transfer vehicle can flexibly deal with obstacles and realize safe obstacle avoidance; Finally, the transfer vehicle successfully arrived at the target point according to the optimized path.
To effectively improve the interference with the planned expected route in the case of personnel emergence or equipment penetration, DWA algorithm can dynamically adjust the planned route in real time. Taking the simulation process as an example, the starting point of the transfer vehicle is set as the origin O, and the target end point is set as the target point C. three obstacles are set on the path respectively, and dynamic obstacles are added at the second obstacle, as shown in Fig 6. Starting from the origin, the transfer vehicle first plans a global path to the target point, then moves along the planned path and detects the information of surrounding obstacles in real time through the DWA algorithm. When it turns around two static obstacles continuously along the preset path to reach the turnout position, it detects and recognizes that the protruding obstacles appear on the initial planned route. Currently, the DWA algorithm is called to adjust the planned path in real time. The transfer vehicle monitors the position relationship between the obstacles in real time and calculates the distance Obs between the transfer vehicle and the obstacles. If Obs is greater than the collision distance doc, the corresponding speed under the simulated path will become the operation speed of the transfer vehicle at the next moment; If Obs is less than the collision distance doc, the transfer vehicle will perform a backward operation to avoid obstacles and readjust its attitude. After the dynamic obstacles pass through, it will continue to move according to the new planned path, and finally reach the target point C, as shown in Fig 7. It is proved that BAS-A* global + DWA local path planning algorithm can also meet the requirements of dynamic obstacle avoidance and new path planning under the condition of considering the penetration of obstacles.
4. Conclusion
To address the efficiency issue of path planning for tracked transfer robots in the complex environment of coal mines, this study proposes a path planning method suitable for underground complex tunnel environments, based on the A* algorithm and incorporating a bidirectional search strategy and a nonlinear slope penalty term. The following conclusions are drawn:
- (1). Compared to the traditional A* algorithm, the algorithm in this study introduces a bidirectional alternating search strategy to enhance the search efficiency of the algorithm. Additionally, a nonlinear slope penalty term is incorporated into the cost function, enabling the robot to effectively avoid high-risk areas without exceeding the safe slope angle, thereby significantly improving the environmental adaptability of the algorithm.
- (2). The results of the path planning experiment demonstrate that the algorithm proposed in this study performs well in terms of indicators such as the number of expanded nodes, path length, number of path nodes, and search time.
However, it is worth noting that in the real underground coal mine environment, perception uncertainty poses a severe challenge to the robustness of algorithms: for instance, point cloud noise and missing points caused by dust and water mist interference in lidar, as well as cumulative errors induced by slipping of odometers on rugged terrain. Specifically, perception errors may lead to distortion or delayed updating of the global map, which in turn may trigger erroneous re-planning of the global path. Meanwhile, misidentification or missed identification of local obstacles directly affects the evaluation and selection of optimal trajectories by the DWA algorithm, potentially resulting in response lag or decision bias in dynamic obstacle avoidance. Therefore, future research needs to focus on enhancing the algorithm’s resistance to perception interference: by fusing information from multiple sensors to improve the reliability of state estimation, or introducing confidence-based environmental representation and planning methods, the algorithm can quantify and handle perception uncertainty, thereby achieving safer and more stable autonomous navigation in complex underground operation scenarios. In summary, the algorithm proposed in this study provides technical support for autonomous material transportation in underground unstructured environments. Future experiments can be conducted in real mine scenarios to optimize the robustness and performance of the algorithm under extremely harsh working conditions.
References
- 1. Wang G, Pang Y, Ren H. Research and development path of smart mine technology system. Metal Mine. 2022;(5):1–9.
- 2. Hart P, Nilsson N, Raphael B. A Formal Basis for the Heuristic Determination of Minimum Cost Paths. IEEE Trans Syst Sci Cyber. 1968;4(2):100–7.
- 3. Jeauneau V, Jouanneau L, Kotenkoff A. Path planner methods for UAVs in real environment. IFAC-PapersOnLine. 2018;51(22):292–7.
- 4. Wang Z, Hu Z, Yuan K. Path planning for intelligent vehicle based on scene modeling from LiDAR. Journal of Transport Information and Safety. 2020;05:78–87.
- 5. Gu J, Zhang M, Wang M. Improved genetic algorithm-based network game path selection and simulation. Journal of System Simulation. 2016;28(8):1805–11.
- 6. Kang NK, Son HJ, Lee S-H. Modified A-star algorithm for modular plant land transportation. J Mech Sci Technol. 2018;32(12):5563–71.
- 7. Kumar Guruji A, Agarwal H, Parsediya DK. Time-efficient A* Algorithm for Robot Path Planning. Procedia Technology. 2016;23:144–9.
- 8. Liu C, Mao Q, Chu X, Xie S. An Improved A-Star Algorithm Considering Water Current, Traffic Separation and Berthing for Vessel Path Planning. Applied Sciences. 2019;9(6):1057.
- 9. Aine S, Swaminathan S, Narayanan V, Hwang V, Likhachev M. Multi-Heuristic A*. The International Journal of Robotics Research. 2015;35(1–3):224–43.
- 10. Xie G, Fang L, Su X, et al. Research on Risk Avoidance Path Planning for Unmanned Vehicle Based on Genetic Algorithm and Bezier Curve. Drones. 2025:9(2): 126:1–27.
- 11. Ataollahi M, Farrokhi M. Online path planning of cooperative mobile robots in unknown environments using improved Q‐Learning and adaptive artificial potential field. The Journal of Engineering. 2023;2023(2).
- 12. Sheng Z, Song T, Song J, Liu Y, Ren P. Bidirectional rapidly exploring random tree path planning algorithm based on adaptive strategies and artificial potential fields. Engineering Applications of Artificial Intelligence. 2025;148:110393.
- 13. Das MS, Sanyal S, Mandal S. Navigation of Multiple Robots in Formative Manner in an Unknown Environment using Artificial Potential Field Based Path Planning Algorithm. Ain Shams Engineering Journal. 2022;13(5):101675.
- 14. Fox D, Burgard W, Thrun S. The dynamic window approach to collision avoidance. IEEE Robot Automat Mag. 1997;4(1):23–33.
- 15. Zhang R, Zhou L, Liu Z. Dynamic path planning for mobile robot based on RRT* and dynamic window approach. Journal of System Simulation. 2024;36(4):957–68.
- 16. Liu A, Liu C, Li L, Wang R, Lu Z. An improved fuzzy‐controlled local path planning algorithm based on dynamic window approach. Journal of Field Robotics. 2024;42(2):430–54.
- 17. Gong X, Gao Y, Wang F, Zhu D, Zhao W, Wang F, et al. A Local Path Planning Algorithm for Robots Based on Improved DWA. Electronics. 2024;13(15):2965.
- 18. He Y, Hou T, Zeng Z. UAV Path Planning Algorithm Based on Improved A* and Dynamic Window Approach. Mechanical Science and Technology for Aerospace Engineering. 2025;44(09):1668–76.
- 19. Li X, Shan L, Chang L. DWA path planning algorithm based on multi-objective particle swarm optimization in complex environment. Journal of National University of Defense Technology. 2022;44(04):52–9.
- 20. Zhang M, Ma L, Wu Y, Shen K, Sun Y, Leung H. High-Traversability and Precise Navigation for Mobile Robots in Constrained Environments. IEEE Sensors J. 2025;25(12):22815–26.
- 21.
Nia SE. FMTX: An Efficient and Asymptotically Optimal Extension of the Fast Marching Tree for Dynamic Replanning. arXiv preprint arXiv:2509.08521, 2025.
- 22. Wang H, Li C, LianG W. Path planning of wheeled coal mine rescue robot based on improved A* and potential field algorithm. Coal Science and Technology. 2024;52(08):159–70.
- 23. Yu L, Zhang Y, Huang L. Research on neighborhood improvement based on two-way A* path planning algorithm. Journal of Chinese Computer Systems. 2025;46(06):1312–8.
- 24. Guo J, Yu J, Feng C. Global path planning for robots on uneven terrain based on improved A*algorithm. Computer Engineering and Applications. 2025;61(05):309–22.
- 25. Yijun Z, Jiadong X, Chen L. A Fast Bi-Directional A* Algorithm Based on Quad-Tree Decomposition and Hierarchical Map. IEEE Access. 2021;9:102877–85.