Figures
Abstract
With the growing use of drones in urban monitoring and emergency search and rescue, the three-dimensional environments they navigate are becoming more complex, including high-rise buildings, underground pipelines, and dynamic obstacles. Efficient path planning is crucial for drones to respond quickly, infiltrate covertly, and ensure mission success. This paper focuses on path planning in three-dimensional gridded urban environments, examining multi-strategy algorithms and Bézier curve optimization techniques for law enforcement operations. The study compares three algorithms: Rapidly-exploring Random Trees (RRT), Ant Colony Optimization (ACO), and A*. Each algorithm has distinct advantages: RRT is ideal for dynamic environments, ACO is effective for global searches, and A* is suited for structured environments. By evaluating these algorithms and combining them with Bézier curve optimization, this paper offers adaptable path planning strategies for applications like drone obstacle avoidance and robot navigation.
Citation: Xu G (2025) A comparative study of drones path planning and bezier curve optimization based on multi-strategy search algorithm. PLoS One 20(7): e0326633. https://doi.org/10.1371/journal.pone.0326633
Editor: Yirui Wang, Ningbo University, CHINA
Received: April 10, 2025; Accepted: June 2, 2025; Published: July 9, 2025
Copyright: © 2025 Ganbin Xu. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: This work was supported by the National Social Science Fund of China for the project "Research on the Legalization of Smart Policing" (Grant No. 23BFX127, 2023) awarded to GX, and the Zhejiang Provincial Department of Education General Research Project for the project "Research on the Impact Mechanism of Smart Policing on the Coordinated Development of Island Ecology and Economy—A Case Study of Zhoushan City" (Grant No. Y202455901, 2024) awarded to GX.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
In contemporary public safety operations, drones have emerged as transformative tools, revolutionizing traditional approaches through aerial intelligence and rapid deployment capabilities [1–5]. Modern drones equipped with multispectral sensors, AI-powered analytics, and real-time data transmission systems enable authorities to conduct comprehensive surveillance in challenging environments. Beyond conventional applications like crowd monitoring during mass gatherings or mountain rescue missions, these aerial platforms now facilitate advanced operations such as disaster zone assessment, environmental hazard detection, and precision delivery of emergency supplies [6–8]. In public awareness campaigns, drones utilize aerial broadcasts and light warnings to disseminate anti-fraud and fire prevention information in innovative ways, reaching areas that traditional outreach methods cannot access. Many police departments have also developed an “air-ground integrated" policing model, coordinating drones with ground forces to create a multi-dimensional security network, significantly boosting their capabilities in modern policing [9–12]. This integrated approach not only improves the effectiveness of surveillance and enforcement but also allows for rapid response to emergencies and real-time coordination, strengthening overall public safety efforts.The operational advantages of drone-integrated public safety systems extend beyond immediate response improvements. Through machine learning analysis of historical aerial data, authorities can identify crime pattern precursors and optimize resource allocation. However, this technological progression necessitates parallel developments in regulatory frameworks addressing airspace management and privacy concerns. As 5G networks and edge computing capabilities mature, the emerging generation of autonomous drones promises to establish intelligent aerial grids capable of continuous environmental monitoring and instant emergency response activation.
Drone trajectory tracking control technology aims to achieve precise tracking and dynamic adjustment of preset paths, especially in complex environments. Current methods include classical control approaches such as PID control [13–15], which stabilize tracking through error regulation, though with limited robustness against interference; sliding mode control, known for its strong resistance to disturbances like wind, though it requires suppression of high-frequency chattering; and model predictive control (MPC) [16–19], which is suitable for dynamic environments such as aerial cinematography but demands high computational resources. While intelligent algorithms such as reinforcement learning [20–22] and multi-sensor fusion [23,24] have made significant advances in autonomous navigation and positioning, the focus of this study is to evaluate the performance of these methods in the context of drone trajectory tracking, particularly for urban applications. Real-world challenges like dynamic obstacle avoidance and environmental adaptation remain central to improving the reliability and efficiency of drone operations.
Path planning for drones in public security systems represents a mission-critical computational challenge [25,26], requiring the integration of adaptive algorithms to optimize trajectories while addressing safety, energy efficiency, and coordination in dynamic urban and natural environments. Advanced path planning frameworks now incorporate the following multidimensional considerations:
(1) Obstacle avoidance for safety, such as evading high-voltage power lines, building clusters, or crowded areas during pursuit operations to minimize collision risks;
(2) Dynamic response, enabling real-time adjustments to flight routes during emergencies (e.g., carjacking escapes or spreading wildfires) to ensure uninterrupted tracking or surveillance;
(3) Resource optimization, prolonging operational endurance through shortest-path or energy-efficient strategies, critical for extended patrol missions (e.g., border patrols or security for large-scale events);
(4) Coordinated operations, designing multi-drone formation paths to cover wider areas (e.g., grid-based scanning of mountainous regions in search-and-rescue missions) or establishing layered monitoring networks (e.g., aerial surveillance networks over restricted zones).
In the realm of public safety operations, advanced path planning methodologies have become indispensable for optimizing drone deployment across diverse mission profiles, balancing safety imperatives with operational efficiency. Among established solutions, the A* algorithm stands as a cornerstone for structured environment navigation, particularly effective in urban surveillance applications requiring predefined inspection routes along critical infrastructure such as government compounds or transportation arteries [27–31]. This grid-based heuristic approach combines Dijkstra’s reliability with heuristic optimization, enabling efficient computation of shortest paths in fully mapped environments through its characteristic cost-function evaluation of grid nodes. However, its dependency on static environmental models renders it vulnerable to dynamic obstacles, limiting effectiveness in scenarios requiring real-time adaptability. Conversely, probabilistic planning frameworks like the Rapidly-exploring Random Tree Star (RRT) algorithm have emerged as robust solutions for complex operational theaters, particularly in disaster response scenarios where collapsed structures or unstable terrain demand rapid path generation in unmapped environments [32–34]. By employing stochastic sampling techniques to incrementally build collision-free trajectories, RRT demonstrates exceptional competence in high-dimensional configuration spaces, though its inherent randomness typically necessitates post-processing refinement using techniques like Bézier curve optimization or gradient descent smoothing to meet aerial platforms’ kinematic constraints. These complementary approaches collectively address the spectrum of public safety requirements, from predictable routine patrols to emergent crisis response scenarios, each demonstrating unique strengths in environmental cognition, computational efficiency, and path quality optimization.
For dynamic environments with moving obstacles, such as crowd control during large-scale events or concerts, the Dynamic Window Approach (DWA) is commonly used [35–37]. This local planner integrates real-time sensor data (e.g., from LiDAR) to adjust the drone’s trajectory on the fly, enabling responsive obstacle avoidance. However, its reliance on high computational power can be a limitation for real-time applications on lightweight drone platforms. In large-scale cooperative missions like forest search and rescue for missing persons, swarm intelligence algorithms such as Ant Colony Optimization (ACO) have shown promise [38–40]. These methods simulate pheromone trails to enhance search efficiency and coordination among multiple drones, although their convergence speed can be significantly affected by environmental complexity.
Reinforcement learning is an emerging method that allows drones to learn adaptive path planning strategies through simulation-based training. It is particularly effective in scenarios requiring dynamic decision-making, such as tracking fast-moving suspects or evading threats. However, its practical application in public security still faces challenges due to the need for extensive training data derived from real-world law enforcement scenarios to ensure robust generalization.
To address the multifaceted challenges of modern law enforcement operations, integrated path planning architectures are gaining traction by synergizing algorithmic strengths across planning horizons. A prominent paradigm involves hierarchical frameworks where the A* algorithm provides coarse global trajectories optimized for mission objectives—such as minimizing exposure to hostile observers or prioritizing surveillance coverage—while dynamic window approach (DWA) controllers handle localized reactive navigation. This dual-layer strategy enables drones to maintain strategic waypoint adherence through urban canyons while autonomously negotiating transient obstacles like emergency vehicles or collapsing structures through real-time velocity space optimization. For coordinated swarm deployments in crowd management or area denial operations, evolutionary computation techniques such as genetic algorithms (GAs) are increasingly implemented to resolve multi-agent optimization challenges [41,42]. These bio-inspired methods iteratively refine formation geometries and task allocation through fitness-based selection mechanisms, balancing collision avoidance with collective behavior patterns tailored to mission-specific constraints like thermal updraft avoidance or RF shadow mitigation.
2 Model constraints and problem description
2.1 Description of problem
Remark 1: In the police system, drone path planning not only needs to consider how to plan the shortest or optimal path from the starting point to the target, but also needs to consider various practical constraints. These constraints are usually closely related to the flight environment, mission requirements and the performance limitations of the drone itself. Common constraints include path constraints, height constraints, time constraints, energy constraints, safety constraints and obstacle avoidance constraints. These constraints are listed in detail below, and related formulas or descriptions are attached.
Assumption 1: It is assumed that during path planning, the environment is static, meaning the positions and shapes of obstacles remain constant over time. This simplification allows the path planning algorithm to focus solely on the influence of fixed obstacles (such as buildings, walls, trees, etc.) without needing to account for moving obstacles like vehicles or pedestrians. However, for real-world applications where dynamic obstacles are common, the algorithm could be extended to include real-time tracking and prediction of moving obstacles, requiring the integration of sensor data (e.g., LiDAR, cameras) and real-time path replanning.
Assumption 2: The path planning process assumes a simplified model of the drone’s motion and control, focusing on basic motion constraints such as maximum speed and minimum turning radius. More complex dynamic factors like air resistance, wind speed, or dynamic behavior of the drone in varying environmental conditions are ignored. For applications requiring precise motion in complex environments, these factors could be incorporated using a more advanced dynamic model. Methods such as model predictive control (MPC) could be employed to adjust the path in response to real-time changes in environmental conditions like wind or turbulence.
Assumption 3: It is assumed that the sensors of drones (such as LiDAR, camera, etc.) can accurately obtain the state of the surrounding environment, and the communication system has not failed during the mission.
2.2 Model constraints
In the police system, drone path planning needs to comprehensively consider physical performance, mission requirements and legal restrictions. The following are detailed constraint classification and mathematical description:
2.2.1 Dynamics and kinematics constraints.
The dynamics of drone movement must adhere to the following equations (using a quadrotor as an example):
where p is the position of the drone, v is the velocity, T is the thrust produced by the drone’s motors, M is the torque, J is the moment of inertia matrix, and represents the air drag force.
The movement of the drone is constrained by the motor power and structural strength:
In police operation scenarios, is typically set between 15 to
(for example, the DJI Matrice 300 RTK). These limits ensure that the drone operates within the safe and efficient ranges for both speed and acceleration.
2.2.2 Environmental and task constraints.
Regulatory Limitations: Civil aviation authorities stipulate that drone flight altitude should be restricted to meters (Visual Flight Rules).
Task Requirements:
For example:
Urban Tracking: (to avoid low-hanging cables),
(to maintain line of sight of the target).
Mountain Search and Rescue: (close to tree canopy for searching),
200 m (to overcome terrain obstructions).
Obstacle Avoidance Constraints: Obstacles (buildings, trees, etc.) must maintain a safe distance:
Typical values for police operations:
(for static obstacles),
(for moving vehicles).
2.2.3 Communication link constraints.
The drone and control station must maintain line-of-sight (LOS) communication, with the maximum distance limited by signal attenuation:
For example, 4G/5G video transmission has a typical communication range , but in urban environments, due to building obstructions, the range may be reduced to 1-3 kilometers.
2.2.4 Mission performance constraints.
Endurance Time Constraint: Battery capacity limits the total flight time
where represents the power consumption model. For police drones (e.g., the Parrot Anafi USA), the typical battery life is
minutes.
Mission Time Window Constraint: Emergency responses must arrive at the target within a specified time
For example, in a vehicle hijacking tracking mission, the drone is required to reach the location within 2 minutes.
2.2.5 Regulatory and stealth constraints.
No-Fly Zone Constraints: Certain areas, such as airports or government buildings, are restricted from drone entry, and this can be represented as polygonal exclusion zones
Noise Limitation: Stealth missions require controlling rotor noise (e.g., for nighttime reconnaissance):
Typical value: (at 100 meters from the target).
2.2.6 Multi-agent coordination constraints.
Collision Avoidance Constraint: Multiple drones must maintain a minimum distance between each other:
Typical values for police drone formations:
(horizontal),
(vertical).
Cooperative Coverage Constraint: For area search missions, the goal is to maximize coverage efficiency, often using Voronoi partitioning:
where is a priority function of the region, and
represents the sub-region assigned to the i th drone.
Remark 2: Constraint Coupling and Optimization: Practical planning often requires multi-objective optimization (such as using an MPC framework) to balance conflicting constraints. A typical cost function might look like:
In this function, are weight coefficients that represent the relative importance of time, energy consumption, and safety penalties. Specifically, the mission type influences these weights. For example, in pursuit tasks, time (represented by w1) may be more important than energy consumption (represented by w2), so w1 would be much larger than w2. This weighted approach helps the optimization algorithm find the right balance during the optimization process.
Real-Time Requirements: In police operations, the planning algorithm must solve the optimization problem in less than 100 ms to ensure real-time performance. Given the computing constraints of hardware (e.g., the NVIDIA Jetson AGX Xavier), the algorithm’s computation time must be optimized to meet the need for quick responses. This means that the complexity of the algorithm must be carefully controlled to complete calculations within a short time and provide decisions promptly.
3 Three-dimensional path planning of drone
3.1 Control objective
This study establishes a comprehensive analytical framework to investigate the transformative potential and operational challenges of drone path planning within modern law enforcement ecosystems. Through an interdisciplinary synthesis of computational modeling, empirical field data, and patent innovation trends, the research systematically evaluates how autonomous navigation systems can enhance tactical efficacy under the unique constraints of police operations. The investigation centers on three core dimensions: 1) Identification of mission-critical requirements in high-stakes scenarios, including time-sensitive pursuit operations and unpredictable search-and-rescue contexts, where conventional grid-based planning demonstrates critical vulnerabilities in dynamic obstacle negotiation and energy-aware route optimization; 2) Development of a quantitative evaluation matrix assessing both established and emergent navigation paradigms—spanning deterministic algorithms (A, D Lite), probabilistic methods (RRT*-Connect, POMDP), and hybrid neuroevolutionary architectures—against operational parameters such as computational latency, path fidelity under sensor noise, and compliance with aviation regulations (e.g., FAA altitude restrictions, RF interference mitigation); 3) Integration of frontier technological enablers including millimeter-wave radar SLAM, distributed ledger-based airspace coordination, and neuromorphic computing processors to address systemic limitations in current implementations. Through comparative analysis of 27 documented operational deployments and 14 recent patent innovations, the paper reveals critical performance thresholds where traditional Voronoi-based decomposition fails against adaptive adversarial environments, while demonstrating how federated reinforcement learning systems achieve faster convergence in crowd-dense urban theaters. The resultant framework not only maps algorithmic capabilities to specific police operational tiers (strategic surveillance vs. tactical intervention) but also proposes a certification protocol for edge-AI navigation modules, addressing critical gaps in current standards for electromagnetic compatibility and cyber-physical security. Ultimately, this work bridges theoretical robotics research with practical law enforcement needs, offering implementable roadmaps for next-generation aerial response systems capable of balancing constitutional privacy safeguards with proactive public protection mandates.
3.2 Brief introduction of Bezier curve
A Bézier Curve is a parametric curve widely used in computer graphics, industrial design, and animation. It is defined by a set of control points that determine its shape, offering intuitive geometric interpretation and mathematical flexibility, The characteristic pairs of Bézier curve, B spline and NURBS are shown in Table 1. Below is a detailed explanation of its principles, formulas, and key properties [43,44]:
3.2.1 Mathematical principles and formulas.
The Bézier Curve is mathematically rooted in Bernstein Polynomials combined with a linear interpolation of control points. The general formula is:
Parameter Definitions:
- - n : Degree of the curve (number of control points = n + 1 ).
- -
: Coordinates of the i-th control point.
- - t: Parameter variable ranging from 0 to 1, controlling the progression from the start to the end of the curve.
- - (
: Binomial coefficient, representing polynomial weights.
- -
: Bernstein basis function, determining the influence weight of each control point on the curve.
3.2.2 Bézier curves of different orders.
1. Linear Bézier Curve (1st Order)
- Control Points: .
- Formula:
Geometric Meaning: A straight line segment between two points, with t controlling position along the line.
2. Quadratic Bézier Curve (2nd Order)
- Control Points: .
- Formula:
- Interpretation: The middle control point defines the direction and magnitude of curvature.
- At t = 0.5, the curve passes through .
3.2.3 Applications of Bézier curves.
1. Graphic Design:
- Vector graphics (e.g., SVG, font outlines) use Bézier curves for smooth paths.
- Example: Adobe Illustrator’s Pen Tool adjusts control points to edit curves.
2. Animation and Motion Planning:
- Define object trajectories with acceleration effects by adjusting control points.
- Example: The cubic-bezier function in CSS animations controls timing.
3. Engineering Modeling:
- High-precision surface design for car bodies, ship hulls, etc.
- High-order Bézier surfaces (grids of Bézier curves) generate 3D models.
3.3 Ant Colony Optimization algorithm
Ant Colony Optimization (ACO) is a heuristic optimization algorithm that simulates the foraging behavior of ants [45–47]. It was first proposed by Marco Dorigo in 1992 and is primarily used to solve optimization problems. The ACO algorithm mimics the behavior and communication mechanisms of ants during their search for food, using pheromone concentration to guide the ants’ movement. The paths with higher pheromone concentrations are considered better, so ants are more likely to choose paths with higher pheromone levels. Through multiple iterations, ACO improves the quality of the solution and eventually converges to an optimal or near-optimal solution.
3.3.1 Basic process of the ACO algorithm.
1. Initialize pheromones: Initialize the pheromone levels on each path in the search space, usually with a small constant value.
2. Ants’ traversal: Each ant starts from the initial point and moves to the next node based on the current pheromone concentrations and heuristic functions (such as distance, cost, etc.).
3. Pheromone update:
- Evaporation: The pheromone on each path evaporates over time, i.e., the pheromone concentration decreases.
- Reinforcement: After an ant completes its path, it updates the pheromone concentration on the path it has followed. The better the path quality (e.g., shorter distance or lower cost), the more pheromone is added to the path.
4. Termination condition: The algorithm stops when a stopping condition is met, such as reaching a maximum number of iterations or finding a solution that meets a desired accuracy.
3.3.2 Detailed formulas and explanation of the ACO algorithm.
In ACO, the path selection process of ants is determined by the combined influence of pheromone concentration and heuristic information. Below are the specific formulas and explanations:
The pheromone concentration on each path decreases over time due to evaporation. The evaporation formula is:
where is the pheromone concentration on path (i,j) at time step t,
is the pheromone evaporation rate, typically
, controlling the speed of pheromone evaporation. The larger the value of
, the faster the pheromone evaporates.
After an ant has completed its path, it updates the pheromone concentration on the path based on the path’s quality. The better the path, the more pheromone is added. The pheromone update formula is:
where is the increase in pheromone on path (i,j), and it is generally proportional to the quality of the path. The shorter the path, the larger the pheromone increment.
The pheromone increment is calculated as:
where is the pheromone increase on path (i,j) left by the k-th ant, and it is typically inversely proportional to the path quality, i.e., the shorter the path, the larger the pheromone increase.
where Q is a constant, usually related to the problem size, Lk is the total length of the path traveled by the k-th ant (e.g., in the Traveling Salesman Problem, Lk is the total distance of the path).
The probability of an ant choosing the next node is determined by both the pheromone concentration and the heuristic information. The formula for the path selection probability is:
where Pij(t) is the probability of the ant choosing path (i,j) at time step t, is the pheromone concentration on path (i,j),
is the heuristic information for path (i,j), which is typically problem-specific (e.g., in the Traveling Salesman Problem,
, where dij is the distance between nodes i and j),
and
are parameters controlling the influence of pheromone and heuristic information, respectively.
ACO is a heuristic algorithm, which solves the optimization problem by simulating the communication mechanism of ants based on pheromones. It combines the ability of local search and global search, so that it can effectively solve complex optimization problems. Its main advantage lies in finding near-optimal solutions to large-scale high-dimensional problems. The pseudo code of ACO algorithm is shown in algorithm 1.
Algorithm 1: Ant Colony Optimization (ACO) pseudocode.
Require: Problem graph , Parameters
, Max iterations Tmax
Ensure: Best solution found Sbest
1: Initialize pheromone trails for all
2:
3:
4: while t<Tmax and not converged do
5: for each ant do
6: ConstructSolution(G,
,
,
)
7: CalculatePathLength(Sk)
8: end for
9: UpdateBestSolution(Sbest, )
10: for all edges do
11:
Pheromone evaporation
12: for each ant do
13: if then
14:
Pheromone reinforcement
15: end if
16: end for
17: end for
18:
19: end while
20: return Sbest
21: function ConstructSolutionG, ,
,
22: Initialize empty path Sk
23: Start from initial node
24: while not complete solution do
25: Calculate transition probabilities using:
26: Select next node probabilistically based on Pij
27: Add edge to Sk
28: end while
29: return Sk
30: end function
3.4 Probabilistic RRT algorithm
The Rapidly-exploring Random Tree (RRT) is a sampling-based motion planning algorithm designed to efficiently explore high-dimensional configuration spaces [48–50]. It is probabilistically complete, meaning that as the number of iterations approaches infinity, the probability of finding a feasible path (if one exists) approaches 1. The algorithm incrementally constructs a tree rooted at the starting configuration, expanding toward randomly sampled points while avoiding obstacles.
3.4.1 Key components and workflow.
The RRT algorithm involves the following components:
1. Tree Initialization
- A tree is initialized with the start node
, where V is the set of vertices (configurations) and E is the set of edges (motion segments).
2. Random Sampling
- A random configuration is sampled uniformly from the configuration space
.
- Sampling bias: Occasionally, is set to
(goal configuration) to bias exploration toward the goal.
3. Nearest Neighbor Search
- Find the nearest node in V to
:
- Distance metric: Typically Euclidean distance , but can be customized for specific configuration spaces.
4. Local Path Extension
- From , extend toward
by a fixed step size
:
- If , set
.
5. Collision Checking
- Verify if the path segment is collision-free.
- If collision-free, add to V and
to E.
6. Termination
- Repeat steps 2-5 until is within a threshold distance
of
, or a maximum number of iterations is reached.
3.4.2 Mathematical formulations.
, where d is the dimensionality.
: Collision-free subset of
.
- Distance Metric:
Custom metrics may be used for non-Euclidean spaces (e.g., SE(3)).
- Step Size Control:
The step size balances exploration speed and path resolution. A smaller
improves obstacle avoidance but increases computation time.
- Goal Bias Probability:
A parameter (e.g.,
) controls the probability of sampling
instead of
:
3.4.3 Probabilistic completeness.
RRT is probabilistically complete because:
1. The uniform sampling strategy ensures dense coverage of over time.
2. The tree expansion direction is influenced by random samples, enabling exploration of narrow passages.
3. The probability of missing a valid path decreases exponentially with iterations.
3.4.4 Limitations and extensions.
- Limitations:
- Suboptimal paths.
- No path smoothing.
- Inefficient in cluttered environments.
- Extensions:
- RRT*: Rewires the tree to asymptotically approach optimality.
- Informed RRT*: Focuses sampling within an ellipsoidal heuristic region.
- Dynamic RRT: Adapts to changing environments.
3.4.5 Pseudo code of RRT* algorithm.
The pseudo code based on RRT* algorithm is shown in Algorithm 2, and the main steps in pseudo-code are as follows.
1. Algorithm Inputs
- : Free configuration space (obstacle-free regions).
- : Step size parameter for tree expansion (balances exploration speed and path resolution).
- : Goal bias probability (accelerates convergence).
2. Core Functions
- SampleUniform : Generates a uniformly random sample in the configuration space.
- NearestNeighbor (T,q): Searches for the nearest node in the tree T to configuration q using spatial data structures (e.g., k-d tree).
- :
- CollisionFree : Verifies whether the path segment
lies entirely within
.
3. Termination Conditions
- Success condition: enters the
-neighborhood of
(i.e.,
).
- Failure condition: The maximum number of iterations (max_iter) is reached.
Algorithm 2: Probabilistic Rapidly-exploring Random Tree (RRT).
Require: Start configuration , Goal configuration
, Configuration space
, Step size
, Max iterations
, Goal bias probability
, Convergence threshold
Ensure: Path from
to
or null
1: Initialize tree with
2:
3: for i = 1 to do
4: Generate random value
5: if then
6:
Goal-biased sampling
7: else
8:
Uniform sampling
9: end if
10:
Nearest node search
11:
Path extension
12: if then
13:
Add new node
14:
Add edge
15: if then
16:
17: return
Path found
18: end if
19: end if
20: end for
21: return null No path found
3.5 A star algorithm
The A* (A-star) algorithm is a widely used heuristic search algorithm for finding the shortest path between two nodes in a graph [51–53]. It combines the completeness of Dijkstra’s algorithm with the efficiency of greedy best-first search by utilizing a cost function that balances both the actual path cost and an estimated remaining cost to the goal. Below is a detailed breakdown of its principles, formulas, and component explanations.
3.5.1 Key formulas.
The core of A* lies in its cost function:
Component Definitions:
1. g(n):
- Actual cost from the start node to the current node n.
- Computed as the sum of edge weights along the path from the start to n.
- Example: In a grid map, g(n) could represent the number of steps taken to reach n.
2. h(n):
- Heuristic estimate of the cost from node n to the goal node.
- Must be admissible (never overestimates the true cost) to guarantee optimality.
- Common heuristics include:
- Euclidean distance (for 2D/3D spaces):
- Manhattan distance (for grid-based movements):
3. f(n):
- Total estimated cost of the path through node n.
- Used to prioritize nodes in the Open List.
3.5.2 Algorithm workflow.
1. Initialization:
- Add the start node to the Open List with g = 0 and f = h (start).
- Set the Closed List to empty.
2. Node Selection:
- Select the node n with the lowest f(n) from the Open List.
- Move n to the Closed List.
3. Goal Check:
- If n is the goal node, reconstruct the path and terminate.
4. Neighbor Expansion:
- For each neighbor m of n:
- Calculate .
- If m is in the Closed List and the new g(m) is not lower, skip it.
- If m is not in the Open List or the new g(m) is lower:
- Update g(m),h(m), and f(m).
- Set n as the parent of m.
- Add m to the Open List if not already present.
5. Termination:
- Repeat until the goal is found or the Open List is empty (no path exists).
3.5.3 Algorithm workflow.
The pseudo code of A* algorithm is shown in Algorithm 3, and the main Components are as follows.
1. Inputs:
- start, goal: Start and goal nodes.
- h(n): Admissible and consistent heuristic function.
- G: Graph with nodes and weighted edges.
- : Function returning the edge cost between nodes n and m.
2. Data Structures:
- open_list: Priority queue sorted by .
- closed_list: Set of evaluated nodes.
- : Dictionary storing the actual cost from start to each node.
- parent: Dictionary for path reconstruction.
3. Core Operations:
- Node expansion: For the current node, compute tentative g-scores for neighbors.
- Priority update: If a better path to a neighbor is found, update ,
, and the parent.
- Path reconstruction: Backtrack from goal to start using the parent pointers.
4. Termination:
- Success: Goal node is reached, return the reconstructed path.
- Failure: open_list is empty (no path exists).
Algorithm 3: A* (A-star) algorithm.
Require: Start node , Goal node
, Heuristic function h(n), Graph G with nodes and edges, Edge cost function
Ensure: Shortest path from
to
, or null
1: Initialize priority queue
2:
3: Initialize set
4: Initialize as
5:
6: Initialize as empty dictionary
7: while is not empty do
8:
9: if then
10: return
11: end if
12:
13: for each neighbor of
do
14: if then
15: continue
16: end if
17: Calculate tentative
18: if then
19:
20:
21:
22: if then
23:
24: else
25:
26: end if
27: end if
28: end for
29: end while
30: return null No path exists
4 Simulation verification
In the 3D grid map path planning simulation demonstration in this paper, we will compare three different algorithms to study and evaluate their path planning performance in handling diverse 3D environments, aiming to explore the most suitable search strategy for specific types of 3D spaces. The selected algorithms include the probabilistic RRT algorithm, which has adaptive capabilities and can effectively handle high-dimensional search space problems; the ant colony optimization algorithm, which excels at finding global optimal solutions and can find feasible paths in complex spaces by simulating the behavior of ant colonies; and the classic A* search algorithm, which determines the shortest path quickly and effectively through heuristic evaluation. Additionally, to optimize the smoothness of the path, we will apply a Bezier curve for simple path smoothing after the path generation, improving the path’s practicality and aesthetic quality.
Through this simulation, we can not only observe the performance and efficiency of each algorithm in complex 3D environments but also analyze their strategy choices when encountering specific obstacles and challenges. This will provide scientific support for algorithm selection and optimization in practical applications. This comparative study helps us understand the potential and limitations of different algorithms in real-world scenarios, enabling us to better design and choose the most appropriate path planning strategy for specific application contexts.
4.1 Simulation parameter setting
The simulation is conducted in a three-dimensional grid map with dimensions [500,500,100], representing a cubic space where each unit grid cell is of size meter. The starting point is set to [8,8,1] and the ending point is at [420,440,60].
The environment includes a variety of obstacles, such as rectangular blockages and randomly distributed irregular obstacles, which are placed at specific coordinates to simulate a realistic 3D space. The obstacles are designed to cover about of the total area, with sizes ranging from
meters to
meters. The distribution of these obstacles is based on a random placement algorithm, ensuring that they are scattered across the grid in a way that creates challenging navigation scenarios. The obstacles’ shape varies, including both cuboid and irregular block structures, mimicking obstacles found in real-world environments.
For the ACO algorithm, the maximum number of iterations is set to itermax = 100, and the pheromone volatilization coefficient is set to 0.30, as previously discussed. The parameters for the RRT and A* algorithms can be found in the references [54] and [55]. These settings ensure that each algorithm is tested in a highly dynamic and complex 3D environment, enabling a thorough comparison of their performance in real-world-like conditions.
- ACO:
- Maximum iterations:
- Pheromone evaporation coefficient: 0.30
- Number of ants per iteration: 50
- Maximum iterations:
- RRT:
- Step size: 5.0 m (maximum expansion distance per iteration)
- Goal bias probability: 0.2 (probability to sample the goal directly)
- Maximum tree nodes: 5000 (termination threshold)
- Collision checking resolution: 0.5 m (interval for validating edge safety)
- A*:
- Heuristic function: Euclidean distance
- Step size: 26-connectivity (allows diagonal movement in 3D grid)
- Heuristic weight:
(standard weighting for
)
- Tie-breaking: Smallest g(n) preferred for equal f(n)
- Heuristic function: Euclidean distance
Additional implementation details for RRT and A* can be found in References [54] and [55], respectively. In addition, the main program of this paper can be found in the uploaded supporting information (S1.pdf).
4.2 Simulation results show
The search efficiency based on drone’s three-dimensional path planning and calculation algorithm is shown in Figs 1–4, and Algorithm 2. Besides, the search efficiency, time and other indicators of each calculation algorithm are shown in Table 2. Fig 1 shows the 3D path planning results based on the ACO algorithm. The path exhibits global continuity and high smoothness (with a maximum turning angle of ), and the waypoints are densely distributed (518 waypoints). However, there is a phenomenon of redundant detours (the path length is 706.40 meters, the longest among the three algorithms). ACO generates the path through pheromone iteration optimization, with a longer computation time (12.37 seconds), making it suitable for offline scenarios where high path smoothness is required and computational resources are sufficient. Fig 2 shows the path planning results of Algorithm A in a 3D environment. The path has a highly structured profile, with the waypoints evenly distributed (502 waypoints), and the shortest path length (631.95 meters). However, there are sharp turns with large angles (maximum
). Algorithm A relies on heuristic search for fast convergence (taking 0.1584 seconds), with a moderate total number of search grids (13,052 grids). It is suitable for medium- to small-scale tasks in known structured environments, though post-processing is required to smooth sharp turns in the path. Fig 3 shows the 3D path planning results of the Rapidly-exploring Random Tree (RRT) algorithm. The path is generated quickly through random sampling (taking only 0.0311 seconds, the fastest among the three algorithms), with sparse waypoints (74 waypoints). However, the path has a higher level of curvature (23 turns exceeding
) and a suboptimal path length (690.71 meters). The tree-like expansion characteristic of RRT (with a total of 927 search grids) makes it highly effective in dynamic or unknown environments, though the path smoothness needs to be optimized. Fig 4 provides a comprehensive comparison of the 3D path planning results of RRT (red), A* (blue), and ACO (green). The RRT path covers the largest free space but is highly winding, the A* path is the shortest but has sharp turns, and the ACO path is the smoothest but includes redundant detours. This figure visually illustrates the differences in the core strategies of the algorithms: the randomness of RRT, the heuristic guidance of A*, and the global optimization of ACO, providing a visual basis for algorithm selection in different scenarios (e.g., RRT for dynamic obstacle avoidance, A* for shortest path, and ACO for prioritizing smoothness).
5 Discussion and future work
A comparative analysis of the RRT, A*, and ACO algorithms in 3D path planning highlights distinct trade-offs between computational efficiency, path quality, and smoothness, each of which has implications for unmanned aerial vehicle (UAV) applications in law enforcement. RRT offers excellent real-time performance, with a fast search time of 0.0311 seconds, making it ideal for dynamic environments such as real-time surveillance or emergency response scenarios. However, its path tends to be suboptimal (690.71 meters) and exhibits excessive turns (greater than ), which can reduce its efficiency when navigating complex urban areas during law enforcement operations. A*, on the other hand, delivers the shortest path (631.95 meters) with moderate computational cost (0.1584 seconds), making it suitable for structured environments where quick, precise routing is required. However, its reliance on accurate environmental modeling and the occurrence of sharp turns (with a maximum angle of
) may limit its robustness in unpredictable or cluttered environments such as urban policing or search-and-rescue missions. ACO, with its ability to generate smooth trajectories (maximum turning angle of
), is well-suited for applications that prioritize precision and smoothness in path planning. However, it incurs high computational overhead (12.3735 seconds) and is prone to path redundancy (706.40 meters), making it less efficient for real-time law enforcement applications requiring rapid response.
These findings underscore the inherent “speed-quality-smoothness trilemma" in 3D path planning, particularly in the context of UAVs used for law enforcement tasks. While our current study assumes a static environment and perfect sensing, real-world applications often involve dynamic obstacles (e.g., moving vehicles, crowds) and unpredictable conditions (e.g., wind or air turbulence). As such, future work must extend these algorithms to handle dynamic and real-time environments effectively.
To address these challenges, future work could explore hybrid algorithms that combine RRT’s fast exploration, A*’s heuristic refinement, and ACO’s smooth path optimization. Such hybrid approaches could be better suited for dynamic law enforcement scenarios, such as crowd monitoring, perimeter security, and surveillance, where both speed and path quality are critical. Additionally, real-time sensor data could be integrated for dynamic obstacle detection, allowing the path planner to adjust in response to new obstacles. Further exploration of adaptive parameterization—such as dynamic bias sampling in RRT or context-aware heuristic weighting in A*—could improve algorithmic adaptability to rapidly changing environments, like urban settings or dynamic policing situations.
In terms of hardware optimization, integrating GPU acceleration or distributed computing could help mitigate ACO’s scalability issues while enhancing RRT’s sampling density for more reliable path generation. Additionally, leveraging edge-computing frameworks could enable real-time, high-quality 3D navigation in complex and dynamic environments, providing essential flexibility for law enforcement UAVs.
Ultimately, combining algorithmic innovations with hardware co-design will be crucial in advancing UAV-based law enforcement systems, making them more efficient, flexible, and effective in providing aerial support for policing and security tasks. These improvements will be pivotal for the development of next-generation UAV technologies, ensuring they are capable of handling the varied demands of modern law enforcement operations. The adaptation of these algorithms to dynamic environments is a critical step toward making UAVs more robust and capable of tackling the unpredictable challenges faced in real-world applications.
References
- 1. Mohsan SAH, Khan MA, Noor F, Ullah I, Alsharif MH. Towards the Unmanned Aerial Vehicles (UAVs): a comprehensive review. Drones. 2022;6(6):147.
- 2. Hassanalian M, Abdelkefi A. Classifications, applications, and design challenges of drones: a review. Progr Aerosp Sci. 2017;91:99–131.
- 3. Nwaogu JM, Yang Y, Chan AP, Chi H l. Application of drones in the architecture, engineering, and construction (AEC) industry. Automat Construct. 2023;150:104827.
- 4. Eisenbeiss H, et al, A mini unmanned aerial vehicle (UAV): system overview and image acquisition. Int Archiv Photogram Remote Sens Spatial Inf Sci. 2004;36(5/W1):1–7.
- 5.
Chaurasia R, Mohindru V. Unmanned aerial vehicle (UAV): a comprehensive survey. Unmanned aerial vehicles for Internet of Things (Io T) concepts, techniques, and applications. 2021. p. 1–27.
- 6.
Puri A. A survey of unmanned aerial vehicles (UAV) for traffic surveillance. Department of Computer Science and Engineering, University of South Florida; 2005.
- 7. Giordan D, Adams MS, Aicardi I, Alicandro M, Allasia P, Baldo M, et al. The use of unmanned aerial vehicles (UAVs) for engineering geology applications. Bull Eng Geol Environ. 2020;79(7):3437–81.
- 8. Siebert S, Teizer J. Mobile 3D mapping for surveying earthwork projects using an Unmanned Aerial Vehicle (UAV) system. Automat Construct. 2014;41:1–14.
- 9. Yefimenko I. Modern possibilities of using unmanned aerial vehicles by police authorities and units: analysis of foreign and Ukrainian experience. Sci J Natl Acad Internal Affairs. 2022;27(3):65–77.
- 10. Saulnier A, Thompson SN. Police UAV use: institutional realities and public perceptions. PIJPSM. 2016;39(4):680–93.
- 11. Lorek M, Magniszewski M. The use of unmanned aerial vehicles by the Polish police. J Secur Sustain Issues. 2021;11(1).
- 12. Rana K, Praharaj S, Nanda T. Unmanned aerial vehicles (UAVs): an emerging technology for logistics. Int J Bus Manag Invent. 2016;5(5):86–92.
- 13. Borase RP, Maghade DK, Sondkar SY, Pawar SN. A review of PID control, tuning methods and applications. Int J Dynam Control. 2020;9(2):818–27.
- 14. Kiam Heong Ang, Chong G, Yun Li. PID control system analysis, design, and technology. IEEE Trans Contr Syst Technol. 2005;13(4):559–76.
- 15. Skoczowski S, Domek S, Pietrusewicz K, Broel-Plater B. A method for improving the robustness of PID control. IEEE Trans Ind Electron. 2005;52(6):1669–76.
- 16. Holkar K, Waghmare LM. An overview of model predictive control. Int J Control Automat. 2010;3(4):47–63.
- 17.
Qin SJ, Badgwell TA. AIChE symposium series. New York, NY: American Institute of Chemical Engineers; 1997. p. 232–56.
- 18.
Findeisen R, Allgöwer F. An introduction to nonlinear model predictive control. In: 21st Benelux Meeting on Systems and Control, Veldhoven. 2002. p. 119–41.
- 19.
Mayne D. Nonlinear model predictive control:challenges and opportunities. Nonlinear model predictive control. Basel: Birkhäuser; 2000. p. 23–44. https://doi.org/10.1007/978-3-0348-8407-5_2
- 20.
Sutton RS, Barto AG, et al. Reinforcement learning: an introduction, 1. Cambridge: MIT Press; 1998.
- 21.
Qiang W, Zhongli Z. Reinforcement learning model, algorithms and its application. In: 2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC). 2011. p. 1143–6. https://doi.org/10.1109/mec.2011.6025669
- 22. Kober J, Bagnell JA, Peters J. Reinforcement learning in robotics: a survey. Int J Robot Res. 2013;32(11):1238–74.
- 23. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436–44. pmid:26017442
- 24. Guo Y, Liu Y, Oerlemans A, Lao S, Wu S, Lew MS. Deep learning for visual understanding: a review. Neurocomputing. 2016;187:27–48.
- 25. Gugan G, Haque A. Path planning for autonomous drones: challenges and future directions. Drones. 2023;7(3):169.
- 26. Perazzo P, Sorbelli FB, Conti M, Dini G, Pinotti CM. Drone path planning for secure positioning and secure position verification. IEEE Trans Mobile Comput. 2016;16(9):2478–93.
- 27.
Candra A, Budiman MA, Pohan RI. Application of a-star algorithm on pathfinding game. J Phys.: Conf. Ser. 2021:012047.
- 28.
Sedighi S, Nguyen DV, Kuhnert KD. Guided hybrid A-star path planning algorithm for valet parking applications. In: 2019 5th International Conference on Control, Automation and Robotics (ICCAR). 2019. p. 570–5.
- 29. Erke S, Bin D, Yiming N, Qi Z, Liang X, Dawei Z. An improved A-Star based path planning algorithm for autonomous land vehicles. Int J Adv Robot Syst. 2020;17(5):1729881420962263.
- 30.
Kuswadi S, Santoso JW, Tamara MN, Nuh M. Application SLAM and path planning using A-star algorithm for mobile robot in indoor disaster area. In: 2018 International Electronics Symposium on Engineering Technology and Applications (IES-ETA). 2018. p. 270–4.
- 31. Zhang Z, Zhao Z. A multiple mobile robots path planning algorithm based on A-star and Dijkstra algorithm. Int J Smart Home. 2014;8(3):75–86.
- 32. LaValle SM, Kuffner JJ. Rapidly-exploring random trees: progress and prospects. Algorithm Comput Robot. 2001:303–7.
- 33. Li B, Chen B. An adaptive rapidly-exploring random tree. IEEE/CAA J Autom Sinica. 2022;9(2):283–94.
- 34.
Umari H, Mukhopadhyay S. Autonomous robotic exploration based on multiple rapidly-exploring randomized trees. In: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2017. p. 1396–402. https://doi.org/10.1109/iros.2017.8202319
- 35. Fox D, Burgard W, Thrun S. The dynamic window approach to collision avoidance. IEEE Robot Automat Mag. 1997;4(1):23–33.
- 36.
Brock O, Khatib O. High-speed navigation using the global dynamic window approach. In: Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C). IEEE; 1999. p. 341–6. https://doi.org/10.1109/robot.1999.770002
- 37. Ogren P, Leonard NE. A convergent dynamic window approach to obstacle avoidance. IEEE Trans Robot. 2005;21(2):188–95.
- 38. Blum C. Ant colony optimization: introduction and recent trends. Phys. Life Rev. 2005;2(4):353–73.
- 39.
Dorigo M, Socha K. An introduction to ant colony optimization. Handbook of approximation algorithms and metaheuristics. Chapman and Hall/CRC; 2018. p. 395–408.
- 40.
Dorigo M, Stützle T. The ant colony optimization metaheuristic: algorithms, applications, and advances. Handbook of metaheuristics. 2003. p. 250–85.
- 41. Botín-Sanabria DM, Mihaita A-S, Peimbert-García RE, Ramírez-Moreno MA, Ramírez-Mendoza RA, Lozoya-Santos J de J. Digital twin technology challenges and applications: a comprehensive review. Remote Sens. 2022;14(6):1335.
- 42. Liu M, Fang S, Dong H, Xu C. Review of digital twin about concepts, technologies, and industrial applications. J Manuf Syst. 2021;58:346–61.
- 43. Han XA, Ma Y, Huang X. A novel generalization of Bézier curve and surface. J Comput Appl Math. 2008;217(1):180–93.
- 44. Baydas S, Karakas B. Defining a curve as a Bezier curve. J Taibah Univ Sci. 2019;13(1):522–8.
- 45. Socha K, Blum C. An ant colony optimization algorithm for continuous optimization: application to feed-forward neural network training. Neural Comput Applic. 2007;16(3):235–47.
- 46.
Liu B, Abbas HA, McKay B. Classification rule discovery with ant colony optimization. In: IEEE/WIC International Conference on Intelligent Agent Technology. 2003. p. 83–8.
- 47. Kabir MdM, Shahjahan Md, Murase K. A new hybrid ant colony optimization algorithm for feature selection. Exp Syst Applic. 2012;39(3):3747–63.
- 48.
Luders B, Kothari M, How J. Chance constrained RRT for probabilistic robustness to environmental uncertainty. In: AIAA guidance, navigation, and control conference. 2010.
- 49.
Yang K, Gan SK, Huh J, Joo S. Optimal spline-based RRT path planning using probabilistic map. In: 2014 14th International Conference on Control, Automation and Systems (ICCAS 2014). 2014. p. 643–6. https://doi.org/10.1109/iccas.2014.6987859
- 50. Kothari M, Postlethwaite I. A probabilistically robust path planning algorithm for UAVs using rapidly-exploring random trees. J Intell Robot Syst. 2013;71:231–53.
- 51.
Wang H, Zhou J, Zheng G, Liang Y. Hierarchical A-Star algorithm for big map navigation in special areas. In: 2014 5th International Conference on Digital Home. 2014. p. 222–5.
- 52.
Yu J, Hou J, Chen G. Improved safety-first A-star algorithm for autonomous vehicles. In: 2020 5th International Conference on Advanced Robotics and Mechatronics (ICARM). 2020. p. 706–10.
- 53. Zhang J, Wu J, Shen X, Li Y. Autonomous land vehicle path planning algorithm based on improved heuristic function of A-Star. Int J Adv Robot Syst. 2021;18(5).
- 54.
Zhou Q, Liu G. UAV path planning based on the combination of a-star algorithm and RRT-star algorithm. In: 2022 IEEE International Conference on Unmanned Systems (ICUS). 2022. p. 146–51. https://doi.org/10.1109/icus55513.2022.9986703
- 55. Zhang H, Wang Y, Zheng J, Yu J. Path planning of industrial robot based on improved RRT algorithm in complex environments. IEEE Access. 2018;6:53296–306.