Reinforcement learning for UAV flight controls: Evaluating continuous space reinforcement learning algorithms for fixed-wing UAVs

doi:10.1371/journal.pone.0334219

Fig 1.

Modeling of a fixed wing UAV.

More »

Expand

Fig 2.

Track followed by fixed wing UAV.

More »

Expand

Fig 3.

Working of airspeed and altitude controller.

More »

Expand

Fig 4.

Working of heading controller.

More »

Expand

Fig 5.

Working of attitude controller.

More »

Expand

Fig 6.

DDPG algorithm structure.

More »

Expand

Fig 7.

TD3 algorithm structure.

More »

Expand

Fig 8.

PPO algorithm structure.

More »

Expand

Fig 9.

TRPO algorithm structure.

More »

Expand

Fig 10.

SAC algorithm structure.

More »

Expand

Table 1.

Strengths and limitations of RL algorithms.

More »

Expand

Table 2.

Training process for RL agents.

More »

Expand

Table 3.

Evaluation criteria for RL agents.

More »

Expand

Table 4.

Hyperparameters for all agents.

More »

Expand

Fig 11.

The Altitude Control (left) and Training Curve (right) obtained on DDPG agent.

More »

Expand

Fig 12.

The Heading Control (left) and Roll Control (right) obtained on DDPG agent.

More »

Expand

Fig 13.

The Altitude Control (left) and Training Curve (right) obtained on TRPO agent.

More »

Expand

Fig 14.

The Heading Control (left) and Roll Control (right) obtained on TRPO agent.

More »

Expand

Fig 15.

The Altitude Control (left) and Training Curve (right) obtained on PPO agent.

More »

Expand

Fig 16.

The Heading Control (left) and Roll Control (right) obtained on PPO agent.

More »

Expand

Fig 17.

The Altitude Control (left) and Training Curve (right) obtained on TD3 agent.

More »

Expand

Fig 18.

The Heading Control (left) and Roll Control (right) obtained on TD3 agent.

More »

Expand

Fig 19.

The Altitude Control (left) and Training Curve (right) obtained on SAC agent.

More »

Expand

Fig 20.

The Heading Control (left) and Roll Control (right) obtained on SAC agent.

More »

Expand

Fig 21.

The altitude control for PID controller.

More »

Expand

Fig 22.

The Heading Control (left) and Roll Control (right) of PID controller.

More »

Expand

Fig 23.

The deflection of control surfaces over time.

More »

Expand

Fig 24.

Comparison of RL agent and PID response for altitude controller.

More »

Expand

Fig 25.

Comparison of RL agent and PID response for heading controller.

More »

Expand

Fig 26.

Comparison of RL agent and PID response for roll controller.

More »

Expand

Table 5.

Comparison of RL agents.

More »

Expand

Table 6.

Training time, total steps, and step efficiency of RL agents.

More »

Expand

Table 7.

Comparison of PID and RL agents.

More »

Expand