Fig 1.
RL model.
Fig 2.
Simulation environment.
Table 1.
Setting of 4 kinds of algorithms.
Fig 3.
Optimal path of proposed algorithm.
Fig 4.
Convergence episode of traditional Q-learning.
Fig 5.
Convergence episode of Q-learning(Initialization).
Fig 6.
Convergence episode of Q-learning(Dynamic).
Fig 7.
Convergence episode of improved Q-learning.
Table 2.
Performance comparison of two kinds of algorithms.