eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

An Improved Method towards Multi-UAV Autonomous Navigation Using Deep Reinforcement Learning

Resource Type: Conference
Authors: Wu, Dingwei; Wan, Kaifang; Tang, Jianqiang; Gao, Xiaoguang; Zhai, Yiwei; Qi, Zhaohui
Source: 2022 7th International Conference on Control and Robotics Engineering (ICCRE) Control and Robotics Engineering (ICCRE), 2022 7th International Conference on. :96-101 Apr, 2022
Subject: Robotics and Control Systems
Navigation
Decision making
Reinforcement learning
Stability analysis
Robustness
Convergence
Autonomous robots
reinforcement learning
multi-UAV
autonomous navigation
prioritized experience replay
MADDPG
Language

Online Access

Full Text (IEEE)

초록

Autonomous navigation is a key technology of multi-UAV systems, and deep reinforcement learning can endow UAVs with powerful autonomous decision-making capabilities. To improve the convergence speed and stability of reinforcement learning, this paper proposes a multi-agent deep deterministic policy gradient algorithm based on prioritized experience replay, namely PER-MADDPG. This algorithm makes the samples with higher priority have a higher probability of being chosen for the parameter update, which can speed up the algorithm convergence. Moreover, the actions of UAVs are generated utilizing parameter noise, which can improve the stability and robustness of the algorithm. Experiments show that PER-MADDPG has fast convergence speed and good convergence results, and has excellent autonomous navigation capabilities.

공지

DAU Library

eArticles

요약정보

An Improved Method towards Multi-UAV Autonomous Navigation Using Deep Reinforcement Learning

Online Access

초록