학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Complex Multistep Manipulator Planning Algorithm Based on Improved UOF

Resource Type: Conference
Authors: Zhang, Wen; Cao, Xianchao; Weng, Qing; Li, Kun
Source: 2023 35th Chinese Control and Decision Conference (CCDC) Control and Decision Conference (CCDC), 2023 35th Chinese. :3856-3861 May, 2023
Subject: General Topics for Engineers
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Training
Stacking
Estimation
Manipulators
Planning
Task analysis
Convergence
multistep tasks
convergence time
double estimation
curiosity
proximity
Language
ISSN: 1948-9447

Online Access

Full Text (IEEE)

초록

The manipulator is challenging to perform multistep tasks, such as block stacking. Universal Option Framework (UOF) has a high success rate in completing multistep tasks. The Deep Deterministic Policy Gradients (DDPG) algorithm based on Hindsight Experience Replay (HER) in UOF trains low-level policy. However, it has a low success rate, long convergence time, and low training efficiency for complex tasks. To solve the above problems, this article proposes the UOF algorithm for improved DDPG and the UOF algorithm for improved HER. The improved DDPG algorithm suggests the soft double estimation softmax operator based on UOF and introduces it into the DDPG algorithm. By improving the deviation of the DDPG algorithm, the success rate of complex tasks is increased, and the task convergence time is significantly shortened; By introducing curiosity and proximity into the improved HER algorithm, the failure experience of each learning stage is effectively utilized to solve further the sparse reward problem, which significantly enhances the training efficiency and the success rate of complex tasks. Experiments on block stacking tasks show that the improved method can effectively complete complex tasks.

공지

DAU Library

학술논문

요약정보

Complex Multistep Manipulator Planning Algorithm Based on Improved UOF

Online Access

초록