학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Promoting Quality and Diversity in Population-based Reinforcement Learning via Hierarchical Trajectory Space Exploration

Resource Type: Conference
Authors: Miao, Jiayu; Zhou, Tianze; Shao, Kun; Zhou, Ming; Zhang, Weinan; Hao, Jianye; Yu, Yong; Wang, Jun
Source: 2022 International Conference on Robotics and Automation (ICRA) Robotics and Automation (ICRA), 2022 IEEE International Conference on. :7544-7550 May, 2022
Subject: Robotics and Control Systems
Automation
Sociology
Reinforcement learning
Trajectory
Space exploration
Behavioral sciences
Task analysis
Language

Online Access

Full Text (IEEE)

초록

Quality Diversity (QD) algorithms in population-based reinforcement learning aim to optimize agents' returns and diversity among the population simultaneously. It is conducive to solving exploration problems in reinforcement learning and potentially getting multiple good and diverse strategies. However, previous methods typically define behavioral embedding in action space or outcome space, which neglect trajectory characteristics during the execution process. In this paper, we introduce a trajectory embedding model trained by Variational Autoencoder with similarity constraint to characterize trajectory features. Based on that, we propose a hierarchical trajectory-space exploration (HTSE) framework using Determinantal Point Processes (DPP) to generate high-quality and diverse solutions in the selection and mutation process. The experimental results show that our HTSE method effectively completes several simulated tasks, outperforming other Quality-Diversity Reinforcement Learning algorithms.

공지

DAU Library

학술논문

요약정보

Promoting Quality and Diversity in Population-based Reinforcement Learning via Hierarchical Trajectory Space Exploration

Online Access

초록