학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

An Adversarial Objective for Scalable Exploration

Resource Type: Conference
Authors: Bucher, Bernadette; Schmeckpeper, Karl; Matni, Nikolai; Daniilidis, Kostas
Source: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Intelligent Robots and Systems (IROS), 2021 IEEE/RSJ International Conference on. :2670-2677 Sep, 2021
Subject: Robotics and Control Systems
Computational modeling
Scalability
Pipelines
Predictive models
Hardware
Data models
Task analysis
Language
ISSN: 2153-0866

Online Access

Full Text (IEEE)

초록

Collecting new experience is costly in many robotic tasks, so determining how to efficiently explore in a new environment to learn as much as possible in as few trials as possible is an important problem for robotics. In this paper, we propose a method for exploring for the purpose of learning a dynamics model. Our key idea is to minimize a score given by a discriminator network as an objective for a planner which chooses actions. This discriminator is optimized jointly with a prediction model and enables our active learning approach to sample sequences of observations and actions which result in predictions considered the least realistic by the discriminator. Comparable existing exploration methods cannot operate in many prediction-planning pipelines used in robotic learning without hardware modifications to standard robotics platforms in order to accommodate their large compute requirements, so the primary contribution of our adversarial exploration method is scalability. We demonstrate progressively increased performance of our adversarial exploration approach compared to leading model-based exploration strategies as compute is restricted in simulated environments. We further demonstrate the ability of our adversarial method to scale to a robotic manipulation prediction-planning pipeline where we improve sample efficiency and prediction performance for a domain transfer problem.

공지

DAU Library

학술논문

요약정보

An Adversarial Objective for Scalable Exploration

Online Access

초록