An Adversarial Objective for Scalable Exploration
- Resource Type
- Conference
- Authors
- Bucher, Bernadette; Schmeckpeper, Karl; Matni, Nikolai; Daniilidis, Kostas
- Source
- 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Intelligent Robots and Systems (IROS), 2021 IEEE/RSJ International Conference on. :2670-2677 Sep, 2021
- Subject
- Robotics and Control Systems
Computational modeling
Scalability
Pipelines
Predictive models
Hardware
Data models
Task analysis
- Language
- ISSN
- 2153-0866
Collecting new experience is costly in many robotic tasks, so determining how to efficiently explore in a new environment to learn as much as possible in as few trials as possible is an important problem for robotics. In this paper, we propose a method for exploring for the purpose of learning a dynamics model. Our key idea is to minimize a score given by a discriminator network as an objective for a planner which chooses actions. This discriminator is optimized jointly with a prediction model and enables our active learning approach to sample sequences of observations and actions which result in predictions considered the least realistic by the discriminator. Comparable existing exploration methods cannot operate in many prediction-planning pipelines used in robotic learning without hardware modifications to standard robotics platforms in order to accommodate their large compute requirements, so the primary contribution of our adversarial exploration method is scalability. We demonstrate progressively increased performance of our adversarial exploration approach compared to leading model-based exploration strategies as compute is restricted in simulated environments. We further demonstrate the ability of our adversarial method to scale to a robotic manipulation prediction-planning pipeline where we improve sample efficiency and prediction performance for a domain transfer problem.