학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Imitation Learning as State Matching via Differentiable Physics

Resource Type: Conference
Authors: Chen, Siwei; Ma, Xiao; Xu, Zhongwen
Source: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2023 IEEE/CVF Conference on. :7846-7855 Jun, 2023
Subject: Computing and Processing
Training
Heuristic algorithms
Reinforcement learning
Stability analysis
Trajectory
Pattern recognition
Task analysis
Robotics
Language
ISSN: 2575-7075

Online Access

Full Text (IEEE)

초록

Existing imitation learning (IL) methods such as inverse reinforcement learning (IRL) usually have a double-loop training process, alternating between learning a reward function and a policy and tend to suffer long training time and high variance. In this work, we identify the benefits of differentiable physics simulators and propose a new IL method, i.e., Imitation Learning as State Matching via Differentiable Physics (ILD), which gets rid of the double-loop design and achieves significant improvements in final performance, convergence speed, and stability. The proposed ILD incorporates the differentiable physics simulator as a physics prior into its computational graph for policy learning. ILD unrolls the dynamics by sampling actions from a parameterized policy and minimizing the distance between the expert trajectory and the agent trajectory. It back-propagates the gradient into the policy via temporal physics operators, which improves the transferability to unseen environments and yields higher final performance. ILD has a single-loop structure that stabilizes and speeds up training. It dynamically selects learning objectives for each state during optimization to simplify the complex optimization land-scape. Experiments show that ILD outperforms state-of-the-art methods in continuous control tasks with Brax, and can be applied to deformable object manipulation tasks, generalized to unseen configurations. 1 1 The link to the code: https://github.com/sail-sg/ILD

공지

DAU Library

학술논문

요약정보

Imitation Learning as State Matching via Differentiable Physics

Online Access

초록