학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Modality Attention for Prediction-Based Robot Motion Generation: Improving Interpretability and Robustness of Using Multi-Modality

Resource Type: Periodical
Authors: Ichiwara, H.; Ito, H.; Yamamoto, K.; Mori, H.; Ogata, T.
Source: IEEE Robotics and Automation Letters IEEE Robot. Autom. Lett. Robotics and Automation Letters, IEEE. 8(12):8271-8278 Dec, 2023
Subject: Robotics and Control Systems
Computing and Processing
Components, Circuits, Devices and Systems
Robot sensing systems
Deep learning
Visualization
Robustness
Predictive models
Recurrent neural networks
Grasping
Deep learning in grasping and manipulation
neurorobotics
learning from experience
Language
ISSN: 2377-3766
2377-3774

Online Access

초록

We developed a modality attention motion generation model on the basis of multi-modality prediction. This model provides interpretability about modality usage and demonstrates robustness against disturbances. We used a hierarchical model consisting of low-level recurrent neural networks (RNNs) for processing each modality individually and a high-level RNN that integrates the multi-modality. This integration is achieved by efficiently gating multi-modality and inputting it to the high-level RNN. We verified the interpretability and robustness of the task of inserting a furniture part, which consists of the “approach” phase to bring the wooden dowel closer to the hole and the “insertion” phase. While the proposed model achieves the same task success rate as the conventional model, it clarifies that it refers to vision during “approach” and force during “insertion,” providing interpretability regarding modality use. Furthermore, in contrast to the non-modality attention model, whose task success rate drops significantly under disturbance, the proposed model enhances robustness against disturbances to modalities it does not direct attention during the task, resulting in a consistently high success rate ($\simeq\! 90\%$).

공지

DAU Library

학술논문

요약정보

Modality Attention for Prediction-Based Robot Motion Generation: Improving Interpretability and Robustness of Using Multi-Modality

Online Access

초록