학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Bi-directional attention based RGB-D fusion for category-level object pose and shape estimation

Resource Type: Original Paper
Authors: Tang, Kaifeng; Xu, Chi; Chen, Ming
Source: Multimedia Tools and Applications: An International Journal. 83(17):53043-53063
Subject: Object pose estimation
Object shape estimation
Attention
RGB-D image
Robotic vision
Language: English
ISSN: 1573-7721

Online Access

초록

RGB-D images contain color and geometric information which are complementary for object pose and shape estimation. Normally, dense-fusion scheme is used to fuse the features extracted from the RGB-D channels for pose estimation of instance-level objects. However, for category-level objects, the effectiveness of dense-fusion feature is unfortunately affected by the significant intra-class variations between color and geometry. To address this problem, we propose AttentionFusion, a bi-directional attention-based RGB-D fusion framework for category-level object pose and shape estimation. In this framework, the complex contextual relationship between the color and geometric features is effectively explored by bi-directional cross-attention mechanism on a global scale for feature fusion. Based on the fused feature, 6D pose of the category-level object instance is refined iteratively, and object shape is also estimated precisely. Experimental results show that, the proposed method can achieve state-of-the-art performance for object pose and shape estimation on REAL275 datasets.

공지

DAU Library

학술논문

요약정보

Bi-directional attention based RGB-D fusion for category-level object pose and shape estimation

Online Access

초록