학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Decoupling Common and Unique Representations for Multimodal Self-supervised Learning

Resource Type: Working Paper
Authors: Wang, Yi; Albrecht, Conrad M; Braham, Nassim Ait Ali; Liu, Chenying; Xiong, Zhitong; Zhu, Xiao Xiang
Source
Subject: Computer Science - Computer Vision and Pattern Recognition
Language

Online Access

초록

The increasing availability of multi-sensor data sparks wide interest in multimodal self-supervised learning. However, most existing approaches learn only common representations across modalities while ignoring intra-modal training and modality-unique representations. We propose Decoupling Common and Unique Representations (DeCUR), a simple yet effective method for multimodal self-supervised learning. By distinguishing inter- and intra-modal embeddings through multimodal redundancy reduction, DeCUR can integrate complementary information across different modalities. We evaluate DeCUR in three common multimodal scenarios (radar-optical, RGB-elevation, and RGB-depth), and demonstrate its consistent improvement regardless of architectures and for both multimodal and modality-missing settings. With thorough experiments and comprehensive analysis, we hope this work can provide valuable insights and raise more interest in researching the hidden relationships of multimodal representations.
Comment: Accepted to ECCV 2024. 27 pages, 8 figures

공지

DAU Library

학술논문

요약정보

Decoupling Common and Unique Representations for Multimodal Self-supervised Learning

Online Access

초록