학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

MetaVG: A Meta-Learning Framework for Visual Grounding

Resource Type: Article
Authors: Su, Chao; Li, Zhi; Lei, Tianyi; Peng, Dezhong; Wang, Xu
Source: IEEE Signal Processing Letters; 2024, Vol. 31 Issue: 1 p236-240, 5p
Subject
Language
ISSN: 10709908; 15582361

Online Access

초록

Visual grounding aims at localizing objects in images using natural language expressions. This task can be challenging when there are significant differences between the distributions of the training and testing sets. Existing methods tend to excessively focus on the training sets, which could lead to overfitting, especially in small-sample scenarios. To address this issue, in this letter, we present a novel meta-learning-based training framework called MetaVG, for visual grounding. Our approach leverages bi-level optimization to adapt quickly to the target task, thereby alleviating the overfitting issue. To train MetaVG effectively, we propose a novel training mechanism called Random Uncorrelated Meta-training (RUM). This mechanism proposes to randomly load uncorrelated batches as support and query sets respectively in the data separation process, then utilize bi-level optimization to directly train the model on visual grounding datasets. Comprehensive experiments on four widely used datasets, as well as in small-sample scenarios, validate the efficacy of MetaVG.

공지

DAU Library

학술논문

요약정보

MetaVG: A Meta-Learning Framework for Visual Grounding

Online Access

초록