학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Visual Question Answering with Textual Representations for Images

Resource Type: Conference
Authors: Hirota, Yusuke; Garcia, Noa; Otani, Mayu; Chu, Chenhui; Nakashima, Yuta; Taniguchi, Ittetsu; Onoye, Takao
Source: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) ICCVW Computer Vision Workshops (ICCVW), 2021 IEEE/CVF International Conference on. :3147-3150 Oct, 2021
Subject: Computing and Processing
Visualization
Computer vision
Conferences
Computational modeling
Knowledge discovery
Feature extraction
Object recognition
Language
ISSN: 2473-9944

Online Access

Full Text (IEEE)

초록

How far can we go with textual representations for understanding pictures? Deep visual features extracted by object recognition models are prevailing used in multiple tasks, and especially in visual question answering (VQA). However, conventional deep visual features may struggle to convey all the details in an image as we humans do. Mean-while, with recent language models’ progress, descriptive text may be an alternative to this problem. This paper delves into the effectiveness of textual representations for image understanding in the specific context of VQA.

공지

DAU Library

학술논문

요약정보

Visual Question Answering with Textual Representations for Images

Online Access

초록