학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

KAN:Keyframe Attention Network for Person Video Captioning

Resource Type: Conference
Authors: Zhang, Xiangyun; Yang, Min; Zhang, Xu; Ni, Fan; Hu, Fangqiang; Zhu, Aichun
Source: 2023 China Automation Congress (CAC) Automation Congress (CAC), 2023 China. :7013-7018 Nov, 2023
Subject: Aerospace
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Automation
Feature extraction
Vectors
Explosions
Resource management
Residual neural networks
video captioning
keyframe extraction
keyframe representation
attention mechanism
Language
ISSN: 2688-0938

Online Access

Full Text (IEEE)

초록

This paper presents a novel algorithm named Keyframe Attention Network (KAN) for video captioning, which combines keyframe feature extraction with an attention allocation mechanism. The proposed method first utilizes a threshold-based keyframe extraction technique to obtain keyframes. Subsequently, keyframe representation module is employed to extract essential features from these keyframes, this module is built by deep residual network. Finally, the extracted feature vectors, along with reference captions, are fed into an attention allocation module to generate descriptive captions. The inclusion of deep residual network ensures an increased network depth without encountering gradient explosions. Moreover, the attention module adopts an Encoder-Decoder structure with additional attention layers, enabling effective attention allocation and yielding more accurate captions.

공지

DAU Library

학술논문

요약정보

KAN:Keyframe Attention Network for Person Video Captioning

Online Access

초록