eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Video Content Understanding Based on Feature Selection and Semantic Alignment Network

Resource Type: Conference
Authors: Huang, Qingnan; Wang, Xiangqing; Cai, Xiaodong; Zhou, Meixin
Source: 2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE) MLISE Machine Learning and Intelligent Systems Engineering (MLISE), 2022 International Conference on. :29-33 Aug, 2022
Subject: Computing and Processing
Training
Visualization
Video description
Semantics
Machine learning
Feature extraction
Data mining
Feature Selection Network
Semantic Alignment Loss
Difficult Samples
Video Understanding
Language

Online Access

Full Text (IEEE)

초록

Existing video understanding models usually use a space-based uniform information extraction method, which is easy to ignore important visual semantics. At the same time, due to the inability to accurately distinguish positive and negative samples, visual semantics and predicted content cannot be accurately aligned, resulting in inaccurate content descriptions. In this paper, we design a method to highlight important features and a new semantic alignment loss function to improve the accuracy of the description. First, the image frame information is mapped into feature vectors, and important features are learned and selected through a selection extraction network based on the relationship between frames, and then extracted through a fully connected layer. Secondly, through the training of negative samples, the decoder can effectively identify the difficult samples, and on this basis, a new semantic alignment loss function is designed to adaptively assign weights to the loss calculated by negative samples to improve the relationship between text and images. Semantic relevance. Experimental results on MSVD, a dataset widely used in this field, show that our method can significantly improve the accuracy of video descriptions, and each indicator is significantly better than existing models.

공지

DAU Library

eArticles

요약정보

Video Content Understanding Based on Feature Selection and Semantic Alignment Network

Online Access

초록