학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Memory-Token Transformer for Unsupervised Video Anomaly Detection

Resource Type: Conference
Authors: Li, Youyu; Song, Xiaoning; Xu, Tianyang; Feng, Zhenhua
Source: 2022 26th International Conference on Pattern Recognition (ICPR) Pattern Recognition (ICPR), 2022 26th International Conference on. :3325-3332 Aug, 2022
Subject: Computing and Processing
Robotics and Control Systems
Signal Processing and Analysis
Three-dimensional displays
Convolution
Semantics
Video sequences
Memory modules
Benchmark testing
Transformers
Language
ISSN: 2831-7475

Online Access

Full Text (IEEE)

초록

Video anomaly detection is crucial for behavior analysis, which has witnessed continuous progress in recent years with the auto-encoder based reconstruction framework. However, in some cases, abnormal frames may also be reconstructed well due to the strong representation ability of deep networks, increasing missed detection. To mitigate this issue, the existing methods usually the memory bank method. This method records normal patterns and assigns high errors for the reconstruction of abnormal frames into normal frames. In this paper, to better use the semantic information of normal videos recorded in the memory module, we introduce the Memory-Token Transformer (MTT) to boost the reconstruction performance on normal frames. We assume that the anomalies in a video mainly concentrate on the regions containing people and relevant objects. Therefore, during the decoding stage, we first extract the semantic concepts of a feature map and generate the corresponding semantic tokens. Then the tokens are combined with the proposed memory module. Last, we introduce a transformer to fuse the complex relationship among different tokens, and use 3D convolution with the pooling operator in our encoder to enhance spatio-temporal feature extraction as compared with 2D models. The experimental results obtained on various benchmarks demonstrate the effectiveness of the proposed method.

공지

DAU Library

학술논문

요약정보

Memory-Token Transformer for Unsupervised Video Anomaly Detection

Online Access

초록