학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Exploring Video Event Classification: Leveraging Two-Stage Neural Networks and Customized CNN Models with UCF-101 and CCV Datasets

Resource Type: Conference
Authors: Sachdeva, Kumud; Sandhu, Jasminder Kaur; Sahu, Rakesh
Source: 2024 11th International Conference on Computing for Sustainable Global Development (INDIACom) Computing for Sustainable Global Development (INDIACom), 2024 11th International Conference on. :100-105 Feb, 2024
Subject: Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineering Profession
General Topics for Engineers
Geoscience
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Deep learning
Video on demand
Event detection
Reviews
Computational modeling
Neural networks
Computer architecture
Video Event Detection (VED)
Convolutional Neural Network (CNN)
Deep Neural Network (DNN)
Recurrent Neural Network (RNN)
UCF-101
CCV datasets
Language

Online Access

Full Text (IEEE)

초록

Rapid technological advances have significantly improved our ability to analyse video data. This comprehensive review examines machine learning (ML) models applied to video event detection and classification, including CNNs, deep neural networks (DNNs), and RNNs. When evaluated on benchmark datasets for accuracy, these approaches demonstrate their relative strengths and weaknesses. Researchers have encountered numerous challenges in video event detection, which are addressed throughout the review. However, achieving high detection precision remains challenging due to diverse event types, video quality issues, model over fitting risks, and lack of large labeled training datasets. Background scenes, lighting, and object occlusion further complicate accurate identification. As datasets and computational power grow, video event detection stands to gain significantly. This review assessed action recognition models trained on the UCF-101 and CCV databases. On CCV, a 2-stage neural network achieved 75% accuracy; while a multi-stream deep learning (DL) system obtained 77.5%. For the larger UCF101, 2-stream and RNN architectures realized 92% and 89% accuracy using video-level prediction.

공지

DAU Library

학술논문

요약정보

Exploring Video Event Classification: Leveraging Two-Stage Neural Networks and Customized CNN Models with UCF-101 and CCV Datasets

Online Access

초록