Utility Model for Visual Recognition using Enhanced Long-term Recurrent Convolutional Network
- Resource Type
- Conference
- Authors
- Sowmya, BJ; Swetha, BN; Meeradevi; Kanavalli, Anita; Mishra, Divyansh; Kaushik, Abhishek; Hossale, Prashanth; Patil, Ankit U
- Source
- 2023 International Conference on Recent Advances in Science and Engineering Technology (ICRASET) Recent Advances in Science and Engineering Technology (ICRASET), 2023 International Conference on. :1-5 Nov, 2023
- Subject
- Computing and Processing
Engineering Profession
Photonics and Electrooptics
Robotics and Control Systems
Signal Processing and Analysis
Visualization
Recurrent neural networks
Feature extraction
Solids
Convolutional neural networks
Task analysis
Long short term memory
LSTM
CNN
Machine Learning
Computer Vision
- Language
A new model for video identification and explanation assignments. The model combines the benefits of long short-term memory (LSTM) networks for sequence modeling and convolutional neural networks (CNNs) for extracting features from images. The proposed model is assessed using a dataset of videos and outperforms the state-of-the-art models. The model is suitable for various applications such as video captioning, action recognition, and video retrieval. The suggested model comprises of two primary elements: a long short-term memory (LSTM) and a convolutional neural network (CNN). The CNN extracts feature from each video frame. The LSTM then uses these features to predict the next frame in the video. The model is trained on a datasetof videos that have been labelled with their corresponding actions. The model isevaluated on a test set of videos that have not been seen before. The model outperforms cutting-edge models on the test set. This study showcases the significance of recurrent convolutional structures for tasks related to visual identification and explanation.