학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Deep Learning for Multimodal Emotion Recognition-Attentive Residual Disconnected RNN

Resource Type: Conference
Authors: Chandra, Erick; Hsu, Jane Yung-jen
Source: 2019 International Conference on Technologies and Applications of Artiﬁcial Intelligence (TAAI) Technologies and Applications of Artiﬁcial Intelligence (TAAI), 2019 International Conference on. :1-8 Nov, 2019
Subject: Computing and Processing
Robotics and Control Systems
Recurrent neural networks
Feature extraction
Emotion recognition
Task analysis
Mathematical model
Residual neural networks
Visualization
Emotion Recognition
Disconnected Recurrent Neural Network
Attention Mechanism
Residual Network
Language
ISSN: 2376-6824

Online Access

Full Text (IEEE)

초록

Human communicates using verbal and non-verbal cues. One of the most essential elements that complements the understanding of communication is emotion. Emotion is expressed not only in words, but also facial expressions, body language, tone, etc. Therefore, we formulate the emotion recognition as a multimodal task.Emotions are usually described in a sequence along with the utterances. In recent years, RNN-based models have been known to be good at modeling the entire sequence and capturing long-term dependencies. However, it lacks the ability to extract local key patterns and position-invariant features. Hence, we adopt Deep Attentive Residual Disconnected RNN model which incorporates the concept from both RNN and CNN to enhance the ability to capture spatial and temporal features.We utilize CMU MOSEI dataset comprising of language, visual, and acoustic modalities for training and evaluating our model. The results show that Deep Attentive Residual Disconnected RNN model outperforms the baseline. Besides, the use of multimodal approach also solidifies the recognition better compared to those of single modalities.

공지

DAU Library

학술논문

요약정보

Deep Learning for Multimodal Emotion Recognition-Attentive Residual Disconnected RNN

Online Access

초록