학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Emotion-Aware Talking Face Generation Based on 3DMM

Resource Type: Conference
Authors: Chen, Xinyu; Tang, Sheng
Source: 2024 4th International Conference on Neural Networks, Information and Communication (NNICE) Neural Networks, Information and Communication (NNICE), 2024 4th International Conference on. :1808-1813 Jan, 2024
Subject: Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Robotics and Control Systems
Signal Processing and Analysis
Deep learning
Three-dimensional displays
Lips
Aerospace electronics
Feature extraction
Transformers
Quality assessment
3DMM
Transformer
content information
emotion information
facial expressions
lip movements
Language

Online Access

Full Text (IEEE)

초록

Current methods for generating videos of talking face based on deep learning mainly focus on the correlation between lip movements and audio content. Although these methods have high generation quality and good audio-visual synchronization, they ignore facial expressions in talking face videos. To solve this problem, Audio to Expression Network (A2ENet), an emotional talking face video generation framework based on 3DMM, is proposed in this paper to generate talking face videos with facial expressions in an audio-driven way. Firstly, A2ENet uses two Transformer based encoders to extract audio features, and uses a cross-reconstruction emotion disentanglement method to decompose audio into potential space of content information and potential space of emotion information, and then uses a Transformer Decoder to integrate these two feature spaces. After that, the Proposed method predict the 3D expression coefficient that matches the emotion of the audio, and finally uses the renderer to generate the talking face video. By using the eye control parameters, A2ENet can realize the eye movements control of talking face. A2ENet associating the initial 3D expression coefficients with specific individuals to retain the identity information of the reference face. Experimental results show that our method can generate talking face videos with appropriate facial expressions, and achieve more accurate lip movements and better video quality.

공지

DAU Library

학술논문

요약정보

Emotion-Aware Talking Face Generation Based on 3DMM

Online Access

초록