Imitator: Personalized Speech-driven 3D Facial Animation

12/30/2022
by   Balamurugan Thambiraja, et al.
17

Speech-driven 3D facial animation has been widely explored, with applications in gaming, character animation, virtual reality, and telepresence systems. State-of-the-art methods deform the face topology of the target actor to sync the input audio without considering the identity-specific speaking style and facial idiosyncrasies of the target actor, thus, resulting in unrealistic and inaccurate lip movements. To address this, we present Imitator, a speech-driven facial expression synthesis method, which learns identity-specific details from a short input video and produces novel facial expressions matching the identity-specific speaking style and facial idiosyncrasies of the target actor. Specifically, we train a style-agnostic transformer on a large facial expression dataset which we use as a prior for audio-driven facial expressions. Based on this prior, we optimize for identity-specific speaking style based on a short reference video. To train the prior, we introduce a novel loss function based on detected bilabial consonants to ensure plausible lip closures and consequently improve the realism of the generated expressions. Through detailed experiments and a user study, we show that our approach produces temporally coherent facial expressions from input audio while preserving the speaking style of the target actors.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

research
09/05/2019

Neural Style-Preserving Visual Dubbing

Dubbing is a technique for translating video content from one language t...
research
05/08/2019

Capture, Learning, and Synthesis of 3D Speaking Styles

Audio-driven 3D facial animation has been widely explored, but achieving...
research
05/25/2020

Identity-Preserving Realistic Talking Face Generation

Speech-driven facial animation is useful for a variety of applications s...
research
08/28/2023

ExpCLIP: Bridging Text and Facial Expressions via Semantic Alignment

The objective of stylized speech-driven facial animation is to create an...
research
09/20/2023

FaceDiffuser: Speech-Driven 3D Facial Animation Synthesis Using Diffusion

Speech-driven 3D facial animation synthesis has been a challenging task ...
research
03/01/2023

Few-shots Portrait Generation with Style Enhancement and Identity Preservation

Nowadays, the wide application of virtual digital human promotes the com...
research
07/18/2023

FACTS: Facial Animation Creation using the Transfer of Styles

The ability to accurately capture and express emotions is a critical asp...

Please sign up or login with your details

Forgot password? Click here to reset