Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis

07/24/2022
by   Shuai Shen, et al.
0

Talking head synthesis is an emerging technology with wide applications in film dubbing, virtual avatars and online education. Recent NeRF-based methods generate more natural talking videos, as they better capture the 3D structural information of faces. However, a specific model needs to be trained for each identity with a large dataset. In this paper, we propose Dynamic Facial Radiance Fields (DFRF) for few-shot talking head synthesis, which can rapidly generalize to an unseen identity with few training data. Different from the existing NeRF-based methods which directly encode the 3D geometry and appearance of a specific person into the network, our DFRF conditions face radiance field on 2D appearance images to learn the face prior. Thus the facial radiance field can be flexibly adjusted to the new identity with few reference images. Additionally, for better modeling of the facial deformations, we propose a differentiable face warping module conditioned on audio signals to deform all reference images to the query space. Extensive experiments show that with only tens of seconds of training clip available, our proposed DFRF can synthesize natural and high-quality audio-driven talking head videos for novel identities with only 40k iterations. We highly recommend readers view our supplementary video for intuitive comparisons. Code is available in https://sstzal.github.io/DFRF/.

READ FULL TEXT

page 2

page 11

page 12

page 13

research
12/15/2020

HeadGAN: Video-and-Audio-Driven Talking Head Synthesis

Recent attempts to solve the problem of talking head synthesis using a s...
research
01/10/2023

DiffTalk: Crafting Diffusion Models for Generalized Talking Head Synthesis

Talking head synthesis is a promising approach for the video production ...
research
11/21/2019

FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis

Talking face synthesis has been widely studied in either appearance-base...
research
06/14/2023

Generalizable One-shot Neural Head Avatar

We present a method that reconstructs and animates a 3D head avatar from...
research
05/05/2023

Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos

Modern generators render talking-head videos with impressive levels of p...
research
08/11/2022

FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction and Expression Editing

We propose a Few-shot Dynamic Neural Radiance Field (FDNeRF), the first ...
research
11/18/2020

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

We tackle human image synthesis, including human motion imitation, appea...

Please sign up or login with your details

Forgot password? Click here to reset