Convolution-based and Transformer-based vision backbone networks process...
In this technical report, we briefly introduce our solution for the
Zero...
Image harmonization aims to solve the visual inconsistency problem in
co...
In this paper, we focus on a recently proposed novel task called Audio-V...
Multimodal-driven talking face generation refers to animating a portrait...
Recently, emotional talking face generation has received considerable
at...
Most of the existing blind image Super-Resolution (SR) methods assume th...
2D-based Industrial Anomaly Detection has been widely discussed, however...
Few Shot Instance Segmentation (FSIS) requires models to detect and segm...
In this paper, we focus on exploring effective methods for faster, accur...
Motivated by biological evolution, this paper explains the rationality o...
This paper presents a novel Region-Aware Face Swapping (RAFSwap) network...
Density-based and classification-based methods have ruled unsupervised
a...
In the practical application of restoring low-resolution gray-scale imag...
Inspired by biological evolution, we explain the rationality of Vision
T...
Audio-guided face reenactment aims to generate a photorealistic face tha...
This paper presents a novel end-to-end dynamic time-lapse video generati...
Recent works in the person re-identification task mainly focus on the mo...
Audio-guided face reenactment aims at generating photorealistic faces us...
Unsupervised learning of optical flow, which leverages the supervision f...
Recent works have shown how realistic talking face images can be obtaine...
Recent face reenactment studies have achieved remarkable success either
...