Stable diffusion, a generative model used in text-to-image synthesis,
fr...
Contrastive Language-Image Pre-training (CLIP) has significantly boosted...
Sound Event Detection (SED) aims to predict the temporal boundaries of a...
Recently, text-guided 3D generative methods have made remarkable advance...
Learning from crowds describes that the annotations of training data are...
Modern image retrieval methods typically rely on fine-tuning pre-trained...
Face anti-spoofing (FAS) is an essential mechanism for safeguarding the
...
Transparent object perception is a rapidly developing research problem i...
We propose a new formulation of temporal action detection (TAD) with
den...
3D gaze estimation is most often tackled as learning a direct mapping be...
Domain shift across crowd data severely hinders crowd counting models to...
Unsupervised person re-identification (ReID) aims to train a feature
ext...
Face Restoration (FR) aims to restore High-Quality (HQ) faces from
Low-Q...
The long-tailed distribution is a common phenomenon in the real world.
E...
Perspective distortions and crowd variations make crowd counting a
chall...
Major advancements have been made in the field of object detection and
s...
Face recognition, as one of the most successful applications in artifici...
Gait benchmarks empower the research community to train and evaluate
hig...
Face benchmarks empower the research community to train and evaluate
hig...
Learning discriminative deep feature embeddings by using million-scale
i...
This paper probes intrinsic factors behind typical failure cases (e.g.
s...
Recent deep face hallucination methods show stunning performance in
supe...
During the COVID-19 coronavirus epidemic, almost everyone wears a facial...
According to WHO statistics, there are more than 204,617,027 confirmed
C...
Spatial self-attention layers, in the form of Non-Local blocks, introduc...
Although tremendous strides have been made in uncontrolled face detectio...
Deep neural networks have been the driving force behind the success in
c...
In this paper, we contribute a new million-scale face benchmark containi...
The last few years have witnessed the great success of non-linear genera...
The label noise transition matrix T, reflecting the probabilities that t...
Deep Convolutional Neural Networks (DCNNs) are currently the method of c...
The transition matrix, denoting the transition relationship from
clean l...
Deep Convolutional Neural Networks (DCNNs) is currently the method of ch...
Generating realistic 3D faces is of high importance for computer graphic...
Though tremendous strides have been made in uncontrolled face detection,...
3D Morphable Models (3DMMs) are statistical models that represent facial...
Facial landmark localisation in images captured in-the-wild is an import...
Convolutional neural networks have significantly boosted the performance...
Robust principal component analysis (RPCA) is a powerful method for lear...
Recently proposed robust 3D face alignment methods establish either dens...
We revisit the problem of robust principal component analysis with featu...