Ensuring the reliability of face recognition systems against presentatio...
In this paper, we propose a novel self-supervised motion estimator for
L...
Camera localization is a classical computer vision task that serves vari...
Creating animatable avatars from static scans requires the modeling of
c...
Active learning selects informative samples for annotation within budget...
Self-supervised audio-visual source localization aims to locate sound-so...
Masked Autoencoders learn strong visual representations and achieve
stat...
Recent years witnessed the breakthrough of face recognition with deep
co...
Visual localization is a fundamental task that regresses the 6 Degree Of...
We propose CrossHuman, a novel method that learns cross-guidance from
pa...
Vision-Language Pre-training (VLP) with large-scale image-text pairs has...
Mixed Sample Regularization (MSR), such as MixUp or CutMix, is a powerfu...
Most automatic matting methods try to separate the salient foreground fr...
Personalized image aesthetics assessment (PIAA) is challenging due to it...
Knowledge distillation (KD) shows a bright promise as a powerful
regular...
It is extremely challenging to create an animatable clothed human avatar...
Most existing CNN-based salient object detection methods can identify lo...
Multi-label learning in the presence of missing labels (MLML) is a
chall...
Referring image segmentation aims to segment a referent via a natural
li...
Several video-based 3D pose and shape estimation algorithms have been
pr...
High-precision camera re-localization technology in a pre-established 3D...
Most existing human matting algorithms tried to separate pure human-only...
This paper investigates the feasibility of learning good representation ...
Adversarial examples can deceive a deep neural network (DNN) by signific...
Accurate visual re-localization is very critical to many artificial
inte...
It is a consensus that small models perform quite poorly under the parad...
This paper presents a view-guided solution for the task of point cloud
c...
Inpainting high-resolution images with large holes challenges existing d...
Perceptual Extreme Super-Resolution for single image is extremely diffic...
In this paper, we propose a monocular visual localization pipeline lever...
Vision is often used as a complementary modality for audio speech recogn...
Unconstrained remote gaze estimation remains challenging mostly due to i...
Due to the deteriorated conditions of lack and uneven
lighting, nightti...
Feature pyramid architecture has been broadly adopted in object detectio...
Non-uniform blur, mainly caused by camera shake and motions of multiple
...
Modern machine learning suffers from catastrophic forgetting when learni...
In this paper, we study the problem of object counting with incomplete
a...
We study in this paper how to initialize the parameters of multinomial
l...
In this paper, we address the incremental classifier learning problem, w...
We study in this paper the problem of one-shot face recognition, with th...
In this paper, we design a benchmark task and provide the associated dat...
Bayesian learning is often hampered by large computational expense. As a...