Gestures are non-verbal but important behaviors accompanying people's sp...
In recent years, the neural implicit surface has emerged as a powerful
r...
Dynamic vision sensors or event cameras provide rich complementary
infor...
Enhancing AI systems to perform tasks following human instructions can
s...
We introduce Omni-LOS, a neural computational imaging method for conduct...
Sequential video understanding, as an emerging video understanding task,...
Commonly used backbones for semantic segmentation, such as ResNet and
Sw...
Given only a set of images, neural implicit surface representation has s...
Recent CLIP-guided 3D optimization methods, e.g., DreamFields and
PureCL...
Three-dimensional (3D) ultrasound imaging technique has been applied for...
Lifelong person re-identification (LReID) is in significant demand for
r...
We represent the ResNeRF, a novel geometry-guided two-stage framework fo...
Self-supervised monocular depth estimation refers to training a monocula...
Modeling the human body in a canonical space is a common practice for
ca...
We propose united implicit functions (UNIF), a part-based method for clo...
We present a phasorial embedding field PREF as a compact
representation ...
Counting repetitive actions are widely seen in human activities such as
...
Benefiting from the contiguous representation ability, deep implicit
fun...
In this paper, we propose a novel sequence verification task that aims t...
Deep learning based trajectory prediction methods rely on large amount o...
With the emergence of service robots and surveillance cameras, dynamic f...
Anomaly detection in medical images refers to the identification of abno...
Transformers have shown preferable performance on many vision tasks. How...
Co-speech gesture generation is to synthesize a gesture sequence that no...
An Axial Shifted MLP architecture (AS-MLP) is proposed in this paper.
Di...
An LBYL (`Look Before You Leap') Network is proposed for end-to-end trai...
Existing view synthesis methods mainly focus on the perspective images a...
This paper proposes a framework for the interactive video object segment...
Almost all existing amodal segmentation methods make the inferences of
o...
We tackle human image synthesis, including human motion imitation, appea...
Spatial Description Resolution, as a language-guided localization task, ...
Anomaly detection in retinal image refers to the identification of
abnor...
This paper tackles the unsupervised depth estimation task in indoor
envi...
In this paper, we propose a learning-based approach to the task of
autom...
Almost all previous deep learning-based multi-view stereo (MVS) approach...
Choroid is the vascular layer of the eye, which is directly related to t...
With the development of convolutional neural network, deep learning has ...
We tackle the human motion imitation, appearance transfer, and novel vie...
Generative Adversarial Networks (GANs) have the capability of synthesizi...
Recently, there has been growing interest in developing learning-based
m...
Compared with single image based crowd counting, video provides the
spat...
By borrowing the wisdom of human in gaze following, we propose a two-sta...
In this work, we demonstrate yet another approach to tackle the amodal
s...
Partial domain adaptation aims to transfer knowledge from a label-rich s...
In this paper, we present a novel framework to detect line segments in
m...
Recent progresses in visual tracking have greatly improved the tracking
...
Medical image segmentation is an important step in medical image analysi...
Single-image piece-wise planar 3D reconstruction aims to simultaneously
...
A surface light field represents the radiance of rays originating from a...
Diabetic Retinopathy (DR) is a non-negligible eye disease among patients...