We present DySample, an ultra-lightweight and effective dynamic upsample...
We show that crowd counting can be viewed as a decomposable point queryi...
We study the problem of synthesizing a long-term dynamic video from only...
Depth estimation aims to predict dense depth maps. In autonomous driving...
We present the All-Seeing (AS) project: a large-scale data and model for...
Video stabilization refers to the problem of transforming a shaky video ...
Learning-based multi-view stereo (MVS) methods deal with predicting accu...
Video depth estimation aims to infer temporally consistent depth. Some
m...
Conditional spatial queries are recently introduced into DEtection
TRans...
We introduce the notion of point affiliation into feature upsampling. By...
We consider the problem of realistic bokeh rendering from a single
all-i...
We introduce Probabilistic Coordinate Fields (PCFs), a novel
geometric-i...
There is a long-standing problem of repeated patterns in correspondence
...
Class-agnostic counting (CAC) aims to count objects of interest from a q...
All-in-Focus (AIF) photography is expected to be a commercial selling po...
3D interacting hand pose estimation from a single RGB image is a challen...
Real-time eyeblink detection in the wild can widely serve for fatigue
de...
Correspondence pruning aims to search consistent correspondences (inlier...
We present 3D Cinemagraphy, a new technique that marries 2D image animat...
Automatic image cropping algorithms aim to recompose images like human-b...
We study the composition style in deep image matting, a notion that
char...
We study the problem of novel view synthesis of objects from a single im...
We introduce point affiliation into feature upsampling, a notion that
de...
Neural Radiance Field (NeRF) and its variants have exhibited great succe...
Generative adversarial networks (GANs) have been trained to be professio...
Temporal consistency is the key challenge of video depth estimation. Pre...
We consider the problem of task-agnostic feature upsampling in dense
pre...
Learning accurate object detectors often requires large-scale training d...
Partial occlusion effects are a phenomenon that blurry objects near a ca...
We introduce a 3D instance representation, termed instance kernels, wher...
We propose BokehMe, a hybrid bokeh rendering framework that marries a ne...
Class-agnostic counting (CAC) aims to count all instances in a query ima...
This paper reviews the second AIM realistic bokeh effect rendering chall...
This paper focuses on developing efficient and robust evaluation metrics...
We present a simple yet effective method for 3D correspondence grouping....
We formulate counting as a sequential decision problem and present a nov...
Face verification can be regarded as a 2-class fine-grained visual
recog...
Towards 3D object tracking in point clouds, a novel point-to-box network...
To facilitate depth-based 3D action recognition, 3D dynamic voxel (3DV) ...
In this work, we study how well different type of approaches generalise ...
The local reference frame (LRF) acts as a critical role in 3D local shap...
Visual counting, a task that aims to estimate the number of objects from...
Point cloud analysis is a basic task in 3D computer vision, which attrac...
Matching corresponding features between two images is a fundamental task...
For 3D hand and body pose estimation task in depth image, a novel
anchor...
Visual counting, a task that predicts the number of objects from an
imag...
Correspondence selection aiming at seeking correct feature correspondenc...
This paper presents a simple yet very effective data-driven approach to ...
Feature correspondence selection is pivotal to many feature-matching bas...
Effective and real-time eyeblink detection is of wide-range applications...