We address the task of weakly-supervised few-shot image classification a...
Learning to predict reliable characteristic orientations of 3D point clo...
Recent state-of-the-art methods for HOI detection typically build on
tra...
Scene graph generation aims to construct a semantic graph structure from...
Extracting discriminative local features that are invariant to imaging
v...
Despite recent advances in implicit neural representations (INRs), it re...
Metric learning aims to build a distance metric typically by learning an...
Temporal Action Localization (TAL) methods typically operate on top of
f...
We address the problem of generating a sequence of LEGO brick assembly w...
Despite the extensive adoption of machine learning on the task of visual...
Knowledge distillation is the procedure of transferring "knowledge" from...
We study the problem of learning to assign a characteristic pose, i.e., ...
Human vision possesses a special type of visual processing systems calle...
Although autoregressive models have achieved promising results on image
...
The task of predicting future actions from a video is crucial for a
real...
Assembling parts into an object is a combinatorial problem that arises i...
Establishing correspondences between images remains a challenging task,
...
Randomized smoothing (RS) has been shown to be a fast, scalable techniqu...
We present a novel self-taught framework for unsupervised metric learnin...
Detecting robust keypoints from an image is an integral part of many com...
Group activity recognition is the task of understanding the activity
con...
The inherent challenge of detecting symmetries stems from arbitrary
orie...
We introduce the integrative task of few-shot classification and segment...
For autoregressive (AR) modeling of high-resolution images, vector
quant...
Private inference (PI) enables inference directly on cryptographically s...
Consistency regularization on label predictions becomes a fundamental
te...
The recent success of neural networks enables a better interpretation of...
Semi-supervised domain adaptation (SSDA) is to adapt a learner to a new
...
Convolution has been arguably the most important feature transform for m...
Conditional Generative Adversarial Networks (cGAN) generate realistic im...
Discovering a solution in a combinatorial space is prevalent in many
rea...
The paradigm of differentiable programming has significantly enhanced th...
Despite advances in feature representation, leveraging geometric relatio...
Point cloud registration is the task of estimating the rigid transformat...
In this work, we propose a camera self-calibration algorithm for generic...
The task of reflection symmetry detection remains challenging due to
sig...
We propose to address the problem of few-shot classification by meta-lea...
The emergence of deep learning has been accompanied by privacy concerns
...
Few-shot semantic segmentation aims at learning to segment a target obje...
Despite advances in feature representation, leveraging geometric relatio...
This paper presents a novel method for embedding transfer, a task of
tra...
Spatio-temporal convolution often fails to learn motion dynamics in vide...
Bayesian optimization is a popular method for solving the problem of glo...
We present a novel discriminator for GANs that improves realness and
div...
Mutual learning is an ensemble training strategy to improve generalizati...
Feature representation plays a crucial role in visual correspondence, an...
Motion plays a crucial role in understanding videos and most state-of-th...
Most current action recognition methods heavily rely on appearance
infor...
In this paper, we study two important problems in the automated design o...
Deep neural networks are often highly overparameterized, prohibiting the...