Instance segmentation in 3D is a challenging task due to the lack of
lar...
Diffusion models have achieved remarkable results in image generation, a...
Driven by recent advances AI, we passengers are entering a golden age of...
To facilitate research in the direction of fine-tuning foundation models...
Transformers are powerful visual learners, in large part due to their
co...
Semantic segmentation methods typically perform per-pixel classification...
We study how an autonomous agent learns to perform a task from demonstra...
Autonomous intelligent agents deployed to the real-world need to be robu...
We present a method for the accurate 3D reconstruction of partly-symmetr...
The field of machine learning has achieved striking progress in recent y...
The objectives of this work are cross-modal text-audio and audio-text
re...
In this work we investigate how to achieve equivariance to input
transfo...
We train embodied neural networks to plan and navigate unseen complex 3D...
Can artificial agents learn to assist others in achieving their goals wi...
We propose a method to train deep networks to decompose videos into 3D
g...
We consider the task of retrieving audio using free-form natural languag...
In the quiet backwaters of cs.CV, cs.LG and stat.ML, a cornucopia of new...
We introduce QuerYD, a new large-scale dataset for retrieval and event
l...
Peer review forms the backbone of modern scientific manuscript evaluatio...
Self-supervised learning has advanced rapidly, with several results beat...
We present a new method that learns to segment and cluster images withou...
Adapting deep networks to new concepts from few examples is extremely
ch...
We propose a fast second-order method that can be used as a drop-in
repl...
Learning through experience is time-consuming, inefficient and often bad...
We introduce a new video dataset and benchmark to assess single-object
t...
While the costs of human violence have attracted a great deal of attenti...
Convolutional Neural Networks (CNNs) are extremely efficient, since they...
The problem of arbitrary object tracking has traditionally been tackled ...
One-shot learning is usually tackled by using generative models or
discr...
The core component of most modern trackers is a discriminative classifie...