Knowledge distillation (KD) exploits a large well-trained model (i.e.,
t...
We focus on the task of far-field 3D detection (Far3Det) of objects beyo...
Contemporary autonomous vehicle (AV) benchmarks have advanced techniques...
Lifelong learners must recognize concept vocabularies that evolve over t...
Shoe tread impressions are one of the most common types of evidence left...
In the real open world, data tends to follow long-tailed class distribut...
Real-world machine learning systems need to analyze novel testing data t...
Object detection with multimodal inputs can improve many safety-critical...
Monocular depth prediction is a highly underdetermined problem and recen...
A system capturing the association between video frames and textual quer...
The nematode Caenorhabditis elegans (C. elegans) serves as an important ...
Leveraging synthetically rendered data offers great potential to improve...
Computer Vision applications often require a textual grounding module wi...
We introduce multigrid Predictive Filter Flow (mgPFF), a framework for
u...
We propose a simple, interpretable framework for solving a wide range of...
To achieve parsimonious inference in per-pixel labeling tasks with a lim...
Automated facial expression analysis has a variety of applications in
hu...
Grounding textual phrases in visual content is a meaningful yet challeng...
We introduce a differentiable, end-to-end trainable framework for solvin...
Objects may appear at arbitrary scales in perspective images of a scene,...
Pooling second-order local feature statistics to form a high-dimensional...
Real-world applications could benefit from the ability to automatically
...
The challenge of object categorization in images is largely due to arbit...
We now know that mid-level features can greatly enhance the performance ...
This note presents some representative methods which are based on dictio...