Foundation models have made significant strides in various applications,...
Driven by large-data pre-training, Segment Anything Model (SAM) has been...
Unsupervised Domain Adaptation (UDA) aims to adapt the model trained on ...
Challenging illumination conditions (low light, underexposure and
overex...
Localizing individuals in crowds is more in accordance with the practica...
Recently, most siamese network based trackers locate targets via object
...
The Feature Pyramid Network (FPN) presents a remarkable approach to alle...
The Learnable Tree Filter presents a remarkable approach to model
struct...
Several multi-modality representation learning approaches such as LXMERT...
In this paper, we propose an anchor-free object detector with a fully
di...
This report presents our method which wins the nuScenes3D Detection Chal...
Learning effective fusion of multi-modality features is at the heart of
...