Object-Centric Multiple Object Tracking

09/01/2023
by   Zixu Zhao, et al.
0

Unsupervised object-centric learning methods allow the partitioning of scenes into entities without additional localization information and are excellent candidates for reducing the annotation burden of multiple-object tracking (MOT) pipelines. Unfortunately, they lack two key properties: objects are often split into parts and are not consistently tracked over time. In fact, state-of-the-art models achieve pixel-level accuracy and temporal consistency by relying on supervised object detection with additional ID labels for the association through time. This paper proposes a video object-centric model for MOT. It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module that builds complete object prototypes to handle occlusions. Benefited from object-centric learning, we only require sparse detection labels (0 feature binding. Relying on our self-supervised Expectation-Maximization-inspired loss for object association, our approach requires no ID labels. Our experiments significantly narrow the gap between the existing object-centric model and the fully supervised state-of-the-art and outperform several unsupervised trackers.

READ FULL TEXT

page 5

page 7

page 8

research
05/17/2023

S^3Track: Self-supervised Tracking with Soft Assignment Flow

In this work, we study self-supervised multiple object tracking without ...
research
04/25/2023

Self-Supervised Multi-Object Tracking From Consistency Across Timescales

Self-supervised multi-object trackers have the potential to leverage the...
research
06/08/2023

Tracking Objects with 3D Representation from Videos

Data association is a knotty problem for 2D Multiple Object Tracking due...
research
07/28/2023

Uncertainty-aware Unsupervised Multi-Object Tracking

Without manually annotated identities, unsupervised multi-object tracker...
research
07/15/2020

CycAs: Self-supervised Cycle Association for Learning Re-identifiable Descriptions

This paper proposes a self-supervised learning method for the person re-...
research
08/04/2020

Tracking Emerges by Looking Around Static Scenes, with Neural 3D Mapping

We hypothesize that an agent that can look around in static scenes can l...
research
09/12/2023

OTAS: Unsupervised Boundary Detection for Object-Centric Temporal Action Segmentation

Temporal action segmentation is typically achieved by discovering the dr...

Please sign up or login with your details

Forgot password? Click here to reset