Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking

11/20/2019
by   Eric Crawford, et al.
14

The ability to detect and track objects in the visual world is a crucial skill for any intelligent agent, as it is a necessary precursor to any object-level reasoning process. Moreover, it is important that agents learn to track objects without supervision (i.e. without access to annotated training videos) since this will allow agents to begin operating in new environments with minimal human assistance. The task of learning to discover and track objects in videos, which we call unsupervised object tracking, has grown in prominence in recent years; however, most architectures that address it still struggle to deal with large scenes containing many objects. In the current work, we propose an architecture that scales well to the large-scene, many-object setting by employing spatially invariant computations (convolutions and spatial attention) and representations (a spatially local object specification scheme). In a series of experiments, we demonstrate a number of attractive features of our architecture; most notably, that it outperforms competing methods at tracking objects in cluttered scenes with many objects, and that it can generalize well to videos that are larger and/or contain more objects than videos encountered during training.

READ FULL TEXT

page 13

page 14

research
10/21/2020

RigidFusion: Robot Localisation and Mapping in Environments with Large Dynamic Rigid Objects

This work presents a novel approach to simultaneously track a robot with...
research
08/06/2023

InterTracker: Discovering and Tracking General Objects Interacting with Hands in the Wild

Understanding human interaction with objects is an important research to...
research
05/27/2021

Tracking Without Re-recognition in Humans and Machines

Imagine trying to track one particular fruitfly in a swarm of hundreds. ...
research
09/30/2021

The Challenge of Appearance-Free Object Tracking with Feedforward Neural Networks

Nearly all models for object tracking with artificial neural networks de...
research
03/06/2023

Referring Multi-Object Tracking

Existing referring understanding tasks tend to involve the detection of ...
research
10/11/2012

Unsupervised Detection and Tracking of Arbitrary Objects with Dependent Dirichlet Process Mixtures

This paper proposes a technique for the unsupervised detection and track...
research
11/16/2016

Unsupervised Learning of Important Objects from First-Person Videos

A first-person camera, placed at a person's head, captures, which object...

Please sign up or login with your details

Forgot password? Click here to reset