Tracking Without Re-recognition in Humans and Machines

05/27/2021
by   Drew Linsley, et al.
6

Imagine trying to track one particular fruitfly in a swarm of hundreds. Higher biological visual systems have evolved to track moving objects by relying on both appearance and motion features. We investigate if state-of-the-art deep neural networks for visual tracking are capable of the same. For this, we introduce PathTracker, a synthetic visual challenge that asks human observers and machines to track a target object in the midst of identical-looking "distractor" objects. While humans effortlessly learn PathTracker and generalize to systematic variations in task design, state-of-the-art deep networks struggle. To address this limitation, we identify and model circuit mechanisms in biological brains that are implicated in tracking objects based on motion cues. When instantiated as a recurrent network, our circuit model learns to solve PathTracker with a robust visual strategy that rivals human performance and explains a significant proportion of their decision-making on the challenge. We also show that the success of this circuit model extends to object tracking in natural videos. Adding it to a transformer-based architecture for object tracking builds tolerance to visual nuisances that affect object appearance, resulting in a new state-of-the-art performance on the large-scale TrackingNet object tracking challenge. Our work highlights the importance of building artificial vision models that can help us better understand human vision and improve computer vision.

READ FULL TEXT

page 5

page 9

page 10

page 11

page 12

page 15

page 18

page 19

research
09/30/2021

The Challenge of Appearance-Free Object Tracking with Feedforward Neural Networks

Nearly all models for object tracking with artificial neural networks de...
research
04/19/2021

What can human minimal videos tell us about dynamic recognition models?

In human vision objects and their parts can be visually recognized from ...
research
05/02/2020

Derivation of a Constant Velocity Motion Model for Visual Tracking

Motion models play a great role in visual tracking applications for pred...
research
11/20/2019

Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking

The ability to detect and track objects in the visual world is a crucial...
research
06/12/2018

A Connectome Based Hexagonal Lattice Convolutional Network Model of the Drosophila Visual System

What can we learn from a connectome? We constructed a simplified model o...
research
10/12/2021

Can machines learn to see without visual databases?

This paper sustains the position that the time has come for thinking of ...
research
05/04/2023

Tracking through Containers and Occluders in the Wild

Tracking objects with persistence in cluttered and dynamic environments ...

Please sign up or login with your details

Forgot password? Click here to reset