GlobalTrack: A Simple and Strong Baseline for Long-term Tracking

by   Lianghua Huang, et al.

A key capability of a long-term tracker is to search for targets in very large areas (typically the entire image) to handle possible target absences or tracking failures. However, currently there is a lack of such a strong baseline for global instance search. In this work, we aim to bridge this gap. Specifically, we propose GlobalTrack, a pure global instance search based tracker that makes no assumption on the temporal consistency of the target's positions and scales. GlobalTrack is developed based on two-stage object detectors, and it is able to perform full-image and multi-scale search of arbitrary instances with only a single query as the guide. We further propose a cross-query loss to improve the robustness of our approach against distractors. With no online learning, no punishment on position or scale changes, no scale smoothing and no trajectory refinement, our pure global instance search based tracker achieves comparable, sometimes much better performance on four large-scale tracking benchmarks (i.e., 52.1 on TLP, 60.3 compared to state-of-the-art approaches that typically require complex post-processing. More importantly, our tracker runs without cumulative errors, i.e., any type of temporary tracking failures will not affect its performance on future frames, making it ideal for long-term tracking. We hope this work will be a strong baseline for long-term tracking and will stimulate future works in this area. Code is available at


page 1

page 2


'Skimming-Perusal' Tracking: A Framework for Real-Time and Robust Long-term Tracking

Compared with traditional short-term tracking, long-term tracking poses ...

Robust Long-Term Object Tracking via Improved Discriminative Model Prediction

We propose an improved discriminative model prediction method for robust...

Tracking for Half an Hour

Long-term tracking requires extreme stability to the multitude of model ...

Target-Aware Tracking with Long-term Context Attention

Most deep trackers still follow the guidance of the siamese paradigms an...

Global Tracking via Ensemble of Local Trackers

The crux of long-term tracking lies in the difficulty of tracking the ta...

Distractor-Aware Fast Tracking via Dynamic Convolutions and MOT Philosophy

A practical long-term tracker typically contains three key properties, i...

Real Time Visual Tracking using Spatial-Aware Temporal Aggregation Network

More powerful feature representations derived from deep neural networks ...

Please sign up or login with your details

Forgot password? Click here to reset