Target-Aware Tracking with Long-term Context Attention

by   Kaijie He, et al.

Most deep trackers still follow the guidance of the siamese paradigms and use a template that contains only the target without any contextual information, which makes it difficult for the tracker to cope with large appearance changes, rapid target movement, and attraction from similar objects. To alleviate the above problem, we propose a long-term context attention (LCA) module that can perform extensive information fusion on the target and its context from long-term frames, and calculate the target correlation while enhancing target features. The complete contextual information contains the location of the target as well as the state around the target. LCA uses the target state from the previous frame to exclude the interference of similar objects and complex backgrounds, thus accurately locating the target and enabling the tracker to obtain higher robustness and regression accuracy. By embedding the LCA module in Transformer, we build a powerful online tracker with a target-aware backbone, termed as TATrack. In addition, we propose a dynamic online update algorithm based on the classification confidence of historical information without additional calculation burden. Our tracker achieves state-of-the-art performance on multiple benchmarks, with 71.1% AUC, 89.3% NP, and 73.0% AO on LaSOT, TrackingNet, and GOT-10k. The code and trained models are available on


Multi-Template Temporal Siamese Network for Long-Term Object Tracking

Siamese Networks are one of most popular visual object tracking methods ...

DAL – A Deep Depth-aware Long-term Tracker

The best RGBD trackers provide high accuracy but are slow to run. On the...

High-Performance Long-Term Tracking with Meta-Updater

Long-term visual tracking has drawn increasing attention because it is m...

Learning regression and verification networks for long-term visual tracking

In the long-term single object tracking task, the target moves out of vi...

GlobalTrack: A Simple and Strong Baseline for Long-term Tracking

A key capability of a long-term tracker is to search for targets in very...

MixFormer: End-to-End Tracking with Iterative Mixed Attention

Tracking often uses a multi-stage pipeline of feature extraction, target...

ConTrack: Contextual Transformer for Device Tracking in X-ray

Device tracking is an important prerequisite for guidance during endovas...

Please sign up or login with your details

Forgot password? Click here to reset