Related papers: ACTrack: Adding Spatio-Temporal Condition for Visu…

TCTrack: Temporal Contexts for Aerial Tracking

Temporal contexts among consecutive frames are far from being fully utilized in existing visual trackers. In this work, we present TCTrack, a comprehensive framework to fully exploit temporal contexts for aerial tracking. The temporal…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Ziang Cao , Ziyuan Huang , Liang Pan , Shiwei Zhang , Ziwei Liu , Changhong Fu

Towards Real-World Visual Tracking with Temporal Contexts

Visual tracking has made significant improvements in the past few decades. Most existing state-of-the-art trackers 1) merely aim for performance in ideal conditions while overlooking the real-world conditions; 2) adopt the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Ziang Cao , Ziyuan Huang , Liang Pan , Shiwei Zhang , Ziwei Liu , Changhong Fu

ATSTrack: Enhancing Visual-Language Tracking by Aligning Temporal and Spatial Scales

A main challenge of Visual-Language Tracking (VLT) is the misalignment between visual inputs and language descriptions caused by target movement. Previous trackers have explored many effective feature modification methods to preserve more…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Yihao Zhen , Qiang Wang , Yu Qiao , Liangqiong Qu , Huijie Fan

ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking

Vision-language tracking aims to locate the target object in the video sequence using a template patch and a language description provided in the initial frame. To achieve robust tracking, especially in complex long-term scenarios that…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 X. Feng , S. Hu , X. Li , D. Zhang , M. Wu , J. Zhang , X. Chen , K. Huang

AttTrack: Online Deep Attention Transfer for Multi-object Tracking

Multi-object tracking (MOT) is a vital component of intelligent video analytics applications such as surveillance and autonomous driving. The time and storage complexity required to execute deep learning models for visual object tracking…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Keivan Nalaie , Rong Zheng

ProContEXT: Exploring Progressive Context Transformer for Tracking

Existing Visual Object Tracking (VOT) only takes the target area in the first frame as a template. This causes tracking to inevitably fail in fast-changing and crowded scenes, as it cannot account for changes in object appearance between…

Computer Vision and Pattern Recognition · Computer Science 2023-03-31 Jin-Peng Lan , Zhi-Qi Cheng , Jun-Yan He , Chenyang Li , Bin Luo , Xu Bao , Wangmeng Xiang , Yifeng Geng , Xuansong Xie

TRAT: Tracking by Attention Using Spatio-Temporal Features

Robust object tracking requires knowledge of tracked objects' appearance, motion and their evolution over time. Although motion provides distinctive and complementary information especially for fast moving objects, most of the recent…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Hasan Saribas , Hakan Cevikalp , Okan Köpüklü , Bedirhan Uzun

Temporal Adaptive RGBT Tracking with Modality Prompt

RGBT tracking has been widely used in various fields such as robotics, surveillance processing, and autonomous driving. Existing RGBT trackers fully explore the spatial information between the template and the search region and locate the…

Computer Vision and Pattern Recognition · Computer Science 2024-01-03 Hongyu Wang , Xiaotao Liu , Yifan Li , Meng Sun , Dian Yuan , Jing Liu

Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV Tracking

Transformer-based models have improved visual tracking, but most still cannot run in real time on resource-limited devices, especially for unmanned aerial vehicle (UAV) tracking. To achieve a better balance between performance and…

Computer Vision and Pattern Recognition · Computer Science 2025-08-18 You Wu , Yongxin Li , Mengyuan Liu , Xucheng Wang , Xiangyang Yang , Hengzhou Ye , Dan Zeng , Qijun Zhao , Shuiwang Li

ODTrack: Online Dense Temporal Token Learning for Visual Tracking

Online contextual reasoning and association across consecutive video frames are critical to perceive instances in visual tracking. However, most current top-performing trackers persistently lean on sparse temporal relationships between…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Yaozong Zheng , Bineng Zhong , Qihua Liang , Zhiyi Mo , Shengping Zhang , Xianxian Li

BACTrack: Building Appearance Collection for Aerial Tracking

Siamese network-based trackers have shown remarkable success in aerial tracking. Most previous works, however, usually perform template matching only between the initial template and the search region and thus fail to deal with rapidly…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Xincong Liu , Tingfa Xu , Ying Wang , Zhinong Yu , Xiaoying Yuan , Haolin Qin , Jianan Li

Enforcing Template Representability and Temporal Consistency for Adaptive Sparse Tracking

Sparse representation has been widely studied in visual tracking, which has shown promising tracking performance. Despite a lot of progress, the visual tracking problem is still a challenging task due to appearance variations over time. In…

Computer Vision and Pattern Recognition · Computer Science 2016-05-03 Xue Yang , Fei Han , Hua Wang , Hao Zhang

Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers

The rich spatio-temporal information is crucial to capture the complicated target appearance variations in visual tracking. However, most top-performing tracking algorithms rely on many hand-crafted components for spatio-temporal…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Jinxia Xie , Bineng Zhong , Zhiyi Mo , Shengping Zhang , Liangtao Shi , Shuxiang Song , Rongrong Ji

An Efficient Token Compression Framework for Visual Object Tracking

Refining visual representations by eliminating their internal feature-level redundancy is crucial for simultaneously optimizing the performance and computational cost of models in visual tracking. To enhance their performance, many…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Weijing Wu , Qihua Liang , Bineng Zhong , Haiying Xia , Zhiyi Mo , Shuxiang Song

Explicit Visual Prompts for Visual Object Tracking

How to effectively exploit spatio-temporal information is crucial to capture target appearance changes in visual tracking. However, most deep learning-based trackers mainly focus on designing a complicated appearance model or template…

Computer Vision and Pattern Recognition · Computer Science 2024-01-09 Liangtao Shi , Bineng Zhong , Qihua Liang , Ning Li , Shengping Zhang , Xianxian Li

A Spatio-temporal Track Association Algorithm Based on Marine Vessel Automatic Identification System Data

Tracking multiple moving objects in real-time in a dynamic threat environment is an important element in national security and surveillance system. It helps pinpoint and distinguish potential candidates posing threats from other normal…

Machine Learning · Computer Science 2022-06-27 Imtiaz Ahmed , Mikyoung Jun , Yu Ding

Less is More: Token Context-aware Learning for Object Tracking

Recently, several studies have shown that utilizing contextual information to perceive target states is crucial for object tracking. They typically capture context by incorporating multiple video frames. However, these naive frame-context…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Chenlong Xu , Bineng Zhong , Qihua Liang , Yaozong Zheng , Guorong Li , Shuxiang Song

CXTrack: Improving 3D Point Cloud Tracking with Contextual Information

3D single object tracking plays an essential role in many applications, such as autonomous driving. It remains a challenging problem due to the large appearance variation and the sparsity of points caused by occlusion and limited sensor…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Tian-Xing Xu , Yuan-Chen Guo , Yu-Kun Lai , Song-Hai Zhang

Fast Tracking via Spatio-Temporal Context Learning

In this paper, we present a simple yet fast and robust algorithm which exploits the spatio-temporal context for visual tracking. Our approach formulates the spatio-temporal relationships between the object of interest and its local context…

Computer Vision and Pattern Recognition · Computer Science 2013-11-11 Kaihua Zhang , Lei Zhang , Ming-Hsuan Yang , David Zhang

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

Boosting performance of the offline trained siamese trackers is getting harder nowadays since the fixed information of the template cropped from the first frame has been almost thoroughly mined, but they are poorly capable of resisting…

Computer Vision and Pattern Recognition · Computer Science 2021-04-05 Zhihong Fu , Qingjie Liu , Zehua Fu , Yunhong Wang