Related papers: Towards Sequence-Level Training for Visual Trackin…

SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks

Siamese network based trackers formulate tracking as convolutional feature cross-correlation between target template and searching region. However, Siamese trackers still have accuracy gap compared with state-of-the-art algorithms and they…

Computer Vision and Pattern Recognition · Computer Science 2019-01-01 Bo Li , Wei Wu , Qiang Wang , Fangyi Zhang , Junliang Xing , Junjie Yan

End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning

We study active object tracking, where a tracker takes visual observations (i.e., frame sequences) as input and produces the corresponding camera control signals as output (e.g., move forward, turn left, etc.). Conventional methods tackle…

Computer Vision and Pattern Recognition · Computer Science 2019-02-14 Wenhan Luo , Peng Sun , Fangwei Zhong , Wei Liu , Tong Zhang , Yizhou Wang

Progressive Scaling Visual Object Tracking

In this work, we propose a progressive scaling training strategy for visual object tracking, systematically analyzing the influence of training data volume, model size, and input resolution on tracking performance. Our empirical study…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Jack Hong , Shilin Yan , Zehao Xiao , Jiayin Cai , Xiaolong Jiang , Yao Hu , Henghui Ding

End-to-end Active Object Tracking via Reinforcement Learning

We study active object tracking, where a tracker takes as input the visual observation (i.e., frame sequence) and produces the camera control signal (e.g., move forward, turn left, etc.). Conventional methods tackle the tracking and the…

Computer Vision and Pattern Recognition · Computer Science 2018-06-04 Wenhan Luo , Peng Sun , Fangwei Zhong , Wei Liu , Tong Zhang , Yizhou Wang

Unsupervised Deep Representation Learning for Real-Time Tracking

The advancement of visual tracking has continuously been brought by deep learning models. Typically, supervised learning is employed to train these models with expensive labeled data. In order to reduce the workload of manual annotations…

Computer Vision and Pattern Recognition · Computer Science 2020-07-24 Ning Wang , Wengang Zhou , Yibing Song , Chao Ma , Wei Liu , Houqiang Li

R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning

Visual single object tracking aims to continuously localize and estimate the scale of a target in subsequent video frames, given only its initial state in the first frame. This task has traditionally been framed as a template matching…

Computer Vision and Pattern Recognition · Computer Science 2025-07-23 Biao Wang , Wenwen Li , Jiawei Ge

Real-time visual tracking by deep reinforced decision making

One of the major challenges of model-free visual tracking problem has been the difficulty originating from the unpredictable and drastic changes in the appearance of objects we target to track. Existing methods tackle this problem by…

Computer Vision and Pattern Recognition · Computer Science 2018-08-20 Janghoon Choi , Junseok Kwon , Kyoung Mu Lee

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

In this paper, we develop a new approach of spatially supervised recurrent convolutional neural networks for visual object tracking. Our recurrent convolutional network exploits the history of locations as well as the distinctive visual…

Computer Vision and Pattern Recognition · Computer Science 2016-07-21 Guanghan Ning , Zhi Zhang , Chen Huang , Zhihai He , Xiaobo Ren , Haohong Wang

Deep Reinforcement Learning for Visual Object Tracking in Videos

In this paper we introduce a fully end-to-end approach for visual tracking in videos that learns to predict the bounding box locations of a target object at every frame. An important insight is that the tracking problem can be considered as…

Computer Vision and Pattern Recognition · Computer Science 2017-04-12 Da Zhang , Hamid Maei , Xin Wang , Yuan-Fang Wang

Target-Aware Deep Tracking

Existing deep trackers mainly use convolutional neural networks pre-trained for generic object recognition task for representations. Despite demonstrated successes for numerous vision tasks, the contributions of using pre-trained deep…

Computer Vision and Pattern Recognition · Computer Science 2019-04-04 Xin Li , Chao Ma , Baoyuan Wu , Zhenyu He , Ming-Hsuan Yang

Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking

In this paper, we introduce a new sequence-to-sequence learning framework for RGB-based and multi-modal object tracking. First, we present SeqTrack for RGB-based tracking. It casts visual tracking as a sequence generation task, forecasting…

Computer Vision and Pattern Recognition · Computer Science 2024-03-28 Xin Chen , Ben Kang , Jiawen Zhu , Dong Wang , Houwen Peng , Huchuan Lu

Single Object Tracking Research: A Survey

Visual object tracking is an important task in computer vision, which has many real-world applications, e.g., video surveillance, visual navigation. Visual object tracking also has many challenges, e.g., object occlusion and deformation. To…

Computer Vision and Pattern Recognition · Computer Science 2022-04-26 Ruize Han , Wei Feng , Qing Guo , Qinghua Hu

Object-Adaptive LSTM Network for Real-time Visual Tracking with Adversarial Data Augmentation

In recent years, deep learning based visual tracking methods have obtained great success owing to the powerful feature representation ability of Convolutional Neural Networks (CNNs). Among these methods, classification-based tracking…

Computer Vision and Pattern Recognition · Computer Science 2020-02-10 Yihan Du , Yan Yan , Si Chen , Yang Hua

Learning Hierarchical Features for Visual Object Tracking with Recursive Neural Networks

Recently, deep learning has achieved very promising results in visual object tracking. Deep neural networks in existing tracking methods require a lot of training data to learn a large number of parameters. However, training data is not…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Li Wang , Ting Liu , Bing Wang , Xulei Yang , Gang Wang

Deep Meta Learning for Real-Time Target-Aware Visual Tracking

In this paper, we propose a novel on-line visual tracking framework based on the Siamese matching network and meta-learner network, which run at real-time speeds. Conventional deep convolutional feature-based discriminative visual tracking…

Computer Vision and Pattern Recognition · Computer Science 2019-08-19 Janghoon Choi , Junseok Kwon , Kyoung Mu Lee

Visual Tracking by means of Deep Reinforcement Learning and an Expert Demonstrator

In the last decade many different algorithms have been proposed to track a generic object in videos. Their execution on recent large-scale video datasets can produce a great amount of various tracking behaviours. New trends in Reinforcement…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Matteo Dunnhofer , Niki Martinel , Gian Luca Foresti , Christian Micheloni

Progressive Multi-Stage Learning for Discriminative Tracking

Visual tracking is typically solved as a discriminative learning problem that usually requires high-quality samples for online model adaptation. It is a critical and challenging problem to evaluate the training samples collected from…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Weichao Li , Xi Li , Omar Elfarouk Bourahla , Fuxian Huang , Fei Wu , Wei Liu , Zhiheng Wang , Hongmin Liu

An In-Depth Analysis of Visual Tracking with Siamese Neural Networks

This survey presents a deep analysis of the learning and inference capabilities in nine popular trackers. It is neither intended to study the whole literature nor is it an attempt to review all kinds of neural networks proposed for visual…

Computer Vision and Pattern Recognition · Computer Science 2018-08-03 Roman Pflugfelder

Learning the Model Update for Siamese Trackers

Siamese approaches address the visual tracking problem by extracting an appearance template from the current frame, which is used to localize the target in the next frame. In general, this template is linearly combined with the accumulated…

Computer Vision and Pattern Recognition · Computer Science 2019-09-09 Lichao Zhang , Abel Gonzalez-Garcia , Joost van de Weijer , Martin Danelljan , Fahad Shahbaz Khan

Beyond Traditional Single Object Tracking: A Survey

Single object tracking is a vital task of many applications in critical fields. However, it is still considered one of the most challenging vision tasks. In recent years, computer vision, especially object tracking, witnessed the…

Computer Vision and Pattern Recognition · Computer Science 2024-05-20 Omar Abdelaziz , Mohamed Shehata , Mohamed Mohamed