Related papers: Efficient Visual Tracking with Exemplar Transforme…

Efficient Visual Tracking via Hierarchical Cross-Attention Transformer

In recent years, target tracking has made great progress in accuracy. This development is mainly attributed to powerful networks (such as transformers) and additional modules (such as online update and refinement modules). However, less…

Computer Vision and Pattern Recognition · Computer Science 2022-11-01 Xin Chen , Ben Kang , Dong Wang , Dongdong Li , Huchuan Lu

Exploring Dynamic Transformer for Efficient Object Tracking

The speed-precision trade-off is a critical problem for visual object tracking which usually requires low latency and deployment on constrained resources. Existing solutions for efficient tracking mainly focus on adopting light-weight…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Jiawen Zhu , Xin Chen , Haiwen Diao , Shuai Li , Jun-Yan He , Chenyang Li , Bin Luo , Dong Wang , Huchuan Lu

An Efficient Token Compression Framework for Visual Object Tracking

Refining visual representations by eliminating their internal feature-level redundancy is crucial for simultaneously optimizing the performance and computational cost of models in visual tracking. To enhance their performance, many…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Weijing Wu , Qihua Liang , Bineng Zhong , Haiying Xia , Zhiyi Mo , Shuxiang Song

Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking

Transformer-based visual trackers have demonstrated significant progress owing to their superior modeling capabilities. However, existing trackers are hampered by low speed, limiting their applicability on devices with limited computational…

Computer Vision and Pattern Recognition · Computer Science 2023-08-15 Ben Kang , Xin Chen , Dong Wang , Houwen Peng , Huchuan Lu

Efficient Training for Visual Tracking with Deformable Transformer

Recent Transformer-based visual tracking models have showcased superior performance. Nevertheless, prior works have been resource-intensive, requiring prolonged GPU training hours and incurring high GFLOPs during inference due to…

Computer Vision and Pattern Recognition · Computer Science 2023-09-07 Qingmao Wei , Guotian Zeng , Bi Zeng

UETrack: A Unified and Efficient Framework for Single Object Tracking

With growing real-world demands, efficient tracking has received increasing attention. However, most existing methods are limited to RGB inputs and struggle in multi-modal scenarios. Moreover, current multi-modal tracking approaches…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Ben Kang , Jie Zhao , Xin Chen , Wanting Geng , Bin Zhang , Lu Zhang , Dong Wang , Huchuan Lu

LiteTrack: Layer Pruning with Asynchronous Feature Extraction for Lightweight and Efficient Visual Tracking

The recent advancements in transformer-based visual trackers have led to significant progress, attributed to their strong modeling capabilities. However, as performance improves, running latency correspondingly increases, presenting a…

Computer Vision and Pattern Recognition · Computer Science 2023-09-19 Qingmao Wei , Bi Zeng , Jianqi Liu , Li He , Guotian Zeng

Adaptively Bypassing Vision Transformer Blocks for Efficient Visual Tracking

Empowered by transformer-based models, visual tracking has advanced significantly. However, the slow speed of current trackers limits their applicability on devices with constrained computational resources. To address this challenge, we…

Computer Vision and Pattern Recognition · Computer Science 2024-07-02 Xiangyang Yang , Dan Zeng , Xucheng Wang , You Wu , Hengzhou Ye , Qijun Zhao , Shuiwang Li

Transformer Tracking with Cyclic Shifting Window Attention

Transformer architecture has been showing its great strength in visual object tracking, for its effective attention mechanism. Existing transformer-based approaches adopt the pixel-to-pixel attention strategy on flattened image features and…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Zikai Song , Junqing Yu , Yi-Ping Phoebe Chen , Wei Yang

Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach

Existing tracking algorithms typically rely on low-frame-rate RGB cameras coupled with computationally intensive deep neural network architectures to achieve effective tracking. However, such frame-based methods inherently face challenges…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Shiao Wang , Xiao Wang , Liye Jin , Bo Jiang , Lin Zhu , Lan Chen , Yonghong Tian , Bin Luo

Compact Transformer Tracker with Correlative Masked Modeling

Transformer framework has been showing superior performances in visual object tracking for its great strength in information aggregation across the template and search image with the well-known attention mechanism. Most recent advances…

Computer Vision and Pattern Recognition · Computer Science 2023-01-27 Zikai Song , Run Luo , Junqing Yu , Yi-Ping Phoebe Chen , Wei Yang

Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer

We propose a light-weight and highly efficient Joint Detection and Tracking pipeline for the task of Multi-Object Tracking using a fully-transformer architecture. It is a modified version of TransTrack, which overcomes the computational…

Computer Vision and Pattern Recognition · Computer Science 2022-11-11 Siddharth Sagar Nijhawan , Leo Hoshikawa , Atsushi Irie , Masakazu Yoshimura , Junji Otsuka , Takeshi Ohashi

Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking

Transformer-based visual trackers have demonstrated significant advancements due to their powerful modeling capabilities. However, their practicality is limited on resource-constrained devices because of their slow processing speeds. To…

Computer Vision and Pattern Recognition · Computer Science 2025-06-26 Ben Kang , Xin Chen , Jie Zhao , Chunjuan Bo , Dong Wang , Huchuan Lu

AiATrack: Attention in Attention for Transformer Visual Tracking

Transformer trackers have achieved impressive advancements recently, where the attention mechanism plays an important role. However, the independent correlation computation in the attention mechanism could result in noisy and ambiguous…

Computer Vision and Pattern Recognition · Computer Science 2022-07-25 Shenyuan Gao , Chunluan Zhou , Chao Ma , Xinggang Wang , Junsong Yuan

Real-Time Object Tracking via Meta-Learning: Efficient Model Adaptation and One-Shot Channel Pruning

We propose a novel meta-learning framework for real-time object tracking with efficient model adaptation and channel pruning. Given an object tracker, our framework learns to fine-tune its model parameters in only a few iterations of…

Computer Vision and Pattern Recognition · Computer Science 2019-12-05 Ilchae Jung , Kihyun You , Hyeonwoo Noh , Minsu Cho , Bohyung Han

SparseTT: Visual Tracking with Sparse Transformers

Transformers have been successfully applied to the visual tracking task and significantly promote tracking performance. The self-attention mechanism designed to model long-range dependencies is the key to the success of Transformers.…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Zhihong Fu , Zehua Fu , Qingjie Liu , Wenrui Cai , Yunhong Wang

OneTrack-M: A multitask approach to transformer-based MOT models

Multi-Object Tracking (MOT) is a critical problem in computer vision, essential for understanding how objects move and interact in videos. This field faces significant challenges such as occlusions and complex environmental dynamics,…

Computer Vision and Pattern Recognition · Computer Science 2025-02-10 Luiz C. S. de Araujo , Carlos M. S. Figueiredo

Real-time 3D Single Object Tracking with Transformer

LiDAR-based 3D single object tracking is a challenging issue in robotics and autonomous driving. Currently, existing approaches usually suffer from the problem that objects at long distance often have very sparse or partially-occluded point…

Computer Vision and Pattern Recognition · Computer Science 2022-09-05 Jiayao Shan , Sifan Zhou , Yubo Cui , Zheng Fang

Towards Real-World Visual Tracking with Temporal Contexts

Visual tracking has made significant improvements in the past few decades. Most existing state-of-the-art trackers 1) merely aim for performance in ideal conditions while overlooking the real-world conditions; 2) adopt the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Ziang Cao , Ziyuan Huang , Liang Pan , Shiwei Zhang , Ziwei Liu , Changhong Fu

Explicit Visual Prompts for Visual Object Tracking

How to effectively exploit spatio-temporal information is crucial to capture target appearance changes in visual tracking. However, most deep learning-based trackers mainly focus on designing a complicated appearance model or template…

Computer Vision and Pattern Recognition · Computer Science 2024-01-09 Liangtao Shi , Bineng Zhong , Qihua Liang , Ning Li , Shengping Zhang , Xianxian Li