Related papers: Exploring Dynamic Transformer for Efficient Object…

Repurposing Video Diffusion Transformers for Robust Point Tracking

Point tracking aims to localize corresponding points across video frames, serving as a fundamental task for 4D reconstruction, robotics, and video editing. Existing methods commonly rely on shallow convolutional backbones such as ResNet…

Computer Vision and Pattern Recognition · Computer Science 2025-12-24 Soowon Son , Honggyu An , Chaehyun Kim , Hyunah Ko , Jisu Nam , Dahyun Chung , Siyoon Jin , Jung Yi , Jaewon Min , Junhwa Hur , Seungryong Kim

UETrack: A Unified and Efficient Framework for Single Object Tracking

With growing real-world demands, efficient tracking has received increasing attention. However, most existing methods are limited to RGB inputs and struggle in multi-modal scenarios. Moreover, current multi-modal tracking approaches…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Ben Kang , Jie Zhao , Xin Chen , Wanting Geng , Bin Zhang , Lu Zhang , Dong Wang , Huchuan Lu

Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking

Transformer-based visual trackers have demonstrated significant advancements due to their powerful modeling capabilities. However, their practicality is limited on resource-constrained devices because of their slow processing speeds. To…

Computer Vision and Pattern Recognition · Computer Science 2025-06-26 Ben Kang , Xin Chen , Jie Zhao , Chunjuan Bo , Dong Wang , Huchuan Lu

Efficient Visual Tracking with Exemplar Transformers

The design of more complex and powerful neural network models has significantly advanced the state-of-the-art in visual object tracking. These advances can be attributed to deeper networks, or the introduction of new building blocks, such…

Computer Vision and Pattern Recognition · Computer Science 2022-10-05 Philippe Blatter , Menelaos Kanakis , Martin Danelljan , Luc Van Gool

Efficient Training for Visual Tracking with Deformable Transformer

Recent Transformer-based visual tracking models have showcased superior performance. Nevertheless, prior works have been resource-intensive, requiring prolonged GPU training hours and incurring high GFLOPs during inference due to…

Computer Vision and Pattern Recognition · Computer Science 2023-09-07 Qingmao Wei , Guotian Zeng , Bi Zeng

FastTrackTr:Towards Fast Multi-Object Tracking with Transformers

Transformer-based multi-object tracking (MOT) methods have captured the attention of many researchers in recent years. However, these models often suffer from slow inference speeds due to their structure or other issues. To address this…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Pan Liao , Feng Yang , Di Wu , Jinwen Yu , Wenhui Zhao , Dingwen Zhang

TransTrack: Multiple Object Tracking with Transformer

In this work, we propose TransTrack, a simple but efficient scheme to solve the multiple object tracking problems. TransTrack leverages the transformer architecture, which is an attention-based query-key mechanism. It applies object…

Computer Vision and Pattern Recognition · Computer Science 2021-05-05 Peize Sun , Jinkun Cao , Yi Jiang , Rufeng Zhang , Enze Xie , Zehuan Yuan , Changhu Wang , Ping Luo

CXTrack: Improving 3D Point Cloud Tracking with Contextual Information

3D single object tracking plays an essential role in many applications, such as autonomous driving. It remains a challenging problem due to the large appearance variation and the sparsity of points caused by occlusion and limited sensor…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Tian-Xing Xu , Yuan-Chen Guo , Yu-Kun Lai , Song-Hai Zhang

OneTrack-M: A multitask approach to transformer-based MOT models

Multi-Object Tracking (MOT) is a critical problem in computer vision, essential for understanding how objects move and interact in videos. This field faces significant challenges such as occlusions and complex environmental dynamics,…

Computer Vision and Pattern Recognition · Computer Science 2025-02-10 Luiz C. S. de Araujo , Carlos M. S. Figueiredo

LiteTrack: Layer Pruning with Asynchronous Feature Extraction for Lightweight and Efficient Visual Tracking

The recent advancements in transformer-based visual trackers have led to significant progress, attributed to their strong modeling capabilities. However, as performance improves, running latency correspondingly increases, presenting a…

Computer Vision and Pattern Recognition · Computer Science 2023-09-19 Qingmao Wei , Bi Zeng , Jianqi Liu , Li He , Guotian Zeng

FARTrack: Fast Autoregressive Visual Tracking with High Performance

Inference speed and tracking performance are two critical evaluation metrics in the field of visual tracking. However, high-performance trackers often suffer from slow processing speeds, making them impractical for deployment on…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Guijie Wang , Tong Lin , Yifan Bai , Anjia Cao , Shiyi Liang , Wangbo Zhao , Xing Wei

Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach

Existing tracking algorithms typically rely on low-frame-rate RGB cameras coupled with computationally intensive deep neural network architectures to achieve effective tracking. However, such frame-based methods inherently face challenges…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Shiao Wang , Xiao Wang , Liye Jin , Bo Jiang , Lin Zhu , Lan Chen , Yonghong Tian , Bin Luo

Adaptively Bypassing Vision Transformer Blocks for Efficient Visual Tracking

Empowered by transformer-based models, visual tracking has advanced significantly. However, the slow speed of current trackers limits their applicability on devices with constrained computational resources. To address this challenge, we…

Computer Vision and Pattern Recognition · Computer Science 2024-07-02 Xiangyang Yang , Dan Zeng , Xucheng Wang , You Wu , Hengzhou Ye , Qijun Zhao , Shuiwang Li

DroTrack: High-speed Drone-based Object Tracking Under Uncertainty

We present DroTrack, a high-speed visual single-object tracking framework for drone-captured video sequences. Most of the existing object tracking methods are designed to tackle well-known challenges, such as occlusion and cluttered…

Computer Vision and Pattern Recognition · Computer Science 2020-05-05 Ali Hamdi , Flora Salim , Du Yong Kim

NetTrack: Tracking Highly Dynamic Objects with a Net

The complex dynamicity of open-world objects presents non-negligible challenges for multi-object tracking (MOT), often manifested as severe deformations, fast motion, and occlusions. Most methods that solely depend on coarse-grained object…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Guangze Zheng , Shijie Lin , Haobo Zuo , Changhong Fu , Jia Pan

SparseTT: Visual Tracking with Sparse Transformers

Transformers have been successfully applied to the visual tracking task and significantly promote tracking performance. The self-attention mechanism designed to model long-range dependencies is the key to the success of Transformers.…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Zhihong Fu , Zehua Fu , Qingjie Liu , Wenrui Cai , Yunhong Wang

DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos

This paper presents DriveTrack, a new benchmark and data generation framework for long-range keypoint tracking in real-world videos. DriveTrack is motivated by the observation that the accuracy of state-of-the-art trackers depends strongly…

Computer Vision and Pattern Recognition · Computer Science 2023-12-18 Arjun Balasingam , Joseph Chandler , Chenning Li , Zhoutong Zhang , Hari Balakrishnan

Real-Time Object Tracking via Meta-Learning: Efficient Model Adaptation and One-Shot Channel Pruning

We propose a novel meta-learning framework for real-time object tracking with efficient model adaptation and channel pruning. Given an object tracker, our framework learns to fine-tune its model parameters in only a few iterations of…

Computer Vision and Pattern Recognition · Computer Science 2019-12-05 Ilchae Jung , Kihyun You , Hyeonwoo Noh , Minsu Cho , Bohyung Han

Learning Policies for Adaptive Tracking with Deep Feature Cascades

Visual object tracking is a fundamental and time-critical vision task. Recent years have seen many shallow tracking methods based on real-time pixel-based correlation filters, as well as deep methods that have top performance but need a…

Computer Vision and Pattern Recognition · Computer Science 2017-09-15 Chen Huang , Simon Lucey , Deva Ramanan

Exploring Reliable Spatiotemporal Dependencies for Efficient Visual Tracking

Recent advances in transformer-based lightweight object tracking have established new standards across benchmarks, leveraging the global receptive field and powerful feature extraction capabilities of attention mechanisms. Despite these…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Junze Shi , Yang Yu , Jian Shi , Haibo Luo