Related papers: Global Tracking Transformers

End-to-End Multi-Object Tracking with Global Response Map

Most existing Multi-Object Tracking (MOT) approaches follow the Tracking-by-Detection paradigm and the data association framework where objects are firstly detected and then associated. Although deep-learning based method can noticeably…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Xingyu Wan , Jiakai Cao , Sanping Zhou , Jinjun Wang

Tracking Transforming Objects: A Benchmark

Tracking transforming objects holds significant importance in various fields due to the dynamic nature of many real-world scenarios. By enabling systems accurately represent transforming objects over time, tracking transforming objects…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 You Wu , Yuelong Wang , Yaxin Liao , Fuliang Wu , Hengzhou Ye , Shuiwang Li

TransTrack: Multiple Object Tracking with Transformer

In this work, we propose TransTrack, a simple but efficient scheme to solve the multiple object tracking problems. TransTrack leverages the transformer architecture, which is an attention-based query-key mechanism. It applies object…

Computer Vision and Pattern Recognition · Computer Science 2021-05-05 Peize Sun , Jinkun Cao , Yi Jiang , Rufeng Zhang , Enze Xie , Zehuan Yuan , Changhu Wang , Ping Luo

TrackFormer: Multi-Object Tracking with Transformers

The challenging task of multi-object tracking (MOT) requires simultaneous reasoning about track initialization, identity, and spatio-temporal trajectories. We formulate this task as a frame-to-frame set prediction problem and introduce…

Computer Vision and Pattern Recognition · Computer Science 2022-05-02 Tim Meinhardt , Alexander Kirillov , Laura Leal-Taixe , Christoph Feichtenhofer

FastTracker: Real-Time and Accurate Visual Tracking

Conventional multi-object tracking (MOT) systems are predominantly designed for pedestrian tracking and often exhibit limited generalization to other object categories. This paper presents a generalized tracking framework capable of…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Hamidreza Hashempoor , Yu Dong Hwang

TAPTR: Tracking Any Point with Transformers as Detection

In this paper, we propose a simple and strong framework for Tracking Any Point with TRansformers (TAPTR). Based on the observation that point tracking bears a great resemblance to object detection and tracking, we borrow designs from…

Computer Vision and Pattern Recognition · Computer Science 2024-03-21 Hongyang Li , Hao Zhang , Shilong Liu , Zhaoyang Zeng , Tianhe Ren , Feng Li , Lei Zhang

Learning Global Structure Consistency for Robust Object Tracking

Fast appearance variations and the distractions of similar objects are two of the most challenging problems in visual object tracking. Unlike many existing trackers that focus on modeling only the target, in this work, we consider the…

Computer Vision and Pattern Recognition · Computer Science 2020-08-28 Bi Li , Chengquan Zhang , Zhibin Hong , Xu Tang , Jingtuo Liu , Junyu Han , Errui Ding , Wenyu Liu

Robust Mobile Object Tracking Based on Multiple Feature Similarity and Trajectory Filtering

This paper presents a new algorithm to track mobile objects in different scene conditions. The main idea of the proposed tracker includes estimation, multi-features similarity measures and trajectory filtering. A feature set (distance,…

Computer Vision and Pattern Recognition · Computer Science 2011-06-15 Duc Phu Chau , François Bremond , Monique Thonnat , Etienne Corvee

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking

In video object tracking, there exist rich temporal contexts among successive frames, which have been largely overlooked in existing trackers. In this work, we bridge the individual video frames and explore the temporal contexts across them…

Computer Vision and Pattern Recognition · Computer Science 2021-03-25 Ning Wang , Wengang Zhou , Jie Wang , Houqaing Li

Beyond SOT: Tracking Multiple Generic Objects at Once

Generic Object Tracking (GOT) is the problem of tracking target objects, specified by bounding boxes in the first frame of a video. While the task has received much attention in the last decades, researchers have almost exclusively focused…

Computer Vision and Pattern Recognition · Computer Science 2024-02-27 Christoph Mayer , Martin Danelljan , Ming-Hsuan Yang , Vittorio Ferrari , Luc Van Gool , Alina Kuznetsova

FastTrackTr:Towards Fast Multi-Object Tracking with Transformers

Transformer-based multi-object tracking (MOT) methods have captured the attention of many researchers in recent years. However, these models often suffer from slow inference speeds due to their structure or other issues. To address this…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Pan Liao , Feng Yang , Di Wu , Jinwen Yu , Wenhui Zhao , Dingwen Zhang

Global Correlation Network: End-to-End Joint Multi-Object Detection and Tracking

Multi-object tracking (MOT) has made great progress in recent years, but there are still some problems. Most MOT algorithms follow tracking-by-detection framework, which separates detection and tracking into two independent parts. Early…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Xuewu Lin , Yu-ang Guo , Jianqiang Wang

A fast multi-object tracking system using an object detector ensemble

Multiple-Object Tracking (MOT) is of crucial importance for applications such as retail video analytics and video surveillance. Object detectors are often the computational bottleneck of modern MOT systems, limiting their use for real-time…

Computer Vision and Pattern Recognition · Computer Science 2019-08-14 Richard Cobos , Jefferson Hernandez , Andres G. Abad

Is a Pure Transformer Effective for Separated and Online Multi-Object Tracking?

Recent advances in Multi-Object Tracking (MOT) have demonstrated significant success in short-term association within the separated tracking-by-detection online paradigm. However, long-term tracking remains challenging. While graph-based…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Chongwei Liu , Haojie Li , Zhihui Wang , Rui Xu

TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Tracking multiple objects in videos relies on modeling the spatial-temporal interactions of the objects. In this paper, we propose a solution named TransMOT, which leverages powerful graph transformers to efficiently model the spatial and…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Peng Chu , Jiang Wang , Quanzeng You , Haibin Ling , Zicheng Liu

MOTR: End-to-End Multiple-Object Tracking with Transformer

Temporal modeling of objects is a key challenge in multiple object tracking (MOT). Existing methods track by associating detections through motion-based and appearance-based similarity heuristics. The post-processing nature of association…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Fangao Zeng , Bin Dong , Yuang Zhang , Tiancai Wang , Xiangyu Zhang , Yichen Wei

Towards Generalizable Multi-Object Tracking

Multi-Object Tracking MOT encompasses various tracking scenarios, each characterized by unique traits. Effective trackers should demonstrate a high degree of generalizability across diverse scenarios. However, existing trackers struggle to…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Zheng Qin , Le Wang , Sanping Zhou , Panpan Fu , Gang Hua , Wei Tang

Generic Multiview Visual Tracking

Recent progresses in visual tracking have greatly improved the tracking performance. However, challenges such as occlusion and view change remain obstacles in real world deployment. A natural solution to these challenges is to use multiple…

Computer Vision and Pattern Recognition · Computer Science 2019-04-05 Minye Wu , Haibin Ling , Ning Bi , Shenghua Gao , Hao Sheng , Jingyi Yu

Learning Spatio-Temporal Transformer for Visual Tracking

In this paper, we present a new tracking architecture with an encoder-decoder transformer as the key component. The encoder models the global spatio-temporal feature dependencies between target objects and search regions, while the decoder…

Computer Vision and Pattern Recognition · Computer Science 2021-04-01 Bin Yan , Houwen Peng , Jianlong Fu , Dong Wang , Huchuan Lu

Transforming Model Prediction for Tracking

Optimization based tracking methods have been widely successful by integrating a target model prediction module, providing effective global reasoning by minimizing an objective function. While this inductive bias integrates valuable domain…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Christoph Mayer , Martin Danelljan , Goutam Bhat , Matthieu Paul , Danda Pani Paudel , Fisher Yu , Luc Van Gool