Related papers: TrackFormer: Multi-Object Tracking with Transforme…

Joint Spatial-Temporal and Appearance Modeling with Transformer for Multiple Object Tracking

The recent trend in multiple object tracking (MOT) is heading towards leveraging deep learning to boost the tracking performance. In this paper, we propose a novel solution named TransSTAM, which leverages Transformer to effectively model…

Computer Vision and Pattern Recognition · Computer Science 2022-06-01 Peng Dai , Yiqiang Feng , Renliang Weng , Changshui Zhang

TransTrack: Multiple Object Tracking with Transformer

In this work, we propose TransTrack, a simple but efficient scheme to solve the multiple object tracking problems. TransTrack leverages the transformer architecture, which is an attention-based query-key mechanism. It applies object…

Computer Vision and Pattern Recognition · Computer Science 2021-05-05 Peize Sun , Jinkun Cao , Yi Jiang , Rufeng Zhang , Enze Xie , Zehuan Yuan , Changhu Wang , Ping Luo

TransFiner: A Full-Scale Refinement Approach for Multiple Object Tracking

Multiple object tracking (MOT) is the task containing detection and association. Plenty of trackers have achieved competitive performance. Unfortunately, for the lack of informative exchange on these subtasks, they are often biased toward…

Computer Vision and Pattern Recognition · Computer Science 2022-09-05 Bin Sun

Transformer Network for Multi-Person Tracking and Re-Identification in Unconstrained Environment

Multi-object tracking (MOT) has profound applications in a variety of fields, including surveillance, sports analytics, self-driving, and cooperative robotics. Despite considerable advancements, existing MOT methodologies tend to falter…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Hamza Mukhtar , Muhammad Usman Ghani Khan

Strong-TransCenter: Improved Multi-Object Tracking based on Transformers with Dense Representations

Transformer networks have been a focus of research in many fields in recent years, being able to surpass the state-of-the-art performance in different computer vision tasks. However, in the task of Multiple Object Tracking (MOT), leveraging…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Amit Galor , Roy Orfaig , Ben-Zion Bobrovsky

TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses

3D multi-object tracking (MOT) is vital for many applications including autonomous driving vehicles and service robots. With the commonly used tracking-by-detection paradigm, 3D MOT has made important progress in recent years. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Xuesong Chen , Shaoshuai Shi , Chao Zhang , Benjin Zhu , Qiang Wang , Ka Chun Cheung , Simon See , Hongsheng Li

MOTR: End-to-End Multiple-Object Tracking with Transformer

Temporal modeling of objects is a key challenge in multiple object tracking (MOT). Existing methods track by associating detections through motion-based and appearance-based similarity heuristics. The post-processing nature of association…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Fangao Zeng , Bin Dong , Yuang Zhang , Tiancai Wang , Xiangyu Zhang , Yichen Wei

Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers

Online Multi-Object Tracking (MOT) from videos is a challenging computer vision task which has been extensively studied for decades. Most of the existing MOT algorithms are based on the Tracking-by-Detection (TBD) paradigm combined with…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Zhen He , Jian Li , Daxue Liu , Hangen He , David Barber

End-to-end Tracking with a Multi-query Transformer

Multiple-object tracking (MOT) is a challenging task that requires simultaneous reasoning about location, appearance, and identity of the objects in the scene over time. Our aim in this paper is to move beyond tracking-by-detection…

Computer Vision and Pattern Recognition · Computer Science 2022-10-27 Bruno Korbar , Andrew Zisserman

OneTrack-M: A multitask approach to transformer-based MOT models

Multi-Object Tracking (MOT) is a critical problem in computer vision, essential for understanding how objects move and interact in videos. This field faces significant challenges such as occlusions and complex environmental dynamics,…

Computer Vision and Pattern Recognition · Computer Science 2025-02-10 Luiz C. S. de Araujo , Carlos M. S. Figueiredo

TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Tracking multiple objects in videos relies on modeling the spatial-temporal interactions of the objects. In this paper, we propose a solution named TransMOT, which leverages powerful graph transformers to efficiently model the spatial and…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Peng Chu , Jiang Wang , Quanzeng You , Haibin Ling , Zicheng Liu

Unified Transformer Tracker for Object Tracking

As an important area in computer vision, object tracking has formed two separate communities that respectively study Single Object Tracking (SOT) and Multiple Object Tracking (MOT). However, current methods in one tracking scenario are not…

Computer Vision and Pattern Recognition · Computer Science 2022-06-09 Fan Ma , Mike Zheng Shou , Linchao Zhu , Haoqi Fan , Yilei Xu , Yi Yang , Zhicheng Yan

End-to-End Multi-Object Tracking with Global Response Map

Most existing Multi-Object Tracking (MOT) approaches follow the Tracking-by-Detection paradigm and the data association framework where objects are firstly detected and then associated. Although deep-learning based method can noticeably…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Xingyu Wan , Jiakai Cao , Sanping Zhou , Jinjun Wang

TransCenter: Transformers with Dense Representations for Multiple-Object Tracking

Transformers have proven superior performance for a wide variety of tasks since they were introduced. In recent years, they have drawn attention from the vision community in tasks such as image classification and object detection. Despite…

Computer Vision and Pattern Recognition · Computer Science 2022-10-03 Yihong Xu , Yutong Ban , Guillaume Delorme , Chuang Gan , Daniela Rus , Xavier Alameda-Pineda

Exploit the Connectivity: Multi-Object Tracking with TrackletNet

Multi-object tracking (MOT) is an important and practical task related to both surveillance systems and moving camera applications, such as autonomous driving and robotic vision. However, due to unreliable detection, occlusion and fast…

Computer Vision and Pattern Recognition · Computer Science 2018-11-20 Gaoang Wang , Yizhou Wang , Haotian Zhang , Renshu Gu , Jenq-Neng Hwang

FastTracker: Real-Time and Accurate Visual Tracking

Conventional multi-object tracking (MOT) systems are predominantly designed for pedestrian tracking and often exhibit limited generalization to other object categories. This paper presents a generalized tracking framework capable of…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Hamidreza Hashempoor , Yu Dong Hwang

Transformers for Multi-Object Tracking on Point Clouds

We present TransMOT, a novel transformer-based end-to-end trainable online tracker and detector for point cloud data. The model utilizes a cross- and a self-attention mechanism and is applicable to lidar data in an automotive context, as…

Computer Vision and Pattern Recognition · Computer Science 2022-09-07 Felicia Ruppel , Florian Faion , Claudius Gläser , Klaus Dietmayer

In Pursuit of Many: A Review of Modern Multiple Object Tracking Systems

Multiple Object Tracking (MOT) is a core capability in modern computer vision, essential to autonomous driving, surveillance, sports analytics, robotics, and biomedical imaging. Persistent identity assignment across frames remains…

Computer Vision and Pattern Recognition · Computer Science 2026-03-12 Mk Bashar , Samia Islam , Kashifa Kawaakib Hussain , Md. Bakhtiar Hasan , A. B. M. Ashikur Rahman , Md. Hasanul Kabir

MixFormer: End-to-End Tracking with Iterative Mixed Attention

Visual object tracking often employs a multi-stage pipeline of feature extraction, target information integration, and bounding box estimation. To simplify this pipeline and unify the process of feature extraction and target information…

Computer Vision and Pattern Recognition · Computer Science 2023-02-10 Yutao Cui , Cheng Jiang , Gangshan Wu , Limin Wang

SOTFormer: A Minimal Transformer for Unified Object Tracking and Trajectory Prediction

Accurate single-object tracking and short-term motion forecasting remain challenging under occlusion, scale variation, and temporal drift, which disrupt the temporal coherence required for real-time perception. We introduce…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Zhongping Dong , Pengyang Yu , Shuangjian Li , Liming Chen , Mohand Tahar Kechadi