Related papers: MCTR: Multi Camera Tracking Transformer

MOTR: End-to-End Multiple-Object Tracking with Transformer

Temporal modeling of objects is a key challenge in multiple object tracking (MOT). Existing methods track by associating detections through motion-based and appearance-based similarity heuristics. The post-processing nature of association…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Fangao Zeng , Bin Dong , Yuang Zhang , Tiancai Wang , Xiangyu Zhang , Yichen Wei

Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal Transformers

Tracking a time-varying indefinite number of objects in a video sequence over time remains a challenge despite recent advances in the field. Most existing approaches are not able to properly handle multi-object tracking challenges such as…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Tianyu Zhu , Markus Hiller , Mahsa Ehsanpour , Rongkai Ma , Tom Drummond , Ian Reid , Hamid Rezatofighi

CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection

To achieve accurate 3D object detection at a low cost for autonomous driving, many multi-camera methods have been proposed and solved the occlusion problem of monocular approaches. However, due to the lack of accurate estimated depth,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-06 Ching-Yu Tseng , Yi-Rong Chen , Hsin-Ying Lee , Tsung-Han Wu , Wen-Chin Chen , Winston H. Hsu

MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries

Accurate and consistent 3D tracking from multiple cameras is a key component in a vision-based autonomous driving system. It involves modeling 3D dynamic objects in complex scenes across multiple cameras. This problem is inherently…

Computer Vision and Pattern Recognition · Computer Science 2022-05-03 Tianyuan Zhang , Xuanyao Chen , Yue Wang , Yilun Wang , Hang Zhao

CityTrack: Improving City-Scale Multi-Camera Multi-Target Tracking by Location-Aware Tracking and Box-Grained Matching

Multi-Camera Multi-Target Tracking (MCMT) is a computer vision technique that involves tracking multiple targets simultaneously across multiple cameras. MCMT in urban traffic visual analysis faces great challenges due to the complex and…

Computer Vision and Pattern Recognition · Computer Science 2023-07-07 Jincheng Lu , Xipeng Yang , Jin Ye , Yifu Zhang , Zhikang Zou , Wei Zhang , Xiao Tan

Recent Advances in Embedding Methods for Multi-Object Tracking: A Survey

Multi-object tracking (MOT) aims to associate target objects across video frames in order to obtain entire moving trajectories. With the advancement of deep neural networks and the increasing demand for intelligent video analysis, MOT has…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Gaoang Wang , Mingli Song , Jenq-Neng Hwang

End-to-End Object Detection with Transformers

We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components like a non-maximum suppression…

Computer Vision and Pattern Recognition · Computer Science 2020-05-29 Nicolas Carion , Francisco Massa , Gabriel Synnaeve , Nicolas Usunier , Alexander Kirillov , Sergey Zagoruyko

OneTrack-M: A multitask approach to transformer-based MOT models

Multi-Object Tracking (MOT) is a critical problem in computer vision, essential for understanding how objects move and interact in videos. This field faces significant challenges such as occlusions and complex environmental dynamics,…

Computer Vision and Pattern Recognition · Computer Science 2025-02-10 Luiz C. S. de Araujo , Carlos M. S. Figueiredo

Motion-Aware Transformer for Multi-Object Tracking

Multi-object tracking (MOT) in videos remains challenging due to complex object motions and crowded scenes. Recent DETR-based frameworks offer end-to-end solutions but typically process detection and tracking queries jointly within a single…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Xu Yang , Gady Agam

Towards Effective Multi-Moving-Camera Tracking: A New Dataset and Lightweight Link Model

Ensuring driving safety for autonomous vehicles has become increasingly crucial, highlighting the need for systematic tracking of on-road pedestrians. Most vehicles are equipped with visual sensors, however, the large-scale visual data has…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Yanting Zhang , Shuanghong Wang , Qingxiang Wang , Cairong Yan , Rui Fan

FastTrackTr:Towards Fast Multi-Object Tracking with Transformers

Transformer-based multi-object tracking (MOT) methods have captured the attention of many researchers in recent years. However, these models often suffer from slow inference speeds due to their structure or other issues. To address this…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Pan Liao , Feng Yang , Di Wu , Jinwen Yu , Wenhui Zhao , Dingwen Zhang

MotionTrack: End-to-End Transformer-based Multi-Object Tracing with LiDAR-Camera Fusion

Multiple Object Tracking (MOT) is crucial to autonomous vehicle perception. End-to-end transformer-based algorithms, which detect and track objects simultaneously, show great potential for the MOT task. However, most existing methods focus…

Computer Vision and Pattern Recognition · Computer Science 2023-06-30 Ce Zhang , Chengjie Zhang , Yiluan Guo , Lingji Chen , Michael Happold

MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

As a video task, Multiple Object Tracking (MOT) is expected to capture temporal information of targets effectively. Unfortunately, most existing methods only explicitly exploit the object features between adjacent frames, while lacking the…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Ruopeng Gao , Limin Wang

End-to-End Multi-Object Tracking with Global Response Map

Most existing Multi-Object Tracking (MOT) approaches follow the Tracking-by-Detection paradigm and the data association framework where objects are firstly detected and then associated. Although deep-learning based method can noticeably…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Xingyu Wan , Jiakai Cao , Sanping Zhou , Jinjun Wang

A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic

Vision sensors are becoming more important in Intelligent Transportation Systems (ITS) for traffic monitoring, management, and optimization as the number of network cameras continues to rise. However, manual object tracking and matching…

Computer Vision and Pattern Recognition · Computer Science 2025-05-02 Muhammad Imran Zaman , Usama Ijaz Bajwa , Gulshan Saleem , Rana Hammad Raza

Online 3D Multi-Camera Perception through Robust 2D Tracking and Depth-based Late Aggregation

Multi-Target Multi-Camera Tracking (MTMC) is an essential computer vision task for automating large-scale surveillance. With camera calibration and depth information, the targets in the scene can be projected into 3D space, offering…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Vu-Minh Le , Thao-Anh Tran , Duc Huy Do , Xuan Canh Do , Huong Ninh , Hai Tran

TransTrack: Multiple Object Tracking with Transformer

In this work, we propose TransTrack, a simple but efficient scheme to solve the multiple object tracking problems. TransTrack leverages the transformer architecture, which is an attention-based query-key mechanism. It applies object…

Computer Vision and Pattern Recognition · Computer Science 2021-05-05 Peize Sun , Jinkun Cao , Yi Jiang , Rufeng Zhang , Enze Xie , Zehuan Yuan , Changhu Wang , Ping Luo

GMT: Effective Global Framework for Multi-Camera Multi-Target Tracking

Multi-Camera Multi-Target (MCMT) tracking aims to locate and associate the same targets across multiple camera views. Existing methods typically adopt a two-stage framework, involving single-camera tracking followed by inter-camera…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Yihao Zhen , Mingyue Xu , Qiang Wang , Baojie Fan , Jiahua Dong , Tinghui Zhao , Huijie Fan

Contrastive Learning for Multi-Object Tracking with Transformers

The DEtection TRansformer (DETR) opened new possibilities for object detection by modeling it as a translation task: converting image features into object-level representations. Previous works typically add expensive modules to DETR to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-16 Pierre-François De Plaen , Nicola Marinello , Marc Proesmans , Tinne Tuytelaars , Luc Van Gool

Multi-tracklet Tracking for Generic Targets with Adaptive Detection Clustering

Tracking specific targets, such as pedestrians and vehicles, has been the focus of recent vision-based multitarget tracking studies. However, in some real-world scenarios, unseen categories often challenge existing methods due to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Zewei Wu , Longhao Wang , Cui Wang , César Teixeira , Wei Ke , Zhang Xiong