Related papers: Generic Multiview Visual Tracking

GMT: Effective Global Framework for Multi-Camera Multi-Target Tracking

Multi-Camera Multi-Target (MCMT) tracking aims to locate and associate the same targets across multiple camera views. Existing methods typically adopt a two-stage framework, involving single-camera tracking followed by inter-camera…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Yihao Zhen , Mingyue Xu , Qiang Wang , Baojie Fan , Jiahua Dong , Tinghui Zhao , Huijie Fan

Glance-MCMT: A General MCMT Framework with Glance Initialization and Progressive Association

We propose a multi-camera multi-target (MCMT) tracking framework that ensures consistent global identity assignment across views using trajectory and appearance cues. The pipeline starts with BoT-SORT-based single-camera tracking, followed…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Hamidreza Hashempoor

Multi-tracklet Tracking for Generic Targets with Adaptive Detection Clustering

Tracking specific targets, such as pedestrians and vehicles, has been the focus of recent vision-based multitarget tracking studies. However, in some real-world scenarios, unseen categories often challenge existing methods due to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Zewei Wu , Longhao Wang , Cui Wang , César Teixeira , Wei Ke , Zhang Xiong

Global Correlation Network: End-to-End Joint Multi-Object Detection and Tracking

Multi-object tracking (MOT) has made great progress in recent years, but there are still some problems. Most MOT algorithms follow tracking-by-detection framework, which separates detection and tracking into two independent parts. Early…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Xuewu Lin , Yu-ang Guo , Jianqiang Wang

GMT: General Motion Tracking for Humanoid Whole-Body Control

The ability to track general whole-body motions in the real world is a useful way to build general-purpose humanoid robots. However, achieving this can be challenging due to the temporal and kinematic diversity of the motions, the policy's…

Robotics · Computer Science 2025-09-05 Zixuan Chen , Mazeyu Ji , Xuxin Cheng , Xuanbin Peng , Xue Bin Peng , Xiaolong Wang

Globally Optimal Object Tracking with Fully Convolutional Networks

Tracking is one of the most important but still difficult tasks in computer vision and pattern recognition. The main difficulties in the tracking field are appearance variation and occlusion. Most traditional tracking methods set the…

Computer Vision and Pattern Recognition · Computer Science 2016-12-28 Jinho Lee , Brian Kenji Iwana , Shouta Ide , Seiichi Uchida

Generic Vehicle Tracking Framework Capable of Handling Occlusions Based on Modified Mixture Particle Filter

Accurate and robust tracking of surrounding road participants plays an important role in autonomous driving. However, there is usually no prior knowledge of the number of tracking targets due to object emergence, object disappearance and…

Computer Vision and Pattern Recognition · Computer Science 2018-10-03 Jiachen Li , Wei Zhan , Masayoshi Tomizuka

CityTrack: Improving City-Scale Multi-Camera Multi-Target Tracking by Location-Aware Tracking and Box-Grained Matching

Multi-Camera Multi-Target Tracking (MCMT) is a computer vision technique that involves tracking multiple targets simultaneously across multiple cameras. MCMT in urban traffic visual analysis faces great challenges due to the complex and…

Computer Vision and Pattern Recognition · Computer Science 2023-07-07 Jincheng Lu , Xipeng Yang , Jin Ye , Yifu Zhang , Zhikang Zou , Wei Zhang , Xiao Tan

Generative Point Tracking with Flow Matching

Tracking a point through a video can be a challenging task due to uncertainty arising from visual obfuscations, such as appearance changes and occlusions. Although current state-of-the-art discriminative models excel in regressing long-term…

Computer Vision and Pattern Recognition · Computer Science 2025-10-27 Mattie Tesfaldet , Adam W. Harley , Konstantinos G. Derpanis , Derek Nowrouzezahrai , Christopher Pal

Exploit the Connectivity: Multi-Object Tracking with TrackletNet

Multi-object tracking (MOT) is an important and practical task related to both surveillance systems and moving camera applications, such as autonomous driving and robotic vision. However, due to unreliable detection, occlusion and fast…

Computer Vision and Pattern Recognition · Computer Science 2018-11-20 Gaoang Wang , Yizhou Wang , Haotian Zhang , Renshu Gu , Jenq-Neng Hwang

MCTR: Multi Camera Tracking Transformer

Multi-camera tracking plays a pivotal role in various real-world applications. While end-to-end methods have gained significant interest in single-camera tracking, multi-camera tracking remains predominantly reliant on heuristic techniques.…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Alexandru Niculescu-Mizil , Deep Patel , Iain Melvin

Bringing Generalization to Deep Multi-View Pedestrian Detection

Multi-view Detection (MVD) is highly effective for occlusion reasoning in a crowded environment. While recent works using deep learning have made significant advances in the field, they have overlooked the generalization aspect, which makes…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Jeet Vora , Swetanjal Dutta , Kanishk Jain , Shyamgopal Karthik , Vineet Gandhi

TGCN: Time Domain Graph Convolutional Network for Multiple Objects Tracking

Multiple object tracking is to give each object an id in the video. The difficulty is how to match the predicted objects and detected objects in same frames. Matching features include appearance features, location features, etc. These…

Computer Vision and Pattern Recognition · Computer Science 2021-01-07 Jie Zhang

Occlusion-Robust Online Multi-Object Visual Tracking using a GM-PHD Filter with CNN-Based Re-Identification

We propose a novel online multi-object visual tracker using a Gaussian mixture Probability Hypothesis Density (GM-PHD) filter and deep appearance learning. The GM-PHD filter has a linear complexity with the number of objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-06 Nathanael L. Baisa

FastTracker: Real-Time and Accurate Visual Tracking

Conventional multi-object tracking (MOT) systems are predominantly designed for pedestrian tracking and often exhibit limited generalization to other object categories. This paper presents a generalized tracking framework capable of…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Hamidreza Hashempoor , Yu Dong Hwang

State-aware Re-identification Feature for Multi-target Multi-camera Tracking

Multi-target Multi-camera Tracking (MTMCT) aims to extract the trajectories from videos captured by a set of cameras. Recently, the tracking performance of MTMCT is significantly enhanced with the employment of re-identification (Re-ID)…

Computer Vision and Pattern Recognition · Computer Science 2019-06-05 Peng Li , Jiabin Zhang , Zheng Zhu , Yanwei Li , Lu Jiang , Guan Huang

Reliable Object Tracking by Multimodal Hybrid Feature Extraction and Transformer-Based Fusion

Visual object tracking, which is primarily based on visible light image sequences, encounters numerous challenges in complicated scenarios, such as low light conditions, high dynamic ranges, and background clutter. To address these…

Computer Vision and Pattern Recognition · Computer Science 2024-10-24 Hongze Sun , Rui Liu , Wuque Cai , Jun Wang , Yue Wang , Huajin Tang , Yan Cui , Dezhong Yao , Daqing Guo

Head Anchor Enhanced Detection and Association for Crowded Pedestrian Tracking

Visual pedestrian tracking represents a promising research field, with extensive applications in intelligent surveillance, behavior analysis, and human-computer interaction. However, real-world applications face significant occlusion…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Zewei Wu , César Teixeira , Wei Ke , Zhang Xiong

One Graph to Track Them All: Dynamic GNNs for Single- and Multi-View Tracking

This work presents a unified, fully differentiable model for multi-people tracking that learns to associate detections into trajectories without relying on pre-computed tracklets. The model builds a dynamic spatiotemporal graph that…

Computer Vision and Pattern Recognition · Computer Science 2026-01-01 Martin Engilberge , Ivan Vrkic , Friedrich Wilke Grosche , Julien Pilet , Engin Turetken , Pascal Fua

GRASPTrack: Geometry-Reasoned Association via Segmentation and Projection for Multi-Object Tracking

Multi-object tracking (MOT) in monocular videos is fundamentally challenged by occlusions and depth ambiguity, issues that conventional tracking-by-detection (TBD) methods struggle to resolve owing to a lack of geometric awareness. To…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Xudong Han , Pengcheng Fang , Yueying Tian , Jianhui Yu , Xiaohao Cai , Daniel Roggen , Philip Birch