Related papers: MOPT: Multi-Object Panoptic Tracking

Learning Appearance and Motion Cues for Panoptic Tracking

Panoptic tracking enables pixel-level scene interpretation of videos by integrating instance tracking in panoptic segmentation. This provides robots with a spatio-temporal understanding of the environment, an essential attribute for their…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Juana Valeria Hurtado , Sajad Marvi , Rohit Mohan , Abhinav Valada

SAMOFT: Robust Multi-Object Tracking via Region and Flow

Multi-object tracking (MOT) is a fundamental task in computer vision that requires continuously tracking multiple targets while maintaining consistent identities across frames. However, most existing approaches primarily rely on…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Yanchao Wang , Dawei Zhang , Chengzhuan Yang , Wei Liu , Minglu Li , Hua Wang , Zhonglong Zheng , Ming-Hsuan Yang

Know Your Surroundings: Panoramic Multi-Object Tracking by Multimodality Collaboration

In this paper, we focus on the multi-object tracking (MOT) problem of automatic driving and robot navigation. Most existing MOT methods track multiple objects using a singular RGB camera, which are prone to camera field-of-view and suffer…

Computer Vision and Pattern Recognition · Computer Science 2021-06-01 Yuhang He , Wentao Yu , Jie Han , Xing Wei , Xiaopeng Hong , Yihong Gong

Panoptic Segmentation

We propose and study a task we name panoptic segmentation (PS). Panoptic segmentation unifies the typically distinct tasks of semantic segmentation (assign a class label to each pixel) and instance segmentation (detect and segment each…

Computer Vision and Pattern Recognition · Computer Science 2019-04-12 Alexander Kirillov , Kaiming He , Ross Girshick , Carsten Rother , Piotr Dollár

Amodal Panoptic Segmentation

Humans have the remarkable ability to perceive objects as a whole, even when parts of them are occluded. This ability of amodal perception forms the basis of our perceptual and cognitive understanding of our world. To enable robots to…

Computer Vision and Pattern Recognition · Computer Science 2022-02-24 Rohit Mohan , Abhinav Valada

Spatio-Temporal Point Process for Multiple Object Tracking

Multiple Object Tracking (MOT) focuses on modeling the relationship of detected objects among consecutive frames and merge them into different trajectories. MOT remains a challenging task as noisy and confusing detection results often…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Tao Wang , Kean Chen , Weiyao Lin , John See , Zenghui Zhang , Qian Xu , Xia Jia

ORMOT: A Dataset and Framework for Omnidirectional Referring Multi-Object Tracking

Multi-Object Tracking (MOT) is a fundamental task in computer vision, aiming to track targets across video frames. Existing MOT methods perform well in general visual scenes, but face significant challenges and limitations when extended to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Sijia Chen , Zihan Zhou , Yanqiu Yu , En Yu , Wenbing Tao

Multiple Object Tracking as ID Prediction

Multi-Object Tracking (MOT) has been a long-standing challenge in video understanding. A natural and intuitive approach is to split this task into two parts: object detection and association. Most mainstream methods employ meticulously…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Ruopeng Gao , Ji Qi , Limin Wang

PanopticNDT: Efficient and Robust Panoptic Mapping

As the application scenarios of mobile robots are getting more complex and challenging, scene understanding becomes increasingly crucial. A mobile robot that is supposed to operate autonomously in indoor environments must have precise…

Robotics · Computer Science 2024-07-02 Daniel Seichter , Benedict Stephan , Söhnke Benedikt Fischedick , Steffen Müller , Leonard Rabes , Horst-Michael Gross

OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding

LVLMs have been shown to perform excellently in image-level tasks such as VQA and caption. However, in many instance-level tasks, such as visual grounding and object detection, LVLMs still show performance gaps compared to previous expert…

Computer Vision and Pattern Recognition · Computer Science 2025-11-24 Teng Fu , Mengyang Zhao , Ke Niu , Kaixin Peng , Bin Li

Panoptic Segmentation Meets Remote Sensing

Panoptic segmentation combines instance and semantic predictions, allowing the detection of "things" and "stuff" simultaneously. Effectively approaching panoptic segmentation in remotely sensed data can be auspicious in many challenging…

Computer Vision and Pattern Recognition · Computer Science 2022-05-04 Osmar Luiz Ferreira de Carvalho , Osmar Abílio de Carvalho Júnior , Cristiano Rosa e Silva , Anesmar Olino de Albuquerque , Nickolas Castro Santana , Dibio Leandro Borges , Roberto Arnaldo Trancoso Gomes , Renato Fontes Guimarães

STEP: Segmenting and Tracking Every Pixel

The task of assigning semantic classes and track identities to every pixel in a video is called video panoptic segmentation. Our work is the first that targets this task in a real-world setting requiring dense interpretation in both spatial…

Computer Vision and Pattern Recognition · Computer Science 2021-12-08 Mark Weber , Jun Xie , Maxwell Collins , Yukun Zhu , Paul Voigtlaender , Hartwig Adam , Bradley Green , Andreas Geiger , Bastian Leibe , Daniel Cremers , Aljoša Ošep , Laura Leal-Taixé , Liang-Chieh Chen

EfficientPS: Efficient Panoptic Segmentation

Understanding the scene in which an autonomous robot operates is critical for its competent functioning. Such scene comprehension necessitates recognizing instances of traffic participants along with general scene semantics which can be…

Computer Vision and Pattern Recognition · Computer Science 2021-11-05 Rohit Mohan , Abhinav Valada

Multi-Object Tracking and Segmentation via Neural Message Passing

Graphs offer a natural way to formulate Multiple Object Tracking (MOT) and Multiple Object Tracking and Segmentation (MOTS) within the tracking-by-detection paradigm. However, they also introduce a major challenge for learning methods, as…

Computer Vision and Pattern Recognition · Computer Science 2022-07-18 Guillem Braso , Orcun Cetintas , Laura Leal-Taixe

TrackMPNN: A Message Passing Graph Neural Architecture for Multi-Object Tracking

This study follows many classical approaches to multi-object tracking (MOT) that model the problem using dynamic graphical data structures, and adapts this formulation to make it amenable to modern neural networks. Our main contributions in…

Computer Vision and Pattern Recognition · Computer Science 2021-05-10 Akshay Rangesh , Pranav Maheshwari , Mez Gebre , Siddhesh Mhatre , Vahid Ramezani , Mohan M. Trivedi

PointTrack++ for Effective Online Multi-Object Tracking and Segmentation

Multiple-object tracking and segmentation (MOTS) is a novel computer vision task that aims to jointly perform multiple object tracking (MOT) and instance segmentation. In this work, we present PointTrack++, an effective on-line framework…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Zhenbo Xu , Wei Zhang , Xiao Tan , Wei Yang , Xiangbo Su , Yuchen Yuan , Hongwu Zhang , Shilei Wen , Errui Ding , Liusheng Huang

EnsembleMOT: A Step towards Ensemble Learning of Multiple Object Tracking

Multiple Object Tracking (MOT) has rapidly progressed in recent years. Existing works tend to design a single tracking algorithm to perform both detection and association. Though ensemble learning has been exploited in many tasks, i.e,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-20 Yunhao Du , Zihang Liu , Fei Su

OVTrack: Open-Vocabulary Multiple Object Tracking

The ability to recognize, localize and track dynamic objects in a scene is fundamental to many real-world applications, such as self-driving and robotic systems. Yet, traditional multiple object tracking (MOT) benchmarks rely only on a few…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Siyuan Li , Tobias Fischer , Lei Ke , Henghui Ding , Martin Danelljan , Fisher Yu

Transformer-Based Multi-Object Smoothing with Decoupled Data Association and Smoothing

Multi-object tracking (MOT) is the task of estimating the state trajectories of an unknown and time-varying number of objects over a certain time window. Several algorithms have been proposed to tackle the multi-object smoothing task, where…

Computer Vision and Pattern Recognition · Computer Science 2024-01-01 Juliano Pinto , Georg Hess , Yuxuan Xia , Henk Wymeersch , Lennart Svensson

Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking

Panoptic scene understanding and tracking of dynamic agents are essential for robots and automated vehicles to navigate in urban environments. As LiDARs provide accurate illumination-independent geometric depictions of the scene, performing…

Computer Vision and Pattern Recognition · Computer Science 2021-12-28 Whye Kit Fong , Rohit Mohan , Juana Valeria Hurtado , Lubing Zhou , Holger Caesar , Oscar Beijbom , Abhinav Valada