Related papers: Playing for Benchmarks

Need for Speed: A Benchmark for Higher Frame Rate Object Tracking

In this paper, we propose the first higher frame rate video dataset (called Need for Speed - NfS) and benchmark for visual object tracking. The dataset consists of 100 videos (380K frames) captured with now commonly available higher frame…

Computer Vision and Pattern Recognition · Computer Science 2017-03-23 Hamed Kiani Galoogahi , Ashton Fagg , Chen Huang , Deva Ramanan , Simon Lucey

Planar Object Tracking in the Wild: A Benchmark

Planar object tracking is an actively studied problem in vision-based robotic applications. While several benchmarks have been constructed for evaluating state-of-the-art algorithms, there is a lack of video sequences captured in the wild…

Computer Vision and Pattern Recognition · Computer Science 2018-05-23 Pengpeng Liang , Yifan Wu , Hu Lu , Liming Wang , Chunyuan Liao , Haibin Ling

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Generic motion understanding from video involves not only tracking objects, but also perceiving how their surfaces deform and move. This information is useful to make inferences about 3D shape, physical properties and object interactions.…

Computer Vision and Pattern Recognition · Computer Science 2023-04-03 Carl Doersch , Ankush Gupta , Larisa Markeeva , Adrià Recasens , Lucas Smaira , Yusuf Aytar , João Carreira , Andrew Zisserman , Yi Yang

A performance contextualization approach to validating camera models for robot simulation

The focus of this contribution is on camera simulation as it comes into play in simulating autonomous robots for their virtual prototyping. We propose a camera model validation methodology based on the performance of a perception algorithm…

Robotics · Computer Science 2022-08-02 Asher Elmquist , Radu Serban , Dan Negrut

Predictive Visual Tracking: A New Benchmark and Baseline Approach

As a crucial robotic perception capability, visual tracking has been intensively studied recently. In the real-world scenarios, the onboard processing time of the image streams inevitably leads to a discrepancy between the tracking results…

Computer Vision and Pattern Recognition · Computer Science 2022-11-14 Bowen Li , Yiming Li , Junjie Ye , Changhong Fu , Hang Zhao

Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and Stereo

While recent methods for motion and stereo estimation recover an unprecedented amount of details, such highly detailed structures are neither adequately reflected in the data of existing benchmarks nor their evaluation methodology. Hence,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-06 Lukas Mehl , Jenny Schmalfuss , Azin Jahedi , Yaroslava Nalivayko , Andrés Bruhn

ACCIDENT: A Benchmark Dataset for Vehicle Accident Detection from Traffic Surveillance Videos

We introduce ACCIDENT, a benchmark dataset for traffic accident detection in CCTV footage, designed to evaluate models in supervised (IID and OOD) and zero-shot settings, reflecting both data-rich and data-scarce scenarios. The benchmark…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Lukas Picek , Michal Čermák , Marek Hanzl , Vojtěch Čermák

Target-Bench: Can Video World Models Achieve Mapless Path Planning with Semantic Targets?

While recent video world models can generate highly realistic videos, their ability to perform semantic reasoning and planning remains unclear and unquantified. We introduce Target-Bench, the first benchmark that enables comprehensive…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Dingrui Wang , Zhihao Liang , Hongyuan Ye , Zhexiao Sun , Zhaowei Lu , Yuchen Zhang , Yuyu Zhao , Yuan Gao , Marvin Seegert , Finn Schäfer , Haotong Qin , Wei Li , Luigi Palmieri , Felix Jahncke , Mattia Piccinini , Johannes Betz

DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos

This paper presents DriveTrack, a new benchmark and data generation framework for long-range keypoint tracking in real-world videos. DriveTrack is motivated by the observation that the accuracy of state-of-the-art trackers depends strongly…

Computer Vision and Pattern Recognition · Computer Science 2023-12-18 Arjun Balasingam , Joseph Chandler , Chenning Li , Zhoutong Zhang , Hari Balakrishnan

Realistic Video Summarization through VISIOCITY: A New Benchmark and Evaluation Framework

Automatic video summarization is still an unsolved problem due to several challenges. We take steps towards making automatic video summarization more realistic by addressing them. Firstly, the currently available datasets either have very…

Computer Vision and Pattern Recognition · Computer Science 2020-08-26 Vishal Kaushal , Suraj Kothawade , Rishabh Iyer , Ganesh Ramakrishnan

TAPVid-3D: A Benchmark for Tracking Any Point in 3D

We introduce a new benchmark, TAPVid-3D, for evaluating the task of long-range Tracking Any Point in 3D (TAP-3D). While point tracking in two dimensions (TAP) has many benchmarks measuring performance on real-world videos, such as…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Skanda Koppula , Ignacio Rocco , Yi Yang , Joe Heyward , João Carreira , Andrew Zisserman , Gabriel Brostow , Carl Doersch

360VOT: A New Benchmark Dataset for Omnidirectional Visual Object Tracking

360{\deg} images can provide an omnidirectional field of view which is important for stable and long-term scene perception. In this paper, we explore 360{\deg} images for visual object tracking and perceive new challenges caused by large…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Huajian Huang , Yinzhe Xu , Yingshu Chen , Sai-Kit Yeung

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

We propose a novel multimodal video benchmark - the Perception Test - to evaluate the perception and reasoning skills of pre-trained multimodal models (e.g. Flamingo, SeViLA, or GPT-4). Compared to existing benchmarks that focus on…

Computer Vision and Pattern Recognition · Computer Science 2023-11-01 Viorica Pătrăucean , Lucas Smaira , Ankush Gupta , Adrià Recasens Continente , Larisa Markeeva , Dylan Banarse , Skanda Koppula , Joseph Heyward , Mateusz Malinowski , Yi Yang , Carl Doersch , Tatiana Matejovicova , Yury Sulsky , Antoine Miech , Alex Frechette , Hanna Klimczak , Raphael Koster , Junlin Zhang , Stephanie Winkler , Yusuf Aytar , Simon Osindero , Dima Damen , Andrew Zisserman , João Carreira

Playing for Data: Ground Truth from Computer Games

Recent progress in computer vision has been driven by high-capacity models trained on large datasets. Unfortunately, creating large datasets with pixel-level labels has been extremely costly due to the amount of human effort required. In…

Computer Vision and Pattern Recognition · Computer Science 2016-08-09 Stephan R. Richter , Vibhav Vineet , Stefan Roth , Vladlen Koltun

Benchmarking Unsupervised Object Representations for Video Sequences

Perceiving the world in terms of objects and tracking them through time is a crucial prerequisite for reasoning and scene understanding. Recently, several methods have been proposed for unsupervised learning of object-centric…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Marissa A. Weis , Kashyap Chitta , Yash Sharma , Wieland Brendel , Matthias Bethge , Andreas Geiger , Alexander S. Ecker

Peng Cheng Object Detection Benchmark for Smart City

Object detection is an algorithm that recognizes and locates the objects in the image and has a wide range of applications in the visual understanding of complex urban scenes. Existing object detection benchmarks mainly focus on a single…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Yaowei Wang , Zhouxin Yang , Rui Liu , Deng Li , Yuandu Lai , Leyuan Fang , Yahong Han

Beyond Referring Expressions: Scenario Comprehension Visual Grounding

Existing visual grounding benchmarks primarily evaluate alignment between image regions and literal referring expressions, where models can often succeed by matching a prominent named category. We explore a complementary and more…

Computer Vision and Pattern Recognition · Computer Science 2026-04-03 Ruozhen He , Nisarg A. Shah , Qihua Dong , Zilin Xiao , Jaywon Koo , Vicente Ordonez

Understanding Real-World Traffic Safety through RoadSafe365 Benchmark

Although recent traffic benchmarks have advanced multimodal data analysis, they generally lack systematic evaluation aligned with official safety standards. To fill this gap, we introduce RoadSafe365, a large-scale vision-language benchmark…

Computer Vision and Pattern Recognition · Computer Science 2026-02-10 Xinyu Liu , Darryl C. Jacob , Yuxin Liu , Xinsong Du , Muchao Ye , Bolei Zhou , Pan He

Benchmarking high-fidelity pedestrian tracking systems for research, real-time monitoring and crowd control

High-fidelity pedestrian tracking in real-life conditions has been an important tool in fundamental crowd dynamics research allowing to quantify statistics of relevant observables including walking velocities, mutual distances and body…

Physics and Society · Physics 2022-02-08 Caspar A. S. Pouw , Joris Willems , Frank van Schadewijk , Jasmin Thurau , Federico Toschi , Alessandro Corbetta

Perceptual Evaluation of Liquid Simulation Methods

This paper proposes a novel framework to evaluate fluid simulation methods based on crowd-sourced user studies in order to robustly gather large numbers of opinions. The key idea for a robust and reliable evaluation is to use a reference…

Graphics · Computer Science 2020-11-23 Kiwon Um , Xiangyu Hu , Nils Thuerey