English
Related papers

Related papers: Playing for Benchmarks

200 papers

In this paper, we propose the first higher frame rate video dataset (called Need for Speed - NfS) and benchmark for visual object tracking. The dataset consists of 100 videos (380K frames) captured with now commonly available higher frame…

Computer Vision and Pattern Recognition · Computer Science 2017-03-23 Hamed Kiani Galoogahi , Ashton Fagg , Chen Huang , Deva Ramanan , Simon Lucey

Planar object tracking is an actively studied problem in vision-based robotic applications. While several benchmarks have been constructed for evaluating state-of-the-art algorithms, there is a lack of video sequences captured in the wild…

Computer Vision and Pattern Recognition · Computer Science 2018-05-23 Pengpeng Liang , Yifan Wu , Hu Lu , Liming Wang , Chunyuan Liao , Haibin Ling

Generic motion understanding from video involves not only tracking objects, but also perceiving how their surfaces deform and move. This information is useful to make inferences about 3D shape, physical properties and object interactions.…

Computer Vision and Pattern Recognition · Computer Science 2023-04-03 Carl Doersch , Ankush Gupta , Larisa Markeeva , Adrià Recasens , Lucas Smaira , Yusuf Aytar , João Carreira , Andrew Zisserman , Yi Yang

The focus of this contribution is on camera simulation as it comes into play in simulating autonomous robots for their virtual prototyping. We propose a camera model validation methodology based on the performance of a perception algorithm…

Robotics · Computer Science 2022-08-02 Asher Elmquist , Radu Serban , Dan Negrut

As a crucial robotic perception capability, visual tracking has been intensively studied recently. In the real-world scenarios, the onboard processing time of the image streams inevitably leads to a discrepancy between the tracking results…

Computer Vision and Pattern Recognition · Computer Science 2022-11-14 Bowen Li , Yiming Li , Junjie Ye , Changhong Fu , Hang Zhao

While recent methods for motion and stereo estimation recover an unprecedented amount of details, such highly detailed structures are neither adequately reflected in the data of existing benchmarks nor their evaluation methodology. Hence,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-06 Lukas Mehl , Jenny Schmalfuss , Azin Jahedi , Yaroslava Nalivayko , Andrés Bruhn

We introduce ACCIDENT, a benchmark dataset for traffic accident detection in CCTV footage, designed to evaluate models in supervised (IID and OOD) and zero-shot settings, reflecting both data-rich and data-scarce scenarios. The benchmark…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Lukas Picek , Michal Čermák , Marek Hanzl , Vojtěch Čermák

While recent video world models can generate highly realistic videos, their ability to perform semantic reasoning and planning remains unclear and unquantified. We introduce Target-Bench, the first benchmark that enables comprehensive…

This paper presents DriveTrack, a new benchmark and data generation framework for long-range keypoint tracking in real-world videos. DriveTrack is motivated by the observation that the accuracy of state-of-the-art trackers depends strongly…

Computer Vision and Pattern Recognition · Computer Science 2023-12-18 Arjun Balasingam , Joseph Chandler , Chenning Li , Zhoutong Zhang , Hari Balakrishnan

Automatic video summarization is still an unsolved problem due to several challenges. We take steps towards making automatic video summarization more realistic by addressing them. Firstly, the currently available datasets either have very…

Computer Vision and Pattern Recognition · Computer Science 2020-08-26 Vishal Kaushal , Suraj Kothawade , Rishabh Iyer , Ganesh Ramakrishnan

We introduce a new benchmark, TAPVid-3D, for evaluating the task of long-range Tracking Any Point in 3D (TAP-3D). While point tracking in two dimensions (TAP) has many benchmarks measuring performance on real-world videos, such as…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Skanda Koppula , Ignacio Rocco , Yi Yang , Joe Heyward , João Carreira , Andrew Zisserman , Gabriel Brostow , Carl Doersch

360{\deg} images can provide an omnidirectional field of view which is important for stable and long-term scene perception. In this paper, we explore 360{\deg} images for visual object tracking and perceive new challenges caused by large…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Huajian Huang , Yinzhe Xu , Yingshu Chen , Sai-Kit Yeung

We propose a novel multimodal video benchmark - the Perception Test - to evaluate the perception and reasoning skills of pre-trained multimodal models (e.g. Flamingo, SeViLA, or GPT-4). Compared to existing benchmarks that focus on…

Recent progress in computer vision has been driven by high-capacity models trained on large datasets. Unfortunately, creating large datasets with pixel-level labels has been extremely costly due to the amount of human effort required. In…

Computer Vision and Pattern Recognition · Computer Science 2016-08-09 Stephan R. Richter , Vibhav Vineet , Stefan Roth , Vladlen Koltun

Perceiving the world in terms of objects and tracking them through time is a crucial prerequisite for reasoning and scene understanding. Recently, several methods have been proposed for unsupervised learning of object-centric…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Marissa A. Weis , Kashyap Chitta , Yash Sharma , Wieland Brendel , Matthias Bethge , Andreas Geiger , Alexander S. Ecker

Object detection is an algorithm that recognizes and locates the objects in the image and has a wide range of applications in the visual understanding of complex urban scenes. Existing object detection benchmarks mainly focus on a single…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Yaowei Wang , Zhouxin Yang , Rui Liu , Deng Li , Yuandu Lai , Leyuan Fang , Yahong Han

Existing visual grounding benchmarks primarily evaluate alignment between image regions and literal referring expressions, where models can often succeed by matching a prominent named category. We explore a complementary and more…

Computer Vision and Pattern Recognition · Computer Science 2026-04-03 Ruozhen He , Nisarg A. Shah , Qihua Dong , Zilin Xiao , Jaywon Koo , Vicente Ordonez

Although recent traffic benchmarks have advanced multimodal data analysis, they generally lack systematic evaluation aligned with official safety standards. To fill this gap, we introduce RoadSafe365, a large-scale vision-language benchmark…

Computer Vision and Pattern Recognition · Computer Science 2026-02-10 Xinyu Liu , Darryl C. Jacob , Yuxin Liu , Xinsong Du , Muchao Ye , Bolei Zhou , Pan He

High-fidelity pedestrian tracking in real-life conditions has been an important tool in fundamental crowd dynamics research allowing to quantify statistics of relevant observables including walking velocities, mutual distances and body…

This paper proposes a novel framework to evaluate fluid simulation methods based on crowd-sourced user studies in order to robustly gather large numbers of opinions. The key idea for a robust and reliable evaluation is to use a reference…

Graphics · Computer Science 2020-11-23 Kiwon Um , Xiangyu Hu , Nils Thuerey
‹ Prev 1 2 3 10 Next ›