Related papers: Enhancing Video-Based Robot Failure Detection Usin…

TIMID: Time-Dependent Mistake Detection in Videos of Robot Executions

As robotic systems execute increasingly difficult task sequences, so does the number of ways in which they can fail. Video Anomaly Detection (VAD) frameworks typically focus on singular, low-level kinematic or action failures, struggling to…

Robotics · Computer Science 2026-03-11 Nerea Gallego , Fernando Salanova , Claudio Mannarano , Cristian Mahulea , Eduardo Montijano

See Yourself in Others: Attending Multiple Tasks for Own Failure Detection

Autonomous robots deal with unexpected scenarios in real environments. Given input images, various visual perception tasks can be performed, e.g., semantic segmentation, depth estimation and normal estimation. These different tasks provide…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Boyang Sun , Jiaxu Xing , Hermann Blum , Roland Siegwart , Cesar Cadena

How to Utilize Failure Demo Data?: Effective Data Selection for Imitation Learning Using Distribution Differences in Attention Mechanism

Imitation learning for robotic tasks has relied primarily on policies trained only on successful demonstrations, although failures are unavoidable during human data collection. Many existing approaches for exploiting failure data require…

Robotics · Computer Science 2026-05-21 Kana Miyamoto , Kanata Suzuki , Tetsuya Ogata

Improving Robot Success Detection using Static Object Data

We use static object data to improve success detection for stacking objects on and nesting objects in one another. Such actions are necessary for certain robotics tasks, e.g., clearing a dining table or packing a warehouse bin. However,…

Robotics · Computer Science 2019-08-02 Rosario Scalise , Jesse Thomason , Yonatan Bisk , Siddhartha Srinivasa

Failure Prediction from Limited Hardware Demonstrations

Prediction of failures in real-world robotic systems either requires accurate model information or extensive testing. Partial knowledge of the system model makes simulation-based failure prediction unreliable. Moreover, obtaining such…

Robotics · Computer Science 2024-10-15 Anjali Parashar , Kunal Garg , Joseph Zhang , Chuchu Fan

Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies

Recent years have witnessed impressive robotic manipulation systems driven by advances in imitation learning and generative modeling, such as diffusion- and flow-based approaches. As robot policy performance increases, so does the…

Robotics · Computer Science 2025-06-23 Chen Xu , Tony Khuong Nguyen , Emma Dixon , Christopher Rodriguez , Patrick Miller , Robert Lee , Paarth Shah , Rares Ambrus , Haruki Nishimura , Masha Itkina

Weakly-Supervised Completion Moment Detection using Temporal Attention

Monitoring the progression of an action towards completion offers fine grained insight into the actor's behaviour. In this work, we target detecting the completion moment of actions, that is the moment when the action's goal has been…

Computer Vision and Pattern Recognition · Computer Science 2019-10-23 Farnoosh Heidarivincheh , Majid Mirmehdi , Dima Damen

Extending Temporal Data Augmentation for Video Action Recognition

Pixel space augmentation has grown in popularity in many Deep Learning areas, due to its effectiveness, simplicity, and low computational cost. Data augmentation for videos, however, still remains an under-explored research topic, as most…

Computer Vision and Pattern Recognition · Computer Science 2022-11-10 Artjoms Gorpincenko , Michal Mackiewicz

Efficient Spatial-Temporal Modeling for Real-Time Video Analysis: A Unified Framework for Action Recognition and Object Tracking

Real-time video analysis remains a challenging problem in computer vision, requiring efficient processing of both spatial and temporal information while maintaining computational efficiency. Existing approaches often struggle to balance…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Shahla John

Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes

Detecting and recognizing human action in videos with crowded scenes is a challenging problem due to the complex environment and diversity events. Prior works always fail to deal with this problem in two aspects: (1) lacking utilizing…

Computer Vision and Pattern Recognition · Computer Science 2020-10-19 Li Yuan , Yichen Zhou , Shuning Chang , Ziyuan Huang , Yunpeng Chen , Xuecheng Nie , Tao Wang , Jiashi Feng , Shuicheng Yan

Real-Time Detection of Robot Failures Using Gaze Dynamics in Collaborative Tasks

Detecting robot failures during collaborative tasks is crucial for maintaining trust in human-robot interactions. This study investigates user gaze behaviour as an indicator of robot failures, utilising machine learning models to…

Human-Computer Interaction · Computer Science 2025-03-12 Ramtin Tabatabaei , Vassilis Kostakos , Wafa Johal

Improving Zero-Shot Action Recognition using Human Instruction with Text Description

Zero-shot action recognition, which recognizes actions in videos without having received any training examples, is gaining wide attention considering it can save labor costs and training time. Nevertheless, the performance of zero-shot…

Computer Vision and Pattern Recognition · Computer Science 2023-06-13 Nan Wu , Hiroshi Kera , Kazuhiko Kawamoto

Enhancing Robot Learning through Learned Human-Attention Feature Maps

Robust and efficient learning remains a challenging problem in robotics, in particular with complex visual inputs. Inspired by human attention mechanism, with which we quickly process complex visual scenes and react to changes in the…

Robotics · Computer Science 2023-08-30 Daniel Scheuchenstuhl , Stefan Ulmer , Felix Resch , Luigi Berducci , Radu Grosu

Robot Action Diagnosis and Experience Correction by Falsifying Parameterised Execution Models

When faced with an execution failure, an intelligent robot should be able to identify the likely reasons for the failure and adapt its execution policy accordingly. This paper addresses the question of how to utilise knowledge about the…

Robotics · Computer Science 2021-05-21 Alex Mitrevski , Paul G. Plöger , Gerhard Lakemeyer

The Effectiveness of Temporal Dependency in Deepfake Video Detection

Deepfakes are a form of synthetic image generation used to generate fake videos of individuals for malicious purposes. The resulting videos may be used to spread misinformation, reduce trust in media, or as a form of blackmail. These…

Computer Vision and Pattern Recognition · Computer Science 2022-05-16 Will Rowan , Nick Pears

A Multimodal Handover Failure Detection Dataset and Baselines

An object handover between a robot and a human is a coordinated action which is prone to failure for reasons such as miscommunication, incorrect actions and unexpected object properties. Existing works on handover failure detection and…

Robotics · Computer Science 2025-08-26 Santosh Thoduka , Nico Hochgeschwender , Juergen Gall , Paul G. Plöger

Learning-Based Safety-Aware Task Scheduling for Efficient Human-Robot Collaboration

Ensuring human safety in collaborative robotics can compromise efficiency because traditional safety measures increase robot cycle time when human interaction is frequent. This paper proposes a safety-aware approach to mitigate efficiency…

Robotics · Computer Science 2025-12-22 M. Faroni , A. Spano , A. M. Zanchettin , P. Rocco

Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition

Deep-Learning-based video recognition has shown promising improvements along with the development of large-scale datasets and spatiotemporal network architectures. In image recognition, learning spatially invariant features is a key factor…

Computer Vision and Pattern Recognition · Computer Science 2020-08-14 Taeoh Kim , Hyeongmin Lee , MyeongAh Cho , Ho Seong Lee , Dong Heon Cho , Sangyoun Lee

Learning from Demonstration with Failure Awareness for Safe Robot Navigation

Learning from demonstration is widely used for robot navigation, yet it suffers from a fundamental limitation: demonstrations consist predominantly of successful behaviors and provide limited coverage of unsafe states. This limitation leads…

Robotics · Computer Science 2026-04-28 Xianghui Wang , Siwei Cheng , Shanze Wang , Xinming Zhang , Dan Zhang , Wei Zhang

TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking

Multi-object tracking (MOT) in computer vision remains a significant challenge, requiring precise localization and continuous tracking of multiple objects in video sequences. The emergence of data sets that emphasize robust…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Thuc Nguyen-Quang , Minh-Triet Tran