English
Related papers

Related papers: Object-centric Forward Modeling for Model Predicti…

200 papers

Robotic manipulation in complex open-world scenarios requires both reliable physical manipulation skills and effective and generalizable perception. In this paper, we propose a method where general purpose pretrained visual models serve as…

Robotics · Computer Science 2017-09-27 Coline Devin , Pieter Abbeel , Trevor Darrell , Sergey Levine

Predictive world models enable agents to model scene dynamics and reason about the consequences of their actions. Inspired by human perception, object-centric world models capture scene dynamics using object-level representations, which can…

Machine Learning · Computer Science 2026-05-15 Jonathan Spieler , Angel Villar-Corrales , Sven Behnke

The ability to model the underlying dynamics of visual scenes and reason about the future is central to human intelligence. Many attempts have been made to empower intelligent systems with such physical understanding and prediction…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Huilin Xu , Tao Chen , Feng Xu

This paper focuses on building object-centric representations for long-term action anticipation in videos. Our key motivation is that objects provide important cues to recognize and predict human-object interactions, especially when the…

Computer Vision and Pattern Recognition · Computer Science 2023-11-02 Ce Zhang , Changcheng Fu , Shijie Wang , Nakul Agarwal , Kwonjoon Lee , Chiho Choi , Chen Sun

The goal of object-centric representation learning is to decompose visual scenes into a structured representation that isolates the entities. Recent successes have shown that object-centric representation learning can be scaled to…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Aniket Didolkar , Andrii Zadaianchuk , Anirudh Goyal , Mike Mozer , Yoshua Bengio , Georg Martius , Maximilian Seitzer

Object-centric representation (OCR) has recently become a subject of interest in the computer vision community for learning a structured representation of images and videos. It has been several times presented as a potential way to improve…

Artificial Intelligence · Computer Science 2025-06-25 Alexandre Chapin , Emmanuel Dellandrea , Liming Chen

Understanding the world in terms of objects and the possible interplays with them is an important cognition ability, especially in robotics manipulation, where many tasks require robot-object interactions. However, learning such a…

Robotics · Computer Science 2023-07-10 Stefano Ferraro , Pietro Mazzaglia , Tim Verbelen , Bart Dhoedt

The objective of this work is to learn an object-centric video representation, with the aim of improving transferability to novel tasks, i.e., tasks different from the pre-training task of action classification. To this end, we introduce a…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Chuhan Zhang , Ankush Gupta , Andrew Zisserman

Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes.…

Machine Learning · Computer Science 2020-01-01 Karl Schmeckpeper , Annie Xie , Oleh Rybkin , Stephen Tian , Kostas Daniilidis , Sergey Levine , Chelsea Finn

A world model is essential for an agent to predict the future and plan in domains such as autonomous driving and robotics. To achieve this, recent advancements have focused on video generation, which has gained significant attention due to…

Artificial Intelligence · Computer Science 2025-03-13 Youngjoon Jeong , Junha Chun , Soonwoo Cha , Taesup Kim

The robotic manipulation of compliant objects is currently one of the most active problems in robotics due to its potential to automate many important applications. Despite the progress achieved by the robotics community in recent years,…

Robotics · Computer Science 2022-05-23 Jiaming Qi , Dongyu Li , Yufeng Gao , Peng Zhou , David Navarro-Alarcon

Efficient learning from demonstration for long-horizon tasks remains an open challenge in robotics. While significant effort has been directed toward learning trajectories, a recent resurgence of object-centric approaches has demonstrated…

Robotics · Computer Science 2025-12-01 Adrian Röfer , Russell Buchanan , Max Argus , Sethu Vijayakumar , Abhinav Valada

Large offline learning-based models have enabled robots to successfully interact with objects for a wide variety of tasks. However, these models rely on fairly consistent structured environments. For more unstructured environments, an…

Robotics · Computer Science 2023-07-20 Nikhil U. Shinde , Jacob Johnson , Sylvia Herbert , Michael C. Yip

It is doubtful that animals have perfect inverse models of their limbs (e.g., what muscle contraction must be applied to every joint to reach a particular location in space). However, in robot control, moving an arm's end-effector to a…

Robotics · Computer Science 2022-09-19 Justus Huebotter , Serge Thill , Marcel van Gerven , Pablo Lanillos

Agents that understand objects and their interactions can learn policies that are more robust and transferable. However, most object-centric RL methods factor state by individual objects while leaving interactions implicit. We introduce the…

Machine Learning · Computer Science 2025-11-05 Fan Feng , Phillip Lippe , Sara Magliacane

Model-based control is a popular paradigm for robot navigation because it can leverage a known dynamics model to efficiently plan robust robot trajectories. However, it is challenging to use model-based methods in settings where the…

Robotics · Computer Science 2019-07-19 Somil Bansal , Varun Tolani , Saurabh Gupta , Jitendra Malik , Claire Tomlin

Learned visual dynamics models have proven effective for robotic manipulation tasks. Yet, it remains unclear how best to represent scenes involving multi-object interactions. Current methods decompose a scene into discrete objects, but they…

Object-centric processes (a.k.a. Artifact-centric processes) are implementations of a paradigm where an instance of one process is not executed in isolation but interacts with other instances of the same or other processes. Interactions…

Machine Learning · Computer Science 2022-03-08 Riccardo Galanti , Massimiliano de Leoni , Nicolò Navarin , Alan Marazzi

Learning an accurate model of the environment is essential for model-based control tasks. Existing methods in robotic visuomotor control usually learn from data with heavily labelled actions, object entities or locations, which can be…

Robotics · Computer Science 2021-07-27 Haoqi Yuan , Ruihai Wu , Andrew Zhao , Haipeng Zhang , Zihan Ding , Hao Dong

Object-based factorizations provide a useful level of abstraction for interacting with the world. Building explicit object representations, however, often requires supervisory signals that are difficult to obtain in practice. We present a…

Machine Learning · Computer Science 2019-01-08 Michael Janner , Sergey Levine , William T. Freeman , Joshua B. Tenenbaum , Chelsea Finn , Jiajun Wu
‹ Prev 1 2 3 10 Next ›