Related papers: Object-Oriented Dynamics Predictor

Object-Oriented Dynamics Learning through Multi-Level Abstraction

Object-based approaches for learning action-conditioned dynamics has demonstrated promise for generalization and interpretability. However, existing approaches suffer from structural limitations and optimization difficulties for common…

Machine Learning · Computer Science 2019-12-06 Guangxiang Zhu , Jianhao Wang , Zhizhou Ren , Zichuan Lin , Chongjie Zhang

Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions

We propose a novel framework for the task of object-centric video prediction, i.e., extracting the compositional structure of a video sequence, as well as modeling objects dynamics and interactions from visual observations in order to…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Angel Villar-Corrales , Ismail Wahdan , Sven Behnke

Learning Physical Dynamics for Object-centric Visual Prediction

The ability to model the underlying dynamics of visual scenes and reason about the future is central to human intelligence. Many attempts have been made to empower intelligent systems with such physical understanding and prediction…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Huilin Xu , Tao Chen , Feng Xu

Learning Causal Dynamics Models in Object-Oriented Environments

Causal dynamics models (CDMs) have demonstrated significant potential in addressing various challenges in reinforcement learning. To learn CDMs, recent studies have performed causal discovery to capture the causal dependencies among…

Machine Learning · Computer Science 2024-05-22 Zhongwei Yu , Jingqing Ruan , Dengpeng Xing

OCK: Unsupervised Dynamic Video Prediction with Object-Centric Kinematics

Human perception involves decomposing complex multi-object scenes into time-static object appearance (i.e., size, shape, color) and time-varying object motion (i.e., position, velocity, acceleration). For machines to achieve human-like…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Yeon-Ji Song , Jaein Kim , Suhyung Choi , Jin-Hwa Kim , Byoung-Tak Zhang

Reasoning About Physical Interactions with Object-Oriented Prediction and Planning

Object-based factorizations provide a useful level of abstraction for interacting with the world. Building explicit object representations, however, often requires supervisory signals that are difficult to obtain in practice. We present a…

Machine Learning · Computer Science 2019-01-08 Michael Janner , Sergey Levine , William T. Freeman , Joshua B. Tenenbaum , Chelsea Finn , Jiajun Wu

Object-centric Video Prediction without Annotation

In order to interact with the world, agents must be able to predict the results of the world's dynamics. A natural approach to learn about these dynamics is through video prediction, as cameras are ubiquitous and powerful sensors. Direct…

Computer Vision and Pattern Recognition · Computer Science 2021-05-07 Karl Schmeckpeper , Georgios Georgakis , Kostas Daniilidis

Learning Generalizable Physical Dynamics of 3D Rigid Objects

Humans have a remarkable ability to predict the effect of physical interactions on the dynamics of objects. Endowing machines with this ability would allow important applications in areas like robotics and autonomous vehicles. In this work,…

Computer Vision and Pattern Recognition · Computer Science 2019-01-03 Davis Rempe , Srinath Sridhar , He Wang , Leonidas J. Guibas

ODIP: Towards Automatic Adaptation for Object Detection by Interactive Perception

Object detection plays a deep role in visual systems by identifying instances for downstream algorithms. In industrial scenarios, however, a slight change in manufacturing systems would lead to costly data re-collection and human annotation…

Robotics · Computer Science 2021-08-04 Tung-I Chen , Jen-Wei Wang , Winston H. Hsu

Conditional Object-Centric Learning from Video

Object-centric representations are a promising path toward more systematic generalization by providing flexible abstractions upon which compositional world models can be built. Recent work on simple 2D and 3D datasets has shown that models…

Computer Vision and Pattern Recognition · Computer Science 2022-03-16 Thomas Kipf , Gamaleldin F. Elsayed , Aravindh Mahendran , Austin Stone , Sara Sabour , Georg Heigold , Rico Jonschkowski , Alexey Dosovitskiy , Klaus Greff

Object-Oriented Transition Modeling with Inductive Logic Programming

Building models of the world from observation, i.e., induction, is one of the major challenges in machine learning. In order to be useful, models need to maintain accuracy when used in novel situations, i.e., generalize. In addition, they…

Machine Learning · Computer Science 2026-02-10 Gabriel Stella , Dmitri Loguinov

Object-Oriented Dynamic Networks

This paper contains description of such knowledge representation model as Object-Oriented Dynamic Network (OODN), which gives us an opportunity to represent knowledge, which can be modified in time, to build new relations between objects…

Artificial Intelligence · Computer Science 2015-10-15 Dmytro Terletskyi , Alexandr Provotar

Learning Physical Constraints with Neural Projections

We propose a new family of neural networks to predict the behaviors of physical systems by learning their underpinning constraints. A neural projection operator lies at the heart of our approach, composed of a lightweight network with an…

Neural and Evolutionary Computing · Computer Science 2020-12-15 Shuqi Yang , Xingzhe He , Bo Zhu

Object-oriented Neural Programming (OONP) for Document Understanding

We propose Object-oriented Neural Programming (OONP), a framework for semantically parsing documents in specific domains. Basically, OONP reads a document and parses it into a predesigned object-oriented data structure (referred to as…

Machine Learning · Computer Science 2018-07-26 Zhengdong Lu , Xianggen Liu , Haotian Cui , Yukun Yan , Daqi Zheng

Deep Object-Centric Representations for Generalizable Robot Learning

Robotic manipulation in complex open-world scenarios requires both reliable physical manipulation skills and effective and generalizable perception. In this paper, we propose a method where general purpose pretrained visual models serve as…

Robotics · Computer Science 2017-09-27 Coline Devin , Pieter Abbeel , Trevor Darrell , Sergey Levine

Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers

Recent work has shown that object-centric representations can greatly help improve the accuracy of learning dynamics while also bringing interpretability. In this work, we take this idea one step further, ask the following question: "can…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Sanket Gandhi , Atul , Samanyu Mahajan , Vishal Sharma , Rushil Gupta , Arnab Kumar Mondal , Parag Singla

Unsupervised Learning of Latent Physical Properties Using Perception-Prediction Networks

We propose a framework for the completely unsupervised learning of latent object properties from their interactions: the perception-prediction network (PPN). Consisting of a perception module that extracts representations of latent object…

Machine Learning · Computer Science 2018-07-27 David Zheng , Vinson Luo , Jiajun Wu , Joshua B. Tenenbaum

OCTNet: Trajectory Generation in New Environments from Past Experiences

Being able to safely operate for extended periods of time in dynamic environments is a critical capability for autonomous systems. This generally involves the prediction and understanding of motion patterns of dynamic entities, such as…

Robotics · Computer Science 2019-09-26 Weiming Zhi , Tin Lai , Lionel Ott , Gilad Francis , Fabio Ramos

Dyn-O: Building Structured World Models with Object-Centric Representations

World models aim to capture the dynamics of the environment, enabling agents to predict and plan for future states. In most scenarios of interest, the dynamics are highly centered on interactions among objects within the environment. This…

Machine Learning · Computer Science 2025-07-08 Zizhao Wang , Kaixin Wang , Li Zhao , Peter Stone , Jiang Bian

Unsupervised Moving Object Detection via Contextual Information Separation

We propose an adversarial contextual model for detecting moving objects in images. A deep neural network is trained to predict the optical flow in a region using information from everywhere else but that region (context), while another…

Computer Vision and Pattern Recognition · Computer Science 2019-04-16 Yanchao Yang , Antonio Loquercio , Davide Scaramuzza , Stefano Soatto