Related papers: Visual Interaction Networks

Visual Grounding of Learned Physical Models

Humans intuitively recognize objects' physical properties and predict their motion, even when the objects are engaged in complicated interactions. The abilities to perform physical reasoning and to adapt to new environments, while intrinsic…

Machine Learning · Computer Science 2020-06-30 Yunzhu Li , Toru Lin , Kexin Yi , Daniel M. Bear , Daniel L. K. Yamins , Jiajun Wu , Joshua B. Tenenbaum , Antonio Torralba

Learning Physical Dynamics for Object-centric Visual Prediction

The ability to model the underlying dynamics of visual scenes and reason about the future is central to human intelligence. Many attempts have been made to empower intelligent systems with such physical understanding and prediction…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Huilin Xu , Tao Chen , Feng Xu

Neural Allocentric Intuitive Physics Prediction from Real Videos

Humans are able to make rich predictions about the future dynamics of physical objects from a glance. On the other hand, most existing computer vision approaches require strong assumptions about the underlying system, ad-hoc modeling, or…

Neural and Evolutionary Computing · Computer Science 2018-09-19 Zhihua Wang , Stefano Rosa , Yishu Miao , Zihang Lai , Linhai Xie , Andrew Markham , Niki Trigoni

Interaction Networks for Learning about Objects, Relations and Physics

Reasoning about objects, relations, and physics is central to human intelligence, and a key goal of artificial intelligence. Here we introduce the interaction network, a model which can reason about how objects in complex systems interact,…

Artificial Intelligence · Computer Science 2016-12-02 Peter W. Battaglia , Razvan Pascanu , Matthew Lai , Danilo Rezende , Koray Kavukcuoglu

3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes

Given a visual scene, humans have strong intuitions about how a scene can evolve over time under given actions. The intuition, often termed visual intuitive physics, is a critical ability that allows us to make effective plans to manipulate…

Computer Vision and Pattern Recognition · Computer Science 2023-04-25 Haotian Xue , Antonio Torralba , Joshua B. Tenenbaum , Daniel LK Yamins , Yunzhu Li , Hsiao-Yu Tung

Learning Long-term Visual Dynamics with Region Proposal Interaction Networks

Learning long-term dynamics models is the key to understanding physical common sense. Most existing approaches on learning dynamics from visual input sidestep long-term predictions by resorting to rapid re-planning with short-term models.…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Haozhi Qi , Xiaolong Wang , Deepak Pathak , Yi Ma , Jitendra Malik

Graph networks as learnable physics engines for inference and control

Understanding and interacting with everyday physical scenes requires rich knowledge about the structure of the world, represented either implicitly in a value or policy function, or explicitly in a transition model. Here we introduce a new…

Machine Learning · Computer Science 2018-06-05 Alvaro Sanchez-Gonzalez , Nicolas Heess , Jost Tobias Springenberg , Josh Merel , Martin Riedmiller , Raia Hadsell , Peter Battaglia

Unsupervised Learning for Physical Interaction through Video Prediction

A core challenge for an agent learning to interact with the world is to predict how its actions affect objects in its environment. Many existing methods for learning the dynamics of physical interactions require labeled object information.…

Machine Learning · Computer Science 2016-10-19 Chelsea Finn , Ian Goodfellow , Sergey Levine

Predicting the dynamics of 2d objects with a deep residual network

We investigate how a residual network can learn to predict the dynamics of interacting shapes purely as an image-to-image regression task. With a simple 2d physics simulator, we generate short sequences composed of rectangles put in motion…

Computer Vision and Pattern Recognition · Computer Science 2016-11-28 François Fleuret

Learning Predictive Models From Observation and Interaction

Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes.…

Machine Learning · Computer Science 2020-01-01 Karl Schmeckpeper , Annie Xie , Oleh Rybkin , Stephen Tian , Kostas Daniilidis , Sergey Levine , Chelsea Finn

On the difficulty of learning and predicting the long-term dynamics of bouncing objects

The ability to accurately predict the surrounding environment is a foundational principle of intelligence in biological and artificial agents. In recent years, a variety of approaches have been proposed for learning to predict the physical…

Computer Vision and Pattern Recognition · Computer Science 2019-08-01 Alberto Cenzato , Alberto Testolin , Marco Zorzi

Learning Intuitive Physics with Multimodal Generative Models

Predicting the future interaction of objects when they come into contact with their environment is key for autonomous agents to take intelligent and anticipatory actions. This paper presents a perception framework that fuses visual and…

Machine Learning · Computer Science 2021-01-21 Sahand Rezaei-Shoshtari , Francois Robert Hogan , Michael Jenkin , David Meger , Gregory Dudek

Learning to Identify Physical Parameters from Video Using Differentiable Physics

Video representation learning has recently attracted attention in computer vision due to its applications for activity and scene forecasting or vision-based planning and control. Video prediction models often learn a latent representation…

Computer Vision and Pattern Recognition · Computer Science 2020-09-18 Rama Krishna Kandukuri , Jan Achterhold , Michael Möller , Jörg Stückler

Combining Learned and Analytical Models for Predicting Action Effects from Sensory Data

One of the most basic skills a robot should possess is predicting the effect of physical interactions with objects in the environment. This enables optimal action selection to reach a certain goal state. Traditionally, dynamics are…

Robotics · Computer Science 2020-10-13 Alina Kloss , Stefan Schaal , Jeannette Bohg

Predictive Coding-based Deep Dynamic Neural Network for Visuomotor Learning

This study presents a dynamic neural network model based on the predictive coding framework for perceiving and predicting the dynamic visuo-proprioceptive patterns. In our previous study [1], we have shown that the deep dynamic neural…

Artificial Intelligence · Computer Science 2017-06-09 Jungsik Hwang , Jinhyung Kim , Ahmadreza Ahmadi , Minkyu Choi , Jun Tani

Learning Visual Dynamics Models of Rigid Objects using Relational Inductive Biases

Endowing robots with human-like physical reasoning abilities remains challenging. We argue that existing methods often disregard spatio-temporal relations and by using Graph Neural Networks (GNNs) that incorporate a relational inductive…

Machine Learning · Computer Science 2019-10-24 Fabio Ferreira , Lin Shao , Tamim Asfour , Jeannette Bohg

A Framework for Multisensory Foresight for Embodied Agents

Predicting future sensory states is crucial for learning agents such as robots, drones, and autonomous vehicles. In this paper, we couple multiple sensory modalities with exploratory actions and propose a predictive neural network…

Robotics · Computer Science 2021-09-17 Xiaohui Chen , Ramtin Hosseini , Karen Panetta , Jivko Sinapov

Towards an Interpretable Latent Space in Structured Models for Video Prediction

We focus on the task of future frame prediction in video governed by underlying physical dynamics. We work with models which are object-centric, i.e., explicitly work with object representations, and propagate a loss in the latent space.…

Machine Learning · Computer Science 2021-07-19 Rushil Gupta , Vishal Sharma , Yash Jain , Yitao Liang , Guy Van den Broeck , Parag Singla

Learning and Anticipating Future Actions During Exploratory Data Analysis

The goal of visual analytics is to create a symbiosis between human and computer by leveraging their unique strengths. While this model has demonstrated immense success, we are yet to realize the full potential of such a human-computer…

Human-Computer Interaction · Computer Science 2018-09-27 Ran Wan , Roman Garnett , Alvitta Ottley

The Power of Next-Frame Prediction for Learning Physical Laws

Next-frame prediction is a useful and powerful method for modelling and understanding the dynamics of video data. Inspired by the empirical success of causal language modelling and next-token prediction in language modelling, we explore the…

Computer Vision and Pattern Recognition · Computer Science 2024-05-29 Thomas Winterbottom , G. Thomas Hudson , Daniel Kluvanec , Dean Slack , Jamie Sterling , Junjie Shentu , Chenghao Xiao , Zheming Zhou , Noura Al Moubayed