Related papers: Visual Robot Task Planning

Learning to Imagine Manipulation Goals for Robot Task Planning

Prospection is an important part of how humans come up with new task plans, but has not been explored in depth in robotics. Predicting multiple task-level is a challenging problem that involves capturing both task semantics and continuous…

Machine Learning · Computer Science 2017-11-13 Chris Paxton , Kapil Katyal , Christian Rupprecht , Raman Arora , Gregory D. Hager

Prospection: Interpretable Plans From Language By Predicting the Future

High-level human instructions often correspond to behaviors with multiple implicit steps. In order for robots to be useful in the real world, they must be able to to reason over both motions and intermediate goals implied by human…

Artificial Intelligence · Computer Science 2019-03-21 Chris Paxton , Yonatan Bisk , Jesse Thomason , Arunkumar Byravan , Dieter Fox

Visual Semantic Planning using Deep Successor Representations

A crucial capability of real-world intelligent agents is their ability to plan a sequence of actions to achieve their goals in the visual world. In this work, we address the problem of visual semantic planning: the task of predicting a…

Computer Vision and Pattern Recognition · Computer Science 2017-08-17 Yuke Zhu , Daniel Gordon , Eric Kolve , Dieter Fox , Li Fei-Fei , Abhinav Gupta , Roozbeh Mottaghi , Ali Farhadi

Self-Supervised Visual Planning with Temporal Skip Connections

In order to autonomously learn wide repertoires of complex skills, robots must be able to learn from their own autonomously collected data, without human supervision. One learning signal that is always available for autonomously collected…

Robotics · Computer Science 2017-10-18 Frederik Ebert , Chelsea Finn , Alex X. Lee , Sergey Levine

Learning Robotic Manipulation through Visual Planning and Acting

Planning for robotic manipulation requires reasoning about the changes a robot can affect on objects. When such interactions can be modelled analytically, as in domains with rigid objects, efficient planning algorithms exist. However, in…

Robotics · Computer Science 2019-05-14 Angelina Wang , Thanard Kurutach , Kara Liu , Pieter Abbeel , Aviv Tamar

Visual Perspective Taking for Opponent Behavior Modeling

In order to engage in complex social interaction, humans learn at a young age to infer what others see and cannot see from a different point-of-view, and learn to predict others' plans and behaviors. These abilities have been mostly lacking…

Robotics · Computer Science 2021-05-12 Boyuan Chen , Yuhang Hu , Robert Kwiatkowski , Shuran Song , Hod Lipson

Visual Task Progress Estimation with Appearance Invariant Embeddings for Robot Control and Planning

One of the challenges of full autonomy is to have a robot capable of manipulating its current environment to achieve another environment configuration. This paper is a step towards this challenge, focusing on the visual understanding of the…

Robotics · Computer Science 2020-11-24 Guilherme Maeda , Joni Väätäinen , Hironori Yoshida

Deep Visual Constraints: Neural Implicit Models for Manipulation Planning from Visual Input

Manipulation planning is the problem of finding a sequence of robot configurations that involves interactions with objects in the scene, e.g., grasping and placing an object, or more general tool-use. To achieve such interactions,…

Robotics · Computer Science 2022-08-01 Jung-Su Ha , Danny Driess , Marc Toussaint

Planning for Multi-Object Manipulation with Graph Neural Network Relational Classifiers

Objects rarely sit in isolation in human environments. As such, we'd like our robots to reason about how multiple objects relate to one another and how those relations may change as the robot interacts with the world. To this end, we…

Robotics · Computer Science 2023-03-20 Yixuan Huang , Adam Conkey , Tucker Hermans

Towards Robot Task Planning From Probabilistic Models of Human Skills

We describe an algorithm for motion planning based on expert demonstrations of a skill. In order to teach robots to perform complex object manipulation tasks that can generalize robustly to new environments, we must (1) learn a…

Robotics · Computer Science 2016-02-16 Chris Paxton , Marin Kobilarov , Gregory D. Hager

Learning Representations for Predicting Future Activities

Foreseeing the future is one of the key factors of intelligence. It involves understanding of the past and current environment as well as decent experience of its possible dynamics. In this work, we address future prediction at the abstract…

Computer Vision and Pattern Recognition · Computer Science 2019-05-16 Mohammadreza Zolfaghari , Özgün Çiçek , Syed Mohsin Ali , Farzaneh Mahdisoltani , Can Zhang , Thomas Brox

Learning Sensorimotor Primitives of Sequential Manipulation Tasks from Visual Demonstrations

This work aims to learn how to perform complex robot manipulation tasks that are composed of several, consecutively executed low-level sub-tasks, given as input a few visual demonstrations of the tasks performed by a person. The sub-tasks…

Robotics · Computer Science 2022-03-09 Junchi Liang , Bowen Wen , Kostas Bekris , Abdeslam Boularias

Enhanced Robot Planning and Perception through Environment Prediction

Mobile robots rely on maps to navigate through an environment. In the absence of any map, the robots must build the map online from partial observations as they move in the environment. Traditional methods build a map using only direct…

Robotics · Computer Science 2024-10-14 Vishnu Dutt Sharma

Predictive Coding-based Deep Dynamic Neural Network for Visuomotor Learning

This study presents a dynamic neural network model based on the predictive coding framework for perceiving and predicting the dynamic visuo-proprioceptive patterns. In our previous study [1], we have shown that the deep dynamic neural…

Artificial Intelligence · Computer Science 2017-06-09 Jungsik Hwang , Jinhyung Kim , Ahmadreza Ahmadi , Minkyu Choi , Jun Tani

Planning Robot Motion using Deep Visual Prediction

In this paper, we introduce a novel framework that can learn to make visual predictions about the motion of a robotic agent from raw video frames. Our proposed motion prediction network (PROM-Net) can learn in a completely unsupervised…

Robotics · Computer Science 2019-06-26 Meenakshi Sarkar , Prabhu Pradhan , Debasish Ghose

Towards Bio-Inspired Robotic Trajectory Planning via Self-Supervised RNN

Trajectory planning in robotics is understood as generating a sequence of joint configurations that will lead a robotic agent, or its manipulator, from an initial state to the desired final state, thus completing a manipulation task while…

Robotics · Computer Science 2025-09-24 Miroslav Cibula , Kristína Malinovská , Matthias Kerzel

Generating Reliable and Efficient Predictions of Human Motion: A Promising Encounter between Physics and Neural Networks

Generating accurate and efficient predictions for the motion of the humans present in the scene is key to the development of effective motion planning algorithms for robots moving in promiscuous areas, where wrong planning decisions could…

Robotics · Computer Science 2022-03-04 Alessandro Antonucci , Gastone Pietro Rosati Papini , Luigi Palopoli , Daniele Fontanelli

Generalizable Task Planning through Representation Pretraining

The ability to plan for multi-step manipulation tasks in unseen situations is crucial for future home robots. But collecting sufficient experience data for end-to-end learning is often infeasible in the real world, as deploying robots in…

Robotics · Computer Science 2022-05-18 Chen Wang , Danfei Xu , Li Fei-Fei

Transformers for One-Shot Visual Imitation

Humans are able to seamlessly visually imitate others, by inferring their intentions and using past experience to achieve the same end goal. In other words, we can parse complex semantic knowledge from raw video and efficiently translate…

Machine Learning · Computer Science 2020-11-12 Sudeep Dasari , Abhinav Gupta

Neural World Models for Computer Vision

Humans navigate in their environment by learning a mental model of the world through passive observation and active interaction. Their world model allows them to anticipate what might happen next and act accordingly with respect to an…

Computer Vision and Pattern Recognition · Computer Science 2023-06-16 Anthony Hu