Related papers: Planning from Pixels using Inverse Dynamics Models

Learning Latent Dynamics for Planning from Pixels

Planning has been very successful for control tasks with known environment dynamics. To leverage planning in unknown environments, the agent needs to learn the dynamics from interactions with the world. However, learning dynamics models…

Machine Learning · Computer Science 2019-06-06 Danijar Hafner , Timothy Lillicrap , Ian Fischer , Ruben Villegas , David Ha , Honglak Lee , James Davidson

Goal-Aware Prediction: Learning to Model What Matters

Learned dynamics models combined with both planning and policy learning algorithms have shown promise in enabling artificial agents to learn to perform many diverse tasks with limited supervision. However, one of the fundamental challenges…

Machine Learning · Computer Science 2020-08-12 Suraj Nair , Silvio Savarese , Chelsea Finn

Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

Modeling the world can benefit robot learning by providing a rich training signal for shaping an agent's latent state space. However, learning world models in unconstrained environments over high-dimensional observation spaces such as…

Machine Learning · Computer Science 2021-12-03 Nitish Srivastava , Walter Talbott , Martin Bertran Lopez , Shuangfei Zhai , Josh Susskind

Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills

We consider the problem of learning useful robotic skills from previously collected offline data without access to manually specified rewards or additional online exploration, a setting that is becoming increasingly important for scaling…

Robotics · Computer Science 2021-06-14 Yevgen Chebotar , Karol Hausman , Yao Lu , Ted Xiao , Dmitry Kalashnikov , Jake Varley , Alex Irpan , Benjamin Eysenbach , Ryan Julian , Chelsea Finn , Sergey Levine

Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning

Model-based reinforcement learning (RL) enjoys several benefits, such as data-efficiency and planning, by learning a model of the environment's dynamics. However, learning a global model that can generalize across different dynamics is a…

Machine Learning · Computer Science 2020-06-30 Kimin Lee , Younggyo Seo , Seunghyun Lee , Honglak Lee , Jinwoo Shin

Learning Predictive Models From Observation and Interaction

Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes.…

Machine Learning · Computer Science 2020-01-01 Karl Schmeckpeper , Annie Xie , Oleh Rybkin , Stephen Tian , Kostas Daniilidis , Sergey Levine , Chelsea Finn

Visual Imitation Learning of Non-Prehensile Manipulation Tasks with Dynamics-Supervised Models

Unlike quasi-static robotic manipulation tasks like pick-and-place, dynamic tasks such as non-prehensile manipulation pose greater challenges, especially for vision-based control. Successful control requires the extraction of features…

Robotics · Computer Science 2024-10-28 Abdullah Mustafa , Ryo Hanai , Ixchel Ramirez , Floris Erich , Ryoichi Nakajo , Yukiyasu Domae , Tetsuya Ogata

Weakly-supervised Latent Models for Task-specific Visual-Language Control

Autonomous inspection in hazardous environments requires AI agents that can interpret high-level goals and execute precise control. A key capability for such agents is spatial grounding, for example when a drone must center a detected…

Artificial Intelligence · Computer Science 2025-11-25 Xian Yeow Lee , Lasitha Vidyaratne , Gregory Sin , Ahmed Farahat , Chetan Gupta

Contextual Latent World Models for Offline Meta Reinforcement Learning

Offline meta-reinforcement learning seeks to learn policies that generalize across related tasks from fixed datasets. Context-based methods infer a task representation from transition histories, but learning effective task representations…

Machine Learning · Computer Science 2026-03-04 Mohammadreza Nakheai , Aidan Scannell , Kevin Luck , Joni Pajarinen

Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models

A long-standing goal in AI is to develop agents capable of solving diverse tasks across a range of environments, including those never seen during training. Two dominant paradigms address this challenge: (i) reinforcement learning (RL),…

Machine Learning · Computer Science 2025-10-30 Vlad Sobal , Wancong Zhang , Kyunghyun Cho , Randall Balestriero , Tim G. J. Rudner , Yann LeCun

Reward-free World Models for Online Imitation Learning

Imitation learning (IL) enables agents to acquire skills directly from expert demonstrations, providing a compelling alternative to reinforcement learning. However, prior online IL approaches struggle with complex tasks characterized by…

Machine Learning · Computer Science 2025-05-13 Shangzhe Li , Zhiao Huang , Hao Su

PcLast: Discovering Plannable Continuous Latent States

Goal-conditioned planning benefits from learned low-dimensional representations of rich observations. While compact latent representations typically learned from variational autoencoders or inverse dynamics enable goal-conditioned decision…

Machine Learning · Computer Science 2024-06-12 Anurag Koul , Shivakanth Sujit , Shaoru Chen , Ben Evans , Lili Wu , Byron Xu , Rajan Chari , Riashat Islam , Raihan Seraj , Yonathan Efroni , Lekan Molu , Miro Dudik , John Langford , Alex Lamb

Task-Agnostic Dynamics Priors for Deep Reinforcement Learning

While model-based deep reinforcement learning (RL) holds great promise for sample efficiency and generalization, learning an accurate dynamics model is often challenging and requires substantial interaction with the environment. A wide…

Machine Learning · Computer Science 2019-07-12 Yilun Du , Karthik Narasimhan

Learning Latent State Spaces for Planning through Reward Prediction

Model-based reinforcement learning methods typically learn models for high-dimensional state spaces by aiming to reconstruct and predict the original observations. However, drawing inspiration from model-free reinforcement learning, we…

Machine Learning · Computer Science 2019-12-10 Aaron Havens , Yi Ouyang , Prabhat Nagarajan , Yasuhiro Fujita

What Would You Do? Acting by Learning to Predict

We propose to learn tasks directly from visual demonstrations by learning to predict the outcome of human and robot actions on an environment. We enable a robot to physically perform a human demonstrated task without knowledge of the…

Robotics · Computer Science 2017-03-09 Adam Tow , Niko Sünderhauf , Sareh Shirazi , Michael Milford , Jürgen Leitner

Mapless Humanoid Navigation Using Learned Latent Dynamics

In this paper, we propose a novel Deep Reinforcement Learning approach to address the mapless navigation problem, in which the locomotion actions of a humanoid robot are taken online based on the knowledge encoded in learned models.…

Robotics · Computer Science 2021-08-10 Andre Brandenburger , Diego Rodriguez , Sven Behnke

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Providing a suitable reward function to reinforcement learning can be difficult in many real world applications. While inverse reinforcement learning (IRL) holds promise for automatically learning reward functions from demonstrations,…

Machine Learning · Computer Science 2019-10-29 Lantao Yu , Tianhe Yu , Chelsea Finn , Stefano Ermon

Planning with Goal-Conditioned Policies

Planning methods can solve temporally extended sequential decision making problems by composing simple behaviors. However, planning requires suitable abstractions for the states and transitions, which typically need to be designed by hand.…

Machine Learning · Computer Science 2019-11-20 Soroush Nasiriany , Vitchyr H. Pong , Steven Lin , Sergey Levine

Planning from Observation and Interaction

Observational learning requires an agent to learn to perform a task by referencing only observations of the performed task. This work investigates the equivalent setting in real-world robot learning where access to hand-designed rewards and…

Robotics · Computer Science 2026-03-02 Tyler Han , Siyang Shen , Rohan Baijal , Harine Ravichandiran , Bat Nemekhbold , Kevin Huang , Sanghun Jung , Byron Boots

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models

Humans are masters at quickly learning many complex tasks, relying on an approximate understanding of the dynamics of their environments. In much the same way, we would like our learning agents to quickly adapt to new tasks. In this paper,…

Robotics · Computer Science 2019-10-10 Arunkumar Byravan , Jost Tobias Springenberg , Abbas Abdolmaleki , Roland Hafner , Michael Neunert , Thomas Lampe , Noah Siegel , Nicolas Heess , Martin Riedmiller