Related papers: Learning Memory-Dependent Continuous Control from …

Watch, Try, Learn: Meta-Learning from Demonstrations and Reward

Imitation learning allows agents to learn complex behaviors from demonstrations. However, learning a complex vision-based task may require an impractical number of demonstrations. Meta-imitation learning is a promising approach towards…

Machine Learning · Computer Science 2020-02-03 Allan Zhou , Eric Jang , Daniel Kappler , Alex Herzog , Mohi Khansari , Paul Wohlhart , Yunfei Bai , Mrinal Kalakrishnan , Sergey Levine , Chelsea Finn

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen

Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities

How to learn an effective reinforcement learning-based model for control tasks from high-level visual observations is a practical and challenging problem. A key to solving this problem is to learn low-dimensional state representations from…

Machine Learning · Computer Science 2022-12-27 Jianda Chen , Sinno Jialin Pan

Residual Reinforcement Learning from Demonstrations

Residual reinforcement learning (RL) has been proposed as a way to solve challenging robotic tasks by adapting control actions from a conventional feedback controller to maximize a reward signal. We extend the residual formulation to learn…

Machine Learning · Computer Science 2021-06-16 Minttu Alakuijala , Gabriel Dulac-Arnold , Julien Mairal , Jean Ponce , Cordelia Schmid

Augmented Replay Memory in Reinforcement Learning With Continuous Control

Online reinforcement learning agents are currently able to process an increasing amount of data by converting it into a higher order value functions. This expansion of the information collected from the environment increases the agent's…

Machine Learning · Computer Science 2021-02-04 Mirza Ramicic , Andrea Bonarini

Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics

In many robotic applications, some aspects of the system dynamics can be modeled accurately while others are difficult to obtain or model. We present a novel reinforcement learning (RL) method for continuous state and action spaces that…

Artificial Intelligence · Computer Science 2017-06-06 Tomoki Nishi , Prashant Doshi , Michael R. James , Danil Prokhorov

Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling

Reinforcement learning algorithms can acquire policies for complex tasks autonomously. However, the number of samples required to learn a diverse set of skills can be prohibitively large. While meta-reinforcement learning methods have…

Machine Learning · Computer Science 2020-06-17 Russell Mendonca , Xinyang Geng , Chelsea Finn , Sergey Levine

Continual Learning Using World Models for Pseudo-Rehearsal

The utility of learning a dynamics/world model of the environment in reinforcement learning has been shown in a many ways. When using neural networks, however, these models suffer catastrophic forgetting when learned in a lifelong or…

Machine Learning · Computer Science 2019-06-12 Nicholas Ketz , Soheil Kolouri , Praveen Pilly

Boosting Reinforcement Learning and Planning with Demonstrations: A Survey

Although reinforcement learning has seen tremendous success recently, this kind of trial-and-error learning can be impractical or inefficient in complex environments. The use of demonstrations, on the other hand, enables agents to benefit…

Machine Learning · Computer Science 2023-03-29 Tongzhou Mu , Hao Su

Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations

Currently, deep reinforcement learning (RL) shows impressive results in complex gaming and robotic environments. Often these results are achieved at the expense of huge computational costs and require an incredible number of episodes of…

Machine Learning · Computer Science 2020-06-18 Alexey Skrynnik , Aleksey Staroverov , Ermek Aitygulov , Kirill Aksenov , Vasilii Davydov , Aleksandr I. Panov

A Survey of Demonstration Learning

With the fast improvement of machine learning, reinforcement learning (RL) has been used to automate human tasks in different areas. However, training such agents is difficult and restricted to expert users. Moreover, it is mostly limited…

Machine Learning · Computer Science 2023-03-21 André Correia , Luís A. Alexandre

CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations

This paper presents CONTHER, a novel reinforcement learning algorithm designed to efficiently and rapidly train robotic agents for goal-oriented manipulation tasks and obstacle avoidance. The algorithm uses a modified replay buffer inspired…

Robotics · Computer Science 2025-03-21 Maria Makarova , Qian Liu , Dzmitry Tsetserukou

Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving

Model Predictive Control (MPC) is attracting tremendous attention in the autonomous driving task as a powerful control technique. The success of an MPC controller strongly depends on an accurate internal dynamics model. However, the static…

Machine Learning · Computer Science 2023-04-28 Yuan Zhang , Joschka Boedecker , Chuxuan Li , Guyue Zhou

Lifelong Inverse Reinforcement Learning

Methods for learning from demonstration (LfD) have shown success in acquiring behavior policies by imitating a user. However, even for a single task, LfD may require numerous demonstrations. For versatile agents that must learn many tasks…

Machine Learning · Computer Science 2022-07-04 Jorge A. Mendez , Shashank Shivkumar , Eric Eaton

MemER: Scaling Up Memory for Robot Control via Experience Retrieval

Humans routinely rely on memory to perform tasks, yet most robot policies lack this capability; our goal is to endow robot policies with the same ability. Naively conditioning on long observation histories is computationally expensive and…

Robotics · Computer Science 2025-10-24 Ajay Sridhar , Jennifer Pan , Satvik Sharma , Chelsea Finn

Reinforcement Learning Controllers for Soft Robots using Learned Environments

Soft robotic manipulators offer operational advantage due to their compliant and deformable structures. However, their inherently nonlinear dynamics presents substantial challenges. Traditional analytical methods often depend on simplifying…

Robotics · Computer Science 2024-10-28 Uljad Berdica , Matthew Jackson , Niccolò Enrico Veronese , Jakob Foerster , Perla Maiolino

Practical Recommendations for Replay-based Continual Learning Methods

Continual Learning requires the model to learn from a stream of dynamic, non-stationary data without forgetting previous knowledge. Several approaches have been developed in the literature to tackle the Continual Learning challenge. Among…

Machine Learning · Computer Science 2022-11-30 Gabriele Merlin , Vincenzo Lomonaco , Andrea Cossu , Antonio Carta , Davide Bacciu

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Providing a suitable reward function to reinforcement learning can be difficult in many real world applications. While inverse reinforcement learning (IRL) holds promise for automatically learning reward functions from demonstrations,…

Machine Learning · Computer Science 2019-10-29 Lantao Yu , Tianhe Yu , Chelsea Finn , Stefano Ermon

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our…

Machine Learning · Computer Science 2021-11-09 Hung Le , Thommen Karimpanal George , Majid Abdolshah , Truyen Tran , Svetha Venkatesh

On the Possibility of Learning in Reactive Environments with Arbitrary Dependence

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions, i.e. environments more general than (PO)MDPs. The task for an agent is to attain…

Machine Learning · Computer Science 2009-12-30 Daniil Ryabko , Marcus Hutter