Related papers: Learning Control by Iterative Inversion

Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition

The design of a reward function often poses a major practical challenge to real-world applications of reinforcement learning. Approaches such as inverse reinforcement learning attempt to overcome this challenge, but require expert…

Machine Learning · Computer Science 2018-11-14 Justin Fu , Avi Singh , Dibya Ghosh , Larry Yang , Sergey Levine

Iterative Machine Teaching

In this paper, we consider the problem of machine teaching, the inverse problem of machine learning. Different from traditional machine teaching which views the learners as batch algorithms, we study a new paradigm where the learner uses an…

Machine Learning · Statistics 2017-11-21 Weiyang Liu , Bo Dai , Ahmad Humayun , Charlene Tay , Chen Yu , Linda B. Smith , James M. Rehg , Le Song

Inverse Reinforcement Learning from Non-Stationary Learning Agents

In this paper, we study an inverse reinforcement learning problem that involves learning the reward function of a learning agent using trajectory data collected while this agent is learning its optimal policy. To address this problem, we…

Machine Learning · Computer Science 2024-10-21 Kavinayan P. Sivakumar , Yi Shen , Zachary Bell , Scott Nivison , Boyuan Chen , Michael M. Zavlanos

Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning

The goal of imitation learning is to mimic expert behavior from demonstrations, without access to an explicit reward signal. A popular class of approach infers the (unknown) reward function via inverse reinforcement learning (IRL) followed…

Machine Learning · Computer Science 2022-04-19 Carl Qi , Pieter Abbeel , Aditya Grover

Inverse Policy Evaluation for Value-based Sequential Decision-making

Value-based methods for reinforcement learning lack generally applicable ways to derive behavior from a value function. Many approaches involve approximate value iteration (e.g., $Q$-learning), and acting greedily with respect to the…

Machine Learning · Computer Science 2020-08-27 Alan Chan , Kris de Asis , Richard S. Sutton

Task Parameter Extrapolation via Learning Inverse Tasks from Forward Demonstrations

Generalizing skill policies to novel conditions remains a key challenge in robot learning. Imitation learning methods, while data-efficient, are largely confined to the training region and consistently fail on input data outside it, leading…

Robotics · Computer Science 2026-03-10 Serdar Bahar , Fatih Dogangun , Matteo Saveriano , Yukie Nagai , Emre Ugur

Learn Dynamic-Aware State Embedding for Transfer Learning

Transfer reinforcement learning aims to improve the sample efficiency of solving unseen new tasks by leveraging experiences obtained from previous tasks. We consider the setting where all tasks (MDPs) share the same environment dynamic…

Machine Learning · Computer Science 2021-01-08 Kaige Yang

CCDP: Composition of Conditional Diffusion Policies with Guided Sampling

Imitation Learning offers a promising approach to learn directly from data without requiring explicit models, simulations, or detailed task definitions. During inference, actions are sampled from the learned distribution and executed on the…

Robotics · Computer Science 2025-10-28 Amirreza Razmjoo , Sylvain Calinon , Michael Gienger , Fan Zhang

Continuous Inverse Optimal Control with Locally Optimal Examples

Inverse optimal control, also known as inverse reinforcement learning, is the problem of recovering an unknown reward function in a Markov decision process from expert demonstrations of the optimal policy. We introduce a probabilistic…

Machine Learning · Computer Science 2012-06-22 Sergey Levine , Vladlen Koltun

Versatile Inverse Reinforcement Learning via Cumulative Rewards

Inverse Reinforcement Learning infers a reward function from expert demonstrations, aiming to encode the behavior and intentions of the expert. Current approaches usually do this with generative and uni-modal models, meaning that they…

Machine Learning · Computer Science 2021-11-16 Niklas Freymuth , Philipp Becker , Gerhard Neumann

Outcome-Driven Reinforcement Learning via Variational Inference

While reinforcement learning algorithms provide automated acquisition of optimal policies, practical application of such methods requires a number of design decisions, such as manually designing reward functions that not only define the…

Machine Learning · Computer Science 2022-12-29 Tim G. J. Rudner , Vitchyr H. Pong , Rowan McAllister , Yarin Gal , Sergey Levine

Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback

We study the problem of teaching via demonstrations in sequential decision-making tasks. In particular, we focus on the situation when the teacher has no access to the learner's model and policy, and the feedback from the learner is limited…

Machine Learning · Computer Science 2023-09-19 Rustam Zayanov , Francisco S. Melo , Manuel Lopes

Learning Objective Functions Incrementally by Inverse Optimal Control

This paper proposes an inverse optimal control method which enables a robot to incrementally learn a control objective function from a collection of trajectory segments. By saying incrementally, it means that the collection of trajectory…

Robotics · Computer Science 2022-02-03 Zihao Liang , Wanxin Jin , Shaoshuai Mou

Inverse Reinforcement Learning without Reinforcement Learning

Inverse Reinforcement Learning (IRL) is a powerful set of techniques for imitation learning that aims to learn a reward function that rationalizes expert demonstrations. Unfortunately, traditional IRL methods suffer from a computational…

Machine Learning · Computer Science 2024-01-31 Gokul Swamy , Sanjiban Choudhury , J. Andrew Bagnell , Zhiwei Steven Wu

Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications

Inverse reinforcement learning (IRL) infers a reward function from demonstrations, allowing for policy improvement and generalization. However, despite much recent interest in IRL, little work has been done to understand the minimum set of…

Machine Learning · Computer Science 2019-08-19 Daniel S. Brown , Scott Niekum

Neuroevolution-Based Inverse Reinforcement Learning

The problem of Learning from Demonstration is targeted at learning to perform tasks based on observed examples. One approach to Learning from Demonstration is Inverse Reinforcement Learning, in which actions are observed to infer rewards.…

Neural and Evolutionary Computing · Computer Science 2016-08-11 Karan K. Budhraja , Tim Oates

Reward-Conditioned Policies

Reinforcement learning offers the promise of automating the acquisition of complex behavioral skills. However, compared to commonly used and well-understood supervised learning methods, reinforcement learning algorithms can be brittle,…

Machine Learning · Computer Science 2020-01-01 Aviral Kumar , Xue Bin Peng , Sergey Levine

Training Transition Policies via Distribution Matching for Complex Tasks

Humans decompose novel complex tasks into simpler ones to exploit previously learned skills. Analogously, hierarchical reinforcement learning seeks to leverage lower-level policies for simple tasks to solve complex ones. However, because…

Machine Learning · Computer Science 2022-03-15 Ju-Seung Byun , Andrew Perrault

Adversarial Imitation via Variational Inverse Reinforcement Learning

We consider a problem of learning the reward and policy from expert examples under unknown dynamics. Our proposed method builds on the framework of generative adversarial networks and introduces the empowerment-regularized maximum-entropy…

Machine Learning · Computer Science 2019-02-26 Ahmed H. Qureshi , Byron Boots , Michael C. Yip

Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation

We consider the problem of imitation learning from a finite set of expert trajectories, without access to reinforcement signals. The classical approach of extracting the expert's reward function via inverse reinforcement learning, followed…

Machine Learning · Computer Science 2019-06-10 Ruohan Wang , Carlo Ciliberto , Pierluigi Amadori , Yiannis Demiris