English
Related papers

Related papers: Learning Control by Iterative Inversion

200 papers

The design of a reward function often poses a major practical challenge to real-world applications of reinforcement learning. Approaches such as inverse reinforcement learning attempt to overcome this challenge, but require expert…

Machine Learning · Computer Science 2018-11-14 Justin Fu , Avi Singh , Dibya Ghosh , Larry Yang , Sergey Levine

In this paper, we consider the problem of machine teaching, the inverse problem of machine learning. Different from traditional machine teaching which views the learners as batch algorithms, we study a new paradigm where the learner uses an…

Machine Learning · Statistics 2017-11-21 Weiyang Liu , Bo Dai , Ahmad Humayun , Charlene Tay , Chen Yu , Linda B. Smith , James M. Rehg , Le Song

In this paper, we study an inverse reinforcement learning problem that involves learning the reward function of a learning agent using trajectory data collected while this agent is learning its optimal policy. To address this problem, we…

Machine Learning · Computer Science 2024-10-21 Kavinayan P. Sivakumar , Yi Shen , Zachary Bell , Scott Nivison , Boyuan Chen , Michael M. Zavlanos

The goal of imitation learning is to mimic expert behavior from demonstrations, without access to an explicit reward signal. A popular class of approach infers the (unknown) reward function via inverse reinforcement learning (IRL) followed…

Machine Learning · Computer Science 2022-04-19 Carl Qi , Pieter Abbeel , Aditya Grover

Value-based methods for reinforcement learning lack generally applicable ways to derive behavior from a value function. Many approaches involve approximate value iteration (e.g., $Q$-learning), and acting greedily with respect to the…

Machine Learning · Computer Science 2020-08-27 Alan Chan , Kris de Asis , Richard S. Sutton

Generalizing skill policies to novel conditions remains a key challenge in robot learning. Imitation learning methods, while data-efficient, are largely confined to the training region and consistently fail on input data outside it, leading…

Robotics · Computer Science 2026-03-10 Serdar Bahar , Fatih Dogangun , Matteo Saveriano , Yukie Nagai , Emre Ugur

Transfer reinforcement learning aims to improve the sample efficiency of solving unseen new tasks by leveraging experiences obtained from previous tasks. We consider the setting where all tasks (MDPs) share the same environment dynamic…

Machine Learning · Computer Science 2021-01-08 Kaige Yang

Imitation Learning offers a promising approach to learn directly from data without requiring explicit models, simulations, or detailed task definitions. During inference, actions are sampled from the learned distribution and executed on the…

Robotics · Computer Science 2025-10-28 Amirreza Razmjoo , Sylvain Calinon , Michael Gienger , Fan Zhang

Inverse optimal control, also known as inverse reinforcement learning, is the problem of recovering an unknown reward function in a Markov decision process from expert demonstrations of the optimal policy. We introduce a probabilistic…

Machine Learning · Computer Science 2012-06-22 Sergey Levine , Vladlen Koltun

Inverse Reinforcement Learning infers a reward function from expert demonstrations, aiming to encode the behavior and intentions of the expert. Current approaches usually do this with generative and uni-modal models, meaning that they…

Machine Learning · Computer Science 2021-11-16 Niklas Freymuth , Philipp Becker , Gerhard Neumann

While reinforcement learning algorithms provide automated acquisition of optimal policies, practical application of such methods requires a number of design decisions, such as manually designing reward functions that not only define the…

Machine Learning · Computer Science 2022-12-29 Tim G. J. Rudner , Vitchyr H. Pong , Rowan McAllister , Yarin Gal , Sergey Levine

We study the problem of teaching via demonstrations in sequential decision-making tasks. In particular, we focus on the situation when the teacher has no access to the learner's model and policy, and the feedback from the learner is limited…

Machine Learning · Computer Science 2023-09-19 Rustam Zayanov , Francisco S. Melo , Manuel Lopes

This paper proposes an inverse optimal control method which enables a robot to incrementally learn a control objective function from a collection of trajectory segments. By saying incrementally, it means that the collection of trajectory…

Robotics · Computer Science 2022-02-03 Zihao Liang , Wanxin Jin , Shaoshuai Mou

Inverse Reinforcement Learning (IRL) is a powerful set of techniques for imitation learning that aims to learn a reward function that rationalizes expert demonstrations. Unfortunately, traditional IRL methods suffer from a computational…

Machine Learning · Computer Science 2024-01-31 Gokul Swamy , Sanjiban Choudhury , J. Andrew Bagnell , Zhiwei Steven Wu

Inverse reinforcement learning (IRL) infers a reward function from demonstrations, allowing for policy improvement and generalization. However, despite much recent interest in IRL, little work has been done to understand the minimum set of…

Machine Learning · Computer Science 2019-08-19 Daniel S. Brown , Scott Niekum

The problem of Learning from Demonstration is targeted at learning to perform tasks based on observed examples. One approach to Learning from Demonstration is Inverse Reinforcement Learning, in which actions are observed to infer rewards.…

Neural and Evolutionary Computing · Computer Science 2016-08-11 Karan K. Budhraja , Tim Oates

Reinforcement learning offers the promise of automating the acquisition of complex behavioral skills. However, compared to commonly used and well-understood supervised learning methods, reinforcement learning algorithms can be brittle,…

Machine Learning · Computer Science 2020-01-01 Aviral Kumar , Xue Bin Peng , Sergey Levine

Humans decompose novel complex tasks into simpler ones to exploit previously learned skills. Analogously, hierarchical reinforcement learning seeks to leverage lower-level policies for simple tasks to solve complex ones. However, because…

Machine Learning · Computer Science 2022-03-15 Ju-Seung Byun , Andrew Perrault

We consider a problem of learning the reward and policy from expert examples under unknown dynamics. Our proposed method builds on the framework of generative adversarial networks and introduces the empowerment-regularized maximum-entropy…

Machine Learning · Computer Science 2019-02-26 Ahmed H. Qureshi , Byron Boots , Michael C. Yip

We consider the problem of imitation learning from a finite set of expert trajectories, without access to reinforcement signals. The classical approach of extracting the expert's reward function via inverse reinforcement learning, followed…

Machine Learning · Computer Science 2019-06-10 Ruohan Wang , Carlo Ciliberto , Pierluigi Amadori , Yiannis Demiris
‹ Prev 1 2 3 10 Next ›