English
Related papers

Related papers: Maximum Entropy Multi-Task Inverse RL

200 papers

Multi-task Inverse Reinforcement Learning (IRL) is the problem of inferring multiple reward functions from expert demonstrations. Prior work, built on Bayesian IRL, is unable to scale to complex environments due to computational…

Machine Learning · Computer Science 2018-07-17 Adam Gleave , Oliver Habryka

We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcement Learning (IRL), which provides a principled method to find a most non-committal reward function consistent with given expert…

Machine Learning · Computer Science 2021-06-08 Aaron J. Snoswell , Surya P. N. Singh , Nan Ye

A popular approach to apprenticeship learning (AL) is to formulate it as an inverse reinforcement learning (IRL) problem. The MaxEnt-IRL algorithm successfully integrates the maximum entropy principle into IRL and unlike its predecessors,…

Machine Learning · Computer Science 2026-04-23 Julien Audiffren , Michal Valko , Alessandro Lazaric , Mohammad Ghavamzadeh

This paper presents a general framework for exploiting the representational capacity of neural networks to approximate complex, nonlinear reward functions in the context of solving the inverse reinforcement learning (IRL) problem. We show…

Machine Learning · Computer Science 2016-03-14 Markus Wulfmeier , Peter Ondruska , Ingmar Posner

Reinforcement learning (RL) algorithms aim to balance exploiting the current best strategy with exploring new options that could lead to higher rewards. Most common RL algorithms use undirected exploration, i.e., select random sequences of…

Machine Learning · Computer Science 2025-08-01 Bhavya Sukhija , Stelian Coros , Andreas Krause , Pieter Abbeel , Carmelo Sferrazza

Various methods for solving the inverse reinforcement learning (IRL) problem have been developed independently in machine learning and economics. In particular, the method of Maximum Causal Entropy IRL is based on the perspective of entropy…

Machine Learning · Computer Science 2021-03-05 Navyata Sanghvi , Shinnosuke Usami , Mohit Sharma , Joachim Groeger , Kris Kitani

In the past decades, we have witnessed significant progress in the domain of autonomous driving. Advanced techniques based on optimization and reinforcement learning (RL) become increasingly powerful at solving the forward problem: given…

Robotics · Computer Science 2020-06-25 Zheng Wu , Liting Sun , Wei Zhan , Chenyu Yang , Masayoshi Tomizuka

The Maximum Entropy Reinforcement Learning (MaxEnt RL) framework is a leading approach for achieving efficient learning and robust performance across many RL tasks. However, MaxEnt methods have also been shown to struggle with…

Machine Learning · Computer Science 2025-06-13 Ruipeng Zhang , Ya-Chien Chang , Sicun Gao

Many potential applications of reinforcement learning (RL) require guarantees that the agent will perform well in the face of disturbances to the dynamics or reward function. In this paper, we prove theoretically that maximum entropy…

Machine Learning · Computer Science 2022-05-06 Benjamin Eysenbach , Sergey Levine

While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on estimating a reward function that best explains an expert agent's policy or demonstrated behavior on a control task, it is often the case that such…

Machine Learning · Computer Science 2020-05-01 Dexter R. R. Scobee , S. Shankar Sastry

Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an expert. Many algorithms for IRL have an inherently nested…

Machine Learning · Computer Science 2022-11-02 Siliang Zeng , Chenliang Li , Alfredo Garcia , Mingyi Hong

We study inverse reinforcement learning (IRL) and imitation learning (IM), the problems of recovering a reward or policy function from expert's demonstrated trajectories. We propose a new way to improve the learning process by adding a…

Machine Learning · Computer Science 2022-08-23 The Viet Bui , Tien Mai , Patrick Jaillet

Inverse reinforcement learning (IRL) addresses the problem of recovering a task description given a demonstration of the optimal policy used to solve such a task. The optimal policy is usually provided by an expert or teacher, making IRL…

Machine Learning · Computer Science 2012-02-09 Héctor Ratia , Luis Montesano , Ruben Martinez-Cantin

In this work, we propose a novel inverse reinforcement learning (IRL) algorithm for constrained Markov decision process (CMDP) problems. In standard IRL problems, the inverse learner or agent seeks to recover the reward function of the MDP,…

Machine Learning · Computer Science 2024-01-08 Nirjhar Das , Arpan Chattopadhyay

Inverse Reinforcement Learning (IRL) algorithms infer a reward function that explains demonstrations provided by an expert acting in the environment. Maximum Causal Entropy (MCE) IRL is currently the most popular formulation of IRL, with…

Machine Learning · Computer Science 2022-03-23 Adam Gleave , Sam Toyer

Inverse Reinforcement Learning (IRL) is the task of learning a single reward function given a Markov Decision Process (MDP) without defining the reward function, and a set of demonstrations generated by humans/experts. However, in practice,…

Artificial Intelligence · Computer Science 2017-12-18 Siddharthan Rajasekaran , Jinwei Zhang , Jie Fu

We consider the problem of learning from demonstrated trajectories with inverse reinforcement learning (IRL). Motivated by a limitation of the classical maximum entropy model in capturing the structure of the network of states, we propose…

Machine Learning · Computer Science 2020-08-19 Tien Mai , Kennard Chan , Patrick Jaillet

Inverse Reinforcement Learning (IRL) aims to facilitate a learner's ability to imitate expert behavior by acquiring reward functions that explain the expert's decisions. Regularized IRL applies strongly convex regularizers to the learner's…

Machine Learning · Computer Science 2020-12-04 Wonseok Jeon , Chen-Yang Su , Paul Barde , Thang Doan , Derek Nowrouzezahrai , Joelle Pineau

Experimentally, it has been observed that humans and animals often make decisions that do not maximize their expected utility, but rather choose outcomes randomly, with probability proportional to expected utility. Probability matching, as…

Machine Learning · Computer Science 2019-10-07 Benjamin Eysenbach , Sergey Levine

This paper explores the use of Maximum Causal Entropy Inverse Reinforcement Learning (IRL) within the context of discrete-time stationary Mean-Field Games (MFGs) characterized by finite state spaces and an infinite-horizon,…

Systems and Control · Electrical Eng. & Systems 2025-07-22 Berkay Anahtarci , Can Deha Kariksiz , Naci Saldi
‹ Prev 1 2 3 10 Next ›