Related papers: Bayesian Robust Optimization for Imitation Learnin…

Policy Gradient Bayesian Robust Optimization for Imitation Learning

The difficulty in specifying rewards for many real-world problems has led to an increased focus on learning rewards from human feedback, such as demonstrations. However, there are often many different reward functions that explain the human…

Machine Learning · Computer Science 2021-06-23 Zaynah Javed , Daniel S. Brown , Satvik Sharma , Jerry Zhu , Ashwin Balakrishna , Marek Petrik , Anca D. Dragan , Ken Goldberg

Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

The problem of inverse reinforcement learning (IRL) is relevant to a variety of tasks including value alignment and robot learning from demonstration. Despite significant algorithmic contributions in recent years, IRL remains an ill-posed…

Machine Learning · Computer Science 2020-11-18 Sreejith Balakrishnan , Quoc Phong Nguyen , Bryan Kian Hsiang Low , Harold Soh

Bayesian Inverse Reinforcement Learning for Non-Markovian Rewards

Inverse reinforcement learning (IRL) is the problem of inferring a reward function from expert behavior. There are several approaches to IRL, but most are designed to learn a Markovian reward. However, a reward function might be…

Machine Learning · Computer Science 2024-06-21 Noah Topper , Alvaro Velasquez , George Atia

Scalable Bayesian Inverse Reinforcement Learning

Bayesian inference over the reward presents an ideal solution to the ill-posed nature of the inverse reinforcement learning problem. Unfortunately current methods generally do not scale well beyond the small tabular setting due to the need…

Machine Learning · Computer Science 2021-03-15 Alex J. Chan , Mihaela van der Schaar

Deep Bayesian Reward Learning from Preferences

Bayesian inverse reinforcement learning (IRL) methods are ideal for safe imitation learning, as they allow a learning agent to reason about reward uncertainty and the safety of a learned policy. However, Bayesian IRL is computationally…

Machine Learning · Computer Science 2019-12-11 Daniel S. Brown , Scott Niekum

Imitation Bootstrapped Reinforcement Learning

Despite the considerable potential of reinforcement learning (RL), robotic control tasks predominantly rely on imitation learning (IL) due to its better sample efficiency. However, it is costly to collect comprehensive expert demonstrations…

Machine Learning · Computer Science 2024-05-22 Hengyuan Hu , Suvir Mirchandani , Dorsa Sadigh

A Novel Variational Lower Bound for Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) seeks to learn the reward function from expert trajectories, to understand the task for imitation or collaboration thereby removing the need for manual reward engineering. However, IRL in the context of…

Machine Learning · Computer Science 2023-11-13 Yikang Gui , Prashant Doshi

On the Performance of Maximum Likelihood Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) addresses the problem of recovering a task description given a demonstration of the optimal policy used to solve such a task. The optimal policy is usually provided by an expert or teacher, making IRL…

Machine Learning · Computer Science 2012-02-09 Héctor Ratia , Luis Montesano , Ruben Martinez-Cantin

Energy-Based Imitation Learning

We tackle a common scenario in imitation learning (IL), where agents try to recover the optimal policy from expert demonstrations without further access to the expert or environment reward signals. Except the simple Behavior Cloning (BC)…

Machine Learning · Computer Science 2021-04-16 Minghuan Liu , Tairan He , Minkai Xu , Weinan Zhang

Kernel Density Bayesian Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) methods infer an agent's reward function using demonstrations of expert behavior. A Bayesian IRL approach models a distribution over candidate reward functions, capturing a degree of uncertainty in the…

Machine Learning · Computer Science 2025-07-04 Aishwarya Mandyam , Didong Li , Jiayu Yao , Diana Cai , Andrew Jones , Barbara E. Engelhardt

Necessary and Sufficient Conditions for Inverse Reinforcement Learning of Bayesian Stopping Time Problems

This paper presents an inverse reinforcement learning~(IRL) framework for Bayesian stopping time problems. By observing the actions of a Bayesian decision maker, we provide a necessary and sufficient condition to identify if these actions…

Machine Learning · Computer Science 2023-03-29 Kunal Pattanayak , Vikram Krishnamurthy

Off-Policy Adversarial Inverse Reinforcement Learning

Adversarial Imitation Learning (AIL) is a class of algorithms in Reinforcement learning (RL), which tries to imitate an expert without taking any reward from the environment and does not provide expert behavior directly to the policy…

Machine Learning · Computer Science 2020-05-05 Samin Yeasar Arnob

Probability Density Estimation Based Imitation Learning

Imitation Learning (IL) is an effective learning paradigm exploiting the interactions between agents and environments. It does not require explicit reward signals and instead tries to recover desired policies using expert demonstrations. In…

Machine Learning · Computer Science 2021-12-14 Yang Liu , Yongzhe Chang , Shilei Jiang , Xueqian Wang , Bin Liang , Bo Yuan

A Bayesian Approach to Robust Inverse Reinforcement Learning

We consider a Bayesian approach to offline model-based inverse reinforcement learning (IRL). The proposed framework differs from existing offline model-based IRL approaches by performing simultaneous estimation of the expert's reward…

Machine Learning · Computer Science 2024-04-09 Ran Wei , Siliang Zeng , Chenliang Li , Alfredo Garcia , Anthony McDonald , Mingyi Hong

Balance Equation-based Distributionally Robust Offline Imitation Learning

Imitation Learning (IL) has proven highly effective for robotic and control tasks where manually designing reward functions or explicit controllers is infeasible. However, standard IL methods implicitly assume that the environment dynamics…

Machine Learning · Computer Science 2025-11-12 Rishabh Agrawal , Yusuf Alvi , Rahul Jain , Ashutosh Nayyar

Generalized Bayesian deep reinforcement learning

Bayesian reinforcement learning (BRL) is a method that merges principles from Bayesian statistics and reinforcement learning to make optimal decisions in uncertain environments. As a model-based RL method, it has two key components: (1)…

Machine Learning · Statistics 2025-06-03 Shreya Sinha Roy , Richard G. Everitt , Christian P. Robert , Ritabrata Dutta

A Divergence Minimization Perspective on Imitation Learning Methods

In many settings, it is desirable to learn decision-making and control policies through learning or bootstrapping from expert demonstrations. The most common approaches under this Imitation Learning (IL) framework are Behavioural Cloning…

Machine Learning · Computer Science 2019-11-07 Seyed Kamyar Seyed Ghasemipour , Richard Zemel , Shixiang Gu

Coherent Soft Imitation Learning

Imitation learning methods seek to learn from an expert either through behavioral cloning (BC) of the policy or inverse reinforcement learning (IRL) of the reward. Such methods enable agents to learn complex tasks from humans that are…

Machine Learning · Computer Science 2023-12-07 Joe Watson , Sandy H. Huang , Nicolas Heess

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an expert. Many algorithms for IRL have an inherently nested…

Machine Learning · Computer Science 2022-11-02 Siliang Zeng , Chenliang Li , Alfredo Garcia , Mingyi Hong

PAC Apprenticeship Learning with Bayesian Active Inverse Reinforcement Learning

As AI systems become increasingly autonomous, reliably aligning their decision-making with human preferences is essential. Inverse reinforcement learning (IRL) offers a promising approach to infer preferences from demonstrations. These…

Machine Learning · Computer Science 2025-09-22 Ondrej Bajgar , Dewi S. W. Gould , Jonathon Liu , Alessandro Abate , Konstantinos Gatsis , Michael A. Osborne