Related papers: A New Framework for Query Efficient Active Imitati…

RILe: Reinforced Imitation Learning

Acquiring complex behaviors is essential for artificially intelligent agents, yet learning these behaviors in high-dimensional settings poses a significant challenge due to the vast search space. Traditional reinforcement learning (RL)…

Machine Learning · Computer Science 2025-04-22 Mert Albaba , Sammy Christen , Thomas Langarek , Christoph Gebhardt , Otmar Hilliges , Michael J. Black

Human AI interaction loop training: New approach for interactive reinforcement learning

Reinforcement Learning (RL) in various decision-making tasks of machine learning provides effective results with an agent learning from a stand-alone reward function. However, it presents unique challenges with large amounts of environment…

Machine Learning · Computer Science 2020-03-10 Neda Navidi

Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching

In inverse reinforcement learning (IRL), an agent seeks to replicate expert demonstrations through interactions with the environment. Traditionally, IRL is treated as an adversarial game, where an adversary searches over reward models, and…

Machine Learning · Computer Science 2025-04-23 Arnav Kumar Jain , Harley Wiltzer , Jesse Farebrother , Irina Rish , Glen Berseth , Sanjiban Choudhury

f-IRL: Inverse Reinforcement Learning via State Marginal Matching

Imitation learning is well-suited for robotic tasks where it is difficult to directly program the behavior or specify a cost for optimal control. In this work, we propose a method for learning the reward function (and the corresponding…

Machine Learning · Computer Science 2021-01-01 Tianwei Ni , Harshit Sikchi , Yufei Wang , Tejus Gupta , Lisa Lee , Benjamin Eysenbach

Sample Efficient Imitation Learning via Reward Function Trained in Advance

Imitation learning (IL) is a framework that learns to imitate expert behavior from demonstrations. Recently, IL shows promising results on high dimensional and control tasks. However, IL typically suffers from sample inefficiency in terms…

Machine Learning · Computer Science 2021-11-24 Lihua Zhang

Robust Learning from Observation with Model Misspecification

Imitation learning (IL) is a popular paradigm for training policies in robotic systems when specifying the reward function is difficult. However, despite the success of IL algorithms, they impose the somewhat unrealistic requirement that…

Machine Learning · Computer Science 2022-02-16 Luca Viano , Yu-Ting Huang , Parameswaran Kamalaruban , Craig Innes , Subramanian Ramamoorthy , Adrian Weller

Robust Visual Imitation Learning with Inverse Dynamics Representations

Imitation learning (IL) has achieved considerable success in solving complex sequential decision-making problems. However, current IL methods mainly assume that the environment for learning policies is the same as the environment for…

Machine Learning · Computer Science 2023-10-24 Siyuan Li , Xun Wang , Rongchang Zuo , Kewu Sun , Lingfei Cui , Jishiyu Ding , Peng Liu , Zhe Ma

Probability Density Estimation Based Imitation Learning

Imitation Learning (IL) is an effective learning paradigm exploiting the interactions between agents and environments. It does not require explicit reward signals and instead tries to recover desired policies using expert demonstrations. In…

Machine Learning · Computer Science 2021-12-14 Yang Liu , Yongzhe Chang , Shilei Jiang , Xueqian Wang , Bin Liang , Bo Yuan

Efficient Deep Reinforcement Learning with Imitative Expert Priors for Autonomous Driving

Deep reinforcement learning (DRL) is a promising way to achieve human-like autonomous driving. However, the low sample efficiency and difficulty of designing reward functions for DRL would hinder its applications in practice. In light of…

Robotics · Computer Science 2021-10-29 Zhiyu Huang , Jingda Wu , Chen Lv

Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization

Learning complex policies with Reinforcement Learning (RL) is often hindered by instability and slow convergence, a problem exacerbated by the difficulty of reward engineering. Imitation Learning (IL) from expert demonstrations bypasses…

Machine Learning · Computer Science 2026-05-19 Sayambhu Sen , Shalabh Bhatnagar

Event Extraction with Generative Adversarial Imitation Learning

We propose a new method for event extraction (EE) task based on an imitation learning framework, specifically, inverse reinforcement learning (IRL) via generative adversarial network (GAN). The GAN estimates proper rewards according to the…

Computation and Language · Computer Science 2018-04-24 Tongtao Zhang , Heng Ji

Reward-free World Models for Online Imitation Learning

Imitation learning (IL) enables agents to acquire skills directly from expert demonstrations, providing a compelling alternative to reinforcement learning. However, prior online IL approaches struggle with complex tasks characterized by…

Machine Learning · Computer Science 2025-05-13 Shangzhe Li , Zhiao Huang , Hao Su

A Ranking Game for Imitation Learning

We propose a new framework for imitation learning -- treating imitation as a two-player ranking-based game between a policy and a reward. In this game, the reward agent learns to satisfy pairwise performance rankings between behaviors,…

Machine Learning · Computer Science 2023-01-18 Harshit Sikchi , Akanksha Saran , Wonjoon Goo , Scott Niekum

Reinforced Imitation in Heterogeneous Action Space

Imitation learning is an effective alternative approach to learn a policy when the reward function is sparse. In this paper, we consider a challenging setting where an agent and an expert use different actions from each other. We assume…

Machine Learning · Computer Science 2019-08-27 Konrad Zolna , Negar Rostamzadeh , Yoshua Bengio , Sungjin Ahn , Pedro O. Pinheiro

Information Directed Reward Learning for Reinforcement Learning

For many reinforcement learning (RL) applications, specifying a reward is difficult. This paper considers an RL setting where the agent obtains information about the reward only by querying an expert that can, for example, evaluate…

Machine Learning · Computer Science 2022-02-01 David Lindner , Matteo Turchetta , Sebastian Tschiatschek , Kamil Ciosek , Andreas Krause

Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment

Many imitation learning (IL) algorithms use inverse reinforcement learning (IRL) to infer a reward function that aligns with the demonstration. However, the inferred reward functions often fail to capture the underlying task objectives. In…

Machine Learning · Computer Science 2024-11-01 Weichao Zhou , Wenchao Li

Online reinforcement learning with sparse rewards through an active inference capsule

Intelligent agents must pursue their goals in complex environments with partial information and often limited computational capacity. Reinforcement learning methods have achieved great success by creating agents that optimize engineered…

Machine Learning · Computer Science 2021-06-07 Alejandro Daniel Noel , Charel van Hoof , Beren Millidge

Learning to Reach Goals via Iterated Supervised Learning

Current reinforcement learning (RL) algorithms can be brittle and difficult to use, especially when learning goal-reaching behaviors from sparse rewards. Although supervised imitation learning provides a simple and stable alternative, it…

Machine Learning · Computer Science 2020-10-06 Dibya Ghosh , Abhishek Gupta , Ashwin Reddy , Justin Fu , Coline Devin , Benjamin Eysenbach , Sergey Levine

Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization

In this paper, we study the problem of obtaining a control policy that can mimic and then outperform expert demonstrations in Markov decision processes where the reward function is unknown to the learning agent. One main relevant approach…

Machine Learning · Computer Science 2020-09-24 Feng Tao , Yongcan Cao

Off-Policy Adversarial Inverse Reinforcement Learning

Adversarial Imitation Learning (AIL) is a class of algorithms in Reinforcement learning (RL), which tries to imitate an expert without taking any reward from the environment and does not provide expert behavior directly to the policy…

Machine Learning · Computer Science 2020-05-05 Samin Yeasar Arnob