Related papers: Offline Imitation Learning with Model-based Revers…

Offline Imitation Learning with Variational Counterfactual Reasoning

In offline imitation learning (IL), an agent aims to learn an optimal expert behavior policy without additional online environment interactions. However, in many real-world scenarios, such as robotics manipulation, the offline dataset is…

Machine Learning · Computer Science 2024-01-01 Bowei He , Zexu Sun , Jinxin Liu , Shuai Zhang , Xu Chen , Chen Ma

Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage

This paper studies offline Imitation Learning (IL) where an agent learns to imitate an expert demonstrator without additional online environment interactions. Instead, the learner is presented with a static offline dataset of…

Machine Learning · Computer Science 2022-02-01 Jonathan D. Chang , Masatoshi Uehara , Dhruv Sreenivas , Rahul Kidambi , Wen Sun

Robust Visual Imitation Learning with Inverse Dynamics Representations

Imitation learning (IL) has achieved considerable success in solving complex sequential decision-making problems. However, current IL methods mainly assume that the environment for learning policies is the same as the environment for…

Machine Learning · Computer Science 2023-10-24 Siyuan Li , Xun Wang , Rongchang Zuo , Kewu Sun , Lingfei Cui , Jishiyu Ding , Peng Liu , Zhe Ma

When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning

Offline inverse reinforcement learning (Offline IRL) aims to recover the structure of rewards and environment dynamics that underlie observed actions in a fixed, finite set of demonstrations from an expert agent. Accurate models of…

Machine Learning · Computer Science 2024-03-01 Siliang Zeng , Chenliang Li , Alfredo Garcia , Mingyi Hong

Robust Learning from Observation with Model Misspecification

Imitation learning (IL) is a popular paradigm for training policies in robotic systems when specifying the reward function is difficult. However, despite the success of IL algorithms, they impose the somewhat unrealistic requirement that…

Machine Learning · Computer Science 2022-02-16 Luca Viano , Yu-Ting Huang , Parameswaran Kamalaruban , Craig Innes , Subramanian Ramamoorthy , Adrian Weller

Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving

Learning-based approaches, such as reinforcement learning (RL) and imitation learning (IL), have indicated superiority over rule-based approaches in complex urban autonomous driving environments, showing great potential to make intelligent…

Robotics · Computer Science 2022-05-31 Haochen Liu , Zhiyu Huang , Jingda Wu , Chen Lv

Instabilities of Offline RL with Pre-Trained Neural Representation

In offline reinforcement learning (RL), we seek to utilize offline data to evaluate (or learn) policies in scenarios where the data are collected from a distribution that substantially differs from that of the target policy to be evaluated.…

Machine Learning · Computer Science 2021-03-09 Ruosong Wang , Yifan Wu , Ruslan Salakhutdinov , Sham M. Kakade

Inverse Reinforcement Learning without Reinforcement Learning

Inverse Reinforcement Learning (IRL) is a powerful set of techniques for imitation learning that aims to learn a reward function that rationalizes expert demonstrations. Unfortunately, traditional IRL methods suffer from a computational…

Machine Learning · Computer Science 2024-01-31 Gokul Swamy , Sanjiban Choudhury , J. Andrew Bagnell , Zhiwei Steven Wu

Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning

Imitation learning has proven effective for training robots to perform complex tasks from expert human demonstrations. However, it remains limited by its reliance on high-quality, task-specific data, restricting adaptability to the diverse…

Robotics · Computer Science 2025-10-28 Kevin Huang , Rosario Scalise , Cleah Winston , Ayush Agrawal , Yunchu Zhang , Rohan Baijal , Markus Grotz , Byron Boots , Benjamin Burchfiel , Masha Itkina , Paarth Shah , Abhishek Gupta

Offline Inverse Reinforcement Learning

The objective of offline RL is to learn optimal policies when a fixed exploratory demonstrations data-set is available and sampling additional observations is impossible (typically if this operation is either costly or rises ethical…

Machine Learning · Computer Science 2021-06-10 Firas Jarboui , Vianney Perchet

Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations

We study the problem of offline Imitation Learning (IL) where an agent aims to learn an optimal expert behavior policy without additional online environment interactions. Instead, the agent is provided with a supplementary offline dataset…

Machine Learning · Computer Science 2022-07-21 Haoran Xu , Xianyuan Zhan , Honglei Yin , Huiling Qin

Discriminator-Guided Model-Based Offline Imitation Learning

Offline imitation learning (IL) is a powerful method to solve decision-making problems from expert demonstrations without reward labels. Existing offline IL methods suffer from severe performance degeneration under limited expert data.…

Machine Learning · Computer Science 2023-01-11 Wenjia Zhang , Haoran Xu , Haoyi Niu , Peng Cheng , Ming Li , Heming Zhang , Guyue Zhou , Xianyuan Zhan

Balance Equation-based Distributionally Robust Offline Imitation Learning

Imitation Learning (IL) has proven highly effective for robotic and control tasks where manually designing reward functions or explicit controllers is infeasible. However, standard IL methods implicitly assume that the environment dynamics…

Machine Learning · Computer Science 2025-11-12 Rishabh Agrawal , Yusuf Alvi , Rahul Jain , Ashutosh Nayyar

Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization

Learning complex policies with Reinforcement Learning (RL) is often hindered by instability and slow convergence, a problem exacerbated by the difficulty of reward engineering. Imitation Learning (IL) from expert demonstrations bypasses…

Machine Learning · Computer Science 2026-05-19 Sayambhu Sen , Shalabh Bhatnagar

SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning

We focus on offline imitation learning (IL), which aims to mimic an expert's behavior using demonstrations without any interaction with the environment. One of the main challenges in offline IL is the limited support of expert…

Machine Learning · Computer Science 2024-10-14 Huy Hoang , Tien Mai , Pradeep Varakantham

Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation

Model-based offline Reinforcement Learning (RL) constructs environment models from offline datasets to perform conservative policy optimization. Existing approaches focus on learning state transitions through ensemble models, rollouting…

Machine Learning · Computer Science 2025-03-27 Hongye Cao , Fan Feng , Jing Huo , Shangdong Yang , Meng Fang , Tianpei Yang , Yang Gao

Offline Reinforcement Learning with Reverse Model-based Imagination

In offline reinforcement learning (offline RL), one of the main challenges is to deal with the distributional shift between the learning policy and the given dataset. To address this problem, recent offline RL methods attempt to introduce…

Machine Learning · Computer Science 2021-11-16 Jianhao Wang , Wenzhe Li , Haozhe Jiang , Guangxiang Zhu , Siyuan Li , Chongjie Zhang

Inverse Reinforcement Learning by Estimating Expertise of Demonstrators

In Imitation Learning (IL), utilizing suboptimal and heterogeneous demonstrations presents a substantial challenge due to the varied nature of real-world data. However, standard IL algorithms consider these datasets as homogeneous, thereby…

Machine Learning · Computer Science 2024-12-16 Mark Beliaev , Ramtin Pedarsani

Inverse reinforcement learning by expert imitation for the stochastic linear-quadratic optimal control problem

This article studies inverse reinforcement learning (IRL) for the stochastic linear-quadratic optimal control problem, where two agents are considered. A learner agent does not know the expert agent's performance cost function, but it…

Optimization and Control · Mathematics 2024-05-28 Zhongshi Sun , Guangyan Jia

How to Mitigate the Distribution Shift Problem in Robotics Control: A Robust and Adaptive Approach Based on Offline to Online Imitation Learning

Distribution shift in imitation learning refers to the problem that the agent cannot plan proper actions for a state that has not been visited during the training. This problem can be largely attributed to the inherently narrow state-action…

Robotics · Computer Science 2026-05-26 Hyung-Suk Yoon , Seung-Woo Seo