Related papers: Rethinking ValueDice: Does It Really Improve Perfo…

On Generalization of Adversarial Imitation Learning and Beyond

Despite massive empirical evaluations, one of the fundamental questions in imitation learning is still not fully settled: does AIL (adversarial imitation learning) provably generalize better than BC (behavioral cloning)? We study this open…

Machine Learning · Computer Science 2022-02-14 Tian Xu , Ziniu Li , Yang Yu , Zhi-Quan Luo

Adversarial Imitation Learning via Boosting

Adversarial imitation learning (AIL) has stood out as a dominant framework across various imitation learning (IL) applications, with Discriminator Actor Critic (DAC) (Kostrikov et al.,, 2019) demonstrating the effectiveness of off-policy…

Machine Learning · Computer Science 2024-04-15 Jonathan D. Chang , Dhruv Sreenivas , Yingbing Huang , Kianté Brantley , Wen Sun

Non-Adversarial Imitation Learning and its Connections to Adversarial Methods

Many modern methods for imitation learning and inverse reinforcement learning, such as GAIL or AIRL, are based on an adversarial formulation. These methods apply GANs to match the expert's distribution over states and actions with the…

Machine Learning · Computer Science 2020-08-11 Oleg Arenz , Gerhard Neumann

Augmenting GAIL with BC for sample efficient imitation learning

Imitation learning is the problem of recovering an expert policy without access to a reward signal. Behavior cloning and GAIL are two widely used methods for performing imitation learning. Behavior cloning converges in a few iterations but…

Machine Learning · Computer Science 2020-11-11 Rohit Jena , Changliu Liu , Katia Sycara

Support-weighted Adversarial Imitation Learning

Adversarial Imitation Learning (AIL) is a broad family of imitation learning methods designed to mimic expert behaviors from demonstrations. While AIL has shown state-of-the-art performance on imitation learning with only small number of…

Machine Learning · Computer Science 2020-02-21 Ruohan Wang , Carlo Ciliberto , Pierluigi Amadori , Yiannis Demiris

When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?

Offline reinforcement learning (RL) algorithms can acquire effective policies by utilizing previously collected experience, without any online interaction. It is widely understood that offline RL is able to extract good policies even from…

Machine Learning · Computer Science 2022-04-13 Aviral Kumar , Joey Hong , Anikait Singh , Sergey Levine

Model-based Offline Imitation Learning with Non-expert Data

Although Behavioral Cloning (BC) in theory suffers compounding errors, its scalability and simplicity still makes it an attractive imitation learning algorithm. In contrast, imitation approaches with adversarial training typically does not…

Machine Learning · Computer Science 2022-06-14 Jeongwon Park , Lin Yang

Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning

We study online adversarial imitation learning (AIL), where an agent learns from offline expert demonstrations and interacts with the environment online without access to rewards. Despite strong empirical results, the benefits of online…

Machine Learning · Computer Science 2026-02-03 Shangzhe Li , Dongruo Zhou , Weitong Zhang

Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations

We study the problem of offline Imitation Learning (IL) where an agent aims to learn an optimal expert behavior policy without additional online environment interactions. Instead, the agent is provided with a supplementary offline dataset…

Machine Learning · Computer Science 2022-07-21 Haoran Xu , Xianyuan Zhan , Honglei Yin , Huiling Qin

Is Value Learning Really the Main Bottleneck in Offline RL?

While imitation learning requires access to high-quality data, offline reinforcement learning (RL) should, in principle, perform similarly or better with substantially lower data quality by using a value function. However, current results…

Machine Learning · Computer Science 2024-10-30 Seohong Park , Kevin Frans , Sergey Levine , Aviral Kumar

A Pragmatic Look at Deep Imitation Learning

The introduction of the generative adversarial imitation learning (GAIL) algorithm has spurred the development of scalable imitation learning approaches using deep neural networks. Many of the algorithms that followed used a similar…

Machine Learning · Computer Science 2023-09-21 Kai Arulkumaran , Dan Ogawa Lillrank

On Computation and Generalization of Generative Adversarial Imitation Learning

Generative Adversarial Imitation Learning (GAIL) is a powerful and practical approach for learning sequential decision-making policies. Different from Reinforcement Learning (RL), GAIL takes advantage of demonstration data by experts (e.g.,…

Machine Learning · Computer Science 2020-01-14 Minshuo Chen , Yizhou Wang , Tianyi Liu , Zhuoran Yang , Xingguo Li , Zhaoran Wang , Tuo Zhao

Offline Learning from Demonstrations and Unlabeled Experience

Behavior cloning (BC) is often practical for robot learning because it allows a policy to be trained offline without rewards, by supervised learning on expert demonstrations. However, BC does not effectively leverage what we will refer to…

Machine Learning · Computer Science 2020-11-30 Konrad Zolna , Alexander Novikov , Ksenia Konyushkova , Caglar Gulcehre , Ziyu Wang , Yusuf Aytar , Misha Denil , Nando de Freitas , Scott Reed

When does predictive inverse dynamics outperform behavior cloning?

Behavior cloning (BC) is a practical offline imitation learning method, but it often fails when expert demonstrations are limited. Recent works have introduced a class of architectures named predictive inverse dynamics models (PIDM) that…

Machine Learning · Computer Science 2026-01-30 Lukas Schäfer , Pallavi Choudhury , Abdelhak Lemkhenter , Chris Lovett , Somjit Nath , Luis França , Matheus Ribeiro Furtado de Mendonça , Alex Lamb , Riashat Islam , Siddhartha Sen , John Langford , Katja Hofmann , Sergio Valcarcel Macua

Self-Imitation Advantage Learning

Self-imitation learning is a Reinforcement Learning (RL) method that encourages actions whose returns were higher than expected, which helps in hard exploration and sparse reward problems. It was shown to improve the performance of…

Machine Learning · Computer Science 2020-12-23 Johan Ferret , Olivier Pietquin , Matthieu Geist

Adversarial Imitation Learning with General Function Approximation: Theoretical Analysis and Practical Algorithms

Adversarial imitation learning (AIL), a prominent approach in imitation learning, has achieved significant practical success powered by neural network approximation. However, existing theoretical analyses of AIL are primarily confined to…

Machine Learning · Computer Science 2026-05-05 Tian Xu , Zhilong Zhang , Zexuan Chen , Ruishuo Chen , Yihao Sun , Yang Yu

From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving

Learning robust driving policies from large-scale, real-world datasets is a central challenge in autonomous driving, as online data collection is often unsafe and impractical. While Behavioral Cloning (BC) offers a straightforward approach…

Machine Learning · Computer Science 2025-08-28 Antonio Guillen-Perez

SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards

Learning to imitate expert behavior from demonstrations can be challenging, especially in environments with high-dimensional, continuous observations and unknown dynamics. Supervised learning methods based on behavioral cloning (BC) suffer…

Machine Learning · Computer Science 2019-09-27 Siddharth Reddy , Anca D. Dragan , Sergey Levine

Interactive and Hybrid Imitation Learning: Provably Beating Behavior Cloning

Imitation learning (IL) is a paradigm for learning sequential decision making policies from experts, leveraging offline demonstrations, interactive annotations, or both. Recent advances show that when annotation cost is tallied per…

Machine Learning · Statistics 2026-01-14 Yichen Li , Chicheng Zhang

Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies

This paper presents our solution for the Real Robot Challenge (RRC) III, a competition featured in the NeurIPS 2022 Competition Track, aimed at addressing dexterous robotic manipulation tasks through learning from pre-collected offline…

Robotics · Computer Science 2023-09-22 Qiang Wang , Robert McCarthy , David Cordova Bulens , Francisco Roldan Sanchez , Kevin McGuinness , Noel E. O'Connor , Stephen J. Redmond