English
Related papers

Related papers: Rethinking ValueDice: Does It Really Improve Perfo…

200 papers

Despite massive empirical evaluations, one of the fundamental questions in imitation learning is still not fully settled: does AIL (adversarial imitation learning) provably generalize better than BC (behavioral cloning)? We study this open…

Machine Learning · Computer Science 2022-02-14 Tian Xu , Ziniu Li , Yang Yu , Zhi-Quan Luo

Adversarial imitation learning (AIL) has stood out as a dominant framework across various imitation learning (IL) applications, with Discriminator Actor Critic (DAC) (Kostrikov et al.,, 2019) demonstrating the effectiveness of off-policy…

Machine Learning · Computer Science 2024-04-15 Jonathan D. Chang , Dhruv Sreenivas , Yingbing Huang , Kianté Brantley , Wen Sun

Many modern methods for imitation learning and inverse reinforcement learning, such as GAIL or AIRL, are based on an adversarial formulation. These methods apply GANs to match the expert's distribution over states and actions with the…

Machine Learning · Computer Science 2020-08-11 Oleg Arenz , Gerhard Neumann

Imitation learning is the problem of recovering an expert policy without access to a reward signal. Behavior cloning and GAIL are two widely used methods for performing imitation learning. Behavior cloning converges in a few iterations but…

Machine Learning · Computer Science 2020-11-11 Rohit Jena , Changliu Liu , Katia Sycara

Adversarial Imitation Learning (AIL) is a broad family of imitation learning methods designed to mimic expert behaviors from demonstrations. While AIL has shown state-of-the-art performance on imitation learning with only small number of…

Machine Learning · Computer Science 2020-02-21 Ruohan Wang , Carlo Ciliberto , Pierluigi Amadori , Yiannis Demiris

Offline reinforcement learning (RL) algorithms can acquire effective policies by utilizing previously collected experience, without any online interaction. It is widely understood that offline RL is able to extract good policies even from…

Machine Learning · Computer Science 2022-04-13 Aviral Kumar , Joey Hong , Anikait Singh , Sergey Levine

Although Behavioral Cloning (BC) in theory suffers compounding errors, its scalability and simplicity still makes it an attractive imitation learning algorithm. In contrast, imitation approaches with adversarial training typically does not…

Machine Learning · Computer Science 2022-06-14 Jeongwon Park , Lin Yang

We study online adversarial imitation learning (AIL), where an agent learns from offline expert demonstrations and interacts with the environment online without access to rewards. Despite strong empirical results, the benefits of online…

Machine Learning · Computer Science 2026-02-03 Shangzhe Li , Dongruo Zhou , Weitong Zhang

We study the problem of offline Imitation Learning (IL) where an agent aims to learn an optimal expert behavior policy without additional online environment interactions. Instead, the agent is provided with a supplementary offline dataset…

Machine Learning · Computer Science 2022-07-21 Haoran Xu , Xianyuan Zhan , Honglei Yin , Huiling Qin

While imitation learning requires access to high-quality data, offline reinforcement learning (RL) should, in principle, perform similarly or better with substantially lower data quality by using a value function. However, current results…

Machine Learning · Computer Science 2024-10-30 Seohong Park , Kevin Frans , Sergey Levine , Aviral Kumar

The introduction of the generative adversarial imitation learning (GAIL) algorithm has spurred the development of scalable imitation learning approaches using deep neural networks. Many of the algorithms that followed used a similar…

Machine Learning · Computer Science 2023-09-21 Kai Arulkumaran , Dan Ogawa Lillrank

Generative Adversarial Imitation Learning (GAIL) is a powerful and practical approach for learning sequential decision-making policies. Different from Reinforcement Learning (RL), GAIL takes advantage of demonstration data by experts (e.g.,…

Machine Learning · Computer Science 2020-01-14 Minshuo Chen , Yizhou Wang , Tianyi Liu , Zhuoran Yang , Xingguo Li , Zhaoran Wang , Tuo Zhao

Behavior cloning (BC) is often practical for robot learning because it allows a policy to be trained offline without rewards, by supervised learning on expert demonstrations. However, BC does not effectively leverage what we will refer to…

Behavior cloning (BC) is a practical offline imitation learning method, but it often fails when expert demonstrations are limited. Recent works have introduced a class of architectures named predictive inverse dynamics models (PIDM) that…

Self-imitation learning is a Reinforcement Learning (RL) method that encourages actions whose returns were higher than expected, which helps in hard exploration and sparse reward problems. It was shown to improve the performance of…

Machine Learning · Computer Science 2020-12-23 Johan Ferret , Olivier Pietquin , Matthieu Geist

Adversarial imitation learning (AIL), a prominent approach in imitation learning, has achieved significant practical success powered by neural network approximation. However, existing theoretical analyses of AIL are primarily confined to…

Machine Learning · Computer Science 2026-05-05 Tian Xu , Zhilong Zhang , Zexuan Chen , Ruishuo Chen , Yihao Sun , Yang Yu

Learning robust driving policies from large-scale, real-world datasets is a central challenge in autonomous driving, as online data collection is often unsafe and impractical. While Behavioral Cloning (BC) offers a straightforward approach…

Machine Learning · Computer Science 2025-08-28 Antonio Guillen-Perez

Learning to imitate expert behavior from demonstrations can be challenging, especially in environments with high-dimensional, continuous observations and unknown dynamics. Supervised learning methods based on behavioral cloning (BC) suffer…

Machine Learning · Computer Science 2019-09-27 Siddharth Reddy , Anca D. Dragan , Sergey Levine

Imitation learning (IL) is a paradigm for learning sequential decision making policies from experts, leveraging offline demonstrations, interactive annotations, or both. Recent advances show that when annotation cost is tallied per…

Machine Learning · Statistics 2026-01-14 Yichen Li , Chicheng Zhang

This paper presents our solution for the Real Robot Challenge (RRC) III, a competition featured in the NeurIPS 2022 Competition Track, aimed at addressing dexterous robotic manipulation tasks through learning from pre-collected offline…

‹ Prev 1 2 3 10 Next ›