Related papers: DiffAIL: Diffusion Adversarial Imitation Learning

Diffusion-Reward Adversarial Imitation Learning

Imitation learning aims to learn a policy from observing expert demonstrations without access to reward signals from environments. Generative adversarial imitation learning (GAIL) formulates imitation learning as adversarial learning,…

Machine Learning · Computer Science 2024-11-27 Chun-Mao Lai , Hsiang-Chun Wang , Ping-Chun Hsieh , Yu-Chiang Frank Wang , Min-Hung Chen , Shao-Hua Sun

SD2AIL: Adversarial Imitation Learning from Synthetic Demonstrations via Diffusion Models

Adversarial Imitation Learning (AIL) is a dominant framework in imitation learning that infers rewards from expert demonstrations to guide policy optimization. Although providing more expert demonstrations typically leads to improved…

Machine Learning · Computer Science 2026-04-30 Pengcheng Li , Qiang Fang , Tong Zhao , Yixing Lan , Xin Xu

ADAIL: Adaptive Adversarial Imitation Learning

We present the ADaptive Adversarial Imitation Learning (ADAIL) algorithm for learning adaptive policies that can be transferred between environments of varying dynamics, by imitating a small number of demonstrations collected from a single…

Machine Learning · Computer Science 2020-08-31 Yiren Lu , Jonathan Tompson

Off-Policy Adversarial Inverse Reinforcement Learning

Adversarial Imitation Learning (AIL) is a class of algorithms in Reinforcement learning (RL), which tries to imitate an expert without taking any reward from the environment and does not provide expert behavior directly to the policy…

Machine Learning · Computer Science 2020-05-05 Samin Yeasar Arnob

Support-weighted Adversarial Imitation Learning

Adversarial Imitation Learning (AIL) is a broad family of imitation learning methods designed to mimic expert behaviors from demonstrations. While AIL has shown state-of-the-art performance on imitation learning with only small number of…

Machine Learning · Computer Science 2020-02-21 Ruohan Wang , Carlo Ciliberto , Pierluigi Amadori , Yiannis Demiris

On Discovering Algorithms for Adversarial Imitation Learning

Adversarial Imitation Learning (AIL) methods, while effective in settings with limited expert demonstrations, are often considered unstable. These approaches typically decompose into two components: Density Ratio (DR) estimation…

Artificial Intelligence · Computer Science 2026-02-27 Shashank Reddy Chirra , Jayden Teoh , Praveen Paruchuri , Pradeep Varakantham

Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble

Recovering reward function from expert demonstrations is a fundamental problem in reinforcement learning. The recovered reward function captures the motivation of the expert. Agents can imitate experts by following these reward functions in…

Machine Learning · Computer Science 2024-06-27 Fan-Ming Luo , Xingchen Cao , Rong-Jun Qin , Yang Yu

Discriminator-Guided Model-Based Offline Imitation Learning

Offline imitation learning (IL) is a powerful method to solve decision-making problems from expert demonstrations without reward labels. Existing offline IL methods suffer from severe performance degeneration under limited expert data.…

Machine Learning · Computer Science 2023-01-11 Wenjia Zhang , Haoran Xu , Haoyi Niu , Peng Cheng , Ming Li , Heming Zhang , Guyue Zhou , Xianyuan Zhan

Diffusing States and Matching Scores: A New Framework for Imitation Learning

Adversarial Imitation Learning is traditionally framed as a two-player zero-sum game between a learner and an adversarially chosen cost function, and can therefore be thought of as the sequential generalization of a Generative Adversarial…

Machine Learning · Computer Science 2025-03-04 Runzhe Wu , Yiding Chen , Gokul Swamy , Kianté Brantley , Wen Sun

Policy Contrastive Imitation Learning

Adversarial imitation learning (AIL) is a popular method that has recently achieved much success. However, the performance of AIL is still unsatisfactory on the more challenging tasks. We find that one of the major reasons is due to the low…

Machine Learning · Computer Science 2023-07-07 Jialei Huang , Zhaoheng Yin , Yingdong Hu , Yang Gao

$f$-GAIL: Learning $f$-Divergence for Generative Adversarial Imitation Learning

Imitation learning (IL) aims to learn a policy from expert demonstrations that minimizes the discrepancy between the learner and expert behaviors. Various imitation learning algorithms have been proposed with different pre-determined…

Machine Learning · Computer Science 2020-11-20 Xin Zhang , Yanhua Li , Ziming Zhang , Zhi-Li Zhang

DDIL: Diversity Enhancing Diffusion Distillation With Imitation Learning

Diffusion models excel at generative modeling (e.g., text-to-image) but sampling requires multiple denoising network passes, limiting practicality. Efforts such as progressive distillation or consistency distillation have shown promise by…

Machine Learning · Computer Science 2025-04-01 Risheek Garrepalli , Shweta Mahajan , Munawar Hayat , Fatih Porikli

Auto-Encoding Adversarial Imitation Learning

Reinforcement learning (RL) provides a powerful framework for decision-making, but its application in practice often requires a carefully designed reward function. Adversarial Imitation Learning (AIL) sheds light on automatic policy…

Machine Learning · Computer Science 2024-02-05 Kaifeng Zhang , Rui Zhao , Ziming Zhang , Yang Gao

Hindsight Generative Adversarial Imitation Learning

Compared to reinforcement learning, imitation learning (IL) is a powerful paradigm for training agents to learn control policies efficiently from expert demonstrations. However, in most cases, obtaining demonstration data is costly and…

Machine Learning · Computer Science 2019-03-20 Naijun Liu , Tao Lu , Yinghao Cai , Boyao Li , Shuo Wang

Action-Constrained Imitation Learning

Policy learning under action constraints plays a central role in ensuring safe behaviors in various robot control and resource allocation applications. In this paper, we study a new problem setting termed Action-Constrained Imitation…

Robotics · Computer Science 2025-08-21 Chia-Han Yeh , Tse-Sheng Nan , Risto Vuorio , Wei Hung , Hung-Yen Wu , Shao-Hua Sun , Ping-Chun Hsieh

Diffusion Imitation from Observation

Learning from observation (LfO) aims to imitate experts by learning from state-only demonstrations without requiring action labels. Existing adversarial imitation learning approaches learn a generator agent policy to produce state…

Machine Learning · Computer Science 2024-10-10 Bo-Ruei Huang , Chun-Kai Yang , Chun-Mao Lai , Dai-Jie Wu , Shao-Hua Sun

Latent Wasserstein Adversarial Imitation Learning

Imitation Learning (IL) enables agents to mimic expert behavior by learning from demonstrations. However, traditional IL methods require large amounts of medium-to-high-quality demonstrations as well as actions of expert demonstrations,…

Machine Learning · Computer Science 2026-03-06 Siqi Yang , Kai Yan , Alexander G. Schwing , Yu-Xiong Wang

Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator

Imitation learning is often used in addition to reinforcement learning in environments where reward design is difficult or where the reward is sparse, but it is difficult to be able to imitate well in unknown states from a small amount of…

Machine Learning · Computer Science 2024-01-31 Ryoma Furuyama , Daiki Kuyoshi , Satoshi Yamane

On Computation and Generalization of Generative Adversarial Imitation Learning

Generative Adversarial Imitation Learning (GAIL) is a powerful and practical approach for learning sequential decision-making policies. Different from Reinforcement Learning (RL), GAIL takes advantage of demonstration data by experts (e.g.,…

Machine Learning · Computer Science 2020-01-14 Minshuo Chen , Yizhou Wang , Tianyi Liu , Zhuoran Yang , Xingguo Li , Zhaoran Wang , Tuo Zhao

Adversarial Imitation Learning from Incomplete Demonstrations

Imitation learning targets deriving a mapping from states to actions, a.k.a. policy, from expert demonstrations. Existing methods for imitation learning typically require any actions in the demonstrations to be fully available, which is…

Machine Learning · Computer Science 2019-06-25 Mingfei Sun , Xiaojuan Ma