Related papers: Self-Imitation Learning

Self-Imitation Advantage Learning

Self-imitation learning is a Reinforcement Learning (RL) method that encourages actions whose returns were higher than expected, which helps in hard exploration and sparse reward problems. It was shown to improve the performance of…

Machine Learning · Computer Science 2020-12-23 Johan Ferret , Olivier Pietquin , Matthieu Geist

Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games

Recent research on vulnerabilities of deep reinforcement learning (RL) has shown that adversarial policies adopted by an adversary agent can influence a target RL agent (victim agent) to perform poorly in a multi-agent environment. In…

Machine Learning · Computer Science 2022-11-01 The Viet Bui , Tien Mai , Thanh H. Nguyen

Match or Replay: Self Imitating Proximal Policy Optimization

Reinforcement Learning (RL) agents often struggle with inefficient exploration, particularly in environments with sparse rewards. Traditional exploration strategies can lead to slow learning and suboptimal performance because agents fail to…

Machine Learning · Computer Science 2026-03-31 Gaurav Chaudhary , Laxmidhar Behera , Washim Uddin Mondal

Generative Adversarial Self-Imitation Learning

This paper explores a simple regularizer for reinforcement learning by proposing Generative Adversarial Self-Imitation Learning (GASIL), which encourages the agent to imitate past good trajectories via generative adversarial imitation…

Machine Learning · Computer Science 2018-12-04 Yijie Guo , Junhyuk Oh , Satinder Singh , Honglak Lee

Unlocking the Potential of Soft Actor-Critic for Imitation Learning

Learning-based methods have enabled robots to acquire bio-inspired movements with increasing levels of naturalness and adaptability. Among these, Imitation Learning (IL) has proven effective in transferring complex motion patterns from…

Robotics · Computer Science 2025-09-30 Nayari Marie Lessa , Melya Boukheddimi , Frank Kirchner

A Pragmatic Look at Deep Imitation Learning

The introduction of the generative adversarial imitation learning (GAIL) algorithm has spurred the development of scalable imitation learning approaches using deep neural networks. Many of the algorithms that followed used a similar…

Machine Learning · Computer Science 2023-09-21 Kai Arulkumaran , Dan Ogawa Lillrank

Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration

Imitation learning is a central problem in reinforcement learning where the goal is to learn a policy that mimics the expert's behavior. In practice, it is often challenging to learn the expert policy from a limited number of demonstrations…

Machine Learning · Computer Science 2025-06-26 Heyang Zhao , Xingrui Yu , David M. Bossens , Ivor W. Tsang , Quanquan Gu

Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization

Learning complex policies with Reinforcement Learning (RL) is often hindered by instability and slow convergence, a problem exacerbated by the difficulty of reward engineering. Imitation Learning (IL) from expert demonstrations bypasses…

Machine Learning · Computer Science 2026-05-19 Sayambhu Sen , Shalabh Bhatnagar

IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic

This paper introduces the SOAR framework for imitation learning. SOAR is an algorithmic template that learns a policy from expert demonstrations with a primal dual style algorithm that alternates cost and policy updates. Within the policy…

Machine Learning · Computer Science 2025-06-02 Stefano Viel , Luca Viano , Volkan Cevher

Explaining Fast Improvement in Online Imitation Learning

Online imitation learning (IL) is an algorithmic framework that leverages interactions with expert policies for efficient policy optimization. Here policies are optimized by performing online learning on a sequence of loss functions that…

Machine Learning · Computer Science 2021-02-23 Xinyan Yan , Byron Boots , Ching-An Cheng

Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning

Offline multi-agent reinforcement learning (MARL) aims to learn effective multi-agent policies from pre-collected datasets, which is an important step toward the deployment of multi-agent systems in real-world applications. However, in…

Machine Learning · Computer Science 2023-03-02 Qi Tian , Kun Kuang , Furui Liu , Baoxiang Wang

Learning a Behavioral Repertoire from Demonstrations

Imitation Learning (IL) is a machine learning approach to learn a policy from a dataset of demonstrations. IL can be useful to kick-start learning before applying reinforcement learning (RL) but it can also be useful on its own, e.g. to…

Machine Learning · Computer Science 2019-07-09 Niels Justesen , Miguel Gonzalez Duque , Daniel Cabarcas Jaramillo , Jean-Baptiste Mouret , Sebastian Risi

Off-Policy Adversarial Inverse Reinforcement Learning

Adversarial Imitation Learning (AIL) is a class of algorithms in Reinforcement learning (RL), which tries to imitate an expert without taking any reward from the environment and does not provide expert behavior directly to the policy…

Machine Learning · Computer Science 2020-05-05 Samin Yeasar Arnob

Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past

Soft Actor-Critic (SAC) is an off-policy actor-critic deep reinforcement learning (DRL) algorithm based on maximum entropy reinforcement learning. By combining off-policy updates with an actor-critic formulation, SAC achieves…

Machine Learning · Computer Science 2019-06-11 Che Wang , Keith Ross

Self-Imitation Learning from Demonstrations

Despite the numerous breakthroughs achieved with Reinforcement Learning (RL), solving environments with sparse rewards remains a challenging task that requires sophisticated exploration. Learning from Demonstrations (LfD) remedies this…

Machine Learning · Computer Science 2022-03-22 Georgiy Pshikhachev , Dmitry Ivanov , Vladimir Egorov , Aleksei Shpilman

Policy Compatible Skill Incremental Learning via Lazy Learning Interface

Skill Incremental Learning (SIL) is the process by which an embodied agent expands and refines its skill set over time by leveraging experience gained through interaction with its environment or by the integration of additional data. SIL…

Machine Learning · Computer Science 2026-01-15 Daehee Lee , Dongsu Lee , TaeYoon Kwack , Wonje Choi , Honguk Woo

Self-Imitation Learning by Planning

Imitation learning (IL) enables robots to acquire skills quickly by transferring expert knowledge, which is widely adopted in reinforcement learning (RL) to initialize exploration. However, in long-horizon motion planning tasks, a…

Robotics · Computer Science 2021-03-30 Sha Luo , Hamidreza Kasaei , Lambert Schomaker

Improving On-policy Learning with Statistical Reward Accumulation

Deep reinforcement learning has obtained significant breakthroughs in recent years. Most methods in deep-RL achieve good results via the maximization of the reward signal provided by the environment, typically in the form of discounted…

Machine Learning · Computer Science 2018-09-10 Yubin Deng , Ke Yu , Dahua Lin , Xiaoou Tang , Chen Change Loy

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

Imitation learning (IL) enables agents to mimic expert behaviors. Most previous IL techniques focus on precisely imitating one policy through mass demonstrations. However, in many applications, what humans require is the ability to perform…

Machine Learning · Computer Science 2023-10-10 Xiong-Hui Chen , Junyin Ye , Hang Zhao , Yi-Chen Li , Haoran Shi , Yu-Yan Xu , Zhihao Ye , Si-Hang Yang , Anqi Huang , Kai Xu , Zongzhang Zhang , Yang Yu

Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations

Imitation Learning (IL) aims to discover a policy by minimizing the discrepancy between the agent's behavior and expert demonstrations. However, IL is susceptible to limitations imposed by noisy demonstrations from non-expert behaviors,…

Machine Learning · Computer Science 2023-10-25 Ye Yuan , Xin Li , Yong Heng , Leiji Zhang , MingZhong Wang