Related papers: Learning Equational Theorem Proving

Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations

Model-free deep reinforcement learning (RL) has demonstrated its superiority on many complex sequential decision-making problems. However, heavy dependence on dense rewards and high sample-complexity impedes the wide adoption of these…

Machine Learning · Computer Science 2020-04-02 Zhuangdi Zhu , Kaixiang Lin , Bo Dai , Jiayu Zhou

Support-weighted Adversarial Imitation Learning

Adversarial Imitation Learning (AIL) is a broad family of imitation learning methods designed to mimic expert behaviors from demonstrations. While AIL has shown state-of-the-art performance on imitation learning with only small number of…

Machine Learning · Computer Science 2020-02-21 Ruohan Wang , Carlo Ciliberto , Pierluigi Amadori , Yiannis Demiris

Imitation Learning from Imperfect Demonstration

Imitation learning (IL) aims to learn an optimal policy from demonstrations. However, such demonstrations are often imperfect since collecting optimal ones is costly. To effectively learn from imperfect demonstrations, we propose a novel…

Machine Learning · Computer Science 2019-01-31 Yueh-Hua Wu , Nontawat Charoenphakdee , Han Bao , Voot Tangkaratt , Masashi Sugiyama

SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards

Learning to imitate expert behavior from demonstrations can be challenging, especially in environments with high-dimensional, continuous observations and unknown dynamics. Supervised learning methods based on behavioral cloning (BC) suffer…

Machine Learning · Computer Science 2019-09-27 Siddharth Reddy , Anca D. Dragan , Sergey Levine

Provably Efficient Adversarial Imitation Learning with Unknown Transitions

Imitation learning (IL) has proven to be an effective method for learning good policies from expert demonstrations. Adversarial imitation learning (AIL), a subset of IL methods, is particularly promising, but its theoretical foundation in…

Machine Learning · Computer Science 2023-06-14 Tian Xu , Ziniu Li , Yang Yu , Zhi-Quan Luo

Energy-Based Imitation Learning

We tackle a common scenario in imitation learning (IL), where agents try to recover the optimal policy from expert demonstrations without further access to the expert or environment reward signals. Except the simple Behavior Cloning (BC)…

Machine Learning · Computer Science 2021-04-16 Minghuan Liu , Tairan He , Minkai Xu , Weinan Zhang

Self-Imitation Advantage Learning

Self-imitation learning is a Reinforcement Learning (RL) method that encourages actions whose returns were higher than expected, which helps in hard exploration and sparse reward problems. It was shown to improve the performance of…

Machine Learning · Computer Science 2020-12-23 Johan Ferret , Olivier Pietquin , Matthieu Geist

Semi-Supervised One-Shot Imitation Learning

One-shot Imitation Learning~(OSIL) aims to imbue AI agents with the ability to learn a new task from a single demonstration. To supervise the learning, OSIL typically requires a prohibitively large number of paired expert demonstrations --…

Machine Learning · Computer Science 2024-08-13 Philipp Wu , Kourosh Hakhamaneshi , Yuqing Du , Igor Mordatch , Aravind Rajeswaran , Pieter Abbeel

Single-Reset Divide & Conquer Imitation Learning

Demonstrations are commonly used to speed up the learning process of Deep Reinforcement Learning algorithms. To cope with the difficulty of accessing multiple demonstrations, some algorithms have been developed to learn from a single…

Robotics · Computer Science 2024-02-15 Alexandre Chenu , Olivier Serris , Olivier Sigaud , Nicolas Perrin-Gilbert

Self-evolved Imitation Learning in Simulated World

Imitation learning has been a trend recently, yet training a generalist agent across multiple tasks still requires large-scale expert demonstrations, which are costly and labor-intensive to collect. To address the challenge of limited…

Robotics · Computer Science 2025-09-25 Yifan Ye , Jun Cen , Jing Chen , Zhihe Lu

Imitation Bootstrapped Reinforcement Learning

Despite the considerable potential of reinforcement learning (RL), robotic control tasks predominantly rely on imitation learning (IL) due to its better sample efficiency. However, it is costly to collect comprehensive expert demonstrations…

Machine Learning · Computer Science 2024-05-22 Hengyuan Hu , Suvir Mirchandani , Dorsa Sadigh

Off-Policy Adversarial Inverse Reinforcement Learning

Adversarial Imitation Learning (AIL) is a class of algorithms in Reinforcement learning (RL), which tries to imitate an expert without taking any reward from the environment and does not provide expert behavior directly to the policy…

Machine Learning · Computer Science 2020-05-05 Samin Yeasar Arnob

Auto-Encoding Adversarial Imitation Learning

Reinforcement learning (RL) provides a powerful framework for decision-making, but its application in practice often requires a carefully designed reward function. Adversarial Imitation Learning (AIL) sheds light on automatic policy…

Machine Learning · Computer Science 2024-02-05 Kaifeng Zhang , Rui Zhao , Ziming Zhang , Yang Gao

Inverse Reinforcement Learning without Reinforcement Learning

Inverse Reinforcement Learning (IRL) is a powerful set of techniques for imitation learning that aims to learn a reward function that rationalizes expert demonstrations. Unfortunately, traditional IRL methods suffer from a computational…

Machine Learning · Computer Science 2024-01-31 Gokul Swamy , Sanjiban Choudhury , J. Andrew Bagnell , Zhiwei Steven Wu

Provably Efficient Imitation Learning from Observation Alone

We study Imitation Learning (IL) from Observations alone (ILFO) in large-scale MDPs. While most IL algorithms rely on an expert to directly provide actions to the learner, in this setting the expert only supplies sequences of observations.…

Machine Learning · Computer Science 2019-06-12 Wen Sun , Anirudh Vemula , Byron Boots , J. Andrew Bagnell

Adversarial Imitation Learning with General Function Approximation: Theoretical Analysis and Practical Algorithms

Adversarial imitation learning (AIL), a prominent approach in imitation learning, has achieved significant practical success powered by neural network approximation. However, existing theoretical analyses of AIL are primarily confined to…

Machine Learning · Computer Science 2026-05-05 Tian Xu , Zhilong Zhang , Zexuan Chen , Ruishuo Chen , Yihao Sun , Yang Yu

Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation

As a prominent category of imitation learning methods, adversarial imitation learning (AIL) has garnered significant practical success powered by neural network approximation. However, existing theoretical studies on AIL are primarily…

Machine Learning · Computer Science 2024-11-04 Tian Xu , Zhilong Zhang , Ruishuo Chen , Yihao Sun , Yang Yu

A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

Automated theorem provers have traditionally relied on manually tuned heuristics to guide how they perform proof search. Deep reinforcement learning has been proposed as a way to obviate the need for such heuristics, however, its deployment…

Artificial Intelligence · Computer Science 2020-09-17 Maxwell Crouse , Ibrahim Abdelaziz , Bassem Makni , Spencer Whitehead , Cristina Cornelio , Pavan Kapanipathi , Kavitha Srinivas , Veronika Thost , Michael Witbrock , Achille Fokoue

Bayesian Robust Optimization for Imitation Learning

One of the main challenges in imitation learning is determining what action an agent should take when outside the state distribution of the demonstrations. Inverse reinforcement learning (IRL) can enable generalization to new states by…

Machine Learning · Computer Science 2024-03-04 Daniel S. Brown , Scott Niekum , Marek Petrik

DeeSIL: Deep-Shallow Incremental Learning

Incremental Learning (IL) is an interesting AI problem when the algorithm is assumed to work on a budget. This is especially true when IL is modeled using a deep learning approach, where two com- plex challenges arise due to limited memory,…

Computer Vision and Pattern Recognition · Computer Science 2018-08-21 Eden Belouadah , Adrian Popescu