Related papers: Accelerating Imitation Learning with Predictive Mo…

Explaining Fast Improvement in Online Imitation Learning

Online imitation learning (IL) is an algorithmic framework that leverages interactions with expert policies for efficient policy optimization. Here policies are optimized by performing online learning on a sequence of loss functions that…

Machine Learning · Computer Science 2021-02-23 Xinyan Yan , Byron Boots , Ching-An Cheng

Sample Efficient Imitation Learning via Reward Function Trained in Advance

Imitation learning (IL) is a framework that learns to imitate expert behavior from demonstrations. Recently, IL shows promising results on high dimensional and control tasks. However, IL typically suffers from sample inefficiency in terms…

Machine Learning · Computer Science 2021-11-24 Lihua Zhang

On Efficient Online Imitation Learning via Classification

Imitation learning (IL) is a general learning paradigm for tackling sequential decision-making problems. Interactive imitation learning, where learners can interactively query for expert demonstrations, has been shown to achieve provably…

Machine Learning · Computer Science 2022-09-27 Yichen Li , Chicheng Zhang

Model Predictive Control via On-Policy Imitation Learning

In this paper, we leverage the rapid advances in imitation learning, a topic of intense recent focus in the Reinforcement Learning (RL) literature, to develop new sample complexity results and performance guarantees for data-driven Model…

Optimization and Control · Mathematics 2022-10-18 Kwangjun Ahn , Zakaria Mhammedi , Horia Mania , Zhang-Wei Hong , Ali Jadbabaie

Provably Efficient Model-based Policy Adaptation

The high sample complexity of reinforcement learning challenges its use in practice. A promising approach is to quickly adapt pre-trained policies to new environments. Existing methods for this policy adaptation problem typically rely on…

Machine Learning · Computer Science 2020-06-16 Yuda Song , Aditi Mavalankar , Wen Sun , Sicun Gao

A Model-Based Approach to Imitation Learning through Multi-Step Predictions

Imitation learning is a widely used approach for training agents to replicate expert behavior in complex decision-making tasks. However, existing methods often struggle with compounding errors and limited generalization, due to the inherent…

Machine Learning · Computer Science 2025-04-21 Haldun Balim , Yang Hu , Yuyang Zhang , Na Li

Fast Policy Learning through Imitation and Reinforcement

Imitation learning (IL) consists of a set of tools that leverage expert demonstrations to quickly learn policies. However, if the expert is suboptimal, IL can yield policies with inferior performance compared to reinforcement learning (RL).…

Machine Learning · Computer Science 2018-05-29 Ching-An Cheng , Xinyan Yan , Nolan Wagener , Byron Boots

Smooth Imitation Learning for Online Sequence Prediction

We study the problem of smooth imitation learning for online sequence prediction, where the goal is to train a policy that can smoothly imitate demonstrated behavior in a dynamic and continuous environment in response to online, sequential…

Machine Learning · Computer Science 2016-06-06 Hoang M. Le , Andrew Kang , Yisong Yue , Peter Carr

Blending Imitation and Reinforcement Learning for Robust Policy Improvement

While reinforcement learning (RL) has shown promising performance, its sample complexity continues to be a substantial hurdle, restricting its broader application across a variety of domains. Imitation learning (IL) utilizes oracles to…

Machine Learning · Computer Science 2025-08-12 Xuefeng Liu , Takuma Yoneda , Rick L. Stevens , Matthew R. Walter , Yuxin Chen

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

Imitation learning (IL) is a simple and powerful way to use high-quality human driving data, which can be collected at scale, to produce human-like behavior. However, policies based on imitation learning alone often fail to sufficiently…

Artificial Intelligence · Computer Science 2023-08-14 Yiren Lu , Justin Fu , George Tucker , Xinlei Pan , Eli Bronstein , Rebecca Roelofs , Benjamin Sapp , Brandyn White , Aleksandra Faust , Shimon Whiteson , Dragomir Anguelov , Sergey Levine

FM-IRL: Flow-Matching for Reward Modeling and Policy Regularization in Reinforcement Learning

Flow Matching (FM) has shown remarkable ability in modeling complex distributions and achieves strong performance in offline imitation learning for cloning expert behaviors. However, despite its behavioral cloning expressiveness, FM-based…

Machine Learning · Computer Science 2025-10-14 Zhenglin Wan , Jingxuan Wu , Xingrui Yu , Chubin Zhang , Mingcong Lei , Bo An , Ivor Tsang

Watch and Match: Supercharging Imitation with Regularized Optimal Transport

Imitation learning holds tremendous promise in learning policies efficiently for complex decision making problems. Current state-of-the-art algorithms often use inverse reinforcement learning (IRL), where given a set of expert…

Robotics · Computer Science 2023-02-22 Siddhant Haldar , Vaibhav Mathur , Denis Yarats , Lerrel Pinto

Deep Imitative Models for Flexible Inference, Planning, and Control

Imitation Learning (IL) is an appealing approach to learn desirable autonomous behavior. However, directing IL to achieve arbitrary goals is difficult. In contrast, planning-based algorithms use dynamics models and reward functions to…

Machine Learning · Computer Science 2019-10-02 Nicholas Rhinehart , Rowan McAllister , Sergey Levine

Efficient Offline Reinforcement Learning: First Imitate, then Improve

Supervised imitation-based approaches are often favored over off-policy reinforcement learning approaches for learning policies offline, since their straightforward optimization objective makes them computationally efficient and stable to…

Machine Learning · Computer Science 2025-12-30 Adam Jelley , Trevor McInroe , Sam Devlin , Amos Storkey

Generative predecessor models for sample-efficient imitation learning

We propose Generative Predecessor Models for Imitation Learning (GPRIL), a novel imitation learning algorithm that matches the state-action distribution to the distribution observed in expert demonstrations, using generative models to…

Machine Learning · Computer Science 2019-04-03 Yannick Schroecker , Mel Vecerik , Jonathan Scholz

Online Adaptation for Enhancing Imitation Learning Policies

Imitation learning enables autonomous agents to learn from human examples, without the need for a reward signal. Still, if the provided dataset does not encapsulate the task correctly, or when the task is too complex to be modeled, such…

Artificial Intelligence · Computer Science 2024-06-10 Federico Malato , Ville Hautamaki

Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning

The goal of imitation learning is to mimic expert behavior from demonstrations, without access to an explicit reward signal. A popular class of approach infers the (unknown) reward function via inverse reinforcement learning (IRL) followed…

Machine Learning · Computer Science 2022-04-19 Carl Qi , Pieter Abbeel , Aditya Grover

f-IRL: Inverse Reinforcement Learning via State Marginal Matching

Imitation learning is well-suited for robotic tasks where it is difficult to directly program the behavior or specify a cost for optimal control. In this work, we propose a method for learning the reward function (and the corresponding…

Machine Learning · Computer Science 2021-01-01 Tianwei Ni , Harshit Sikchi , Yufei Wang , Tejus Gupta , Lisa Lee , Benjamin Eysenbach

Agnostic Interactive Imitation Learning: New Theory and Practical Algorithms

We study interactive imitation learning, where a learner interactively queries a demonstrating expert for action annotations, aiming to learn a policy that has performance competitive with the expert, using as few annotations as possible.…

Machine Learning · Computer Science 2024-07-18 Yichen Li , Chicheng Zhang

SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning

Training large language models with reinforcement learning (RL) against verifiable rewards significantly enhances their reasoning abilities, yet remains computationally expensive due to inefficient uniform prompt sampling. We introduce…

Machine Learning · Computer Science 2026-03-06 Ruiqi Zhang , Daman Arora , Song Mei , Andrea Zanette