Related papers: Compressed imitation learning

Model Predictive Control via On-Policy Imitation Learning

In this paper, we leverage the rapid advances in imitation learning, a topic of intense recent focus in the Reinforcement Learning (RL) literature, to develop new sample complexity results and performance guarantees for data-driven Model…

Optimization and Control · Mathematics 2022-10-18 Kwangjun Ahn , Zakaria Mhammedi , Horia Mania , Zhang-Wei Hong , Ali Jadbabaie

Reinforced Imitation in Heterogeneous Action Space

Imitation learning is an effective alternative approach to learn a policy when the reward function is sparse. In this paper, we consider a challenging setting where an agent and an expert use different actions from each other. We assume…

Machine Learning · Computer Science 2019-08-27 Konrad Zolna , Negar Rostamzadeh , Yoshua Bengio , Sungjin Ahn , Pedro O. Pinheiro

Coherent Soft Imitation Learning

Imitation learning methods seek to learn from an expert either through behavioral cloning (BC) of the policy or inverse reinforcement learning (IRL) of the reward. Such methods enable agents to learn complex tasks from humans that are…

Machine Learning · Computer Science 2023-12-07 Joe Watson , Sandy H. Huang , Nicolas Heess

Learning safe, constrained policies via imitation learning: Connection to Probabilistic Inference and a Naive Algorithm

This article introduces an imitation learning method for learning maximum entropy policies that comply with constraints demonstrated by expert trajectories executing a task. The formulation of the method takes advantage of results…

Machine Learning · Computer Science 2025-07-10 George Papadopoulos , George A. Vouros

Imitating Past Successes can be Very Suboptimal

Prior work has proposed a simple strategy for reinforcement learning (RL): label experience with the outcomes achieved in that experience, and then imitate the relabeled experience. These outcome-conditioned imitation learning methods are…

Machine Learning · Computer Science 2023-02-21 Benjamin Eysenbach , Soumith Udatha , Sergey Levine , Ruslan Salakhutdinov

Smooth Imitation Learning for Online Sequence Prediction

We study the problem of smooth imitation learning for online sequence prediction, where the goal is to train a policy that can smoothly imitate demonstrated behavior in a dynamic and continuous environment in response to online, sequential…

Machine Learning · Computer Science 2016-06-06 Hoang M. Le , Andrew Kang , Yisong Yue , Peter Carr

CoSam: An Efficient Collaborative Adaptive Sampler for Recommendation

Sampling strategies have been widely applied in many recommendation systems to accelerate model learning from implicit feedback data. A typical strategy is to draw negative instances with uniform distribution, which however will severely…

Information Retrieval · Computer Science 2020-11-17 Jiawei Chen , Chengquan Jiang , Can Wang , Sheng Zhou , Yan Feng , Chun Chen , Martin Ester , Xiangnan He

Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization

This work studies a Reinforcement Learning (RL) problem in which we are given a set of trajectories collected with K baseline policies. Each of these policies can be quite suboptimal in isolation, and have strong performance in…

Machine Learning · Computer Science 2024-03-29 Teodor V. Marinov , Alekh Agarwal , Mircea Trofin

Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning

The goal of imitation learning is to mimic expert behavior from demonstrations, without access to an explicit reward signal. A popular class of approach infers the (unknown) reward function via inverse reinforcement learning (IRL) followed…

Machine Learning · Computer Science 2022-04-19 Carl Qi , Pieter Abbeel , Aditya Grover

Learning Data Triage: Linear Decoding Works for Compressive MRI

The standard approach to compressive sampling considers recovering an unknown deterministic signal with certain known structure, and designing the sub-sampling pattern and recovery algorithm based on the known structure. This approach…

Information Theory · Computer Science 2016-02-03 Yen-Huan Li , Volkan Cevher

Learning the Sampling Pattern for MRI

The discovery of the theory of compressed sensing brought the realisation that many inverse problems can be solved even when measurements are "incomplete". This is particularly interesting in magnetic resonance imaging (MRI), where long…

Image and Video Processing · Electrical Eng. & Systems 2020-06-23 Ferdia Sherry , Martin Benning , Juan Carlos De los Reyes , Martin J. Graves , Georg Maierhofer , Guy Williams , Carola-Bibiane Schönlieb , Matthias J. Ehrhardt

Imitation Learning by Reinforcement Learning

Imitation learning algorithms learn a policy from demonstrations of expert behavior. We show that, for deterministic experts, imitation learning can be done by reduction to reinforcement learning with a stationary reward. Our theoretical…

Machine Learning · Statistics 2022-03-16 Kamil Ciosek

Statistical learning theory and Occam's razor: The core argument

Statistical learning theory is often associated with the principle of Occam's razor, which recommends a simplicity preference in inductive inference. This paper distills the core argument for simplicity obtainable from statistical learning…

Machine Learning · Computer Science 2024-12-02 Tom F. Sterkenburg

Learning Sampling and Model-Based Signal Recovery for Compressed Sensing MRI

Compressed sensing (CS) MRI relies on adequate undersampling of the k-space to accelerate the acquisition without compromising image quality. Consequently, the design of optimal sampling patterns for these k-space coefficients has received…

Image and Video Processing · Electrical Eng. & Systems 2021-01-26 Iris A. M. Huijben , Bastiaan S. Veeling , Ruud J. G. van Sloun

Sample-efficient Adversarial Imitation Learning

Imitation learning, in which learning is performed by demonstration, has been studied and advanced for sequential decision-making tasks in which a reward function is not predefined. However, imitation learning methods still require numerous…

Machine Learning · Computer Science 2024-01-24 Dahuin Jung , Hyungyu Lee , Sungroh Yoon

On the Sample Efficiency of Inverse Dynamics Models for Semi-Supervised Imitation Learning

Semi-supervised imitation learning (SSIL) consists in learning a policy from a small dataset of action-labeled trajectories and a much larger dataset of action-free trajectories. Some SSIL methods learn an inverse dynamics model (IDM) to…

Machine Learning · Computer Science 2026-02-04 Sacha Morin , Moonsub Byeon , Alexia Jolicoeur-Martineau , Sébastien Lachapelle

"Compressed" Compressed Sensing

The field of compressed sensing has shown that a sparse but otherwise arbitrary vector can be recovered exactly from a small number of randomly constructed linear projections (or samples). The question addressed in this paper is whether an…

Information Theory · Computer Science 2010-01-26 Galen Reeves , Michael Gastpar

Active Imitation Learning via Reduction to I.I.D. Active Learning

In standard passive imitation learning, the goal is to learn a target policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in…

Machine Learning · Computer Science 2012-10-19 Kshitij Judah , Alan Fern , Thomas G. Dietterich

Adapted variable density subsampling for compressed sensing

Recent results in compressed sensing showed that the optimal subsampling strategy should take into account the sparsity pattern of the signal at hand. This oracle-like knowledge, even though desirable, nevertheless remains elusive in most…

Information Theory · Computer Science 2023-06-28 Simon Ruetz

Dynamics-aware Embeddings

In this paper we consider self-supervised representation learning to improve sample efficiency in reinforcement learning (RL). We propose a forward prediction objective for simultaneously learning embeddings of states and action sequences.…

Machine Learning · Computer Science 2020-01-15 William Whitney , Rajat Agarwal , Kyunghyun Cho , Abhinav Gupta