Related papers: Data augmentation for efficient learning from para…

A Sensitivity-based Data Augmentation Framework for Model Predictive Control Policy Approximation

Approximating model predictive control (MPC) policy using expert-based supervised learning techniques requires labeled training data sets sampled from the MPC policy. This is typically obtained by sampling the feasible state-space and…

Optimization and Control · Mathematics 2022-03-16 Dinesh Krishnamoorthy

An Improved Data Augmentation Scheme for Model Predictive Control Policy Approximation

This paper considers the problem of data generation for MPC policy approximation. Learning an approximate MPC policy from expert demonstrations requires a large data set consisting of optimal state-action pairs, sampled across the feasible…

Systems and Control · Electrical Eng. & Systems 2023-06-22 Dinesh Krishnamoorthy

A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets

Recent Offline Reinforcement Learning methods have succeeded in learning high-performance policies from fixed datasets of experience. A particularly effective approach learns to first identify and then mimic optimal decision-making…

Machine Learning · Computer Science 2023-12-12 Jake Grigsby , Yanjun Qi

Model Predictive Control via On-Policy Imitation Learning

In this paper, we leverage the rapid advances in imitation learning, a topic of intense recent focus in the Reinforcement Learning (RL) literature, to develop new sample complexity results and performance guarantees for data-driven Model…

Optimization and Control · Mathematics 2022-10-18 Kwangjun Ahn , Zakaria Mhammedi , Horia Mania , Zhang-Wei Hong , Ali Jadbabaie

Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning

Offline reinforcement learning (RL) enables learning effective policies from fixed datasets without any environment interaction. Existing methods typically employ policy constraints to mitigate the distribution shift encountered during…

Machine Learning · Computer Science 2026-04-30 Tan Jing , Xiaorui Li , Chao Yao , Xiaojuan Ban , Yuetong Fang , Renjing Xu , Zhaolin Yuan

Diffusion Model-Augmented Behavioral Cloning

Imitation learning addresses the challenge of learning by observing an expert's demonstrations without access to reward signals from environments. Most existing imitation learning methods that do not require interacting with environments…

Machine Learning · Computer Science 2024-06-04 Shang-Fu Chen , Hsiang-Chun Wang , Ming-Hao Hsu , Chun-Mao Lai , Shao-Hua Sun

Online Adaptation for Enhancing Imitation Learning Policies

Imitation learning enables autonomous agents to learn from human examples, without the need for a reward signal. Still, if the provided dataset does not encapsulate the task correctly, or when the task is too complex to be modeled, such…

Artificial Intelligence · Computer Science 2024-06-10 Federico Malato , Ville Hautamaki

Augmented Behavioral Cloning from Observation

Imitation from observation is a computational technique that teaches an agent on how to mimic the behavior of an expert by observing only the sequence of states from the expert demonstrations. Recent approaches learn the inverse dynamics of…

Artificial Intelligence · Computer Science 2020-04-29 Juarez Monteiro , Nathan Gavenski , Roger Granada , Felipe Meneguzzi , Rodrigo Barros

AutoAugment: Learning Augmentation Policies from Data

Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations are manually designed. In this paper, we describe a simple procedure called AutoAugment…

Computer Vision and Pattern Recognition · Computer Science 2019-04-15 Ekin D. Cubuk , Barret Zoph , Dandelion Mane , Vijay Vasudevan , Quoc V. Le

MetaAugment: Sample-Aware Data Augmentation Policy Learning

Automated data augmentation has shown superior performance in image recognition. Existing works search for dataset-level augmentation policies without considering individual sample variations, which are likely to be sub-optimal. On the…

Machine Learning · Computer Science 2020-12-23 Fengwei Zhou , Jiawei Li , Chuanlong Xie , Fei Chen , Lanqing Hong , Rui Sun , Zhenguo Li

ExpertGen: Scalable Sim-to-Real Expert Policy Learning from Imperfect Behavior Priors

Learning generalizable and robust behavior cloning policies requires large volumes of high-quality robotics data. While human demonstrations (e.g., through teleoperation) serve as the standard source for expert behaviors, acquiring such…

Robotics · Computer Science 2026-04-22 Zifan Xu , Ran Gong , Maria Vittoria Minniti , Ahmet Salih Gundogdu , Eric Rosen , Kausik Sivakumar , Riedana Yan , Zixing Wang , Di Deng , Peter Stone , Xiaohan Zhang , Karl Schmeckpeper

A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations

This paper investigates how to incorporate expert observations (without explicit information on expert actions) into a deep reinforcement learning setting to improve sample efficiency. First, we formulate an augmented policy loss combining…

Machine Learning · Computer Science 2025-02-28 Erhan Can Ozcan , Vittorio Giammarino , James Queeney , Ioannis Ch. Paschalidis

Feature Augmentations for High-Dimensional Learning

High-dimensional measurements are often correlated which motivates their approximation by factor models. This holds also true when features are engineered via low-dimensional interactions or kernel tricks. This often results in over…

Applications · Statistics 2025-09-03 Xiaonan Zhu , Bingyan Wang , Jianqing Fan

Improving Behavioural Cloning with Human-Driven Dynamic Dataset Augmentation

Behavioural cloning has been extensively used to train agents and is recognized as a fast and solid approach to teach general behaviours based on expert trajectories. Such method follows the supervised learning paradigm and it strongly…

Artificial Intelligence · Computer Science 2022-01-20 Federico Malato , Joona Jehkonen , Ville Hautamäki

SapAugment: Learning A Sample Adaptive Policy for Data Augmentation

Data augmentation methods usually apply the same augmentation (or a mix of them) to all the training samples. For example, to perturb data with noise, the noise is sampled from a Normal distribution with a fixed standard deviation, for all…

Machine Learning · Computer Science 2021-02-16 Ting-Yao Hu , Ashish Shrivastava , Jen-Hao Rick Chang , Hema Koppula , Stefan Braun , Kyuyeon Hwang , Ozlem Kalinli , Oncel Tuzel

Adversarial Policy Optimization in Deep Reinforcement Learning

The policy represented by the deep neural network can overfit the spurious features in observations, which hamper a reinforcement learning agent from learning effective policy. This issue becomes severe in high-dimensional state, where the…

Machine Learning · Computer Science 2023-05-01 Md Masudur Rahman , Yexiang Xue

Adaptive Augmentation Policy Optimization with LLM Feedback

Data augmentation is a critical component of deep learning pipelines, enhancing model generalization by increasing dataset diversity. Traditional augmentation strategies rely on manually designed transformations, stochastic sampling, or…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Ant Duru , Alptekin Temizel

CAAP: Class-Dependent Automatic Data Augmentation Based On Adaptive Policies For Time Series

Data Augmentation is a common technique used to enhance the performance of deep learning models by expanding the training dataset. Automatic Data Augmentation (ADA) methods are getting popular because of their capacity to generate policies…

Machine Learning · Computer Science 2024-04-02 Tien-Yu Chang , Hao Dai , Vincent S. Tseng

Efficient Augmentation via Data Subsampling

Data augmentation is commonly used to encode invariances in learning methods. However, this process is often performed in an inefficient manner, as artificial examples are created by applying a number of transformations to all points in the…

Machine Learning · Computer Science 2019-03-04 Michael Kuchnik , Virginia Smith

Preventing Imitation Learning with Adversarial Policy Ensembles

Imitation learning can reproduce policies by observing experts, which poses a problem regarding policy privacy. Policies, such as human, or policies on deployed robots, can all be cloned without consent from the owners. How can we protect…

Machine Learning · Computer Science 2020-08-04 Albert Zhan , Stas Tiomkin , Pieter Abbeel