English
Related papers

Related papers: Data augmentation for efficient learning from para…

200 papers

Approximating model predictive control (MPC) policy using expert-based supervised learning techniques requires labeled training data sets sampled from the MPC policy. This is typically obtained by sampling the feasible state-space and…

Optimization and Control · Mathematics 2022-03-16 Dinesh Krishnamoorthy

This paper considers the problem of data generation for MPC policy approximation. Learning an approximate MPC policy from expert demonstrations requires a large data set consisting of optimal state-action pairs, sampled across the feasible…

Systems and Control · Electrical Eng. & Systems 2023-06-22 Dinesh Krishnamoorthy

Recent Offline Reinforcement Learning methods have succeeded in learning high-performance policies from fixed datasets of experience. A particularly effective approach learns to first identify and then mimic optimal decision-making…

Machine Learning · Computer Science 2023-12-12 Jake Grigsby , Yanjun Qi

In this paper, we leverage the rapid advances in imitation learning, a topic of intense recent focus in the Reinforcement Learning (RL) literature, to develop new sample complexity results and performance guarantees for data-driven Model…

Optimization and Control · Mathematics 2022-10-18 Kwangjun Ahn , Zakaria Mhammedi , Horia Mania , Zhang-Wei Hong , Ali Jadbabaie

Offline reinforcement learning (RL) enables learning effective policies from fixed datasets without any environment interaction. Existing methods typically employ policy constraints to mitigate the distribution shift encountered during…

Machine Learning · Computer Science 2026-04-30 Tan Jing , Xiaorui Li , Chao Yao , Xiaojuan Ban , Yuetong Fang , Renjing Xu , Zhaolin Yuan

Imitation learning addresses the challenge of learning by observing an expert's demonstrations without access to reward signals from environments. Most existing imitation learning methods that do not require interacting with environments…

Machine Learning · Computer Science 2024-06-04 Shang-Fu Chen , Hsiang-Chun Wang , Ming-Hao Hsu , Chun-Mao Lai , Shao-Hua Sun

Imitation learning enables autonomous agents to learn from human examples, without the need for a reward signal. Still, if the provided dataset does not encapsulate the task correctly, or when the task is too complex to be modeled, such…

Artificial Intelligence · Computer Science 2024-06-10 Federico Malato , Ville Hautamaki

Imitation from observation is a computational technique that teaches an agent on how to mimic the behavior of an expert by observing only the sequence of states from the expert demonstrations. Recent approaches learn the inverse dynamics of…

Artificial Intelligence · Computer Science 2020-04-29 Juarez Monteiro , Nathan Gavenski , Roger Granada , Felipe Meneguzzi , Rodrigo Barros

Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations are manually designed. In this paper, we describe a simple procedure called AutoAugment…

Computer Vision and Pattern Recognition · Computer Science 2019-04-15 Ekin D. Cubuk , Barret Zoph , Dandelion Mane , Vijay Vasudevan , Quoc V. Le

Automated data augmentation has shown superior performance in image recognition. Existing works search for dataset-level augmentation policies without considering individual sample variations, which are likely to be sub-optimal. On the…

Machine Learning · Computer Science 2020-12-23 Fengwei Zhou , Jiawei Li , Chuanlong Xie , Fei Chen , Lanqing Hong , Rui Sun , Zhenguo Li

Learning generalizable and robust behavior cloning policies requires large volumes of high-quality robotics data. While human demonstrations (e.g., through teleoperation) serve as the standard source for expert behaviors, acquiring such…

This paper investigates how to incorporate expert observations (without explicit information on expert actions) into a deep reinforcement learning setting to improve sample efficiency. First, we formulate an augmented policy loss combining…

Machine Learning · Computer Science 2025-02-28 Erhan Can Ozcan , Vittorio Giammarino , James Queeney , Ioannis Ch. Paschalidis

High-dimensional measurements are often correlated which motivates their approximation by factor models. This holds also true when features are engineered via low-dimensional interactions or kernel tricks. This often results in over…

Applications · Statistics 2025-09-03 Xiaonan Zhu , Bingyan Wang , Jianqing Fan

Behavioural cloning has been extensively used to train agents and is recognized as a fast and solid approach to teach general behaviours based on expert trajectories. Such method follows the supervised learning paradigm and it strongly…

Artificial Intelligence · Computer Science 2022-01-20 Federico Malato , Joona Jehkonen , Ville Hautamäki

Data augmentation methods usually apply the same augmentation (or a mix of them) to all the training samples. For example, to perturb data with noise, the noise is sampled from a Normal distribution with a fixed standard deviation, for all…

The policy represented by the deep neural network can overfit the spurious features in observations, which hamper a reinforcement learning agent from learning effective policy. This issue becomes severe in high-dimensional state, where the…

Machine Learning · Computer Science 2023-05-01 Md Masudur Rahman , Yexiang Xue

Data augmentation is a critical component of deep learning pipelines, enhancing model generalization by increasing dataset diversity. Traditional augmentation strategies rely on manually designed transformations, stochastic sampling, or…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Ant Duru , Alptekin Temizel

Data Augmentation is a common technique used to enhance the performance of deep learning models by expanding the training dataset. Automatic Data Augmentation (ADA) methods are getting popular because of their capacity to generate policies…

Machine Learning · Computer Science 2024-04-02 Tien-Yu Chang , Hao Dai , Vincent S. Tseng

Data augmentation is commonly used to encode invariances in learning methods. However, this process is often performed in an inefficient manner, as artificial examples are created by applying a number of transformations to all points in the…

Machine Learning · Computer Science 2019-03-04 Michael Kuchnik , Virginia Smith

Imitation learning can reproduce policies by observing experts, which poses a problem regarding policy privacy. Policies, such as human, or policies on deployed robots, can all be cloned without consent from the owners. How can we protect…

Machine Learning · Computer Science 2020-08-04 Albert Zhan , Stas Tiomkin , Pieter Abbeel
‹ Prev 1 2 3 10 Next ›