English
Related papers

Related papers: Curriculum generation using Autoencoder based cont…

200 papers

Trajectory optimization using a learned model of the environment is one of the core elements of model-based reinforcement learning. This procedure often suffers from exploiting inaccuracies of the learned model. We propose to regularize…

Machine Learning · Computer Science 2019-12-30 Rinu Boney , Norman Di Palo , Mathias Berglund , Alexander Ilin , Juho Kannala , Antti Rasmus , Harri Valpola

Curriculum learning in reinforcement learning is used to shape exploration by presenting the agent with increasingly complex tasks. The idea of curriculum learning has been largely applied in both animal training and pedagogy. In…

Machine Learning · Computer Science 2019-06-14 Francesco Foglino , Christiano Coletto Christakou , Matteo Leonetti

Curriculum reinforcement learning (CRL) improves the learning speed and stability of an agent by exposing it to a tailored series of tasks throughout learning. Despite empirical successes, an open question in CRL is how to automatically…

Machine Learning · Computer Science 2020-10-26 Pascal Klink , Carlo D'Eramo , Jan Peters , Joni Pajarinen

Across machine learning, the use of curricula has shown strong empirical potential to improve learning from data by avoiding local optima of training objectives. For reinforcement learning (RL), curricula are especially interesting, as the…

Machine Learning · Computer Science 2021-09-03 Pascal Klink , Hany Abdulsamad , Boris Belousov , Carlo D'Eramo , Jan Peters , Joni Pajarinen

Neural combinatorial optimization (NCO) aims at designing problem-independent and efficient neural network-based strategies for solving combinatorial problems. The field recently experienced growth by successfully adapting architectures…

Machine Learning · Computer Science 2020-11-13 Michal Lisicki , Arash Afkanpour , Graham W. Taylor

Many challenging reinforcement learning (RL) problems require designing a distribution of tasks that can be applied to train effective policies. This distribution of tasks can be specified by the curriculum. A curriculum is meant to improve…

Machine Learning · Computer Science 2023-01-03 Maria Nesterova , Alexey Skrynnik , Aleksandr Panov

Direct Preference Optimization (DPO) has been proposed as an effective and efficient alternative to reinforcement learning from human feedback (RLHF). However, neither RLHF nor DPO take into account the fact that learning certain…

Computer Vision and Pattern Recognition · Computer Science 2026-02-16 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Nicu Sebe , Mubarak Shah

Continual learning aims to learn multiple tasks sequentially while preserving prior knowledge, but faces the challenge of catastrophic forgetting when adapting to new tasks. Recently, approaches leveraging pre-trained models have gained…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Quan Cheng , Yuanyu Wan , Lingyu Wu , Chenping Hou , Lijun Zhang

Curriculum learning is a training strategy that sorts the training examples by some measure of their difficulty and gradually exposes them to the learner to improve the network performance. Motivated by our insights from implicit curriculum…

Machine Learning · Computer Science 2021-07-28 Vinu Sankar Sadasivan , Anirban Dasgupta

This paper addresses the challenges of training end-to-end autonomous driving agents using Reinforcement Learning (RL). RL agents are typically trained in a fixed set of scenarios and nominal behavior of surrounding road users in…

Robotics · Computer Science 2026-03-06 Ahmed Abouelazm , Tim Weinstein , Tim Joseph , Philip Schörner , J. Marius Zöllner

Direct Preference Optimization (DPO) has been proposed as an effective and efficient alternative to reinforcement learning from human feedback (RLHF). In this paper, we propose a novel and enhanced version of DPO based on curriculum…

Computer Vision and Pattern Recognition · Computer Science 2025-05-12 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Nicu Sebe , Mubarak Shah

We consider the problem of teaching via demonstrations in sequential decision-making settings. In particular, we study how to design a personalized curriculum over demonstrations to speed up the learner's convergence. We provide a unified…

Machine Learning · Computer Science 2021-12-17 Gaurav Yengera , Rati Devidze , Parameswaran Kamalaruban , Adish Singla

Learning multiple tasks sequentially without forgetting previous knowledge, called Continual Learning(CL), remains a long-standing challenge for neural networks. Most existing methods rely on additional network capacity or data replay. In…

Machine Learning · Computer Science 2022-02-01 Hao Liu , Huaping Liu

Leveraging planning during learning and decision-making is central to the long-term development of intelligent agents. Recent works have successfully combined tree-based search methods and self-play learning mechanisms to this end. However,…

Artificial Intelligence · Computer Science 2024-11-01 Matthew V Macfarlane , Edan Toledo , Donal Byrne , Paul Duckworth , Alexandre Laterre

Self-paced reinforcement learning (RL) aims to improve the data efficiency of learning by automatically creating sequences, namely curricula, of probability distributions over contexts. However, existing techniques for self-paced RL fail in…

Machine Learning · Computer Science 2023-05-29 Cevahir Koprulu , Ufuk Topcu

We use Bayesian optimization to learn curricula for word representation learning, optimizing performance on downstream tasks that depend on the learned representations as features. The curricula are modeled by a linear ranking function…

Computation and Language · Computer Science 2016-06-22 Yulia Tsvetkov , Manaal Faruqui , Wang Ling , Brian MacWhinney , Chris Dyer

In this paper, we investigate the framework of Online Convex Optimization (OCO) for online learning. OCO offers a very powerful online learning framework for many applications. In this context, we study a specific framework of OCO called…

Machine Learning · Computer Science 2022-11-01 Deepan Muthirayan , Jianjun Yuan , Pramod P. Khargonekar

A new method for the unsupervised learning of sparse representations using autoencoders is proposed and implemented by ordering the output of the hidden units by their activation value and progressively reconstructing the input in this…

Machine Learning · Computer Science 2016-05-09 Paul Bertens

In this work, we take a representation learning perspective on hierarchical reinforcement learning, where the problem of learning lower layers in a hierarchy is transformed into the problem of learning trajectory-level generative models. We…

Machine Learning · Computer Science 2018-06-08 John D. Co-Reyes , YuXuan Liu , Abhishek Gupta , Benjamin Eysenbach , Pieter Abbeel , Sergey Levine

Personalized recommendations are one of the most widely deployed machine learning (ML) workload serviced from cloud datacenters. As such, architectural solutions for high-performance recommendation inference have recently been the target of…

Hardware Architecture · Computer Science 2020-10-27 Youngeun Kwon , Yunjae Lee , Minsoo Rhu
‹ Prev 1 2 3 10 Next ›