Related papers: Curriculum generation using Autoencoder based cont…

Regularizing Trajectory Optimization with Denoising Autoencoders

Trajectory optimization using a learned model of the environment is one of the core elements of model-based reinforcement learning. This procedure often suffers from exploiting inaccuracies of the learned model. We propose to regularize…

Machine Learning · Computer Science 2019-12-30 Rinu Boney , Norman Di Palo , Mathias Berglund , Alexander Ilin , Juho Kannala , Antti Rasmus , Harri Valpola

An Optimization Framework for Task Sequencing in Curriculum Learning

Curriculum learning in reinforcement learning is used to shape exploration by presenting the agent with increasingly complex tasks. The idea of curriculum learning has been largely applied in both animal training and pedagogy. In…

Machine Learning · Computer Science 2019-06-14 Francesco Foglino , Christiano Coletto Christakou , Matteo Leonetti

Self-Paced Deep Reinforcement Learning

Curriculum reinforcement learning (CRL) improves the learning speed and stability of an agent by exposing it to a tailored series of tasks throughout learning. Despite empirical successes, an open question in CRL is how to automatically…

Machine Learning · Computer Science 2020-10-26 Pascal Klink , Carlo D'Eramo , Jan Peters , Joni Pajarinen

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

Across machine learning, the use of curricula has shown strong empirical potential to improve learning from data by avoiding local optima of training objectives. For reinforcement learning (RL), curricula are especially interesting, as the…

Machine Learning · Computer Science 2021-09-03 Pascal Klink , Hany Abdulsamad , Boris Belousov , Carlo D'Eramo , Jan Peters , Joni Pajarinen

Evaluating Curriculum Learning Strategies in Neural Combinatorial Optimization

Neural combinatorial optimization (NCO) aims at designing problem-independent and efficient neural network-based strategies for solving combinatorial problems. The field recently experienced growth by successfully adapting architectures…

Machine Learning · Computer Science 2020-11-13 Michal Lisicki , Arash Afkanpour , Graham W. Taylor

Reinforcement Learning with Success Induced Task Prioritization

Many challenging reinforcement learning (RL) problems require designing a distribution of tasks that can be applied to train effective policies. This distribution of tasks can be specified by the curriculum. A curriculum is meant to improve…

Machine Learning · Computer Science 2023-01-03 Maria Nesterova , Alexey Skrynnik , Aleksandr Panov

Curriculum-DPO++: Direct Preference Optimization via Data and Model Curricula for Text-to-Image Generation

Direct Preference Optimization (DPO) has been proposed as an effective and efficient alternative to reinforcement learning from human feedback (RLHF). However, neither RLHF nor DPO take into account the fact that learning certain…

Computer Vision and Pattern Recognition · Computer Science 2026-02-16 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Nicu Sebe , Mubarak Shah

Continuous Subspace Optimization for Continual Learning

Continual learning aims to learn multiple tasks sequentially while preserving prior knowledge, but faces the challenge of catastrophic forgetting when adapting to new tasks. Recently, approaches leveraging pre-trained models have gained…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Quan Cheng , Yuanyu Wan , Lingyu Wu , Chenping Hou , Lijun Zhang

Statistical Measures For Defining Curriculum Scoring Function

Curriculum learning is a training strategy that sorts the training examples by some measure of their difficulty and gradually exposes them to the learner to improve the network performance. Motivated by our insights from implicit curriculum…

Machine Learning · Computer Science 2021-07-28 Vinu Sankar Sadasivan , Anirban Dasgupta

Automatic Curriculum Learning for Driving Scenarios: Towards Robust and Efficient Reinforcement Learning

This paper addresses the challenges of training end-to-end autonomous driving agents using Reinforcement Learning (RL). RL agents are typically trained in a fixed set of scenarios and nominal behavior of surrounding road users in…

Robotics · Computer Science 2026-03-06 Ahmed Abouelazm , Tim Weinstein , Tim Joseph , Philip Schörner , J. Marius Zöllner

Curriculum Direct Preference Optimization for Diffusion and Consistency Models

Direct Preference Optimization (DPO) has been proposed as an effective and efficient alternative to reinforcement learning from human feedback (RLHF). In this paper, we propose a novel and enhanced version of DPO based on curriculum…

Computer Vision and Pattern Recognition · Computer Science 2025-05-12 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Nicu Sebe , Mubarak Shah

Curriculum Design for Teaching via Demonstrations: Theory and Applications

We consider the problem of teaching via demonstrations in sequential decision-making settings. In particular, we study how to design a personalized curriculum over demonstrations to speed up the learner's convergence. We provide a unified…

Machine Learning · Computer Science 2021-12-17 Gaurav Yengera , Rati Devidze , Parameswaran Kamalaruban , Adish Singla

Continual Learning with Recursive Gradient Optimization

Learning multiple tasks sequentially without forgetting previous knowledge, called Continual Learning(CL), remains a long-standing challenge for neural networks. Most existing methods rely on additional network capacity or data replay. In…

Machine Learning · Computer Science 2022-02-01 Hao Liu , Huaping Liu

SPO: Sequential Monte Carlo Policy Optimisation

Leveraging planning during learning and decision-making is central to the long-term development of intelligent agents. Recent works have successfully combined tree-based search methods and self-play learning mechanisms to this end. However,…

Artificial Intelligence · Computer Science 2024-11-01 Matthew V Macfarlane , Edan Toledo , Donal Byrne , Paul Duckworth , Alexandre Laterre

Reward-Machine-Guided, Self-Paced Reinforcement Learning

Self-paced reinforcement learning (RL) aims to improve the data efficiency of learning by automatically creating sequences, namely curricula, of probability distributions over contexts. However, existing techniques for self-paced RL fail in…

Machine Learning · Computer Science 2023-05-29 Cevahir Koprulu , Ufuk Topcu

Learning the Curriculum with Bayesian Optimization for Task-Specific Word Representation Learning

We use Bayesian optimization to learn curricula for word representation learning, optimizing performance on downstream tasks that depend on the learned representations as features. The curricula are modeled by a linear ranking function…

Computation and Language · Computer Science 2016-06-22 Yulia Tsvetkov , Manaal Faruqui , Wang Ling , Brian MacWhinney , Chris Dyer

Online Convex Optimization with Long Term Constraints for Predictable Sequences

In this paper, we investigate the framework of Online Convex Optimization (OCO) for online learning. OCO offers a very powerful online learning framework for many applications. In this context, we study a specific framework of OCO called…

Machine Learning · Computer Science 2022-11-01 Deepan Muthirayan , Jianjun Yuan , Pramod P. Khargonekar

Rank Ordered Autoencoders

A new method for the unsupervised learning of sparse representations using autoencoders is proposed and implemented by ordering the output of the hidden units by their activation value and progressively reconstructing the input in this…

Machine Learning · Computer Science 2016-05-09 Paul Bertens

Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings

In this work, we take a representation learning perspective on hierarchical reinforcement learning, where the problem of learning lower layers in a hierarchy is transformed into the problem of learning trajectory-level generative models. We…

Machine Learning · Computer Science 2018-06-08 John D. Co-Reyes , YuXuan Liu , Abhishek Gupta , Benjamin Eysenbach , Pieter Abbeel , Sergey Levine

Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training

Personalized recommendations are one of the most widely deployed machine learning (ML) workload serviced from cloud datacenters. As such, architectural solutions for high-performance recommendation inference have recently been the target of…

Hardware Architecture · Computer Science 2020-10-27 Youngeun Kwon , Yunjae Lee , Minsoo Rhu