Related papers: Discovering Reinforcement Learning Algorithms

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

Deep reinforcement learning includes a broad family of algorithms that parameterise an internal representation, such as a value function or policy, by a deep neural network. Each algorithm optimises its parameters with respect to an…

Machine Learning · Computer Science 2020-07-17 Zhongwen Xu , Hado van Hasselt , Matteo Hessel , Junhyuk Oh , Satinder Singh , David Silver

Evolved Policy Gradients

We propose a metalearning approach for learning gradient-based reinforcement learning (RL) algorithms. The idea is to evolve a differentiable loss function, such that an agent, which optimizes its policy to minimize this loss, will achieve…

Machine Learning · Computer Science 2018-05-01 Rein Houthooft , Richard Y. Chen , Phillip Isola , Bradly C. Stadie , Filip Wolski , Jonathan Ho , Pieter Abbeel

Reinforcement Learning

Reinforcement learning (RL) is a general framework for adaptive control, which has proven to be efficient in many domains, e.g., board games, video games or autonomous vehicles. In such problems, an agent faces a sequential decision-making…

Machine Learning · Computer Science 2020-06-16 Olivier Buffet , Olivier Pietquin , Paul Weng

A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges

Reinforcement Learning (RL) has emerged as a powerful paradigm in Artificial Intelligence (AI), enabling agents to learn optimal behaviors through interactions with their environments. Drawing from the foundations of trial and error, RL…

Artificial Intelligence · Computer Science 2025-02-04 Majid Ghasemi , Amir Hossein Moosavi , Dariush Ebrahimi

Meta-Gradient Reinforcement Learning

The goal of reinforcement learning algorithms is to estimate and/or optimise the value function. However, unlike supervised learning, no teacher or oracle is available to provide the true value function. Instead, the majority of…

Machine Learning · Computer Science 2018-05-25 Zhongwen Xu , Hado van Hasselt , David Silver

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Reinforcement learning (RL) is a powerful machine learning technique that enables an intelligent agent to learn an optimal policy that maximizes the cumulative rewards in sequential decision making. Most of methods in the existing…

Machine Learning · Statistics 2023-01-06 Chengchun Shi , Zhengling Qi , Jianing Wang , Fan Zhou

Evolving Reinforcement Learning Algorithms

We propose a method for meta-learning reinforcement learning algorithms by searching over the space of computational graphs which compute the loss function for a value-based model-free RL agent to optimize. The learned algorithms are…

Machine Learning · Computer Science 2022-11-11 John D. Co-Reyes , Yingjie Miao , Daiyi Peng , Esteban Real , Sergey Levine , Quoc V. Le , Honglak Lee , Aleksandra Faust

Evolutionary Discovery of Reinforcement Learning Algorithms via Large Language Models

Reinforcement learning algorithms are defined by their learning update rules, which are typically hand-designed and fixed. We present an evolutionary framework for discovering reinforcement learning algorithms by searching directly over…

Machine Learning · Computer Science 2026-03-31 Alkis Sygkounas , Amy Loutfi , Andreas Persson

Human-Inspired Multi-Level Reinforcement Learning

Reinforcement learning (RL), a common tool in decision making, learns control policies from various experiences based on the associated cumulative return/rewards without treating them differently. Humans, on the contrary, often learn to…

Machine Learning · Computer Science 2025-11-25 Mingkang Wu , Devin White , Vernon Lawhern , Nicholas R. Waytowich , Yongcan Cao

Reinforcement Learning with Lookahead Information

We study reinforcement learning (RL) problems in which agents observe the reward or transition realizations at their current state before deciding which action to take. Such observations are available in many applications, including…

Machine Learning · Computer Science 2024-10-22 Nadav Merlis

Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables

Deep reinforcement learning algorithms require large amounts of experience to learn an individual task. While in principle meta-reinforcement learning (meta-RL) algorithms enable agents to learn new skills from small amounts of experience,…

Machine Learning · Computer Science 2019-03-21 Kate Rakelly , Aurick Zhou , Deirdre Quillen , Chelsea Finn , Sergey Levine

Guided Meta-Policy Search

Reinforcement learning (RL) algorithms have demonstrated promising results on complex tasks, yet often require impractical numbers of samples since they learn from scratch. Meta-RL aims to address this challenge by leveraging experience…

Machine Learning · Computer Science 2020-10-28 Russell Mendonca , Abhishek Gupta , Rosen Kralev , Pieter Abbeel , Sergey Levine , Chelsea Finn

A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments

Reinforcement learning (RL) algorithms find applications in inventory control, recommender systems, vehicular traffic management, cloud computing and robotics. The real-world complications of many tasks arising in these domains makes them…

Machine Learning · Computer Science 2021-06-03 Sindhu Padakandla

Deep Reinforcement Learning using Genetic Algorithm for Parameter Optimization

Reinforcement learning (RL) enables agents to take decision based on a reward function. However, in the process of learning, the choice of values for learning algorithm parameters can significantly impact the overall learning process. In…

Neural and Evolutionary Computing · Computer Science 2019-05-13 Adarsh Sehgal , Hung Manh La , Sushil J. Louis , Hai Nguyen

Reinforcement Learning with Algorithms from Probabilistic Structure Estimation

Reinforcement learning (RL) algorithms aim to learn optimal decisions in unknown environments through experience of taking actions and observing the rewards gained. In some cases, the environment is not influenced by the actions of the RL…

Machine Learning · Computer Science 2022-06-02 Jonathan P. Epperlein , Roman Overko , Sergiy Zhuk , Christopher King , Djallel Bouneffouf , Andrew Cullen , Robert Shorten

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models

Humans are masters at quickly learning many complex tasks, relying on an approximate understanding of the dynamics of their environments. In much the same way, we would like our learning agents to quickly adapt to new tasks. In this paper,…

Robotics · Computer Science 2019-10-10 Arunkumar Byravan , Jost Tobias Springenberg , Abbas Abdolmaleki , Roland Hafner , Michael Neunert , Thomas Lampe , Noah Siegel , Nicolas Heess , Martin Riedmiller

Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model

Reinforcement learning (RL) shows great potential in sequential decision-making. At present, mainstream RL algorithms are data-driven, which usually yield better asymptotic performance but much slower convergence compared with model-driven…

Machine Learning · Computer Science 2024-02-27 Yang Guan , Jingliang Duan , Shengbo Eben Li , Jie Li , Jianyu Chen , Bo Cheng

Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning

Reinforcement learning (RL) -- algorithms that teach artificial agents to interact with environments by maximising reward signals -- has achieved significant success in recent years. These successes have been facilitated by advances in…

Machine Learning · Computer Science 2025-04-03 Llewyn Salt , Marcus Gallagher

Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design

The past decade has seen vast progress in deep reinforcement learning (RL) on the back of algorithms manually designed by human researchers. Recently, it has been shown that it is possible to meta-learn update rules, with the hope of…

Machine Learning · Computer Science 2023-10-05 Matthew Thomas Jackson , Minqi Jiang , Jack Parker-Holder , Risto Vuorio , Chris Lu , Gregory Farquhar , Shimon Whiteson , Jakob Nicolaus Foerster

Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs

Reinforcement learning (RL) faces challenges in evaluating policy trajectories within intricate game tasks due to the difficulty in designing comprehensive and precise reward functions. This inherent difficulty curtails the broader…

Artificial Intelligence · Computer Science 2024-07-02 Zichao Shen , Tianchen Zhu , Qingyun Sun , Shiqi Gao , Jianxin Li