Related papers: Universal Reinforcement Learning

Active Measure Reinforcement Learning for Observation Cost Minimization

Standard reinforcement learning (RL) algorithms assume that the observation of the next state comes instantaneously and at no cost. In a wide variety of sequential decision making tasks ranging from medical treatment to scientific…

Artificial Intelligence · Computer Science 2020-05-27 Colin Bellinger , Rory Coles , Mark Crowley , Isaac Tamblyn

Reinforcement Learning in Economics and Finance

Reinforcement learning algorithms describe how an agent can learn an optimal action policy in a sequential decision process, through repeated experience. In a given environment, the agent policy provides him some running and terminal…

Theoretical Economics · Economics 2020-03-24 Arthur Charpentier , Romuald Elie , Carl Remlinger

Reinforcement Learning with an Abrupt Model Change

The problem of reinforcement learning is considered where the environment or the model undergoes a change. An algorithm is proposed that an agent can apply in such a problem to achieve the optimal long-time discounted reward. The algorithm…

Systems and Control · Electrical Eng. & Systems 2023-04-25 Wuxia Chen , Taposh Banerjee , Jemin George , Carl Busart

Universal Policies to Learn Them All

We explore a collaborative and cooperative multi-agent reinforcement learning setting where a team of reinforcement learning agents attempt to solve a single cooperative task in a multi-scenario setting. We propose a novel multi-agent…

Multiagent Systems · Computer Science 2019-08-27 Hassam Ullah Sheikh , Ladislau Bölöni

Reinforcement Learning through Active Inference

The central tenet of reinforcement learning (RL) is that agents seek to maximize the sum of cumulative rewards. In contrast, active inference, an emerging framework within cognitive and computational neuroscience, proposes that agents act…

Machine Learning · Computer Science 2020-03-02 Alexander Tschantz , Beren Millidge , Anil K. Seth , Christopher L. Buckley

Reinforcement learning for quantum processes with memory

In reinforcement learning, an agent interacts sequentially with an environment to maximize a reward, receiving only partial, probabilistic feedback. This creates a fundamental exploration-exploitation trade-off: the agent must explore to…

Quantum Physics · Physics 2026-03-27 Josep Lumbreras , Ruo Cheng Huang , Yanglin Hu , Marco Fanizza , Mile Gu

Universal Reinforcement Learning Algorithms: Survey and Experiments

Many state-of-the-art reinforcement learning (RL) algorithms typically assume that the environment is an ergodic Markov Decision Process (MDP). In contrast, the field of universal reinforcement learning (URL) is concerned with algorithms…

Artificial Intelligence · Computer Science 2017-05-31 John Aslanides , Jan Leike , Marcus Hutter

Active Reinforcement Learning over MDPs

The past decade has seen the rapid development of Reinforcement Learning, which acquires impressive performance with numerous training resources. However, one of the greatest challenges in RL is generalization efficiency (i.e.,…

Machine Learning · Computer Science 2021-08-18 Qi Yang , Peng Yang , Ke Tang

Active learning algorithm through the lens of rejection arguments

Active learning is a paradigm of machine learning which aims at reducing the amount of labeled data needed to train a classifier. Its overall principle is to sequentially select the most informative data points, which amounts to determining…

Statistics Theory · Mathematics 2022-09-01 Christophe Denis , Mohamed Hebiri , Boris Ndjia Njike , Xavier Siebert

A Short Survey on Probabilistic Reinforcement Learning

A reinforcement learning agent tries to maximize its cumulative payoff by interacting in an unknown environment. It is important for the agent to explore suboptimal actions as well as to pick actions with highest known rewards. Yet, in…

Machine Learning · Computer Science 2019-01-23 Reazul Hasan Russel

Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation

We study lifelong reinforcement learning (RL) in a regret minimization setting of linear contextual Markov decision process (MDP), where the agent needs to learn a multi-task policy while solving a streaming sequence of tasks. We propose an…

Machine Learning · Computer Science 2022-06-02 Sanae Amani , Lin F. Yang , Ching-An Cheng

Learning Algorithms for Intelligent Agents and Mechanisms

In this thesis, we research learning algorithms for optimal decision making in two different contexts, Reinforcement Learning in Part I and Auction Design in Part II. Reinforcement learning (RL) is an area of machine learning that is…

Machine Learning · Computer Science 2022-10-07 Jad Rahme

Quantum Algorithms for Reinforcement Learning with a Generative Model

Reinforcement learning studies how an agent should interact with an environment to maximize its cumulative reward. A standard way to study this question abstractly is to ask how many samples an agent needs from the environment to learn an…

Quantum Physics · Physics 2021-12-21 Daochen Wang , Aarthi Sundaram , Robin Kothari , Ashish Kapoor , Martin Roetteler

Learning Robust and Adaptive Real-World Continuous Control Using Simulation and Transfer Learning

We use model-free reinforcement learning, extensive simulation, and transfer learning to develop a continuous control algorithm that has good zero-shot performance in a real physical environment. We train a simulated agent to act optimally…

Artificial Intelligence · Computer Science 2018-03-09 M Ferguson , K. H. Law

Online and Offline Reinforcement Learning by Planning with a Learned Model

Learning efficiently from small amounts of data has long been the focus of model-based reinforcement learning, both for the online case when interacting with the environment and the offline case when learning from a fixed dataset. However,…

Machine Learning · Computer Science 2021-04-14 Julian Schrittwieser , Thomas Hubert , Amol Mandhane , Mohammadamin Barekatain , Ioannis Antonoglou , David Silver

Risk-sensitive Actor-free Policy via Convex Optimization

Traditional reinforcement learning methods optimize agents without considering safety, potentially resulting in unintended consequences. In this paper, we propose an optimal actor-free policy that optimizes a risk-sensitive criterion based…

Machine Learning · Computer Science 2023-07-04 Ruoqi Zhang , Jens Sjölund

Reinforcement Learning

Reinforcement learning (RL) is a general framework for adaptive control, which has proven to be efficient in many domains, e.g., board games, video games or autonomous vehicles. In such problems, an agent faces a sequential decision-making…

Machine Learning · Computer Science 2020-06-16 Olivier Buffet , Olivier Pietquin , Paul Weng

Training Agents using Upside-Down Reinforcement Learning

We develop Upside-Down Reinforcement Learning (UDRL), a method for learning to act using only supervised learning techniques. Unlike traditional algorithms, UDRL does not use reward prediction or search for an optimal policy. Instead, it…

Machine Learning · Computer Science 2021-09-07 Rupesh Kumar Srivastava , Pranav Shyam , Filipe Mutz , Wojciech Jaśkowski , Jürgen Schmidhuber

Active Reinforcement Learning for Robust Building Control

Reinforcement learning (RL) is a powerful tool for optimal control that has found great success in Atari games, the game of Go, robotic control, and building optimization. RL is also very brittle; agents often overfit to their training…

Machine Learning · Computer Science 2023-12-19 Doseok Jang , Larry Yan , Lucas Spangher , Costas Spanos

Contrastive Active Inference

Active inference is a unifying theory for perception and action resting upon the idea that the brain maintains an internal model of the world by minimizing free energy. From a behavioral perspective, active inference agents can be seen as…

Machine Learning · Computer Science 2024-01-17 Pietro Mazzaglia , Tim Verbelen , Bart Dhoedt