中文
相关论文

相关论文: Universal Reinforcement Learning

200 篇论文

Standard reinforcement learning (RL) algorithms assume that the observation of the next state comes instantaneously and at no cost. In a wide variety of sequential decision making tasks ranging from medical treatment to scientific…

人工智能 · 计算机科学 2020-05-27 Colin Bellinger , Rory Coles , Mark Crowley , Isaac Tamblyn

Reinforcement learning algorithms describe how an agent can learn an optimal action policy in a sequential decision process, through repeated experience. In a given environment, the agent policy provides him some running and terminal…

理论经济学 · 经济学 2020-03-24 Arthur Charpentier , Romuald Elie , Carl Remlinger

The problem of reinforcement learning is considered where the environment or the model undergoes a change. An algorithm is proposed that an agent can apply in such a problem to achieve the optimal long-time discounted reward. The algorithm…

系统与控制 · 电气工程与系统科学 2023-04-25 Wuxia Chen , Taposh Banerjee , Jemin George , Carl Busart

We explore a collaborative and cooperative multi-agent reinforcement learning setting where a team of reinforcement learning agents attempt to solve a single cooperative task in a multi-scenario setting. We propose a novel multi-agent…

多智能体系统 · 计算机科学 2019-08-27 Hassam Ullah Sheikh , Ladislau Bölöni

The central tenet of reinforcement learning (RL) is that agents seek to maximize the sum of cumulative rewards. In contrast, active inference, an emerging framework within cognitive and computational neuroscience, proposes that agents act…

机器学习 · 计算机科学 2020-03-02 Alexander Tschantz , Beren Millidge , Anil K. Seth , Christopher L. Buckley

In reinforcement learning, an agent interacts sequentially with an environment to maximize a reward, receiving only partial, probabilistic feedback. This creates a fundamental exploration-exploitation trade-off: the agent must explore to…

量子物理 · 物理学 2026-03-27 Josep Lumbreras , Ruo Cheng Huang , Yanglin Hu , Marco Fanizza , Mile Gu

Many state-of-the-art reinforcement learning (RL) algorithms typically assume that the environment is an ergodic Markov Decision Process (MDP). In contrast, the field of universal reinforcement learning (URL) is concerned with algorithms…

人工智能 · 计算机科学 2017-05-31 John Aslanides , Jan Leike , Marcus Hutter

The past decade has seen the rapid development of Reinforcement Learning, which acquires impressive performance with numerous training resources. However, one of the greatest challenges in RL is generalization efficiency (i.e.,…

机器学习 · 计算机科学 2021-08-18 Qi Yang , Peng Yang , Ke Tang

Active learning is a paradigm of machine learning which aims at reducing the amount of labeled data needed to train a classifier. Its overall principle is to sequentially select the most informative data points, which amounts to determining…

统计理论 · 数学 2022-09-01 Christophe Denis , Mohamed Hebiri , Boris Ndjia Njike , Xavier Siebert

A reinforcement learning agent tries to maximize its cumulative payoff by interacting in an unknown environment. It is important for the agent to explore suboptimal actions as well as to pick actions with highest known rewards. Yet, in…

机器学习 · 计算机科学 2019-01-23 Reazul Hasan Russel

We study lifelong reinforcement learning (RL) in a regret minimization setting of linear contextual Markov decision process (MDP), where the agent needs to learn a multi-task policy while solving a streaming sequence of tasks. We propose an…

机器学习 · 计算机科学 2022-06-02 Sanae Amani , Lin F. Yang , Ching-An Cheng

In this thesis, we research learning algorithms for optimal decision making in two different contexts, Reinforcement Learning in Part I and Auction Design in Part II. Reinforcement learning (RL) is an area of machine learning that is…

机器学习 · 计算机科学 2022-10-07 Jad Rahme

Reinforcement learning studies how an agent should interact with an environment to maximize its cumulative reward. A standard way to study this question abstractly is to ask how many samples an agent needs from the environment to learn an…

量子物理 · 物理学 2021-12-21 Daochen Wang , Aarthi Sundaram , Robin Kothari , Ashish Kapoor , Martin Roetteler

We use model-free reinforcement learning, extensive simulation, and transfer learning to develop a continuous control algorithm that has good zero-shot performance in a real physical environment. We train a simulated agent to act optimally…

人工智能 · 计算机科学 2018-03-09 M Ferguson , K. H. Law

Learning efficiently from small amounts of data has long been the focus of model-based reinforcement learning, both for the online case when interacting with the environment and the offline case when learning from a fixed dataset. However,…

Traditional reinforcement learning methods optimize agents without considering safety, potentially resulting in unintended consequences. In this paper, we propose an optimal actor-free policy that optimizes a risk-sensitive criterion based…

机器学习 · 计算机科学 2023-07-04 Ruoqi Zhang , Jens Sjölund

Reinforcement learning (RL) is a general framework for adaptive control, which has proven to be efficient in many domains, e.g., board games, video games or autonomous vehicles. In such problems, an agent faces a sequential decision-making…

机器学习 · 计算机科学 2020-06-16 Olivier Buffet , Olivier Pietquin , Paul Weng

We develop Upside-Down Reinforcement Learning (UDRL), a method for learning to act using only supervised learning techniques. Unlike traditional algorithms, UDRL does not use reward prediction or search for an optimal policy. Instead, it…

机器学习 · 计算机科学 2021-09-07 Rupesh Kumar Srivastava , Pranav Shyam , Filipe Mutz , Wojciech Jaśkowski , Jürgen Schmidhuber

Reinforcement learning (RL) is a powerful tool for optimal control that has found great success in Atari games, the game of Go, robotic control, and building optimization. RL is also very brittle; agents often overfit to their training…

机器学习 · 计算机科学 2023-12-19 Doseok Jang , Larry Yan , Lucas Spangher , Costas Spanos

Active inference is a unifying theory for perception and action resting upon the idea that the brain maintains an internal model of the world by minimizing free energy. From a behavioral perspective, active inference agents can be seen as…

机器学习 · 计算机科学 2024-01-17 Pietro Mazzaglia , Tim Verbelen , Bart Dhoedt
‹ 上一页 1 2 3 10 下一页 ›