English
Related papers

Related papers: A Function Approximation Method for Model-based Hi…

200 papers

This paper introduces a new method for inverse reinforcement learning in large-scale and high-dimensional state spaces. To avoid solving the computationally expensive reinforcement learning problems in reward learning, we propose a function…

Machine Learning · Computer Science 2017-08-15 Kun Li , Joel W. Burdick

This paper develops an inverse reinforcement learning algorithm aimed at recovering a reward function from the observed actions of an agent. We introduce a strategy to flexibly handle different types of actions with two approximations of…

Machine Learning · Computer Science 2017-07-26 Kun Li , Yanan Sui , Joel W. Burdick

We consider the problem of reinforcement learning using function approximation, where the approximating basis can change dynamically while interacting with the environment. A motivation for such an approach is maximizing the value function…

Machine Learning · Computer Science 2010-05-04 Dotan Di Castro , Shie Mannor

One of the most natural approaches to reinforcement learning (RL) with function approximation is value iteration, which inductively generates approximations to the optimal value function by solving a sequence of regression problems. To…

Machine Learning · Computer Science 2024-06-19 Noah Golowich , Ankur Moitra

This paper develops an online inverse reinforcement learning algorithm aimed at efficiently recovering a reward function from ongoing observations of an agent's actions. To reduce the computation time and storage space in reward estimation,…

Robotics · Computer Science 2017-08-01 Kun Li , Joel W. Burdick

Reinforcement learning (RL) problems are fundamental in online decision-making and have been instrumental in finding an optimal policy for Markov decision processes (MDPs). Function approximations are usually deployed to handle large or…

Machine Learning · Computer Science 2025-05-20 Jiashuo Jiang , Yiming Zong , Yinyu Ye

A learning based method for obtaining feedback laws for nonlinear optimal control problems is proposed. The learning problem is posed such that the open loop value function is its optimal solution. This infinite dimensional, function space,…

Optimization and Control · Mathematics 2022-10-26 Karl Kunisch , Donato Vásquez-Varas , Daniel Walter

Value function approximation is important in modern reinforcement learning (RL) problems especially when the state space is (infinitely) large. Despite the importance and wide applicability of value function approximation, its theoretical…

Machine Learning · Computer Science 2023-02-24 Hanlin Zhu , Ruosong Wang , Jason D. Lee

Reinforcement learning algorithms in multi-agent systems deliver highly resilient and adaptable solutions for common problems in telecommunications,aerospace, and industrial robotics. However, achieving an optimal global goal remains a…

Multiagent Systems · Computer Science 2021-05-18 Changgang Zheng , Shufan Yang , Juan Parra-Ullauri , Antonio Garcia-Dominguez , Nelly Bencomo

Inverse reinforcement learning (IRL) is the problem of finding a reward function that generates a given optimal policy for a given Markov Decision Process. This paper looks at an algorithmic-independent geometric analysis of the IRL problem…

Machine Learning · Computer Science 2021-02-19 Abi Komanduru , Jean Honorio

This paper provides an approximate online adaptive solution to the infinite-horizon optimal tracking problem for control-affine continuous-time nonlinear systems with unknown drift dynamics. Model-based reinforcement learning is used to…

Systems and Control · Computer Science 2017-07-25 Rushikesh Kamalapurkar , Lindsey Andrews , Patrick Walters , Warren E. Dixon

We propose and analyze a reinforcement learning principle that approximates the Bellman equations by enforcing their validity only along an user-defined space of test functions. Focusing on applications to model-free offline RL with…

Machine Learning · Computer Science 2022-10-13 Andrea Zanette , Martin J. Wainwright

While reinforcement learning algorithms provide automated acquisition of optimal policies, practical application of such methods requires a number of design decisions, such as manually designing reward functions that not only define the…

Machine Learning · Computer Science 2022-12-29 Tim G. J. Rudner , Vitchyr H. Pong , Rowan McAllister , Yarin Gal , Sergey Levine

This paper studies systematic exploration for reinforcement learning with rich observations and function approximation. We introduce a new model called contextual decision processes, that unifies and generalizes most prior settings. Our…

Machine Learning · Computer Science 2016-12-02 Nan Jiang , Akshay Krishnamurthy , Alekh Agarwal , John Langford , Robert E. Schapire

The shortcomings of maximum likelihood estimation in the context of model-based reinforcement learning have been highlighted by an increasing number of papers. When the model class is misspecified or has a limited representational capacity,…

Machine Learning · Computer Science 2021-06-08 Evgenii Nikishin , Romina Abachi , Rishabh Agarwal , Pierre-Luc Bacon

Reinforcement learning (RL) algorithms typically deal with maximizing the expected cumulative return (discounted or undiscounted, finite or infinite horizon). However, several crucial applications in the real world, such as drug discovery,…

Function approximation has been an indispensable component in modern reinforcement learning algorithms designed to tackle problems with large state spaces in high dimensions. This paper reviews recent results on error analysis for these…

Machine Learning · Computer Science 2024-02-27 Jihao Long , Jiequn Han

In many real-world settings, reinforcement learning systems suffer performance degradation when the environment encountered at deployment differs from that observed during training. Distributionally robust reinforcement learning (DR-RL)…

Machine Learning · Computer Science 2026-03-05 Debamita Ghosh , George K. Atia , Yue Wang

Commonly in reinforcement learning (RL), rewards are discounted over time using an exponential function to model time preference, thereby bounding the expected long-term reward. In contrast, in economics and psychology, it has been shown…

Machine Learning · Computer Science 2022-12-08 Matthias Schultheis , Constantin A. Rothkopf , Heinz Koeppl

Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives…

Machine Learning · Computer Science 2022-02-17 Jihao Long , Jiequn Han , Weinan E
‹ Prev 1 2 3 10 Next ›