English
Related papers

Related papers: Low-rank State-action Value-function Approximation

200 papers

Value-function (VF) approximation is a central problem in Reinforcement Learning (RL). Classical non-parametric VF estimation suffers from the curse of dimensionality. As a result, parsimonious parametric models have been adopted to…

Machine Learning · Computer Science 2024-05-29 Sergio Rozada , Santiago Paternain , Antonio G. Marques

Estimating a policy that maps states to actions is a central problem in reinforcement learning. Traditionally, policies are inferred from the so called value functions (VFs), but exact VF computation suffers from the curse of…

Machine Learning · Computer Science 2024-05-29 Sergio Rozada , Antonio G. Marques

The goal of reinforcement learning is estimating a policy that maps states to actions and maximizes the cumulative reward of a Markov Decision Process (MDP). This is oftentimes achieved by estimating first the optimal (reward) value…

Machine Learning · Computer Science 2024-05-29 Sergio Rozada , Antonio G. Marques

Value-based methods constitute a fundamental methodology in planning and deep reinforcement learning (RL). In this paper, we propose to exploit the underlying structures of the state-action value function, i.e., Q function, for both…

Machine Learning · Computer Science 2020-07-07 Yuzhe Yang , Guo Zhang , Zhi Xu , Dina Katabi

We propose a novel value function approximation technique for Markov decision processes. We consider the problem of compactly representing the state-action value function using a low-rank and sparse matrix model. The problem is to decompose…

Machine Learning · Computer Science 2015-09-02 Hao Yi Ong

Value estimation is one key problem in Reinforcement Learning. Albeit many successes have been achieved by Deep Reinforcement Learning (DRL) in different fields, the underlying structure and learning dynamics of value function, especially…

Machine Learning · Computer Science 2021-11-22 Tong Sang , Hongyao Tang , Jianye Hao , Yan Zheng , Zhaopeng Meng

When using reinforcement learning (RL) algorithms it is common, given a large state space, to introduce some form of approximation architecture for the value function (VF). The exact form of this architecture can have a significant effect…

Machine Learning · Computer Science 2019-02-19 Edward Barker , Charl Ras

Value factorization is a popular paradigm for designing scalable multi-agent reinforcement learning algorithms. However, current factorization methods make choices without full justification that may limit their performance. For example,…

Artificial Intelligence · Computer Science 2024-09-11 Enrico Marchesini , Andrea Baisero , Rupali Bhati , Christopher Amato

We propose a new perspective on representation learning in reinforcement learning based on geometric properties of the space of value functions. We leverage this perspective to provide formal evidence regarding the usefulness of value…

Low-rank representation learning has emerged as a powerful tool for recovering missing values in power load data due to its ability to exploit the inherent low-dimensional structures of spatiotemporal measurements. Among various techniques,…

Machine Learning · Computer Science 2025-06-24 Yan Xia , Hao Feng , Hongwei Sun , Junjie Wang , Qicong Hu

Low-rank approximation of a matrix by means of structured random sampling has been consistently efficient in its extensive empirical studies around the globe, but adequate formal support for this empirical phenomenon has been missing so…

Numerical Analysis · Mathematics 2016-07-21 Victor Pan , John Svadlenka , Liang Zhao

Low-rank matrix approximation is one of the central concepts in machine learning, with applications in dimension reduction, de-noising, multivariate statistical methodology, and many more. A recent extension to LRMA is called low-rank…

Machine Learning · Statistics 2021-09-24 Elena Tuzhilina , Trevor Hastie

Value function factorization via centralized training and decentralized execution is promising for solving cooperative multi-agent reinforcement tasks. One of the approaches in this area, QMIX, has become state-of-the-art and achieved the…

Multiagent Systems · Computer Science 2023-07-27 Hanhan Zhou , Tian Lan , Vaneet Aggarwal

Reinforcement learning (RL) aims to estimate the action to take given a (time-varying) state, with the goal of maximizing a cumulative reward function. Predominantly, there are two families of algorithms to solve RL problems: value-based…

Machine Learning · Computer Science 2025-01-10 Sergio Rozada , Hoi-To Wai , Antonio G. Marques

When using reinforcement learning (RL) algorithms to evaluate a policy it is common, given a large state space, to introduce some form of approximation architecture for the value function (VF). The exact form of this architecture can have a…

Artificial Intelligence · Computer Science 2017-03-06 Edward W. Barker , Charl J. Ras

We provide performance guarantees for a variant of simulation-based policy iteration for controlling Markov decision processes that involves the use of stochastic approximation algorithms along with state-of-the-art techniques that are…

Machine Learning · Computer Science 2022-10-17 Anna Winnicki , R. Srikant

We present on-line policy gradient algorithms for computing the locally optimal policy of a constrained, average cost, finite state Markov Decision Process. The stochastic approximation algorithms require estimation of the gradient of the…

Optimization and Control · Mathematics 2018-12-18 Vikram Krishnamurthy , Felisa Vazquez Abad

We consider the problem of recovering low-rank matrices from random rank-one measurements, which spans numerous applications including covariance sketching, phase retrieval, quantum state tomography, and learning shallow polynomial neural…

Information Theory · Computer Science 2018-12-04 Yuanxin Li , Cong Ma , Yuxin Chen , Yuejie Chi

Tensor methods are among the most prominent tools for the numerical solution of high-dimensional problems where functions of multiple variables have to be approximated. These methods exploit the tensor structure of function spaces and apply…

Numerical Analysis · Mathematics 2021-02-01 Anthony Nouy

Despite recent successes in Reinforcement Learning, value-based methods often suffer from high variance hindering performance. In this paper, we illustrate this in a continuous control setting where state of the art methods perform poorly…

Machine Learning · Computer Science 2019-05-24 Pierre Thodoroff , Nishanth Anand , Lucas Caccia , Doina Precup , Joelle Pineau
‹ Prev 1 2 3 10 Next ›