Related papers: Low-rank State-action Value-function Approximation

Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning

Value-function (VF) approximation is a central problem in Reinforcement Learning (RL). Classical non-parametric VF estimation suffers from the curse of dimensionality. As a result, parsimonious parametric models have been adopted to…

Machine Learning · Computer Science 2024-05-29 Sergio Rozada , Santiago Paternain , Antonio G. Marques

Matrix Low-Rank Approximation For Policy Gradient Methods

Estimating a policy that maps states to actions is a central problem in reinforcement learning. Traditionally, policies are inferred from the so called value functions (VFs), but exact VF computation suffers from the curse of…

Machine Learning · Computer Science 2024-05-29 Sergio Rozada , Antonio G. Marques

Tensor Low-rank Approximation of Finite-horizon Value Functions

The goal of reinforcement learning is estimating a policy that maps states to actions and maximizes the cumulative reward of a Markov Decision Process (MDP). This is oftentimes achieved by estimating first the optimal (reward) value…

Machine Learning · Computer Science 2024-05-29 Sergio Rozada , Antonio G. Marques

Harnessing Structures for Value-Based Planning and Reinforcement Learning

Value-based methods constitute a fundamental methodology in planning and deep reinforcement learning (RL). In this paper, we propose to exploit the underlying structures of the state-action value function, i.e., Q function, for both…

Machine Learning · Computer Science 2020-07-07 Yuzhe Yang , Guo Zhang , Zhi Xu , Dina Katabi

Value function approximation via low-rank models

We propose a novel value function approximation technique for Markov decision processes. We consider the problem of compactly representing the state-action value function using a low-rank and sparse matrix model. The problem is to decompose…

Machine Learning · Computer Science 2015-09-02 Hao Yi Ong

Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning

Value estimation is one key problem in Reinforcement Learning. Albeit many successes have been achieved by Deep Reinforcement Learning (DRL) in different fields, the underlying structure and learning dynamics of value function, especially…

Machine Learning · Computer Science 2021-11-22 Tong Sang , Hongyao Tang , Jianye Hao , Yan Zheng , Zhaopeng Meng

Unsupervised Basis Function Adaptation for Reinforcement Learning

When using reinforcement learning (RL) algorithms it is common, given a large state space, to introduce some form of approximation architecture for the value function (VF). The exact form of this architecture can have a significant effect…

Machine Learning · Computer Science 2019-02-19 Edward Barker , Charl Ras

On Stateful Value Factorization in Multi-Agent Reinforcement Learning

Value factorization is a popular paradigm for designing scalable multi-agent reinforcement learning algorithms. However, current factorization methods make choices without full justification that may limit their performance. For example,…

Artificial Intelligence · Computer Science 2024-09-11 Enrico Marchesini , Andrea Baisero , Rupali Bhati , Christopher Amato

A Geometric Perspective on Optimal Representations for Reinforcement Learning

We propose a new perspective on representation learning in reinforcement learning based on geometric properties of the space of value functions. We leverage this perspective to provide formal evidence regarding the usefulness of value…

Machine Learning · Computer Science 2019-06-27 Marc G. Bellemare , Will Dabney , Robert Dadashi , Adrien Ali Taiga , Pablo Samuel Castro , Nicolas Le Roux , Dale Schuurmans , Tor Lattimore , Clare Lyle

Fast and Accurate Power Load Data Completion via Regularization-optimized Low-Rank Factorization

Low-rank representation learning has emerged as a powerful tool for recovering missing values in power load data due to its ability to exploit the inherent low-dimensional structures of spatiotemporal measurements. Among various techniques,…

Machine Learning · Computer Science 2025-06-24 Yan Xia , Hao Feng , Hongwei Sun , Junjie Wang , Qicong Hu

Fast Derandomized Low-rank Approximation and Extensions

Low-rank approximation of a matrix by means of structured random sampling has been consistently efficient in its extensive empirical studies around the globe, but adequate formal support for this empirical phenomenon has been missing so…

Numerical Analysis · Mathematics 2016-07-21 Victor Pan , John Svadlenka , Liang Zhao

Weighted Low Rank Matrix Approximation and Acceleration

Low-rank matrix approximation is one of the central concepts in machine learning, with applications in dimension reduction, de-noising, multivariate statistical methodology, and many more. A recent extension to LRMA is called low-rank…

Machine Learning · Statistics 2021-09-24 Elena Tuzhilina , Trevor Hastie

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

Value function factorization via centralized training and decentralized execution is promising for solving cooperative multi-agent reinforcement tasks. One of the approaches in this area, QMIX, has become state-of-the-art and achieved the…

Multiagent Systems · Computer Science 2023-07-27 Hanhan Zhou , Tian Lan , Vaneet Aggarwal

Multilinear Tensor Low-Rank Approximation for Policy-Gradient Methods in Reinforcement Learning

Reinforcement learning (RL) aims to estimate the action to take given a (time-varying) state, with the goal of maximizing a cumulative reward function. Predominantly, there are two families of algorithms to solve RL problems: value-based…

Machine Learning · Computer Science 2025-01-10 Sergio Rozada , Hoi-To Wai , Antonio G. Marques

Unsupervised Basis Function Adaptation for Reinforcement Learning

When using reinforcement learning (RL) algorithms to evaluate a policy it is common, given a large state space, to introduce some form of approximation architecture for the value function (VF). The exact form of this architecture can have a…

Artificial Intelligence · Computer Science 2017-03-06 Edward W. Barker , Charl J. Ras

Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation

We provide performance guarantees for a variant of simulation-based policy iteration for controlling Markov decision processes that involves the use of stochastic approximation algorithms along with state-of-the-art techniques that are…

Machine Learning · Computer Science 2022-10-17 Anna Winnicki , R. Srikant

Real-Time Reinforcement Learning of Constrained Markov Decision Processes with Weak Derivatives

We present on-line policy gradient algorithms for computing the locally optimal policy of a constrained, average cost, finite state Markov Decision Process. The stochastic approximation algorithms require estimation of the gradient of the…

Optimization and Control · Mathematics 2018-12-18 Vikram Krishnamurthy , Felisa Vazquez Abad

Nonconvex Matrix Factorization from Rank-One Measurements

We consider the problem of recovering low-rank matrices from random rank-one measurements, which spans numerous applications including covariance sketching, phase retrieval, quantum state tomography, and learning shallow polynomial neural…

Information Theory · Computer Science 2018-12-04 Yuanxin Li , Cong Ma , Yuxin Chen , Yuejie Chi

Low-rank methods for high-dimensional approximation and model order reduction

Tensor methods are among the most prominent tools for the numerical solution of high-dimensional problems where functions of multiple variables have to be approximated. These methods exploit the tensor structure of function spaces and apply…

Numerical Analysis · Mathematics 2021-02-01 Anthony Nouy

Recurrent Value Functions

Despite recent successes in Reinforcement Learning, value-based methods often suffer from high variance hindering performance. In this paper, we illustrate this in a continuous control setting where state of the art methods perform poorly…

Machine Learning · Computer Science 2019-05-24 Pierre Thodoroff , Nishanth Anand , Lucas Caccia , Doina Precup , Joelle Pineau