Related papers: Learning Efficient Representations for Reinforceme…

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

To overcome the curses of dimensionality and modeling of Dynamic Programming (DP) methods to solve Markov Decision Process (MDP) problems, Reinforcement Learning (RL) methods are adopted in practice. Contrary to traditional RL algorithms…

Machine Learning · Computer Science 2021-08-24 Arghyadip Roy , Vivek Borkar , Abhay Karandikar , Prasanna Chaporkar

Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functions

Designing sample-efficient and computationally feasible reinforcement learning (RL) algorithms is particularly challenging in environments with large or infinite state and action spaces. In this paper, we advance this effort by presenting…

Machine Learning · Computer Science 2024-10-04 Zakaria Mhammedi

Reinforcement Learning in Rich-Observation MDPs using Spectral Methods

Reinforcement learning (RL) in Markov decision processes (MDPs) with large state spaces is a challenging problem. The performance of standard RL algorithms degrades drastically with the dimensionality of state space. However, in practice,…

Artificial Intelligence · Computer Science 2018-06-21 Kamyar Azizzadenesheli , Alessandro Lazaric , Animashree Anandkumar

Reinforcement Learning algorithms for regret minimization in structured Markov Decision Processes

A recent goal in the Reinforcement Learning (RL) framework is to choose a sequence of actions or a policy to maximize the reward collected or minimize the regret incurred in a finite time horizon. For several RL problems in operation…

Machine Learning · Computer Science 2016-08-18 K J Prabuchandran , Tejas Bodas , Theja Tulabandhula

Bridging State and History Representations: Understanding Self-Predictive RL

Representations are at the core of all deep reinforcement learning (RL) methods for both Markov decision processes (MDPs) and partially observable Markov decision processes (POMDPs). Many representation learning methods and theoretical…

Machine Learning · Computer Science 2024-04-23 Tianwei Ni , Benjamin Eysenbach , Erfan Seyedsalehi , Michel Ma , Clement Gehring , Aditya Mahajan , Pierre-Luc Bacon

Representation Learning in Low-rank Slate-based Recommender Systems

Reinforcement learning (RL) in recommendation systems offers the potential to optimize recommendations for long-term user engagement. However, the environment often involves large state and action spaces, which makes it hard to efficiently…

Information Retrieval · Computer Science 2023-09-20 Yijia Dai , Wen Sun

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

Multi-task representation learning (MTRL) is an approach that learns shared latent representations across related tasks, facilitating collaborative learning that improves the overall learning efficiency. This paper studies MTRL for…

Machine Learning · Computer Science 2026-04-07 Yaoze Guo , Shana Moothedath

Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL

The success of deep reinforcement learning (DRL) lies in its ability to learn a representation that is well-suited for the exploration and exploitation task. To understand how the choice of representation can improve the efficiency of…

Machine Learning · Computer Science 2024-02-15 Weitong Zhang , Jiafan He , Dongruo Zhou , Amy Zhang , Quanquan Gu

Recursive Reinforcement Learning

Recursion is the fundamental paradigm to finitely describe potentially infinite objects. As state-of-the-art reinforcement learning (RL) algorithms cannot directly reason about recursion, they must rely on the practitioner's ingenuity in…

Machine Learning · Computer Science 2022-06-24 Ernst Moritz Hahn , Mateo Perez , Sven Schewe , Fabio Somenzi , Ashutosh Trivedi , Dominik Wojtczak

Making Linear MDPs Practical via Contrastive Representation Learning

It is common to address the curse of dimensionality in Markov decision processes (MDPs) by exploiting low-rank representations. This motivates much of the recent theoretical study on linear MDPs. However, most approaches require a given…

Machine Learning · Computer Science 2022-12-09 Tianjun Zhang , Tongzheng Ren , Mengjiao Yang , Joseph E. Gonzalez , Dale Schuurmans , Bo Dai

Adaptive Resolving Methods for Reinforcement Learning with Function Approximations

Reinforcement learning (RL) problems are fundamental in online decision-making and have been instrumental in finding an optimal policy for Markov decision processes (MDPs). Function approximations are usually deployed to handle large or…

Machine Learning · Computer Science 2025-05-20 Jiashuo Jiang , Yiming Zong , Yinyu Ye

Exploiting Multiple Abstractions in Episodic RL via Reward Shaping

One major limitation to the applicability of Reinforcement Learning (RL) to many practical domains is the large number of samples required to learn an optimal policy. To address this problem and improve learning efficiency, we consider a…

Machine Learning · Computer Science 2023-08-07 Roberto Cipollone , Giuseppe De Giacomo , Marco Favorito , Luca Iocchi , Fabio Patrizi

Breaking the Computational Barrier: Provably Efficient Actor-Critic for Low-Rank MDPs

Reinforcement learning (RL) is a fundamental framework for sequential decision-making, in which an agent learns an optimal policy through interactions with an unknown environment. In settings with function approximation, many existing RL…

Machine Learning · Computer Science 2026-05-05 Ruiquan Huang , Donghao Li , Yingbin Liang , Jing Yang

Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity

Reinforcement Learning (RL) encompasses diverse paradigms, including model-based RL, policy-based RL, and value-based RL, each tailored to approximate the model, optimal policy, and optimal value function, respectively. This work…

Machine Learning · Computer Science 2024-12-10 Guhao Feng , Han Zhong

Representational efficiency outweighs action efficiency in human program induction

The importance of hierarchically structured representations for tractable planning has long been acknowledged. However, the questions of how people discover such abstractions and how to define a set of optimal abstractions remain open. This…

Artificial Intelligence · Computer Science 2018-07-20 Sophia Sanborn , David D. Bourgin , Michael Chang , Thomas L. Griffiths

Sample Complexity of Robust Reinforcement Learning with a Generative Model

The Robust Markov Decision Process (RMDP) framework focuses on designing control policies that are robust against the parameter uncertainties due to the mismatches between the simulator model and real-world settings. An RMDP problem is…

Machine Learning · Computer Science 2022-05-17 Kishan Panaganti , Dileep Kalathil

Model-based Reinforcement Learning: A Survey

Sequential decision making, commonly formalized as Markov Decision Process (MDP) optimization, is a important challenge in artificial intelligence. Two key approaches to this problem are reinforcement learning (RL) and planning. This paper…

Machine Learning · Computer Science 2022-04-01 Thomas M. Moerland , Joost Broekens , Aske Plaat , Catholijn M. Jonker

Spectral Representation-based Reinforcement Learning

In real-world applications with large state and action spaces, reinforcement learning (RL) typically employs function approximations to represent core components like the policies, value functions, and dynamics models. Although powerful…

Machine Learning · Computer Science 2026-01-29 Chenxiao Gao , Haotian Sun , Na Li , Dale Schuurmans , Bo Dai

On learning history based policies for controlling Markov decision processes

Reinforcementlearning(RL)folkloresuggeststhathistory-basedfunctionapproximationmethods,suchas recurrent neural nets or history-based state abstraction, perform better than their memory-less counterparts, due to the fact that function…

Machine Learning · Computer Science 2022-11-08 Gandharv Patil , Aditya Mahajan , Doina Precup

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Reinforcement Learning (RL) algorithms are known to suffer from the curse of dimensionality, which refers to the fact that large-scale problems often lead to exponentially high sample complexity. A common solution is to use deep neural…

Machine Learning · Computer Science 2024-11-13 Chenbei Lu , Laixi Shi , Zaiwei Chen , Chenye Wu , Adam Wierman