Related papers: Periodic Regularized Q-Learning

Regularized Q-learning

Q-learning is widely used algorithm in reinforcement learning community. Under the lookup table setting, its convergence is well established. However, its behavior is known to be unstable with the linear function approximation case. This…

Machine Learning · Computer Science 2025-02-11 Han-Dong Lim , Donghwan Lee

Regularized Q-Learning with Linear Function Approximation

Regularized Markov Decision Processes serve as models of sequential decision making under uncertainty wherein the decision maker has limited information processing capacity and/or aversion to model ambiguity. With functional approximation,…

Artificial Intelligence · Computer Science 2025-02-11 Jiachen Xi , Alfredo Garcia , Petar Momcilovic

Periodic Q-Learning

The use of target networks is a common practice in deep reinforcement learning for stabilizing the training; however, theoretical understanding of this technique is still limited. In this paper, we study the so-called periodic Q-learning…

Machine Learning · Computer Science 2020-02-25 Donghwan Lee , Niao He

Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning

The $Q$-learning algorithm is a simple and widely-used stochastic approximation scheme for reinforcement learning, but the basic protocol can exhibit instability in conjunction with function approximation. Such instability can be observed…

Machine Learning · Computer Science 2022-06-03 Andrea Zanette , Martin J. Wainwright

Q-Learning in Regularized Mean-field Games

In this paper, we introduce a regularized mean-field game and study learning of this game under an infinite-horizon discounted reward function. Regularization is introduced by adding a strongly concave regularization function to the…

Optimization and Control · Mathematics 2022-11-11 Berkay Anahtarci , Can Deha Kariksiz , Naci Saldi

Convergence of regularized agent-state-based Q-learning in POMDPs

In this paper, we present a framework to understand the convergence of commonly used Q-learning reinforcement learning algorithms in practice. Two salient features of such algorithms are: (i)~the Q-table is recursively updated using an…

Machine Learning · Computer Science 2025-09-04 Amit Sinha , Matthieu Geist , Aditya Mahajan

Zap Q-Learning With Nonlinear Function Approximation

Zap Q-learning is a recent class of reinforcement learning algorithms, motivated primarily as a means to accelerate convergence. Stability theory has been absent outside of two restrictive classes: the tabular setting, and optimal stopping.…

Machine Learning · Computer Science 2020-07-17 Shuhang Chen , Adithya M. Devraj , Fan Lu , Ana Bušić , Sean P. Meyn

Regularized Residual Quantization: a multi-layer sparse dictionary learning approach

The Residual Quantization (RQ) framework is revisited where the quantization distortion is being successively reduced in multi-layers. Inspired by the reverse-water-filling paradigm in rate-distortion theory, an efficient regularization on…

Machine Learning · Computer Science 2017-05-02 Sohrab Ferdowsi , Slava Voloshynovskiy , Dimche Kostadinov

$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training

Reinforcement learning (RL) post-training is crucial for LLM alignment and reasoning, but existing policy-based methods, such as PPO and DPO, can fall short of fixing shortcuts inherited from pre-training. In this work, we introduce…

Machine Learning · Computer Science 2025-10-21 Jin Peng Zhou , Kaiwen Wang , Jonathan Chang , Zhaolin Gao , Nathan Kallus , Kilian Q. Weinberger , Kianté Brantley , Wen Sun

Convergence Results For Q-Learning With Experience Replay

A commonly used heuristic in RL is experience replay (e.g.~\citet{lin1993reinforcement, mnih2015human}), in which a learner stores and re-uses past trajectories as if they were sampled online. In this work, we initiate a rigorous study of…

Machine Learning · Computer Science 2021-12-09 Liran Szlak , Ohad Shamir

On the Convergence of Approximate and Regularized Policy Iteration Schemes

Entropy regularized algorithms such as Soft Q-learning and Soft Actor-Critic, recently showed state-of-the-art performance on a number of challenging reinforcement learning (RL) tasks. The regularized formulation modifies the standard RL…

Machine Learning · Statistics 2019-10-15 Elena Smirnova , Elvis Dohmatob

Convergence of Distributionally Robust Q-Learning with Linear Function Approximation

Distributionally robust reinforcement learning (DRRL) focuses on designing policies that achieve good performance under model uncertainties. The goal is to maximize the worst-case long-term discounted reward, where the data for RL comes…

Machine Learning · Computer Science 2026-03-17 Saptarshi Mandal , Yashaswini Murthy , R. Srikant

Value Penalized Q-Learning for Recommender Systems

Scaling reinforcement learning (RL) to recommender systems (RS) is promising since maximizing the expected cumulative rewards for RL agents meets the objective of RS, i.e., improving customers' long-term satisfaction. A key approach to this…

Machine Learning · Computer Science 2022-09-27 Chengqian Gao , Ke Xu , Kuangqi Zhou , Lanqing Li , Xueqian Wang , Bo Yuan , Peilin Zhao

Replicable Reinforcement Learning with Linear Function Approximation

Replication of experimental results has been a challenge faced by many scientific disciplines, including the field of machine learning. Recent work on the theory of machine learning has formalized replicability as the demand that an…

Machine Learning · Computer Science 2026-04-15 Eric Eaton , Marcel Hussing , Michael Kearns , Aaron Roth , Sikata Bela Sengupta , Jessica Sorrell

Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations

Reinforcement learning (RL) has achieved phenomenal success in various domains. However, its data-driven nature also introduces new vulnerabilities that can be exploited by malicious opponents. Recent work shows that a well-trained RL agent…

Machine Learning · Computer Science 2024-03-08 Xiaolin Sun , Zizhan Zheng

Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization

This paper presents a review of the field of reinforcement learning (RL), with a focus on providing a comprehensive overview of the key concepts, techniques, and algorithms for beginners. RL has a unique setting, jargon, and mathematics…

Machine Learning · Computer Science 2023-04-04 Mohamed-Amine Chadi , Hajar Mousannif

CPR: Classifier-Projection Regularization for Continual Learning

We propose a general, yet simple patch that can be applied to existing regularization-based continual learning methods called classifier-projection regularization (CPR). Inspired by both recent results on neural networks with wide local…

Machine Learning · Computer Science 2021-04-20 Sungmin Cha , Hsiang Hsu , Taebaek Hwang , Flavio P. Calmon , Taesup Moon

Deep Reinforcement Learning with Adjustments

Deep reinforcement learning (RL) algorithms can learn complex policies to optimize agent operation over time. RL algorithms have shown promising results in solving complicated problems in recent years. However, their application on…

Machine Learning · Computer Science 2021-09-29 Hamed Khorasgani , Haiyan Wang , Chetan Gupta , Susumu Serita

Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization

Reinforcement learning (RL) methods with a high replay ratio (RR) and regularization have gained interest due to their superior sample efficiency. However, these methods have mainly been developed for dense-reward tasks. In this paper, we…

Machine Learning · Computer Science 2023-12-12 Takuya Hiraoka

Target Network and Truncation Overcome The Deadly Triad in $Q$-Learning

$Q$-learning with function approximation is one of the most empirically successful while theoretically mysterious reinforcement learning (RL) algorithms, and was identified in Sutton (1999) as one of the most important theoretical open…

Machine Learning · Computer Science 2022-05-04 Zaiwei Chen , John Paul Clarke , Siva Theja Maguluri