Related papers: Finite-Sample Analysis for SARSA with Linear Funct…

Non-asymptotic Analysis of Biased Stochastic Approximation Scheme

Stochastic approximation (SA) is a key method used in statistical learning. Recently, its non-asymptotic convergence analysis has been considered in many papers. However, most of the prior analyses are made under restrictive assumptions…

Machine Learning · Statistics 2019-06-18 Belhal Karimi , Blazej Miasojedow , Eric Moulines , Hoi-To Wai

On the Convergence of SARSA with Linear Function Approximation

SARSA, a classical on-policy control algorithm for reinforcement learning, is known to chatter when combined with linear function approximation: SARSA does not diverge but oscillates in a bounded region. However, little is known about how…

Machine Learning · Computer Science 2023-05-16 Shangtong Zhang , Remi Tachet , Romain Laroche

Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning

Motivated by applications in reinforcement learning (RL), we study a nonlinear stochastic approximation (SA) algorithm under Markovian noise, and establish its finite-sample convergence bounds under various stepsizes. Specifically, we show…

Optimization and Control · Mathematics 2022-01-27 Zaiwei Chen , Sheng Zhang , Thinh T. Doan , John-Paul Clarke , Siva Theja Maguluri

Convergence of SARSA with linear function approximation: The random horizon case

The reinforcement learning algorithm SARSA combined with linear function approximation has been shown to converge for infinite horizon discounted Markov decision problems (MDPs). In this paper, we investigate the convergence of the…

Machine Learning · Computer Science 2023-06-08 Lina Palmborg

Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation

We provide performance guarantees for a variant of simulation-based policy iteration for controlling Markov decision processes that involves the use of stochastic approximation algorithms along with state-of-the-art techniques that are…

Machine Learning · Computer Science 2022-10-17 Anna Winnicki , R. Srikant

A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation

Motivated by the widespread use of temporal-difference (TD-) and Q-learning algorithms in reinforcement learning, this paper studies a class of biased stochastic approximation (SA) procedures under a mild "ergodic-like" assumption on the…

Machine Learning · Statistics 2020-09-02 Gang Wang , Bingcong Li , Georgios B. Giannakis

Stochastic Approximation with Markov Noise: Analysis and applications in reinforcement learning

We present for the first time an asymptotic convergence analysis of two time-scale stochastic approximation driven by "controlled" Markov noise. In particular, the faster and slower recursions have non-additive controlled Markov noise…

Machine Learning · Computer Science 2020-12-03 Prasenjit Karmakar

Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

Stochastic Approximation (SA) is a classical algorithm that has had since the early days a huge impact on signal processing, and nowadays on machine learning, due to the necessity to deal with a large amount of data observed with…

Optimization and Control · Mathematics 2023-07-18 Aymeric Dieuleveut , Gersende Fort , Eric Moulines , Hoi-To Wai

Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes

Stochastic Approximation (SA) is a popular approach for solving fixed-point equations where the information is corrupted by noise. In this paper, we consider an SA involving a contraction mapping with respect to an arbitrary norm, and show…

Machine Learning · Computer Science 2021-07-01 Zaiwei Chen , Siva Theja Maguluri , Sanjay Shakkottai , Karthikeyan Shanmugam

Stochastic Approximation for Risk-aware Markov Decision Processes

We develop a stochastic approximation-type algorithm to solve finite state/action, infinite-horizon, risk-aware Markov decision processes. Our algorithm has two loops. The inner loop computes the risk by solving a stochastic saddle-point…

Optimization and Control · Mathematics 2019-12-05 Wenjie Huang , William B. Haskell

Reinforcement Learning: Stochastic Approximation Algorithms for Markov Decision Processes

This article presents a short and concise description of stochastic approximation algorithms in reinforcement learning of Markov decision processes. The algorithms can also be used as a suboptimal method for partially observed Markov…

Optimization and Control · Mathematics 2015-12-25 Vikram Krishnamurthy

Cyclic Stochastic Optimization: Generalizations, Convergence, and Applications in Multi-Agent Systems

Stochastic approximation (SA) is a powerful class of iterative algorithms for nonlinear root-finding that can be used for minimizing a loss function, $L(\boldsymbol{\theta})$, with respect to a parameter vector $\boldsymbol{\theta}$, when…

Optimization and Control · Mathematics 2017-07-24 Karla Hernández Cuevas

Linear Stochastic Approximation Algorithms and Group Consensus over Random Signed Networks: A Technical Report with All Proofs

This paper studies linear stochastic approximation (SA) algorithms and their application to multi-agent systems in engineering and sociology. As main contribution, we provide necessary and sufficient conditions for convergence of linear SA…

Optimization and Control · Mathematics 2018-09-07 Ge Chen , Xiaoming Duan , Wenjun Mei , Francesco Bullo

Reinforcement Learning with Function Approximation for Non-Markov Processes

We study reinforcement learning methods with linear function approximation under non-Markov state and cost processes. We first consider the policy evaluation method and show that the algorithm converges under suitable ergodicity conditions…

Machine Learning · Computer Science 2026-01-05 Ali Devran Kara

Statistical Adaptive Stochastic Gradient Methods

We propose a statistical adaptive procedure called SALSA for automatically scheduling the learning rate (step size) in stochastic gradient methods. SALSA first uses a smoothed stochastic line-search procedure to gradually increase the…

Machine Learning · Statistics 2020-02-26 Pengchuan Zhang , Hunter Lang , Qiang Liu , Lin Xiao

State Dependent Performative Prediction with Stochastic Approximation

This paper studies the performative prediction problem which optimizes a stochastic loss function with data distribution that depends on the decision variable. We consider a setting where the agent(s) provides samples adapted to the…

Optimization and Control · Mathematics 2021-10-05 Qiang Li , Hoi-To Wai

Feature Adaptation for Sparse Linear Regression

Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,\Sigma)$, and we seek an estimator with small…

Data Structures and Algorithms · Computer Science 2023-05-29 Jonathan Kelner , Frederic Koehler , Raghu Meka , Dhruv Rohatgi

Asynchronous Stochastic Approximation with Applications to Average-Reward Reinforcement Learning

This paper investigates the stability and convergence properties of asynchronous stochastic approximation (SA) algorithms, with a focus on extensions relevant to average-reward reinforcement learning. We first extend a stability proof…

Machine Learning · Computer Science 2025-12-10 Huizhen Yu , Yi Wan , Richard S. Sutton

Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time

In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint. Despite the necessary attention of the scientific community, considering stochastic stopping time, the…

Machine Learning · Computer Science 2024-03-26 Abhijit Mazumdar , Rafal Wisniewski , Manuela L. Bujorianu

SALSA: Sequential Approximate Leverage-Score Algorithm with Application in Analyzing Big Time Series Data

We develop a new efficient sequential approximate leverage score algorithm, SALSA, using methods from randomized numerical linear algebra (RandNLA) for large matrices. We demonstrate that, with high probability, the accuracy of SALSA's…

Machine Learning · Statistics 2024-01-02 Ali Eshragh , Luke Yerbury , Asef Nazari , Fred Roosta , Michael W. Mahoney