English
Related papers

Related papers: Accelerated Reinforcement Learning

200 papers

Various acceleration approaches for Policy Gradient (PG) have been analyzed within the realm of Reinforcement Learning (RL). However, the theoretical understanding of the widely used momentum-based acceleration method on PG remains largely…

Machine Learning · Computer Science 2024-06-07 Yen-Ju Chen , Nai-Chieh Huang , Ching-Pei Lee , Ping-Chun Hsieh

Reinforcement learning, mathematically described by Markov Decision Problems, may be approached either through dynamic programming or policy search. Actor-critic algorithms combine the merits of both approaches by alternating between steps…

Machine Learning · Computer Science 2023-01-31 Harshat Kumar , Alec Koppel , Alejandro Ribeiro

In this paper, we generalize the well-known Nesterov's accelerated gradient (AG) method, originally designed for convex smooth optimization, to solve nonconvex and possibly stochastic optimization problems. We demonstrate that by properly…

Optimization and Control · Mathematics 2013-10-15 Saeed Ghadimi , Guanghui Lan

Following the seminal work of Nesterov, accelerated optimization methods have been used to powerfully boost the performance of first-order, gradient-based parameter estimation in scenarios where second-order optimization strategies are…

Numerical Analysis · Computer Science 2017-11-28 Anthony Yezzi , Ganesh Sundaramoorthi

We propose a framework to use Nesterov's accelerated method for constrained convex optimization problems. Our approach consists of first reformulating the original problem as an unconstrained optimization problem using a continuously…

Optimization and Control · Mathematics 2021-03-12 Priyank Srivastava , Jorge Cortes

We present a totally asynchronous algorithm for convex optimization that is based on a novel generalization of Nesterov's accelerated gradient method. This algorithm is developed for fast convergence under "total asynchrony," i.e., allowing…

Optimization and Control · Mathematics 2024-06-17 Ellie Pond , April Sebok , Zachary Bell , Matthew Hale

Nesterov's accelerated gradient (AG) is a popular technique to optimize objective functions comprising two components: a convex loss and a penalty function. While AG methods perform well for convex penalties, such as the LASSO, convergence…

Optimization and Control · Mathematics 2024-01-04 Kai Yang , Masoud Asgharian , Sahir Bhatnagar

We introduce a generic scheme for accelerating first-order optimization methods in the sense of Nesterov, which builds upon a new analysis of the accelerated proximal point algorithm. Our approach consists of minimizing a convex objective…

Optimization and Control · Mathematics 2015-10-27 Hongzhou Lin , Julien Mairal , Zaid Harchaoui

We introduce a generic scheme for accelerating gradient-based optimization methods in the sense of Nesterov. The approach, called Catalyst, builds upon the inexact accelerated proximal point algorithm for minimizing a convex objective…

Machine Learning · Statistics 2018-06-20 Hongzhou Lin , Julien Mairal , Zaid Harchaoui

We present a coupled system of ODEs which, when discretized with a constant time step/learning rate, recovers Nesterov's accelerated gradient descent algorithm. The same ODEs, when discretized with a decreasing learning rate, leads to novel…

Optimization and Control · Mathematics 2020-09-02 Maxime Laborde , Adam M. Oberman

Convergence analysis of accelerated first-order methods for convex optimization problems are presented from the point of view of ordinary differential equation solvers. A new dynamical system, called Nesterov accelerated gradient flow, has…

Optimization and Control · Mathematics 2022-03-01 Hao Luo , Long Chen

We study a new two-time-scale stochastic gradient method for solving optimization problems, where the gradients are computed with the aid of an auxiliary variable under samples generated by time-varying MDPs controlled by the underlying…

Optimization and Control · Mathematics 2024-08-27 Sihan Zeng , Thinh T. Doan , Justin Romberg

We work towards a unifying paradigm for accelerating policy optimization methods in reinforcement learning (RL) by integrating foresight in the policy improvement step via optimistic and adaptive updates. Leveraging the connection between…

Machine Learning · Computer Science 2023-09-07 Veronica Chelu , Tom Zahavy , Arthur Guez , Doina Precup , Sebastian Flennerhag

Discrete optimization belongs to the set of $\mathcal{NP}$-hard problems, spanning fields such as mixed-integer programming and combinatorial optimization. A current standard approach to solving convex discrete optimization problems is the…

Machine Learning · Computer Science 2024-02-28 Kyle Mana , Fernando Acero , Stephen Mak , Parisa Zehtabi , Michael Cashmore , Daniele Magazzeni , Manuela Veloso

Randomized-subspace methods reduce the cost of first-order optimization by using only low-dimensional projected-gradient information, a feature that is attractive in forward-mode automatic differentiation and communication-limited settings.…

Optimization and Control · Mathematics 2026-05-04 Gaku Omiya , Pierre-Louis Poirion , Akiko Takeda

Various distributed gradient descent algorithms for multi-agent optimization have incorporated the Nesterov accelerated gradient method, where the use of momentum enhances convergence rates. These algorithms have found broad applications in…

Systems and Control · Electrical Eng. & Systems 2026-04-21 Zihao Ren , Lei Wang , Guodong Shi

We derive a second-order ordinary differential equation (ODE) which is the limit of Nesterov's accelerated gradient method. This ODE exhibits approximate equivalence to Nesterov's scheme and thus can serve as a tool for analysis. We show…

Machine Learning · Statistics 2015-10-29 Weijie Su , Stephen Boyd , Emmanuel J. Candes

In this work, we consider policy-based methods for solving the reinforcement learning problem, and establish the sample complexity guarantees. A policy-based algorithm typically consists of an actor and a critic. We consider using various…

Machine Learning · Computer Science 2023-01-16 Zaiwei Chen , Siva Theja Maguluri

We present a unifying framework for adapting the update direction in gradient-based iterative optimization methods. As natural special cases we re-derive classical momentum and Nesterov's accelerated gradient method, lending a new intuitive…

Machine Learning · Statistics 2016-07-12 Aleksandar Botev , Guy Lever , David Barber

In cooperative stochastic games multiple agents work towards learning joint optimal actions in an unknown environment to achieve a common goal. In many real-world applications, however, constraints are often imposed on the actions that can…

Multiagent Systems · Computer Science 2020-07-14 Raghuram Bharadwaj Diddigi , Sai Koti Reddy Danda , Prabuchandran K. J. , Shalabh Bhatnagar
‹ Prev 1 2 3 10 Next ›