English
Related papers

Related papers: A New Optimal Stepsize For Approximate Dynamic Pro…

200 papers

Approximate dynamic programming is a popular method for solving large Markov decision processes. This paper describes a new class of approximate dynamic programming (ADP) methods- distributionally robust ADP-that address the curse of…

Machine Learning · Statistics 2012-05-22 Marek Petrik

This study is aimed at answering the famous question of how the approximation errors at each iteration of Approximate Dynamic Programming (ADP) affect the quality of the final results considering the fact that errors at each iteration…

Systems and Control · Computer Science 2015-05-18 Ali Heydari

Equipping approximate dynamic programming (ADP) with inputconstraints has a tremendous significance. This enables ADP to be applied tothe systems with actuator limitations, which is quite common for dynamicalsystems. In a conventional…

Optimization and Control · Mathematics 2018-05-24 Xuefeng Bao , Zhi-Hong Mao , Nitin Sharma

The solutions to many sequential decision-making problems are characterized by dynamic programming and Bellman's principle of optimality. However, due to the inherent complexity of solving Bellman's equation exactly, there has been…

Systems and Control · Electrical Eng. & Systems 2026-03-24 Bowen Li , Edwin K. P. Chong , Ali Pezeshki

Reinforcement learning based adaptive/approximate dynamic programming (ADP) is a powerful technique to determine an approximate optimal controller for a dynamical system. These methods bypass the need to analytically solve the nonlinear…

Optimization and Control · Mathematics 2018-05-24 Xuefeng Bao , Zhi-Hong Mao , Nitin Sharma

Approximate Dynamic Programming (ADP) is a methodology to solve multi-stage stochastic optimization problems in multi-dimensional discrete or continuous spaces. ADP approximates the optimal value function by adaptively sampling both action…

Optimization and Control · Mathematics 2021-07-02 Vijay Kumar , Mort Webster

Enforcing state and input constraints during reinforcement learning (RL) in continuous state spaces is an open but crucial problem which remains a roadblock to using RL in safety-critical applications. This paper leverages invariant sets to…

Systems and Control · Electrical Eng. & Systems 2019-06-28 Ankush Chakrabarty , Rien Quirynen , Claus Danielson , Weinan Gao

This paper studies the robust optimal control design for uncertain nonlinear systems from a perspective of robust adaptive dynamic programming (robust-ADP). The objective is to fill up a gap in the past literature of ADP where dynamic…

Dynamical Systems · Mathematics 2013-03-12 Yu Jiang , Zhong-Ping Jiang

Trajectory following is one of the complicated control problems when its dynamics are nonlinear, stochastic and include a large number of parameters. The problem has significant difficulties including a large number of trials required for…

Robotics · Computer Science 2019-02-14 Ali Lenjani

For solving pseudo-convex global optimization problems, we present a novel fully adaptive steepest descent method (or ASDM) without any hard-to-estimate parameters. For the step-size regulation in an $\varepsilon$-normalized direction, we…

Optimization and Control · Mathematics 2021-08-12 Z. R. Gabidullina

We study decentralized optimization where multiple agents minimize the average of their (strongly) convex, smooth losses over a communication graph. Convergence of the existing decentralized methods generally hinges on an apriori, proper…

Optimization and Control · Mathematics 2025-08-01 Ilya Kuruzov , Xiaokai Chen , Gesualdo Scutari , Alexander Gasnikov

We consider stochastic approximation with block-coordinate stepsizes and propose adaptive stepsize rules that aim to minimize the expected distance from the next iterate to an (unknown) target point. These stepsize rules employ online…

Optimization and Control · Mathematics 2025-12-09 Tao Jiang , Lin Xiao

An algorithm of searching a zero of an unknown undimensional function is considered, measured at a point x with some error. The step sizes are random positive values and are calculated according to the rule: if two consecutive iterations…

Statistics Theory · Mathematics 2007-06-13 Alexander Plakhov , Pedro Cruz

We introduce a novel dynamic learning-rate scheduling scheme grounded in theory with the goal of simplifying the manual and time-consuming tuning of schedules in practice. Our approach is based on estimating the locally-optimal stepsize,…

Machine Learning · Computer Science 2023-11-27 Gilad Yehudai , Alon Cohen , Amit Daniely , Yoel Drori , Tomer Koren , Mariano Schain

In this paper, near optimal tracking of a class of nonlinear systems is addressed. Adaptive (approximate) dynamic programming approach is used to calculate the optimal control in closed form. ADP (Adaptive (approximate) dynamic programming)…

Optimization and Control · Mathematics 2021-09-22 Farshid Asadi , Ali Heydari

Many sequential decision problems can be formulated as Markov Decision Processes (MDPs) where the optimal value function (or cost-to-go function) can be shown to satisfy a monotone structure in some or all of its dimensions. When the state…

Optimization and Control · Mathematics 2015-09-03 Daniel R. Jiang , Warren B. Powell

We address the challenge of optimizing meta-parameters (hyperparameters) in machine learning, a key factor for efficient training and high model performance. Rather than relying on expensive meta-parameter search methods, we introduce…

Machine Learning · Computer Science 2025-07-10 Arsalan Sharifnassab , Saber Salehkaleybar , Richard Sutton

While many theoretical works concerning Adaptive Dynamic Programming (ADP) have been proposed, application results are scarce. Therefore, we design an ADP-based optimal trajectory tracking controller and apply it to a large-scale…

Systems and Control · Electrical Eng. & Systems 2021-01-26 Florian Köpf , Sean Kille , Jairo Inga , Sören Hohmann

We propose an adaptive step-size rule for decentralized optimization. Choosing a step-size that balances convergence and stability is challenging. This is amplified in the decentralized setting as agents observe only local (possibly…

Optimization and Control · Mathematics 2026-02-17 Aaron Fainman , Stefan Vlaski

An algorithm is presented for momentum gradient descent optimization based on the first-order differential equation of the Newtonian dynamics. The fictitious mass is introduced to the dynamics of momentum for regularizing the adaptive…

Machine Learning · Computer Science 2018-05-15 Zhidong Han
‹ Prev 1 2 3 10 Next ›