Related papers: A General Framework for Bounding Approximate Dynam…

Guaranteed Bounds for General Approximate Dynamic Programming

In this paper, we will develop a systematic approach to deriving guaranteed bounds for approximate dynamic programming (ADP) schemes in optimal control problems. Our approach is inspired by our recent results on bounding the performance of…

Optimization and Control · Mathematics 2014-03-31 Yajing Liu , Edwin K. P. Chong , Ali Pezeshki , Bill Moran

Performance Guarantees for Data-Driven Sequential Decision-Making

The solutions to many sequential decision-making problems are characterized by dynamic programming and Bellman's principle of optimality. However, due to the inherent complexity of solving Bellman's equation exactly, there has been…

Systems and Control · Electrical Eng. & Systems 2026-03-24 Bowen Li , Edwin K. P. Chong , Ali Pezeshki

Approximate Dynamic Programming By Minimizing Distributionally Robust Bounds

Approximate dynamic programming is a popular method for solving large Markov decision processes. This paper describes a new class of approximate dynamic programming (ADP) methods- distributionally robust ADP-that address the curse of…

Machine Learning · Statistics 2012-05-22 Marek Petrik

A Supplementary Condition for the Convergence of the Control Policy during Adaptive Dynamic Programming

Reinforcement learning based adaptive/approximate dynamic programming (ADP) is a powerful technique to determine an approximate optimal controller for a dynamical system. These methods bypass the need to analytically solve the nonlinear…

Optimization and Control · Mathematics 2018-05-24 Xuefeng Bao , Zhi-Hong Mao , Nitin Sharma

An Approximate Dynamic Programming Approach to Adversarial Online Learning

We describe an approximate dynamic programming (ADP) approach to compute approximations of the optimal strategies and of the minimal losses that can be guaranteed in discounted repeated games with vector-valued losses. Such games…

Computer Science and Game Theory · Computer Science 2020-10-27 Vijay Kamble , Patrick Loiseau , Jean Walrand

An Alternating Approach to Approximate Dynamic Programming

In this paper, we give a new approximate dynamic programming (ADP) method to solve large-scale Markov decision programming (MDP) problem. In comparison with many classic ADP methods which have large number of constraints, we formulate an…

Optimization and Control · Mathematics 2025-07-15 Di Zhang

A New Optimal Stepsize For Approximate Dynamic Programming

Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems. The design of effective ADP algorithms has many…

Optimization and Control · Mathematics 2014-07-15 Ilya O. Ryzhov , Peter I. Frazier , Warren B. Powell

A Theoretical Difficulty in Approximate Dynamic Programming with Input Constraints

Equipping approximate dynamic programming (ADP) with inputconstraints has a tremendous significance. This enables ADP to be applied tothe systems with actuator limitations, which is quite common for dynamicalsystems. In a conventional…

Optimization and Control · Mathematics 2018-05-24 Xuefeng Bao , Zhi-Hong Mao , Nitin Sharma

Robust Adaptive Dynamic Programming for Optimal Nonlinear Control Design

This paper studies the robust optimal control design for uncertain nonlinear systems from a perspective of robust adaptive dynamic programming (robust-ADP). The objective is to fill up a gap in the past literature of ADP where dynamic…

Dynamical Systems · Mathematics 2013-03-12 Yu Jiang , Zhong-Ping Jiang

Importance Sampling based Exploration in Q Learning

Approximate Dynamic Programming (ADP) is a methodology to solve multi-stage stochastic optimization problems in multi-dimensional discrete or continuous spaces. ADP approximates the optimal value function by adaptively sampling both action…

Optimization and Control · Mathematics 2021-07-02 Vijay Kumar , Mort Webster

Approximate Dynamic Programming for Real-time Dispatching and Relocation of Emergency Service Engineers

Quick response times are paramount for minimizing downtime in spare parts networks for capital goods, such as medical and manufacturing equipment. To guarantee that the maintenance is performed in a timely fashion, strategic management of…

Optimization and Control · Mathematics 2019-10-04 Dmitrii Usanov , Anna Pechina , Peter van de Ven , Rob van der Mei

Block Decomposable Methods for Large-Scale Optimization Problems

This dissertation explores block decomposable methods for large-scale optimization problems. It focuses on alternating direction method of multipliers (ADMM) schemes and block coordinate descent (BCD) methods. Specifically, it introduces a…

Optimization and Control · Mathematics 2026-01-15 Leandro Farias Maia

Theoretical and Numerical Analysis of Approximate Dynamic Programming with Approximation Errors

This study is aimed at answering the famous question of how the approximation errors at each iteration of Approximate Dynamic Programming (ADP) affect the quality of the final results considering the fact that errors at each iteration…

Systems and Control · Computer Science 2015-05-18 Ali Heydari

Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems

This paper presents a novel method of global adaptive dynamic programming (ADP) for the adaptive optimal control of nonlinear polynomial systems. The strategy consists of relaxing the problem of solving the Hamilton-Jacobi-Bellman (HJB)…

Dynamical Systems · Mathematics 2017-01-11 Yu Jiang , Zhong-Ping Jiang

Approximate Dynamic Programming via a Smoothed Linear Program

We present a novel linear program for the approximation of the dynamic programming cost-to-go function in high-dimensional stochastic control problems. LP approaches to approximate DP have typically relied on a natural `projection' of a…

Optimization and Control · Mathematics 2009-10-05 V. V. Desai , V. F. Farias , C. C. Moallemi

Adaptive Stochastic Alternating Direction Method of Multipliers

The Alternating Direction Method of Multipliers (ADMM) has been studied for years. The traditional ADMM algorithm needs to compute, at each iteration, an (empirical) expected loss function on all training examples, resulting in a…

Machine Learning · Statistics 2014-06-10 Peilin Zhao , Jinwei Yang , Tong Zhang , Ping Li

Approximate Dynamic Programming based on Projection onto the (min,+) subsemimodule

We develop a new Approximate Dynamic Programming (ADP) method for infinite horizon discounted reward Markov Decision Processes (MDP) based on projection onto a subsemimodule. We approximate the value function in terms of a $(\min,+)$ linear…

Systems and Control · Computer Science 2014-03-18 Chandrashekar Lakshminarayanan , Shalabh Bhatnagar

An Approximate Dynamic Programming Algorithm for Monotone Value Functions

Many sequential decision problems can be formulated as Markov Decision Processes (MDPs) where the optimal value function (or cost-to-go function) can be shown to satisfy a monotone structure in some or all of its dimensions. When the state…

Optimization and Control · Mathematics 2015-09-03 Daniel R. Jiang , Warren B. Powell

Near optimal tracking control of a class of nonlinear systems and an experimental comparison

In this paper, near optimal tracking of a class of nonlinear systems is addressed. Adaptive (approximate) dynamic programming approach is used to calculate the optimal control in closed form. ADP (Adaptive (approximate) dynamic programming)…

Optimization and Control · Mathematics 2021-09-22 Farshid Asadi , Ali Heydari

Approximate Dynamic Programming For Linear Systems with State and Input Constraints

Enforcing state and input constraints during reinforcement learning (RL) in continuous state spaces is an open but crucial problem which remains a roadblock to using RL in safety-critical applications. This paper leverages invariant sets to…

Systems and Control · Electrical Eng. & Systems 2019-06-28 Ankush Chakrabarty , Rien Quirynen , Claus Danielson , Weinan Gao