English
Related papers

Related papers: Accelerated Point-wise Maximum Approach to Approxi…

200 papers

In this paper, a convex optimization-based method is proposed for numerically solving dynamic programs in continuous state and action spaces. The key idea is to approximate the output of the Bellman operator at a particular state by the…

Optimization and Control · Mathematics 2020-10-23 Insoon Yang

We describe an approximate dynamic programming method for stochastic control problems on infinite state and input spaces. The optimal value function is approximated by a linear combination of basis functions with coefficients as decision…

Optimization and Control · Mathematics 2012-12-07 Tyler H. Summers , Konstantin Kunz , Nikolaos Kariotoglou , Maryam Kamgarpour , Sean Summers , John Lygeros

We consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value…

Optimization and Control · Mathematics 2020-05-25 Denis Lebedev , Paul Goulart , Kostas Margellos

We consider stochastic dynamic programming problems with high-dimensional, discrete state-spaces and finite, discrete-time horizons that prohibit direct computation of the value function from a given Bellman equation for all states and time…

Optimization and Control · Mathematics 2020-06-05 Denis Lebedev , Paul Goulart , Kostas Margellos

Existing value function approximation methods have been successfully used in many applications, but they often lack useful a priori error bounds. We propose a new approximate bilinear programming formulation of value function approximation,…

Artificial Intelligence · Computer Science 2010-06-15 Marek Petrik , Shlomo Zilberstein

We consider least squares approximation of a function of one variable by a continuous, piecewise-linear approximand that has a small number of breakpoints. This problem was notably considered by Bellman who proposed an approximate algorithm…

Optimization and Control · Mathematics 2018-06-29 Olof Troeng , Mattias Fält

In this paper, we present a discretization algorithm for finite horizon risk constrained dynamic programming algorithm in [Chow_Pavone_13]. Although in a theoretical standpoint, Bellman's recursion provides a systematic way to find optimal…

Optimization and Control · Mathematics 2015-01-12 Yin-Lam Chow , Marco Pavone

This paper considers an optimization problem for a dynamical system whose evolution depends on a collection of binary decision variables. We develop scalable approximation algorithms with provable suboptimality bounds to provide…

Optimization and Control · Mathematics 2016-10-31 Insoon Yang , Samuel A. Burden , Ram Rajagopal , S. Shankar Sastry , Claire J. Tomlin

The solutions to many sequential decision-making problems are characterized by dynamic programming and Bellman's principle of optimality. However, due to the inherent complexity of solving Bellman's equation exactly, there has been…

Systems and Control · Electrical Eng. & Systems 2026-03-24 Bowen Li , Edwin K. P. Chong , Ali Pezeshki

This paper considers non-smooth optimization problems where we seek to minimize the pointwise maximum of a continuously parameterized family of functions. Since the objective function is given as the solution to a maximization problem,…

Optimization and Control · Mathematics 2026-01-12 Dimitris Boskos , Jorge Cortés , Sonia Martínez

One often encounters the curse of dimensionality in the application of dynamic programming to determine optimal policies for controlled Markov chains. In this paper, we provide a method to construct sub-optimal policies along with a bound…

Systems and Control · Computer Science 2011-08-17 Myoungkuk Park , Krishnamoorthy Kalyanam , Swaroop Darbha , Phil Chandler , Meir Pachter

We describe a nonlinear generalization of dual dynamic programming theory and its application to value function estimation for deterministic control problems over continuous state and action spaces, in a discrete-time infinite horizon…

Optimization and Control · Mathematics 2018-10-05 Joseph Warrington , Paul N. Beuchat , John Lygeros

We study the dynamic programming approach to revenue management in the context of attended home delivery. We draw on results from dynamic programming theory for Markov decision problems to show that the underlying Bellman operator has a…

Optimization and Control · Mathematics 2019-10-28 Denis Lebedev , Paul Goulart , Kostas Margellos

A very simple example of an algorithmic problem solvable by dynamic programming is to maximize, over sets A in {1,2,...,n}, the objective function |A| - \sum_i \xi_i 1(i \in A,i+1 \in A) for given \xi_i > 0. This problem, with random…

Probability · Mathematics 2007-10-04 David J. Aldous , Charles Bordenave , Marc Lelarge

Approximate dynamic programming is a popular method for solving large Markov decision processes. This paper describes a new class of approximate dynamic programming (ADP) methods- distributionally robust ADP-that address the curse of…

Machine Learning · Statistics 2012-05-22 Marek Petrik

In this paper, we propose a proximal gradient method and an accelerated proximal gradient method for solving composite optimization problems, where the objective function is the sum of a smooth and a convex, possibly nonsmooth, function. We…

Optimization and Control · Mathematics 2025-07-22 Raghu Bollapragada , Shagun Gupta

Many sequential decision problems can be formulated as Markov Decision Processes (MDPs) where the optimal value function (or cost-to-go function) can be shown to satisfy a monotone structure in some or all of its dimensions. When the state…

Optimization and Control · Mathematics 2015-09-03 Daniel R. Jiang , Warren B. Powell

Bilevel optimization has been developed for many machine learning tasks with large-scale and high-dimensional data. This paper considers a constrained bilevel optimization problem, where the lower-level optimization problem is convex with…

Machine Learning · Computer Science 2023-08-22 Siyuan Xu , Minghui Zhu

We develop a new Approximate Dynamic Programming (ADP) method for infinite horizon discounted reward Markov Decision Processes (MDP) based on projection onto a subsemimodule. We approximate the value function in terms of a $(\min,+)$ linear…

Systems and Control · Computer Science 2014-03-18 Chandrashekar Lakshminarayanan , Shalabh Bhatnagar

We propose a method of approximating multivariate Gaussian probabilities using dynamic programming. We show that solving the optimization problem associated with a class of discrete-time finite horizon Markov decision processes with…

Optimization and Control · Mathematics 2018-02-08 Morgan Jones , Matthew M. Peet
‹ Prev 1 2 3 10 Next ›