Related papers: Linear Programming for Large-Scale Markov Decision…

Large-Scale Markov Decision Problems via the Linear Programming Dual

We consider the problem of controlling a fully specified Markov decision process (MDP), also known as the planning problem, when the state space is very large and calculating the optimal policy is intractable. Instead, we pursue the more…

Optimization and Control · Mathematics 2019-01-09 Yasin Abbasi-Yadkori , Peter L. Bartlett , Xi Chen , Alan Malek

On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs

We consider the linear programming approach for constrained and unconstrained Markov decision processes (MDPs) under the long-run average cost criterion, where the class of MDPs in our study have Borel state spaces and discrete countable…

Optimization and Control · Mathematics 2021-04-20 Huizhen Yu

Finite-Horizon Markov Decision Processes with State Constraints

Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (minimize…

Optimization and Control · Mathematics 2015-07-08 Mahmoud El Chamie , Behcet Acikmese

Efficient Planning in Large MDPs with Weak Linear Function Approximation

Large-scale Markov decision processes (MDPs) require planning algorithms with runtime independent of the number of states of the MDP. We consider the planning problem in MDPs using linear value function approximation with only weak…

Machine Learning · Computer Science 2020-07-14 Roshan Shariff , Csaba Szepesvári

Finite-State Approximations to Discounted and Average Cost Constrained Markov Decision Processes

In this paper, we consider the finite-state approximation of a discrete-time constrained Markov decision process (MDP) under the discounted and average cost criteria. Using the linear programming formulation of the constrained discounted…

Optimization and Control · Mathematics 2018-07-10 Naci Saldi

A Primal-Dual Approach to Constrained Markov Decision Processes

In many operations management problems, we need to make decisions sequentially to minimize the cost while satisfying certain constraints. One modeling approach to study such problems is constrained Markov decision process (CMDP). When…

Optimization and Control · Mathematics 2021-01-27 Yi Chen , Jing Dong , Zhaoran Wang

On Sample Complexity of Projection-Free Primal-Dual Methods for Learning Mixture Policies in Markov Decision Processes

We study the problem of learning policy of an infinite-horizon, discounted cost, Markov decision process (MDP) with a large number of states. We compute the actions of a policy that is nearly as good as a policy chosen by a suitable oracle…

Machine Learning · Computer Science 2019-09-02 Masoud Badiei Khuzani , Varun Vasudevan , Hongyi Ren , Lei Xing

Linear Programming for Decision Processes with Partial Information

Markov Decision Processes (MDPs) are stochastic optimization problems that model situations where a decision maker controls a system based on its state. Partially observed Markov decision processes (POMDPs) are generalizations of MDPs where…

Optimization and Control · Mathematics 2019-03-26 Victor Cohen , Axel Parmentier

Semilinear Dynamic Programming: Analysis, Algorithms, and Certainty Equivalence Properties

We consider a broad class of dynamic programming (DP) problems that involve a partially linear structure and some positivity properties in their system equation and cost function. We address deterministic and stochastic problems, possibly…

Optimization and Control · Mathematics 2026-04-21 Yuchao Li , Dimitri Bertsekas

Modeling and Optimizing Resource Allocation Decisions through Multi-model Markov Decision Processes with Capacity Constraints

This paper proposes a new formulation for the dynamic resource allocation problem, which converts the traditional MDP model with known parameters and no capacity constraints to a new model with uncertain parameters and a resource capacity…

Optimization and Control · Mathematics 2020-11-10 Onur Demiray , Evrim Didem Güneş , Lerzan Örmeci

Coordinated Multi-Agent Patrolling with State-Dependent Cost Rates: Asymptotically Optimal Policies for Large-Scale Systems

We study a large-scale patrol problem with state-dependent costs and multi-agent coordination.We consider heterogeneous agents, rather general reward functions, and the capabilities of tracking agents' trajectories.Given the complexity and…

Optimization and Control · Mathematics 2024-12-12 Jing Fu , Zengfu Wang , Jie Chen

Bounding Procedures for Stochastic Dynamic Programs with Application to the Perimeter Patrol Problem

One often encounters the curse of dimensionality in the application of dynamic programming to determine optimal policies for controlled Markov chains. In this paper, we provide a method to construct sub-optimal policies along with a bound…

Systems and Control · Computer Science 2011-08-17 Myoungkuk Park , Krishnamoorthy Kalyanam , Swaroop Darbha , Phil Chandler , Meir Pachter

Sufficient Markov Decision Processes with Alternating Deep Neural Networks

Advances in mobile computing technologies have made it possible to monitor and apply data-driven interventions across complex systems in real time. Markov decision processes (MDPs) are the primary model for sequential decision problems with…

Methodology · Statistics 2018-03-20 Longshaokan Wang , Eric B. Laber , Katie Witkiewitz

Linear Dynamics meets Linear MDPs: Closed-Form Optimal Policies via Reinforcement Learning

Many applications -- including power systems, robotics, and economics -- involve a dynamical system interacting with a stochastic and hard-to-model environment. We adopt a reinforcement learning approach to control such systems.…

Optimization and Control · Mathematics 2025-08-26 Abed AlRahman Al Makdah , Oliver Kosut , Lalitha Sankar , Shaofeng Zou

Randomized Linear Programming Solves the Discounted Markov Decision Problem In Nearly-Linear (Sometimes Sublinear) Running Time

We propose a novel randomized linear programming algorithm for approximating the optimal policy of the discounted Markov decision problem. By leveraging the value-policy duality and binary-tree data structures, the algorithm adaptively…

Optimization and Control · Mathematics 2019-06-04 Mengdi Wang

On the Minimum Pair Approach for Average-Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs

We consider average-cost Markov decision processes (MDPs) with Borel state spaces, countable, discrete action spaces, and strictly unbounded one-stage costs. For the minimum pair approach, we introduce a new majorization condition on the…

Optimization and Control · Mathematics 2020-05-06 Huizhen Yu

Stochastic dominance-constrained Markov decision processes

We are interested in risk constraints for infinite horizon discrete time Markov decision processes (MDPs). Starting with average reward MDPs, we show that increasing concave stochastic dominance constraints on the empirical distribution of…

Optimization and Control · Mathematics 2012-06-21 William B. Haskell , Rahul Jain

Computing monotone policies for Markov decision processes: a nearly-isotonic penalty approach

This paper discusses algorithms for solving Markov decision processes (MDPs) that have monotone optimal policies. We propose a two-stage alternating convex optimization scheme that can accelerate the search for an optimal policy by…

Systems and Control · Computer Science 2017-04-04 Robert Mattila , Cristian R. Rojas , Vikram Krishnamurthy , Bo Wahlberg

Robust Combination of Local Controllers

Planning problems are hard, motion planning, for example, isPSPACE-hard. Such problems are even more difficult in the presence of uncertainty. Although, Markov Decision Processes (MDPs) provide a formal framework for such problems, finding…

Artificial Intelligence · Computer Science 2013-01-14 Carlos E. Guestrin , Dirk Ormoneit

On Minimizing Total Discounted Cost in MDPs Subject to Reachability Constraints

We study the synthesis of a policy in a Markov decision process (MDP) following which an agent reaches a target state in the MDP while minimizing its total discounted cost. The problem combines a reachability criterion with a discounted…

Optimization and Control · Mathematics 2021-03-18 Yagiz Savas , Christos K. Verginis , Michael Hibbard , Ufuk Topcu