Related papers: Optimal Immunization Policy Using Dynamic Programm…

Dynamic programming with incomplete information to overcome navigational uncertainty in a nautical environment

Using a novel toy nautical navigation environment, we show that dynamic programming can be used when only incomplete information about a partially observed Markov decision process (POMDP) is known. By incorporating uncertainty into our…

Optimization and Control · Mathematics 2022-07-20 Chris Beeler , Xinkai Li , Colin Bellinger , Mark Crowley , Maia Fraser , Isaac Tamblyn

Optimal Decision Making Under Strategic Behavior

We are witnessing an increasing use of data-driven predictive models to inform decisions. As decisions have implications for individuals and society, there is increasing pressure on decision makers to be transparent about their decision…

Machine Learning · Computer Science 2024-02-26 Stratis Tsirtsis , Behzad Tabibian , Moein Khajehnejad , Adish Singla , Bernhard Schölkopf , Manuel Gomez-Rodriguez

Efficient Dynamic Allocation Policy for Robust Ranking and Selection under Stochastic Control Framework

This research considers the ranking and selection with input uncertainty. The objective is to maximize the posterior probability of correctly selecting the best alternative under a fixed simulation budget, where each alternative is measured…

Optimization and Control · Mathematics 2023-05-15 Hui Xiao , Zhihong Wei

PODDP: Partially Observable Differential Dynamic Programming for Latent Belief Space Planning

Autonomous agents are limited in their ability to observe the world state. Partially observable Markov decision processes (POMDPs) formally model the problem of planning under world state uncertainty, but POMDPs with continuous actions and…

Robotics · Computer Science 2020-07-08 Dicong Qiu , Yibiao Zhao , Chris L. Baker

Risk-Averse Planning Under Uncertainty

We consider the problem of designing policies for partially observable Markov decision processes (POMDPs) with dynamic coherent risk objectives. Synthesizing risk-averse optimal policies for POMDPs requires infinite memory and thus…

Robotics · Computer Science 2019-09-30 Mohamadreza Ahmadi , Masahiro Ono , Michel D. Ingham , Richard M. Murray , Aaron D. Ames

Dynamic Programming: From Local Optimality to Global Optimality

In the theory of dynamic programming, an optimal policy is a policy whose lifetime value dominates that of all other policies from every possible initial condition in the state space. This raises a natural question: when does optimality…

Optimization and Control · Mathematics 2025-05-13 John Stachurski , Jingni Yang , Ziyue Yang

An Anytime Algorithm for Decision Making under Uncertainty

We present an anytime algorithm which computes policies for decision problems represented as multi-stage influence diagrams. Our algorithm constructs policies incrementally, starting from a policy which makes no use of the available…

Artificial Intelligence · Computer Science 2013-02-01 Michael C. Horsch , David L. Poole

A Dynamic Programming Algorithm for Finding an Optimal Sequence of Informative Measurements

An informative measurement is the most efficient way to gain information about an unknown state. We present a first-principles derivation of a general-purpose dynamic programming algorithm that returns an optimal sequence of informative…

Machine Learning · Computer Science 2023-02-01 Peter N. Loxley , Ka-Wai Cheung

A Survey of Contextual Optimization Methods for Decision Making under Uncertainty

Recently there has been a surge of interest in operations research (OR) and the machine learning (ML) community in combining prediction algorithms and optimization techniques to solve decision-making problems in the face of uncertainty.…

Optimization and Control · Mathematics 2025-11-11 Utsav Sadana , Abhilash Chenreddy , Erick Delage , Alexandre Forel , Emma Frejinger , Thibaut Vidal

Certifiably Robust Policies for Uncertain Parametric Environments

We present a data-driven approach for producing policies that are provably robust across unknown stochastic environments. Existing approaches can learn models of a single environment as an interval Markov decision processes (IMDP) and…

Machine Learning · Computer Science 2025-03-25 Yannik Schnitzer , Alessandro Abate , David Parker

Optimal discharge of patients from intensive care via a data-driven policy learning framework

Clinical decision support tools rooted in machine learning and optimization can provide significant value to healthcare providers, including through better management of intensive care units. In particular, it is important that the patient…

Machine Learning · Computer Science 2021-12-20 Fernando Lejarza , Jacob Calvert , Misty M Attwood , Daniel Evans , Qingqing Mao

Planning Multiple Epidemic Interventions with Reinforcement Learning

Combating an epidemic entails finding a plan that describes when and how to apply different interventions, such as mask-wearing mandates, vaccinations, school or workplace closures. An optimal plan will curb an epidemic with minimal loss of…

Machine Learning · Computer Science 2023-06-08 Anh Mai , Nikunj Gupta , Azza Abouzied , Dennis Shasha

Planning with Partially Observable Markov Decision Processes: Advances in Exact Solution Method

There is much interest in using partially observable Markov decision processes (POMDPs) as a formal model for planning in stochastic domains. This paper is concerned with finding optimal policies for POMDPs. We propose several improvements…

Artificial Intelligence · Computer Science 2013-02-01 Nevin Lianwen Zhang , Stephen S. Lee

Uncertainty quantification and multi-stage variable selection for personalized treatment regimes

A dynamic treatment regime is a sequence of medical decisions that adapts to the evolving clinical status of a patient over time. To facilitate personalized care, it is crucial to assess the probability of each available treatment option…

Methodology · Statistics 2024-11-05 Jiefeng Bi , Matteo Borrotti , Bernardo Nipoti

Dynamic Vaccine Prioritization via Non-Markovian Final-state Optimization

Effective vaccine prioritization is critical for epidemic control, yet real outbreaks exhibit memory effects that inflate state space and make long-term prediction and optimization challenging. As a result, many strategies are tuned to…

Biological Physics · Physics 2025-11-11 Mi Feng , Liang Tian , Changsong Zhou

Personalized Dynamic Treatment Regimes in Continuous Time: A Bayesian Approach for Optimizing Clinical Decisions with Timing

Accurate models of clinical actions and their impacts on disease progression are critical for estimating personalized optimal dynamic treatment regimes (DTRs) in medical/health research, especially in managing chronic conditions.…

Methodology · Statistics 2021-02-19 William Hua , Hongyuan Mei , Sarah Zohar , Magali Giral , Yanxun Xu

Weathering Ongoing Uncertainty: Learning and Planning in a Time-Varying Partially Observable Environment

Optimal decision-making presents a significant challenge for autonomous systems operating in uncertain, stochastic and time-varying environments. Environmental variability over time can significantly impact the system's optimal decision…

Robotics · Computer Science 2024-03-11 Gokul Puthumanaillam , Xiangyu Liu , Negar Mehr , Melkior Ornik

AI-Driven Optimization under Uncertainty for Mineral Processing Operations

The global capacity for mineral processing must expand rapidly to meet the demand for critical minerals, which are essential for building the clean energy technologies necessary to mitigate climate change. However, the efficiency of mineral…

Systems and Control · Electrical Eng. & Systems 2026-05-15 William Xu , Amir Eskanlou , Mansur Arief , David Zhen Yin , Jef K. Caers

Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes

Active classification, i.e., the sequential decision-making process aimed at data acquisition for classification purposes, arises naturally in many applications, including medical diagnosis, intrusion detection, and object tracking. In this…

Systems and Control · Computer Science 2018-10-02 Bo Wu , Mohamadreza Ahmadi , Suda Bharadwaj , Ufuk Topcu

Dynamic Vaccination Game in a Heterogeneous Mixing Population

Opposition to vaccination has long been a non-negligible public health phenomenon resulted from people's varied perceptions toward vaccination (e.g., vaccine-phobia). This paper investigates the voluntary vaccination behavior of a…

Physics and Society · Physics 2019-09-04 Liqun Lu , Yanfeng Ouyang