Related papers: Sequential decision problems, dependent types and …

On the correctness of monadic backward induction

In control theory, to solve a finite-horizon sequential decision problem (SDP) commonly means to find a list of decision rules that result in an optimal expected total reward (or cost) when taking a given number of decision steps. SDPs are…

Logic in Computer Science · Computer Science 2023-06-22 Nuria Brede , Nicola Botta

A Category-Theoretic Framework for Dependent Effect Systems

Graded monads refine traditional monads using effect annotations in order to describe quantitatively the computational effects that a program can generate. They have been successfully applied to a variety of formal systems for reasoning…

Logic in Computer Science · Computer Science 2026-01-22 Satoshi Kura , Marco Gaboardi , Taro Sekiyama , Hiroshi Unno

Value and Policy Iteration in Optimal Control and Adaptive Dynamic Programming

In this paper, we consider discrete-time infinite horizon problems of optimal control to a terminal set of states. These are the problems that are often taken as the starting point for adaptive dynamic programming. Under very general…

Systems and Control · Computer Science 2015-10-05 Dimitri P. Bertsekas

Convergence Analysis of Policy Iteration

Adaptive optimal control of nonlinear dynamic systems with deterministic and known dynamics under a known undiscounted infinite-horizon cost function is investigated. Policy iteration scheme initiated using a stabilizing initial control is…

Systems and Control · Computer Science 2015-05-21 Ali Heydari

Meta-Learning surrogate models for sequential decision making

We introduce a unified probabilistic framework for solving sequential decision making problems ranging from Bayesian optimisation to contextual bandits and reinforcement learning. This is accomplished by a probabilistic model-based approach…

Machine Learning · Statistics 2019-06-13 Alexandre Galashov , Jonathan Schwarz , Hyunjik Kim , Marta Garnelo , David Saxton , Pushmeet Kohli , S. M. Ali Eslami , Yee Whye Teh

Constrained Online Decision-Making: A Unified Framework

Contextual online decision-making problems with constraints appear in a wide range of real-world applications, such as adaptive experimental design under safety constraints, personalized recommendation with resource limits, and dynamic…

Machine Learning · Statistics 2025-05-23 Haichen Hu , David Simchi-Levi , Navid Azizan

A Practical Formalization of Monadic Equational Reasoning in Dependent-type Theory

One can perform equational reasoning about computational effects with a purely functional programming language thanks to monads. Even though equational reasoning for effectful programs is desirable, it is not yet mainstream. This is partly…

Logic in Computer Science · Computer Science 2025-01-15 Reynald Affeldt , Jacques Garrigue , Takafumi Saikawa

Beyond Bellman: High-Order Generator Regression for Continuous-Time Policy Evaluation

We study finite-horizon continuous-time policy evaluation from discrete closed-loop trajectories under time-inhomogeneous dynamics. The target value surface solves a backward parabolic equation, but the Bellman baseline obtained from…

Machine Learning · Statistics 2026-05-11 Yaowei Zheng , Richong Zhang , Shenxi Wu , Shirui Bian , Haosong Zhang , Li Zeng , Xingjian Ma , Yichi Zhang

Dependence Logics in Temporal Settings

Many forms of dependence manifest themselves over time, with behavior of variables in dynamical systems as a paradigmatic example. This paper studies temporal dependence in dynamical systems from a logical perspective, by enriching a…

Logic in Computer Science · Computer Science 2024-03-29 Alexandru Baltag , Johan van Benthem , Dazhu Li

Finitely Convergent Deterministic and Stochastic Iterative Methods for Solving Convex Feasibility Problems

We propose finitely convergent methods for solving convex feasibility problems defined over a possibly infinite pool of constraints. Following other works in this area, we assume that the interior of the solution set is nonempty and that…

Optimization and Control · Mathematics 2020-09-22 Victor I. Kolobov , Simeon Reich , Rafał Zalas

Deliberation Scheduling for Time-Critical Sequential Decision Making

We describe a method for time-critical decision making involving sequential tasks and stochastic processes. The method employs several iterative refinement routines for solving different aspects of the decision making problem. This paper…

Artificial Intelligence · Computer Science 2013-03-08 Thomas L. Dean , Leslie Pack Kaelbling , Jak Kirman , Ann Nicholson

On the Bayesian calibration of expensive computer models with input dependent parameters

Computer models, aiming at simulating a complex real system, are often calibrated in the light of data to improve performance. Standard calibration methods assume that the optimal values of calibration parameters are invariant to the model…

Methodology · Statistics 2017-09-01 Georgios Karagiannis , Bledar A. Konomi , Guang Lin

Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception

Recently developed pretrained models can encode rich world knowledge expressed in multiple modalities, such as text and images. However, the outputs of these models cannot be integrated into algorithms to solve sequential decision-making…

Artificial Intelligence · Computer Science 2024-06-19 Yunhao Yang , Cyrus Neary , Ufuk Topcu

Deterministic control of randomly-terminated processes

We consider both discrete and continuous "uncertain horizon" deterministic control processes, for which the termination time is a random variable. We examine the dynamic programming equations for the value function of such processes,…

Optimization and Control · Mathematics 2016-01-06 June Andrews , Alexander Vladimirsky

Parallel Model Predictive Control for Deterministic Systems

In this note, we consider infinite horizon optimal control problems with deterministic systems. Since exact solutions to these problems are often intractable, we propose a parallel model predictive control (MPC) method that provides an…

Optimization and Control · Mathematics 2025-04-29 Yuchao Li , Aren Karapetyan , Niklas Schmid , John Lygeros , Karl H. Johansson , Jonas Mårtensson

Correct-by-Construction Control for Stochastic and Uncertain Dynamical Models via Formal Abstractions

Automated synthesis of correct-by-construction controllers for autonomous systems is crucial for their deployment in safety-critical scenarios. Such autonomous systems are naturally modeled as stochastic dynamical models. The general…

Systems and Control · Electrical Eng. & Systems 2023-11-17 Thom Badings , Nils Jansen , Licio Romao , Alessandro Abate

A Machine Learning Algorithm for Finite-Horizon Stochastic Control Problems in Economics

We propose a machine learning algorithm for solving finite-horizon stochastic control problems based on a deep neural network representation of the optimal policy functions. The algorithm has three features: (1) It can solve…

General Economics · Economics 2024-12-09 Xianhua Peng , Steven Kou , Lekang Zhang

Optimal control of forward-backward mean-field stochastic delayed systems

We study methods for solving stochastic control problems of systems of forward-backward mean-field equations with delay, in finite or infinite horizon. Necessary and sufficient maximum principles under partial information are given. The…

Optimization and Control · Mathematics 2016-10-31 Nacira Agram , Elin Engen Rose

Generalized Dual Dynamic Programming for Infinite Horizon Problems in Continuous State and Action Spaces

We describe a nonlinear generalization of dual dynamic programming theory and its application to value function estimation for deterministic control problems over continuous state and action spaces, in a discrete-time infinite horizon…

Optimization and Control · Mathematics 2018-10-05 Joseph Warrington , Paul N. Beuchat , John Lygeros

A Unified Approach for Solving Sequential Selection Problems

In this paper we develop a unified approach for solving a wide class of sequential selection problems. This class includes, but is not limited to, selection problems with no-information, rank-dependent rewards, and considers both fixed as…

Probability · Mathematics 2020-01-27 Alexander Goldenshluger , Yaakov Malinovsky , Assaf Zeevi