English
Related papers

Related papers: Sequential decision problems, dependent types and …

200 papers

In control theory, to solve a finite-horizon sequential decision problem (SDP) commonly means to find a list of decision rules that result in an optimal expected total reward (or cost) when taking a given number of decision steps. SDPs are…

Logic in Computer Science · Computer Science 2023-06-22 Nuria Brede , Nicola Botta

Graded monads refine traditional monads using effect annotations in order to describe quantitatively the computational effects that a program can generate. They have been successfully applied to a variety of formal systems for reasoning…

Logic in Computer Science · Computer Science 2026-01-22 Satoshi Kura , Marco Gaboardi , Taro Sekiyama , Hiroshi Unno

In this paper, we consider discrete-time infinite horizon problems of optimal control to a terminal set of states. These are the problems that are often taken as the starting point for adaptive dynamic programming. Under very general…

Systems and Control · Computer Science 2015-10-05 Dimitri P. Bertsekas

Adaptive optimal control of nonlinear dynamic systems with deterministic and known dynamics under a known undiscounted infinite-horizon cost function is investigated. Policy iteration scheme initiated using a stabilizing initial control is…

Systems and Control · Computer Science 2015-05-21 Ali Heydari

We introduce a unified probabilistic framework for solving sequential decision making problems ranging from Bayesian optimisation to contextual bandits and reinforcement learning. This is accomplished by a probabilistic model-based approach…

Contextual online decision-making problems with constraints appear in a wide range of real-world applications, such as adaptive experimental design under safety constraints, personalized recommendation with resource limits, and dynamic…

Machine Learning · Statistics 2025-05-23 Haichen Hu , David Simchi-Levi , Navid Azizan

One can perform equational reasoning about computational effects with a purely functional programming language thanks to monads. Even though equational reasoning for effectful programs is desirable, it is not yet mainstream. This is partly…

Logic in Computer Science · Computer Science 2025-01-15 Reynald Affeldt , Jacques Garrigue , Takafumi Saikawa

We study finite-horizon continuous-time policy evaluation from discrete closed-loop trajectories under time-inhomogeneous dynamics. The target value surface solves a backward parabolic equation, but the Bellman baseline obtained from…

Machine Learning · Statistics 2026-05-11 Yaowei Zheng , Richong Zhang , Shenxi Wu , Shirui Bian , Haosong Zhang , Li Zeng , Xingjian Ma , Yichi Zhang

Many forms of dependence manifest themselves over time, with behavior of variables in dynamical systems as a paradigmatic example. This paper studies temporal dependence in dynamical systems from a logical perspective, by enriching a…

Logic in Computer Science · Computer Science 2024-03-29 Alexandru Baltag , Johan van Benthem , Dazhu Li

We propose finitely convergent methods for solving convex feasibility problems defined over a possibly infinite pool of constraints. Following other works in this area, we assume that the interior of the solution set is nonempty and that…

Optimization and Control · Mathematics 2020-09-22 Victor I. Kolobov , Simeon Reich , Rafał Zalas

We describe a method for time-critical decision making involving sequential tasks and stochastic processes. The method employs several iterative refinement routines for solving different aspects of the decision making problem. This paper…

Artificial Intelligence · Computer Science 2013-03-08 Thomas L. Dean , Leslie Pack Kaelbling , Jak Kirman , Ann Nicholson

Computer models, aiming at simulating a complex real system, are often calibrated in the light of data to improve performance. Standard calibration methods assume that the optimal values of calibration parameters are invariant to the model…

Methodology · Statistics 2017-09-01 Georgios Karagiannis , Bledar A. Konomi , Guang Lin

Recently developed pretrained models can encode rich world knowledge expressed in multiple modalities, such as text and images. However, the outputs of these models cannot be integrated into algorithms to solve sequential decision-making…

Artificial Intelligence · Computer Science 2024-06-19 Yunhao Yang , Cyrus Neary , Ufuk Topcu

We consider both discrete and continuous "uncertain horizon" deterministic control processes, for which the termination time is a random variable. We examine the dynamic programming equations for the value function of such processes,…

Optimization and Control · Mathematics 2016-01-06 June Andrews , Alexander Vladimirsky

In this note, we consider infinite horizon optimal control problems with deterministic systems. Since exact solutions to these problems are often intractable, we propose a parallel model predictive control (MPC) method that provides an…

Optimization and Control · Mathematics 2025-04-29 Yuchao Li , Aren Karapetyan , Niklas Schmid , John Lygeros , Karl H. Johansson , Jonas Mårtensson

Automated synthesis of correct-by-construction controllers for autonomous systems is crucial for their deployment in safety-critical scenarios. Such autonomous systems are naturally modeled as stochastic dynamical models. The general…

Systems and Control · Electrical Eng. & Systems 2023-11-17 Thom Badings , Nils Jansen , Licio Romao , Alessandro Abate

We propose a machine learning algorithm for solving finite-horizon stochastic control problems based on a deep neural network representation of the optimal policy functions. The algorithm has three features: (1) It can solve…

General Economics · Economics 2024-12-09 Xianhua Peng , Steven Kou , Lekang Zhang

We study methods for solving stochastic control problems of systems of forward-backward mean-field equations with delay, in finite or infinite horizon. Necessary and sufficient maximum principles under partial information are given. The…

Optimization and Control · Mathematics 2016-10-31 Nacira Agram , Elin Engen Rose

We describe a nonlinear generalization of dual dynamic programming theory and its application to value function estimation for deterministic control problems over continuous state and action spaces, in a discrete-time infinite horizon…

Optimization and Control · Mathematics 2018-10-05 Joseph Warrington , Paul N. Beuchat , John Lygeros

In this paper we develop a unified approach for solving a wide class of sequential selection problems. This class includes, but is not limited to, selection problems with no-information, rank-dependent rewards, and considers both fixed as…

Probability · Mathematics 2020-01-27 Alexander Goldenshluger , Yaakov Malinovsky , Assaf Zeevi
‹ Prev 1 2 3 10 Next ›