Related papers: Rollout Algorithms and Approximate Dynamic Program…

Stochastic Optimization with Parametric Cost Function Approximations

A widely used heuristic for solving stochastic optimization problems is to use a deterministic rolling horizon procedure, which has been modified to handle uncertainty (e.g. buffer stocks, schedule slack). This approach has been criticized…

Optimization and Control · Mathematics 2017-03-16 Raymond T. Perkins , Warren B. Powell

An Approximate Dynamic Programming Approach for Dual Stochastic Model Predictive Control

Dual control explicitly addresses the problem of trading off active exploration and exploitation in the optimal control of partially unknown systems. While the problem can be cast in the framework of stochastic dynamic programming, exact…

Systems and Control · Electrical Eng. & Systems 2019-11-12 Elena Arcari , Lukas Hewing , Melanie N. Zeilinger

Stochastic Optimal Control as Approximate Input Inference

Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning. Given the intractability of the global control problem, state-of-the-art algorithms focus on approximate sequential optimization…

Machine Learning · Computer Science 2020-04-23 Joe Watson , Hany Abdulsamad , Jan Peters

Efficient Rollout Strategies for Bayesian Optimization

Bayesian optimization (BO) is a class of sample-efficient global optimization methods, where a probabilistic model conditioned on previous observations is used to determine future evaluations via the optimization of an acquisition function.…

Machine Learning · Computer Science 2020-06-22 Eric Hans Lee , David Eriksson , Bolong Cheng , Michael McCourt , David Bindel

Average-Case Performance of Rollout Algorithms for Knapsack Problems

Rollout algorithms have demonstrated excellent performance on a variety of dynamic and discrete optimization problems. Interpreted as an approximate dynamic programming algorithm, a rollout algorithm estimates the value-to-go at each…

Data Structures and Algorithms · Computer Science 2013-11-27 Andrew Mastin , Patrick Jaillet

Efficient Dynamic Allocation Policy for Robust Ranking and Selection under Stochastic Control Framework

This research considers the ranking and selection with input uncertainty. The objective is to maximize the posterior probability of correctly selecting the best alternative under a fixed simulation budget, where each alternative is measured…

Optimization and Control · Mathematics 2023-05-15 Hui Xiao , Zhihong Wei

Ranking and Selection as Stochastic Control

Under a Bayesian framework, we formulate the fully sequential sampling and selection decision in statistical ranking and selection as a stochastic control problem, and derive the associated Bellman equation. Using value function…

Machine Learning · Computer Science 2017-10-10 Yijie Peng , Edwin K. P. Chong , Chun-Hung Chen , Michael C. Fu

Inexact Sequential Quadratic Optimization for Minimizing a Stochastic Objective Function Subject to Deterministic Nonlinear Equality Constraints

An algorithm is proposed, analyzed, and tested experimentally for solving stochastic optimization problems in which the decision variables are constrained to satisfy equations defined by deterministic, smooth, and nonlinear functions. It is…

Optimization and Control · Mathematics 2021-07-09 Frank E. Curtis , Daniel P. Robinson , Baoyu Zhou

Stochastic optimization with decision-dependent distributions

Stochastic optimization problems often involve data distributions that change in reaction to the decision variables. This is the case for example when members of the population respond to a deployed classifier by manipulating their features…

Optimization and Control · Mathematics 2020-12-15 Dmitriy Drusvyatskiy , Lin Xiao

Approximation Algorithms for Optimization of Combinatorial Dynamical Systems

This paper considers an optimization problem for a dynamical system whose evolution depends on a collection of binary decision variables. We develop scalable approximation algorithms with provable suboptimality bounds to provide…

Optimization and Control · Mathematics 2016-10-31 Insoon Yang , Samuel A. Burden , Ram Rajagopal , S. Shankar Sastry , Claire J. Tomlin

Robust Dynamic Programming for Temporal Logic Control of Stochastic Systems

Discrete-time stochastic systems are an essential modelling tool for many engineering systems. We consider stochastic control systems that are evolving over continuous spaces. For this class of models, methods for the formal verification…

Systems and Control · Computer Science 2018-11-29 Sofie Haesaert , Sadegh Soudjani

Data-driven Rollout for Deterministic Optimal Control

We consider deterministic infinite horizon optimal control problems with nonnegative stage costs. We draw inspiration from learning model predictive control scheme designed for continuous dynamics and iterative tasks, and propose a rollout…

Optimization and Control · Mathematics 2021-09-30 Yuchao Li , Karl H. Johansson , Jonas Mårtensson , Dimitri P. Bertsekas

Efficient Sampling Policy for Selecting a Good Enough Subset

The note studies the problem of selecting a good enough subset out of a finite number of alternatives under a fixed simulation budget. Our work aims to maximize the posterior probability of correctly selecting a good subset. We formulate…

Optimization and Control · Mathematics 2023-05-09 Gongbo Zhang , Bin Chen , Qing-shan Jia , Yijie Peng

Stochastic Constraint Programming

To model combinatorial decision problems involving uncertainty and probability, we introduce stochastic constraint programming. Stochastic constraint programs contain both decision variables (which we can set) and stochastic variables…

Artificial Intelligence · Computer Science 2009-03-09 Toby Walsh

Probabilistic design of optimal sequential decision-making algorithms in learning and control

This survey is focused on certain sequential decision-making problems that involve optimizing over probability functions. We discuss the relevance of these problems for learning and control. The survey is organized around a framework that…

Optimization and Control · Mathematics 2023-01-13 Emiland Garrabe , Giovanni Russo

Deep combinatorial optimisation for optimal stopping time problems : application to swing options pricing

A new method for stochastic control based on neural networks and using randomisation of discrete random variables is proposed and applied to optimal stopping time problems. The method models directly the policy and does not need the…

Computational Finance · Quantitative Finance 2021-01-11 Thomas Deschatre , Joseph Mikael

Optimization-Driven Adaptive Experimentation

Real-world experiments involve batched & delayed feedback, non-stationarity, multiple objectives & constraints, and (often some) personalization. Tailoring adaptive methods to address these challenges on a per-problem basis is infeasible,…

Machine Learning · Computer Science 2024-11-11 Ethan Che , Daniel R. Jiang , Hongseok Namkoong , Jimmy Wang

Adaptive Sequential Optimization with Applications to Machine Learning

A framework is introduced for solving a sequence of slowly changing optimization problems, including those arising in regression and classification applications, using optimization algorithms such as stochastic gradient descent (SGD). The…

Machine Learning · Computer Science 2015-09-25 Craig Wilson , Venugopal V. Veeravalli

Bayesian Adaptive Calibration and Optimal Design

The process of calibrating computer models of natural phenomena is essential for applications in the physical sciences, where plenty of domain knowledge can be embedded into simulations and then calibrated against real observations. Current…

Machine Learning · Computer Science 2025-01-20 Rafael Oliveira , Dino Sejdinovic , David Howard , Edwin V. Bonilla

Deep neural networks algorithms for stochastic control problems on finite horizon: convergence analysis

This paper develops algorithms for high-dimensional stochastic control problems based on deep learning and dynamic programming. Unlike classical approximate dynamic programming approaches, we first approximate the optimal policy by means of…

Probability · Mathematics 2021-09-21 Côme Huré , Huyên Pham , Achref Bachouch , Nicolas Langrené