Related papers: Reinforcement Learning via Parametric Cost Functio…

The Parametric Cost Function Approximation: A new approach for multistage stochastic programming

The most common approaches for solving multistage stochastic programming problems in the research literature have been to either use value functions ("dynamic programming") or scenario trees ("stochastic programming") to approximate the…

Optimization and Control · Mathematics 2022-01-04 Warren B Powell , Saeed Ghadimi

Stochastic Optimization with Parametric Cost Function Approximations

A widely used heuristic for solving stochastic optimization problems is to use a deterministic rolling horizon procedure, which has been modified to handle uncertainty (e.g. buffer stocks, schedule slack). This approach has been criticized…

Optimization and Control · Mathematics 2017-03-16 Raymond T. Perkins , Warren B. Powell

Stochastic Search for a Parametric Cost Function Approximation: Energy storage with rolling forecasts

Rolling forecasts have been almost overlooked in the renewable energy storage literature. In this paper, we provide a new approach for handling uncertainty not just in the accuracy of a forecast, but in the evolution of forecasts over time.…

Optimization and Control · Mathematics 2022-04-18 Saeed Ghadimi , Warren B. Powell

On the Impact of Deep Learning-based Time-series Forecasts on Multistage Stochastic Programming Policies

Multistage stochastic programming provides a modeling framework for sequential decision-making problems that involve uncertainty. One typically overlooked aspect of this methodology is how uncertainty is incorporated into modeling.…

Optimization and Control · Mathematics 2021-09-24 Juyoung Wang , Mucahit Cevik , Merve Bodur

Learning Optimal Deterministic Policies with Stochastic Policy Gradients

Policy gradient (PG) methods are successful approaches to deal with continuous reinforcement learning (RL) problems. They learn stochastic parametric (hyper)policies by either exploring in the space of actions or in the space of parameters.…

Machine Learning · Computer Science 2024-05-31 Alessandro Montenegro , Marco Mussi , Alberto Maria Metelli , Matteo Papini

Scenario trees and policy selection for multistage stochastic programming using machine learning

We propose a hybrid algorithmic strategy for complex stochastic optimization problems, which combines the use of scenario trees from multistage stochastic programming with machine learning techniques for learning a policy in the form of a…

Optimization and Control · Mathematics 2019-10-25 Boris Defourny , Damien Ernst , Louis Wehenkel

Forward-Backward Quantization of Scenario Processes in Multi-Stage Stochastic Optimization

Multi-stage stochastic optimization lies at the core of decision-making under uncertainty. As the analytical solution is available only in exceptional cases, dynamic optimization aims to efficiently find approximations but often neglects…

Optimization and Control · Mathematics 2025-08-26 Anna Timonina-Farkas

Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation

We provide performance guarantees for a variant of simulation-based policy iteration for controlling Markov decision processes that involves the use of stochastic approximation algorithms along with state-of-the-art techniques that are…

Machine Learning · Computer Science 2022-10-17 Anna Winnicki , R. Srikant

The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation

Function approximation is widely used in reinforcement learning to handle the computational difficulties associated with very large state spaces. However, function approximation introduces errors which may lead to instabilities when using…

Machine Learning · Computer Science 2022-12-15 Anna Winnicki , Joseph Lubars , Michael Livesay , R. Srikant

A Reinforcement Learning Approach to the Stochastic Cutting Stock Problem

We propose a formulation of the stochastic cutting stock problem as a discounted infinite-horizon Markov decision process. At each decision epoch, given current inventory of items, an agent chooses in which patterns to cut objects in stock…

Optimization and Control · Mathematics 2022-06-29 Anselmo R. Pitombeira-Neto , Arthur H. Fonseca Murta

Machine Learning for Stochastic Parametrisation

Atmospheric models used for weather and climate prediction are traditionally formulated in a deterministic manner. In other words, given a particular state of the resolved scale variables, the most likely forcing from the sub-grid scale…

Machine Learning · Computer Science 2024-02-16 Hannah M. Christensen , Salah Kouhen , Greta Miller , Raghul Parthipan

Multi-timescale Stochastic Programming with Applications in Power Systems

This paper introduces a multi-timescale stochastic programming framework designed to address decision-making challenges in power systems, particularly those with high renewable energy penetration. The framework models interactions across…

Optimization and Control · Mathematics 2025-08-13 Yihang Zhang , Suvrajeet Sen

Learning From Scenarios for Stochastic Repairable Scheduling

When optimizing problems with uncertain parameter values in a linear objective, decision-focused learning enables end-to-end learning of these values. We are interested in a stochastic scheduling problem, in which processing times are…

Machine Learning · Computer Science 2024-08-16 Kim van den Houten , David M. J. Tax , Esteban Freydell , Mathijs de Weerdt

Differentiability and Regularization of Parametric Convex Value Functions in Stochastic Multistage Optimization

In multistage decision problems, it is often the case that an initial strategic decision (such as investment) is followed by many operational ones (operating the investment). Such initial strategic decision can be seen as a parameter…

Optimization and Control · Mathematics 2026-03-17 Adrien Le Franc , Pierre Carpentier , Jean-Philippe Chancelier , Michel de Lara

Stochastic Constraint Programming: A Scenario-Based Approach

To model combinatorial decision problems involving uncertainty and probability, we introduce scenario based stochastic constraint programming. Stochastic constraint programs contain both decision variables, which we can set, and stochastic…

Artificial Intelligence · Computer Science 2009-03-09 S. Armagan Tarim , Suresh Manandhar , Toby Walsh

Relating Reinforcement Learning to Dynamic Programming-Based Planning

This paper bridges some of the gap between optimal planning and reinforcement learning (RL), both of which share roots in dynamic programming applied to sequential decision making or optimal control. Whereas planning typically favors…

Robotics · Computer Science 2026-03-10 Filip V. Georgiev , Kalle G. Timperi , Başak Sakçak , Steven M. LaValle

Adaptive Two-stage Stochastic Programming with an Analysis on Capacity Expansion Planning Problem

Multi-stage stochastic programming is a well-established framework for sequential decision making under uncertainty by seeking policies that are fully adapted to the uncertainty. Often such flexible policies are not desirable, and the…

Optimization and Control · Mathematics 2024-08-06 Beste Basciftci , Shabbir Ahmed , Nagi Gebraeel

Neural Network Approaches for Parameterized Optimal Control

We consider numerical approaches for deterministic, finite-dimensional optimal control problems whose dynamics depend on unknown or uncertain parameters. We seek to amortize the solution over a set of relevant parameters in an offline stage…

Optimization and Control · Mathematics 2024-02-16 Deepanshu Verma , Nick Winovich , Lars Ruthotto , Bart van Bloemen Waanders

Stochastic Optimization Forests

We study contextual stochastic optimization problems, where we leverage rich auxiliary observations (e.g., product characteristics) to improve decision making with uncertain variables (e.g., demand). We show how to train forest decision…

Optimization and Control · Mathematics 2022-03-17 Nathan Kallus , Xiaojie Mao

A Reinforcement Learning based Path Planning Approach in 3D Environment

Optimal motion planning involves obstacles avoidance where path planning is the key to success in optimal motion planning. Due to the computational demands, most of the path planning algorithms can not be employed for real-time based…

Robotics · Computer Science 2022-02-15 Geesara Kulathunga