Related papers: Supervised Learning for Stochastic Optimal Control

Learning Optimal Control via Forward and Backward Stochastic Differential Equations

In this paper we present a novel sampling-based numerical scheme designed to solve a certain class of stochastic optimal control problems, utilizing forward and backward stochastic differential equations (FBSDEs). By means of a nonlinear…

Systems and Control · Computer Science 2020-06-18 Ioannis Exarchos , Evangelos A. Theodorou

Solving stochastic optimal control problem via stochastic maximum principle with deep learning method

In this paper, we aim to solve the high dimensional stochastic optimal control problem from the view of the stochastic maximum principle via deep learning. By introducing the extended Hamiltonian system which is essentially an FBSDE with a…

Optimization and Control · Mathematics 2021-06-23 Shaolin Ji , Shige Peng , Ying Peng , Xichuan Zhang

Deep Graphic FBSDEs for Opinion Dynamics Stochastic Control

In this paper, we present a scalable deep learning approach to solve opinion dynamics stochastic optimal control problems with mean field term coupling in the dynamics and cost function. Our approach relies on the probabilistic…

Multiagent Systems · Computer Science 2022-04-19 Tianrong Chen , Ziyi Wang , Evangelos A. Theodorou

Deep Learning for Continuous-Time Stochastic Control with Jumps

In this paper, we introduce a model-based deep-learning approach to solve finite-horizon continuous-time stochastic control problems with jumps. We iteratively train two neural networks: one to represent the optimal policy and the other to…

Machine Learning · Computer Science 2026-01-16 Patrick Cheridito , Jean-Loup Dupret , Donatien Hainaut

Particle-based algorithm for stochastic optimal control

The solution to a stochastic optimal control problem can be determined by computing the value function from a discretization of the associated Hamilton-Jacobi-Bellman equation. Alternatively, the problem can be reformulated in terms of a…

Optimization and Control · Mathematics 2024-02-29 Sebastian Reich

Solving a class of stochastic optimal control problems by physics-informed neural networks

The aim of this work is to develop a deep learning method for solving high-dimensional stochastic control problems based on the Hamilton--Jacobi--Bellman (HJB) equation and physics-informed learning. Our approach is to parameterize the…

Optimization and Control · Mathematics 2025-06-23 Zhe Jiao , Wantao Jia , Weiqiu Zhu

A Neural Network Approach for Stochastic Optimal Control

We present a neural network approach for approximating the value function of high-dimensional stochastic control problems. Our training process simultaneously updates our value function estimate and identifies the part of the state space…

Optimization and Control · Mathematics 2024-05-08 Xingjian Li , Deepanshu Verma , Lars Ruthotto

Constrained BSDEs representation of the value function in optimal control of pure jump Markov processes

We consider a classical finite horizon optimal control problem for continuous-time pure jump Markov processes described by means of a rate transition measure depending on a control parameter and controlled by a feedback law. For this class…

Probability · Mathematics 2015-01-20 Elena Bandini , Marco Fuhrman

Deep 2FBSDEs For Systems With Control Multiplicative Noise

We present a deep recurrent neural network architecture to solve a class of stochastic optimal control problems described by fully nonlinear Hamilton Jacobi Bellmanpartial differential equations. Such PDEs arise when one considers…

Machine Learning · Computer Science 2019-12-24 Marcus A Pereira , Ziyi Wang , Tianrong Chen , Emily Reed , Evangelos A Theodorou

Learning Stochastic Parametric Differentiable Predictive Control Policies

The problem of synthesizing stochastic explicit model predictive control policies is known to be quickly intractable even for systems of modest complexity when using classical control-theoretic methods. To address this challenge, we present…

Machine Learning · Computer Science 2022-05-24 Ján Drgoňa , Sayak Mukherjee , Aaron Tuor , Mahantesh Halappanavar , Draguna Vrabie

Value Function Estimators for Feynman-Kac Forward-Backward SDEs in Stochastic Optimal Control

Two novel numerical estimators are proposed for solving forward-backward stochastic differential equations (FBSDEs) appearing in the Feynman-Kac representation of the value function in stochastic optimal control problems. In contrast to the…

Optimization and Control · Mathematics 2021-10-01 Kelsey P. Hawkins , Ali Pakniyat , Panagiotis Tsiotras

Machine Learning and Hamilton-Jacobi-Bellman Equation for Optimal Decumulation: a Comparison Study

We propose a novel data-driven neural network (NN) optimization framework for solving an optimal stochastic control problem under stochastic constraints. Customized activation functions for the output layers of the NN are applied, which…

Optimization and Control · Mathematics 2023-06-21 Marc Chen , Mohammad Shirazi , Peter A. Forsyth , Yuying Li

Deep Forward-Backward SDEs for Min-max Control

This paper presents a novel approach to numerically solve stochastic differential games for nonlinear systems. The proposed approach relies on the nonlinear Feynman-Kac theorem that establishes a connection between parabolic deterministic…

Optimization and Control · Mathematics 2019-06-13 Ziyi Wang , Keuntaek Lee , Marcus A. Pereira , Ioannis Exarchos , Evangelos A. Theodorou

Is RL fine-tuning harder than regression? A PDE learning approach for diffusion models

We study the problem of learning the optimal control policy for fine-tuning a given diffusion process, using general value function approximation. We develop a new class of algorithms by solving a variational inequality problem based on the…

Machine Learning · Computer Science 2025-09-03 Wenlong Mou

Neural Policy Iteration for Stochastic Optimal Control: A Physics-Informed Approach

We propose a physics-informed neural network policy iteration (PINN-PI) framework for solving stochastic optimal control problems governed by second-order Hamilton--Jacobi--Bellman (HJB) equations. At each iteration, a neural network is…

Machine Learning · Computer Science 2025-08-05 Yeongjong Kim , Yeoneung Kim , Minseok Kim , Namkyeong Cho

Backward stochastic differential equations and optimal control of marked point processes

We study a class of backward stochastic differential equations (BSDEs) driven by a random measure or, equivalently, by a marked point process. Under appropriate assumptions we prove well-posedness and continuous dependence of the solution…

Probability · Mathematics 2012-05-24 Fulvia Confortola , Marco Fuhrman

BSDE Representation and Randomized Dynamic Programming Principle for Stochastic Control Problems of Infinite-Dimensional Jump-Diffusions

We consider a general class of stochastic optimal control problems, where the state process lives in a real separable Hilbert space and is driven by a cylindrical Brownian motion and a Poisson random measure; no special structure is imposed…

Probability · Mathematics 2018-10-04 Elena Bandini , Fulvia Confortola , Andrea Cosso

Stochastic Optimization for Machine Learning

It has been found that stochastic algorithms often find good solutions much more rapidly than inherently-batch approaches. Indeed, a very useful rule of thumb is that often, when solving a machine learning problem, an iterative technique…

Machine Learning · Computer Science 2013-08-19 Andrew Cotter

Randomized dynamic programming principle and Feynman-Kac representation for optimal control of McKean-Vlasov dynamics

We analyze a stochastic optimal control problem, where the state process follows a McKean-Vlasov dynamics and the diffusion coefficient can be degenerate. We prove that its value function V admits a nonlinear Feynman-Kac representation in…

Probability · Mathematics 2016-11-15 Erhan Bayraktar , Andrea Cosso , Huyên Pham

Learning-Based Stable Optimal Control for Infinite-Time Nonlinear Regulation Problems

Infinite-time nonlinear optimal regulation control is widely utilized in aerospace engineering as a systematic method for synthesizing stable controllers. However, conventional methods often rely on linearization hypothesis, while recent…

Systems and Control · Electrical Eng. & Systems 2025-06-13 Han Wang , Di Wu , Lin Cheng , Shengping Gong , Xu Huang