Related papers: Direct Policy Optimization using Deterministic Sam…

Deterministic Trajectory Optimization through Probabilistic Optimal Control

In this article, we discuss two algorithms tailored to discrete-time deterministic finite-horizon nonlinear optimal control problems or so-called deterministic trajectory optimization problems. Both algorithms can be derived from an…

Optimization and Control · Mathematics 2024-12-10 Mohammad Mahmoudi Filabadi , Tom Lefebvre , Guillaume Crevecoeur

Importance sampling-based approximate optimal planning and control

In this paper, we propose a sampling-based planning and optimal control method of nonlinear systems under non-differentiable constraints. Motivated by developing scalable planning algorithms, we consider the optimal motion plan to be a…

Systems and Control · Computer Science 2016-12-19 Jie Fu

A Decoupled Data Based Approach to Stochastic Optimal Control Problems

This paper studies the stochastic optimal control problem for systems with unknown dynamics. A novel decoupled data based control (D2C) approach is proposed, which solves the problem in a decoupled "open loop-closed loop" fashion that is…

Systems and Control · Computer Science 2018-09-11 Dan Yu , Mohammandhussen Rafieisakhaei , Suman Chakravorty

Computational methods for stochastic control with metric interval temporal logic specifications

This paper studies an optimal control problem for continuous-time stochastic systems subject to reachability objectives specified in a subclass of metric interval temporal logic specifications, a temporal logic with real-time constraints.…

Systems and Control · Computer Science 2015-04-21 Jie Fu , Ufuk Topcu

Policy Decomposition: Approximate Optimal Control with Suboptimality Estimates

Numerically computing global policies to optimal control problems for complex dynamical systems is mostly intractable. In consequence, a number of approximation methods have been developed. However, none of the current methods can quantify…

Robotics · Computer Science 2021-03-05 Ashwin Khadke , Hartmut Geyer

Constrained Sampling-based Trajectory Optimization using Stochastic Approximation

We propose a sampling-based trajectory optimization methodology for constrained problems. We extend recent works on stochastic search to deal with box control constraints,as well as nonlinear state constraints for discrete dynamical…

Optimization and Control · Mathematics 2019-11-13 George I. Boutselis , Ziyi Wang , Evangelos A. Theodorou

D2C 2.0: Decoupled Data-Based Approach for Learning to Control Stochastic Nonlinear Systems via Model-Free ILQR

In this paper, we propose a structured linear parameterization of a feedback policy to solve the model-free stochastic optimal control problem. This parametrization is corroborated by a decoupling principle that is shown to be near-optimal…

Optimization and Control · Mathematics 2020-02-19 Karthikeya S Parunandi , Aayushman Sharma , Suman Chakravorty , Dileep Kalathil

Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems

This paper addresses the problem of learning the optimal control policy for a nonlinear stochastic dynamical system with continuous state space, continuous action space and unknown dynamics. This class of problems are typically addressed in…

Machine Learning · Computer Science 2019-04-18 Ran Wang , Karthikeya Parunandi , Dan Yu , Dileep Kalathil , Suman Chakravorty

Information-Theoretic Stochastic Optimal Control via Incremental Sampling-based Algorithms

This paper considers optimal control of dynamical systems which are represented by nonlinear stochastic differential equations. It is well-known that the optimal control policy for this problem can be obtained as a function of a value…

Robotics · Computer Science 2014-05-30 Oktay Arslan , Evangelos Theodorou , Panagiotis Tsiotras

Near Optimal Hamiltonian-Control and Learning via Chattering

Many applications require solving non-linear control problems that are classically not well behaved. This paper develops a simple and efficient chattering algorithm that learns near optimal decision policies through an open-loop feedback…

Machine Learning · Computer Science 2017-03-21 Peeyush Kumar , Wolf Kohn , Zelda B. Zabinsky

Deep combinatorial optimisation for optimal stopping time problems : application to swing options pricing

A new method for stochastic control based on neural networks and using randomisation of discrete random variables is proposed and applied to optimal stopping time problems. The method models directly the policy and does not need the…

Computational Finance · Quantitative Finance 2021-01-11 Thomas Deschatre , Joseph Mikael

The Power of Learned Locally Linear Models for Nonlinear Policy Optimization

A common pipeline in learning-based control is to iteratively estimate a model of system dynamics, and apply a trajectory optimization algorithm - e.g.~$\mathtt{iLQR}$ - on the learned model to minimize a target cost. This paper conducts a…

Machine Learning · Computer Science 2023-05-17 Daniel Pfrommer , Max Simchowitz , Tyler Westenbroek , Nikolai Matni , Stephen Tu

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning,…

Robotics · Computer Science 2012-02-27 Vu Anh Huynh , Sertac Karaman , Emilio Frazzoli

Entropy Regularised Deterministic Optimal Control: From Path Integral Solution to Sample-Based Trajectory Optimisation

Sample-based trajectory optimisers are a promising tool for the control of robotics with non-differentiable dynamics and cost functions. Contemporary approaches derive from a restricted subclass of stochastic optimal control where the…

Robotics · Computer Science 2021-10-07 Tom Lefebvre , Guillaume Crevecoeur

Data-Driven Control of Unknown Systems: A Linear Programming Approach

We consider the problem of discounted optimal state-feedback regulation for general unknown deterministic discrete-time systems. It is well known that open-loop instability of systems, non-quadratic cost functions and complex nonlinear…

Systems and Control · Electrical Eng. & Systems 2020-03-31 Alexandros Tanzanakis , John Lygeros

Direct Data-Driven Linear Quadratic Tracking via Policy Optimization

Direct data-driven optimal control provides an elegant end-to-end paradigm, yet its real-time applicability is often hindered by the growing dimensionality of online decision variables. Recent breakthroughs, notably Data-EnablEd Policy…

Systems and Control · Electrical Eng. & Systems 2026-05-18 Shubo Kang , Keyou You

A new scalable algorithm for computational optimal control under uncertainty

We address the design and synthesis of optimal control strategies for high-dimensional stochastic dynamical systems. Such systems may be deterministic nonlinear systems evolving from random initial states, or systems driven by random…

Numerical Analysis · Mathematics 2020-08-26 Panos Lambrianides , Qi Gong , Daniele Venturi

A policy iteration algorithm for non-Markovian control problems

In this paper, we propose a new policy iteration algorithm to compute the value function and the optimal controls of continuous time stochastic control problems. The algorithm relies on successive approximations using linear-quadratic…

Optimization and Control · Mathematics 2024-09-09 Dylan Possamaï , Ludovic Tangpi

Adaptive Probabilistic Trajectory Optimization via Efficient Approximate Inference

Robotic systems must be able to quickly and robustly make decisions when operating in uncertain and dynamic environments. While Reinforcement Learning (RL) can be used to compute optimal policies with little prior knowledge about the…

Robotics · Computer Science 2016-09-13 Yunpeng Pan , Xinyan Yan , Evangelos Theodorou , Byron Boots

Implicit Trajectory Planning for Feedback Linearizable Systems: A Time-varying Optimization Approach

We develop an optimization-based framework for joint real-time trajectory planning and feedback control of feedback-linearizable systems. To achieve this goal, we define a target trajectory as the optimal solution of a time-varying…

Systems and Control · Electrical Eng. & Systems 2020-03-17 Tianqi Zheng , John Simpson-Porco , Enrique Mallada