Related papers: Deep Value Model Predictive Control

DMPC: A Data-and Model-Driven Approach to Predictive Control

This work presents DMPC (Data-and Model-Driven Predictive Control) to solve control problems in which some of the constraints or parts of the objective function are known, while others are entirely unknown to the controller. It is assumed…

Systems and Control · Electrical Eng. & Systems 2021-03-02 Hassan Jafarzadeh , Cody Fleming

Cooperative nonlinear distributed model predictive control with dissimilar control horizons

In this paper, we introduce a nonlinear distributed model predictive control (DMPC) algorithm, which allows for dissimilar and time-varying control horizons among agents, thereby addressing a common limitation in current DMPC schemes. We…

Systems and Control · Electrical Eng. & Systems 2024-10-15 Paula Chanfreut , José M. Maestre , Quanyan Zhu , W. P. M. H. Heemels

Actor-Critic Model Predictive Control: Differentiable Optimization meets Reinforcement Learning for Agile Flight

A key open challenge in agile quadrotor flight is how to combine the flexibility and task-level generality of model-free reinforcement learning (RL) with the structure and online replanning capabilities of model predictive control (MPC),…

Robotics · Computer Science 2026-01-21 Angel Romero , Elie Aljalbout , Yunlong Song , Davide Scaramuzza

Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

Model-Predictive Control (MPC) is a powerful tool for controlling complex, real-world systems that uses a model to make predictions about future behavior. For each state encountered, MPC solves an online optimization problem to choose a…

Machine Learning · Computer Science 2021-04-15 Mohak Bhardwaj , Sanjiban Choudhury , Byron Boots

A Study On Distributed Model Predictive Consensus

We investigate convergence properties of a proposed distributed model predictive control (DMPC) scheme, where agents negotiate to compute an optimal consensus point using an incremental subgradient method based on primal decomposition as…

Multiagent Systems · Computer Science 2008-03-03 Tamas Keviczky , Karl Henrik Johansson

An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm

This scientific paper propose a novel portfolio optimization model using an improved deep reinforcement learning algorithm. The objective function of the optimization model is the weighted sum of the expectation and value at risk(VaR) of…

Machine Learning · Computer Science 2022-08-30 Boyi Jin

Model-Augmented Actor-Critic: Backpropagating through Paths

Current model-based reinforcement learning approaches use the model simply as a learned black-box simulator to augment the data for policy optimization or value function learning. In this paper, we show how to make more effective use of the…

Machine Learning · Computer Science 2020-05-19 Ignasi Clavera , Violet Fu , Pieter Abbeel

Trajectory Optimization for Nonlinear Multi-Agent Systems using Decentralized Learning Model Predictive Control

We present a decentralized minimum-time trajectory optimization scheme based on learning model predictive control for multi-agent systems with nonlinear decoupled dynamics and coupled state constraints. By performing the same task…

Systems and Control · Electrical Eng. & Systems 2020-12-21 Edward L. Zhu , Yvonne R. Stürz , Ugo Rosolia , Francesco Borrelli

Bootstrapped Model Predictive Control

Model Predictive Control (MPC) has been demonstrated to be effective in continuous control tasks. When a world model and a value function are available, planning a sequence of actions ahead of time leads to a better policy. Existing methods…

Machine Learning · Computer Science 2025-04-07 Yuhang Wang , Hanwei Guo , Sizhe Wang , Long Qian , Xuguang Lan

Distributional Advantage Actor-Critic

In traditional reinforcement learning, an agent maximizes the reward collected during its interaction with the environment by approximating the optimal policy through the estimation of value functions. Typically, given a state s and action…

Machine Learning · Computer Science 2018-06-20 Shangda Li , Selina Bing , Steven Yang

Goal-Conditioned Terminal Value Estimation for Real-time and Multi-task Model Predictive Control

While MPC enables nonlinear feedback control by solving an optimal control problem at each timestep, the computational burden tends to be significantly large, making it difficult to optimize a policy within the control period. To address…

Robotics · Computer Science 2024-10-10 Mitsuki Morita , Satoshi Yamamori , Satoshi Yagi , Norikazu Sugimoto , Jun Morimoto

Learning High-Level Policies for Model Predictive Control

The combination of policy search and deep neural networks holds the promise of automating a variety of decision-making tasks. Model Predictive Control (MPC) provides robust solutions to robot control tasks by making use of a dynamical model…

Robotics · Computer Science 2021-05-11 Yunlong Song , Davide Scaramuzza

Learning to Solve Parametric Mixed-Integer Optimal Control Problems via Differentiable Predictive Control

We propose a novel approach to solving input- and state-constrained parametric mixed-integer optimal control problems using Differentiable Predictive Control (DPC). Our approach follows the differentiable programming paradigm by learning an…

Systems and Control · Electrical Eng. & Systems 2025-06-25 Ján Boldocký , Shahriar Dadras Javan , Martin Gulan , Martin Mönnigmann , Ján Drgoňa

Sensitivity-Based Distributed Model Predictive Control for Nonlinear Systems under Inexact Optimization

This paper presents a distributed model predictive control (DMPC) scheme for nonlinear continuous-time systems. The underlying distributed optimal control problem is cooperatively solved in parallel via a sensitivity-based algorithm. The…

Optimization and Control · Mathematics 2024-06-06 Maximilian Pierer von Esch , Andreas Völz , Knut Graichen

The Value of Planning for Infinite-Horizon Model Predictive Control

Model Predictive Control (MPC) is a classic tool for optimal control of complex, real-world systems. Although it has been successfully applied to a wide range of challenging tasks in robotics, it is fundamentally limited by the prediction…

Robotics · Computer Science 2021-04-08 Nathan Hatch , Byron Boots

Distributed Model Predicted Control of Multi-agent Systems with Applications to Multi-vehicle Cooperation

This paper proposes a distributed model predicted control (DMPC) approach for consensus control of multi-agent systems (MASs) with linear agent dynamics and bounded control input constraints. Within the proposed DMPC framework, each agent…

Systems and Control · Electrical Eng. & Systems 2020-09-16 Yougang Bian , Changkun Du , Manjiang Hu , Haikuo Liu

Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework

In this paper, we propose actor-director-critic, a new framework for deep reinforcement learning. Compared with the actor-critic framework, the director role is added, and action classification and action evaluation are applied…

Machine Learning · Computer Science 2023-01-11 Zongwei Liu , Yonghong Song , Yuanlin Zhang

Value Improved Actor Critic Algorithms

To learn approximately optimal acting policies for decision problems, modern Actor Critic algorithms rely on deep Neural Networks (DNNs) to parameterize the acting policy and greedification operators to iteratively improve it. The reliance…

Machine Learning · Computer Science 2026-01-19 Yaniv Oren , Moritz A. Zanger , Pascal R. van der Vaart , Mustafa Mert Celikok , Matthijs T. J. Spaan , Wendelin Bohmer

Active exploration in adaptive model predictive control

A dual adaptive model predictive control (MPC) algorithm is presented for linear, time-invariant systems subject to bounded disturbances and parametric uncertainty in the state-space matrices. Online set-membership identification is…

Systems and Control · Electrical Eng. & Systems 2021-02-23 Anilkumar Parsi , Andrea Iannelli , Roy S. Smith

A Semismooth Predictor Corrector Method for Suboptimal Model Predictive Control

Suboptimal model predictive control is a technique that can reduce the computational cost of model predictive control (MPC) by exploiting its robustness to incomplete optimization. Instead of solving the optimal control problem exactly,…

Systems and Control · Computer Science 2019-05-08 Dominic Liao-McPherson , Marco Nicotra , Ilya Kolmanovsky