Related papers: Reinforcement Learning for Mixed-Integer Problems …

Integrating Reinforcement Learning and Model Predictive Control with Applications to Microgrids

This work proposes an approach that integrates reinforcement learning and model predictive control (MPC) to solve finite-horizon optimal control problems in mixed-logical dynamical systems efficiently. Optimization-based control of such…

Systems and Control · Electrical Eng. & Systems 2025-04-15 Caio Fabio Oliveira da Silva , Azita Dabiri , Bart De Schutter

Practical Reinforcement Learning of Stabilizing Economic MPC

Reinforcement Learning (RL) has demonstrated a huge potential in learning optimal policies without any prior knowledge of the process to be controlled. Model Predictive Control (MPC) is a popular control technique which is able to deal with…

Systems and Control · Computer Science 2019-04-10 Mario Zanon , Sébastien Gros , Alberto Bemporad

Learning for MPC with Stability & Safety Guarantees

The combination of learning methods with Model Predictive Control (MPC) has attracted a significant amount of attention in the recent literature. The hope of this combination is to reduce the reliance of MPC schemes on accurate models, and…

Machine Learning · Computer Science 2022-07-25 Sébastien Gros , Mario Zanon

Information Theoretic Model Predictive Q-Learning

Model-free Reinforcement Learning (RL) works well when experience can be collected cheaply and model-based RL is effective when system dynamics can be modeled accurately. However, both assumptions can be violated in real world problems such…

Machine Learning · Computer Science 2020-05-07 Mohak Bhardwaj , Ankur Handa , Dieter Fox , Byron Boots

Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

Model-Predictive Control (MPC) is a powerful tool for controlling complex, real-world systems that uses a model to make predictions about future behavior. For each state encountered, MPC solves an online optimization problem to choose a…

Machine Learning · Computer Science 2021-04-15 Mohak Bhardwaj , Sanjiban Choudhury , Byron Boots

Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search

Model predictive control (MPC) is an effective method for controlling robotic systems, particularly autonomous aerial vehicles such as quadcopters. However, application of MPC can be computationally demanding, and typically requires…

Machine Learning · Computer Science 2016-02-17 Tianhao Zhang , Gregory Kahn , Sergey Levine , Pieter Abbeel

Model Predictive Control via On-Policy Imitation Learning

In this paper, we leverage the rapid advances in imitation learning, a topic of intense recent focus in the Reinforcement Learning (RL) literature, to develop new sample complexity results and performance guarantees for data-driven Model…

Optimization and Control · Mathematics 2022-10-18 Kwangjun Ahn , Zakaria Mhammedi , Horia Mania , Zhang-Wei Hong , Ali Jadbabaie

An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms

In this work, we consider policy-based methods for solving the reinforcement learning problem, and establish the sample complexity guarantees. A policy-based algorithm typically consists of an actor and a critic. We consider using various…

Machine Learning · Computer Science 2023-01-16 Zaiwei Chen , Siva Theja Maguluri

Optimization of the Model Predictive Control Update Interval Using Reinforcement Learning

In control applications there is often a compromise that needs to be made with regards to the complexity and performance of the controller and the computational resources that are available. For instance, the typical hardware platform in…

Systems and Control · Electrical Eng. & Systems 2020-11-30 Eivind Bøhn , Sebastien Gros , Signe Moe , Tor Arne Johansen

Safe and Efficient Model Predictive Control Using Neural Networks: An Interior Point Approach

Model predictive control (MPC) provides a useful means for controlling systems with constraints, but suffers from the computational burden of repeatedly solving an optimization problem in real time. Offline (explicit) solutions for MPC…

Systems and Control · Electrical Eng. & Systems 2022-09-14 Daniel Tabas , Baosen Zhang

Bridging the gap between QP-based and MPC-based RL

Reinforcement learning methods typically use Deep Neural Networks to approximate the value functions and policies underlying a Markov Decision Process. Unfortunately, DNN-based RL suffers from a lack of explainability of the resulting…

Systems and Control · Electrical Eng. & Systems 2022-05-19 Shambhuraj Sawant , Sebastien Gros

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

We explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationarity of the environment,…

Machine Learning · Computer Science 2020-03-17 Ryan Lowe , Yi Wu , Aviv Tamar , Jean Harb , Pieter Abbeel , Igor Mordatch

Computationally efficient solution of mixed integer model predictive control problems via machine learning aided Benders Decomposition

Mixed integer Model Predictive Control (MPC) problems arise in the operation of systems where discrete and continuous decisions must be taken simultaneously to compensate for disturbances. The efficient solution of mixed integer MPC…

Optimization and Control · Mathematics 2024-04-09 Ilias Mitrai , Prodromos Daoutidis

Multi-agent Battery Storage Management using MPC-based Reinforcement Learning

In this paper, we present the use of Model Predictive Control (MPC) based on Reinforcement Learning (RL) to find the optimal policy for a multi-agent battery storage system. A time-varying prediction of the power price and production-demand…

Systems and Control · Electrical Eng. & Systems 2021-06-08 A. Bahari Kordabad , W. Cai , S. Gros

MPCritic: A plug-and-play MPC architecture for reinforcement learning

The reinforcement learning (RL) and model predictive control (MPC) communities have developed vast ecosystems of theoretical approaches and computational tools for solving optimal control problems. Given their conceptual similarities but…

Machine Learning · Computer Science 2025-09-04 Nathan P. Lawrence , Thomas Banker , Ali Mesbah

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen

Actor-Critic based Improper Reinforcement Learning

We consider an improper reinforcement learning setting where a learner is given $M$ base controllers for an unknown Markov decision process, and wishes to combine them optimally to produce a potentially new controller that can outperform…

Machine Learning · Computer Science 2022-07-20 Mohammadi Zaki , Avinash Mohan , Aditya Gopalan , Shie Mannor

Differentiable MPC for End-to-end Planning and Control

We present foundations for using Model Predictive Control (MPC) as a differentiable policy class for reinforcement learning in continuous state and action spaces. This provides one way of leveraging and combining the advantages of…

Machine Learning · Computer Science 2019-10-15 Brandon Amos , Ivan Dario Jimenez Rodriguez , Jacob Sacks , Byron Boots , J. Zico Kolter

MPC-based Reinforcement Learning for Economic Problems with Application to Battery Storage

In this paper, we are interested in optimal control problems with purely economic costs, which often yield optimal policies having a (nearly) bang-bang structure. We focus on policy approximations based on Model Predictive Control (MPC) and…

Machine Learning · Computer Science 2021-04-07 Arash Bahari Kordabad , Wenqi Cai , Sebastien Gros

Towards Safe Reinforcement Learning Using NMPC and Policy Gradients: Part I - Stochastic case

We present a methodology to deploy the stochastic policy gradient method, using actor-critic techniques, when the optimal policy is approximated using a parametric optimization problem, allowing one to enforce safety via hard constraints.…

Systems and Control · Electrical Eng. & Systems 2024-09-23 Sebastien Gros , Mario Zanon