Related papers: Stabilizing Value Iteration with and without Appro…

Stability Analysis of Optimal Adaptive Control using Value Iteration with Approximation Errors

Adaptive optimal control using value iteration initiated from a stabilizing control policy is theoretically analyzed in terms of stability of the system during the learning stage without ignoring the effects of approximation errors. This…

Optimization and Control · Mathematics 2017-10-25 Ali Heydari

When to stop value iteration: stability and near-optimality versus computation

Value iteration (VI) is a ubiquitous algorithm for optimal control, planning, and reinforcement learning schemes. Under the right assumptions, VI is a vital tool to generate inputs with desirable properties for the controlled system, like…

Optimization and Control · Mathematics 2020-11-23 Mathieu Granzotto , Romain Postoyan , Dragan Nešić , Lucian Buşoniu , Jamal Daafouz

A Fully Data-Driven Value Iteration for Stochastic LQR: Convergence, Robustness and Stability

Unlike traditional model-based reinforcement learning approaches that estimate system parameters from data, non-model-based data-driven control learns the optimal policy directly from input-state data without any intermediate model…

Optimization and Control · Mathematics 2026-05-05 Leilei Cui , Zhong-Ping Jiang , Petter N. Kolm , Grégoire G. Macqueron

Convergence Analysis of Policy Iteration

Adaptive optimal control of nonlinear dynamic systems with deterministic and known dynamics under a known undiscounted infinite-horizon cost function is investigated. Policy iteration scheme initiated using a stabilizing initial control is…

Systems and Control · Computer Science 2015-05-21 Ali Heydari

Theoretical and Numerical Analysis of Approximate Dynamic Programming with Approximation Errors

This study is aimed at answering the famous question of how the approximation errors at each iteration of Approximate Dynamic Programming (ADP) affect the quality of the final results considering the fact that errors at each iteration…

Systems and Control · Computer Science 2015-05-18 Ali Heydari

Time-varying optimal control under measurement errors

Solving optimal control problems to determine a stabilizing controller involves a significant computational effort. Time-varying optimal control provides a remedy by designing a tracking system, given as an ordinary differential equation,…

Systems and Control · Electrical Eng. & Systems 2026-04-16 Patrick Schmidt , Stefan Streif

A posteriori error estimators for stabilized finite element approximations of an optimal control problem

We derive a posteriori error estimators for an optimal control problem governed by a convection-reaction-diffusion equation; control constraints are also considered. We consider a family of low-order stabilized finite element methods to…

Numerical Analysis · Mathematics 2017-04-24 Alejandro Allendes , Enrique Otarola , Richard Rankin

Practical sample-and-hold stabilization of nonlinear systems under approximate optimizers

It is a known fact that not all controllable systems can be asymptotically stabilized by a continuous static feedback. Several approaches have been developed throughout the last decades, including time-varying, dynamical and even…

Optimization and Control · Mathematics 2018-06-25 Pavel Osinenko , Lukas Beckenbach , Stefan Streif

On the Convergence of Approximate and Regularized Policy Iteration Schemes

Entropy regularized algorithms such as Soft Q-learning and Soft Actor-Critic, recently showed state-of-the-art performance on a number of challenging reinforcement learning (RL) tasks. The regularized formulation modifies the standard RL…

Machine Learning · Statistics 2019-10-15 Elena Smirnova , Elvis Dohmatob

Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator

This paper revisits and extends the convergence and robustness properties of value and policy iteration algorithms for discrete-time linear quadratic regulator problems. In the model-based case, we extend current results concerning the…

Systems and Control · Electrical Eng. & Systems 2025-04-11 Bowen Song , Chenxuan Wu , Andrea Iannelli

Value Iteration for Simple Stochastic Games: Stopping Criterion and Learning Algorithm

Simple stochastic games can be solved by value iteration (VI), which yields a sequence of under-approximations of the value of the game. This sequence is guaranteed to converge to the value only in the limit. Since no stopping criterion is…

Logic in Computer Science · Computer Science 2021-02-02 Edon Kelmendi , Julia Krämer , Jan Kretinsky , Maximilian Weininger

Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients

Policy iteration is one of the classical frameworks of reinforcement learning, which requires a known initial stabilizing control. However, finding the initial stabilizing control depends on the known system model. To relax this requirement…

Systems and Control · Electrical Eng. & Systems 2025-03-20 Dongdong Li , Jiuxiang Dong

A stabilizing iteration scheme for model predictive control based on relaxed barrier functions

We propose and analyze a stabilizing iteration scheme for the algorithmic implementation of model predictive control for linear discrete-time systems. Polytopic input and state constraints are considered and handled by means of so-called…

Optimization and Control · Mathematics 2016-04-07 Christian Feller , Christian Ebenbauer

Constrained Optimal Tracking Control of Unknown Systems: A Multi-Step Linear Programming Approach

We study the problem of optimal state-feedback tracking control for unknown discrete-time deterministic systems with input constraints. To handle input constraints, state-of-art methods utilize a certain nonquadratic stage cost function,…

Systems and Control · Electrical Eng. & Systems 2020-12-09 Alexandros Tanzanakis , John Lygeros

Policy iteration for discrete-time systems with discounted costs: stability and near-optimality guarantees

Given a discounted cost, we study deterministic discrete-time systems whose inputs are generated by policy iteration (PI). We provide novel near-optimality and stability properties, while allowing for non stabilizing initial policies. That…

Optimization and Control · Mathematics 2024-03-29 Jonathan de Brusse , Mathieu Granzotto , Romain Postoyan , Dragan Nešić

Value Iteration in Continuous Actions, States and Time

Classical value iteration approaches are not applicable to environments with continuous states and actions. For such environments, the states and actions are usually discretized, which leads to an exponential increase in computational…

Machine Learning · Computer Science 2021-05-12 Michael Lutter , Shie Mannor , Jan Peters , Dieter Fox , Animesh Garg

Output-Feedback Stabilizing Policy Iteration for Convergence Assurance of Unknown Discrete-Time Systems with Unmeasurable States

This note proposes a data-driven output-feedback stabilizing policy iteration for unknown linear discrete-time systems with unmeasurable states. Existing policy iteration methods for optimal control must start from a stabilizing control…

Systems and Control · Electrical Eng. & Systems 2025-12-01 Dongdong Li , Jiuxiang Dong

Cutting Your Losses: Learning Fault-Tolerant Control and Optimal Stopping under Adverse Risk

Recently, there has been a surge in interest in safe and robust techniques within reinforcement learning (RL). Current notions of risk in RL fail to capture the potential for systemic failures such as abrupt stoppages from system failures…

Systems and Control · Computer Science 2019-10-09 David Mguni

Approximate infinite-horizon predictive control

Predictive control is frequently used for control problems involving constraints. Being an optimization based technique utilizing a user specified so-called stage cost, performance properties, i.e., bounds on the infinite horizon…

Systems and Control · Electrical Eng. & Systems 2022-09-09 Lukas Beckenbach , Stefan Streif

Value-Gradient Iteration with Quadratic Approximate Value Functions

We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions as a substitute for the…

Optimization and Control · Mathematics 2023-11-10 Alan Yang , Stephen Boyd