English
Related papers

Related papers: Stabilizing Value Iteration with and without Appro…

200 papers

Adaptive optimal control using value iteration initiated from a stabilizing control policy is theoretically analyzed in terms of stability of the system during the learning stage without ignoring the effects of approximation errors. This…

Optimization and Control · Mathematics 2017-10-25 Ali Heydari

Value iteration (VI) is a ubiquitous algorithm for optimal control, planning, and reinforcement learning schemes. Under the right assumptions, VI is a vital tool to generate inputs with desirable properties for the controlled system, like…

Optimization and Control · Mathematics 2020-11-23 Mathieu Granzotto , Romain Postoyan , Dragan Nešić , Lucian Buşoniu , Jamal Daafouz

Unlike traditional model-based reinforcement learning approaches that estimate system parameters from data, non-model-based data-driven control learns the optimal policy directly from input-state data without any intermediate model…

Optimization and Control · Mathematics 2026-05-05 Leilei Cui , Zhong-Ping Jiang , Petter N. Kolm , Grégoire G. Macqueron

Adaptive optimal control of nonlinear dynamic systems with deterministic and known dynamics under a known undiscounted infinite-horizon cost function is investigated. Policy iteration scheme initiated using a stabilizing initial control is…

Systems and Control · Computer Science 2015-05-21 Ali Heydari

This study is aimed at answering the famous question of how the approximation errors at each iteration of Approximate Dynamic Programming (ADP) affect the quality of the final results considering the fact that errors at each iteration…

Systems and Control · Computer Science 2015-05-18 Ali Heydari

Solving optimal control problems to determine a stabilizing controller involves a significant computational effort. Time-varying optimal control provides a remedy by designing a tracking system, given as an ordinary differential equation,…

Systems and Control · Electrical Eng. & Systems 2026-04-16 Patrick Schmidt , Stefan Streif

We derive a posteriori error estimators for an optimal control problem governed by a convection-reaction-diffusion equation; control constraints are also considered. We consider a family of low-order stabilized finite element methods to…

Numerical Analysis · Mathematics 2017-04-24 Alejandro Allendes , Enrique Otarola , Richard Rankin

It is a known fact that not all controllable systems can be asymptotically stabilized by a continuous static feedback. Several approaches have been developed throughout the last decades, including time-varying, dynamical and even…

Optimization and Control · Mathematics 2018-06-25 Pavel Osinenko , Lukas Beckenbach , Stefan Streif

Entropy regularized algorithms such as Soft Q-learning and Soft Actor-Critic, recently showed state-of-the-art performance on a number of challenging reinforcement learning (RL) tasks. The regularized formulation modifies the standard RL…

Machine Learning · Statistics 2019-10-15 Elena Smirnova , Elvis Dohmatob

This paper revisits and extends the convergence and robustness properties of value and policy iteration algorithms for discrete-time linear quadratic regulator problems. In the model-based case, we extend current results concerning the…

Systems and Control · Electrical Eng. & Systems 2025-04-11 Bowen Song , Chenxuan Wu , Andrea Iannelli

Simple stochastic games can be solved by value iteration (VI), which yields a sequence of under-approximations of the value of the game. This sequence is guaranteed to converge to the value only in the limit. Since no stopping criterion is…

Logic in Computer Science · Computer Science 2021-02-02 Edon Kelmendi , Julia Krämer , Jan Kretinsky , Maximilian Weininger

Policy iteration is one of the classical frameworks of reinforcement learning, which requires a known initial stabilizing control. However, finding the initial stabilizing control depends on the known system model. To relax this requirement…

Systems and Control · Electrical Eng. & Systems 2025-03-20 Dongdong Li , Jiuxiang Dong

We propose and analyze a stabilizing iteration scheme for the algorithmic implementation of model predictive control for linear discrete-time systems. Polytopic input and state constraints are considered and handled by means of so-called…

Optimization and Control · Mathematics 2016-04-07 Christian Feller , Christian Ebenbauer

We study the problem of optimal state-feedback tracking control for unknown discrete-time deterministic systems with input constraints. To handle input constraints, state-of-art methods utilize a certain nonquadratic stage cost function,…

Systems and Control · Electrical Eng. & Systems 2020-12-09 Alexandros Tanzanakis , John Lygeros

Given a discounted cost, we study deterministic discrete-time systems whose inputs are generated by policy iteration (PI). We provide novel near-optimality and stability properties, while allowing for non stabilizing initial policies. That…

Optimization and Control · Mathematics 2024-03-29 Jonathan de Brusse , Mathieu Granzotto , Romain Postoyan , Dragan Nešić

Classical value iteration approaches are not applicable to environments with continuous states and actions. For such environments, the states and actions are usually discretized, which leads to an exponential increase in computational…

Machine Learning · Computer Science 2021-05-12 Michael Lutter , Shie Mannor , Jan Peters , Dieter Fox , Animesh Garg

This note proposes a data-driven output-feedback stabilizing policy iteration for unknown linear discrete-time systems with unmeasurable states. Existing policy iteration methods for optimal control must start from a stabilizing control…

Systems and Control · Electrical Eng. & Systems 2025-12-01 Dongdong Li , Jiuxiang Dong

Recently, there has been a surge in interest in safe and robust techniques within reinforcement learning (RL). Current notions of risk in RL fail to capture the potential for systemic failures such as abrupt stoppages from system failures…

Systems and Control · Computer Science 2019-10-09 David Mguni

Predictive control is frequently used for control problems involving constraints. Being an optimization based technique utilizing a user specified so-called stage cost, performance properties, i.e., bounds on the infinite horizon…

Systems and Control · Electrical Eng. & Systems 2022-09-09 Lukas Beckenbach , Stefan Streif

We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions as a substitute for the…

Optimization and Control · Mathematics 2023-11-10 Alan Yang , Stephen Boyd
‹ Prev 1 2 3 10 Next ›