Related papers: Dynamical System Optimization

Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces

Direct optimization is an appealing framework that replaces integration with optimization of a random objective for approximating gradients in models with discrete random variables. A$^\star$ sampling is a framework for optimizing such…

Machine Learning · Computer Science 2020-10-26 Guy Lorberbom , Chris J. Maddison , Nicolas Heess , Tamir Hazan , Daniel Tarlow

Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients

Policy iteration is one of the classical frameworks of reinforcement learning, which requires a known initial stabilizing control. However, finding the initial stabilizing control depends on the known system model. To relax this requirement…

Systems and Control · Electrical Eng. & Systems 2025-03-20 Dongdong Li , Jiuxiang Dong

Policy Gradient Algorithms Implicitly Optimize by Continuation

Direct policy optimization in reinforcement learning is usually solved with policy-gradient algorithms, which optimize policy parameters via stochastic gradient ascent. This paper provides a new theoretical interpretation and justification…

Machine Learning · Computer Science 2023-10-24 Adrien Bolland , Gilles Louppe , Damien Ernst

Dynamic Programming: From Local Optimality to Global Optimality

In the theory of dynamic programming, an optimal policy is a policy whose lifetime value dominates that of all other policies from every possible initial condition in the state space. This raises a natural question: when does optimality…

Optimization and Control · Mathematics 2025-05-13 John Stachurski , Jingni Yang , Ziyue Yang

A new scalable algorithm for computational optimal control under uncertainty

We address the design and synthesis of optimal control strategies for high-dimensional stochastic dynamical systems. Such systems may be deterministic nonlinear systems evolving from random initial states, or systems driven by random…

Numerical Analysis · Mathematics 2020-08-26 Panos Lambrianides , Qi Gong , Daniele Venturi

Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models

We focus on developing efficient and reliable policy optimization strategies for robot learning with real-world data. In recent years, policy gradient methods have emerged as a promising paradigm for training control policies in simulation.…

Machine Learning · Computer Science 2023-11-07 Tyler Westenbroek , Jacob Levy , David Fridovich-Keil

Sample Complexity of Estimating the Policy Gradient for Nearly Deterministic Dynamical Systems

Reinforcement learning is a promising approach to learning robotics controllers. It has recently been shown that algorithms based on finite-difference estimates of the policy gradient are competitive with algorithms based on the policy…

Machine Learning · Computer Science 2021-10-12 Osbert Bastani

Quasi-Newton Iteration in Deterministic Policy Gradient

This paper presents a model-free approximation for the Hessian of the performance of deterministic policies to use in the context of Reinforcement Learning based on Quasi-Newton steps in the policy parameters. We show that the approximate…

Machine Learning · Computer Science 2022-03-29 Arash Bahari Kordabad , Hossein Nejatbakhsh Esfahani , Wenqi Cai , Sebastien Gros

Deep Reinforcement Learning for Dynamic Algorithm Selection: A Proof-of-Principle Study on Differential Evolution

Evolutionary algorithms, such as Differential Evolution, excel in solving real-parameter optimization challenges. However, the effectiveness of a single algorithm varies across different problem instances, necessitating considerable efforts…

Neural and Evolutionary Computing · Computer Science 2024-03-08 Hongshu Guo , Yining Ma , Zeyuan Ma , Jiacheng Chen , Xinglin Zhang , Zhiguang Cao , Jun Zhang , Yue-Jiao Gong

Automated Dynamic Algorithm Configuration

The performance of an algorithm often critically depends on its parameter configuration. While a variety of automated algorithm configuration methods have been proposed to relieve users from the tedious and error-prone task of manually…

Artificial Intelligence · Computer Science 2022-05-30 Steven Adriaensen , André Biedenkapp , Gresa Shala , Noor Awad , Theresa Eimer , Marius Lindauer , Frank Hutter

Structure, Analysis, and Synthesis of First-Order Algorithms

Optimization algorithms can be interpreted through the lens of dynamical systems as the interconnection of linear systems and a set of subgradient nonlinearities. This dynamical systems formulation allows for the analysis and synthesis of…

Optimization and Control · Mathematics 2026-03-27 Jared Miller , Carsten Scherer , Fabian Jakob , Andrea Iannelli

Action-State Dependent Dynamic Model Selection

A model among many may only be best under certain states of the world. Switching from a model to another can also be costly. Finding a procedure to dynamically choose a model in these circumstances requires to solve a complex estimation…

Machine Learning · Computer Science 2023-10-10 Francesco Cordoni , Alessio Sancetta

Trajectory-Based Off-Policy Deep Reinforcement Learning

Policy gradient methods are powerful reinforcement learning algorithms and have been demonstrated to solve many complex tasks. However, these methods are also data-inefficient, afflicted with high variance gradient estimates, and frequently…

Machine Learning · Computer Science 2019-05-15 Andreas Doerr , Michael Volpp , Marc Toussaint , Sebastian Trimpe , Christian Daniel

Deep Learning Theory Review: An Optimal Control and Dynamical Systems Perspective

Attempts from different disciplines to provide a fundamental understanding of deep learning have advanced rapidly in recent years, yet a unified framework remains relatively limited. In this article, we provide one possible way to align…

Machine Learning · Computer Science 2019-10-01 Guan-Horng Liu , Evangelos A. Theodorou

Reinforcement Learning Using Expectation Maximization Based Guided Policy Search for Stochastic Dynamics

Guided policy search algorithms have been proven to work with incredible accuracy for not only controlling a complicated dynamical system, but also learning optimal policies from various unseen instances. One assumes true nature of the…

Systems and Control · Electrical Eng. & Systems 2020-10-02 Prakash Mallick , Zhiyong Chen , Mohsen Zamani

Scalable Model-based Policy Optimization for Decentralized Networked Systems

Reinforcement learning algorithms require a large amount of samples; this often limits their real-world applications on even simple tasks. Such a challenge is more outstanding in multi-agent tasks, as each step of operation is more costly…

Machine Learning · Computer Science 2022-09-05 Yali Du , Chengdong Ma , Yuchen Liu , Runji Lin , Hao Dong , Jun Wang , Yaodong Yang

Dynamic Optimization and Learning for Renewal Systems

We consider the problem of optimizing time averages in systems with independent and identically distributed behavior over renewal frames. This includes scheduling and task processing to maximize utility in stochastic networks with variable…

Optimization and Control · Mathematics 2010-11-30 Michael J. Neely

Deep Reinforcement-Learning-based Driving Policy for Autonomous Road Vehicles

In this work the problem of path planning for an autonomous vehicle that moves on a freeway is considered. The most common approaches that are used to address this problem are based on optimal control methods, which make assumptions about…

Robotics · Computer Science 2020-02-19 Konstantinos Makantasis , Maria Kontorinaki , Ioannis Nikolos

On Hyper-parameter Tuning for Stochastic Optimization Algorithms

This paper proposes the first-ever algorithmic framework for tuning hyper-parameters of stochastic optimization algorithm based on reinforcement learning. Hyper-parameters impose significant influences on the performance of stochastic…

Machine Learning · Computer Science 2020-03-11 Haotian Zhang , Jianyong Sun , Zongben Xu

Learning Beyond Optimization: Stress-Gated Dynamical Regime Regulation in Autonomous Systems

Despite their apparent diversity, modern machine learning methods can be reduced to a remarkably simple core principle: learning is achieved by continuously optimizing parameters to minimize or maximize a scalar objective function. This…

Machine Learning · Computer Science 2026-02-24 Sheng Ran