Related papers: Accelerating Model-Free Policy Optimization Using …

Combining Model-Based and Model-Free Methods for Nonlinear Control: A Provably Convergent Policy Gradient Approach

Model-free learning-based control methods have seen great success recently. However, such methods typically suffer from poor sample complexity and limited convergence guarantees. This is in sharp contrast to classical model-based control,…

Optimization and Control · Mathematics 2020-06-16 Guannan Qu , Chenkai Yu , Steven Low , Adam Wierman

Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators

Nonlinear control systems with partial information to the decision maker are prevalent in a variety of applications. As a step toward studying such nonlinear systems, this work explores reinforcement learning methods for finding the optimal…

Machine Learning · Computer Science 2025-04-11 Yinbin Han , Meisam Razaviyayn , Renyuan Xu

A Smoothing Stochastic Gradient Method for Composite Optimization

We consider the unconstrained optimization problem whose objective function is composed of a smooth and a non-smooth conponents where the smooth component is the expectation a random function. This type of problem arises in some interesting…

Optimization and Control · Mathematics 2011-07-01 Qihang Lin , Xi Chen , Javier Pena

Gradient-Free Distributed Optimization with Exact Convergence

In this paper, a gradient-free distributed algorithm is introduced to solve a set constrained optimization problem under a directed communication network. Specifically, at each time-step, the agents locally compute a so-called…

Optimization and Control · Mathematics 2021-09-06 Yipeng Pang , Guoqiang Hu

Development of a method for solving structural optimization problems

In practice, optimization tasks have some structure that allows developing new algorithms for every problem with faster convergence rates. Using the structure of optimization tasks, we can propose algorithms with more optimistic convergence…

Optimization and Control · Mathematics 2020-09-01 Alexander Tyurin

Internal Model-Based Online Optimization

In this paper we propose a model-based approach to the design of online optimization algorithms, with the goal of improving the tracking of the solution trajectory (trajectories) w.r.t. state-of-the-art methods. We focus first on quadratic…

Optimization and Control · Mathematics 2023-07-24 Nicola Bastianello , Ruggero Carli , Sandro Zampieri

Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

Many engineering problems have multiple objectives, and the overall aim is to optimize a non-linear function of these objectives. In this paper, we formulate the problem of maximizing a non-linear concave function of multiple long-term…

Machine Learning · Computer Science 2025-09-23 Qinbo Bai , Mridul Agarwal , Vaneet Aggarwal

Decomposition and Adaptive Sampling for Data-Driven Inverse Linear Optimization

This work addresses inverse linear optimization where the goal is to infer the unknown cost vector of a linear program. Specifically, we consider the data-driven setting in which the available data are noisy observations of optimal…

Optimization and Control · Mathematics 2021-12-07 Rishabh Gupta , Qi Zhang

Gradient Sliding for Composite Optimization

We consider in this paper a class of composite optimization problems whose objective function is given by the summation of a general smooth and nonsmooth component, together with a relatively simple nonsmooth term. We present a new class of…

Optimization and Control · Mathematics 2015-10-27 Guanghui Lan

Model-Free Imitation Learning with Policy Optimization

In imitation learning, an agent learns how to behave in an environment with an unknown cost function by mimicking expert demonstrations. Existing imitation learning algorithms typically involve solving a sequence of planning or…

Machine Learning · Computer Science 2016-06-17 Jonathan Ho , Jayesh K. Gupta , Stefano Ermon

A template for gradient norm minimization

The gradient mapping norm is a strong and easily verifiable stopping criterion for first-order methods on composite problems. When the objective exhibits the quadratic growth property, the gradient mapping norm minimization problem can be…

Optimization and Control · Mathematics 2024-10-31 Mihai I. Florea

An Inexact Preconditioned Zeroth-order Proximal Method for Composite Optimization

In this paper, we consider the composite optimization problem, where the objective function integrates a continuously differentiable loss function with a nonsmooth regularization term. Moreover, only the function values for the…

Optimization and Control · Mathematics 2024-01-09 Shanglin Liu , Lei Wang , Nachuan Xiao , Xin Liu

Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret

We consider the task of learning to control a linear dynamical system under fixed quadratic costs, known as the Linear Quadratic Regulator (LQR) problem. While model-free approaches are often favorable in practice, thus far only model-based…

Machine Learning · Computer Science 2021-02-26 Asaf Cassel , Tomer Koren

On improving generalization in a class of learning problems with the method of small parameters for weakly-controlled optimal gradient systems

In this paper, we provide a mathematical framework for improving generalization in a class of learning problems which is related to point estimations for modeling of high-dimensional nonlinear functions. In particular, we consider a…

Optimization and Control · Mathematics 2024-12-13 Getachew K. Befekadu

Practicable Simulation-Free Model Order Reduction by Nonlinear Moment Matching

In this paper, a practicable simulation-free model order reduction method by nonlinear moment matching is developed. Based on the steady-state interpretation of linear moment matching, we comprehensively explain the extension of this…

Systems and Control · Electrical Eng. & Systems 2024-12-20 Maria Cruz Varona , Raphael Gebhart , Julian Suk , Boris Lohmann

Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator

Direct policy gradient methods for reinforcement learning and continuous control problems are a popular approach for a variety of reasons: 1) they are easy to implement without explicit knowledge of the underlying model 2) they are an…

Machine Learning · Computer Science 2019-03-26 Maryam Fazel , Rong Ge , Sham M. Kakade , Mehran Mesbahi

Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

We study derivative-free methods for policy optimization over the class of linear policies. We focus on characterizing the convergence rate of these methods when applied to linear-quadratic systems, and study various settings of driving…

Machine Learning · Computer Science 2020-05-19 Dhruv Malik , Ashwin Pananjady , Kush Bhatia , Koulik Khamaru , Peter L. Bartlett , Martin J. Wainwright

Gradient Approximation and Multi-Variable Derivative-Free Optimization based on Non-Commutative Maps

In this work, multi-variable derivative-free optimization algorithms for unconstrained optimization problems are developed. A novel procedure for approximating the gradient of multi-variable objective functions based on non-commutative maps…

Optimization and Control · Mathematics 2021-11-17 Jan Feiling , Mohamed-Ali Belabbas , Christian Ebenbauer

Gradient-Based Mixed Planning with Symbolic and Numeric Action Parameters

Dealing with planning problems with both logical relations and numeric changes in real-world dynamic environments is challenging. Existing numeric planning systems for the problem often discretize numeric variables or impose convex…

Artificial Intelligence · Computer Science 2022-10-11 Kebing Jin , Hankz Hankui Zhuo , Zhanhao Xiao , Hai Wan , Subbarao Kambhampati

On the Convergence and Complexity of Proximal Gradient and Accelerated Proximal Gradient Methods under Adaptive Gradient Estimation

In this paper, we propose a proximal gradient method and an accelerated proximal gradient method for solving composite optimization problems, where the objective function is the sum of a smooth and a convex, possibly nonsmooth, function. We…

Optimization and Control · Mathematics 2025-07-22 Raghu Bollapragada , Shagun Gupta