Related papers: Regret Analysis: a control perspective

Adaptive Regret for Control of Time-Varying Dynamics

We consider the problem of online control of systems with time-varying linear dynamics. This is a general formulation that is motivated by the use of local linearization in control of nonlinear dynamical systems. To state meaningful…

Machine Learning · Computer Science 2022-02-15 Paula Gradu , Elad Hazan , Edgar Minasyan

A Local Regret in Nonconvex Online Learning

We consider an online learning process to forecast a sequence of outcomes for nonconvex models. A typical measure to evaluate online learning algorithms is regret but such standard definition of regret is intractable for nonconvex models…

Machine Learning · Computer Science 2018-11-30 Sergul Aydore , Lee Dicker , Dean Foster

Adaptive Gradient Online Control

In this work we consider the online control of a known linear dynamic system with adversarial disturbance and adversarial controller cost. The goal in online control is to minimize the regret, defined as the difference between cumulative…

Optimization and Control · Mathematics 2021-10-15 Deepan Muthirayan , Jianjun Yuan , Pramod P. Khargonekar

Minimizing Dynamic Regret and Adaptive Regret Simultaneously

Regret minimization is treated as the golden rule in the traditional study of online learning. However, regret minimization algorithms tend to converge to the static optimum, thus being suboptimal for changing environments. To address this…

Machine Learning · Computer Science 2020-02-07 Lijun Zhang , Shiyin Lu , Tianbao Yang

Online Optimal Control with Linear Dynamics and Predictions: Algorithms and Regret Analysis

This paper studies the online optimal control problem with time-varying convex stage costs for a time-invariant linear dynamical system, where a finite lookahead window of accurate predictions of the stage costs are available at each time.…

Optimization and Control · Mathematics 2019-10-23 Yingying Li , Xin Chen , Na Li

Provable Regret Bounds for Deep Online Learning and Control

The theory of deep learning focuses almost exclusively on supervised learning, non-convex optimization using stochastic gradient descent, and overparametrized neural networks. It is common belief that the optimizer dynamics, network…

Machine Learning · Computer Science 2022-02-18 Xinyi Chen , Edgar Minasyan , Jason D. Lee , Elad Hazan

Online Optimisation for Online Learning and Control -- From No-Regret to Generalised Error Convergence

This paper presents early work aiming at the development of a new framework for the design and analysis of algorithms for online learning based prediction and control. Firstly, we consider the task of predicting values of a function or time…

Optimization and Control · Mathematics 2019-03-26 Jan-P. Calliess

Revisiting Regret Benchmarks in Online Non-Stochastic Control

In the online non-stochastic control problem, an agent sequentially selects control inputs for a linear dynamical system when facing unknown and adversarially selected convex costs and disturbances. A common metric for evaluating control…

Optimization and Control · Mathematics 2025-04-24 Vijeth Hebbar , Cédric Langbort

Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization

We investigate online convex optimization in non-stationary environments and choose dynamic regret as the performance measure, defined as the difference between cumulative loss incurred by the online algorithm and that of any feasible…

Machine Learning · Computer Science 2024-04-09 Peng Zhao , Yu-Jie Zhang , Lijun Zhang , Zhi-Hua Zhou

On the Computational Efficiency of Adaptive and Dynamic Regret Minimization

In online convex optimization, the player aims to minimize regret, or the difference between her loss and that of the best fixed decision in hindsight over the entire repeated game. Algorithms that minimize (standard) regret may converge to…

Machine Learning · Computer Science 2023-02-14 Zhou Lu , Elad Hazan

Rate-Optimal Online Convex Optimization in Adaptive Linear Control

We consider the problem of controlling an unknown linear dynamical system under adversarially changing convex costs and full feedback of both the state and cost function. We present the first computationally-efficient algorithm that attains…

Machine Learning · Computer Science 2022-06-06 Asaf Cassel , Alon Cohen , Tomer Koren

An Online Learning Analysis of Minimax Adaptive Control

We present an online learning analysis of minimax adaptive control for the case where the uncertainty includes a finite set of linear dynamical systems. Precisely, for each system inside the uncertainty set, we define the model-based regret…

Systems and Control · Electrical Eng. & Systems 2023-09-12 Venkatraman Renganathan , Andrea Iannelli , Anders Rantzer

A Modern Introduction to Online Learning

In this book, I introduce the basic concepts of Online Learning through the modern view of Online Convex Optimization. Here, online learning refers to the framework of regret minimization under worst-case assumptions. I present first-order…

Machine Learning · Computer Science 2026-04-28 Francesco Orabona

Logarithmic Regret for Online Control

We study optimal regret bounds for control in linear dynamical systems under adversarially changing strongly convex cost functions, given the knowledge of transition dynamics. This includes several well studied and fundamental frameworks…

Machine Learning · Computer Science 2019-09-12 Naman Agarwal , Elad Hazan , Karan Singh

Regret Analysis of Online Gradient Descent-based Iterative Learning Control with Model Mismatch

In Iterative Learning Control (ILC), a sequence of feedforward control actions is generated at each iteration on the basis of partial model knowledge and past measurements with the goal of steering the system toward a desired reference…

Systems and Control · Electrical Eng. & Systems 2022-04-12 Efe C. Balta , Andrea Iannelli , Roy S. Smith , John Lygeros

Online Learning for Predictive Control with Provable Regret Guarantees

We study the problem of online learning in predictive control of an unknown linear dynamical system with time varying cost functions which are unknown apriori. Specifically, we study the online learning problem where the control algorithm…

Machine Learning · Computer Science 2022-11-01 Deepan Muthirayan , Jianjun Yuan , Dileep Kalathil , Pramod P. Khargonekar

Online Optimization in Dynamic Environments: Improved Regret Rates for Strongly Convex Problems

In this paper, we address tracking of a time-varying parameter with unknown dynamics. We formalize the problem as an instance of online optimization in a dynamic setting. Using online gradient descent, we propose a method that sequentially…

Machine Learning · Computer Science 2016-03-17 Aryan Mokhtari , Shahin Shahrampour , Ali Jadbabaie , Alejandro Ribeiro

Online Convex Optimization Perspective for Learning from Dynamically Revealed Preferences

We study the problem of online learning (OL) from revealed preferences: a learner wishes to learn a non-strategic agent's private utility function through observing the agent's utility-maximizing actions in a changing environment. We adopt…

Optimization and Control · Mathematics 2021-06-07 Violet Xinying Chen , Fatma Kılınç-Karzan

Balancing Exploration for Online Receding Horizon Learning Control with Provable Regret Guarantees

We address the problem of simultaneously learning and control in an online receding horizon control setting. We consider the control of an unknown linear dynamical system with general cost functions and affine constraints on the control…

Optimization and Control · Mathematics 2022-11-02 Deepan Muthirayan , Jianjun Yuan , Pramod P. Khargonekar

Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

We study the problem of system identification and adaptive control in partially observable linear dynamical systems. Adaptive and closed-loop system identification is a challenging problem due to correlations introduced in data collection.…

Machine Learning · Computer Science 2020-06-25 Sahin Lale , Kamyar Azizzadenesheli , Babak Hassibi , Anima Anandkumar