Related papers: Online Learning in Dynamically Changing Environmen…

A closer look at temporal variability in dynamic online learning

This work focuses on the setting of dynamic regret in the context of online learning with full information. In particular, we analyze regret bounds with respect to the temporal variability of the loss functions. By assuming that the…

Machine Learning · Computer Science 2021-02-16 Nicolò Campolongo , Francesco Orabona

Optimal Dynamic Regret in Exp-Concave Online Learning

We consider the problem of the Zinkevich (2003)-style dynamic regret minimization in online learning with exp-concave losses. We show that whenever improper learning is allowed, a Strongly Adaptive online learner achieves the dynamic regret…

Machine Learning · Computer Science 2021-07-06 Dheeraj Baby , Yu-Xiang Wang

Online learning with dynamics: A minimax perspective

We study the problem of online learning with dynamics, where a learner interacts with a stateful environment over multiple rounds. In each round of the interaction, the learner selects a policy to deploy and incurs a cost that depends on…

Machine Learning · Computer Science 2020-12-04 Kush Bhatia , Karthik Sridharan

Dynamic Regret of Convex and Smooth Functions

We investigate online convex optimization in non-stationary environments and choose the dynamic regret as the performance measure, defined as the difference between cumulative loss incurred by the online algorithm and that of any feasible…

Machine Learning · Computer Science 2020-12-01 Peng Zhao , Yu-Jie Zhang , Lijun Zhang , Zhi-Hua Zhou

No-Regret Learnability for Piecewise Linear Losses

In the convex optimization approach to online regret minimization, many methods have been developed to guarantee a $O(\sqrt{T})$ bound on regret for subdifferentiable convex loss functions with bounded subgradients, by using a reduction to…

Machine Learning · Computer Science 2016-09-20 Arthur Flajolet , Patrick Jaillet

Temporal Variability in Implicit Online Learning

In the setting of online learning, Implicit algorithms turn out to be highly successful from a practical standpoint. However, the tightest regret analyses only show marginal improvements over Online Mirror Descent. In this work, we shed…

Machine Learning · Computer Science 2020-11-10 Nicolò Campolongo , Francesco Orabona

Towards Fair Disentangled Online Learning for Changing Environments

In the problem of online learning for changing environments, data are sequentially received one after another over time, and their distribution assumptions may vary frequently. Although existing methods demonstrate the effectiveness of…

Machine Learning · Computer Science 2023-07-18 Chen Zhao , Feng Mi , Xintao Wu , Kai Jiang , Latifur Khan , Christan Grant , Feng Chen

Unconstrained Online Learning with Unbounded Losses

Algorithms for online learning typically require one or more boundedness assumptions: that the domain is bounded, that the losses are Lipschitz, or both. In this paper, we develop a new setting for online learning with unbounded domains and…

Machine Learning · Computer Science 2023-07-18 Andrew Jacobsen , Ashok Cutkosky

Non-stationary Online Learning for Curved Losses: Improved Dynamic Regret via Mixability

Non-stationary online learning has drawn much attention in recent years. Despite considerable progress, dynamic regret minimization has primarily focused on convex functions, leaving the functions with stronger curvature (e.g., squared or…

Machine Learning · Computer Science 2025-06-13 Yu-Jie Zhang , Peng Zhao , Masashi Sugiyama

Minimal Expected Regret in Linear Quadratic Control

We consider the problem of online learning in Linear Quadratic Control systems whose state transition and state-action transition matrices $A$ and $B$ may be initially unknown. We devise an online learning algorithm and provide guarantees…

Machine Learning · Computer Science 2021-09-30 Yassir Jedra , Alexandre Proutiere

Efficient Methods for Non-stationary Online Learning

Non-stationary online learning has drawn much attention in recent years. In particular, dynamic regret and adaptive regret are proposed as two principled performance measures for online convex optimization in non-stationary environments. To…

Machine Learning · Computer Science 2025-09-10 Peng Zhao , Yan-Feng Xie , Lijun Zhang , Zhi-Hua Zhou

Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization

We investigate online convex optimization in non-stationary environments and choose dynamic regret as the performance measure, defined as the difference between cumulative loss incurred by the online algorithm and that of any feasible…

Machine Learning · Computer Science 2024-04-09 Peng Zhao , Yu-Jie Zhang , Lijun Zhang , Zhi-Hua Zhou

Online Learning with Unknown Constraints

We consider the problem of online learning where the sequence of actions played by the learner must adhere to an unknown safety constraint at every round. The goal is to minimize regret with respect to the best safe action in hindsight…

Machine Learning · Computer Science 2024-03-08 Karthik Sridharan , Seung Won Wilson Yoo

Online Learning with Continuous Variations: Dynamic Regret and Reductions

Online learning is a powerful tool for analyzing iterative algorithms. However, the classic adversarial setup sometimes fails to capture certain regularity in online problems in practice. Motivated by this, we establish a new setup, called…

Machine Learning · Computer Science 2022-04-06 Ching-An Cheng , Jonathan Lee , Ken Goldberg , Byron Boots

Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond

We study the framework of universal dynamic regret minimization with strongly convex losses. We answer an open problem in Baby and Wang 2021 by showing that in a proper learning setup, Strongly Adaptive algorithms can achieve the near…

Machine Learning · Computer Science 2022-01-25 Dheeraj Baby , Yu-Xiang Wang

High-Probability Risk Bounds via Sequential Predictors

Online learning methods yield sequential regret bounds under minimal assumptions and provide in-expectation risk bounds for statistical learning. However, despite the apparent advantage of online guarantees over their statistical…

Machine Learning · Computer Science 2023-08-16 Dirk van der Hoeven , Nikita Zhivotovskiy , Nicolò Cesa-Bianchi

Unconstrained Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

The regret bound of dynamic online learning algorithms is often expressed in terms of the variation in the function sequence ($V_T$) and/or the path-length of the minimizer sequence after $T$ rounds. For strongly convex and smooth…

Machine Learning · Computer Science 2020-08-17 Ting-Jui Chang , Shahin Shahrampour

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

Existing online learning algorithms for adversarial Markov Decision Processes achieve ${O}(\sqrt{T})$ regret after $T$ rounds of interactions even if the loss functions are chosen arbitrarily by an adversary, with the caveat that the…

Machine Learning · Computer Science 2023-10-27 Tiancheng Jin , Junyan Liu , Chloé Rouyer , William Chang , Chen-Yu Wei , Haipeng Luo

Online Learning for Predictive Control with Provable Regret Guarantees

We study the problem of online learning in predictive control of an unknown linear dynamical system with time varying cost functions which are unknown apriori. Specifically, we study the online learning problem where the control algorithm…

Machine Learning · Computer Science 2022-11-01 Deepan Muthirayan , Jianjun Yuan , Dileep Kalathil , Pramod P. Khargonekar

Dynamic Regret for Online Composite Optimization

This paper investigates online composite optimization in dynamic environments, where each objective or loss function contains a time-varying nondifferentiable regularizer. To resolve it, an online proximal gradient algorithm is studied for…

Optimization and Control · Mathematics 2023-03-24 Ruijie Hou , Xiuxian Li , Yang Shi