English
Related papers

Related papers: Online Learning in Dynamically Changing Environmen…

200 papers

This work focuses on the setting of dynamic regret in the context of online learning with full information. In particular, we analyze regret bounds with respect to the temporal variability of the loss functions. By assuming that the…

Machine Learning · Computer Science 2021-02-16 Nicolò Campolongo , Francesco Orabona

We consider the problem of the Zinkevich (2003)-style dynamic regret minimization in online learning with exp-concave losses. We show that whenever improper learning is allowed, a Strongly Adaptive online learner achieves the dynamic regret…

Machine Learning · Computer Science 2021-07-06 Dheeraj Baby , Yu-Xiang Wang

We study the problem of online learning with dynamics, where a learner interacts with a stateful environment over multiple rounds. In each round of the interaction, the learner selects a policy to deploy and incurs a cost that depends on…

Machine Learning · Computer Science 2020-12-04 Kush Bhatia , Karthik Sridharan

We investigate online convex optimization in non-stationary environments and choose the dynamic regret as the performance measure, defined as the difference between cumulative loss incurred by the online algorithm and that of any feasible…

Machine Learning · Computer Science 2020-12-01 Peng Zhao , Yu-Jie Zhang , Lijun Zhang , Zhi-Hua Zhou

In the convex optimization approach to online regret minimization, many methods have been developed to guarantee a $O(\sqrt{T})$ bound on regret for subdifferentiable convex loss functions with bounded subgradients, by using a reduction to…

Machine Learning · Computer Science 2016-09-20 Arthur Flajolet , Patrick Jaillet

In the setting of online learning, Implicit algorithms turn out to be highly successful from a practical standpoint. However, the tightest regret analyses only show marginal improvements over Online Mirror Descent. In this work, we shed…

Machine Learning · Computer Science 2020-11-10 Nicolò Campolongo , Francesco Orabona

In the problem of online learning for changing environments, data are sequentially received one after another over time, and their distribution assumptions may vary frequently. Although existing methods demonstrate the effectiveness of…

Machine Learning · Computer Science 2023-07-18 Chen Zhao , Feng Mi , Xintao Wu , Kai Jiang , Latifur Khan , Christan Grant , Feng Chen

Algorithms for online learning typically require one or more boundedness assumptions: that the domain is bounded, that the losses are Lipschitz, or both. In this paper, we develop a new setting for online learning with unbounded domains and…

Machine Learning · Computer Science 2023-07-18 Andrew Jacobsen , Ashok Cutkosky

Non-stationary online learning has drawn much attention in recent years. Despite considerable progress, dynamic regret minimization has primarily focused on convex functions, leaving the functions with stronger curvature (e.g., squared or…

Machine Learning · Computer Science 2025-06-13 Yu-Jie Zhang , Peng Zhao , Masashi Sugiyama

We consider the problem of online learning in Linear Quadratic Control systems whose state transition and state-action transition matrices $A$ and $B$ may be initially unknown. We devise an online learning algorithm and provide guarantees…

Machine Learning · Computer Science 2021-09-30 Yassir Jedra , Alexandre Proutiere

Non-stationary online learning has drawn much attention in recent years. In particular, dynamic regret and adaptive regret are proposed as two principled performance measures for online convex optimization in non-stationary environments. To…

Machine Learning · Computer Science 2025-09-10 Peng Zhao , Yan-Feng Xie , Lijun Zhang , Zhi-Hua Zhou

We investigate online convex optimization in non-stationary environments and choose dynamic regret as the performance measure, defined as the difference between cumulative loss incurred by the online algorithm and that of any feasible…

Machine Learning · Computer Science 2024-04-09 Peng Zhao , Yu-Jie Zhang , Lijun Zhang , Zhi-Hua Zhou

We consider the problem of online learning where the sequence of actions played by the learner must adhere to an unknown safety constraint at every round. The goal is to minimize regret with respect to the best safe action in hindsight…

Machine Learning · Computer Science 2024-03-08 Karthik Sridharan , Seung Won Wilson Yoo

Online learning is a powerful tool for analyzing iterative algorithms. However, the classic adversarial setup sometimes fails to capture certain regularity in online problems in practice. Motivated by this, we establish a new setup, called…

Machine Learning · Computer Science 2022-04-06 Ching-An Cheng , Jonathan Lee , Ken Goldberg , Byron Boots

We study the framework of universal dynamic regret minimization with strongly convex losses. We answer an open problem in Baby and Wang 2021 by showing that in a proper learning setup, Strongly Adaptive algorithms can achieve the near…

Machine Learning · Computer Science 2022-01-25 Dheeraj Baby , Yu-Xiang Wang

Online learning methods yield sequential regret bounds under minimal assumptions and provide in-expectation risk bounds for statistical learning. However, despite the apparent advantage of online guarantees over their statistical…

Machine Learning · Computer Science 2023-08-16 Dirk van der Hoeven , Nikita Zhivotovskiy , Nicolò Cesa-Bianchi

The regret bound of dynamic online learning algorithms is often expressed in terms of the variation in the function sequence ($V_T$) and/or the path-length of the minimizer sequence after $T$ rounds. For strongly convex and smooth…

Machine Learning · Computer Science 2020-08-17 Ting-Jui Chang , Shahin Shahrampour

Existing online learning algorithms for adversarial Markov Decision Processes achieve ${O}(\sqrt{T})$ regret after $T$ rounds of interactions even if the loss functions are chosen arbitrarily by an adversary, with the caveat that the…

Machine Learning · Computer Science 2023-10-27 Tiancheng Jin , Junyan Liu , Chloé Rouyer , William Chang , Chen-Yu Wei , Haipeng Luo

We study the problem of online learning in predictive control of an unknown linear dynamical system with time varying cost functions which are unknown apriori. Specifically, we study the online learning problem where the control algorithm…

Machine Learning · Computer Science 2022-11-01 Deepan Muthirayan , Jianjun Yuan , Dileep Kalathil , Pramod P. Khargonekar

This paper investigates online composite optimization in dynamic environments, where each objective or loss function contains a time-varying nondifferentiable regularizer. To resolve it, an online proximal gradient algorithm is studied for…

Optimization and Control · Mathematics 2023-03-24 Ruijie Hou , Xiuxian Li , Yang Shi
‹ Prev 1 2 3 10 Next ›