English
Related papers

Related papers: Temporal Variability in Implicit Online Learning

200 papers

This work focuses on the setting of dynamic regret in the context of online learning with full information. In particular, we analyze regret bounds with respect to the temporal variability of the loss functions. By assuming that the…

Machine Learning · Computer Science 2021-02-16 Nicolò Campolongo , Francesco Orabona

This paper considers the stability of online learning algorithms and its implications for learnability (bounded regret). We introduce a novel quantity called {\em forward regret} that intuitively measures how good an online learning…

Machine Learning · Computer Science 2012-11-28 Ankan Saha , Prateek Jain , Ambuj Tewari

We consider the problem of online control of systems with time-varying linear dynamics. This is a general formulation that is motivated by the use of local linearization in control of nonlinear dynamical systems. To state meaningful…

Machine Learning · Computer Science 2022-02-15 Paula Gradu , Elad Hazan , Edgar Minasyan

A natural goal when designing online learning algorithms for non-stationary environments is to bound the regret of the algorithm in terms of the temporal variation of the input sequence. Intuitively, when the variation is small, it should…

Machine Learning · Computer Science 2021-12-08 Gautam Goel , Babak Hassibi

We consider a family of learning strategies for online optimization problems that evolve in continuous time and we show that they lead to no regret. From a more traditional, discrete-time viewpoint, this continuous-time approach allows us…

Optimization and Control · Mathematics 2014-02-28 Joon Kwon , Panayotis Mertikopoulos

We study the problem of online learning and online regret minimization when samples are drawn from a general unknown non-stationary process. We introduce the concept of a dynamic changing process with cost $K$, where the conditional…

Machine Learning · Computer Science 2023-11-14 Changlong Wu , Ananth Grama , Wojciech Szpankowski

In learning theory, the performance of an online policy is commonly measured in terms of the static regret metric, which compares the cumulative loss of an online policy to that of an optimal benchmark in hindsight. In the definition of…

Information Theory · Computer Science 2022-08-23 Ativ Joshi , Abhishek Sinha

Online learning is a powerful tool for analyzing iterative algorithms. However, the classic adversarial setup sometimes fails to capture certain regularity in online problems in practice. Motivated by this, we establish a new setup, called…

Machine Learning · Computer Science 2022-04-06 Ching-An Cheng , Jonathan Lee , Ken Goldberg , Byron Boots

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop…

Machine Learning · Computer Science 2015-01-27 Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

We propose a general framework for studying adaptive regret bounds in the online learning framework, including model selection bounds and data-dependent bounds. Given a data- or model-dependent bound we ask, "Does there exist some algorithm…

Machine Learning · Computer Science 2020-02-14 Dylan J. Foster , Alexander Rakhlin , Karthik Sridharan

The performance of online convex optimization algorithms in a dynamic environment is often expressed in terms of the dynamic regret, which measures the decision maker's performance against a sequence of time-varying comparators. In the…

Machine Learning · Computer Science 2022-02-28 Nima Eshraghi , Ben Liang

We extend and combine several tools of the literature to design fast, adaptive, anytime and scale-free online learning algorithms. Scale-free regret bounds must scale linearly with the maximum loss, both toward large losses and toward very…

Machine Learning · Computer Science 2024-10-22 Laurent Orseau , Marcus Hutter

In online learning, the dynamic regret metric chooses the reference (optimal) solution that may change over time, while the typical (static) regret metric assumes the reference solution to be constant over the whole time horizon. The…

Machine Learning · Computer Science 2019-09-04 Yawei Zhao , Shuang Qiu , Ji Liu

We study the problem of uncertainty quantification via prediction sets, in an online setting where the data distribution may vary arbitrarily over time. Recent work develops online conformal prediction techniques that leverage regret…

Machine Learning · Computer Science 2023-02-16 Aadyot Bhatnagar , Huan Wang , Caiming Xiong , Yu Bai

Regret minimization is treated as the golden rule in the traditional study of online learning. However, regret minimization algorithms tend to converge to the static optimum, thus being suboptimal for changing environments. To address this…

Machine Learning · Computer Science 2020-02-07 Lijun Zhang , Shiyin Lu , Tianbao Yang

We consider the online version of the isotonic regression problem. Given a set of linearly ordered points (e.g., on the real line), the learner must predict labels sequentially at adversarially chosen positions and is evaluated by her total…

Machine Learning · Computer Science 2016-10-10 Wojciech Kotłowski , Wouter M. Koolen , Alan Malek

We investigate online convex optimization in non-stationary environments and choose the dynamic regret as the performance measure, defined as the difference between cumulative loss incurred by the online algorithm and that of any feasible…

Machine Learning · Computer Science 2020-12-01 Peng Zhao , Yu-Jie Zhang , Lijun Zhang , Zhi-Hua Zhou

We study the problem of dynamic regret minimization in online convex optimization, in which the objective is to minimize the difference between the cumulative loss of an algorithm and that of an arbitrary sequence of comparators. While the…

Machine Learning · Computer Science 2024-11-05 Andrew Jacobsen , Francesco Orabona

Maintaining predictive accuracy in non-stationary environments requires online model selection to adapt autonomously to unknown distribution shifts. However, existing tuning-free algorithms face a fundamental trade-off between robustness…

Machine Learning · Computer Science 2026-05-27 Kei Takemura , Ryuta Matsuno , Keita Sakuma

In the problem of online learning for changing environments, data are sequentially received one after another over time, and their distribution assumptions may vary frequently. Although existing methods demonstrate the effectiveness of…

Machine Learning · Computer Science 2023-07-18 Chen Zhao , Feng Mi , Xintao Wu , Kai Jiang , Latifur Khan , Christan Grant , Feng Chen
‹ Prev 1 2 3 10 Next ›