English
Related papers

Related papers: Normalized Online Learning

200 papers

We introduce online learning algorithms which are independent of feature scales, proving regret bounds dependent on the ratio of scales existent in the data rather than the absolute scale. This has several useful effects: there is no need…

Machine Learning · Computer Science 2013-05-30 Stephane Ross , Paul Mineiro , John Langford

We extend and combine several tools of the literature to design fast, adaptive, anytime and scale-free online learning algorithms. Scale-free regret bounds must scale linearly with the maximum loss, both toward large losses and toward very…

Machine Learning · Computer Science 2024-10-22 Laurent Orseau , Marcus Hutter

One of the main strengths of online algorithms is their ability to adapt to arbitrary data sequences. This is especially important in nonparametric settings, where performance is measured against rich classes of comparator functions that…

Machine Learning · Computer Science 2020-11-03 Ilja Kuzborskij , Nicolò Cesa-Bianchi

We consider online learning with linear models, where the algorithm predicts on sequentially revealed instances (feature vectors), and is compared against the best linear function (comparator) in hindsight. Popular algorithms in this…

Machine Learning · Computer Science 2019-02-21 Michał Kempka , Wojciech Kotłowski , Manfred K. Warmuth

This paper presents early work aiming at the development of a new framework for the design and analysis of algorithms for online learning based prediction and control. Firstly, we consider the task of predicting values of a function or time…

Optimization and Control · Mathematics 2019-03-26 Jan-P. Calliess

We study the generalization performance of online learning algorithms trained on samples coming from a dependent source of data. We show that the generalization error of any stable online algorithm concentrates around its regret--an easily…

Machine Learning · Statistics 2012-06-08 Alekh Agarwal , John C. Duchi

This paper considers the stability of online learning algorithms and its implications for learnability (bounded regret). We introduce a novel quantity called {\em forward regret} that intuitively measures how good an online learning…

Machine Learning · Computer Science 2012-11-28 Ankan Saha , Prateek Jain , Ambuj Tewari

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop…

Machine Learning · Computer Science 2015-01-27 Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

We propose a new partial-observability model for online learning problems where the learner, besides its own loss, also observes some noisy feedback about the other actions, depending on the underlying structure of the problem. We represent…

Machine Learning · Computer Science 2026-04-16 Tomáš Kocák , Gergely Neu , Michal Valko

We propose a general framework for studying adaptive regret bounds in the online learning framework, including model selection bounds and data-dependent bounds. Given a data- or model-dependent bound we ask, "Does there exist some algorithm…

Machine Learning · Computer Science 2020-02-14 Dylan J. Foster , Alexander Rakhlin , Karthik Sridharan

We consider a variant of online convex optimization in which both the instances (input vectors) and the comparator (weight vector) are unconstrained. We exploit a natural scale invariance symmetry in our unconstrained setting: the…

Machine Learning · Computer Science 2017-08-24 Wojciech Kotłowski

We show how to take any two parameter-free online learning algorithms with different regret guarantees and obtain a single algorithm whose regret is the minimum of the two base algorithms. Our method is embarrassingly simple: just add the…

Machine Learning · Statistics 2019-02-26 Ashok Cutkosky

We design algorithms for online linear optimization that have optimal regret and at the same time do not need to know any upper or lower bounds on the norm of the loss vectors. We achieve adaptiveness to norms of loss vectors by scale…

Machine Learning · Computer Science 2015-07-03 Francesco Orabona , David Pal

We design and analyze algorithms for online linear optimization that have optimal regret and at the same time do not need to know any upper or lower bounds on the norm of the loss vectors. Our algorithms are instances of the Follow the…

Machine Learning · Computer Science 2016-12-15 Francesco Orabona , Dávid Pál

The vast majority of optimization and online learning algorithms today require some prior information about the data (often in the form of bounds on gradients or on the optimal parameter value). When this information is not available, these…

Machine Learning · Computer Science 2017-06-07 Ashok Cutkosky , Kwabena Boahen

This paper describes a new parameter-free online learning algorithm for changing environments. In comparing against algorithms with the same time complexity as ours, we obtain a strongly adaptive regret bound that is a factor of at least…

Machine Learning · Statistics 2017-08-08 Kwang-Sung Jun , Francesco Orabona , Rebecca Willett , Stephen Wright

We consider prediction with expert advice when the loss vectors are assumed to lie in a set described by the sum of atomic norm balls. We derive a regret bound for a general version of the online mirror descent (OMD) algorithm that uses a…

Machine Learning · Computer Science 2017-11-15 Siddharth Barman , Aditya Gopalan , Aadirupa Saha

We consider revenue maximization in online auction/pricing problems. A seller sells an identical item in each period to a new buyer, or a new set of buyers. For the online posted pricing problem, we show regret bounds that scale with the…

Computer Science and Game Theory · Computer Science 2018-09-13 Sébastien Bubeck , Nikhil R. Devanur , Zhiyi Huang , Rad Niazadeh

We present methods for online linear optimization that take advantage of benign (as opposed to worst-case) sequences. Specifically if the sequence encountered by the learner is described well by a known "predictable process", the algorithms…

Machine Learning · Statistics 2014-05-27 Alexander Rakhlin , Karthik Sridharan

A natural goal when designing online learning algorithms for non-stationary environments is to bound the regret of the algorithm in terms of the temporal variation of the input sequence. Intuitively, when the variation is small, it should…

Machine Learning · Computer Science 2021-12-08 Gautam Goel , Babak Hassibi
‹ Prev 1 2 3 10 Next ›