English
Related papers

Related papers: Online Linear Optimization with Many Hints

200 papers

We consider a variant of the classical online linear optimization problem in which at every step, the online player receives a "hint" vector before choosing the action for that round. Rather surprisingly, it was shown that if the hint…

Machine Learning · Computer Science 2020-10-05 Aditya Bhaskara , Ashok Cutkosky , Ravi Kumar , Manish Purohit

We consider the online linear optimization problem, where at every step the algorithm plays a point $x_t$ in the unit ball, and suffers loss $\langle c_t, x_t\rangle$ for some cost vector $c_t$ that is then revealed to the algorithm. Recent…

Machine Learning · Computer Science 2021-11-10 Aditya Bhaskara , Ashok Cutkosky , Ravi Kumar , Manish Purohit

In this paper, we broaden the horizon of online convex optimization (OCO), and consider multi-objective OCO, where there are $K$ distinct loss function sequences, and an algorithm has to choose its action at time $t$, before the $K$ loss…

Machine Learning · Computer Science 2026-02-11 Rahul Vaze , Sumiran Mishra

A well-studied generalization of the standard online convex optimization (OCO) framework is constrained online convex optimization (COCO). In COCO, on every round, a convex cost function and a convex constraint function are revealed to the…

Machine Learning · Computer Science 2024-10-29 Abhishek Sinha , Rahul Vaze

We study the problem of online convex optimization (OCO) under unknown linear constraints that are either static, or stochastically time-varying. For this problem, we introduce an algorithm that we term Optimistically Safe OCO (OSOCO) and…

Machine Learning · Computer Science 2025-07-16 Spencer Hutchinson , Tianyi Chen , Mahnoosh Alizadeh

A new algorithm for regret minimization in online convex optimization is described. The regret of the algorithm after $T$ time periods is $O(\sqrt{T \log T})$ - which is the minimum possible up to a logarithmic term. In addition, the new…

Machine Learning · Computer Science 2023-07-24 Elad Hazan , Nimrod Megiddo

This paper addresses Online Convex Optimization (OCO) problems where the constraints have additive perturbations that (i) vary over time and (ii) are not known at the time to make a decision. Perturbations may not be i.i.d. generated and…

Optimization and Control · Mathematics 2019-06-04 Víctor Valls , George Iosifidis , Douglas J. Leith , Leandros Tassiulas

We consider online learning problems where the aim is to achieve regret which is efficient in the sense that it is the same order as the lowest regret amongst K experts. This is a substantially stronger requirement that achieving…

Machine Learning · Computer Science 2019-11-12 Daron Anderson , Douglas J. Leith

The framework of online learning with memory naturally captures learning problems with temporal constraints, and was previously studied for the experts setting. In this work we extend the notion of learning with memory to the general Online…

Machine Learning · Computer Science 2014-06-11 Oren Anava , Elad Hazan , Shie Mannor

A natural goal when designing online learning algorithms for non-stationary environments is to bound the regret of the algorithm in terms of the temporal variation of the input sequence. Intuitively, when the variation is small, it should…

Machine Learning · Computer Science 2021-12-08 Gautam Goel , Babak Hassibi

In this paper we propose a framework for solving constrained online convex optimization problem. Our motivation stems from the observation that most algorithms proposed for online convex optimization require a projection onto the convex set…

Machine Learning · Computer Science 2012-10-01 Mehrdad Mahdavi , Rong Jin , Tianbao Yang

We study the setting of optimizing with bandit feedback with additional prior knowledge provided to the learner in the form of an initial hint of the optimal action. We present a novel algorithm for stochastic linear bandits that uses this…

Machine Learning · Computer Science 2022-03-09 Ashok Cutkosky , Chris Dann , Abhimanyu Das , Qiuyi , Zhang

In the convex optimization approach to online regret minimization, many methods have been developed to guarantee a $O(\sqrt{T})$ bound on regret for subdifferentiable convex loss functions with bounded subgradients, by using a reduction to…

Machine Learning · Computer Science 2016-09-20 Arthur Flajolet , Patrick Jaillet

We consider an assortment selection and pricing problem in which a seller has $N$ different items available for sale. In each round, the seller observes a $d$-dimensional contextual preference information vector for the user, and offers to…

Machine Learning · Computer Science 2025-03-18 Yigit Efe Erginbas , Thomas A. Courtade , Kannan Ramchandran

In citep{Hazan-2008-extract}, the authors showed that the regret of online linear optimization can be bounded by the total variation of the cost vectors. In this paper, we extend this result to general online convex optimization. We first…

Machine Learning · Computer Science 2012-06-15 Tianbao Yang , Mehrdad Mahdavi , Rong Jin , Shenghuo Zhu

We study Online Convex Optimization (OCO) with adversarial constraints, where an online algorithm must make sequential decisions to minimize both convex loss functions and cumulative constraint violations. We focus on a setting where the…

Machine Learning · Statistics 2025-03-14 Jordan Lekeufack , Michael I. Jordan

We study the problem of online learning (OL) from revealed preferences: a learner wishes to learn a non-strategic agent's private utility function through observing the agent's utility-maximizing actions in a changing environment. We adopt…

Optimization and Control · Mathematics 2021-06-07 Violet Xinying Chen , Fatma Kılınç-Karzan

In this paper, we study a special bandit setting of online stochastic linear optimization, where only one-bit of information is revealed to the learner at each round. This problem has found many applications including online advertisement…

Machine Learning · Computer Science 2015-09-28 Lijun Zhang , Tianbao Yang , Rong Jin , Zhi-Hua Zhou

Existing approaches to online convex optimization (OCO) make sequential one-slot-ahead decisions, which lead to (possibly adversarial) losses that drive subsequent decision iterates. Their performance is evaluated by the so-called regret…

Systems and Control · Computer Science 2017-11-22 Tianyi Chen , Qing Ling , Georgios B. Giannakis

We consider Constrained Online Convex Optimization (COCO) with adversarially chosen constraints. At each round, the learner chooses an action before observing the loss and constraint function for that round. The goal is to achieve small…

Machine Learning · Computer Science 2026-05-21 Dhruv Sarkar , Abhishek Sinha
‹ Prev 1 2 3 10 Next ›