Related papers: Efficient Online Linear Optimization with Approxim…

Online Improper Learning with an Approximation Oracle

We revisit the question of reducing online learning to approximate optimization of the offline problem. In this setting, we give two algorithms with near-optimal performance in the full information setting: they guarantee optimal regret and…

Machine Learning · Computer Science 2018-04-24 Elad Hazan , Wei Hu , Yuanzhi Li , Zhiyuan Li

Online Inverse Linear Optimization: Efficient Logarithmic-Regret Algorithm, Robustness to Suboptimality, and Lower Bound

In online inverse linear optimization, a learner observes time-varying sets of feasible actions and an agent's optimal actions, selected by solving linear optimization over the feasible actions. The learner sequentially makes predictions of…

Machine Learning · Computer Science 2025-05-23 Shinsaku Sakaue , Taira Tsuchiya , Han Bao , Taihei Oki

Beyond $\mathcal{O}(\sqrt{T})$ Regret: Decoupling Learning and Decision-making in Online Linear Programming

Online linear programming plays an important role in both revenue management and resource allocation, and recent research has focused on developing efficient first-order online learning algorithms. Despite the empirical success of…

Machine Learning · Statistics 2025-01-07 Wenzhi Gao , Dongdong Ge , Chenyu Xue , Chunlin Sun , Yinyu Ye

Improved Regret Bounds for Projection-free Bandit Convex Optimization

We revisit the challenge of designing online algorithms for the bandit convex optimization problem (BCO) which are also scalable to high dimensional problems. Hence, we consider algorithms that are \textit{projection-free}, i.e., based on…

Machine Learning · Computer Science 2019-10-09 Dan Garber , Ben Kretzu

Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations

We study algorithms for online linear optimization in Hilbert spaces, focusing on the case where the player is unconstrained. We develop a novel characterization of a large class of minimax algorithms, recovering, and even improving,…

Machine Learning · Computer Science 2014-05-22 H. Brendan McMahan , Francesco Orabona

Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation

We consider the dynamic resource allocation problem where the decision space is finite-dimensional, yet the solution must satisfy a large or even infinite number of constraints revealed via streaming data or oracle feedback. We model this…

Machine Learning · Computer Science 2026-03-18 Yiming Zong , Jiashuo Jiang

A Low Complexity Algorithm with $O(\sqrt{T})$ Regret and $O(1)$ Constraint Violations for Online Convex Optimization with Long Term Constraints

This paper considers online convex optimization over a complicated constraint set, which typically consists of multiple functional constraints and a set constraint. The conventional online projection algorithm (Zinkevich, 2003) can be…

Optimization and Control · Mathematics 2020-05-19 Hao Yu , Michael J. Neely

Adversarial Online Learning with Changing Action Sets: Efficient Algorithms with Approximate Regret Bounds

We revisit the problem of online learning with sleeping experts/bandits: in each time step, only a subset of the actions are available for the algorithm to choose from (and learn about). The work of Kleinberg et al. (2010) showed that there…

Machine Learning · Computer Science 2021-04-27 Ehsan Emamjomeh-Zadeh , Chen-Yu Wei , Haipeng Luo , David Kempe

Logarithmic Regret from Sublinear Hints

We consider the online linear optimization problem, where at every step the algorithm plays a point $x_t$ in the unit ball, and suffers loss $\langle c_t, x_t\rangle$ for some cost vector $c_t$ that is then revealed to the algorithm. Recent…

Machine Learning · Computer Science 2021-11-10 Aditya Bhaskara , Ashok Cutkosky , Ravi Kumar , Manish Purohit

Improved Regret Bounds for Online Submodular Maximization

In this paper, we consider an online optimization problem over $T$ rounds where at each step $t\in[T]$, the algorithm chooses an action $x_t$ from the fixed convex and compact domain set $\mathcal{K}$. A utility function $f_t(\cdot)$ is…

Machine Learning · Computer Science 2021-06-16 Omid Sadeghi , Prasanna Raut , Maryam Fazel

Online Learning with Unknown Constraints

We consider the problem of online learning where the sequence of actions played by the learner must adhere to an unknown safety constraint at every round. The goal is to minimize regret with respect to the best safe action in hindsight…

Machine Learning · Computer Science 2024-03-08 Karthik Sridharan , Seung Won Wilson Yoo

Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation

We study reinforcement learning with linear function approximation and adversarially changing cost functions, a setup that has mostly been considered under simplifying assumptions such as full information feedback or exploratory…

Machine Learning · Computer Science 2023-01-31 Uri Sherman , Tomer Koren , Yishay Mansour

Online learning for min-max discrete problems

We study various discrete nonlinear combinatorial optimization problems in an online learning framework. In the first part, we address the question of whether there are negative results showing that getting a vanishing (or even vanishing…

Data Structures and Algorithms · Computer Science 2020-06-24 Evripidis Bampis , Dimitris Christou , Bruno Escoffier , Nguyen Kim Thang

The Computational Power of Optimization in Online Learning

We consider the fundamental problem of prediction with expert advice where the experts are "optimizable": there is a black-box optimization oracle that can be used to compute, in constant time, the leading expert in retrospect at any point…

Machine Learning · Computer Science 2016-01-28 Elad Hazan , Tomer Koren

An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints

We study Online Convex Optimization (OCO) with adversarial constraints, where an online algorithm must make sequential decisions to minimize both convex loss functions and cumulative constraint violations. We focus on a setting where the…

Machine Learning · Statistics 2025-03-14 Jordan Lekeufack , Michael I. Jordan

Computing Optimal Regularizers for Online Linear Optimization

Follow-the-Regularized-Leader (FTRL) algorithms are a popular class of learning algorithms for online linear optimization (OLO) that guarantee sub-linear regret, but the choice of regularizer can significantly impact dimension-dependent…

Machine Learning · Computer Science 2024-10-24 Khashayar Gatmiry , Jon Schneider , Stefanie Jegelka

Minimizing Regret of Bandit Online Optimization in Unconstrained Action Spaces

We consider online convex optimization with a zero-order oracle feedback. In particular, the decision maker does not know the explicit representation of the time-varying cost functions, or their gradients. At each time step, she observes…

Optimization and Control · Mathematics 2020-05-05 Tatiana Tatarenko , Maryam Kamgarpour

Trading Regret for Efficiency: Online Convex Optimization with Long Term Constraints

In this paper we propose a framework for solving constrained online convex optimization problem. Our motivation stems from the observation that most algorithms proposed for online convex optimization require a projection onto the convex set…

Machine Learning · Computer Science 2012-10-01 Mehrdad Mahdavi , Rong Jin , Tianbao Yang

Adaptive Algorithms for Online Convex Optimization with Long-term Constraints

We present an adaptive online gradient descent algorithm to solve online convex optimization problems with long-term constraints , which are constraints that need to be satisfied when accumulated over a finite number of rounds T , but can…

Machine Learning · Statistics 2015-12-24 Rodolphe Jenatton , Jim Huang , Cédric Archambeau

Regret in Online Combinatorial Optimization

We address online linear optimization problems when the possible actions of the decision maker are represented by binary vectors. The regret of the decision maker is the difference between her realized loss and the best loss she would have…

Machine Learning · Computer Science 2013-04-02 Jean-Yves Audibert , Sébastien Bubeck , Gábor Lugosi