Related papers: Quantum Algorithm for Online Convex Optimization

Quantum Algorithm for Online Exp-concave Optimization

We explore whether quantum advantages can be found for the zeroth-order feedback online exp-concave optimization problem, which is also known as bandit exp-concave optimization with multi-point feedback. We present quantum online…

Quantum Physics · Physics 2024-10-28 Jianhao He , Chengchang Liu , Xutong Liu , Lvzhou Li , John C. S. Lui

Minimizing Regret of Bandit Online Optimization in Unconstrained Action Spaces

We consider online convex optimization with a zero-order oracle feedback. In particular, the decision maker does not know the explicit representation of the time-varying cost functions, or their gradients. At each time step, she observes…

Optimization and Control · Mathematics 2020-05-05 Tatiana Tatarenko , Maryam Kamgarpour

An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints

We study Online Convex Optimization (OCO) with adversarial constraints, where an online algorithm must make sequential decisions to minimize both convex loss functions and cumulative constraint violations. We focus on a setting where the…

Machine Learning · Statistics 2025-03-14 Jordan Lekeufack , Michael I. Jordan

Safe Online Convex Optimization with Multi-Point Feedback

Motivated by the stringent safety requirements that are often present in real-world applications, we study a safe online convex optimization setting where the player needs to simultaneously achieve sublinear regret and zero constraint…

Machine Learning · Computer Science 2024-07-17 Spencer Hutchinson , Mahnoosh Alizadeh

Optimal Regret Algorithm for Pseudo-1d Bandit Convex Optimization

We study online learning with bandit feedback (i.e. learner has access to only zeroth-order oracle) where cost/reward functions $\f_t$ admit a "pseudo-1d" structure, i.e. $\f_t(\w) = \loss_t(\pred_t(\w))$ where the output of $\pred_t$ is…

Machine Learning · Computer Science 2021-02-16 Aadirupa Saha , Nagarajan Natarajan , Praneeth Netrapalli , Prateek Jain

Improved Guarantees for Constrained Online Convex Optimization via Self-Contraction

We consider Constrained Online Convex Optimization (COCO) with adversarially chosen constraints. At each round, the learner chooses an action before observing the loss and constraint function for that round. The goal is to achieve small…

Machine Learning · Computer Science 2026-05-21 Dhruv Sarkar , Abhishek Sinha

Adaptive Algorithms for Online Convex Optimization with Long-term Constraints

We present an adaptive online gradient descent algorithm to solve online convex optimization problems with long-term constraints , which are constraints that need to be satisfied when accumulated over a finite number of rounds T , but can…

Machine Learning · Statistics 2015-12-24 Rodolphe Jenatton , Jim Huang , Cédric Archambeau

An optimal algorithm for bandit convex optimization

We consider the problem of online convex optimization against an arbitrary adversary with bandit feedback, known as bandit convex optimization. We give the first $\tilde{O}(\sqrt{T})$-regret algorithm for this setting based on a novel…

Machine Learning · Computer Science 2016-03-16 Elad Hazan , Yuanzhi Li

An Efficient Interior-Point Method for Online Convex Optimization

A new algorithm for regret minimization in online convex optimization is described. The regret of the algorithm after $T$ time periods is $O(\sqrt{T \log T})$ - which is the minimum possible up to a logarithmic term. In addition, the new…

Machine Learning · Computer Science 2023-07-24 Elad Hazan , Nimrod Megiddo

Optimal Algorithms for Online Convex Optimization with Adversarial Constraints

A well-studied generalization of the standard online convex optimization (OCO) framework is constrained online convex optimization (COCO). In COCO, on every round, a convex cost function and a convex constraint function are revealed to the…

Machine Learning · Computer Science 2024-10-29 Abhishek Sinha , Rahul Vaze

Second Order Methods for Bandit Optimization and Control

Bandit convex optimization (BCO) is a general framework for online decision making under uncertainty. While tight regret bounds for general convex losses have been established, existing algorithms achieving these bounds have prohibitive…

Machine Learning · Computer Science 2024-10-04 Arun Suggala , Y. Jennifer Sun , Praneeth Netrapalli , Elad Hazan

A Reduction from Delayed to Immediate Feedback for Online Convex Optimization with Improved Guarantees

We develop a reduction-based framework for online learning with delayed feedback that recovers and improves upon existing results for both first-order and bandit convex optimization. Our approach introduces a continuous-time model under…

Machine Learning · Computer Science 2026-02-04 Alexander Ryabchenko , Idan Attias , Daniel M. Roy

Best of Both Worlds: Regret Minimization versus Minimax Play

In this paper, we investigate the existence of online learning algorithms with bandit feedback that simultaneously guarantee $O(1)$ regret compared to a given comparator strategy, and $\tilde{O}(\sqrt{T})$ regret compared to any fixed…

Machine Learning · Computer Science 2025-06-05 Adrian Müller , Jon Schneider , Stratis Skoulakis , Luca Viano , Volkan Cevher

A Unified Framework for Analyzing Meta-algorithms in Online Convex Optimization

In this paper, we analyze the problem of online convex optimization in different settings, including different feedback types (full-information/semi-bandit/bandit/etc) in either stochastic or non-stochastic setting and different notions of…

Machine Learning · Computer Science 2026-02-23 Mohammad Pedramfar , Vaneet Aggarwal

Improved Regret Bounds for Projection-free Bandit Convex Optimization

We revisit the challenge of designing online algorithms for the bandit convex optimization problem (BCO) which are also scalable to high dimensional problems. Hence, we consider algorithms that are \textit{projection-free}, i.e., based on…

Machine Learning · Computer Science 2019-10-09 Dan Garber , Ben Kretzu

The Computational Power of Optimization in Online Learning

We consider the fundamental problem of prediction with expert advice where the experts are "optimizable": there is a black-box optimization oracle that can be used to compute, in constant time, the leading expert in retrospect at any point…

Machine Learning · Computer Science 2016-01-28 Elad Hazan , Tomer Koren

Multi-Objective $\textit{min-max}$ Online Convex Optimization

In this paper, we broaden the horizon of online convex optimization (OCO), and consider multi-objective OCO, where there are $K$ distinct loss function sequences, and an algorithm has to choose its action at time $t$, before the $K$ loss…

Machine Learning · Computer Science 2026-02-11 Rahul Vaze , Sumiran Mishra

Contaminated Online Convex Optimization

In online convex optimization, some efficient algorithms have been designed for each of the individual classes of objective functions, e.g., convex, strongly convex, and exp-concave. However, existing regret analyses, including those of…

Optimization and Control · Mathematics 2024-12-13 Tomoya Kamijima , Shinji Ito

Optimal High-Probability Regret for Online Convex Optimization with Two-Point Bandit Feedback

We consider the problem of Online Convex Optimization (OCO) with two-point bandit feedback. In this setting, a player attempts to minimize a sequence of adversarially generated convex loss functions, while only observing the value of each…

Machine Learning · Computer Science 2026-04-07 Haishan Ye

Online Convex Optimization with Stochastic Constraints: Zero Constraint Violation and Bandit Feedback

This paper studies online convex optimization with stochastic constraints. We propose a variant of the drift-plus-penalty algorithm that guarantees $O(\sqrt{T})$ expected regret and zero constraint violation, after a fixed number of…

Optimization and Control · Mathematics 2023-07-17 Yeongjong Kim , Dabeen Lee