Related papers: Efficient Constrained Regret Minimization

A Unifying Framework for Online Optimization with Long-Term Constraints

We study online learning problems in which a decision maker has to take a sequence of decisions subject to $m$ long-term constraints. The goal of the decision maker is to maximize their total reward, while at the same time achieving small…

Machine Learning · Computer Science 2022-09-16 Matteo Castiglioni , Andrea Celli , Alberto Marchesi , Giulia Romano , Nicola Gatti

A Regret-Variance Trade-Off in Online Learning

We consider prediction with expert advice for strongly convex and bounded losses, and investigate trade-offs between regret and "variance" (i.e., squared difference of learner's predictions and best expert predictions). With $K$ experts,…

Machine Learning · Computer Science 2022-06-07 Dirk van der Hoeven , Nikita Zhivotovskiy , Nicolò Cesa-Bianchi

Optimistic and Adaptive Lagrangian Hedging

In online learning an algorithm plays against an environment with losses possibly picked by an adversary at each round. The generality of this framework includes problems that are not adversarial, for example offline optimization, or saddle…

Machine Learning · Computer Science 2021-02-04 Ryan D'Orazio , Ruitong Huang

Robust Min-Max (Regret) Optimization using Ordered Weighted Averaging

In decision-making under uncertainty, several criteria have been studied to aggregate the performance of a solution over multiple possible scenarios. This paper introduces a novel variant of ordered weighted averaging (OWA) for optimization…

Optimization and Control · Mathematics 2024-01-30 Werner Baak , Marc Goerigk , Adam Kasperski , Paweł Zieliński

Best of Many Worlds Guarantees for Online Learning with Knapsacks

We study online learning problems in which a decision maker wants to maximize their expected reward without violating a finite set of $m$ resource constraints. By casting the learning process over a suitably defined space of strategy…

Machine Learning · Computer Science 2023-03-13 Andrea Celli , Matteo Castiglioni , Christian Kroer

No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need!

We study online decision making problems under resource constraints, where both reward and cost functions are drawn from distributions that may change adversarially over time. We focus on two canonical settings: $(i)$ online resource…

Machine Learning · Computer Science 2025-06-19 Francesco Emanuele Stradi , Matteo Castiglioni , Alberto Marchesi , Nicola Gatti , Christian Kroer

Meta-strategy for Learning Tuning Parameters with Guarantees

Online learning methods, like the online gradient algorithm (OGA) and exponentially weighted aggregation (EWA), often depend on tuning parameters that are difficult to set in practice. We consider an online meta-learning scenario, and we…

Machine Learning · Statistics 2021-11-15 Dimitri Meunier , Pierre Alquier

Efficient online algorithms for fast-rate regret bounds under sparsity

We consider the online convex optimization problem. In the setting of arbitrary sequences and finite set of parameters, we establish a new fast-rate quantile regret bound. Then we investigate the optimization into the L1-ball by…

Statistics Theory · Mathematics 2018-05-24 Pierre Gaillard , Olivier Wintenberger

Decision Variance in Online Learning

Online learning has traditionally focused on the expected rewards. In this paper, a risk-averse online learning problem under the performance measure of the mean-variance of the rewards is studied. Both the bandit and full information…

Machine Learning · Statistics 2019-03-15 Sattar Vakili , Alexis Boukouvalas , Qing Zhao

Online Non-convex Optimization with Long-term Non-convex Constraints

A novel Follow-the-Perturbed-Leader type algorithm is proposed and analyzed for solving general long-term constrained optimization problems in an online manner, where the target and constraint functions are oblivious adversarially generated…

Optimization and Control · Mathematics 2025-10-02 Shijie Pan , Jianyu Xu , Wenjie Huang

Dual Averaging on Compactly-Supported Distributions And Application to No-Regret Learning on a Continuum

We consider an online learning problem on a continuum. A decision maker is given a compact feasible set $S$, and is faced with the following sequential problem: at iteration~$t$, the decision maker chooses a distribution $x^{(t)} \in…

Machine Learning · Computer Science 2015-04-30 Walid Krichene

Regret Bounds for Robust Online Decision Making

We propose a framework which generalizes "decision making with structured observations" by allowing robust (i.e. multivalued) models. In this framework, each model associates each decision with a convex set of probability distributions over…

Machine Learning · Computer Science 2025-06-27 Alexander Appel , Vanessa Kosoy

Universal Dynamic Regret and Constraint Violation Bounds for Constrained Online Convex Optimization

We consider a generalization of the celebrated Online Convex Optimization (OCO) framework with adversarial online constraints. In this problem, an online learner interacts with an adversary sequentially over multiple rounds. At the…

Machine Learning · Computer Science 2026-01-07 Subhamon Supantha , Abhishek Sinha

Regret in Online Combinatorial Optimization

We address online linear optimization problems when the possible actions of the decision maker are represented by binary vectors. The regret of the decision maker is the difference between her realized loss and the best loss she would have…

Machine Learning · Computer Science 2013-04-02 Jean-Yves Audibert , Sébastien Bubeck , Gábor Lugosi

Trading Regret for Efficiency: Online Convex Optimization with Long Term Constraints

In this paper we propose a framework for solving constrained online convex optimization problem. Our motivation stems from the observation that most algorithms proposed for online convex optimization require a projection onto the convex set…

Machine Learning · Computer Science 2012-10-01 Mehrdad Mahdavi , Rong Jin , Tianbao Yang

Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization

This paper considers the distributed online convex optimization problem with time-varying constraints over a network of agents. This is a sequential decision making problem with two sequences of arbitrarily varying convex loss and…

Optimization and Control · Mathematics 2022-12-29 Xinlei Yi , Xiuxian Li , Tao Yang , Lihua Xie , Tianyou Chai , Karl H. Johansson

Exponentially Weighted Algorithm for Online Network Resource Allocation with Long-Term Constraints

This paper studies an online optimal resource reservation problem in communication networks with job transfers where the goal is to minimize the reservation cost while maintaining the blocking cost under a certain budget limit. To tackle…

Optimization and Control · Mathematics 2024-05-07 Ahmed Sid-Ali , Ioannis Lambadaris , Yiqiang Q. Zhao , Gennady Shaikhet , Amirhossein Asgharnia

Regret Balancing for Bandit and RL Model Selection

We consider model selection in stochastic bandit and reinforcement learning problems. Given a set of base learning algorithms, an effective model selection strategy adapts to the best learning algorithm in an online fashion. We show that by…

Machine Learning · Computer Science 2020-06-11 Yasin Abbasi-Yadkori , Aldo Pacchiano , My Phan

Constrained Online Two-stage Stochastic Optimization: Near Optimal Algorithms via Adversarial Learning

We consider an online two-stage stochastic optimization with long-term constraints over a finite horizon of $T$ periods. At each period, we take the first-stage action, observe a model parameter realization and then take the second-stage…

Machine Learning · Computer Science 2024-05-21 Jiashuo Jiang

Dynamic Regret Analysis for Online Meta-Learning

The online meta-learning framework has arisen as a powerful tool for the continual lifelong learning setting. The goal for an agent is to quickly learn new tasks by drawing on prior experience, while it faces with tasks one after another.…

Machine Learning · Computer Science 2021-09-30 Parvin Nazari , Esmaile Khorram