Related papers: Improved algorithms for online load balancing

Competitive ratio versus regret minimization: achieving the best of both worlds

We consider online algorithms under both the competitive ratio criteria and the regret minimization one. Our main goal is to build a unified methodology that would be able to guarantee both criteria simultaneously. For a general class of…

Machine Learning · Computer Science 2019-04-09 Amit Daniely , Yishay Mansour

Online Linear Programming with Batching

We study Online Linear Programming (OLP) with batching. The planning horizon is cut into $K$ batches, and the decisions on customers arriving within a batch can be delayed to the end of their associated batch. Compared with OLP without…

Machine Learning · Computer Science 2024-08-02 Haoran Xu , Peter W. Glynn , Yinyu Ye

Online estimation and control with optimal pathlength regret

A natural goal when designing online learning algorithms for non-stationary environments is to bound the regret of the algorithm in terms of the temporal variation of the input sequence. Intuitively, when the variation is small, it should…

Machine Learning · Computer Science 2021-12-08 Gautam Goel , Babak Hassibi

Online Game with Time-Varying Coupled Inequality Constraints

In this paper, online game is studied, where at each time, a group of players aim at selfishly minimizing their own time-varying cost function simultaneously subject to time-varying coupled constraints and local feasible set constraints.…

Computer Science and Game Theory · Computer Science 2023-06-29 Min Meng , Xiuxian Li , Yiguang Hong , Jie Chen , Long Wang

Online Mixed Discrete and Continuous Optimization: Algorithms, Regret Analysis and Applications

We study an online mixed discrete and continuous optimization problem where a decision maker interacts with an unknown environment for a number of $T$ rounds. At each round, the decision maker needs to first jointly choose a discrete and a…

Optimization and Control · Mathematics 2024-08-27 Lintao Ye , Ming Chi , Zhi-Wei Liu , Xiaoling Wang , Vijay Gupta

A note on continuous-time online learning

In online learning, the data is provided in a sequential order, and the goal of the learner is to make online decisions to minimize overall regrets. This note is concerned with continuous-time models and algorithms for several online…

Machine Learning · Statistics 2024-05-20 Lexing Ying

Hierarchies of Relaxations for Online Prediction Problems with Evolving Constraints

We study online prediction where regret of the algorithm is measured against a benchmark defined via evolving constraints. This framework captures online prediction on graphs, as well as other prediction problems with combinatorial…

Machine Learning · Computer Science 2015-06-15 Alexander Rakhlin , Karthik Sridharan

Online distributed algorithms for seeking generalized Nash equilibria in dynamic environments

In this paper, we study the distributed generalized Nash equilibrium seeking problem of non-cooperative games in dynamic environments. Each player in the game aims to minimize its own time-varying cost function subject to a local action…

Optimization and Control · Mathematics 2020-04-02 Kaihong Lu , Guangqi Li , Long Wang

A Unifying Framework for Online Optimization with Long-Term Constraints

We study online learning problems in which a decision maker has to take a sequence of decisions subject to $m$ long-term constraints. The goal of the decision maker is to maximize their total reward, while at the same time achieving small…

Machine Learning · Computer Science 2022-09-16 Matteo Castiglioni , Andrea Celli , Alberto Marchesi , Giulia Romano , Nicola Gatti

No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need!

We study online decision making problems under resource constraints, where both reward and cost functions are drawn from distributions that may change adversarially over time. We focus on two canonical settings: $(i)$ online resource…

Machine Learning · Computer Science 2025-06-19 Francesco Emanuele Stradi , Matteo Castiglioni , Alberto Marchesi , Nicola Gatti , Christian Kroer

Regret Analysis of Distributed Online Control for LTI Systems with Adversarial Disturbances

This paper addresses the distributed online control problem over a network of linear time-invariant (LTI) systems (with possibly unknown dynamics) in the presence of adversarial perturbations. There exists a global network cost that is…

Optimization and Control · Mathematics 2023-10-06 Ting-Jui Chang , Shahin Shahrampour

Distributed Online Linear Regression

We study online linear regression problems in a distributed setting, where the data is spread over a network. In each round, each network node proposes a linear predictor, with the objective of fitting the \emph{network-wide} data. It then…

Machine Learning · Computer Science 2019-02-14 Deming Yuan , Alexandre Proutiere , Guodong Shi

Adaptive Regret Minimization in Bounded-Memory Games

Online learning algorithms that minimize regret provide strong guarantees in situations that involve repeatedly making decisions in an uncertain environment, e.g. a driver deciding what route to drive to work every day. While regret…

Computer Science and Game Theory · Computer Science 2013-09-06 Jeremiah Blocki , Nicolas Christin , Anupam Datta , Arunesh Sinha

Regret Bounds for Lifelong Learning

We consider the problem of transfer learning in an online setting. Different tasks are presented sequentially and processed by a within-task algorithm. We propose a lifelong learning strategy which refines the underlying data representation…

Machine Learning · Statistics 2019-10-14 Pierre Alquier , The Tien Mai , Massimiliano Pontil

Learning payoffs while routing in skill-based queues

Motivated by applications in service systems, we consider queueing systems where each customer must be handled by a server with the right skill set. We focus on optimizing the routing of customers to servers in order to maximize the total…

Machine Learning · Computer Science 2024-12-16 Sanne van Kempen , Jaron Sanders , Fiona Sloothaak , Maarten G. Wolf

Online Optimization for Network Resource Allocation and Comparison with Reinforcement Learning Techniques

We tackle in this paper an online network resource allocation problem with job transfers. The network is composed of many servers connected by communication links. The system operates in discrete time; at each time slot, the administrator…

Machine Learning · Statistics 2023-11-17 Ahmed Sid-Ali , Ioannis Lambadaris , Yiqiang Q. Zhao , Gennady Shaikhet , Amirhossein Asgharnia

Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization

This paper considers the distributed online convex optimization problem with time-varying constraints over a network of agents. This is a sequential decision making problem with two sequences of arbitrarily varying convex loss and…

Optimization and Control · Mathematics 2022-12-29 Xinlei Yi , Xiuxian Li , Tao Yang , Lihua Xie , Tianyou Chai , Karl H. Johansson

Distributed Online Optimization with Long-Term Constraints

We consider distributed online convex optimization problems, where the distributed system consists of various computing units connected through a time-varying communication graph. In each time step, each computing unit selects a constrained…

Machine Learning · Computer Science 2019-12-23 Deming Yuan , Alexandre Proutiere , Guodong Shi

Settling the Sample Complexity of Online Reinforcement Learning

A central issue lying at the heart of online reinforcement learning (RL) is data efficiency. While a number of recent works achieved asymptotically minimal regret in online RL, the optimality of these results is only guaranteed in a…

Machine Learning · Computer Science 2025-04-30 Zihan Zhang , Yuxin Chen , Jason D. Lee , Simon S. Du

A Single-Sample Polylogarithmic Regret Bound for Nonstationary Online Linear Programming

We study nonstationary Online Linear Programming (OLP), where $n$ orders arrive sequentially with reward-resource consumption pairs that form a sequence of independent, but not necessarily identically distributed, random vectors. At the…

Data Structures and Algorithms · Computer Science 2026-03-17 Haoran Xu , Owen Shen , Peter Glynn , Yinyu Ye , Patrick Jaillet