Related papers: Online Linear Programming with Batching

Wait-Less Offline Tuning and Re-solving for Online Decision Making

Online linear programming (OLP) has found broad applications in revenue management and resource allocation. State-of-the-art OLP algorithms achieve low regret by repeatedly solving linear programming (LP) subproblems that incorporate…

Machine Learning · Statistics 2025-11-04 Jingruo Sun , Wenzhi Gao , Ellen Vitercik , Yinyu Ye

A Single-Sample Polylogarithmic Regret Bound for Nonstationary Online Linear Programming

We study nonstationary Online Linear Programming (OLP), where $n$ orders arrive sequentially with reward-resource consumption pairs that form a sequence of independent, but not necessarily identically distributed, random vectors. At the…

Data Structures and Algorithms · Computer Science 2026-03-17 Haoran Xu , Owen Shen , Peter Glynn , Yinyu Ye , Patrick Jaillet

Online Linear Programming with Replenishment

We study an online linear programming (OLP) model in which inventory is not provided upfront but instead arrives gradually through an exogenous stochastic replenishment process. This replenishment-based formulation captures operational…

Optimization and Control · Mathematics 2026-01-22 Yuze Chen , Yuan Zhou , Baichuan Mo , Jie Ying , Yufei Ruan , Zhou Ye

Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation

We consider the dynamic resource allocation problem where the decision space is finite-dimensional, yet the solution must satisfy a large or even infinite number of constraints revealed via streaming data or oracle feedback. We model this…

Machine Learning · Computer Science 2026-03-18 Yiming Zong , Jiashuo Jiang

Infrequent Resolving Algorithm for Online Linear Programming

Online linear programming (OLP) has gained significant attention from both researchers and practitioners due to its extensive applications, such as online auction, network revenue management, order fulfillment and advertising. Existing OLP…

Data Structures and Algorithms · Computer Science 2025-11-18 Guokai Li , Zizhuo Wang , Jingwei Zhang

Beyond $\mathcal{O}(\sqrt{T})$ Regret: Decoupling Learning and Decision-making in Online Linear Programming

Online linear programming plays an important role in both revenue management and resource allocation, and recent research has focused on developing efficient first-order online learning algorithms. Despite the empirical success of…

Machine Learning · Statistics 2025-01-07 Wenzhi Gao , Dongdong Ge , Chenyu Xue , Chunlin Sun , Yinyu Ye

Online Linear Programming: Dual Convergence, New Algorithms, and Regret Bounds

We study an online linear programming (OLP) problem under a random input model in which the columns of the constraint matrix along with the corresponding coefficients in the objective function are generated i.i.d. from an unknown…

Data Structures and Algorithms · Computer Science 2021-04-20 Xiaocheng Li , Yinyu Ye

Decoupling Learning and Decision-Making: Breaking the $\mathcal{O}(\sqrt{T})$ Barrier in Online Resource Allocation with First-Order Methods

Online linear programming plays an important role in both revenue management and resource allocation, and recent research has focused on developing efficient first-order online learning algorithms. Despite the empirical success of…

Machine Learning · Computer Science 2025-01-08 Wenzhi Gao , Chunlin Sun , Chenyu Xue , Dongdong Ge , Yinyu Ye

Online Linear Optimization with Many Hints

We study an online linear optimization (OLO) problem in which the learner is provided access to $K$ "hint" vectors in each round prior to making a decision. In this setting, we devise an algorithm that obtains logarithmic regret whenever…

Machine Learning · Computer Science 2020-10-08 Aditya Bhaskara , Ashok Cutkosky , Ravi Kumar , Manish Purohit

Improved algorithms for online load balancing

We consider an online load balancing problem and its extensions in the framework of repeated games. On each round, the player chooses a distribution (task allocation) over $K$ servers, and then the environment reveals the load of each…

Data Structures and Algorithms · Computer Science 2020-07-22 Yaxiong Liu , Kohei Hatano , Eiji Takimoto

Online estimation and control with optimal pathlength regret

A natural goal when designing online learning algorithms for non-stationary environments is to bound the regret of the algorithm in terms of the temporal variation of the input sequence. Intuitively, when the variation is small, it should…

Machine Learning · Computer Science 2021-12-08 Gautam Goel , Babak Hassibi

Learning payoffs while routing in skill-based queues

Motivated by applications in service systems, we consider queueing systems where each customer must be handled by a server with the right skill set. We focus on optimizing the routing of customers to servers in order to maximize the total…

Machine Learning · Computer Science 2024-12-16 Sanne van Kempen , Jaron Sanders , Fiona Sloothaak , Maarten G. Wolf

An Improved Analysis of LP-based Control for Revenue Management

In this paper, we study a class of revenue management problems where the decision maker aims to maximize the total revenue subject to budget constraints on multiple type of resources over a finite horizon. At each time, a new…

Optimization and Control · Mathematics 2022-03-18 Guanting Chen , Xiaocheng Li , Yinyu Ye

Online Resource Allocation in Episodic Markov Decision Processes

This paper studies a long-term resource allocation problem over multiple periods where each period requires a multi-stage decision-making process. We formulate the problem as an online allocation problem in an episodic finite-horizon…

Data Structures and Algorithms · Computer Science 2023-10-20 Duksang Lee , William Overman , Dabeen Lee

Online Joint Assortment-Inventory Optimization under MNL Choices

We study an online joint assortment-inventory optimization problem, in which we assume that the choice behavior of each customer follows the Multinomial Logit (MNL) choice model, and the attraction parameters are unknown a priori. The…

Machine Learning · Computer Science 2025-01-03 Yong Liang , Xiaojie Mao , Shiyuan Wang

A Distributed Online Pricing Strategy for Demand Response Programs

We study a demand response problem from utility (also referred to as operator)'s perspective with realistic settings, in which the utility faces uncertainty and limited communication. Specifically, the utility does not know the cost…

Optimization and Control · Mathematics 2017-08-11 Pan Li , Hao Wang , Baosen Zhang

Smoothed Online Combinatorial Optimization Using Imperfect Predictions

Smoothed online combinatorial optimization considers a learner who repeatedly chooses a combinatorial decision to minimize an unknown changing cost function with a penalty on switching decisions in consecutive rounds. We study smoothed…

Machine Learning · Computer Science 2023-01-18 Kai Wang , Zhao Song , Georgios Theocharous , Sridhar Mahadevan

Efficient and Optimal No-Regret Caching under Partial Observation

Online learning algorithms have been successfully used to design caching policies with sublinear regret in the total number of requests, with no statistical assumption about the request sequence. Most existing algorithms involve…

Machine Learning · Computer Science 2025-03-05 Younes Ben Mazziane , Francescomaria Faticanti , Sara Alouf , Giovanni Neglia

A Batch-to-Online Transformation under Random-Order Model

We introduce a transformation framework that can be utilized to develop online algorithms with low $\epsilon$-approximate regret in the random-order model from offline approximation algorithms. We first give a general reduction theorem that…

Machine Learning · Computer Science 2023-10-27 Jing Dong , Yuichi Yoshida

Minimal Expected Regret in Linear Quadratic Control

We consider the problem of online learning in Linear Quadratic Control systems whose state transition and state-action transition matrices $A$ and $B$ may be initially unknown. We devise an online learning algorithm and provide guarantees…

Machine Learning · Computer Science 2021-09-30 Yassir Jedra , Alexandre Proutiere