Related papers: Online Regenerative Learning

Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation

We consider the dynamic resource allocation problem where the decision space is finite-dimensional, yet the solution must satisfy a large or even infinite number of constraints revealed via streaming data or oracle feedback. We model this…

Machine Learning · Computer Science 2026-03-18 Yiming Zong , Jiashuo Jiang

Online Linear Programming: Dual Convergence, New Algorithms, and Regret Bounds

We study an online linear programming (OLP) problem under a random input model in which the columns of the constraint matrix along with the corresponding coefficients in the objective function are generated i.i.d. from an unknown…

Data Structures and Algorithms · Computer Science 2021-04-20 Xiaocheng Li , Yinyu Ye

Second-Order Non-Stationary Online Learning for Regression

The goal of a learner, in standard online learning, is to have the cumulative loss not much larger compared with the best-performing function from some fixed class. Numerous algorithms were shown to have this gap arbitrarily close to zero,…

Machine Learning · Computer Science 2013-03-04 Nina Vaits , Edward Moroshko , Koby Crammer

Infrequent Resolving Algorithm for Online Linear Programming

Online linear programming (OLP) has gained significant attention from both researchers and practitioners due to its extensive applications, such as online auction, network revenue management, order fulfillment and advertising. Existing OLP…

Data Structures and Algorithms · Computer Science 2025-11-18 Guokai Li , Zizhuo Wang , Jingwei Zhang

Stochastic Optimization for Machine Learning

It has been found that stochastic algorithms often find good solutions much more rapidly than inherently-batch approaches. Indeed, a very useful rule of thumb is that often, when solving a machine learning problem, an iterative technique…

Machine Learning · Computer Science 2013-08-19 Andrew Cotter

Online Linear Programming with Replenishment

We study an online linear programming (OLP) model in which inventory is not provided upfront but instead arrives gradually through an exogenous stochastic replenishment process. This replenishment-based formulation captures operational…

Optimization and Control · Mathematics 2026-01-22 Yuze Chen , Yuan Zhou , Baichuan Mo , Jie Ying , Yufei Ruan , Zhou Ye

A Single-Sample Polylogarithmic Regret Bound for Nonstationary Online Linear Programming

We study nonstationary Online Linear Programming (OLP), where $n$ orders arrive sequentially with reward-resource consumption pairs that form a sequence of independent, but not necessarily identically distributed, random vectors. At the…

Data Structures and Algorithms · Computer Science 2026-03-17 Haoran Xu , Owen Shen , Peter Glynn , Yinyu Ye , Patrick Jaillet

A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i.i.d. assumptions made in statistical learning. This leads to poor performance in theory and…

Machine Learning · Computer Science 2015-03-17 Stephane Ross , Geoffrey J. Gordon , J. Andrew Bagnell

Tracking Performance of Online Stochastic Learners

The utilization of online stochastic algorithms is popular in large-scale learning settings due to their ability to compute updates on the fly, without the need to store and process data in large batches. When a constant step-size is used,…

Optimization and Control · Mathematics 2023-07-19 Stefan Vlaski , Elsa Rizk , Ali H. Sayed

Wait-Less Offline Tuning and Re-solving for Online Decision Making

Online linear programming (OLP) has found broad applications in revenue management and resource allocation. State-of-the-art OLP algorithms achieve low regret by repeatedly solving linear programming (LP) subproblems that incorporate…

Machine Learning · Statistics 2025-11-04 Jingruo Sun , Wenzhi Gao , Ellen Vitercik , Yinyu Ye

ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems

Reinforcement Learning (RL) has achieved state-of-the-art results in domains such as robotics and games. We build on this previous work by applying RL algorithms to a selection of canonical online stochastic optimization problems with a…

Machine Learning · Computer Science 2019-12-03 Bharathan Balaji , Jordan Bell-Masterson , Enes Bilgin , Andreas Damianou , Pablo Moreno Garcia , Arpit Jain , Runfei Luo , Alvaro Maggiar , Balakrishnan Narayanaswamy , Chun Ye

Adaptive Online Planning for Continual Lifelong Learning

We study learning control in an online reset-free lifelong learning scenario, where mistakes can compound catastrophically into the future and the underlying dynamics of the environment may change. Traditional model-free policy learning…

Machine Learning · Computer Science 2020-06-30 Kevin Lu , Igor Mordatch , Pieter Abbeel

A Dynamic Near-Optimal Algorithm for Online Linear Programming

A natural optimization model that formulates many online resource allocation and revenue management problems is the online linear program (LP) in which the constraint matrix is revealed column by column along with the corresponding…

Data Structures and Algorithms · Computer Science 2014-04-10 Shipra Agrawal , Zizhuo Wang , Yinyu Ye

Second-Order Stochastic Optimization for Machine Learning in Linear Time

First-order stochastic methods are the state-of-the-art in large-scale machine learning optimization owing to efficient per-iteration complexity. Second-order methods, while able to provide faster convergence, have been much less explored…

Machine Learning · Statistics 2017-12-01 Naman Agarwal , Brian Bullins , Elad Hazan

Nearly Optimal Linear Convergence of Stochastic Primal-Dual Methods for Linear Programming

There is a recent interest on first-order methods for linear programming (LP). In this paper,we propose a stochastic algorithm using variance reduction and restarts for solving sharp primal-dual problems such as LP. We show that the…

Optimization and Control · Mathematics 2024-01-02 Haihao Lu , Jinwen Yang

Stochastic Learning Rate Optimization in the Stochastic Approximation and Online Learning Settings

In this work, multiplicative stochasticity is applied to the learning rate of stochastic optimization algorithms, giving rise to stochastic learning-rate schemes. In-expectation theoretical convergence results of Stochastic Gradient Descent…

Optimization and Control · Mathematics 2022-03-22 Theodoros Mamalis , Dusan Stipanovic , Petros Voulgaris

Adaptive Resolving Methods for Reinforcement Learning with Function Approximations

Reinforcement learning (RL) problems are fundamental in online decision-making and have been instrumental in finding an optimal policy for Markov decision processes (MDPs). Function approximations are usually deployed to handle large or…

Machine Learning · Computer Science 2025-05-20 Jiashuo Jiang , Yiming Zong , Yinyu Ye

Solving Linear Programs with Fast Online Learning Algorithms

This paper presents fast first-order methods for solving linear programs (LPs) approximately. We adapt online linear programming algorithms to offline LPs and obtain algorithms that avoid any matrix multiplication. We also introduce a…

Optimization and Control · Mathematics 2024-11-07 Wenzhi Gao , Dongdong Ge , Chunlin Sun , Yinyu Ye

A Guide to Stochastic Optimisation for Large-Scale Inverse Problems

Stochastic optimisation algorithms are the de facto standard for machine learning with large amounts of data. Handling only a subset of available data in each optimisation step dramatically reduces the per-iteration computational costs,…

Numerical Analysis · Mathematics 2024-12-19 Matthias J. Ehrhardt , Zeljko Kereta , Jingwei Liang , Junqi Tang

Robust Learning with Private Information

Firms increasingly delegate decisions to learning algorithms in platform markets. Standard algorithms perform well when platform policies are stationary, but firms often face ambiguity about whether policies are stationary or adapt…

Theoretical Economics · Economics 2026-02-11 Kyohei Okumura