English
Related papers

Related papers: Online Regenerative Learning

200 papers

We consider the dynamic resource allocation problem where the decision space is finite-dimensional, yet the solution must satisfy a large or even infinite number of constraints revealed via streaming data or oracle feedback. We model this…

Machine Learning · Computer Science 2026-03-18 Yiming Zong , Jiashuo Jiang

We study an online linear programming (OLP) problem under a random input model in which the columns of the constraint matrix along with the corresponding coefficients in the objective function are generated i.i.d. from an unknown…

Data Structures and Algorithms · Computer Science 2021-04-20 Xiaocheng Li , Yinyu Ye

The goal of a learner, in standard online learning, is to have the cumulative loss not much larger compared with the best-performing function from some fixed class. Numerous algorithms were shown to have this gap arbitrarily close to zero,…

Machine Learning · Computer Science 2013-03-04 Nina Vaits , Edward Moroshko , Koby Crammer

Online linear programming (OLP) has gained significant attention from both researchers and practitioners due to its extensive applications, such as online auction, network revenue management, order fulfillment and advertising. Existing OLP…

Data Structures and Algorithms · Computer Science 2025-11-18 Guokai Li , Zizhuo Wang , Jingwei Zhang

It has been found that stochastic algorithms often find good solutions much more rapidly than inherently-batch approaches. Indeed, a very useful rule of thumb is that often, when solving a machine learning problem, an iterative technique…

Machine Learning · Computer Science 2013-08-19 Andrew Cotter

We study an online linear programming (OLP) model in which inventory is not provided upfront but instead arrives gradually through an exogenous stochastic replenishment process. This replenishment-based formulation captures operational…

Optimization and Control · Mathematics 2026-01-22 Yuze Chen , Yuan Zhou , Baichuan Mo , Jie Ying , Yufei Ruan , Zhou Ye

We study nonstationary Online Linear Programming (OLP), where $n$ orders arrive sequentially with reward-resource consumption pairs that form a sequence of independent, but not necessarily identically distributed, random vectors. At the…

Data Structures and Algorithms · Computer Science 2026-03-17 Haoran Xu , Owen Shen , Peter Glynn , Yinyu Ye , Patrick Jaillet

Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i.i.d. assumptions made in statistical learning. This leads to poor performance in theory and…

Machine Learning · Computer Science 2015-03-17 Stephane Ross , Geoffrey J. Gordon , J. Andrew Bagnell

The utilization of online stochastic algorithms is popular in large-scale learning settings due to their ability to compute updates on the fly, without the need to store and process data in large batches. When a constant step-size is used,…

Optimization and Control · Mathematics 2023-07-19 Stefan Vlaski , Elsa Rizk , Ali H. Sayed

Online linear programming (OLP) has found broad applications in revenue management and resource allocation. State-of-the-art OLP algorithms achieve low regret by repeatedly solving linear programming (LP) subproblems that incorporate…

Machine Learning · Statistics 2025-11-04 Jingruo Sun , Wenzhi Gao , Ellen Vitercik , Yinyu Ye

Reinforcement Learning (RL) has achieved state-of-the-art results in domains such as robotics and games. We build on this previous work by applying RL algorithms to a selection of canonical online stochastic optimization problems with a…

We study learning control in an online reset-free lifelong learning scenario, where mistakes can compound catastrophically into the future and the underlying dynamics of the environment may change. Traditional model-free policy learning…

Machine Learning · Computer Science 2020-06-30 Kevin Lu , Igor Mordatch , Pieter Abbeel

A natural optimization model that formulates many online resource allocation and revenue management problems is the online linear program (LP) in which the constraint matrix is revealed column by column along with the corresponding…

Data Structures and Algorithms · Computer Science 2014-04-10 Shipra Agrawal , Zizhuo Wang , Yinyu Ye

First-order stochastic methods are the state-of-the-art in large-scale machine learning optimization owing to efficient per-iteration complexity. Second-order methods, while able to provide faster convergence, have been much less explored…

Machine Learning · Statistics 2017-12-01 Naman Agarwal , Brian Bullins , Elad Hazan

There is a recent interest on first-order methods for linear programming (LP). In this paper,we propose a stochastic algorithm using variance reduction and restarts for solving sharp primal-dual problems such as LP. We show that the…

Optimization and Control · Mathematics 2024-01-02 Haihao Lu , Jinwen Yang

In this work, multiplicative stochasticity is applied to the learning rate of stochastic optimization algorithms, giving rise to stochastic learning-rate schemes. In-expectation theoretical convergence results of Stochastic Gradient Descent…

Optimization and Control · Mathematics 2022-03-22 Theodoros Mamalis , Dusan Stipanovic , Petros Voulgaris

Reinforcement learning (RL) problems are fundamental in online decision-making and have been instrumental in finding an optimal policy for Markov decision processes (MDPs). Function approximations are usually deployed to handle large or…

Machine Learning · Computer Science 2025-05-20 Jiashuo Jiang , Yiming Zong , Yinyu Ye

This paper presents fast first-order methods for solving linear programs (LPs) approximately. We adapt online linear programming algorithms to offline LPs and obtain algorithms that avoid any matrix multiplication. We also introduce a…

Optimization and Control · Mathematics 2024-11-07 Wenzhi Gao , Dongdong Ge , Chunlin Sun , Yinyu Ye

Stochastic optimisation algorithms are the de facto standard for machine learning with large amounts of data. Handling only a subset of available data in each optimisation step dramatically reduces the per-iteration computational costs,…

Numerical Analysis · Mathematics 2024-12-19 Matthias J. Ehrhardt , Zeljko Kereta , Jingwei Liang , Junqi Tang

Firms increasingly delegate decisions to learning algorithms in platform markets. Standard algorithms perform well when platform policies are stationary, but firms often face ambiguity about whether policies are stationary or adapt…

Theoretical Economics · Economics 2026-02-11 Kyohei Okumura
‹ Prev 1 2 3 10 Next ›