English
Related papers

Related papers: Improved Kernel Alignment Regret Bound for Online …

200 papers

In this paper, we improve the regret bound for online kernel selection under bandit feedback. Previous algorithm enjoys a $O((\Vert f\Vert^2_{\mathcal{H}_i}+1)K^{\frac{1}{3}}T^{\frac{2}{3}})$ expected bound for Lipschitz loss functions. We…

Machine Learning · Computer Science 2023-03-24 Junfan Li , Shizhong Liao

Online linear programming plays an important role in both revenue management and resource allocation, and recent research has focused on developing efficient first-order online learning algorithms. Despite the empirical success of…

Machine Learning · Statistics 2025-01-07 Wenzhi Gao , Dongdong Ge , Chenyu Xue , Chunlin Sun , Yinyu Ye

We present a generalization of the adversarial linear bandits framework, where the underlying losses are kernel functions (with an associated reproducing kernel Hilbert space) rather than linear functions. We study a version of the…

Machine Learning · Statistics 2018-02-28 Aldo Pacchiano , Niladri S. Chatterji , Peter L. Bartlett

This paper considers online convex optimization over a complicated constraint set, which typically consists of multiple functional constraints and a set constraint. The conventional online projection algorithm (Zinkevich, 2003) can be…

Optimization and Control · Mathematics 2020-05-19 Hao Yu , Michael J. Neely

The trade-off between regret and computational cost is a fundamental problem for online kernel regression, and previous algorithms worked on the trade-off can not keep optimal regret bounds at a sublinear computational complexity. In this…

Machine Learning · Computer Science 2023-06-16 Junfan Li , Shizhong Liao

In online inverse linear optimization, a learner observes time-varying sets of feasible actions and an agent's optimal actions, selected by solving linear optimization over the feasible actions. The learner sequentially makes predictions of…

Machine Learning · Computer Science 2025-05-23 Shinsaku Sakaue , Taira Tsuchiya , Han Bao , Taihei Oki

Online kernel selection is a fundamental problem of online kernel methods.In this paper,we study online kernel selection with memory constraint in which the memory of kernel selection and online prediction procedures is limited to a fixed…

Machine Learning · Computer Science 2025-03-25 Junfan Li , Shizhong Liao

We study a generalization of the problem of online learning in adversarial linear contextual bandits by incorporating loss functions that belong to a reproducing kernel Hilbert space, which allows for a more flexible modeling of complex…

Machine Learning · Statistics 2023-10-04 Gergely Neu , Julia Olkhovskaya , Sattar Vakili

We are interested in a framework of online learning with kernels for low-dimensional but large-scale and potentially adversarial datasets. We study the computational and theoretical performance of online variations of kernel Ridge…

Machine Learning · Statistics 2019-05-30 Rémi Jézéquel , Pierre Gaillard , Alessandro Rudi

We present an efficient second-order algorithm with $\tilde{O}(\frac{1}{\eta}\sqrt{T})$ regret for the bandit online multiclass problem. The regret bound holds simultaneously with respect to a family of loss functions parameterized by…

Machine Learning · Computer Science 2018-01-19 Alina Beygelzimer , Francesco Orabona , Chicheng Zhang

We revisit the problem of \textit{online linear optimization} in case the set of feasible actions is accessible through an approximated linear optimization oracle with a factor $\alpha$ multiplicative approximation guarantee. This setting…

Machine Learning · Computer Science 2017-09-12 Dan Garber

Consider an online convex optimization problem where the loss functions are self-concordant barriers, smooth relative to a convex function $h$, and possibly non-Lipschitz. We analyze the regret of online mirror descent with $h$. Then, based…

Machine Learning · Statistics 2023-09-22 Chung-En Tsai , Hao-Chung Cheng , Yen-Huan Li

We study Online Convex Optimization (OCO) with adversarial constraints, where an online algorithm must make sequential decisions to minimize both convex loss functions and cumulative constraint violations. We focus on a setting where the…

Machine Learning · Statistics 2025-03-14 Jordan Lekeufack , Michael I. Jordan

In this paper we propose a framework for solving constrained online convex optimization problem. Our motivation stems from the observation that most algorithms proposed for online convex optimization require a projection onto the convex set…

Machine Learning · Computer Science 2012-10-01 Mehrdad Mahdavi , Rong Jin , Tianbao Yang

We present an adaptive online gradient descent algorithm to solve online convex optimization problems with long-term constraints , which are constraints that need to be satisfied when accumulated over a finite number of rounds T , but can…

Machine Learning · Statistics 2015-12-24 Rodolphe Jenatton , Jim Huang , Cédric Archambeau

In this paper, we investigate the online non-convex optimization problem which generalizes the classic {online convex optimization problem by relaxing the convexity assumption on the cost function. For this type of problem, the classic…

Machine Learning · Computer Science 2017-09-14 Lin Yang , Cheng Tan , Wing Shing Wong

We study algorithms for online linear optimization in Hilbert spaces, focusing on the case where the player is unconstrained. We develop a novel characterization of a large class of minimax algorithms, recovering, and even improving,…

Machine Learning · Computer Science 2014-05-22 H. Brendan McMahan , Francesco Orabona

In the convex optimization approach to online regret minimization, many methods have been developed to guarantee a $O(\sqrt{T})$ bound on regret for subdifferentiable convex loss functions with bounded subgradients, by using a reduction to…

Machine Learning · Computer Science 2016-09-20 Arthur Flajolet , Patrick Jaillet

We resolve an open question from (Christiano, 2014b) posed in COLT'14 regarding the optimal dependency of the regret achievable for online local learning on the size of the label set. In this framework the algorithm is shown a pair of items…

Machine Learning · Computer Science 2015-08-25 Pranjal Awasthi , Moses Charikar , Kevin A. Lai , Andrej Risteski

We consider the adversarial convex bandit problem and we build the first $\mathrm{poly}(T)$-time algorithm with $\mathrm{poly}(n) \sqrt{T}$-regret for this problem. To do so we introduce three new ideas in the derivative-free optimization…

Machine Learning · Computer Science 2016-07-19 Sébastien Bubeck , Ronen Eldan , Yin Tat Lee
‹ Prev 1 2 3 10 Next ›