Related papers: Exponential Weights Algorithms for Selective Learn…

A Regret-Variance Trade-Off in Online Learning

We consider prediction with expert advice for strongly convex and bounded losses, and investigate trade-offs between regret and "variance" (i.e., squared difference of learner's predictions and best expert predictions). With $K$ experts,…

Machine Learning · Computer Science 2022-06-07 Dirk van der Hoeven , Nikita Zhivotovskiy , Nicolò Cesa-Bianchi

Exponentially Weighted Algorithm for Online Network Resource Allocation with Long-Term Constraints

This paper studies an online optimal resource reservation problem in communication networks with job transfers where the goal is to minimize the reservation cost while maintaining the blocking cost under a certain budget limit. To tackle…

Optimization and Control · Mathematics 2024-05-07 Ahmed Sid-Ali , Ioannis Lambadaris , Yiqiang Q. Zhao , Gennady Shaikhet , Amirhossein Asgharnia

A study on effectiveness of extreme learning machine

Extreme learning machine (ELM), proposed by Huang et al., has been shown a promising learning algorithm for single-hidden layer feedforward neural networks (SLFNs). Nevertheless, because of the random choice of input weights and biases, the…

Neural and Evolutionary Computing · Computer Science 2014-09-16 Yuguang Wang , Feilong Cao , Yubo Yuan

Online Isotonic Regression

We consider the online version of the isotonic regression problem. Given a set of linearly ordered points (e.g., on the real line), the learner must predict labels sequentially at adversarially chosen positions and is evaluated by her total…

Machine Learning · Computer Science 2016-10-10 Wojciech Kotłowski , Wouter M. Koolen , Alan Malek

Efficient Logistic Regression with Mixture of Sigmoids

This paper studies the Exponential Weights (EW) algorithm with an isotropic Gaussian prior for online logistic regression. We show that the near-optimal worst-case regret bound $O(d\log(Bn))$ for EW, established by Kakade and Ng (2005)…

Machine Learning · Computer Science 2026-04-06 Federico Di Gennaro , Saptarshi Chakraborty , Nikita Zhivotovskiy

The Many Faces of Exponential Weights in Online Learning

A standard introduction to online learning might place Online Gradient Descent at its center and then proceed to develop generalizations and extensions like Online Mirror Descent and second-order methods. Here we explore the alternative…

Machine Learning · Statistics 2021-08-31 Dirk van der Hoeven , Tim van Erven , Wojciech Kotłowski

Excess Risk Bounds for Exponentially Concave Losses

The overarching goal of this paper is to derive excess risk bounds for learning from exp-concave loss functions in passive and sequential learning settings. Exp-concave loss functions encompass several fundamental problems in machine…

Machine Learning · Computer Science 2014-02-11 Mehrdad Mahdavi , Rong Jin

Efficient Constrained Regret Minimization

Online learning constitutes a mathematical and compelling framework to analyze sequential decision making problems in adversarial environments. The learner repeatedly chooses an action, the environment responds with an outcome, and then the…

Machine Learning · Computer Science 2012-10-05 Mehrdad Mahdavi , Tianbao Yang , Rong Jin

Fast rates with high probability in exp-concave statistical learning

We present an algorithm for the statistical learning setting with a bounded exp-concave loss in $d$ dimensions that obtains excess risk $O(d \log(1/\delta)/n)$ with probability at least $1 - \delta$. The core technique is to boost the…

Machine Learning · Computer Science 2016-10-17 Nishant A. Mehta

Effective Non-Random Extreme Learning Machine

The Extreme Learning Machine (ELM) is a growing statistical technique widely applied to regression problems. In essence, ELMs are single-layer neural networks where the hidden layer weights are randomly sampled from a specific distribution,…

Machine Learning · Statistics 2025-07-31 Daniela De Canditiis , Fabiano Veglianti

Deep Reinforcement Learning with Weighted Q-Learning

Reinforcement learning algorithms based on Q-learning are driving Deep Reinforcement Learning (DRL) research towards solving complex problems and achieving super-human performance on many of them. Nevertheless, Q-Learning is known to be…

Machine Learning · Computer Science 2022-06-14 Andrea Cini , Carlo D'Eramo , Jan Peters , Cesare Alippi

Exponential weights in multivariate regression and a low-rankness favoring prior

We establish theoretical guarantees for the expected prediction error of the exponential weighting aggregate in the case of multivariate regression that is when the label vector is multidimensional. We consider the regression model with…

Statistics Theory · Mathematics 2018-06-26 Arnak S. Dalalyan

Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning

Empirical risk minimization (ERM) is the workhorse of machine learning, whether for classification and regression or for off-policy policy learning, but its model-agnostic guarantees can fail when we use adaptively collected data, such as…

Machine Learning · Statistics 2021-06-04 Aurélien Bibaut , Antoine Chambaz , Maria Dimakopoulou , Nathan Kallus , Mark van der Laan

Extractor-Based Time-Space Lower Bounds for Learning

A matrix $M: A \times X \rightarrow \{-1,1\}$ corresponds to the following learning problem: An unknown element $x \in X$ is chosen uniformly at random. A learner tries to learn $x$ from a stream of samples, $(a_1, b_1), (a_2, b_2) \ldots$,…

Machine Learning · Computer Science 2017-08-10 Sumegha Garg , Ran Raz , Avishay Tal

Nested bandits

In many online decision processes, the optimizing agent is called to choose between large numbers of alternatives with many inherent similarities; in turn, these similarities imply closely correlated losses that may confound standard…

Machine Learning · Computer Science 2022-06-22 Matthieu Martin , Panayotis Mertikopoulos , Thibaud Rahier , Houssam Zenati

Exponentially Weighted Moving Models

An exponentially weighted moving model (EWMM) for a vector time series fits a new data model each time period, based on an exponentially fading loss function on past observed data. The well known and widely used exponentially weighted…

Computation · Statistics 2024-04-25 Eric Luxenberg , Stephen Boyd

Distribution Learning Meets Graph Structure Sampling

This work establishes a novel link between the problem of PAC-learning high-dimensional graphical models and the task of (efficient) counting and sampling of graph structures, using an online learning framework. We observe that if we apply…

Machine Learning · Computer Science 2025-11-14 Arnab Bhattacharyya , Sutanu Gayen , Philips George John , Sayantan Sen , N. V. Vinodchandran

Extreme Value Policy Optimization for Safe Reinforcement Learning

Ensuring safety is a critical challenge in applying Reinforcement Learning (RL) to real-world scenarios. Constrained Reinforcement Learning (CRL) addresses this by maximizing returns under predefined constraints, typically formulated as the…

Machine Learning · Computer Science 2026-01-21 Shiqing Gao , Yihang Zhou , Shuai Shao , Haoyu Luo , Yiheng Bing , Jiaxin Ding , Luoyi Fu , Xinbing Wang

Exponential Hardness of Reinforcement Learning with Linear Function Approximation

A fundamental question in reinforcement learning theory is: suppose the optimal value functions are linear in given features, can we learn them efficiently? This problem's counterpart in supervised learning, linear regression, can be solved…

Machine Learning · Computer Science 2023-02-28 Daniel Kane , Sihan Liu , Shachar Lovett , Gaurav Mahajan , Csaba Szepesvári , Gellért Weisz

Robust Tracking via Weighted Online Extreme Learning Machine

The tracking method based on the extreme learning machine (ELM) is efficient and effective. ELM randomly generates input weights and biases in the hidden layer, and then calculates and computes the output weights by reducing the iterative…

Machine Learning · Computer Science 2018-07-27 Jing Zhang , Huibing Wang , Yonggong Ren