Related papers: On aggregation for heavy-tailed classes

Aggregation for Regression Learning

This paper studies statistical aggregation procedures in regression setting. A motivating factor is the existence of many different methods of estimation, leading to possibly competing estimators. We consider here three different types of…

Statistics Theory · Mathematics 2007-06-13 Florentina Bunea , Alexandre Tsybakov , Marten Wegkamp

Optimal learning with $Q$-aggregation

We consider a general supervised learning problem with strongly convex and Lipschitz loss and study the problem of model selection aggregation. In particular, given a finite dictionary functions (learners) together with the prior, we…

Statistics Theory · Mathematics 2014-02-28 Guillaume Lecué , Philippe Rigollet

Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features

Linear TD($\lambda$) is one of the most fundamental reinforcement learning algorithms for policy evaluation. Previously, convergence rates are typically established under the assumption of linearly independent features, which does not hold…

Machine Learning · Computer Science 2025-10-15 Zixuan Xie , Xinyu Liu , Rohan Chandra , Shangtong Zhang

A naive aggregation algorithm for improving generalization in a class of learning problems

In this brief paper, we present a naive aggregation algorithm for a typical learning problem with expert advice setting, in which the task of improving generalization, i.e., model validation, is embedded in the learning process as a…

Machine Learning · Computer Science 2024-09-09 Getachew K Befekadu

Optimal rates of aggregation in classification under low noise assumption

In the same spirit as Tsybakov (2003), we define the optimality of an aggregation procedure in the problem of classification. Using an aggregate with exponential weights, we obtain an optimal rate of convex aggregation for the hinge risk…

Statistics Theory · Mathematics 2007-12-04 Guillaume Lecué

Deviation optimal learning using greedy Q-aggregation

Given a finite family of functions, the goal of model selection aggregation is to construct a procedure that mimics the function from this family that is the closest to an unknown regression function. More precisely, we consider a general…

Statistics Theory · Mathematics 2012-12-13 Dong Dai , Philippe Rigollet , Tong Zhang

Aggregation for Gaussian regression

This paper studies statistical aggregation procedures in the regression setting. A motivating factor is the existence of many different methods of estimation, leading to possibly competing estimators. We consider here three different types…

Statistics Theory · Mathematics 2009-09-29 Florentina Bunea , Alexandre B. Tsybakov , Marten H. Wegkamp

An optimal unrestricted learning procedure

We study learning problems involving arbitrary classes of functions $F$, distributions $X$ and targets $Y$. Because proper learning procedures, i.e., procedures that are only allowed to select functions in $F$, tend to perform poorly unless…

Machine Learning · Statistics 2018-04-17 Shahar Mendelson

Adaptive Sampling for Convex Regression

In this paper, we introduce the first principled adaptive-sampling procedure for learning a convex function in the $L_\infty$ norm, a problem that arises often in the behavioral and social sciences. We present a function-specific measure of…

Machine Learning · Computer Science 2018-08-28 Max Simchowitz , Kevin Jamieson , Jordan W. Suchow , Thomas L. Griffiths

Adaptive Gradient-Based Meta-Learning Methods

We build a theoretical framework for designing and understanding practical meta-learning methods that integrates sophisticated formalizations of task-similarity with the extensive literature on online convex optimization and sequential…

Machine Learning · Computer Science 2019-12-10 Mikhail Khodak , Maria-Florina Balcan , Ameet Talwalkar

Federated Learning Aggregation: New Robust Algorithms with Guarantees

Federated Learning has been recently proposed for distributed model training at the edge. The principle of this approach is to aggregate models learned on distributed clients to obtain a new more general "average" model (FedAvg). The…

Machine Learning · Statistics 2022-07-20 Adnan Ben Mansour , Gaia Carenini , Alexandre Duplessis , David Naccache

Methods for Convex $(L_0,L_1)$-Smooth Optimization: Clipping, Acceleration, and Adaptivity

Due to the non-smoothness of optimization problems in Machine Learning, generalized smoothness assumptions have been gaining a lot of attention in recent years. One of the most popular assumptions of this type is $(L_0,L_1)$-smoothness…

Optimization and Control · Mathematics 2024-12-30 Eduard Gorbunov , Nazarii Tupitsa , Sayantan Choudhury , Alen Aliev , Peter Richtárik , Samuel Horváth , Martin Takáč

Online Adaptive Methods, Universality and Acceleration

We present a novel method for convex unconstrained optimization that, without any modifications, ensures: (i) accelerated convergence rate for smooth objectives, (ii) standard convergence rate in the general (non-smooth) setting, and (iii)…

Machine Learning · Computer Science 2018-09-11 Kfir Y. Levy , Alp Yurtsever , Volkan Cevher

Suboptimality of Penalized Empirical Risk Minimization in Classification

Let $\cF$ be a set of $M$ classification procedures with values in $[-1,1]$. Given a loss function, we want to construct a procedure which mimics at the best possible rate the best procedure in $\cF$. This fastest rate is called optimal…

Statistics Theory · Mathematics 2008-12-02 Guillaume Lecué

Fast Rates by Transferring from Auxiliary Hypotheses

In this work we consider the learning setting where, in addition to the training set, the learner receives a collection of auxiliary hypotheses originating from other tasks. We focus on a broad class of ERM-based linear algorithms that can…

Machine Learning · Computer Science 2016-10-19 Ilja Kuzborskij , Francesco Orabona

On the Aggregation of Rules for Knowledge Graph Completion

Rule learning approaches for knowledge graph completion are efficient, interpretable and competitive to purely neural models. The rule aggregation problem is concerned with finding one plausibility score for a candidate fact which was…

Artificial Intelligence · Computer Science 2023-09-04 Patrick Betz , Stefan Lüdtke , Christian Meilicke , Heiner Stuckenschmidt

Approximation and learning by greedy algorithms

We consider the problem of approximating a given element $f$ from a Hilbert space $\mathcal{H}$ by means of greedy algorithms and the application of such procedures to the regression problem in statistical learning theory. We improve on the…

Statistics Theory · Mathematics 2009-09-29 Andrew R. Barron , Albert Cohen , Wolfgang Dahmen , Ronald A. DeVore

Greedy Learning to Optimize with Convergence Guarantees

Learning to optimize is an approach that leverages training data to accelerate the solution of optimization problems. Many approaches use unrolling to parametrize the update step and learn optimal parameters. Although L2O has shown…

Optimization and Control · Mathematics 2025-07-15 Patrick Fahy , Mohammad Golbabaee , Matthias J. Ehrhardt

Learning Options via Compression

Identifying statistical regularities in solutions to some tasks in multi-task reinforcement learning can accelerate the learning of new tasks. Skill learning offers one way of identifying these regularities by decomposing pre-collected…

Machine Learning · Computer Science 2022-12-12 Yiding Jiang , Evan Zheran Liu , Benjamin Eysenbach , Zico Kolter , Chelsea Finn

Convex and Network Flow Optimization for Structured Sparsity

We consider a class of learning problems regularized by a structured sparsity-inducing norm defined as the sum of l_2- or l_infinity-norms over groups of variables. Whereas much effort has been put in developing fast optimization techniques…

Optimization and Control · Mathematics 2011-10-17 Julien Mairal , Rodolphe Jenatton , Guillaume Obozinski , Francis Bach