Related papers: Aggregation for Gaussian regression

Aggregation for Regression Learning

This paper studies statistical aggregation procedures in regression setting. A motivating factor is the existence of many different methods of estimation, leading to possibly competing estimators. We consider here three different types of…

Statistics Theory · Mathematics 2007-06-13 Florentina Bunea , Alexandre Tsybakov , Marten Wegkamp

Suboptimality of Penalized Empirical Risk Minimization in Classification

Let $\cF$ be a set of $M$ classification procedures with values in $[-1,1]$. Given a loss function, we want to construct a procedure which mimics at the best possible rate the best procedure in $\cF$. This fastest rate is called optimal…

Statistics Theory · Mathematics 2008-12-02 Guillaume Lecué

Variance function estimation in regression model via aggregation procedures

In the regression problem, we consider the problem of estimating the variance function by the means of aggregation methods. We focus on two particular aggregation setting: Model Selection aggregation (MS) and Convex aggregation (C) where…

Machine Learning · Statistics 2021-10-07 Ahmed Zaoui

An Aggregation Method for Sparse Logistic Regression

$L_1$ regularized logistic regression has now become a workhorse of data mining and bioinformatics: it is widely used for many classification problems, particularly ones with many features. However, $L_1$ regularization typically selects…

Machine Learning · Statistics 2015-02-12 Zhe Liu

Minimax Optimal Bayesian Aggregation

It is generally believed that ensemble approaches, which combine multiple algorithms or models, can outperform any single algorithm at machine learning tasks, such as prediction. In this paper, we propose Bayesian convex and linear…

Statistics Theory · Mathematics 2014-03-07 Yun Yang , David B. Dunson

Optimal rates of aggregation in classification under low noise assumption

In the same spirit as Tsybakov (2003), we define the optimality of an aggregation procedure in the problem of classification. Using an aggregate with exponential weights, we obtain an optimal rate of convex aggregation for the hinge risk…

Statistics Theory · Mathematics 2007-12-04 Guillaume Lecué

Generalized Robust Bayesian Committee Machine for Large-scale Gaussian Process Regression

In order to scale standard Gaussian process (GP) regression to large-scale datasets, aggregation models employ factorized training process and then combine predictions from distributed experts. The state-of-the-art aggregation models,…

Machine Learning · Statistics 2018-06-05 Haitao Liu , Jianfei Cai , Yi Wang , Yew-Soon Ong

Optimal exponential bounds for aggregation of estimators for the Kullback-Leibler loss

We study the problem of model selection type aggregation with respect to the Kullback-Leibler divergence for various probabilistic models. Rather than considering a convex combination of the initial estimators $f_1, \ldots, f_N$, our…

Statistics Theory · Mathematics 2016-01-22 Cristina Butucea , Jean-François Delmas , Anne Dutfoy , Richard Fischer

Gradient Sampling Methods with Inexact Subproblem Solutions and Gradient Aggregation

Gradient sampling (GS) has proved to be an effective methodology for the minimization of objective functions that may be nonconvex and/or nonsmooth. The most computationally expensive component of a contemporary GS method is the need to…

Optimization and Control · Mathematics 2021-08-10 Frank E. Curtis , Minhan Li

Consensual Aggregation on Random Projected High-dimensional Features for Regression

In this paper, we present a study of a kernel-based consensual aggregation on randomly projected high-dimensional features of predictions for regression. The aggregation scheme is composed of two steps: the high-dimensional features of…

Machine Learning · Statistics 2022-04-07 Sothea Has

Adaptive Minimax Estimation over Sparse $\ell_q$-Hulls

Given a dictionary of $M_n$ initial estimates of the unknown true regression function, we aim to construct linearly aggregated estimators that target the best performance among all the linear combinations under a sparse $q$-norm ($0 \leq q…

Statistics Theory · Mathematics 2012-01-16 Zhan Wang , Sandra Paterlini , Frank Gao , Yuhong Yang

Optimal bounds for aggregation of affine estimators

We study the problem of aggregation of estimators when the estimators are not independent of the data used for aggregation and no sample splitting is allowed. If the estimators are deterministic vectors, it is well known that the minimax…

Statistics Theory · Mathematics 2018-03-01 Pierre C. Bellec

Empirical risk minimization is optimal for the convex aggregation problem

Let $F$ be a finite model of cardinality $M$ and denote by $\operatorname {conv}(F)$ its convex hull. The problem of convex aggregation is to construct a procedure having a risk as close as possible to the minimal risk over $\operatorname…

Statistics Theory · Mathematics 2013-12-17 Guillaume Lecué

Randomized maximum-contrast selection: subagging for large-scale regression

We introduce a very general method for sparse and large-scale variable selection. The large-scale regression settings is such that both the number of parameters and the number of samples are extremely large. The proposed method is based on…

Statistics Theory · Mathematics 2019-07-31 Jelena Bradic

Aggregation Models with Optimal Weights for Distributed Gaussian Processes

Gaussian process (GP) models have received increasing attention in recent years due to their superb prediction accuracy and modeling flexibility. To address the computational burdens of GP models for large-scale datasets, distributed…

Machine Learning · Statistics 2026-02-11 Haoyuan Chen , Rui Tuo

Recursive Aggregation of Estimators by Mirror Descent Algorithm with Averaging

We consider a recursive algorithm to construct an aggregated estimator from a finite number of base decision rules in the classification problem. The estimator approximately minimizes a convex risk functional under the l1-constraint. It is…

Statistics Theory · Mathematics 2007-06-13 Anatoli Juditsky , Alexander Nazin , Alexandre Tsybakov , Nicolas Vayatis

Aggregating Correlated Estimations with (Almost) no Training

Many decision problems cannot be solved exactly and use several estimation algorithms that assign scores to the different available options. The estimation errors can have various correlations, from low (e.g. between two very different…

Machine Learning · Computer Science 2023-09-06 Theo Delemazure , François Durand , Fabien Mathieu

Ensemble Methods for Convex Regression with Applications to Geometric Programming Based Circuit Design

Convex regression is a promising area for bridging statistical estimation and deterministic convex optimization. New piecewise linear convex regression methods are fast and scalable, but can have instability when used to approximate…

Machine Learning · Computer Science 2012-06-22 Lauren Hannah , David Dunson

Mitigating the Participation Bias by Balancing Extreme Ratings

Rating aggregation plays a crucial role in various fields, such as product recommendations, hotel rankings, and teaching evaluations. However, traditional averaging methods can be affected by participation bias, where some raters do not…

Machine Learning · Computer Science 2025-02-07 Yongkang Guo , Yuqing Kong , Jialiang Liu

Adaptive density estimation for clustering with Gaussian mixtures

Gaussian mixture models are widely used to study clustering problems. These model-based clustering methods require an accurate estimation of the unknown data density by Gaussian mixtures. In Maugis and Michel (2009), a penalized maximum…

Statistics Theory · Mathematics 2015-03-19 Maugis Cathy , Michel Bertrand