Related papers: Reducing Sampling Ratios Improves Bagging in Spars…

A Characterization of Mean Squared Error for Estimator with Bagging

Bagging can significantly improve the generalization performance of unstable machine learning algorithms such as trees or neural networks. Though bagging is now widely used in practice and many empirical studies have explored its behavior,…

Machine Learning · Computer Science 2019-08-08 Martin Mihelich , Charles Dognin , Yan Shu , Michael Blot

Bagging in overparameterized learning: Risk characterization and risk monotonization

Bagging is a commonly used ensemble technique in statistics and machine learning to improve the performance of prediction procedures. In this paper, we study the prediction risk of variants of bagged predictors under the proportional…

Statistics Theory · Mathematics 2023-10-26 Pratik Patil , Jin-Hong Du , Arun Kumar Kuchibhotla

An Aggregation Method for Sparse Logistic Regression

$L_1$ regularized logistic regression has now become a workhorse of data mining and bioinformatics: it is widely used for many classification problems, particularly ones with many features. However, $L_1$ regularization typically selects…

Machine Learning · Statistics 2015-02-12 Zhe Liu

On The Sparse Bayesian Learning Of Linear Models

This work is a re-examination of the sparse Bayesian learning (SBL) of linear regression models of Tipping (2001) in a high-dimensional setting. We propose a hard-thresholded version of the SBL estimator that achieves, for orthogonal design…

Methodology · Statistics 2015-02-12 Yves Atchade , Chia Chye Yee

Bounding Optimality Gap in Stochastic Optimization via Bagging: Statistical Efficiency and Stability

We study a statistical method to estimate the optimal value, and the optimality gap of a given solution for stochastic optimization as an assessment of the solution quality. Our approach is based on bootstrap aggregating, or bagging,…

Optimization and Control · Mathematics 2022-12-06 Henry Lam , Huajie Qian

Subbagging Variable Selection for Big Data

This article introduces a subbagging (subsample aggregating) approach for variable selection in regression within the context of big data. The proposed subbagging approach not only ensures that variable selection is scalable given the…

Methodology · Statistics 2025-03-10 Xian Li , Xuan Liang , Tao Zou

Feature Adaptation for Sparse Linear Regression

Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,\Sigma)$, and we seek an estimator with small…

Data Structures and Algorithms · Computer Science 2023-05-29 Jonathan Kelner , Frederic Koehler , Raghu Meka , Dhruv Rohatgi

When can we improve on sample average approximation for stochastic optimization?

We explore the performance of sample average approximation in comparison with several other methods for stochastic optimization when there is information available on the underlying true probability distribution. The methods we evaluate are…

Machine Learning · Computer Science 2019-07-22 Eddie Anderson , Harrison Nguyen

Cascade Bagging for Accuracy Prediction with Few Training Samples

Accuracy predictor is trained to predict the validation accuracy of an network from its architecture encoding. It can effectively assist in designing networks and improving Neural Architecture Search(NAS) efficiency. However, a…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Ruyi Zhang , Ziwei Yang , Zhi Yang , Xubo Yang , Lei Wang , Zheyang Li

Sparse Linear Regression With Missing Data

This paper proposes a fast and accurate method for sparse regression in the presence of missing data. The underlying statistical model encapsulates the low-dimensional structure of the incomplete data matrix and the sparsity of the…

Machine Learning · Statistics 2015-03-31 Ravi Ganti , Rebecca M. Willett

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient

This paper provides a statistical analysis of high-dimensional batch Reinforcement Learning (RL) using sparse linear function approximation. When there is a large number of candidate features, our result sheds light on the fact that…

Machine Learning · Computer Science 2020-11-10 Botao Hao , Yaqi Duan , Tor Lattimore , Csaba Szepesvári , Mengdi Wang

Stability via resampling: statistical problems beyond the real line

Model averaging techniques based on resampling methods (such as bootstrapping or subsampling) have been utilized across many areas of statistics, often with the explicit goal of promoting stability in the resulting output. We provide a…

Statistics Theory · Mathematics 2024-05-28 Jake A. Soloff , Rina Foygel Barber , Rebecca Willett

JOBS: Joint-Sparse Optimization from Bootstrap Samples

Classical signal recovery based on $\ell_1$ minimization solves the least squares problem with all available measurements via sparsity-promoting regularization. In practice, it is often the case that not all measurements are available or…

Machine Learning · Statistics 2018-12-12 Luoluo Liu , Sang Peter Chin , Trac D. Tran

Measuring the Algorithmic Convergence of Randomized Ensembles: The Regression Setting

When randomized ensemble methods such as bagging and random forests are implemented, a basic question arises: Is the ensemble large enough? In particular, the practitioner desires a rigorous guarantee that a given ensemble will perform…

Machine Learning · Statistics 2019-08-06 Miles E. Lopes , Suofei Wu , Thomas C. M. Lee

Statistical estimation and testing via the sorted L1 norm

We introduce a novel method for sparse regression and variable selection, which is inspired by modern ideas in multiple testing. Imagine we have observations from the linear model y = X beta + z, then we suggest estimating the regression…

Methodology · Statistics 2013-10-30 Malgorzata Bogdan , Ewout van den Berg , Weijie Su , Emmanuel Candes

Enhancing Sparsity by Reweighted L1 Minimization

It is now well understood that (1) it is possible to reconstruct sparse signals exactly from what appear to be highly incomplete sets of linear measurements and (2) that this can be done by constrained L1 minimization. In this paper, we…

Methodology · Statistics 2007-11-13 Emmanuel J. Candes , Michael B. Wakin , Stephen P. Boyd

Aggregating density estimators: an empirical study

We present some new density estimation algorithms obtained by bootstrap aggregation like Bagging. Our algorithms are analyzed and empirically compared to other methods found in the statistical literature, like stacking and boosting for…

Methodology · Statistics 2012-07-23 Mathias Bourel , Badih Ghattas

A replica analysis of under-bagging

Under-bagging (UB), which combines under-sampling and bagging, is a popular ensemble learning method for training classifiers on an imbalanced data. Using bagging to reduce the increased variance caused by the reduction in sample size due…

Machine Learning · Statistics 2025-05-19 Takashi Takahashi

Numerical Characterization of Support Recovery in Sparse Regression with Correlated Design

Sparse regression is frequently employed in diverse scientific settings as a feature selection method. A pervasive aspect of scientific data that hampers both feature selection and estimation is the presence of strong correlations between…

Methodology · Statistics 2021-03-25 Ankit Kumar , Sharmodeep Bhattacharyya , Kristofer Bouchard

Bayesian sparse multiple regression for simultaneous rank reduction and variable selection

We develop a Bayesian methodology aimed at simultaneously estimating low-rank and row-sparse matrices in a high-dimensional multiple-response linear regression model. We consider a carefully devised shrinkage prior on the matrix of…

Methodology · Statistics 2019-04-10 Antik Chakraborty , Anirban Bhattacharya , Bani K. Mallick