Related papers: Bootstrap Model Aggregation for Distributed Statis…

Bootstrap Model Averaging

Model averaging has gained significant attention in recent years due to its ability of fusing information from different models. The critical challenge in frequentist model averaging is the choice of weight vector. The bootstrap method,…

Methodology · Statistics 2024-12-10 Minghui Song , Guohua Zou , Alan T. K. Wan

M\"obius inversion and the iterated bootstrap

Estimating nonlinear functionals of probability distributions from samples is a fundamental statistical problem. The "plug-in" estimator obtained by applying the target functional to the empirical distribution of samples is biased.…

Statistics Theory · Mathematics 2026-02-20 Florian Schäfer

Distributed Networked Learning with Correlated Data

We consider a distributed estimation method in a setting with heterogeneous streams of correlated data distributed across nodes in a network. In the considered approach, linear models are estimated locally (i.e., with only local data)…

Machine Learning · Computer Science 2021-02-11 Lingzhou Hong , Alfredo Garcia , Ceyhun Eksin

A bootstrap method for estimating bias and variance in statistical multispecies models using highly disparate data sets

Statistical multispecies models of multiarea marine ecosystems use a variety of data sources to estimate parameters using composite or weighted likelihood functions with associated weighting issues and questions on how to obtain variance…

Applications · Statistics 2012-02-16 Lorna Taylor , Verena M. Trenkel , Vojtech Kupca , Gunnar Stefansson

Iterative Distributed Multinomial Regression

This article introduces an iterative distributed computing estimator for the multinomial logistic regression model with large choice sets. Compared to the maximum likelihood estimator, the proposed iterative distributed estimator achieves…

Econometrics · Economics 2024-12-03 Yanqin Fan , Yigit Okar , Xuetao Shi

Towards Arbitrary Noise Augmentation - Deep Learning for Sampling from Arbitrary Probability Distributions

Accurate noise modelling is important for training of deep learning reconstruction algorithms. While noise models are well known for traditional imaging techniques, the noise distribution of a novel sensor may be difficult to determine a…

Machine Learning · Computer Science 2018-07-11 Felix Horger , Tobias Würfl , Vincent Christlein , Andreas Maier

Distributed Parameter Estimation via Pseudo-likelihood

Estimating statistical models within sensor networks requires distributed algorithms, in which both data and computation are distributed across the nodes of the network. We propose a general approach for distributed learning based on…

Machine Learning · Computer Science 2012-07-03 Qiang Liu , Alexander Ihler

Subsampling to Enhance Efficiency in Input Uncertainty Quantification

In stochastic simulation, input uncertainty refers to the output variability arising from the statistical noise in specifying the input models. This uncertainty can be measured by a variance contribution in the output, which, in the…

Methodology · Statistics 2021-05-20 Henry Lam , Huajie Qian

Distributionally Robust Feature Selection

We study the problem of selecting limited features to observe such that models trained on them can perform well simultaneously across multiple subpopulations. This problem has applications in settings where collecting each feature is…

Machine Learning · Computer Science 2025-10-27 Maitreyi Swaroop , Tamar Krishnamurti , Bryan Wilder

The Interplay of Statistics and Noisy Optimization: Learning Linear Predictors with Random Data Weights

We analyze gradient descent with randomly weighted data points in a linear regression model, under a generic weighting distribution. This includes various forms of stochastic gradient descent, importance sampling, but also extends to…

Machine Learning · Statistics 2025-12-12 Gabriel Clara , Yazan Mash'al

Aggregating density estimators: an empirical study

We present some new density estimation algorithms obtained by bootstrap aggregation like Bagging. Our algorithms are analyzed and empirically compared to other methods found in the statistical literature, like stacking and boosting for…

Methodology · Statistics 2012-07-23 Mathias Bourel , Badih Ghattas

Robust Learning-Based Control via Bootstrapped Multiplicative Noise

Despite decades of research and recent progress in adaptive control and reinforcement learning, there remains a fundamental lack of understanding in designing controllers that provide robustness to inherent non-asymptotic uncertainties…

Machine Learning · Computer Science 2021-08-13 Benjamin Gravell , Tyler Summers

Fast Bootstrapping Nonparametric Maximum Likelihood for Latent Mixture Models

Estimating the mixing density of a latent mixture model is an important task in signal processing. Nonparametric maximum likelihood estimation is one popular approach to this problem. If the latent variable distribution is assumed to be…

Methodology · Statistics 2024-03-01 Shijie Wang , Minsuk Shin , Ray Bai

Bootstrapping Lasso in Generalized Linear Models

Generalized linear model or GLM constitutes a large class of models and essentially extends the ordinary linear regression by connecting the mean of the response variable with the covariate through appropriate link functions. On the other…

Methodology · Statistics 2026-02-03 Mayukh Choudhury , Debraj Das

Robust and Efficient Aggregation for Distributed Learning

Distributed learning paradigms, such as federated and decentralized learning, allow for the coordination of models across a collection of agents, and without the need to exchange raw data. Instead, agents compute model updates locally based…

Machine Learning · Computer Science 2022-04-04 Stefan Vlaski , Christian Schroth , Michael Muma , Abdelhak M. Zoubir

An Adaptively Resized Parametric Bootstrap for Inference in High-dimensional Generalized Linear Models

Accurate statistical inference in logistic regression models remains a critical challenge when the ratio between the number of parameters and sample size is not negligible. This is because approximations based on either classical asymptotic…

Methodology · Statistics 2022-08-19 Qian Zhao , Emmanuel J. Candes

Information-Theoretic Perspective of Federated Learning

An approach to distributed machine learning is to train models on local datasets and aggregate these models into a single, stronger model. A popular instance of this form of parallelization is federated learning, where the nodes…

Machine Learning · Computer Science 2019-11-19 Linara Adilova , Julia Rosenzweig , Michael Kamp

Distributed estimation from relative measurements of heterogeneous and uncertain quality

This paper studies the problem of estimation from relative measurements in a graph, in which a vector indexed over the nodes has to be reconstructed from pairwise measurements of differences between its components associated to nodes…

Systems and Control · Computer Science 2018-07-27 Chiara Ravazzi , Nelson P. K. Chan , Paolo Frasca

Smoothed pseudo-population bootstrap methods with applications to finite population quantiles

This paper introduces smoothed pseudo-population bootstrap methods for the purposes of variance estimation and the construction of confidence intervals for finite population quantiles. In an i.i.d. context, it has been shown that resampling…

Methodology · Statistics 2025-09-30 Vanessa McNealis , Christian Léger

Probability and Non-Probability Samples: Improving Regression Modeling by Using Data from Different Sources

Non-probability sampling, for example in the form of online panels, has become a fast and cheap method to collect data. While reliable inference tools are available for classical probability samples, non-probability samples can yield…

Methodology · Statistics 2022-04-05 Gerhard Tutz