English
Related papers

Related papers: Bootstrap Model Aggregation for Distributed Statis…

200 papers

Model averaging has gained significant attention in recent years due to its ability of fusing information from different models. The critical challenge in frequentist model averaging is the choice of weight vector. The bootstrap method,…

Methodology · Statistics 2024-12-10 Minghui Song , Guohua Zou , Alan T. K. Wan

Estimating nonlinear functionals of probability distributions from samples is a fundamental statistical problem. The "plug-in" estimator obtained by applying the target functional to the empirical distribution of samples is biased.…

Statistics Theory · Mathematics 2026-02-20 Florian Schäfer

We consider a distributed estimation method in a setting with heterogeneous streams of correlated data distributed across nodes in a network. In the considered approach, linear models are estimated locally (i.e., with only local data)…

Machine Learning · Computer Science 2021-02-11 Lingzhou Hong , Alfredo Garcia , Ceyhun Eksin

Statistical multispecies models of multiarea marine ecosystems use a variety of data sources to estimate parameters using composite or weighted likelihood functions with associated weighting issues and questions on how to obtain variance…

Applications · Statistics 2012-02-16 Lorna Taylor , Verena M. Trenkel , Vojtech Kupca , Gunnar Stefansson

This article introduces an iterative distributed computing estimator for the multinomial logistic regression model with large choice sets. Compared to the maximum likelihood estimator, the proposed iterative distributed estimator achieves…

Econometrics · Economics 2024-12-03 Yanqin Fan , Yigit Okar , Xuetao Shi

Accurate noise modelling is important for training of deep learning reconstruction algorithms. While noise models are well known for traditional imaging techniques, the noise distribution of a novel sensor may be difficult to determine a…

Machine Learning · Computer Science 2018-07-11 Felix Horger , Tobias Würfl , Vincent Christlein , Andreas Maier

Estimating statistical models within sensor networks requires distributed algorithms, in which both data and computation are distributed across the nodes of the network. We propose a general approach for distributed learning based on…

Machine Learning · Computer Science 2012-07-03 Qiang Liu , Alexander Ihler

In stochastic simulation, input uncertainty refers to the output variability arising from the statistical noise in specifying the input models. This uncertainty can be measured by a variance contribution in the output, which, in the…

Methodology · Statistics 2021-05-20 Henry Lam , Huajie Qian

We study the problem of selecting limited features to observe such that models trained on them can perform well simultaneously across multiple subpopulations. This problem has applications in settings where collecting each feature is…

Machine Learning · Computer Science 2025-10-27 Maitreyi Swaroop , Tamar Krishnamurti , Bryan Wilder

We analyze gradient descent with randomly weighted data points in a linear regression model, under a generic weighting distribution. This includes various forms of stochastic gradient descent, importance sampling, but also extends to…

Machine Learning · Statistics 2025-12-12 Gabriel Clara , Yazan Mash'al

We present some new density estimation algorithms obtained by bootstrap aggregation like Bagging. Our algorithms are analyzed and empirically compared to other methods found in the statistical literature, like stacking and boosting for…

Methodology · Statistics 2012-07-23 Mathias Bourel , Badih Ghattas

Despite decades of research and recent progress in adaptive control and reinforcement learning, there remains a fundamental lack of understanding in designing controllers that provide robustness to inherent non-asymptotic uncertainties…

Machine Learning · Computer Science 2021-08-13 Benjamin Gravell , Tyler Summers

Estimating the mixing density of a latent mixture model is an important task in signal processing. Nonparametric maximum likelihood estimation is one popular approach to this problem. If the latent variable distribution is assumed to be…

Methodology · Statistics 2024-03-01 Shijie Wang , Minsuk Shin , Ray Bai

Generalized linear model or GLM constitutes a large class of models and essentially extends the ordinary linear regression by connecting the mean of the response variable with the covariate through appropriate link functions. On the other…

Methodology · Statistics 2026-02-03 Mayukh Choudhury , Debraj Das

Distributed learning paradigms, such as federated and decentralized learning, allow for the coordination of models across a collection of agents, and without the need to exchange raw data. Instead, agents compute model updates locally based…

Machine Learning · Computer Science 2022-04-04 Stefan Vlaski , Christian Schroth , Michael Muma , Abdelhak M. Zoubir

Accurate statistical inference in logistic regression models remains a critical challenge when the ratio between the number of parameters and sample size is not negligible. This is because approximations based on either classical asymptotic…

Methodology · Statistics 2022-08-19 Qian Zhao , Emmanuel J. Candes

An approach to distributed machine learning is to train models on local datasets and aggregate these models into a single, stronger model. A popular instance of this form of parallelization is federated learning, where the nodes…

Machine Learning · Computer Science 2019-11-19 Linara Adilova , Julia Rosenzweig , Michael Kamp

This paper studies the problem of estimation from relative measurements in a graph, in which a vector indexed over the nodes has to be reconstructed from pairwise measurements of differences between its components associated to nodes…

Systems and Control · Computer Science 2018-07-27 Chiara Ravazzi , Nelson P. K. Chan , Paolo Frasca

This paper introduces smoothed pseudo-population bootstrap methods for the purposes of variance estimation and the construction of confidence intervals for finite population quantiles. In an i.i.d. context, it has been shown that resampling…

Methodology · Statistics 2025-09-30 Vanessa McNealis , Christian Léger

Non-probability sampling, for example in the form of online panels, has become a fast and cheap method to collect data. While reliable inference tools are available for classical probability samples, non-probability samples can yield…

Methodology · Statistics 2022-04-05 Gerhard Tutz
‹ Prev 1 2 3 10 Next ›