Related papers: PopulAtion Parameter Averaging (PAPA)

WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average

The performance of deep neural networks is enhanced by ensemble methods, which average the output of several models. However, this comes at an increased cost at inference. Weight averaging methods aim at balancing the generalization of…

Machine Learning · Computer Science 2024-05-29 Louis Fournier , Adel Nabli , Masih Aminbeidokhti , Marco Pedersoli , Eugene Belilovsky , Edouard Oyallon

Beyond Simple Averaging: Improving NLP Ensemble Performance with Topological-Data-Analysis-Based Weighting

In machine learning, ensembles are important tools for improving the model performance. In natural language processing specifically, ensembles boost the performance of a method due to multiple large models available in open source. However,…

Machine Learning · Computer Science 2025-01-30 Polina Proskura , Alexey Zaytsev

PEP: Parameter Ensembling by Perturbation

Ensembling is now recognized as an effective approach for increasing the predictive performance and calibration of deep networks. We introduce a new approach, Parameter Ensembling by Perturbation (PEP), that constructs an ensemble of…

Machine Learning · Computer Science 2020-10-27 Alireza Mehrtash , Purang Abolmaesumi , Polina Golland , Tina Kapur , Demian Wassermann , William M. Wells

Collective Wisdom: Policy Averaging with an Application to the Newsvendor Problem

We propose a Policy Averaging Approach (PAA) that synthesizes the strengths of existing approaches to create more reliable, flexible and justifiable policies for stochastic optimization problems. An important component of the PAA is risk…

Applications · Statistics 2025-03-25 Xiangyu Cui , Nicholas G. Hall , Yun Shi , Tianyuan Su

Leveraging Population Outcomes to Improve the Generalization of Experimental Results

Generalizing causal estimates in randomized experiments to a broader target population is essential for guiding decisions by policymakers and practitioners in the social and biomedical sciences. While recent papers developed various…

Methodology · Statistics 2021-11-03 Melody Huang , Naoki Egami , Erin Hartman , Luke Miratrix

When Ensembling Smaller Models is More Efficient than Single Large Models

Ensembling is a simple and popular technique for boosting evaluation performance by training multiple models (e.g., with different initializations) and aggregating their predictions. This approach is commonly reserved for the largest…

Machine Learning · Computer Science 2020-05-05 Dan Kondratyuk , Mingxing Tan , Matthew Brown , Boqing Gong

Adaptive Stochastic Weight Averaging

Ensemble models often improve generalization performances in challenging tasks. Yet, traditional techniques based on prediction averaging incur three well-known disadvantages: the computational overhead of training multiple models,…

Machine Learning · Computer Science 2024-06-28 Caglar Demir , Arnab Sharma , Axel-Cyrille Ngonga Ngomo

Parameter Averaging in Link Prediction

Ensemble methods are widely employed to improve generalization in machine learning. This has also prompted the adoption of ensemble learning for the knowledge graph embedding (KGE) models in performing link prediction. Typical approaches to…

Machine Learning · Computer Science 2025-10-30 Rupesh Sapkota , Caglar Demir , Arnab Sharma , Axel-Cyrille Ngonga Ngomo

Optimizing Ensemble Weights and Hyperparameters of Machine Learning Models for Regression Problems

Aggregating multiple learners through an ensemble of models aim to make better predictions by capturing the underlying distribution of the data more accurately. Different ensembling methods, such as bagging, boosting, and stacking/blending,…

Machine Learning · Statistics 2020-11-03 Mohsen Shahhosseini , Guiping Hu , Hieu Pham

Neural network ensembles: Evaluation of aggregation algorithms

Ensembles of artificial neural networks show improved generalization capabilities that outperform those of single networks. However, for aggregation to be effective, the individual networks must be as accurate and diverse as possible. An…

Artificial Intelligence · Computer Science 2007-05-23 P. M. Granitto , P. F. Verdes , H. A. Ceccatto

Weighted averages in population annealing: analysis and general framework

Population annealing is a powerful sequential Monte Carlo algorithm designed to study the equilibrium behavior of general systems in statistical physics through massive parallelism. In addition to the remarkable scaling capabilities of the…

Statistical Mechanics · Physics 2022-10-19 Paul L. Ebert , Denis Gessert , Martin Weigel

On Uniform, Bayesian, and PAC-Bayesian Deep Ensembles

It is common practice to combine deep neural networks into ensembles. These deep ensembles can benefit from the cancellation of errors effect: Errors by ensemble members may average out, leading to better generalization performance than…

Machine Learning · Computer Science 2025-01-07 Nick Hauptvogel , Christian Igel

Resampling schemes in population annealing -- numerical results

Population annealing (PA) is a population-based algorithm that is designed for equilibrium simulations of thermodynamic systems with a rough free energy landscape. It is known to be more efficient in doing so than standard Markov chain…

Statistical Mechanics · Physics 2022-04-04 Denis Gessert , Martin Weigel , Wolfhard Janke

Trainable Weight Averaging: Accelerating Training and Improving Generalization

Weight averaging is a widely used technique for accelerating training and improving the generalization of deep neural networks (DNNs). While existing approaches like stochastic weight averaging (SWA) rely on pre-set weighting schemes, they…

Machine Learning · Computer Science 2025-02-11 Tao Li , Zhehao Huang , Yingwen Wu , Zhengbao He , Qinghua Tao , Xiaolin Huang , Chih-Jen Lin

Prediction-Powered Linear Regression: A Balance Between Interpretation and Prediction

Unlabeled data are increasingly prevalent in contemporary economic studies, yet their effective use for improving prediction remains challenging because the outcomes are often costly or even infeasible to observe. Machine learning methods…

Methodology · Statistics 2026-05-12 Fuzhi Xu , Xingyu Yan , Xinyu Zhang

An ensemble consists of a set of individually trained classifiers (such as neural networks or decision trees) whose predictions are combined when classifying novel instances. Previous research has shown that an ensemble is often more…

Artificial Intelligence · Computer Science 2011-06-02 R. Maclin , D. Opitz

Stochastic Weight Averaging in Parallel: Large-Batch Training that Generalizes Well

We propose Stochastic Weight Averaging in Parallel (SWAP), an algorithm to accelerate DNN training. Our algorithm uses large mini-batches to compute an approximate solution quickly and then refines it by averaging the weights of multiple…

Machine Learning · Computer Science 2020-01-09 Vipul Gupta , Santiago Akle Serrano , Dennis DeCoste

Boosting Test Performance with Importance Sampling--a Subpopulation Perspective

Despite empirical risk minimization (ERM) is widely applied in the machine learning community, its performance is limited on data with spurious correlation or subpopulation that is introduced by hidden attributes. Existing literature…

Machine Learning · Computer Science 2024-12-18 Hongyu Shen , Zhizhen Zhao

The Wisdom of the Crowd and Higher-Order Beliefs

We propose a new simple procedure called Population-Mean-Based Aggregation (PMBA) that enables a principal to "aggregate" information about an unknown state of the world from agents without understanding the information structure among…

Theoretical Economics · Economics 2026-04-29 Yi-Chun Chen , Manuel Mueller-Frank , Mallesh M Pai

Weight a Minute: Understanding Variability in PATE Estimates Across Target Populations

Clinical study populations often differ meaningfully from the broader populations to which results are intended to generalize. Weighting methods such as inverse probability of sampling weights (IPSW) reweight study participants to resemble…

Methodology · Statistics 2025-12-02 William Stewart , Carly L. Brantner , Elizabeth A. Stuart , Laine Thomas