English
Related papers

Related papers: PopulAtion Parameter Averaging (PAPA)

200 papers

The performance of deep neural networks is enhanced by ensemble methods, which average the output of several models. However, this comes at an increased cost at inference. Weight averaging methods aim at balancing the generalization of…

Machine Learning · Computer Science 2024-05-29 Louis Fournier , Adel Nabli , Masih Aminbeidokhti , Marco Pedersoli , Eugene Belilovsky , Edouard Oyallon

In machine learning, ensembles are important tools for improving the model performance. In natural language processing specifically, ensembles boost the performance of a method due to multiple large models available in open source. However,…

Machine Learning · Computer Science 2025-01-30 Polina Proskura , Alexey Zaytsev

Ensembling is now recognized as an effective approach for increasing the predictive performance and calibration of deep networks. We introduce a new approach, Parameter Ensembling by Perturbation (PEP), that constructs an ensemble of…

Machine Learning · Computer Science 2020-10-27 Alireza Mehrtash , Purang Abolmaesumi , Polina Golland , Tina Kapur , Demian Wassermann , William M. Wells

We propose a Policy Averaging Approach (PAA) that synthesizes the strengths of existing approaches to create more reliable, flexible and justifiable policies for stochastic optimization problems. An important component of the PAA is risk…

Applications · Statistics 2025-03-25 Xiangyu Cui , Nicholas G. Hall , Yun Shi , Tianyuan Su

Generalizing causal estimates in randomized experiments to a broader target population is essential for guiding decisions by policymakers and practitioners in the social and biomedical sciences. While recent papers developed various…

Methodology · Statistics 2021-11-03 Melody Huang , Naoki Egami , Erin Hartman , Luke Miratrix

Ensembling is a simple and popular technique for boosting evaluation performance by training multiple models (e.g., with different initializations) and aggregating their predictions. This approach is commonly reserved for the largest…

Machine Learning · Computer Science 2020-05-05 Dan Kondratyuk , Mingxing Tan , Matthew Brown , Boqing Gong

Ensemble models often improve generalization performances in challenging tasks. Yet, traditional techniques based on prediction averaging incur three well-known disadvantages: the computational overhead of training multiple models,…

Machine Learning · Computer Science 2024-06-28 Caglar Demir , Arnab Sharma , Axel-Cyrille Ngonga Ngomo

Ensemble methods are widely employed to improve generalization in machine learning. This has also prompted the adoption of ensemble learning for the knowledge graph embedding (KGE) models in performing link prediction. Typical approaches to…

Machine Learning · Computer Science 2025-10-30 Rupesh Sapkota , Caglar Demir , Arnab Sharma , Axel-Cyrille Ngonga Ngomo

Aggregating multiple learners through an ensemble of models aim to make better predictions by capturing the underlying distribution of the data more accurately. Different ensembling methods, such as bagging, boosting, and stacking/blending,…

Machine Learning · Statistics 2020-11-03 Mohsen Shahhosseini , Guiping Hu , Hieu Pham

Ensembles of artificial neural networks show improved generalization capabilities that outperform those of single networks. However, for aggregation to be effective, the individual networks must be as accurate and diverse as possible. An…

Artificial Intelligence · Computer Science 2007-05-23 P. M. Granitto , P. F. Verdes , H. A. Ceccatto

Population annealing is a powerful sequential Monte Carlo algorithm designed to study the equilibrium behavior of general systems in statistical physics through massive parallelism. In addition to the remarkable scaling capabilities of the…

Statistical Mechanics · Physics 2022-10-19 Paul L. Ebert , Denis Gessert , Martin Weigel

It is common practice to combine deep neural networks into ensembles. These deep ensembles can benefit from the cancellation of errors effect: Errors by ensemble members may average out, leading to better generalization performance than…

Machine Learning · Computer Science 2025-01-07 Nick Hauptvogel , Christian Igel

Population annealing (PA) is a population-based algorithm that is designed for equilibrium simulations of thermodynamic systems with a rough free energy landscape. It is known to be more efficient in doing so than standard Markov chain…

Statistical Mechanics · Physics 2022-04-04 Denis Gessert , Martin Weigel , Wolfhard Janke

Weight averaging is a widely used technique for accelerating training and improving the generalization of deep neural networks (DNNs). While existing approaches like stochastic weight averaging (SWA) rely on pre-set weighting schemes, they…

Machine Learning · Computer Science 2025-02-11 Tao Li , Zhehao Huang , Yingwen Wu , Zhengbao He , Qinghua Tao , Xiaolin Huang , Chih-Jen Lin

Unlabeled data are increasingly prevalent in contemporary economic studies, yet their effective use for improving prediction remains challenging because the outcomes are often costly or even infeasible to observe. Machine learning methods…

Methodology · Statistics 2026-05-12 Fuzhi Xu , Xingyu Yan , Xinyu Zhang

An ensemble consists of a set of individually trained classifiers (such as neural networks or decision trees) whose predictions are combined when classifying novel instances. Previous research has shown that an ensemble is often more…

Artificial Intelligence · Computer Science 2011-06-02 R. Maclin , D. Opitz

We propose Stochastic Weight Averaging in Parallel (SWAP), an algorithm to accelerate DNN training. Our algorithm uses large mini-batches to compute an approximate solution quickly and then refines it by averaging the weights of multiple…

Machine Learning · Computer Science 2020-01-09 Vipul Gupta , Santiago Akle Serrano , Dennis DeCoste

Despite empirical risk minimization (ERM) is widely applied in the machine learning community, its performance is limited on data with spurious correlation or subpopulation that is introduced by hidden attributes. Existing literature…

Machine Learning · Computer Science 2024-12-18 Hongyu Shen , Zhizhen Zhao

We propose a new simple procedure called Population-Mean-Based Aggregation (PMBA) that enables a principal to "aggregate" information about an unknown state of the world from agents without understanding the information structure among…

Theoretical Economics · Economics 2026-04-29 Yi-Chun Chen , Manuel Mueller-Frank , Mallesh M Pai

Clinical study populations often differ meaningfully from the broader populations to which results are intended to generalize. Weighting methods such as inverse probability of sampling weights (IPSW) reweight study participants to resemble…

Methodology · Statistics 2025-12-02 William Stewart , Carly L. Brantner , Elizabeth A. Stuart , Laine Thomas
‹ Prev 1 2 3 10 Next ›