English
Related papers

Related papers: Parameter Expansion and Efficient Inference

200 papers

Many applications require that we learn the parameters of a model from data. EM is a method used to learn the parameters of probabilistic models for which the data for some of the variables in the models is either missing or hidden. There…

Machine Learning · Computer Science 2013-01-30 Luis E. Ortiz , Leslie Pack Kaelbling

The Expectation--Maximization (EM) algorithm is a simple meta-algorithm that has been used for many years as a methodology for statistical inference when there are missing measurements in the observed data or when the data is composed of…

Machine Learning · Statistics 2022-11-15 Hideitsu Hino , Shotaro Akaho , Noboru Murata

We provide a general theory of the expectation-maximization (EM) algorithm for inferring high dimensional latent variable models. In particular, we make two contributions: (i) For parameter estimation, we propose a novel high dimensional EM…

Machine Learning · Statistics 2015-01-28 Zhaoran Wang , Quanquan Gu , Yang Ning , Han Liu

Parameter estimation in logistic regression is a well-studied problem with the Newton-Raphson method being one of the most prominent optimization techniques used in practice. A number of monotone optimization methods including…

Computation · Statistics 2023-04-11 Nicholas C. Henderson , Zhongzhe Ouyang

Parameter-shared pre-trained language models (PLMs) have emerged as a successful approach in resource-constrained environments, enabling substantial reductions in model storage and memory costs without significant performance compromise.…

Computation and Language · Computer Science 2023-10-20 Weize Chen , Xiaoyue Xu , Xu Han , Yankai Lin , Ruobing Xie , Zhiyuan Liu , Maosong Sun , Jie Zhou

Improved EM strategies, based on the idea of efficient data augmentation (Meng and van Dyk 1997, 1998), are presented for ML estimation of mixture proportions. The resulting algorithms inherit the simplicity, ease of implementation, and…

Computation · Statistics 2010-02-22 Yaming Yu

The rise of deep learning has marked significant progress in fields such as computer vision, natural language processing, and medical imaging, primarily through the adaptation of pre-trained models for specific tasks. Traditional…

Machine Learning · Computer Science 2024-04-25 Charith Chandra Sai Balne , Sreyoshi Bhaduri , Tamoghna Roy , Vinija Jain , Aman Chadha

The convergence of expectation-maximization (EM)-based algorithms typically requires continuity of the likelihood function with respect to all the unknown parameters (optimization variables). The requirement is not met when parameters…

Signal Processing · Electrical Eng. & Systems 2024-04-18 Geethu Joseph

We consider efficient estimation of flexible transformation models with interval-censored data. To reduce the dimension of semi-parametric models, the unknown monotone transformation function is approximated via monotone splines. A…

Methodology · Statistics 2019-12-30 Minggen Lu , Yan Liu , Chin-Shang Li , Jianguo Sun

Abbreviation expansion is a strategy used to speed up communication by limiting the amount of typing and using a language model to suggest expansions. Here we look at personalizing a Large Language Model's (LLM) suggestions based on prior…

Computation and Language · Computer Science 2023-12-25 Katrin Tomanek , Shanqing Cai , Subhashini Venugopalan

Although scaling up the number of trainable parameters in both pre-training and fine-tuning can effectively improve the performance of large language models, it also leads to increased computational overhead. When delving into the parameter…

Computation and Language · Computer Science 2025-06-02 Naibin Gu , Yilong Chen , Zhenyu Zhang , Peng Fu , Zheng Lin , Shuohuan Wang , Yu Sun , Hua Wu , Weiping Wang , Haifeng Wang

Parameter-efficient fine-tuning (PEFT) techniques, such as adapter tuning, aim to fine-tune a pre-trained language model (PLM) using a minimal number of parameters for a specific task or profile. Although adapter tuning provides increased…

Machine Learning · Computer Science 2024-01-30 Namju Kwak , Taesup Kim

The Expectation Maximization (EM) algorithm is a key reference for inference in latent variable models; unfortunately, its computational cost is prohibitive in the large scale learning setting. In this paper, we propose an extension of the…

Machine Learning · Statistics 2020-11-26 Gersende Fort , Eric Moulines , Hoi-To Wai

Expectation Maximization (EM) is among the most popular algorithms for estimating parameters of statistical models. However, EM, which is an iterative algorithm based on the maximum likelihood principle, is generally only guaranteed to find…

Statistics Theory · Mathematics 2016-08-30 Ji Xu , Daniel Hsu , Arian Maleki

Statistical pragmatism embraces all efficient methods in statistical inference. Augmentation of the collected data is used herein to obtain representative population information from a large class of non-representative population's units.…

Statistics Theory · Mathematics 2015-12-04 Yannis G. Yatracos

As fine-tuning becomes impractical at scale, probing is emerging as the preferred evaluation protocol. However, standard linear probing can understate the capability of models whose pre-training optimizes local representations rather than…

The aim of the paper is to examine the computational complexity and algorithmics of enumeration, the task to output all solutions of a given problem, from the point of view of parameterized complexity. First we define formally different…

Computational Complexity · Computer Science 2013-06-11 Nadia Creignou , Arne Meier , Julian-Steffen Müller , Johannes Schmidt , Heribert Vollmer

The EM (Expectation-Maximization) algorithm is regarded as an MM (Majorization-Minimization) algorithm for maximum likelihood estimation of statistical models. Expanding this view, this paper demonstrates that by choosing an appropriate…

Optimization and Control · Mathematics 2026-02-12 Kensuke Asai , Jun-ya Gotoh

Bayesian variable selection methods are powerful techniques for fitting and inferring on sparse high-dimensional linear regression models. However, many are computationally intensive or require restrictive prior distributions on model…

Methodology · Statistics 2023-10-10 Alexander C. McLain , Anja Zgodic , Howard Bondell

Parameter-efficient tuning aims to mitigate the large memory requirements of adapting pretrained language models for downstream tasks. For example, one popular method, prefix-tuning, prepends trainable tokens to sequences while freezing the…

Computation and Language · Computer Science 2023-05-26 Jonathan Li , Will Aitken , Rohan Bhambhoria , Xiaodan Zhu
‹ Prev 1 2 3 10 Next ›