Related papers: Inside-Outside Estimation Meets Dynamic EM

Robust stochastic parsing using the inside-outside algorithm

The paper describes a parser of sequences of (English) part-of-speech labels which utilises a probabilistic grammar trained using the inside-outside algorithm. The initial (meta)grammar is defined by a linguist and further rules compatible…

cmp-lg · Computer Science 2008-02-03 Briscoe , Ted , Waegner , Nick

Inside-Outside Estimation of a Lexicalized PCFG for German

The paper describes an extensive experiment in inside-outside estimation of a lexicalized probabilistic context free grammar for German verb-final clauses. Grammar and formalism features which make the experiment feasible are described.…

Computation and Language · Computer Science 2007-05-23 Franz Beil , Glenn Carroll , Detlef Prescher , Stefan Riezler , Mats Rooth

Parsing Inside-Out

The inside-outside probabilities are typically used for reestimating Probabilistic Context Free Grammars (PCFGs), just as the forward-backward probabilities are typically used for reestimating HMMs. I show several novel uses, including…

cmp-lg · Computer Science 2007-05-23 Joshua Goodman

Divergence-Based Motivation for Online EM and Combining Hidden Variable Models

Expectation-Maximization (EM) is a prominent approach for parameter estimation of hidden (aka latent) variable models. Given the full batch of data, EM forms an upper-bound of the negative log-likelihood of the model at each iteration and…

Machine Learning · Computer Science 2020-02-24 Ehsan Amid , Manfred K. Warmuth

Geometry of EM and related iterative algorithms

The Expectation--Maximization (EM) algorithm is a simple meta-algorithm that has been used for many years as a methodology for statistical inference when there are missing measurements in the observed data or when the data is composed of…

Machine Learning · Statistics 2022-11-15 Hideitsu Hino , Shotaro Akaho , Noboru Murata

Bayesian Grammar Induction for Language Modeling

We describe a corpus-based induction algorithm for probabilistic context-free grammars. The algorithm employs a greedy heuristic search within a Bayesian framework, and a post-pass using the Inside-Outside algorithm. We compare the…

cmp-lg · Computer Science 2008-02-03 Stanley F. Chen

Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation

Following the recent success of word embeddings, it has been argued that there is no such thing as an ideal representation for words, as different models tend to capture divergent and often mutually incompatible aspects like…

Computation and Language · Computer Science 2021-12-28 Mikel Artetxe , Gorka Labaka , Iñigo Lopez-Gazpio , Eneko Agirre

A Tutorial on the Expectation-Maximization Algorithm Including Maximum-Likelihood Estimation and EM Training of Probabilistic Context-Free Grammars

The paper gives a brief review of the expectation-maximization algorithm (Dempster 1977) in the comprehensible framework of discrete mathematics. In Section 2, two prominent estimation methods, the relative-frequency estimation and the…

Computation and Language · Computer Science 2007-05-23 Detlef Prescher

Deterministic Approximate EM Algorithm; Application to the Riemann Approximation EM and the Tempered EM

The Expectation Maximisation (EM) algorithm is widely used to optimise non-convex likelihood functions with latent variables. Many authors modified its simple design to fit more specific situations. For instance, the Expectation (E) step…

Statistics Theory · Mathematics 2022-05-03 Thomas Lartigue , Stanley Durrleman , Stéphanie Allassonnière

Convergence of Expectation-Maximization Algorithm with Mixed-Integer Optimization

The convergence of expectation-maximization (EM)-based algorithms typically requires continuity of the likelihood function with respect to all the unknown parameters (optimization variables). The requirement is not met when parameters…

Signal Processing · Electrical Eng. & Systems 2024-04-18 Geethu Joseph

Expectation-Maximization as a Spectrally Governed Relaxation Flow

The expectation--maximization (EM) algorithm combines global monotonicity, local linear convergence, and strong practical robustness, but these features are usually analyzed separately. Global descent is nonlinear, whereas local convergence…

Machine Learning · Statistics 2026-05-11 Qiao Wang

Evaluating Word Embedding Models: Methods and Experimental Results

Extensive evaluation on a large number of word embedding models for language processing applications is conducted in this work. First, we introduce popular word embedding models and discuss desired properties of word models and evaluation…

Computation and Language · Computer Science 2019-07-10 Bin Wang , Angela Wang , Fenxiao Chen , Yuncheng Wang , C. -C. Jay Kuo

Integrating Probabilistic Rules into Neural Networks: A Stochastic EM Learning Algorithm

The EM-algorithm is a general procedure to get maximum likelihood estimates if part of the observations on the variables of a network are missing. In this paper a stochastic version of the algorithm is adapted to probabilistic neural…

Artificial Intelligence · Computer Science 2013-03-26 Gerhard Paass

Accelerating EM: An Empirical Study

Many applications require that we learn the parameters of a model from data. EM is a method used to learn the parameters of probabilistic models for which the data for some of the variables in the models is either missing or hidden. There…

Machine Learning · Computer Science 2013-01-30 Luis E. Ortiz , Leslie Pack Kaelbling

Dynamic Contextualized Word Embeddings

Static word embeddings that represent words by a single vector cannot capture the variability of word meaning in different linguistic and extralinguistic contexts. Building on prior work on contextualized and dynamic word embeddings, we…

Computation and Language · Computer Science 2021-06-09 Valentin Hofmann , Janet B. Pierrehumbert , Hinrich Schütze

Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model

The expectation-maximization (EM) algorithm and its variants are widely used in statistics. In high-dimensional mixture linear regression, the model is assumed to be a finite mixture of linear regression and the number of predictors is much…

Statistics Theory · Mathematics 2023-07-24 Ning Wang , Xin Zhang , Qing Mai

Parameter Learning of Logic Programs for Symbolic-Statistical Modeling

We propose a logical/mathematical framework for statistical parameter learning of parameterized logic programs, i.e. definite clause programs containing probabilistic facts with a parameterized distribution. It extends the traditional least…

Artificial Intelligence · Computer Science 2011-08-26 T. Sato , Y. Kameya

High Dimensional Expectation-Maximization Algorithm: Statistical Optimization and Asymptotic Normality

We provide a general theory of the expectation-maximization (EM) algorithm for inferring high dimensional latent variable models. In particular, we make two contributions: (i) For parameter estimation, we propose a novel high dimensional EM…

Machine Learning · Statistics 2015-01-28 Zhaoran Wang , Quanquan Gu , Yang Ning , Han Liu

Ensemble based approach to quantifying uncertainty of LLM based classifications

The output of Large Language Models (LLMs) are a function of the internal model's parameters and the input provided into the context window. The hypothesis presented here is that under a greedy sampling strategy the variance in the LLM's…

Artificial Intelligence · Computer Science 2025-02-20 Srijith Rajamohan , Ahmed Salhin , Josh Frazier , Rohit Kumar , Yu-Cheng Tsai , Todd Cook

Analyzing Correlations Between Intrinsic and Extrinsic Bias Metrics of Static Word Embeddings With Their Measuring Biases Aligned

We examine the abilities of intrinsic bias metrics of static word embeddings to predict whether Natural Language Processing (NLP) systems exhibit biased behavior. A word embedding is one of the fundamental NLP technologies that represents…

Computation and Language · Computer Science 2024-09-17 Taisei Katô , Yusuke Miyao