Related papers: A Partial EM Algorithm for Clustering White Breads

On the EM-Tau algorithm: a new EM-style algorithm with partial E-steps

The EM algorithm is one of many important tools in the field of statistics. While often used for imputing missing data, its widespread applications include other common statistical tasks, such as clustering. In clustering, the EM algorithm…

Machine Learning · Statistics 2017-11-22 Val Andrei Fajardo , Jiaxi Liang

Learning Shared Kernel Models: the Shared Kernel EM algorithm

Expectation maximisation (EM) is an unsupervised learning method for estimating the parameters of a finite mixture distribution. It works by introducing "hidden" or "latent" variables via Baum's auxiliary function $Q$ that allow the joint…

Machine Learning · Computer Science 2022-05-19 Graham W. Pulford

On the Semi-supervised Expectation Maximization

The Expectation Maximization (EM) algorithm is widely used as an iterative modification to maximum likelihood estimation when the data is incomplete. We focus on a semi-supervised case to learn the model from labeled and unlabeled samples.…

Machine Learning · Computer Science 2023-01-26 Erixhen Sula , Lizhong Zheng

Ten Steps of EM Suffice for Mixtures of Two Gaussians

The Expectation-Maximization (EM) algorithm is a widely used method for maximum likelihood estimation in models with latent variables. For estimating mixtures of Gaussians, its iteration can be viewed as a soft version of the k-means…

Machine Learning · Statistics 2017-06-06 Constantinos Daskalakis , Christos Tzamos , Manolis Zampetakis

EM for Mixture of Linear Regression with Clustered Data

Modern data-driven and distributed learning frameworks deal with diverse massive data generated by clients spread across heterogeneous environments. Indeed, data heterogeneity is a major bottleneck in scaling up many distributed learning…

Machine Learning · Computer Science 2023-08-23 Amirhossein Reisizadeh , Khashayar Gatmiry , Asuman Ozdaglar

Semiparametric Elliptical Mixture Clustering for High-Dimensional Data

Clustering high-dimensional data is especially challenging when cluster distributions are heavy tailed and only approximately elliptical. Existing high-dimensional methods are largely built for Gaussian or other light-tailed models, whereas…

Methodology · Statistics 2026-05-12 Long Feng , Dan Zhuang

A Hybrid EM Algorithm for Linear Two-Way Interactions with Missing Data

We study an EM algorithm for estimating product-term regression models with missing data. The study of such problems in the likelihood tradition has thus far been restricted to an EM algorithm method using full numerical integration.…

Methodology · Statistics 2021-11-16 Dale S. Kim

Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model

The expectation-maximization (EM) algorithm and its variants are widely used in statistics. In high-dimensional mixture linear regression, the model is assumed to be a finite mixture of linear regression and the number of predictors is much…

Statistics Theory · Mathematics 2023-07-24 Ning Wang , Xin Zhang , Qing Mai

Identifying the number of clusters in discrete mixture models

Research on cluster analysis for categorical data continues to develop, with new clustering algorithms being proposed. However, in this context, the determination of the number of clusters is rarely addressed. In this paper, we propose a…

Methodology · Statistics 2014-09-29 Cláudia Silvestre , Margarida G. M. S. Cardoso , Mário A. T. Figueiredo

Constrained Optimization for a Subset of the Gaussian Parsimonious Clustering Models

The expectation-maximization (EM) algorithm is an iterative method for finding maximum likelihood estimates when data are incomplete or are treated as being incomplete. The EM algorithm and its variants are commonly used for parameter…

Computation · Statistics 2013-06-26 Ryan P. Browne , Sanjeena Subedi , Paul McNicholas

Quantum Expectation-Maximization for Gaussian Mixture Models

The Expectation-Maximization (EM) algorithm is a fundamental tool in unsupervised machine learning. It is often used as an efficient way to solve Maximum Likelihood (ML) estimation problems, especially for models with latent variables. It…

Quantum Physics · Physics 2020-07-08 Iordanis Kerenidis , Alessandro Luongo , Anupam Prakash

Unsupervised learning of regression mixture models with unknown number of components

Regression mixture models are widely studied in statistics, machine learning and data analysis. Fitting regression mixtures is challenging and is usually performed by maximum likelihood by using the expectation-maximization (EM) algorithm.…

Methodology · Statistics 2014-09-25 Faicel Chamroukhi

Analysis of a Generalized Expectation-Maximization Algorithm for Gaussian Mixture Models: A Control Systems Perspective

The Expectation-Maximization (EM) algorithm is one of the most popular methods used to solve the problem of parametric distribution-based clustering in unsupervised learning. In this paper, we propose to analyze a generalized EM (GEM)…

Optimization and Control · Mathematics 2021-05-19 Sarthak Chatterjee , Orlando Romero , Sérgio Pequito

Text-Based Product Matching -- Semi-Supervised Clustering Approach

Matching identical products present in multiple product feeds constitutes a crucial element of many tasks of e-commerce, such as comparing product offerings, dynamic price optimization, and selecting the assortment personalized for the…

Databases · Computer Science 2024-02-16 Alicja Martinek , Szymon Łukasik , Amir H. Gandomi

Geometry of EM and related iterative algorithms

The Expectation--Maximization (EM) algorithm is a simple meta-algorithm that has been used for many years as a methodology for statistical inference when there are missing measurements in the observed data or when the data is composed of…

Machine Learning · Statistics 2022-11-15 Hideitsu Hino , Shotaro Akaho , Noboru Murata

Learning Consumer Preferences from Bundle Sales Data

Product bundling is a common selling mechanism used in online retailing. To set profitable bundle prices, the seller needs to learn consumer preferences from the transaction data. When customers purchase bundles or multiple products,…

Machine Learning · Statistics 2022-09-13 Ningyuan Chen , Setareh Farajollahzadeh , Guan Wang

Algorithme EM r\'egularis\'e

Expectation-Maximization (EM) algorithm is a widely used iterative algorithm for computing maximum likelihood estimate when dealing with Gaussian Mixture Model (GMM). When the sample size is smaller than the data dimension, this could lead…

Machine Learning · Statistics 2023-07-06 Pierre Houdouin , Matthieu Jonkcheere , Frederic Pascal

Learning to Recommend Using Non-Uniform Data

Learning user preferences for products based on their past purchases or reviews is at the cornerstone of modern recommendation engines. One complication in this learning task is that some users are more likely to purchase products or review…

Information Retrieval · Computer Science 2023-03-08 Wanning Chen , Mohsen Bayati

A Computational Note on the Application of the Supplemented EM Algorithm to Item Response Models

The EM algorithm is a method for finding the maximum likelihood estimate of a model in the presence of missing data. Unfortunately, EM does not produce a parameter covariance matrix for standard errors. Supplemented EM (SEM; Meng & Rubin,…

Computation · Statistics 2016-05-04 Joshua N. Pritikin

Neural Parameter Estimation with Incomplete Data

Advances in artificial intelligence (AI) and deep learning have led to neural networks being used to generate lightning-speed answers to complex science questions, paintings in the style of Monet, or stories like those of Twain. Leveraging…

Methodology · Statistics 2026-02-25 Matthew Sainsbury-Dale , Andrew Zammit-Mangion , Noel Cressie , Raphaël Huser