Related papers: Distributed inference for heterogeneous mixture mo…

EM for Mixture of Linear Regression with Clustered Data

Modern data-driven and distributed learning frameworks deal with diverse massive data generated by clients spread across heterogeneous environments. Indeed, data heterogeneity is a major bottleneck in scaling up many distributed learning…

Machine Learning · Computer Science 2023-08-23 Amirhossein Reisizadeh , Khashayar Gatmiry , Asuman Ozdaglar

Grouped Heterogeneous Mixture Modeling for Clustered Data

Clustered data is ubiquitous in a variety of scientific fields. In this paper, we propose a flexible and interpretable modeling approach, called grouped heterogenous mixture modeling, for clustered data, which models cluster-wise…

Methodology · Statistics 2020-02-10 Shonosuke Sugasawa

A simple multithreaded implementation of the EM algorithm for mixture models

Finite mixture models have been widely used for the modelling and analysis of data from heterogeneous populations. Maximum likelihood estimation of the parameters is typically carried out via the Expectation-Maximization (EM) algorithm. The…

Computation · Statistics 2016-06-08 Sharon X Lee , Kaleb L Lee , Geoffrey J McLachlan

Online Expectation Maximization based algorithms for inference in hidden Markov models

The Expectation Maximization (EM) algorithm is a versatile tool for model parameter estimation in latent data models. When processing large data sets or data stream however, EM becomes intractable since it requires the whole data set to be…

Statistics Theory · Mathematics 2012-10-18 Sylvain Le Corff , Gersende Fort

Multi-Node EM Algorithm for Finite Mixture Models

Finite mixture models are powerful tools for modelling and analyzing heterogeneous data. Parameter estimation is typically carried out using maximum likelihood estimation via the Expectation-Maximization (EM) algorithm. Recently, the…

Computation · Statistics 2020-05-15 Sharon X. Lee , Geoffrey J. McLachlan , Kaleb L. Leemaqz

A factor mixture analysis model for multivariate binary data

The paper proposes a latent variable model for binary data coming from an unobserved heterogeneous population. The heterogeneity is taken into account by replacing the traditional assumption of Gaussian distributed factors by a finite…

Methodology · Statistics 2010-10-13 Silvia Cagnone , Cinzia Viroli

Mixture models for data with unknown distributions

We describe and analyze a broad class of mixture models for real-valued multivariate data in which the probability density of observations within each component of the model is represented as an arbitrary combination of basis functions.…

Methodology · Statistics 2025-02-28 M. E. J. Newman

Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model

The expectation-maximization (EM) algorithm and its variants are widely used in statistics. In high-dimensional mixture linear regression, the model is assumed to be a finite mixture of linear regression and the number of predictors is much…

Statistics Theory · Mathematics 2023-07-24 Ning Wang , Xin Zhang , Qing Mai

Learning Shared Kernel Models: the Shared Kernel EM algorithm

Expectation maximisation (EM) is an unsupervised learning method for estimating the parameters of a finite mixture distribution. It works by introducing "hidden" or "latent" variables via Baum's auxiliary function $Q$ that allow the joint…

Machine Learning · Computer Science 2022-05-19 Graham W. Pulford

Online EM Algorithm for Latent Data Models

In this contribution, we propose a generic online (also sometimes called adaptive or recursive) version of the Expectation-Maximisation (EM) algorithm applicable to latent variable models of independent observations. Compared to the…

Computation · Statistics 2017-03-02 Olivier Cappé , Eric Moulines

Network EM Algorithm for Gaussian Mixture Model in Decentralized Federated Learning

We systematically study various network Expectation-Maximization (EM) algorithms for the Gaussian mixture model within the framework of decentralized federated learning. Our theoretical investigation reveals that directly extending the…

Machine Learning · Statistics 2024-11-11 Shuyuan Wu , Bin Du , Xuetong Li , Hansheng Wang

Divergence-Based Motivation for Online EM and Combining Hidden Variable Models

Expectation-Maximization (EM) is a prominent approach for parameter estimation of hidden (aka latent) variable models. Given the full batch of data, EM forms an upper-bound of the negative log-likelihood of the model at each iteration and…

Machine Learning · Computer Science 2020-02-24 Ehsan Amid , Manfred K. Warmuth

On the Behavior of the Expectation-Maximization Algorithm for Mixture Models

Finite mixture models are among the most popular statistical models used in different data science disciplines. Despite their broad applicability, inference under these models typically leads to computationally challenging non-convex…

Machine Learning · Computer Science 2018-09-25 Babak Barazandeh , Meisam Razaviyayn

Towards the Theory of Unsupervised Federated Learning: Non-asymptotic Analysis of Federated EM Algorithms

While supervised federated learning approaches have enjoyed significant success, the domain of unsupervised federated learning remains relatively underexplored. Several federated EM algorithms have gained popularity in practice, however,…

Machine Learning · Statistics 2025-04-18 Ye Tian , Haolei Weng , Yang Feng

Scalable Regularised Joint Mixture Models

In many applications, data can be heterogeneous in the sense of spanning latent groups with different underlying distributions. When predictive models are applied to such data the heterogeneity can affect both predictive performance and…

Machine Learning · Statistics 2022-05-04 Thomas Lartigue , Sach Mukherjee

Fast convergence of a Federated Expectation-Maximization Algorithm

Data heterogeneity has been a long-standing bottleneck in studying the convergence rates of Federated Learning algorithms. In order to better understand the issue of data heterogeneity, we study the convergence rate of the…

Machine Learning · Statistics 2026-03-24 Zhixu Tao , Rajita Chandak , Sanjeev Kulkarni

Beyond the EM Algorithm: Constrained Optimization Methods for Latent Class Model

Latent class model (LCM), which is a finite mixture of different categorical distributions, is one of the most widely used models in statistics and machine learning fields. Because of its non-continuous nature and the flexibility in shape,…

Machine Learning · Statistics 2021-03-23 Hao Chen , Lanshan Han , Alvin Lim

Heterogeneity-aware and communication-efficient distributed statistical inference

In multicenter research, individual-level data are often protected against sharing across sites. To overcome the barrier of data sharing, many distributed algorithms, which only require sharing aggregated information, have been developed.…

Methodology · Statistics 2021-03-25 Rui Duan , Yang Ning , Yong Chen

Semi-nonparametric Latent Class Choice Model with a Flexible Class Membership Component: A Mixture Model Approach

This study presents a semi-nonparametric Latent Class Choice Model (LCCM) with a flexible class membership component. The proposed model formulates the latent classes using mixture models as an alternative approach to the traditional random…

Econometrics · Economics 2023-08-07 Georges Sfeir , Maya Abou-Zeid , Filipe Rodrigues , Francisco Camara Pereira , Isam Kaysi

Non-parametric estimation of mixed discrete choice models

In this paper, different strands of literature are combined in order to obtain algorithms for semi-parametric estimation of discrete choice models that include the modelling of unobserved heterogeneity by using mixing distributions for the…

Methodology · Statistics 2022-12-12 Dietmar Bauer , Sebastian Büscher , Manuel Batram