Related papers: Block-Conditional Missing at Random Models for Mis…

Estimation of Classification Rules from Partially Classified Data

We consider the situation where the observed sample contains some observations whose class of origin is known (that is, they are classified with respect to the g underlying classes of interest), and where the remaining observations in the…

Machine Learning · Statistics 2020-04-15 Geoffrey J. McLachlan , Daniel Ahfock

Statistical Inference with Different Missing-data Mechanisms

When data are missing due to at most one cause from some time to next time, we can make sampling distribution inferences about the parameter of the data by modeling the missing-data mechanism correctly. Proverbially, in case its mechanism…

Methodology · Statistics 2014-07-21 Kosuke Morikawa , Yutaka Kano

Blockmodels: A R-package for estimating in Latent Block Model and Stochastic Block Model, with various probability functions, with or without covariates

Analysis of the topology of a graph, regular or bipartite one, can be done by clustering for regular ones or co-clustering for bipartite ones. The Stochastic Block Model and the Latent Block Model are two models, which are very similar for…

Computation · Statistics 2016-02-25 Jean-Benoist Leger

Model-based Clustering with Missing Not At Random Data

Model-based unsupervised learning, as any learning task, stalls as soon as missing data occurs. This is even more true when the missing data are informative, or said missing not at random (MNAR). In this paper, we propose model-based…

Machine Learning · Statistics 2023-12-25 Aude Sportisse , Matthieu Marbac , Fabien Laporte , Gilles Celeux , Claire Boyer , Julie Josse , Christophe Biernacki

Missing at random, likelihood ignorability and model completeness

This paper provides further insight into the key concept of missing at random (MAR) in incomplete data analysis. Following the usual selection modelling approach we envisage two models with separable parameters: a model for the response of…

Statistics Theory · Mathematics 2007-06-13 Guobing Lu , John B. Copas

DPER: Efficient Parameter Estimation for Randomly Missing Data

The missing data problem has been broadly studied in the last few decades and has various applications in different areas such as statistics or bioinformatics. Even though many methods have been developed to tackle this challenge, most of…

Machine Learning · Statistics 2021-06-10 Thu Nguyen , Khoi Minh Nguyen-Duy , Duy Ho Minh Nguyen , Binh T. Nguyen , Bruce Alan Wade

EPEM: Efficient Parameter Estimation for Multiple Class Monotone Missing Data

The problem of monotone missing data has been broadly studied during the last two decades and has many applications in different fields such as bioinformatics or statistics. Commonly used imputation techniques require multiple iterations…

Machine Learning · Computer Science 2020-09-25 Thu Nguyen , Duy H. M. Nguyen , Huy Nguyen , Binh T. Nguyen , Bruce A. Wade

Variational Inference for Stochastic Block Models from Sampled Data

This paper deals with non-observed dyads during the sampling of a network and consecutive issues in the inference of the Stochastic Block Model (SBM). We review sampling designs and recover Missing At Random (MAR) and Not Missing At Random…

Methodology · Statistics 2019-01-10 Timothée Tabouy , Pierre Barbillon , Julien Chiquet

Clustering data with values missing at random using scale mixtures of multivariate skew-normal distributions

Handling missing data is a major challenge in model-based clustering, especially when the data exhibit skewness and heavy tails. We address this by extending the finite mixture of scale mixtures of multivariate skew-normal (FMSMSN) family…

Methodology · Statistics 2025-07-29 Jason Pillay , Cristina Tortora , Antonio Punzo , Andriette Bekker

A Hybrid EM Algorithm for Linear Two-Way Interactions with Missing Data

We study an EM algorithm for estimating product-term regression models with missing data. The study of such problems in the likelihood tradition has thus far been restricted to an EM algorithm method using full numerical integration.…

Methodology · Statistics 2021-11-16 Dale S. Kim

Sufficient Identification Conditions and Semiparametric Estimation under Missing Not at Random Mechanisms

Conducting valid statistical analyses is challenging in the presence of missing-not-at-random (MNAR) data, where the missingness mechanism is dependent on the missing values themselves even conditioned on the observed data. Here, we…

Methodology · Statistics 2023-06-13 Anna Guo , Jiwei Zhao , Razieh Nabi

Parametric MMD Estimation with Missing Values: Robustness to Missingness and Data Model Misspecification

In the missing data literature, the Maximum Likelihood Estimator (MLE) is celebrated for its ignorability property under missing at random (MAR) data. However, its sensitivity to misspecification of the (complete) data model, even under…

Methodology · Statistics 2025-09-23 Badr-Eddine Chérief-Abdellatif , Jeffrey Näf

Integrating Probabilistic Rules into Neural Networks: A Stochastic EM Learning Algorithm

The EM-algorithm is a general procedure to get maximum likelihood estimates if part of the observations on the variables of a network are missing. In this paper a stochastic version of the algorithm is adapted to probabilistic neural…

Artificial Intelligence · Computer Science 2013-03-26 Gerhard Paass

A Stochastic Version of the EM Algorithm for Mixture Cure Rate Model with Exponentiated Weibull Family of Lifetimes

Handling missing values plays an important role in the analysis of survival data, especially, the ones marked by cure fraction. In this paper, we discuss the properties and implementation of stochastic approximations to the…

Methodology · Statistics 2021-07-22 Sandip Barui , Suvra Pal , Nutan Mishra , Katherine Davies

Evaluation of missing data mechanisms in two and three dimensional incomplete tables

The analysis of incomplete contingency tables is a practical and an interesting problem. In this paper, we provide characterizations for the various missing mechanisms of a variable in terms of response and non-response odds for two and…

Methodology · Statistics 2018-11-27 S. Ghosh , P. Vellaisamy

An overview of latent Markov models for longitudinal categorical data

We provide a comprehensive overview of latent Markov (LM) models for the analysis of longitudinal categorical data. The main assumption behind these models is that the response variables are conditionally independent given a latent process…

Statistics Theory · Mathematics 2010-03-16 F. Bartolucci , A. Farcomeni , F. Pennoni

Sequential identification of nonignorable missing data mechanisms

With nonignorable missing data, likelihood-based inference should be based on the joint distribution of the study variables and their missingness indicators. These joint models cannot be estimated from the data alone, thus requiring the…

Statistics Theory · Mathematics 2017-01-06 Mauricio Sadinle , Jerome P. Reiter

Modeling Missing at Random Neuropsychological Test Scores Using a Mixture of Binomial Product Experts

Multivariate bounded discrete data arises in many fields. In the setting of dementia studies, such data is collected when individuals complete neuropsychological tests. We outline a modeling and inference procedure that can model the joint…

Methodology · Statistics 2026-02-10 Daniel Suen , Yen-Chi Chen

Finite mixture modeling of censored and missing data using the multivariate skew-normal distribution

Finite mixture models have been widely used to model and analyze data from a heterogeneous populations. Moreover, data of this kind can be missing or subject to some upper and/or lower detection limits because of the restriction of…

Methodology · Statistics 2020-09-24 Francisco H. C. de Alencar , Christian E. Galarza , Larissa A. Matos , Victor H. Lachos

Identifying the number of clusters in discrete mixture models

Research on cluster analysis for categorical data continues to develop, with new clustering algorithms being proposed. However, in this context, the determination of the number of clusters is rarely addressed. In this paper, we propose a…

Methodology · Statistics 2014-09-29 Cláudia Silvestre , Margarida G. M. S. Cardoso , Mário A. T. Figueiredo