Related papers: Multivariate Density Estimation with Missing Data

Multiwavelet density estimation

Accurate density estimation methodologies play an integral role in a variety of scientific disciplines, with applications including simulation models, decision support tools, and exploratory data analysis. In the past, histograms and kernel…

Statistics Theory · Mathematics 2012-06-14 Judson B. Locke , Adrian M. Peter

Choosing Imputation Models

Imputing missing values is an important preprocessing step in data analysis, but the literature offers little guidance on how to choose between different imputation models. This letter suggests adopting the imputation model that generates a…

Methodology · Statistics 2021-07-13 Moritz Marbach

Variational Bayesian Multiple Imputation in High-Dimensional Regression Models With Missing Responses

Multiple imputation has become one of the standard methods in drawing inferences in many incomplete data applications. Applications of multiple imputation in relatively more complex settings, such as high-dimensional clustered data, require…

Methodology · Statistics 2025-04-08 Qiushuang Li , Recai Yucel

Imputation of Missing Data Using Linear Gaussian Cluster-Weighted Modeling

Missing data theory deals with the statistical methods in the occurrence of missing data. Missing data occurs when some values are not stored or observed for variables of interest. However, most of the statistical theory assumes that data…

Methodology · Statistics 2021-10-26 Luis Alejandro Masmela-Caita , Thais Paiva Galletti , Marcos Oliveira Prates

Handling missing data in model-based clustering

Gaussian Mixture models (GMMs) are a powerful tool for clustering, classification and density estimation when clustering structures are embedded in the data. The presence of missing values can largely impact the GMMs estimation process,…

Machine Learning · Statistics 2020-06-05 Alessio Serafini , Thomas Brendan Murphy , Luca Scrucca

Multiple imputation for multilevel data with continuous and binary variables

We present and compare multiple imputation methods for multilevel continuous and binary data where variables are systematically and sporadically missing. The methods are compared from a theoretical point of view and through an extensive…

Methodology · Statistics 2026-05-18 Vincent Audigier , Ian R. White , Shahab Jolani , Thomas P. A. Debray , Matteo Quartagno , James Carpenter , Stef van Buuren , Matthieu Resche-Rigon

Mixture models for data with unknown distributions

We describe and analyze a broad class of mixture models for real-valued multivariate data in which the probability density of observations within each component of the model is represented as an arbitrary combination of basis functions.…

Methodology · Statistics 2025-02-28 M. E. J. Newman

Imputation of missing data using multivariate Gaussian Linear Cluster-Weighted Modeling

Missing data arises when certain values are not recorded or observed for variables of interest. However, most of the statistical theory assume complete data availability. To address incomplete databases, one approach is to fill the gaps…

Methodology · Statistics 2023-08-15 Luis Alejandro Masmela-Caita , Thais Paiva Galletti , Marcos Oliveira Prates

Robust propensity score weighting estimation under missing at random

Missing data is frequently encountered in many areas of statistics. Propensity score weighting is a popular method for handling missing data. The propensity score method employs a response propensity model, but correct specification of the…

Methodology · Statistics 2024-03-28 Hengfang Wang , Jae Kwang Kim , Jeongseop Han , Youngjo Lee

Estimating conditional density of missing values using deep Gaussian mixture model

We consider the problem of estimating the conditional probability distribution of missing values given the observed ones. We propose an approach, which combines the flexibility of deep neural networks with the simplicity of Gaussian mixture…

Machine Learning · Computer Science 2020-11-20 Marcin Przewięźlikowski , Marek Śmieja , Łukasz Struski

Empirical Bayes conditional density estimation

The problem of nonparametric estimation of the conditional density of a response, given a vector of explanatory variables, is classical and of prominent importance in many prediction problems since the conditional density provides a more…

Methodology · Statistics 2015-04-21 Catia Scricciolo

Bayesian Semiparametric Multivariate Density Deconvolution

We consider the problem of multivariate density deconvolution when the interest lies in estimating the distribution of a vector-valued random variable but precise measurements of the variable of interest are not available, observations…

Methodology · Statistics 2016-12-06 Abhra Sarkar , Debdeep Pati , Bani K. Mallick , Raymond J. Carroll

Semiparametric fractional imputation using Gaussian mixture models for handling multivariate missing data

Item nonresponse is frequently encountered in practice. Ignoring missing data can lose efficiency and lead to misleading inference. Fractional imputation is a frequentist approach of imputation for handling missing data. However, the…

Methodology · Statistics 2018-09-18 Hejian Sang , Jae Kwang Kim

High Dimensional Binary Choice Model with Unknown Heteroskedasticity or Instrumental Variables

This paper proposes a new method for estimating high-dimensional binary choice models. We consider a semiparametric model that places no distributional assumptions on the error term, allows for heteroskedastic errors, and permits endogenous…

Econometrics · Economics 2025-07-15 Fu Ouyang , Thomas Tao Yang

Density Estimation and Classification via Bayesian Nonparametric Learning of Affine Subspaces

It is now practically the norm for data to be very high dimensional in areas such as genetics, machine vision, image analysis and many others. When analyzing such data, parametric models are often too inflexible while nonparametric…

Methodology · Statistics 2011-05-31 Abhishek Bhattacharya , Garritt Page , David Dunson

Combining local and global smoothing in multivariate density estimation

Non-parametric estimation of a multivariate density estimation is tackled via a method which combines traditional local smoothing with a form of global smoothing but without imposing a rigid structure. Simulation work delivers encouraging…

Methodology · Statistics 2016-10-10 Adelchi Azzalini

Density Regression with Conditional Support Points

Density regression characterizes the conditional density of the response variable given the covariates, and provides much more information than the commonly used conditional mean or quantile regression. However, it is often computationally…

Methodology · Statistics 2022-06-15 Yunlu Chen , Nan Zhang

Density estimation with atoms, and functional estimation for mixed discrete-continuous data

In classical density (or density-functional) estimation, it is standard to assume that the underlying distribution has a density with respect to the Lebesgue measure. However, when the data distribution is a mixture of continuous and…

Methodology · Statistics 2025-08-05 Aytijhya Saha , Aaditya Ramdas

Estimation of the invariant measure of a multidimensional diffusion from noisy observations

We introduce a new approach for estimating the invariant density of a multidimensional diffusion when dealing with high-frequency observations blurred by independent noises. We consider the intermediate regime, where observations occur at…

Statistics Theory · Mathematics 2024-04-19 Raphaël Maillet , Grégoire Szymanski

Bayesian dependent mixture models: A predictive comparison and survey

For exchangeable data, mixture models are an extremely useful tool for density estimation due to their attractive balance between smoothness and flexibility. When additional covariate information is present, mixture models can be extended…

Methodology · Statistics 2023-08-01 Sara Wade , Vanda Inacio , Sonia Petrone