English
Related papers

Related papers: Estimating Gaussian Copulas with Missing Data

200 papers

Missing data imputation forms the first critical step of many data analysis pipelines. The challenge is greatest for mixed data sets, including real, Boolean, and ordinal data, where standard techniques for imputation fail basic sanity…

Methodology · Statistics 2020-06-17 Yuxuan Zhao , Madeleine Udell

Quantitative studies in many fields involve the analysis of multivariate data of diverse types, including measurements that we may consider binary, ordinal and continuous. One approach to the analysis of such mixed data is to use a copula…

Statistics Theory · Mathematics 2007-06-13 Peter D. Hoff

Graphical models with bi-directed edges (<->) represent marginal independence: the absence of an edge between two vertices indicates that the corresponding variables are marginally independent. In this paper, we consider maximum likelihood…

Methodology · Statistics 2012-12-12 Mathias Drton , Thomas S. Richardson

Many real-world datasets contain missing entries and mixed data types including categorical and ordered (e.g. continuous and ordinal) variables. Imputing the missing entries is necessary, since many data analysis pipelines require complete…

Methodology · Statistics 2022-10-14 Yuxuan Zhao , Alex Townsend , Madeleine Udell

This paper investigates Gaussian copula mixture models (GCMM), which are an extension of Gaussian mixture models (GMM) that incorporate copula concepts. The paper presents the mathematical definition of GCMM and explores the properties of…

Machine Learning · Computer Science 2023-05-25 Ke Wan , Alain Kornhauser

This paper tackles the problem of missing data imputation for noisy and non-Gaussian data. A classical imputation method, the Expectation Maximization (EM) algorithm for Gaussian mixture models, has shown interesting properties when…

Machine Learning · Statistics 2023-05-23 Florian Mouret , Alexandre Hippert-Ferrer , Frédéric Pascal , Jean-Yves Tourneret

In this manuscript, we consider a finite multivariate nonparametric mixture model where the dependence between the marginal densities is modeled using the copula device. Pseudo EM stochastic algorithms were recently proposed to estimate all…

Computation · Statistics 2022-12-14 Michael Levine , Gildas Mazo

Missing observations are pervasive throughout empirical research, especially in the social sciences. Despite multiple approaches to dealing adequately with missing data, many scholars still fail to address this vital issue. In this paper,…

We present an approach for modeling and imputation of nonignorable missing data. Our approach uses Bayesian data integration to combine (1) a Gaussian copula model for all study variables and missingness indicators, which allows arbitrary…

Methodology · Statistics 2024-11-19 Joseph Feldman , Jerome P. Reiter , Daniel R. Kowal

Dramatic increases in the size and dimensionality of many recent data sets make crucial the need for sophisticated methods that can exploit inherent structure and handle missing values. In this article we derive an expectation-maximization…

Methodology · Statistics 2013-09-26 Hunter Glanz , Luis Carvalho

Modern datasets commonly feature both substantial missingness and many variables of mixed data types, which present significant challenges for estimation and inference. Complete case analysis, which proceeds using only the observations with…

Methodology · Statistics 2023-04-10 Joseph Feldman , Daniel R. Kowal

Often of primary interest in the analysis of multivariate data are the copula parameters describing the dependence among the variables, rather than the univariate marginal distributions. Since the ranks of a multivariate dataset are…

Statistics Theory · Mathematics 2014-03-13 Peter D. Hoff , Xiaoyue Niu , Jon A. Wellner

A method that uses order statistics to construct multivariate distributions with fixed marginals and which utilizes a representation of the Bernstein copula in terms of a finite mixture distribution is proposed. Expectation-maximization…

Computation · Statistics 2014-01-16 Xiaoling Dou , Satoshi Kuriki , Gwo Dong Lin , Donald Richards

Learning the joint dependence of discrete variables is a fundamental problem in machine learning, with many applications including prediction, clustering and dimensionality reduction. More recently, the framework of copula modeling has…

Machine Learning · Statistics 2013-11-15 Alfredo Kalaitzis , Ricardo Silva

This research deals with the estimation and imputation of missing data in longitudinal models with a Poisson response variable inflated with zeros. A methodology is proposed that is based on the use of maximum likelihood, assuming that data…

Methodology · Statistics 2024-09-18 D. S. Martinez-Lobo , O. O. Melo , N. A. Cruz

Estimating copulas with discrete marginal distributions is challenging, especially in high dimensions, because computing the likelihood contribution of each observation requires evaluating $2^{J}$ terms, with $J$ the number of discrete…

Methodology · Statistics 2018-11-12 D. Gunawan , M. -N. Tran , K. Suzuki , J. Dick , R. Kohn

Bayesian graphical models are a useful tool for understanding dependence relationships among many variables, particularly in situations with external prior information. In high-dimensional settings, the space of possible graphs becomes…

Machine Learning · Statistics 2019-02-07 Zehang Richard Li , Tyler H. McCormick

We propose, for multivariate Gaussian copula models with unknown margins and structured correlation matrices, a rank-based, semiparametrically efficient estimator for the Euclidean copula parameter. This estimator is defined as a one-step…

Methodology · Statistics 2014-10-02 Johan Segers , Ramon van den Akker , Bas J. M. Werker

Missing values with mixed data types is a common problem in a large number of machine learning applications such as processing of surveys and in different medical applications. Recently, Gaussian copula models have been suggested as a means…

Machine Learning · Statistics 2021-07-02 Benjamin Christoffersen , Mark Clements , Keith Humphreys , Hedvig Kjellström

Missing data is a common issue in various fields such as medicine, social sciences, and natural sciences, and it poses significant challenges for accurate statistical analysis. Although numerous imputation methods have been proposed to…

Methodology · Statistics 2025-07-23 Seongmin Kim , Jeunghun Oh , Hungkuk Ko , Jeongmin Park , Jaeyong Lee
‹ Prev 1 2 3 10 Next ›