Related papers: Bayesian Gaussian Copula Factor Models for Mixed D…

High Dimensional Semiparametric Latent Graphical Model for Mixed Data

Graphical models are commonly used tools for modeling multivariate random variables. While there exist many convenient multivariate distributions such as Gaussian distribution for continuous data, mixed data with the presence of discrete…

Machine Learning · Statistics 2014-04-30 Jianqing Fan , Han Liu , Yang Ning , Hui Zou

Bayesian Bootstrap based Gaussian Copula Model for Mixed Data with High Missing Rates

Missing data is a common issue in various fields such as medicine, social sciences, and natural sciences, and it poses significant challenges for accurate statistical analysis. Although numerous imputation methods have been proposed to…

Methodology · Statistics 2025-07-23 Seongmin Kim , Jeunghun Oh , Hungkuk Ko , Jeongmin Park , Jaeyong Lee

Clustering Multivariate Data using Factor Analytic Bayesian Mixtures with an Unknown Number of Components

Recent work on overfitting Bayesian mixtures of distributions offers a powerful framework for clustering multivariate data using a latent Gaussian model which resembles the factor analysis model. The flexibility provided by overfitting…

Methodology · Statistics 2019-08-29 Panagiotis Papastamoulis

Bayesian Computation in Dynamic Latent Factor Models

Bayesian computation for filtering and forecasting analysis is developed for a broad class of dynamic models. The ability to scale-up such analyses in non-Gaussian, nonlinear multivariate time series models is advanced through the…

Methodology · Statistics 2022-06-07 Isaac Lavine , Andrew Cron , Mike West

Fast Bayesian inference in large Gaussian graphical models

Despite major methodological developments, Bayesian inference for Gaussian graphical models remains challenging in high dimension due to the tremendous size of the model space. This article proposes a method to infer the marginal and…

Methodology · Statistics 2018-04-10 Gwenaël G. R. Leday , Sylvia Richardson

Extending the rank likelihood for semiparametric copula estimation

Quantitative studies in many fields involve the analysis of multivariate data of diverse types, including measurements that we may consider binary, ordinal and continuous. One approach to the analysis of such mixed data is to use a copula…

Statistics Theory · Mathematics 2007-06-13 Peter D. Hoff

Factor copula models for non-Gaussian longitudinal data

This article presents factor copula approaches to model temporal dependency of non-Gaussian (continuous/discrete) longitudinal data. Factor copula models are canonical vine copulas which explain the underlying dependence structure of a…

Methodology · Statistics 2025-02-18 Subhajit Chattopadhyay

High-dimensional factor copula models with estimation of latent variables

Factor models are a parsimonious way to explain the dependence of variables using several latent variables. In Gaussian 1-factor and structural factor models (such as bi-factor, oblique factor) and their factor copula counterparts, factor…

Methodology · Statistics 2022-05-31 Xinyao Fan , Harry Joe

Variational Gaussian Copula Inference

We utilize copulas to constitute a unified framework for constructing and optimizing variational proposals in hierarchical Bayesian models. For models with continuous and non-Gaussian hidden variables, we propose a semiparametric and…

Machine Learning · Statistics 2016-05-19 Shaobo Han , Xuejun Liao , David B. Dunson , Lawrence Carin

Bayesian Variable Selection for Gaussian copula regression models

We develop a novel Bayesian method to select important predictors in regression models with multiple responses of diverse types. A sparse Gaussian copula regression model is used to account for the multivariate dependencies between any…

Methodology · Statistics 2020-09-22 Angelos Alexopoulos , Leonardo Bottolo

Model-based clustering of Gaussian copulas for mixed data

Clustering task of mixed data is a challenging problem. In a probabilistic framework, the main difficulty is due to a shortage of conventional distributions for such data. In this paper, we propose to achieve the mixed data clustering with…

Methodology · Statistics 2015-10-01 Matthieu Marbac , Christophe Biernacki , Vincent Vandewalle

On the quantification and efficient propagation of imprecise probabilities with copula dependence

This paper addresses the problem of quantification and propagation of uncertainties associated with dependence modeling when data for characterizing probability models are limited. Practically, the system inputs are often assumed to be…

Computation · Statistics 2020-04-14 Jiaxin Zhang , Michael D. Shields

Fast Variational Inference for Bayesian Factor Analysis in Single and Multi-Study Settings

Factors models are routinely used to analyze high-dimensional data in both single-study and multi-study settings. Bayesian inference for such models relies on Markov Chain Monte Carlo (MCMC) methods which scale poorly as the number of…

Methodology · Statistics 2025-04-29 Blake Hansen , Alejandra Avalos-Pacheco , Massimiliano Russo , Roberta De Vito

Nonparametric Copula Models for Multivariate, Mixed, and Missing Data

Modern datasets commonly feature both substantial missingness and many variables of mixed data types, which present significant challenges for estimation and inference. Complete case analysis, which proceeds using only the observations with…

Methodology · Statistics 2023-04-10 Joseph Feldman , Daniel R. Kowal

Non-Gaussian Discriminative Factor Models via the Max-Margin Rank-Likelihood

We consider the problem of discriminative factor analysis for data that are in general non-Gaussian. A Bayesian model based on the ranks of the data is proposed. We first introduce a new {\em max-margin} version of the rank-likelihood. A…

Machine Learning · Statistics 2015-05-20 Xin Yuan , Ricardo Henao , Ephraim L. Tsalik , Raymond J. Langley , Lawrence Carin

A dynamic copula model for probabilistic forecasting of non-Gaussian multivariate time series

Multivariate time series (MTS) data often include a heterogeneous mix of non-Gaussian distributional features (asymmetry, multimodality, heavy tails) and data types (continuous and discrete variables). Traditional MTS methods based on…

Methodology · Statistics 2025-02-25 John Zito , Daniel R. Kowal

Bayesian Variable Selection for Non-Gaussian Responses: A Marginally Calibrated Copula Approach

We propose a new highly flexible and tractable Bayesian approach to undertake variable selection in non-Gaussian regression models. It uses a copula decomposition for the joint distribution of observations on the dependent variable. This…

Methodology · Statistics 2020-09-07 Nadja Klein , Michael Stanley Smith

A Bayesian factor analysis model for high-dimensional microbiome count data

Dimension reduction techniques are among the most essential analytical tools in the analysis of high-dimensional data. Generalized principal component analysis (PCA) is an extension to standard PCA that has been widely used to identify…

Methodology · Statistics 2024-04-25 Ismaïla Ba , Maxime Turgeon , Simona Veniamin , Juan Joel , Richard Miller , Morag Graham , Christine Bonner , Charles N. Bernstein , Douglas L. Arnold , Amit Bar-Or , Ruth Ann Marrie , Julia O'Mahony , E. Ann Yeh , Brenda Banwell , Emmanuelle Waubant , Natalie Knox , Gary Van Domselaar , Ali I. Mirza , Heather Armstrong , Saman Muthukumarana , Kevin McGregor

Gaussian Copula Models for Nonignorable Missing Data Using Auxiliary Marginal Quantiles

We present an approach for modeling and imputation of nonignorable missing data. Our approach uses Bayesian data integration to combine (1) a Gaussian copula model for all study variables and missingness indicators, which allows arbitrary…

Methodology · Statistics 2024-11-19 Joseph Feldman , Jerome P. Reiter , Daniel R. Kowal

Asymptotically Exact and Fast Gaussian Copula Models for Imputation of Mixed Data Types

Missing values with mixed data types is a common problem in a large number of machine learning applications such as processing of surveys and in different medical applications. Recently, Gaussian copula models have been suggested as a means…

Machine Learning · Statistics 2021-07-02 Benjamin Christoffersen , Mark Clements , Keith Humphreys , Hedvig Kjellström