Related papers: Coupled Compound Poisson Factorization

Hierarchical Compound Poisson Factorization

Non-negative matrix factorization models based on a hierarchical Gamma-Poisson structure capture user and item behavior effectively in extremely sparse data sets, making them the ideal choice for collaborative filtering applications.…

Machine Learning · Computer Science 2016-05-27 Mehmet E. Basbug , Barbara E. Engelhardt

Random effects compound Poisson model to represent data with extra zeros

This paper describes a compound Poisson-based random effects structure for modeling zero-inflated data. Data with large proportion of zeros are found in many fields of applied statistics, for example in ecology when trying to model and…

Applications · Statistics 2009-07-29 Marie-Pierre Etienne , Eric Parent , Benoit Hugues , Bernier Jacques

A Unified Framework for Variable Selection in Model-Based Clustering with Missing Not at Random

Model-based clustering integrated with variable selection is a powerful tool for uncovering latent structures within complex data. However, its effectiveness is often hindered by challenges such as identifying relevant variables that define…

Methodology · Statistics 2025-11-05 Binh H. Ho , Long Nguyen Chi , TrungTin Nguyen , Binh T. Nguyen , Van Ha Hoang , Christopher Drovandi

On Tensors, Sparsity, and Nonnegative Factorizations

Tensors have found application in a variety of fields, ranging from chemometrics to signal processing and beyond. In this paper, we consider the problem of multilinear modeling of sparse count data. Our goal is to develop a descriptive…

Numerical Analysis · Mathematics 2013-09-16 Eric C. Chi , Tamara G. Kolda

Bayesian Robust Tensor Factorization for Incomplete Multiway Data

We propose a generative model for robust tensor factorization in the presence of both missing data and outliers. The objective is to explicitly infer the underlying low-CP-rank tensor capturing the global information and a sparse tensor…

Computer Vision and Pattern Recognition · Computer Science 2016-06-21 Qibin Zhao , Guoxu Zhou , Liqing Zhang , Andrzej Cichocki , Shun-ichi Amari

Modelling and understanding count processes through a Markov-modulated non-homogeneous Poisson process framework

The Markov-modulated Poisson process is utilised for count modelling in a variety of areas such as queueing, reliability, network and insurance claims analysis. In this paper, we extend the Markov-modulated Poisson process framework through…

Risk Management · Quantitative Finance 2020-08-06 Benjamin Avanzi , Greg Taylor , Bernard Wong , Alan Xian

Bayesian Hybrid Matrix Factorisation for Data Integration

We introduce a novel Bayesian hybrid matrix factorisation model (HMF) for data integration, based on combining multiple matrix factorisation methods, that can be used for in- and out-of-matrix prediction of missing values. The model is very…

Machine Learning · Statistics 2017-04-18 Thomas Brouwer , Pietro Lió

Estimation of Semiparametric Factor Models with Missing Data

We study semiparametric factor models in high-dimensional panels where the factor loadings consist of a nonparametric component explained by observed covariates and an idiosyncratic component capturing unobserved heterogeneity. A key…

Methodology · Statistics 2025-12-09 Sijie Zheng

Group-sparse Embeddings in Collective Matrix Factorization

CMF is a technique for simultaneously learning low-rank representations based on a collection of matrices with shared entities. A typical example is the joint modeling of user-item, item-property, and user-feature matrices in a recommender…

Machine Learning · Statistics 2014-11-19 Arto Klami , Guillaume Bouchard , Abhishek Tripathi

Robust Bayesian Tensor Factorization with Zero-Inflated Poisson Model and Consensus Aggregation

Tensor factorizations (TF) are powerful tools for the efficient representation and analysis of multidimensional data. However, classic TF methods based on maximum likelihood estimation underperform when applied to zero-inflated count data,…

Machine Learning · Statistics 2023-08-17 Daniel Chafamo , Vignesh Shanmugam , Neriman Tokcan

Model-based Clustering with Sparse Covariance Matrices

Finite Gaussian mixture models are widely used for model-based clustering of continuous data. Nevertheless, since the number of model parameters scales quadratically with the number of variables, these models can be easily…

Methodology · Statistics 2018-09-25 Michael Fop , Thomas Brendan Murphy , Luca Scrucca

Zero-inflated modeling with smoothing on counting tensors

We propose a unified probabilistic framework for sparse count tensors with excess zeros, motivated by single-cell Hi-C data. The observed data are naturally represented as a three-way tensor indexed by genomic loci pairs and cells,…

Methodology · Statistics 2026-04-27 Elena Tuzhilina , Yaoming Zhen

Coupled conditional backward sampling particle filter

The conditional particle filter (CPF) is a promising algorithm for general hidden Markov model smoothing. Empirical evidence suggests that the variant of CPF with backward sampling (CBPF) performs well even with long time series. Previous…

Computation · Statistics 2019-08-29 Anthony Lee , Sumeetpal S. Singh , Matti Vihola

Factor Analysis on Citation, Using a Combined Latent and Logistic Regression Model

We propose a combined model, which integrates the latent factor model and the logistic regression model, for the citation network. It is noticed that neither a latent factor model nor a logistic regression model alone is sufficient to…

Machine Learning · Statistics 2019-12-03 Namjoon Suh , Xiaoming Huo , Eric Heim , Lee Seversky

Nonparametric Pattern-Mixture Models for Inference with Missing Data

Pattern-mixture models provide a transparent approach for handling missing data, where the full-data distribution is factorized in a way that explicitly shows the parts that can be estimated from observed data alone, and the parts that…

Methodology · Statistics 2019-04-26 Yen-Chi Chen , Mauricio Sadinle

Additive Non-negative Matrix Factorization for Missing Data

Non-negative matrix factorization (NMF) has previously been shown to be a useful decomposition for multivariate data. We interpret the factorization in a new way and use it to generate missing attributes from test data. We provide a joint…

Numerical Analysis · Computer Science 2010-07-05 Mithun Das Gupta

Mixed and missing data: a unified treatment with latent graphical models

We propose to learn latent graphical models when data have mixed variables and missing values. This model could be used for further data analysis, including regression, classification, ranking etc. It also could be used for imputing missing…

Methodology · Statistics 2015-11-17 Xiao Li , Jinzhu Jia , Yuan Yao

Link Prediction via Generalized Coupled Tensor Factorisation

This study deals with the missing link prediction problem: the problem of predicting the existence of missing connections between entities of interest. We address link prediction using coupled analysis of relational datasets represented as…

Machine Learning · Computer Science 2012-08-31 Beyza Ermiş , Evrim Acar , A. Taylan Cemgil

A principal components method to impute missing values for mixed data

We propose a new method to impute missing values in mixed datasets. It is based on a principal components method, the factorial analysis for mixed data, which balances the influence of all the variables that are continuous and categorical…

Applications · Statistics 2013-02-20 Vincent Audigier , François Husson , Julie Josse

Regression Analysis for Multivariate Dependent Count Data Using Convolved Gaussian Processes

Research on Poisson regression analysis for dependent data has been developed rapidly in the last decade. One of difficult problems in a multivariate case is how to construct a cross-correlation structure and at the meantime make sure that…

Methodology · Statistics 2017-10-05 A'yunin Sofro , Jian Qing Shi , Chunzheng Cao