English
Related papers

Related papers: Robust Bayesian Tensor Factorization with Zero-Inf…

200 papers

Dimension reduction of high-dimensional microbiome data facilitates subsequent analysis such as regression and clustering. Most existing reduction methods cannot fully accommodate the special features of the data such as count-valued and…

Methodology · Statistics 2023-05-02 Tianchen Xu , Ryan T. Demmer , Gen Li

We propose a unified probabilistic framework for sparse count tensors with excess zeros, motivated by single-cell Hi-C data. The observed data are naturally represented as a three-way tensor indexed by genomic loci pairs and cells,…

Methodology · Statistics 2026-04-27 Elena Tuzhilina , Yaoming Zhen

Tensor factorization has been proved as an efficient unsupervised learning approach for health data analysis, especially for computational phenotyping, where the high-dimensional Electronic Health Records (EHRs) with patients' history of…

Machine Learning · Computer Science 2022-11-04 Jing Ma , Qiuchen Zhang , Jian Lou , Li Xiong , Sivasubramanium Bhavani , Joyce C. Ho

In this paper, I propose a new class of Zero-Inflated Poisson models into the family of Cluster Weighted Models (CWMs) called Zero-Inflated Poisson CWMs (ZIPCWM). ZIPCWM extends Poisson cluster weighted models and other mixture models. I…

Methodology · Statistics 2022-08-29 Kehinde Olobatuyi

How can we capture the hidden properties from a tensor and a matrix data simultaneously in a fast, accurate, and scalable way? Coupled matrix-tensor factorization (CMTF) is a major tool to extract latent factors from a tensor and matrices…

Numerical Analysis · Computer Science 2017-12-06 Dongjin Choi , Jun-Gi Jang , U Kang

We present a general framework, the coupled compound Poisson factorization (CCPF), to capture the missing-data mechanism in extremely sparse data sets by coupling a hierarchical Poisson factorization with an arbitrary data-generating model.…

Machine Learning · Computer Science 2017-01-10 Mehmet E. Basbug , Barbara E. Engelhardt

We present a scalable Bayesian model for low-rank factorization of massive tensors with binary observations. The proposed model has the following key properties: (1) in contrast to the models based on the logistic or probit likelihood,…

Machine Learning · Statistics 2015-08-19 Changwei Hu , Piyush Rai , Lawrence Carin

Tensor decomposition is a popular technique for tensor completion, However most of the existing methods are based on linear or shallow model, when the data tensor becomes large and the observation data is very small, it is prone to over…

Numerical Analysis · Mathematics 2021-05-21 Qianxi Wu , An-Bao Xu

Probabilistic Temporal Tensor Factorization (PTTF) is an effective algorithm to model the temporal tensor data. It leverages a time constraint to capture the evolving properties of tensor data. Nowadays the exploding dataset demands a large…

Machine Learning · Statistics 2016-11-14 Guangxi Li , Zenglin Xu , Linnan Wang , Jinmian Ye , Irwin King , Michael Lyu

The rapid generation of complex, highly skewed, and zero-inflated multi-source count data poses significant challenges for variable selection, particularly in biomedical domains like tumor development and metabolic dysregulation. To address…

Applications · Statistics 2025-11-11 Shan Tang , Shanjun Mao , Shourong Ma , Falong Tan

In this manuscript, we introduce a tensor-based approach to Non-Negative Tensor Factorization (NTF). The method entails tensor dimension reduction through the utilization of the Einstein product. To maintain the regularity and sparsity of…

Numerical Analysis · Mathematics 2024-06-18 Anas El Hachimi , Khalide Jbilou , Ahmed Ratnani

Coupled decompositions are a widely used tool for data fusion. As the volume of data increases, so does the dimensionality of matrices and tensors, highlighting the need for more efficient coupled decomposition algorithms. This paper…

Numerical Analysis · Mathematics 2026-04-22 Erna Begovic , Anita Carevic , Ivana Sain Glibic

Zero-inflated count data arise in various fields, including health, biology, economics, and the social sciences. These data are often modelled using probabilistic distributions such as zero-inflated Poisson (ZIP), zero-inflated negative…

Methodology · Statistics 2025-03-31 Zahra AghahosseinaliShirazi , Pedro A. Rangel , Camila P. E. de Souza

We propose a generative model for robust tensor factorization in the presence of both missing data and outliers. The objective is to explicitly infer the underlying low-CP-rank tensor capturing the global information and a sparse tensor…

Computer Vision and Pattern Recognition · Computer Science 2016-06-21 Qibin Zhao , Guoxu Zhou , Liqing Zhang , Andrzej Cichocki , Shun-ichi Amari

The Poisson distribution is often used as a standard model for count data. Quite often, however, such data sets are not well fit by a Poisson model because they have more zeros than are compatible with this model. For these situations, a…

Statistics Theory · Mathematics 2008-12-18 M. J. Bayarri , James O. Berger , Gauri S. Datta

Low-rank tensor completion has been widely used in computer vision and machine learning. This paper develops a novel multi-modal core tensor factorization (MCTF) method combined with a tensor low-rankness measure and a better nonconvex…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Haijin Zeng

Tensor factorization models offer an effective approach to convert massive electronic health records into meaningful clinical concepts (phenotypes) for data analysis. These models need a large amount of diverse samples to avoid population…

Machine Learning · Computer Science 2017-10-13 Yejin Kim , Jimeng Sun , Hwanjo Yu , Xiaoqian Jiang

Because of the limitations of matrix factorization, such as losing spatial structure information, the concept of low-rank tensor factorization (LRTF) has been applied for the recovery of a low dimensional subspace from high dimensional…

Computer Vision and Pattern Recognition · Computer Science 2017-05-22 Xi'ai Chen , Zhi Han , Yao Wang , Qian Zhao , Deyu Meng , Lin Lin , Yandong Tang

Classification of multi-dimensional time series from real-world systems require fine-grained learning of complex features such as cross-dimensional dependencies and intra-class variations-all under the practical challenge of low training…

Machine Learning · Computer Science 2025-05-16 Anushiya Arunan , Yan Qin , Xiaoli Li , Yuen Chau

Understanding the association between dietary patterns and health outcomes, such as the cancer risk, is crucial to inform public health guidelines and shaping future dietary interventions. However, dietary intake data present several…

Methodology · Statistics 2025-10-10 Blake Hansen , Dafne Zorzetto , Valeria Edefonti , Roberta De Vito
‹ Prev 1 2 3 10 Next ›