Related papers: Parametric Modelling of Multivariate Count Data Us…

Review of Probability Distributions for Modeling Count Data

Count data take on non-negative integer values and are challenging to properly analyze using standard linear-Gaussian methods such as linear regression and principal components analysis. Generalized linear models enable direct modeling of…

Methodology · Statistics 2020-01-14 F. William Townes

A Generic Multivariate Distribution for Counting Data

Motivated by the need, in some Bayesian likelihood free inference problems, of imputing a multivariate counting distribution based on its vector of means and variance-covariance matrix, we define a generic multivariate discrete…

Applications · Statistics 2011-03-28 Marcos Capistrán , J. Andrés Christen

Graphical model-based clustering of categorical data

Clustering multivariate data is a pervasive task in many applied problems, particularly in social studies and life science. Model-based approaches to clustering rely on mixture models, where each mixture component corresponds to the kernel…

Methodology · Statistics 2026-01-22 Laura Ferrini , Federico Castelletti

A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution

The Poisson distribution has been widely studied and used for modeling univariate count-valued data. Multivariate generalizations of the Poisson distribution that permit dependencies, however, have been far less popular. Yet, real-world…

Methodology · Statistics 2016-12-28 David I. Inouye , Eunho Yang , Genevera I. Allen , Pradeep Ravikumar

Deep Multivariate Models with Parametric Conditionals

We consider deep multivariate models for heterogeneous collections of random variables. In the context of computer vision, such collections may e.g. consist of images, segmentations, image attributes, and latent variables. When developing…

Machine Learning · Computer Science 2026-02-03 Dmitrij Schlesinger , Boris Flach , Alexander Shekhovtsov

Nonparametric graphical model for counts

Although multivariate count data are routinely collected in many application areas, there is surprisingly little work developing flexible models for characterizing their dependence structure. This is particularly true when interest focuses…

Methodology · Statistics 2020-05-19 Arkaprava Roy , David B Dunson

Pattern graphs: a graphical approach to nonmonotone missing data

We introduce the concept of pattern graphs--directed acyclic graphs representing how response patterns are associated. A pattern graph represents an identifying restriction that is nonparametrically identified/saturated and is often a…

Methodology · Statistics 2020-12-04 Yen-Chi Chen

A new multivariate Poisson model

Multi-dimensional data frequently occur in many different fields, including risk management, insurance, biology, environmental sciences, and many more. In analyzing multivariate data, it is imperative that the underlying modelling…

Methodology · Statistics 2025-06-23 Orla A. Murphy , Juliana Schulz

Splitting models for multivariate count data

Considering discrete models, the univariate framework has been studied in depth compared to the multivariate one. This paper first proposes two criteria to define a sensu stricto multivariate discrete distribution. It then introduces the…

Statistics Theory · Mathematics 2018-02-07 Pierre Fernique , Jean Peyhardi , Jean-Baptiste Durand

A Generalized Multinomial Distribution from Dependent Categorical Random Variables

Categorical random variables are a common staple in machine learning methods and other applications across disciplines. Many times, correlation within categorical predictors exists, and has been noted to have an effect on various algorithm…

Probability · Mathematics 2017-01-25 Rachel Traylor

Synthetic Potential Outcomes and Causal Mixture Identifiability

Heterogeneous data from multiple populations, sub-groups, or sources is often represented as a ``mixture model'' with a single latent class influencing all of the observed covariates. Heterogeneity can be resolved at multiple levels by…

Machine Learning · Computer Science 2024-12-16 Bijan Mazaheri , Chandler Squires , Caroline Uhler

Probabilistic Graphical Models: A Concise Tutorial

Probabilistic graphical modeling is a branch of machine learning that uses probability distributions to describe the world, make predictions, and support decision-making under uncertainty. Underlying this modeling framework is an elegant…

Machine Learning · Computer Science 2025-07-24 Jacqueline Maasch , Willie Neiswanger , Stefano Ermon , Volodymyr Kuleshov

New multi-sample nonparametric tests for panel count data

This paper considers the problem of multi-sample nonparametric comparison of counting processes with panel count data, which arise naturally when recurrent events are considered. Such data frequently occur in medical follow-up studies and…

Statistics Theory · Mathematics 2009-04-21 N. Balakrishnan , Xingqiu Zhao

Nonparametric Identification and Estimation of Ratios of Multi-Category Means under Preferential Sampling

Multi-category data arise in diverse fields including marketing, chemistry, public policy, genomics, political science, and ecology. We consider the problem of estimating ratios of category-specific means in a fully nonparametric setting,…

Methodology · Statistics 2025-10-29 Grant Hopkins , Sarah Teichman , Ellen Graham , Amy D Willis

Multivariate Generating Functions for Information Spread on Multi-Type Random Graphs

We study the spread of information on multi-type directed random graphs. In such graphs the vertices are partitioned into distinct types (communities) that have different transmission rates between themselves and with other types. We…

Statistical Mechanics · Physics 2023-06-21 Yaron Oz , Ittai Rubinstein , Muli Safra

Multivariate Count Time Series Modelling

We review autoregressive models for the analysis of multivariate count time series. In doing so, we discuss the choice of a suitable distribution for a vectors of count random variables. This review focus on three main approaches taken for…

Methodology · Statistics 2021-09-21 Konstantinos Fokianos

Dealing with overdispersion in multivariate count data

The problem of overdispersion in multivariate count data is a challenging issue. Nowadays, it covers a central role mainly due to the relevance of modern technologies data, such as Next Generation Sequencing and textual data from the web or…

Methodology · Statistics 2025-02-24 Noemi Corsini , Cinzia Viroli

Generative vs. Discriminative modeling under the lens of uncertainty quantification

Learning a parametric model from a given dataset indeed enables to capture intrinsic dependencies between random variables via a parametric conditional probability distribution and in turn predict the value of a label variable given…

Machine Learning · Statistics 2024-06-14 Elouan Argouarc'h , François Desbouvries , Eric Barat , Eiji Kawasaki

Statistical inference for multivariate extremes via a geometric approach

A geometric representation for multivariate extremes, based on the shapes of scaled sample clouds in light-tailed margins and their so-called limit sets, has recently been shown to connect several existing extremal dependence concepts.…

Methodology · Statistics 2023-11-03 Jennifer Wadsworth , Ryan Campbell

Multivariate Species Sampling Models

Species sampling processes have long served as the fundamental framework for modeling random discrete distributions and exchangeable sequences. However, data arising from distinct but related sources require a broader notion of…

Statistics Theory · Mathematics 2026-02-03 Beatrice Franzolini , Antonio Lijoi , Igor Prünster , Giovanni Rebaudo