English
Related papers

Related papers: A Bayesian Zero-Inflated Dirichlet-Multinomial Reg…

200 papers

We provide novel probabilistic portrayals of two multivariate models designed to handle zero-inflation in count-compositional data. We develop a new unifying framework that represents both as finite mixture distributions. One of these…

Methodology · Statistics 2026-03-31 André F. B. Menezes , Andrew C. Parnell , Keefe Murphy

This paper proposes a new generalized linear model with the fractional binomial distribution. Zero-inflated Poisson/negative binomial distributions are used for count data with many zeros. To analyze the association of such a count variable…

Methodology · Statistics 2025-08-01 Jeonghwa Lee , Chloe Breece

With the development of next generation sequencing technology, researchers have now been able to study the microbiome composition using direct sequencing, whose output are bacterial taxa counts for each microbiome sample. One goal of…

Applications · Statistics 2013-05-24 Jun Chen , Hongzhe Li

The infant microbiome undergoes rapid changes in composition over time and is associated with long-term risks of conditions such as immune strength, allergy, asthma, and other health outcomes. Modeling the associations between exposures or…

Methodology · Statistics 2026-03-31 Brody Erlandson , Ander Wilson , Matthew D. Koslovsky

Detecting associations between microbial compositions and sample characteristics is one of the most important tasks in microbiome studies. Most of the existing methods apply univariate models to single microbial species separately, with…

Count-compositional data arise in many different fields, including high-throughput sequencing experiments, ecological surveys, and palaeoclimate studies, where a common, important goal is to understand how covariates relate to the observed…

Methodology · Statistics 2026-04-10 André F. B. Menezes , Andrew C. Parnell , Keefe Murphy

Compositional data are met in many different fields, such as economics, archaeometry, ecology, geology and political sciences. Regression where the dependent variable is a composition is usually carried out via a log-ratio transformation of…

Methodology · Statistics 2017-06-08 Michail Tsagris , Connie Stewart

The problem of overdispersion in multivariate count data is a challenging issue. Nowadays, it covers a central role mainly due to the relevance of modern technologies data, such as Next Generation Sequencing and textual data from the web or…

Methodology · Statistics 2025-02-24 Noemi Corsini , Cinzia Viroli

Count data are common in medical research. When these data have more zeros than expected by the most used count distributions, it is common to employ a zero-inflated regression model. However, the interpretability of these models is much…

Methodology · Statistics 2025-09-30 Gustavo H. A. Pereira , Jeremias Leão , Manoel Santos-Neto , Jianwen Cai

We use the theory of normal variance-mean mixtures to derive a data augmentation scheme for models that include gamma functions. Our methodology applies to many situations in statistics and machine learning, including Multinomial-Dirichlet…

Methodology · Statistics 2021-06-22 Jingyu He , Nicholas Polson , Jianeng Xu

Clustering multivariate binary data is of interest in many scientific fields, including ecology, biomedicine, and social policy. Beyond heuristic clustering algorithms, such data can be modelled using multivariate Bernoulli mixture models.…

Methodology · Statistics 2026-04-24 Luisa Ferrari , Maria Franco Villoria , Garritt L. Page , Alex Laini

Despite the increasing popularity of quantile regression models for continuous responses, models for count data have so far received little attention. The main quantile regression technique for count data involves adding uniform random…

Methodology · Statistics 2014-06-10 Charalampos Chanialidis , Ludger Evers , Tereza Neocleous

Dimension reduction techniques are among the most essential analytical tools in the analysis of high-dimensional data. Generalized principal component analysis (PCA) is an extension to standard PCA that has been widely used to identify…

In this work, we develop a novel Bayesian estimation method for the Dirichlet process (DP) mixture of the inverted Dirichlet distributions, which has been shown to be very flexible for modeling vectors with positive elements. The recently…

Machine Learning · Computer Science 2020-02-04 Zhanyu Ma , Yuping Lai

Bayesian hierarchical models are commonly employed for inference in count datasets, as they account for multiple levels of variation by incorporating prior distributions for parameters at different levels. Examples include Beta-Binomial,…

Methodology · Statistics 2024-11-04 Yuexi Wang , Nicholas G. Polson

We consider the complex data modeling problem motivated by the zero-inflated and overdispersed data from microbiome studies. Analyzing how microbiome abundance is associated with human biological features, such as BMI, is of great…

Methodology · Statistics 2025-03-31 Zirui Wang , Tianying Wang

In microbiome and genomic studies, the regression of compositional data has been a crucial tool for identifying microbial taxa or genes that are associated with clinical phenotypes. To account for the variation in sequencing depth, the…

Methodology · Statistics 2021-03-11 Pixu Shi , Yuchen Zhou , Anru R. Zhang

Microorganisms play critical roles in human health and disease. It is well known that microbes live in diverse communities in which they interact synergistically or antagonistically. Thus for estimating microbial associations with clinical…

Compositional data and multivariate count data with known totals are challenging to analyse due to the non-negativity and sum-to-one constraints on the sample space. It is often the case that many of the compositional components are highly…

Methodology · Statistics 2020-12-24 Janice L. Scealy , Andrew T. A. Wood

In Bayesian inference for mixture models with an unknown number of components, a finite mixture model is usually employed that assumes prior distributions for mixing weights and the number of components. This model is called a mixture of…

Methodology · Statistics 2025-12-25 Fumiya Iwashige , Shintaro Hashimoto
‹ Prev 1 2 3 10 Next ›