Related papers: Regression analysis with compositional data contai…

A Dirichlet Regression Model for Compositional Data with Zeros

Compositional data are met in many different fields, such as economics, archaeometry, ecology, geology and political sciences. Regression where the dependent variable is a composition is usually carried out via a log-ratio transformation of…

Methodology · Statistics 2017-06-08 Michail Tsagris , Connie Stewart

Modelling structural zeros in compositional data via a zero-censored multivariate normal model

We present a new model for analyzing compositional data with structural zeros. Inspired by \cite{butler2008} who suggested a model in the presence of zero values in the data we propose a model that treats the zero values in a different…

Methodology · Statistics 2022-08-30 Michail Tsagris

A novel, divergence based, regression for compositional data

In compositional data, an observation is a vector with non-negative components which sum to a constant, typically 1. Data of this type arise in many areas, such as geology, archaeology, biology, economics and political science amongst…

Methodology · Statistics 2015-11-25 Michail Tsagris

Modeling Compositional Regression with uncorrelated and correlated errors: a Bayesian approach

Compositional data consist of known compositions vectors whose components are positive and defined in the interval (0,1) representing proportions or fractions of a "whole". The sum of these components must be equal to one. Compositional…

Applications · Statistics 2015-07-02 Taciana K. O. Shimizu , Francisco Louzada , Adriano K. Suzuki , Ricardo S. Ehlers

The k-NN algorithm for compositional data: a revised approach with and without zero values present

In compositional data, an observation is a vector with non-negative components which sum to a constant, typically 1. Data of this type arise in many areas, such as geology, archaeology, biology, economics and political science among others.…

Methodology · Statistics 2015-06-18 Michail Tsagris

Improved classification for compositional data using the $\alpha$-transformation

In compositional data analysis an observation is a vector containing non-negative values, only the relative sizes of which are considered to be of interest. Without loss of generality, a compositional vector can be taken to be a vector of…

Methodology · Statistics 2015-06-18 Michail Tsagris , Simon Preston , Andrew T. A. Wood

A Transformation-free Linear Regression for Compositional Outcomes and Predictors

Compositional data are common in many fields, both as outcomes and predictor variables. The inventory of models for the case when both the outcome and predictor variables are compositional is limited and the existing models are difficult to…

Methodology · Statistics 2020-04-20 Jacob Fiksel , Scott Zeger , Abhirup Datta

Flexible non-parametric regression models for compositional data

Compositional data arise in many real-life applications and versatile methods for properly analyzing this type of data in the regression context are needed. When parametric assumptions do not hold or are difficult to verify, non-parametric…

Methodology · Statistics 2023-09-07 Michail Tsagris , Abdulaziz Alenazi , Connie Stewart

Robust Nonparametric Regression for Compositional Data: the Simplicial--Real case

Statistical analysis on compositional data has gained a lot of attention due to their great potential of applications. A feature of these data is that they are multivariate vectors that lie in the simplex, that is, the components of each…

Methodology · Statistics 2025-05-22 Ana M. Bianco , Graciela Boente , Wenceslao González--Manteiga , Francisco Gude Sampedro , Ana Pérez--González

Collaborative Training of Tensors for Compositional Distributional Semantics

Type-based compositional distributional semantic models present an interesting line of research into functional representations of linguistic meaning. One of the drawbacks of such models, however, is the lack of training data required to…

Computation and Language · Computer Science 2017-05-08 Tamara Polajnar

Robust test statistics for data sets with missing correlation information

Not all experiments publish their results with a description of the correlations between the data points. This makes it difficult to do hypothesis tests or model fits with that data, since just assuming no correlation can lead to an over-…

Data Analysis, Statistics and Probability · Physics 2021-06-30 Lukas Koch

Rectified Fisher-Bingham Model for Compositional Data with Zeros

This paper introduces a rectified and renormalized Fisher-Bingham model for compositional data with zeros, motivated in part by the presence of zeros in microbiota studies. The approach represents compositions through a square-root…

Methodology · Statistics 2026-04-29 Eugene Han , Marahi Perez-Tamayo , Hannah D. Holscher , Ruoqing Zhu

Compositional Covariate Importance Testing via Partial Conjunction of Bivariate Hypotheses

Compositional data (i.e., data comprising random variables that sum up to a constant) arises in many applications including microbiome studies, chemical ecology, political science, and experimental designs. Yet when compositional data serve…

Methodology · Statistics 2025-01-03 Ritwik Bhaduri , Siyuan Ma , Lucas Janson

Compositional data analysis -- linear algebra, visualization and interpretation

Compositional data analysis is concerned with multivariate data that have a constant sum, usually 1 or 100\%. These are data often found in biochemistry and geochemistry, but also in the social sciences, when relative values are of interest…

Methodology · Statistics 2021-10-26 Michael Greenacre

Extending compositional data analysis from a graph signal processing perspective

Traditional methods for the analysis of compositional data consider the log-ratios between all different pairs of variables with equal weight, typically in the form of aggregated contributions. This is not meaningful in contexts where it is…

Methodology · Statistics 2022-01-27 Christopher Rieser , Peter Filzmoser

Fractional binomial regression model for count data with excess zeros

This paper proposes a new generalized linear model with the fractional binomial distribution. Zero-inflated Poisson/negative binomial distributions are used for count data with many zeros. To analyze the association of such a count variable…

Methodology · Statistics 2025-08-01 Jeonghwa Lee , Chloe Breece

Analysis of High Dimensional Compositional Data Containing Structural Zeros with Applications to Microbiome Data

This paper is motivated by the recent interest in the analysis of high dimen- sional microbiome data. A key feature of this data is the presence of `structural zeros' which are microbes missing from an observation vector due to an…

Applications · Statistics 2016-05-23 Abhishek Kaul , Ori Davidov , Shyamal D. Peddada

Independent Component Analysis for Compositional Data

Compositional data represent a specific family of multivariate data, where the information of interest is contained in the ratios between parts rather than in absolute values of single parts. The analysis of such specific data is…

Methodology · Statistics 2021-07-07 Christoph Muehlmann , Kamila Fačevicová , Alžběta Gardlo , Hana Janečková , Klaus Nordhausen

A critical comparison of handling zeros in high-dimensional compositional count data

The growing use of high-throughput sequencing (HTS) has enabled the large-scale production of compositional count data, driving progress in microbiome research. However, such count data are often high-dimensional, over-dispersed, and…

Other Statistics · Statistics 2026-05-22 Wenqi Tang , Kamila Fačevicová , Klaus Nordhausen , Sara Taskinen

Random effects compound Poisson model to represent data with extra zeros

This paper describes a compound Poisson-based random effects structure for modeling zero-inflated data. Data with large proportion of zeros are found in many fields of applied statistics, for example in ecology when trying to model and…

Applications · Statistics 2009-07-29 Marie-Pierre Etienne , Eric Parent , Benoit Hugues , Bernier Jacques