Related papers: Improved classification for compositional data usi…

The k-NN algorithm for compositional data: a revised approach with and without zero values present

In compositional data, an observation is a vector with non-negative components which sum to a constant, typically 1. Data of this type arise in many areas, such as geology, archaeology, biology, economics and political science among others.…

Methodology · Statistics 2015-06-18 Michail Tsagris

A data-based power transformation for compositional data

Compositional data analysis is carried out either by neglecting the compositional constraint and applying standard multivariate data analysis, or by transforming the data using the logs of the ratios of the components. In this work we…

Methodology · Statistics 2011-06-17 Michail T. Tsagris , Simon Preston , Andrew T. A. Wood

Gaussian asymptotic limits for the $\alpha$-transformation in the analysis of compositional data

Compositional data consists of vectors of proportions whose components sum to 1. Such vectors lie in the standard simplex, which is a manifold with boundary. One issue that has been rather controversial within the field of compositional…

Statistics Theory · Mathematics 2019-02-22 Yannis Pantazis , Michail Tsagris , Andrew T. A. Wood

A novel, divergence based, regression for compositional data

In compositional data, an observation is a vector with non-negative components which sum to a constant, typically 1. Data of this type arise in many areas, such as geology, archaeology, biology, economics and political science amongst…

Methodology · Statistics 2015-11-25 Michail Tsagris

Flexible non-parametric regression models for compositional data

Compositional data arise in many real-life applications and versatile methods for properly analyzing this type of data in the regression context are needed. When parametric assumptions do not hold or are difficult to verify, non-parametric…

Methodology · Statistics 2023-09-07 Michail Tsagris , Abdulaziz Alenazi , Connie Stewart

Modeling Compositional Regression with uncorrelated and correlated errors: a Bayesian approach

Compositional data consist of known compositions vectors whose components are positive and defined in the interval (0,1) representing proportions or fractions of a "whole". The sum of these components must be equal to one. Compositional…

Applications · Statistics 2015-07-02 Taciana K. O. Shimizu , Francisco Louzada , Adriano K. Suzuki , Ricardo S. Ehlers

Extending compositional data analysis from a graph signal processing perspective

Traditional methods for the analysis of compositional data consider the log-ratios between all different pairs of variables with equal weight, typically in the form of aggregated contributions. This is not meaningful in contexts where it is…

Methodology · Statistics 2022-01-27 Christopher Rieser , Peter Filzmoser

The $\alpha$--regression for compositional data: a unified framework for standard, temporal and spatial regression models including compositional predictors

The paper revisits the $\alpha$--regression framework for compositional data. The model uses a flexible power transformation parameterized by $\alpha$ to interpolate between raw data analysis and log--ratio methods, naturally handling zeros…

Methodology · Statistics 2026-05-14 Michail Tsagris , Yannis Pantazis

Compositional Cubes: A New Concept for Multi-factorial Compositions

Compositional data are commonly known as multivariate observations carrying relative information. Even though the case of vector or even two-factorial compositional data (compositional tables) is already well described in the literature,…

Methodology · Statistics 2022-01-26 Kamila Fačevicová , Peter Filzmoser , Karel Hron

Compositional data analysis -- linear algebra, visualization and interpretation

Compositional data analysis is concerned with multivariate data that have a constant sum, usually 1 or 100\%. These are data often found in biochemistry and geochemistry, but also in the social sciences, when relative values are of interest…

Methodology · Statistics 2021-10-26 Michael Greenacre

A folded model for compositional data analysis

A folded type model is developed for analyzing compositional data. The proposed model involves an extension of the $\alpha$-transformation for compositional data and provides a new and flexible class of distributions for modeling data…

Machine Learning · Statistics 2019-02-27 Michail Tsagris , Connie Stewart

Simplicial clustering using the $\alpha$--transformation

We introduce two simplicial clustering approaches for compositional data, that are adaptations of the $K$--means and of the Gaussian mixture models algorithms, by employing the $\alpha$--transformation. By utilizing clustering validation…

Methodology · Statistics 2025-09-30 Michail Tsagris , Nikolaos Kontemeniotis

Compositional data analysis for modelling and forecasting mortality using the {\alpha}-transformation

Mortality forecasting is crucial for demographic planning and actuarial studies, especially for projecting population ageing and longevity risk. Classical approaches largely rely on extrapolative methods, such as the Lee-Carter (LC) model,…

Applications · Statistics 2026-02-24 Han Ying Lim , Dharini Pathmanathan , Sophie Dabo-Niang

Independent Component Analysis for Compositional Data

Compositional data represent a specific family of multivariate data, where the information of interest is contained in the ratios between parts rather than in absolute values of single parts. The analysis of such specific data is…

Methodology · Statistics 2021-07-07 Christoph Muehlmann , Kamila Fačevicová , Alžběta Gardlo , Hana Janečková , Klaus Nordhausen

Robust Nonparametric Regression for Compositional Data: the Simplicial--Real case

Statistical analysis on compositional data has gained a lot of attention due to their great potential of applications. A feature of these data is that they are multivariate vectors that lie in the simplex, that is, the components of each…

Methodology · Statistics 2025-05-22 Ana M. Bianco , Graciela Boente , Wenceslao González--Manteiga , Francisco Gude Sampedro , Ana Pérez--González

A Dirichlet Regression Model for Compositional Data with Zeros

Compositional data are met in many different fields, such as economics, archaeometry, ecology, geology and political sciences. Regression where the dependent variable is a composition is usually carried out via a log-ratio transformation of…

Methodology · Statistics 2017-06-08 Michail Tsagris , Connie Stewart

Instrumental Variable Estimation for Compositional Treatments

Many scientific datasets are compositional in nature. Important biological examples include species abundances in ecology, cell-type compositions derived from single-cell sequencing data, and amplicon abundance data in microbiome research.…

Machine Learning · Computer Science 2024-05-29 Elisabeth Ailer , Christian L. Müller , Niki Kilbertus

A Guideline for the Statistical Analysis of Compositional Data in Immunology

The study of immune cellular composition has been of great scientific interest in immunology because of the generation of multiple large-scale data. From the statistical point of view, such immune cellular data should be treated as…

Applications · Statistics 2022-04-22 Jinkyung Yoo , Zequn Sun , Michael Greenacre , Qin Ma , Dongjun Chung , Young Min Kim

On the choice of weights in aggregate compositional data analysis

In this paper, we distinguish between two kinds of compositional data sets: elementary and aggregate. This fact will help us to decide the choice of the weights to use in log interaction analysis of aggregate compositional vectors. We show…

Applications · Statistics 2023-01-27 Vartan Choulakian , Jules De Tibeiro , Pasquale Sarnacchiaro

A New Approach to Compositional Data Analysis using $L^{\infty}$-normalization with Applications to Vaginal Microbiome

We introduce a novel approach to compositional data analysis based on $L^{\infty}$-normalization, addressing challenges posed by zero-rich high-throughput data. Traditional methods like Aitchison's transformations require excluding zeros,…

Computation · Statistics 2025-03-28 Pawel Gajer , Jacques Ravel