Related papers: Compositional data analysis -- linear algebra, vis…

Independent Component Analysis for Compositional Data

Compositional data represent a specific family of multivariate data, where the information of interest is contained in the ratios between parts rather than in absolute values of single parts. The analysis of such specific data is…

Methodology · Statistics 2021-07-07 Christoph Muehlmann , Kamila Fačevicová , Alžběta Gardlo , Hana Janečková , Klaus Nordhausen

Compositional Cubes: A New Concept for Multi-factorial Compositions

Compositional data are commonly known as multivariate observations carrying relative information. Even though the case of vector or even two-factorial compositional data (compositional tables) is already well described in the literature,…

Methodology · Statistics 2022-01-26 Kamila Fačevicová , Peter Filzmoser , Karel Hron

Modeling Compositional Regression with uncorrelated and correlated errors: a Bayesian approach

Compositional data consist of known compositions vectors whose components are positive and defined in the interval (0,1) representing proportions or fractions of a "whole". The sum of these components must be equal to one. Compositional…

Applications · Statistics 2015-07-02 Taciana K. O. Shimizu , Francisco Louzada , Adriano K. Suzuki , Ricardo S. Ehlers

Compositional Covariate Importance Testing via Partial Conjunction of Bivariate Hypotheses

Compositional data (i.e., data comprising random variables that sum up to a constant) arises in many applications including microbiome studies, chemical ecology, political science, and experimental designs. Yet when compositional data serve…

Methodology · Statistics 2025-01-03 Ritwik Bhaduri , Siyuan Ma , Lucas Janson

Extending compositional data analysis from a graph signal processing perspective

Traditional methods for the analysis of compositional data consider the log-ratios between all different pairs of variables with equal weight, typically in the form of aggregated contributions. This is not meaningful in contexts where it is…

Methodology · Statistics 2022-01-27 Christopher Rieser , Peter Filzmoser

A Guideline for the Statistical Analysis of Compositional Data in Immunology

The study of immune cellular composition has been of great scientific interest in immunology because of the generation of multiple large-scale data. From the statistical point of view, such immune cellular data should be treated as…

Applications · Statistics 2022-04-22 Jinkyung Yoo , Zequn Sun , Michael Greenacre , Qin Ma , Dongjun Chung , Young Min Kim

Interpretation of Compositional Regression with Application to Time Budget Analysis

Regression with compositional response or covariates, or even regression between parts of a composition, is frequently employed in social sciences. Among other possible applications, it may help to reveal interesting features in time…

Statistics Theory · Mathematics 2016-09-27 Ivo Muller , Karel Hron , Eva Fiserova , Jan Smahaj , Panajotis Cakirpaloglu , Jana Vancakova

Regression models for compositional data: General log-contrast formulations, proximal optimization, and microbiome data applications

Compositional data sets are ubiquitous in science, including geology, ecology, and microbiology. In microbiome research, compositional data primarily arise from high-throughput sequence-based profiling experiments. These data comprise…

Statistics Theory · Mathematics 2019-03-05 Patrick L. Combettes , Christian L. Müller

Improved classification for compositional data using the $\alpha$-transformation

In compositional data analysis an observation is a vector containing non-negative values, only the relative sizes of which are considered to be of interest. Without loss of generality, a compositional vector can be taken to be a vector of…

Methodology · Statistics 2015-06-18 Michail Tsagris , Simon Preston , Andrew T. A. Wood

Instrumental Variable Estimation for Compositional Treatments

Many scientific datasets are compositional in nature. Important biological examples include species abundances in ecology, cell-type compositions derived from single-cell sequencing data, and amplicon abundance data in microbiome research.…

Machine Learning · Computer Science 2024-05-29 Elisabeth Ailer , Christian L. Müller , Niki Kilbertus

On the choice of weights in aggregate compositional data analysis

In this paper, we distinguish between two kinds of compositional data sets: elementary and aggregate. This fact will help us to decide the choice of the weights to use in log interaction analysis of aggregate compositional vectors. We show…

Applications · Statistics 2023-01-27 Vartan Choulakian , Jules De Tibeiro , Pasquale Sarnacchiaro

Robust Nonparametric Regression for Compositional Data: the Simplicial--Real case

Statistical analysis on compositional data has gained a lot of attention due to their great potential of applications. A feature of these data is that they are multivariate vectors that lie in the simplex, that is, the components of each…

Methodology · Statistics 2025-05-22 Ana M. Bianco , Graciela Boente , Wenceslao González--Manteiga , Francisco Gude Sampedro , Ana Pérez--González

Does Data Scaling Lead to Visual Compositional Generalization?

Compositional understanding is crucial for human intelligence, yet it remains unclear whether contemporary vision models exhibit it. The dominant machine learning paradigm is built on the premise that scaling data and model sizes will…

Machine Learning · Computer Science 2025-07-10 Arnas Uselis , Andrea Dittadi , Seong Joon Oh

Robust Principal Component Analysis for Compositional Tables

A data table which is arranged according to two factors can often be considered as a compositional table. An example is the number of unemployed people, split according to gender and age classes. Analyzed as compositions, the relevant…

Methodology · Statistics 2019-04-12 Julie Rendlová , Karel Hron , Kamila Fačevicová , Peter Filzmoser

Relational Composition in Neural Networks: A Survey and Call to Action

Many neural nets appear to represent data as linear combinations of "feature vectors." Algorithms for discovering these vectors have seen impressive recent success. However, we argue that this success is incomplete without an understanding…

Artificial Intelligence · Computer Science 2024-07-23 Martin Wattenberg , Fernanda B. Viégas

Large Covariance Estimation for Compositional Data via Composition-Adjusted Thresholding

High-dimensional compositional data arise naturally in many applications such as metagenomic data analysis. The observed data lie in a high-dimensional simplex, and conventional statistical methods often fail to produce sensible results due…

Methodology · Statistics 2016-01-19 Yuanpei Cao , Wei Lin , Hongzhe Li

Interaction Models and Generalized Score Matching for Compositional Data

Applications such as the analysis of microbiome data have led to renewed interest in statistical methods for compositional data, i.e., multivariate data in the form of probability vectors that contain relative proportions. In particular,…

Methodology · Statistics 2021-09-13 Shiqing Yu , Mathias Drton , Ali Shojaie

The k-NN algorithm for compositional data: a revised approach with and without zero values present

In compositional data, an observation is a vector with non-negative components which sum to a constant, typically 1. Data of this type arise in many areas, such as geology, archaeology, biology, economics and political science among others.…

Methodology · Statistics 2015-06-18 Michail Tsagris

Primal path algorithm for compositional data analysis

Compositional data have two unique characteristics compared to typical multivariate data: the observed values are nonnegative and their summand is exactly one. To reflect these characteristics, a specific regularized regression model with…

Machine Learning · Computer Science 2018-12-24 Jong-June Jeon , Yongdai Kim , Sungho Won , Hosik Choi

The Cost of Compositionality: A High-Performance Implementation of String Diagram Composition

String diagrams are an increasingly popular algebraic language for the analysis of graphical models of computations across different research fields. Whereas string diagrams have been thoroughly studied as semantic structures, much less…

Category Theory · Mathematics 2022-11-04 Paul Wilson , Fabio Zanasi