Related papers: Decomposition of variance in terms of conditional …

Orthogonal variance decomposition based feature selection

Existing feature selection methods fail to properly account for interactions between features when evaluating feature subsets. In this paper, we attempt to remedy this issue by using orthogonal variance decomposition to evaluate features.…

Machine Learning · Computer Science 2019-10-23 Firuz Kamalov

Learning sources of variability from high-dimensional observational studies

Causal inference studies whether the presence of a variable influences an observed outcome. As measured by quantities such as the "average treatment effect," this paradigm is employed across numerous biological fields, from vaccine and drug…

Methodology · Statistics 2023-11-30 Eric W. Bridgeford , Jaewon Chung , Brian Gilbert , Sambit Panda , Adam Li , Cencheng Shen , Alexandra Badea , Brian Caffo , Joshua T. Vogelstein

Causal Variance Decompositions for Measuring Health Inequalities

Recent causal inference literature has introduced causal effect decompositions to quantify sources of observed inequalities or disparities in outcomes, but these approaches are typically limited to pairwise comparisons. In healthcare…

Methodology · Statistics 2026-04-27 Lin Yu , Zhihui Liu , Kathy Han , Olli Saarela

A Multivariate Variance Components Model for Analysis of Covariance in Designed Experiments

Traditional methods for covariate adjustment of treatment means in designed experiments are inherently conditional on the observed covariate values. In order to develop a coherent general methodology for analysis of covariance, we propose a…

Methodology · Statistics 2010-01-19 James G. Booth , Walter T. Federer , Martin T. Wells , Russell D. Wolfinger

Variance decompositions for extensive-form games

Quantitative measures of randomness in games are useful for game design and have implications for gambling law. We treat the outcome of a game as a random variable and derive a closed-form expression and estimator for the variance in the…

Other Statistics · Statistics 2020-09-11 Alex Cloud , Eric Laber

Testing for the Important Components of Posterior Predictive Variance

We give a decomposition of the posterior predictive variance using the law of total variance and conditioning on a finite dimensional discrete random variable. This random variable summarizes various features of modeling that are used to…

Methodology · Statistics 2022-09-02 Dean Dustin , Bertrand Clarke

Debiasing Evaluations That are Biased by Evaluations

It is common to evaluate a set of items by soliciting people to rate them. For example, universities ask students to rate the teaching quality of their instructors, and conference organizers ask authors of submissions to evaluate the…

Machine Learning · Statistics 2020-12-02 Jingyan Wang , Ivan Stelmakh , Yuting Wei , Nihar B. Shah

Variance Estimation with Dependence and Heterogeneous Means

This paper considers the problem of estimating the variance of a sum of a triangular array of random vectors with heterogeneous means. When random vectors exhibit two-way cluster dependence or weak dependence, standard variance estimators…

Econometrics · Economics 2026-03-13 Luther Yap

Analysis of Learner Independent Variables for Estimating Assessment Items Difficulty Level

The quality of assessment determines the quality of learning, and is characterized by validity, reliability and difficulty. Mastery of learning is generally represented by the difficulty levels of assessment items. A very large number of…

Computers and Society · Computer Science 2022-06-10 Shilpi Banerjee , N. J. Rao

A novel decomposition to explain heterogeneity in observational and randomized studies of causality

This paper introduces a novel decomposition framework to explain heterogeneity in causal effects observed across different studies, considering both observational and randomized settings. We present a formal decomposition of between-study…

Methodology · Statistics 2025-12-18 Brian Gilbert , Ivan Dıaz , Kara E. Rudolph , Nicholas Williams , Tat-Thang Vo

Disentangling the independently controllable factors of variation by interacting with the world

It has been postulated that a good representation is one that disentangles the underlying explanatory factors of variation. However, it remains an open question what kind of training framework could potentially achieve that. Whereas most…

Machine Learning · Statistics 2018-02-27 Valentin Thomas , Emmanuel Bengio , William Fedus , Jules Pondard , Philippe Beaudoin , Hugo Larochelle , Joelle Pineau , Doina Precup , Yoshua Bengio

Value Profiles for Encoding Human Variation

Modelling human variation in rating tasks is crucial for personalization, pluralistic model alignment, and computational social science. We propose representing individuals using natural language value profiles -- descriptions of underlying…

Computation and Language · Computer Science 2025-10-01 Taylor Sorensen , Pushkar Mishra , Roma Patel , Michael Henry Tessler , Michiel Bakker , Georgina Evans , Iason Gabriel , Noah Goodman , Verena Rieser

Comparing Two Samples Through Stochastic Dominance: A Graphical Approach

Non-deterministic measurements are common in real-world scenarios: the performance of a stochastic optimization algorithm or the total reward of a reinforcement learning agent in a chaotic environment are just two examples in which…

Machine Learning · Statistics 2022-08-31 Etor Arza , Josu Ceberio , Ekhiñe Irurozki , Aritz Pérez

Causal mediation analysis decomposition of between-hospital variance

Causal variance decompositions for a given disease-specific quality indicator can be used to quantify differences in performance between hospitals or health care providers. While variance decompositions can demonstrate variation in quality…

Methodology · Statistics 2023-01-26 Bo Chen , Keith A. Lawson , Antonio Finelli , Olli Saarela

From Review to Rating: Exploring Dependency Measures for Text Classification

Various text analysis techniques exist, which attempt to uncover unstructured information from text. In this work, we explore using statistical dependence measures for textual classification, representing text as word vectors. Student…

Computation and Language · Computer Science 2018-08-01 Samuel Cunningham-Nelson , Mahsa Baktashmotlagh , Wageeh Boles

The Variational Deficiency Bottleneck

We introduce a bottleneck method for learning data representations based on information deficiency, rather than the more traditional information sufficiency. A variational upper bound allows us to implement this method efficiently. The…

Information Theory · Computer Science 2020-11-05 Pradeep Kr. Banerjee , Guido Montúfar

Description of a stochastic system by a nonadapted stochastic process

An approach for the description of stochastic systems is derived. Some of the variables in the system are studied forward in time, others backward in time. The approach is based on a perturbation expansion in the strength of the coupling…

Statistical Mechanics · Physics 2021-08-04 Piero Olla

Causal learning with sufficient statistics: an information bottleneck approach

The inference of causal relationships using observational data from partially observed multivariate systems with hidden variables is a fundamental question in many scientific domains. Methods extracting causal information from conditional…

Machine Learning · Statistics 2020-10-13 Daniel Chicharro , Michel Besserve , Stefano Panzeri

Linear causal disentanglement via higher-order cumulants

Linear causal disentanglement is a recent method in causal representation learning to describe a collection of observed variables via latent variables with causal dependencies between them. It can be viewed as a generalization of both…

Machine Learning · Statistics 2024-07-08 Paula Leyes Carreno , Chiara Meroni , Anna Seigal

Compositional Covariate Importance Testing via Partial Conjunction of Bivariate Hypotheses

Compositional data (i.e., data comprising random variables that sum up to a constant) arises in many applications including microbiome studies, chemical ecology, political science, and experimental designs. Yet when compositional data serve…

Methodology · Statistics 2025-01-03 Ritwik Bhaduri , Siyuan Ma , Lucas Janson