Related papers: Perturbation-based Effect Measures for Composition…

Instrumental Variable Estimation for Compositional Treatments

Many scientific datasets are compositional in nature. Important biological examples include species abundances in ecology, cell-type compositions derived from single-cell sequencing data, and amplicon abundance data in microbiome research.…

Machine Learning · Computer Science 2024-05-29 Elisabeth Ailer , Christian L. Müller , Niki Kilbertus

On Semiparametric Instrumental Variable Estimation of Average Treatment Effects through Data Fusion

Suppose one is interested in estimating causal effects in the presence of potentially unmeasured confounding with the aid of a valid instrumental variable. This paper investigates the problem of making inferences about the average treatment…

Methodology · Statistics 2020-12-15 BaoLuo Sun , Wang Miao

Hypothesis-driven mediation analysis for compositional data: an application to gut microbiome

Biological sequencing data consist of read counts, e.g. of specified taxa and often exhibit sparsity (zero-count inflation) and overdispersion (extra-Poisson variability). As most sequencing techniques provide an arbitrary total count,…

Applications · Statistics 2024-07-01 Noora Kartiosuo , Jaakko Nevalainen , Olli Raitakari , Katja Pahkala , Kari Auranen

Compositional Covariate Importance Testing via Partial Conjunction of Bivariate Hypotheses

Compositional data (i.e., data comprising random variables that sum up to a constant) arises in many applications including microbiome studies, chemical ecology, political science, and experimental designs. Yet when compositional data serve…

Methodology · Statistics 2025-01-03 Ritwik Bhaduri , Siyuan Ma , Lucas Janson

Semiparametric theory

In this paper we give a brief review of semiparametric theory, using as a running example the common problem of estimating an average causal effect. Semiparametric models allow at least part of the data-generating process to be unspecified…

Methodology · Statistics 2017-09-20 Edward H. Kennedy

A Semiparametric Approach to Model Effect Modification

One fundamental statistical question for research areas such as precision medicine and health disparity is about discovering effect modification of treatment or exposure by observed covariates. We propose a semiparametric framework for…

Methodology · Statistics 2020-08-04 Muxuan Liang , Menggang Yu

A Bayesian Joint Model for Compositional Mediation Effect Selection in Microbiome Data

Analyzing multivariate count data generated by high-throughput sequencing technology in microbiome research studies is challenging due to the high-dimensional and compositional structure of the data and overdispersion. In practice,…

Applications · Statistics 2023-11-03 Jingyan Fu , Matthew D. Koslovsky , Andreas M. Neophytou , Marina Vannucci

Identification and Debiased Learning of Causal Effects with General Instrumental Variables

Instrumental variable methods are fundamental to causal inference when treatment assignment is confounded by unobserved variables. In this article, we develop a general nonparametric causal framework for identification and learning with…

Methodology · Statistics 2026-02-10 Shuyuan Chen , Peng Zhang , Yifan Cui

Estimating complex causal effects from incomplete observational data

Despite the major advances taken in causal modeling, causality is still an unfamiliar topic for many statisticians. In this paper, it is demonstrated from the beginning to the end how causal effects can be estimated from observational data…

Methodology · Statistics 2014-07-03 Juha Karvanen

Average partial effect estimation using double machine learning

Single-parameter summaries of variable effects in regression settings are desirable for ease of interpretation. However (partially) linear models for example, which would deliver these, may fit poorly to the data. On the other hand, an…

Statistics Theory · Mathematics 2025-07-28 Harvey Klyne , Rajen D. Shah

Efficient nonparametric causal inference with missing exposure information

Missing exposure information is a very common feature of many observational studies. Here we study identifiability and efficient estimation of causal effects on vector outcomes, in such cases where treatment is unconfounded but partially…

Methodology · Statistics 2020-02-04 Edward H. Kennedy

Estimating Average Causal Effects Under General Interference, with Application to a Social Network Experiment

This paper presents a randomization-based framework for estimating causal effects under interference between units, motivated by challenges that arise in analyzing experiments on social networks. The framework integrates three components:…

Statistics Theory · Mathematics 2018-06-21 Peter M. Aronow , Cyrus Samii

A nonparametric super-efficient estimator of the average treatment effect

Doubly robust estimators of causal effects are a popular means of estimating causal effects. Such estimators combine an estimate of the conditional mean of the outcome given treatment and confounders (the so-called outcome regression) with…

Methodology · Statistics 2019-01-17 David Benkeser , Weixin Cai , Mark J van der Laan

Perturbation selection and influence measures in local influence analysis

Cook's [J. Roy. Statist. Soc. Ser. B 48 (1986) 133--169] local influence approach based on normal curvature is an important diagnostic tool for assessing local influence of minor perturbations to a statistical model. However, no rigorous…

Statistics Theory · Mathematics 2008-12-18 Hongtu Zhu , Joseph G. Ibrahim , Sikyum Lee , Heping Zhang

Causal Effect Estimation after Propensity Score Trimming with Continuous Treatments

Propensity score trimming, which discards subjects with propensity scores below a threshold, is a common way to address positivity violations that complicate causal effect estimation. However, most works on trimming assume treatment is…

Methodology · Statistics 2024-07-31 Zach Branson , Edward H. Kennedy , Sivaraman Balakrishnan , Larry Wasserman

Compositional Models for Estimating Causal Effects

Many real-world systems can be usefully represented as sets of interacting components. Examples include computational systems, such as query processors and compilers, natural systems, such as cells and ecosystems, and social systems, such…

Artificial Intelligence · Computer Science 2025-03-18 Purva Pruthi , David Jensen

A Causal Framework for Decomposing Spurious Variations

One of the fundamental challenges found throughout the data sciences is to explain why things happen in specific ways, or through which mechanisms a certain variable $X$ exerts influences over another variable $Y$. In statistics and machine…

Methodology · Statistics 2023-06-09 Drago Plecko , Elias Bareinboim

Identification and Semiparametric Estimation of Conditional Means from Aggregate Data

We introduce a new method for estimating the mean of an outcome variable within groups when researchers only observe the average of the outcome and group indicators across a set of aggregation units, such as geographical areas. Existing…

Methodology · Statistics 2026-05-01 Cory McCartan , Shiro Kuriwaki

Semiparametric Estimation of Treatment Effects in Observational Studies with Heterogeneous Partial Interference

In many observational studies in social science and medicine, subjects or units are connected, and one unit's treatment and attributes may affect another's treatment and outcome, violating the stable unit treatment value assumption (SUTVA)…

Methodology · Statistics 2024-06-25 Zhaonan Qu , Ruoxuan Xiong , Jizhou Liu , Guido Imbens

Identifying Causal Effects Using Instrumental Variables from the Auxiliary Dataset

Instrumental variable approaches have gained popularity for estimating causal effects in the presence of unmeasured confounders. However, the availability of instrumental variables in the primary dataset is often challenged due to stringent…

Methodology · Statistics 2026-03-31 Kang Shuai , Shanshan Luo , Wei Li , Yangbo He