统计方法学 — Scifaro

Sequential Transport for Causal Mediation Analysis

We propose sequential transport (ST), a distributional framework for mediation analysis that combines optimal transport (OT) with a mediator directed acyclic graph (DAG). Instead of relying on cross-world counterfactual assumptions, ST…

统计方法学 · 统计学 2026-03-24 Agathe Fernandes Machado , Iryna Voitsitska , Arthur Charpentier , Ewen Gallic

Changepoint Detection As Model Selection: A General Framework

This dissertation presents a general framework for changepoint detection based on L0 model selection. The core method, Iteratively Reweighted Fused Lasso (IRFL), improves upon the generalized lasso by adaptively reweighting penalties to…

统计方法学 · 统计学 2026-03-24 Michael Grantham , Xueheng Shi , Bertrand Clarke

A Reduced Basis Decomposition Approach to Efficient Data Collection in Pairwise Comparison Studies

Comparative judgement studies elicit quality assessments through pairwise comparisons, typically analysed using the Bradley-Terry model. A challenge in these studies is experimental design, specifically, determining the optimal pairs to…

统计方法学 · 统计学 2026-03-24 Jiahua Jiang , Joseph Marsh , Rowland G Seymour

Ridge Boosting is Both Robust and Efficient

Estimators in statistics and machine learning must typically trade off between efficiency, having low variance for a fixed target, and distributional robustness, such as multiaccuracy, or having low bias over a range of possible targets. In…

统计方法学 · 统计学 2026-03-24 David Bruns-Smith , Zhongming Xie , Avi Feller

Cumulative Marginal Mean Model for Assessing Sequential Effects Using Digital Health Data

Mobile health (mHealth) leverages digital technologies, such as mobile phones, to capture objective, frequent, and real-world digital phenotypes from individuals, enabling the delivery of tailored interventions to accommodate substantial…

统计方法学 · 统计学 2026-03-24 Xingche Guo , Zexi Cai , Yuanjia Wang , Donglin Zeng

Bootstrapped Control Limits for Score-Based Concept Drift Control Charts

Monitoring for changes in a predictive relationship represented by a fitted supervised learning model (i.e., concept drift detection) is a widespread problem in modern data-driven applications. A general and powerful Fisher score-based…

统计方法学 · 统计学 2026-03-24 Jiezhong Wu , Daniel W. Apley

Hilbert space methods for approximating multi-output latent variable Gaussian processes

Gaussian processes are a powerful class of non-linear models, but have limited applicability for larger datasets due to their high computational complexity. In such cases, approximate methods are required, for example, the recently…

统计方法学 · 统计学 2026-03-24 Soham Mukherjee , Manfred Claassen , Paul-Christian Bürkner

Double machine learning to estimate the effects of multiple treatments and their interactions

Causal inference literature has extensively focused on binary treatments, with relatively fewer methods developed for multi-valued treatments. In particular, methods for multiple simultaneously assigned treatments remain understudied…

统计方法学 · 统计学 2026-03-24 Qingyan Xiang , Yubai Yuan , Dongyuan Song , Usman J. Wudil , Muktar H. Aliyu , C. William Wester , Bryan E. Shepherd

Quantifying uncertainty and stability among highly correlated predictors: a subspace perspective

We study the problem of linear feature selection when features are highly correlated. Such settings pose two fundamental challenges. First, how should model similarity be defined? Simply counting features in common can be misleading: two…

统计方法学 · 统计学 2026-03-24 Xiaozhu Zhang , Jacob Bien , Armeen Taeb

A Nonparametric Bayesian Local-Global Model for Enhanced Adverse Event Signal Detection in Spontaneous Reporting System Data

Spontaneous reporting system databases are key resources for post-marketing surveillance, providing real-world evidence (RWE) on the adverse events (AEs) of regulated drugs or other medical products. Various statistical methods have been…

统计方法学 · 统计学 2026-03-24 Xin-Wei Huang , Saptarshi Chakraborty

Principal Decomposition with Nested Submanifolds

Over the past decades, the increasing dimensionality of data has increased the need for effective data decomposition methods. Existing approaches, however, often rely on linear models or lack sufficient interpretability or flexibility. To…

统计方法学 · 统计学 2026-03-24 Jiaji Su , Zhigang Yao

Rethinking the Win Ratio: A Causal Framework for Hierarchical Outcome Analysis

Quantifying causal effects in the presence of complex and multivariate outcomes remains a key challenge in treatment evaluation. For hierarchical multivariate outcomes, the FDA recommends the Win Ratio and Generalized Pairwise Comparisons…

统计方法学 · 统计学 2026-03-24 Mathieu Even , Julie Josse

Kendall's tau and Spearman's rho for normal location-scale and skew-normal scale mixture copulas

We derive explicit formulas for Kendall's tau and Spearman's rho for two broad classes of asymmetric copulas: normal location-scale mixture copulas and skew-normal scale mixture copulas. These classes encompass widely used specifications,…

统计方法学 · 统计学 2026-03-24 Ye Lu

Bayesian defective Marshall-Olkin Gompertz model: an integrated approach to identifying cure fraction

Regression models have a substantial impact on interpretation of treatments, genetic characteristics and other potential risk factors in survival analysis. In many applications, the description of censoring and survival curve reveals the…

统计方法学 · 统计学 2026-03-24 Dionisio Alves-Neto , Vera Lucia Tomazella , Adriano Suzuki , Danilo Alvares

Discovering the critical number of respondents to validate an item in a questionnaire: The Binomial Cut-level Content Validity proposal

The question that drives this research is: "How to discover the number of respondents that are necessary to validate items of a questionnaire as actually essential to reach the questionnaire's proposal?" Among the efforts in this subject,…

统计方法学 · 统计学 2026-03-24 Helder Gomes Costa , Eduardo Shimoda , José Fabiano da Serra Costa , Aldo Shimoya , Edilvando Pereira Eufrazio

Variance reduction combining pre-experiment and in-experiment data

Online controlled experiments (A/B testing) are fundamental to data-driven decision-making in many companies. Improving the sensitivity of these experiments under fixed sample size constraints requires reducing the variance of the average…

统计方法学 · 统计学 2026-03-24 Zhexiao Lin , Pablo Crespo

Restricted Spatial Regression is Reasonable Statistical Practice: Clarifications, Interpretations, and New Developments

The spatial linear mixed model (SLMM) consists of fixed and spatial random effects that may be linearly dependent. Partially motivated as a means to address potential issues with confounding, the Restricted spatial regression (RSR) model…

统计方法学 · 统计学 2026-03-24 Jonathan R. Bradley

Functional Principal Component Analysis for Sparse Censored Data

Functional principal component analysis (FPCA) is a key tool in the study of functional data, driving both exploratory analyses and feature construction for use in formal modeling and testing procedures. However, existing methods for FPCA…

统计方法学 · 统计学 2026-03-24 Caitrin Murphy , Eric Laber , Rhonda Merwin , Brian Reich , Jake Koerner

Moving Aggregate Modified Autoregressive Copula-Based Time Series Models (MAGMAR-Copulas)

Copula-based time series models can model univariate and stationary time series in a flexible way by decomposing the joint distribution of consecutive observations into a copula and the stationary distribution. Implicitly this approach…

统计方法学 · 统计学 2026-03-24 Sven Pappert

Bayesian Functional Analysis for Untargeted Metabolomics Data with Matching Uncertainty and Small Sample Sizes

Untargeted metabolomics based on liquid chromatography-mass spectrometry technology is quickly gaining widespread application given its ability to depict the global metabolic pattern in biological samples. However, the data is noisy and…

统计方法学 · 统计学 2026-03-24 Guoxuan Ma , Jian Kang , Tianwei Yu