统计方法学 — Scifaro

SpeedCP: Fast Kernel-based Conditional Conformal Prediction

Conformal prediction provides distribution-free prediction sets with finite-sample conditional guarantees. We build upon the RKHS-based framework of Gibbs et al. (2023), which leverages families of covariate shifts to provide approximate…

统计方法学 · 统计学 2026-05-29 Yating Liu , Yeo Jin Jung , Zixuan Wu , So Won Jeong , Claire Donnat

Optimal Stopping for Sequential Bayesian Experimental Design

Sequential Bayesian experimental design typically assumes that the number of experiments is fixed before data collection begins. In practical campaigns, however, experimentation may need to terminate early because additional measurements…

统计方法学 · 统计学 2026-05-29 Chen Cheng , Xun Huan

Position: Stop Chasing the C-index when Evaluating Survival Analysis Models

The current state of evaluation in survival analysis is plagued by the persistent use of evaluation metrics in ways that are misaligned with the stated modeling objective. In addition, many such evaluations are based on censoring…

统计方法学 · 统计学 2026-05-29 Christian Marius Lillelund , Shi-ang Qi , Russell Greiner , Christian Fischer Pedersen

rd2d: Causal Inference in Boundary Discontinuity Designs

Boundary Discontinuity (BD) designs are used in empirical research to learn about causal treatment effects along a continuous assignment boundary defined by a bivariate score. These designs are also known as multi-score regression…

统计方法学 · 统计学 2026-05-29 Matias D. Cattaneo , Rocio Titiunik , Ruiqi Rae Yu

Robust Principal Components by Casewise and Cellwise Weighting

Principal component analysis (PCA) is a fundamental tool for analyzing multivariate data. Here the focus is on dimension reduction to the principal subspace, characterized by its projection matrix. The classical principal subspace can be…

统计方法学 · 统计学 2026-05-29 Fabio Centofanti , Mia Hubert , Peter J. Rousseeuw

Bayesian Structured Mediation Analysis With Unobserved Confounders

We explore methods to reduce the impact of unobserved confounders on the causal mediation analysis of high-dimensional mediators with spatially smooth structures, such as brain imaging data. The key approach is to incorporate the latent…

统计方法学 · 统计学 2026-05-29 Yuliang Xu , Shu Yang , Jian Kang

Bayesian modeling of multi-species labeling errors in ecological studies

Ecological and conservation studies monitoring bird communities typically rely on species classification based on bird vocalizations. Historically, this has been based on expert volunteers going into the field and making lists of the bird…

统计方法学 · 统计学 2026-05-29 Haoxuan Wang , Patrik Lauha , David B. Dunson

Parametric Bootstrap for Fixed Edge-Probability Network Models

This paper studies parametric bootstrap methods for network data, with the goal of quantifying the uncertainty of network statistics of interest. While existing network resampling methods primarily focus on count statistics under…

统计方法学 · 统计学 2026-05-29 Zhixuan Shao , Can M. Le

Second-level global sensitivity analysis of numerical simulators with application to an accident scenario in a sodium-cooled fast reactor

Numerical simulators are widely used to model physical phenomena and global sensitivity analysis (GSA) aims at studying the global impact of the input uncertainties on the simulator output. To perform GSA, statistical tools based on…

统计方法学 · 统计学 2026-05-29 Anouar Meynaoui , Amandine Marrel , Béatrice Laurent

Beyond Exchangeability: Distribution-Shift-Aware Integration of External Control Data in Randomized Trials

Randomized controlled trials (RCTs) are the gold standard for evaluating causal effects but are often costly and difficult to scale; consequently, they are frequently augmented with auxiliary external controls in many applications. Prior…

统计方法学 · 统计学 2026-05-28 Jiawei Shan , Yiteng Tu , Guanbo Wang , Chao Ying , Jiwei Zhao

Adaptive clinical trials based on design-optimal e-values with automatic curtailment: An application to single-arm trials with binary data

The e-value is gaining traction as a robust alternative to p-values and Bayes factors for quantifying statistical evidence. e-values are a promising method for adaptive clinical trials due to their anytime-validity: e-values ensure type I…

统计方法学 · 统计学 2026-05-28 Stef Baas , Judith ter Schure , Joost van Rosmalen

Sequential generalized kernel equating: Providing comparable scores across multiple test forms with nonequivalent groups and differently measured covariates

Test equating using covariates may be applied to provide comparable scores from multiple test forms when no anchor items are available. However, its performance may be compromised if some of the covariates themselves are measured using…

统计方法学 · 统计学 2026-05-28 Michaela Vařejková , Patrícia Martinková , Eva Potužníková

The Modified Egger Intercept Tests for Detecting Horizontal Pleiotropy in Two-Sample Summary-Data Mendelian Randomization

The Egger intercept (EI) test is a widely used tool to detect horizontal pleiotropy in two-sample summary-data Mendelian randomization. A significant EI test suggests that either the average pleiotropic effect differs from zero (i.e.,…

统计方法学 · 统计学 2026-05-28 Yilei Ma , Youpeng Su , Xin Liu , Xuanye Cui , Ping Yin , Peng Wang

Identifying Direct Causal Effects in Latent Factor Models by Accounting for Unidentified Parents

We consider linear structural equation models with explicitly modelled latent variables. In such models, observed and latent variables solve linear equations including stochastic noise terms. The goal of our work is to identify the direct…

统计方法学 · 统计学 2026-05-28 Tom Hochsprung , Nils Sturma , Jakob Runge , Mathias Drton , Andreas Gerhardus

A computationally-tractable measure of global sensitivity for sampling-based Bayesian inference

Bayesian inference can often be sensitive to the choice of hyperparameters of the prior or likelihood, yet defining and quantifying this sensitivity in a principled and computationally feasible way remains challenging in practice.…

统计方法学 · 统计学 2026-05-28 Arina Odnoblyudova , Charita Dellaporta , François-Xavier Briol

Multi-Teacher Knowledge Distillation via Teacher-Informed Mixture Priors

Knowledge distillation is a powerful method for model compression, enabling the efficient deployment of complex deep learning models (teachers), including large language models. However, its underlying statistical mechanisms remain unclear,…

统计方法学 · 统计学 2026-05-28 Luyang Fang , Yongkai Chen , Jiazhang Cai , Ping Ma , Wenxuan Zhong

A Parameterization-Invariant DIC

The classic Deviance Information Criterion (DIC) is not invariant to reparameterization and can have a negative and unstable effective number of parameters. The reason for the effective number of parameters being negative is actually that…

统计方法学 · 统计学 2026-05-28 Xingyao Xiao , Sophia Rabe-Hesketh

Improving Power in Randomized Controlled Trials with Time-to-Event Endpoints: A Risk-Free Approach

Leveraging external or historical data to improve the efficiency of randomized clinical trials without introducing bias or inflating the Type I error rate remains challenging. Recent work on externally trained prognostic scores, such as…

统计方法学 · 统计学 2026-05-28 Junyi Zhou , Qing Liu , May Mo , Amy Xia

BOOST: Power-Optimal Strong-FWER Testing for Block-Structured Multiplicity

Structured multiple-testing problems (gatekeeping trials, dose-finding, multi-tissue eQTL mapping, bundled-challenger A/B experiments) organize hypotheses into design-imposed blocks and demand strong family-wise error rate (FWER) control…

统计方法学 · 统计学 2026-05-28 Prasanjit Dubey , Xiaoming Huo

Implementing the principal stratum strategy for intercurrent events with survival outcomes: a tutorial

The International Council for Harmonization (ICH) E9 (R1) addendum provides the estimand framework to formulate treatment effects in a clinical trial. One of the attributes of an estimand the framework describes is intercurrent events.…

统计方法学 · 统计学 2026-05-28 Xiaoxiao Zhou , Joyce Chen , Pallavi Mishra-Kalyani , Xiaoxue Li , Yuan Li Shen , Shu Wang , Susan Halabi , Fan Li