Statistics — Scifaro

A Parameterization-Invariant DIC

The classic Deviance Information Criterion (DIC) is not invariant to reparameterization and can have a negative and unstable effective number of parameters. The reason for the effective number of parameters being negative is actually that…

Methodology · Statistics 2026-05-28 Xingyao Xiao , Sophia Rabe-Hesketh

Learning to target with network interference

This paper studies adaptive targeting under network interference in a bandit setting, where treatments applied to one individual may affect others through spillover effects. We consider a linear model in a sparse regime, where each…

Machine Learning · Statistics 2026-05-28 Xiaomeng Wang , Hamsa Bastani , Osbert Bastani , Zhimei Ren

Day-Ahead Electricity Price Forecasting Using a Multivariate Group Lasso Method

Electricity price signals in modern power systems exhibit complex dependence structures that render forecasting inherently challenging. Our analysis of real-world pricing signals from the California Independent System Operator (CAISO)…

Applications · Statistics 2026-05-28 Keyi Wang , Jiaxiang Ji , Mahan Mansouri , Ahmed Aziz Ezzat

Soft Specialists: $\alpha$-R\'enyi Ensembles for Uncertainty-Aware LLM Post-Training

Existing training approaches for large language models learn a single set of parameters, based on large volumes of data, which is typically heterogeneous, conflicting and often outright contradictory. As a result, the model is forced to…

Machine Learning · Statistics 2026-05-28 Paula Cordero-Encinar , Georgy Tyukin , Andrew B. Duncan

Improving Power in Randomized Controlled Trials with Time-to-Event Endpoints: A Risk-Free Approach

Leveraging external or historical data to improve the efficiency of randomized clinical trials without introducing bias or inflating the Type I error rate remains challenging. Recent work on externally trained prognostic scores, such as…

Methodology · Statistics 2026-05-28 Junyi Zhou , Qing Liu , May Mo , Amy Xia

Likelihood-Free Inference for Multivariate Generalized Pareto Models

Likelihood-based inference for multivariate extreme-value models is often unreliable or infeasible when likelihoods are intractable or supports are discrete. This challenge is particularly acute for multivariate discrete generalized Pareto…

Applications · Statistics 2026-05-28 Samira Aka , Marie Kratz , Philippe Naveau

Unsupervised Identification and Removal of Spurious Correlations During Fine-Tuning

Fine-tuning a pretrained language model on a curated dataset can produce spurious correlations between the fine-tuning task and unintended latent factors -- such as misaligned personas or political slant -- that the curation procedure has…

Machine Learning · Statistics 2026-05-28 Ciarán M. Gilligan-Lee , Joseph Egan , Yuchen Zhu , Michael O'Riordan

Evolving and Detecting Multi-Turn Deception using Geometric Signatures

Safety defenses for large language models (LLMs) are typically trained and evaluated on single-turn prompts, yet real attacks often unfold as indirect, multi-turn probing. To defend against this more nuanced form of deception, we present a…

Machine Learning · Statistics 2026-05-28 Surender Suresh Kumar , Mary L. Cummings

BOOST: Power-Optimal Strong-FWER Testing for Block-Structured Multiplicity

Structured multiple-testing problems (gatekeeping trials, dose-finding, multi-tissue eQTL mapping, bundled-challenger A/B experiments) organize hypotheses into design-imposed blocks and demand strong family-wise error rate (FWER) control…

Methodology · Statistics 2026-05-28 Prasanjit Dubey , Xiaoming Huo

Implementing the principal stratum strategy for intercurrent events with survival outcomes: a tutorial

The International Council for Harmonization (ICH) E9 (R1) addendum provides the estimand framework to formulate treatment effects in a clinical trial. One of the attributes of an estimand the framework describes is intercurrent events.…

Methodology · Statistics 2026-05-28 Xiaoxiao Zhou , Joyce Chen , Pallavi Mishra-Kalyani , Xiaoxue Li , Yuan Li Shen , Shu Wang , Susan Halabi , Fan Li

Bayesian Imputation for Unplayed Games in Round-Robin Chess Tournaments: Application to Grand Chess Tour, Bucharest 2026

When a player withdraws mid-tournament from a round-robin chess event, organizers face a fundamental problem: how should scores be assigned for games that were never played? Current FIDE guidelines specify annulment if withdrawal occurs…

Methodology · Statistics 2026-05-28 Ravi Varadhan

Why pyrotechnics markets keep killing:a simple geometric argument for redesign

Fires and explosions in pyrotechnics retail markets recur worldwide with predictable regularity, killing dozens to hundreds of people in single events. This paper argues that the global topology of the market is the dominant determinant of…

Applications · Statistics 2026-05-28 Carlos M. Hernandez-Suarez , Alonso Sanchez-Maldonado , Carlos A. Robles-Hernandez

Purely analytic composites: Relative variance contributions of indicators corresponding to a priori indicator weights

Composites are often created to facilitate the work of decision-makers. Therefore, practical or theoretical considerations may lead to a priori weights of the indicators forming a composite. Composites that are created a weighted aggregates…

Applications · Statistics 2026-05-28 Andre Beauducel , Ned Kock

Accelerating Reinforcement Learning Training Using Simulation Surrogate Models

High-fidelity simulation models are widely used to analyze complex stochastic systems, but their high computational cost motivates the development of cheaper surrogate models that approximate the simulation model's input-output…

Machine Learning · Statistics 2026-05-28 Mohammadmahdi Ghasemloo , David J. Eckman , Yaxian Li

Semiparametrically Efficient Inference for Kernel Measures of Noise Heterogeneity

We develop semiparametrically efficient inference for kernel measures of noise heterogeneity in additive noise models. In many applications, the regression function is estimated using flexible machine learning methods. Downstream procedures…

Machine Learning · Statistics 2026-05-28 Jakub Wornbard , Zikai Shen , Dimitri Meunier , Arthur Gretton

Identifiable Bayesian Deep Generative Copulas with Unknown Layer Widths for Data with Arbitrary Marginal Distributions

Deep generative models offer powerful tools for multivariate data analysis, but their black-box architectures are often unidentified and difficult to interpret. We introduce the Deep Discrete Encoder (DDE) Copula, an identifiable and…

Machine Learning · Statistics 2026-05-28 Joseph Feldman , Yuqi Gu

Model--based clustering for spherical and hyper--spherical data using elliptically symmetric distributions

Model--based clustering for directional data data has attracted a lot of interest, but most methods utilize rotationally symmetric distributions. This paper suggests the use of elliptically symmetric distributions, namely the elliptically…

Methodology · Statistics 2026-05-28 Theodoros Perdikis , Nader Alharbi , Michail Tsagris

Iterative Causal Discovery: Per-Edge Impossibility Certificates, Tier-Aware Oracle Queries, and the $1+K$ Lower Bound

Causal-discovery algorithms return a directed graph, yet provide no principled means of distinguishing edge directions identified by the data from those assigned without an identifying assumption. Under the standard Markov and faithfulness…

Machine Learning · Statistics 2026-05-28 Eichi Uehara

Calibrated Inference for the Conditional Average Treatment Effect in the Few-Placebo Regime via Gaussian Processes

Estimating how much an intervention helps a given individual the conditional average treatment effect (CATE) is increasingly central to decision-making in medicine, economics, and policy, where an estimate is most useful when accompanied by…

Machine Learning · Statistics 2026-05-28 Eichi Uehara

When prompt perturbations break your A/B test: A valid statistical test for generative surveying

Generative surveying -- where collections of LLM-based personas provide feedback on messages -- has emerged as a cheap and scalable alternative to traditional market research. However, LLMs are sensitive to small variations in prompt design…

Methodology · Statistics 2026-05-28 Hayden Helm , Carey Priebe