统计方法学 — Scifaro

Balancing the privacy-utility trade-off: How to draw reliable conclusions from private data

Absolute anonymization, conceived as an irreversible transformation that prevents re-identification and sensitive value disclosure, has proven to be a broken promise. Consequently, modern data protection must shift toward a privacy-utility…

统计方法学 · 统计学 2026-03-16 Raphaël de Fondeville

Consistent and powerful CUSUM change-point test for panel data with changes in variance

This paper investigates change-point of variance in panel data models with time series of $\alpha$-mixing. Based on the cumulative sum (CUSUM) method and the individual differences, we construct a CUSUM test for panel data models to detect…

统计方法学 · 统计学 2026-03-16 Wenzhi Yang , Yueting Xu , Xiaoping Shi , Qiong Li

Inference for function-on-function regression: central limit theorem and residual bootstrap

We investigate asymptotic inference in a linear regression model where both response and regressors are functions, using an estimator based on functional principal components analysis. Although this approach is widely used in functional…

统计方法学 · 统计学 2026-03-16 Hyemin Yeon

Variational Bayes and Truncation approximations for Enriched Dirichlet process mixtures

A common impediment in conducting inference for Bayesian nonparametric models is either the need for complex MCMC algorithms and/or computational run-time for large datasets. We propose solutions here for Enriched Dirichlet process mixtures…

统计方法学 · 统计学 2026-03-16 Somnath Bhadra , Michael J. Daniels

Bayesian Covariate-Varying Interaction Analysis for Multivariate Count Data: Application to Microbiome Studies

Understanding covariate-varying interdependencies among features is of great interest in various applications. Motivated by microbiome studies where microbial abundances and interactions vary with environmental factors, we develop a…

统计方法学 · 统计学 2026-03-16 Shuangjie Zhang , Michael L. Patnode , Juhee Lee

Bayesian Conservative Policy Optimization (BCPO): A Novel Uncertainty-Calibrated Offline Reinforcement Learning with Credible Lower Bounds

Offline reinforcement learning (RL) aims to learn decision policies from a fixed batch of logged transitions, without additional environment interaction. Despite remarkable empirical progress, offline RL remains fragile under distribution…

统计方法学 · 统计学 2026-03-16 Debashis Chatterjee

Robust Sequential Hypothesis Testing with Generalized Estimating Equations for Incomplete Clustered and Longitudinal Data

Existing sequential generalized estimating equation methodology for longitudinal and group-correlated data focuses on narrow hypotheses concerning treatment efficacy and often makes modeling assumptions that impede the desirable robustness…

统计方法学 · 统计学 2026-03-16 Nathan T. Provost , Abdus S. Wahed

Conformal novelty detection with false discovery rate control at the boundary

Conformal novelty detection is a classical machine learning task for which uncertainty quantification is essential for providing reliable results. Recent work has shown that the BH procedure applied to conformal p-values controls the false…

统计方法学 · 统计学 2026-03-16 Zijun Gao , Etienne Roquain , Daniel Xiang

Empirical Bayes learning from selectively reported confidence intervals

We develop a statistical framework for empirical Bayes learning from selectively reported confidence intervals, and apply it to provide context for interpreting results published in MEDLINE abstracts. We use a collection of 326,060 z-scores…

统计方法学 · 统计学 2026-03-16 Hunter Chen , Junming Guan , Erik van Zwet , Nikolaos Ignatiadis

Defensive Model Expansion for Robust Bayesian Inference

Some applied researchers hesitate to use nonparametric methods, worrying that they will lose power in small samples or overfit the data when simpler models are sufficient. We argue that at least some of these concerns are unfounded when…

统计方法学 · 统计学 2026-03-16 Antonio R. Linero

Speeding up the ordered allocation sampler

The ordered allocation sampler is a Gibbs sampler designed to explore the posterior distribution in nonparametric mixture models. It encompasses both infinite mixtures and finite mixtures with random number of components, and it has be…

统计方法学 · 统计学 2026-03-16 Maria F. Gil-Leyva , Fidel Selva , Pierpaolo De Blasi

Left-truncated discrete lifespans: The AFiD enterprise panel

Our model for the lifespan of an enterprise is the geometric distribution. We do not formulate a model for enterprise foundation, but assume that foundations and lifespans are independent. We aim to fit the model to information about…

统计方法学 · 统计学 2026-03-16 Eric Scholz , Rafael Weißbach

Mesoscale two-sample testing for networks

Networks arise naturally in many scientific fields as a representation of pairwise connections. Statistical network analysis has most often considered a single large network, but it is common in a number of applications to observe multiple…

统计方法学 · 统计学 2026-03-16 Peter W. MacDonald , Elizaveta Levina , Ji Zhu

Bayesian Inference of Reproduction Number from Epidemiological and Genetic Data Using Particle MCMC

Inference of the reproduction number through time is of vital importance during an epidemic outbreak. Typically, epidemiologists tackle this using observed prevalence or incidence data. However, prevalence and incidence data alone is often…

统计方法学 · 统计学 2026-03-16 Alicia Gill , Jere Koskela , Xavier Didelot , Richard G. Everitt

Credible Intervals for Probability of Failure with Gaussian Processes

Estimating the probability of failure for expensive simulations is a central task in reliability analysis for structural design, power grid design, and safety certification, among other areas. This work derives credible intervals on the…

统计方法学 · 统计学 2026-03-16 Aleksei G. Sorokin , Vishwas Rao

When Respondents Don't Care Anymore: Identifying the Onset of Careless Responding

Questionnaires in the behavioral sciences tend to be lengthy. However, literature suggests that survey length is a contributing factor to careless responding, with longer questionnaires yielding higher probability that participants start…

统计方法学 · 统计学 2026-03-16 Max Welz , Andreas Alfons

Data-Driven Influence Functions for Optimization-Based Causal Inference

We study a constructive algorithm that approximates Gateaux derivatives for statistical functionals by finite differencing, with a focus on functionals that arise in causal inference. We study the case where probability distributions are…

统计方法学 · 统计学 2026-03-16 Michael I. Jordan , Yixin Wang , Angela Zhou

Bayesian Model Calibration with Integrated Discrepancy: Addressing Inexact Dislocation Dynamics Models

In this work, a novel approach to Bayesian model calibration routines is developed which reinterprets the traditional definition of model discrepancy as defined by Kennedy and O'Hagan (KOH). The novelty lies in the integration of…

统计方法学 · 统计学 2026-03-13 Liam Myhill , Enrique Martinez Saez , Sez Russcher

Distributionally balanced sampling designs

We propose Distributionally Balanced Designs (DBD), a new class of probability sampling designs that target representativeness at the level of the full auxiliary distribution rather than selected moments. In disciplines such as ecology,…

统计方法学 · 统计学 2026-03-13 Anton Grafström , Wilmer Prentius

Causal Influence Maximization with Steady-State Guarantees

Influence maximization in networks is a central problem in machine learning and causal inference, where an intervention on a subset of individuals triggers a diffusion process through the network. Existing approaches typically optimize…

统计方法学 · 统计学 2026-03-13 Renjie Cao , Zhuoxin Yan , Xinyan Su , Zhiheng Zhang