统计方法学 — Scifaro

Confounder selection via iterative graph expansion

Confounder selection, namely choosing a set of covariates to control for confounding between a treatment and an outcome, is arguably the most important step in the design of an observational study. Previous methods, such as Pearl's…

统计方法学 · 统计学 2026-03-24 F. Richard Guo , Qingyuan Zhao

Homogeneity and Sub-homogeneity Pursuit: Iterative Complement Clustering PCA

Principal component analysis (PCA), the most popular dimension-reduction technique, has been used to analyze high-dimensional data in many areas. It discovers the homogeneity within the data and creates a reduced feature space to capture as…

统计方法学 · 统计学 2026-03-24 Daning Bi , Le Chang , Yanrong Yang

AR-sieve Bootstrap for High-dimensional Time Series

This paper proposes a new AR-sieve bootstrap approach to high-dimensional time series. The major challenge of classical bootstrap methods on high-dimensional time series is two-fold: curse of dimensionality and temporal dependence. To…

统计方法学 · 统计学 2026-03-24 Daning Bi , Han Lin Shang , Yanrong Yang , Huanjun Zhu

Combining chains of Bayesian models with Markov melding

A challenge for practitioners of Bayesian inference is specifying a model that incorporates multiple relevant, heterogeneous data sets. It may be easier to instead specify distinct submodels for each source of data, then join the submodels…

统计方法学 · 统计学 2026-03-24 Andrew A. Manderson , Robert J. B. Goudie

Posterior inference via Hill's prediction model

This paper is concerned with the construction of prior free posterior distributions which rely on the use of one step ahead predictive distribution functions. These are typically more straightforward to motivate than prior distributions.…

统计方法学 · 统计学 2026-03-23 Pier Giovanni Bissiri , Chris Holmes , Stephen G. Walker

Approximate posterior recalibration

Bayesian inference is often implemented using approximations, which can yield interval estimates that are too narrow, not fully capturing the uncertainty in the posterior distribution. We address the question of how to adjust these…

统计方法学 · 统计学 2026-03-23 Tiffany Cai , Philip Greengard , Ben Goodrich , Andrew Gelman

Q-approximation of operating characteristics of clinical trial designs

Designing clinical trials requires evaluating multiple operating characteristics (OCs), such as the likelihood of an early stopping decision, the probability of detecting a treatment effect, and the Type I error rate. In most cases, these…

统计方法学 · 统计学 2026-03-23 Susanna Gentile , Daniel E. Schwartz , Riddhiman Saha , Lorenzo Trippa

On the Calibration of Bayesian Success Criteria and Operating Characteristics for Clinical Trials

Recently, the U.S. Food and Drug Administration (FDA) released draft guidance \citep{FDA2026} signaling a paradigm shift that facilitates the use of Bayesian methodology as the primary analysis and decision framework for drug approval. The…

统计方法学 · 统计学 2026-03-23 Peng Yang , Li Wang , Ying Yuan

Scalable and Robust Spatial Prediction via Multi-Resolution Ensembles of Predictive Processes

Gaussian processes provide a flexible framework for spatial prediction, but their computational cost limits applicability to large-scale data with large sample size $n$. Predictive processes (PPs), a popular low-rank approximation, mitigate…

统计方法学 · 统计学 2026-03-23 Nicolas Bianco , Nadja Klein

Cancer Survival Rates Are Misleading

Five-year cancer survival rates are widely reported and often interpreted to mean that early detection saves lives, that a late fatal diagnosis would have been prevented by earlier detection, and that increasing survival over time proves…

统计方法学 · 统计学 2026-03-23 Allen B. Downey

Estimation of Multivariate Functional Principal Components from Sparse Functional Data

Traditional Functional Principal Component Analysis typically focuses on densely observed univariate functional data, yet many applications, particularly in longitudinal studies, involve multivariate functional data observed sparsely and…

统计方法学 · 统计学 2026-03-23 Uche Mbaka , Michelle Carey

Objective Model Prior Probabilities in Variable Selection

For many years it was routine to use equal model prior probabilities in Bayesian model uncertainty analysis. At least twenty years ago it became clear that this was problematic, leading to support of much too large models in the…

统计方法学 · 统计学 2026-03-23 James Berger , Gonzalo García-Donato , Elías Moreno , Luis Pericchi

Learning to Bet for Horizon-Aware Anytime-Valid Testing

We develop horizon-aware anytime-valid tests and confidence sequences for bounded means under a strict deadline $N$. Using the betting/e-process framework, we cast horizon-aware betting as a finite-horizon optimal control problem with state…

统计方法学 · 统计学 2026-03-23 Ege Onur Taga , Samet Oymak , Shubhanshu Shekhar

Regression Adjustments for Double Randomization in Two-Sided Marketplaces

Multiple randomization designs (MRDs) are a class of experimental designs used to handle interference in two-sided marketplaces. We investigate regression adjustment strategies for estimating total, spillover, and direct effects in MRDs. We…

统计方法学 · 统计学 2026-03-23 Timothy Sudijono , Lihua Lei , Lorenzo Masoero , Suhas Vijaykumar , Guido Imbens , James McQueen

Coordinate Descent Algorithm for Least Absolute Deviations Regression

Least Absolute Deviations (LAD) regression provides a robust alternative to ordinary least squares by minimizing the sum of absolute residuals. However, its widespread use has been limited by the computational cost of existing solvers,…

统计方法学 · 统计学 2026-03-23 Zehaan Naik , Debasis Kundu

Transfer learning for high-dimensional Factor-augmented sparse linear model

In this paper, we study transfer learning for high-dimensional factor-augmented sparse linear models, motivated by applications in economics and finance where strongly correlated predictors and latent factor structures pose major challenges…

统计方法学 · 统计学 2026-03-23 Bo Fu , Dandan Jiang

An Order of Magnitude Time Complexity Reduction for Gaussian Graphical Model Posterior Sampling Using a Reverse Telescoping Block Decomposition

We consider the problem of fully Bayesian posterior estimation and uncertainty quantification in undirected Gaussian graphical models via Markov chain Monte Carlo (MCMC) under recently-developed element-wise graphical priors, such as the…

统计方法学 · 统计学 2026-03-23 Zejin Gao , Ksheera Sagar , Anindya Bhadra

Optimal two-phase sampling designs for generalized raking estimators with multiple parameters of interest

Large observational datasets, including those derived from electronic health records, are a valuable resource for medical research but are often affected by missingness, measurement error, and misclassification. Two-phase sampling with…

统计方法学 · 统计学 2026-03-23 Jasper B. Yang , Bryan E. Shepherd , Thomas Lumley , Pamela A. Shaw

A proxy-based approach for unmeasured confounding in electronic health records research

Electronic health records (EHR) are widely used to study clinical decisions, yet unmeasured confounding remains a persistent challenge. Proxy variables offer a potential solution. In EHR data, clinicians already record many such…

统计方法学 · 统计学 2026-03-23 Haley Colgate Kottler , Amy Cochran

Statistical Inference for Quasi-Infinitely Divisible Distributions via Fourier Methods

This study focuses on statistical inference for the class of quasi-infinitely divisible (QID) distributions, which was recently introduced by Lindner, Pan and Sato (2018). The paper presents a Fourier approach, based on the analogue of the…

统计方法学 · 统计学 2026-03-23 Vladimir Panov , Anton Ryabchenko