统计方法学 — Scifaro

Wild Bootstrap Inference for Non-Negative Matrix Factorization with Random Effects

Non-negative matrix factorization (NMF) is widely used for parts-based representations, yet formal inference for covariate effects is rarely available when the basis is learned under non-negativity. We introduce non-negative matrix…

统计方法学 · 统计学 2026-03-03 Kenichi Satoh

Wrapped flat-top kernel density estimation with circular data

Kernel density estimators with circular data have been studied extensively for decades, as they allow flexible estimations even when the shape of the underlying density is complex. Many recent studies have examined bias correction methods;…

统计方法学 · 统计学 2026-03-03 Yasuhito Tsuruta

Integration of Individual Participant and Aggregate Data Under Dataset Shift: Summary Statistic Comparison and Scalable Computation

Integrated IPD-AD analysis, which combines individual participant data (IPD) with aggregate data (AD), is increasingly recognized as an effective strategy for generating more reliable and generalizable inferences from heterogeneous studies.…

统计方法学 · 统计学 2026-03-03 Ming-Yueh Huang , Jing Qin , Chiung-Yu Huang

Robust measures of dispersion for circular data with an anomaly detection rule

Circular variables that represent directions or periodic observations arise in many fields, such as biology and environmental sciences. An important issue when dealing with circular data is how to estimate their dispersion robustly,…

统计方法学 · 统计学 2026-03-03 Houyem Demni , Mia Hubert , Giovanni C. Porzio , Peter J. Rousseeuw

Robust Weighted Triangulation of Causal Effects Under Model Uncertainty

A fundamental challenge in causal inference with observational data is correct specification of a causal model. When there is model uncertainty, analysts may seek to use estimates from multiple candidate models that rely on distinct, and…

统计方法学 · 统计学 2026-03-03 Rohit Bhattacharya , Ina Ocelli , Ted Westling

Interpreting Net Survival: What We Estimate Versus What We Think We Estimate

Net survival is conventionally defined as ``survival if cancer were the only possible cause of death'', an estimand corresponding to cancer-specific mortality alone. The Pohar Perme estimator targets this by removing general population…

统计方法学 · 统计学 2026-03-03 Matthew J. Smith

Beyond False Discovery Rate: A Stepdown Group SLOPE Approach for Grouped Variable Selection

High-dimensional feature selection is routinely required to balance statistical power with strict control of multiple-error metrics such as the k-Family-Wise Error Rate (k-FWER) and the False Discovery Proportion (FDP), yet some existing…

统计方法学 · 统计学 2026-03-03 Xuelin Zhang , Jingxuan Liang , Xinyue Liu , Hong Chen , Biqin Song

Laplace Variational Inference for Bayesian Envelope Models

Envelope models provide a sufficient dimension reduction framework for multivariate regression analysis. Bayesian inference for these models has been developed primarily using Markov chain Monte Carlo (MCMC) methods. Specifically, Gibbs…

统计方法学 · 统计学 2026-03-03 Seunghyeon Kim , Kwangmin Lee , Yeonhee Park

Detecting Distributional Differences in Spatially Correlated Multivariate Data via Kernel-Smoothed Rank-Based Empirical Copula Tests

Comparing multivariate yield quality distributions across spatially referenced agricultural fields is complicated by two pervasive features: non-normality and spatial autocorrelation. Classical procedures such as ANOVA, MANOVA, and standard…

统计方法学 · 统计学 2026-03-03 Marco Mandap

Hidden in Plain Sight: How Non-Collapsibility Biases Treatment Effects in (Network) Meta-Analysis

Network meta-analysis (NMA) is widely used to compare multiple interventions simultaneously by synthesizing direct and indirect evidence. The general fixed or random effects contrast-based NMA model can be applied to different outcomes and…

统计方法学 · 统计学 2026-03-03 Harlan Campbell , Jeroen P. Jansen

Robust Power and Sample Size Calculations in Quasi-likelihood Models: Methods and Practice

Accurate power and sample size (PSS) calculations are essential for designing studies that use quasi-likelihood (QL) models, which extend generalized linear models (GLMs) to settings where the full distribution of the outcome is not…

统计方法学 · 统计学 2026-03-03 Shijie Yuan , Amy Cochran , Paul Rathouz

Sensitivity Analysis for False Discovery Rate Estimation with Published p-Values

There is recent interest in estimating the false discovery rate (FDR) with published p-values. However, there is little formal research that addresses the manner and extent to which the presumed selection, or publication, bias model impacts…

统计方法学 · 统计学 2026-03-03 Tianyu Cao , Sangyoon Yi , Joshua Habiger

Synthetic Priors

Bayesian inference in generalized linear models requires a prior on the coefficient vector $\beta$. Practitioners naturally reason about response probabilities at specific covariate values, not about abstract log-odds parameters. We develop…

统计方法学 · 统计学 2026-03-03 Nick Polson , Vadim Sokolov

Causal Inference with MNAR Self-Masking Confounders: A Stratified Delta-Imputed Propensity Estimation Method

In observational studies, causal inference becomes difficult when confounders are missing-not-at-random (MNAR), particularly where the missingness depends on the confounder's own unreported value (self-masking). Existing methods for…

统计方法学 · 统计学 2026-03-03 Md. Niamul Islam Sium , Mohammad Hridoy Patwary

Fast distance computation of multivariate distributions via nonparanormal transport

With the increasing availability of data objects in the form of probability distributions, there is a growing need for statistical methods tailored to distributional data. Distance measures, especially the pairwise distance matrix between…

统计方法学 · 统计学 2026-03-03 Edward Shao , Junyoung Park , Naresh Punjabi , Hui Jiang , Irina Gaynanova

CliPS -- How to identify cluster distributions in Bayesian mixture models

We propose the CliPS procedure when fitting Bayesian mixture models in the context of model-based clustering to identify the cluster distributions while simultaneously assessing the suitability of a cluster solution and validating the…

统计方法学 · 统计学 2026-03-03 Gertraud Malsiner-Walli , Sylvia Frühwirth-Schnatter , Bettina Grün

Rejoinder to the discussants of the two JASA articles `Frequentist Model Averaging' and `The Focused Information Criterion', by Nils Lid Hjort and Gerda Claeskens

We are honoured to have our work read and discussed at such a thorough level by several experts. Words of appreciation and encouragement are gratefully received, while the many supplementary comments, thoughtful reminders, new perspectives…

统计方法学 · 统计学 2026-03-03 Nils Lid Hjort , Gerda Claeskens

Improved Conditional Logistic Regression using Information in Concordant Pairs with Software

We develop an improvement to conditional logistic regression (CLR) in the setting where the parameter of interest is the additive effect of binary treatment effect on log-odds of the positive level in the binary response. Our improvement is…

统计方法学 · 统计学 2026-03-03 Jacob Tennenbaum , Adam Kapelner

On the bias of the Hoover index estimator: Results for the gamma distribution

The Hoover index is a widely used measure of inequality with an intuitive interpretation, yet little is known about the finite-sample properties of its empirical estimator. In this paper, we derive a simple expression for the expected value…

统计方法学 · 统计学 2026-03-03 Roberto Vila , Helton Saulo

Beyond Maximum Likelihood: Variational Inequality Estimation for Generalized Linear Models

Generalized linear models (GLMs) are fundamental tools for statistical modeling, with maximum likelihood estimation (MLE) serving as the classical approach for parameter inference. While MLE performs well for canonical GLMs, it can become…

统计方法学 · 统计学 2026-03-03 Linglingzhi Zhu , Jonghyeok Lee , Yao Xie