统计方法学 — Scifaro

Adaptive procedures for boundary FDR control

A cornerstone of the multiple testing literature is the Benjamini-Hochberg (BH) procedure, which guarantees control of the FDR when $p$-values are independent or positively dependent. While BH controls the average quality of rejections, it…

统计方法学 · 统计学 2026-03-31 Sarah Mostow , Daniel Xiang

A Bayesian Functional Concurrent Zero-Inflated Dirichlet-Multinomial Regression Model with Application to Infant Microbiome

The infant microbiome undergoes rapid changes in composition over time and is associated with long-term risks of conditions such as immune strength, allergy, asthma, and other health outcomes. Modeling the associations between exposures or…

统计方法学 · 统计学 2026-03-31 Brody Erlandson , Ander Wilson , Matthew D. Koslovsky

The exact amount of t-ness that the normal model can tolerate

Suppose that the normal model is used for data $Y_1,\ldots,Y_n$, but that the true distribution is a t-distribution with location and scale parameters $\xi$ and $\sigma$ and $m$ degrees of freedom. The normal model corresponds to…

统计方法学 · 统计学 2026-03-31 Nils Lid Hjort

Age-Specific Logistic Regression with Complex Event Time Data

In attempt to advance the current practice for assessing and predicting the primary ovarian insufficiency (POI) risk in female childhood cancer survivors, we propose two estimating function based approaches for age-specific logistic…

统计方法学 · 统计学 2026-03-31 Haoxuan , Zhou , X. Joan Hu , Yi Xiong , Yan Yuan

Fast and Scalable Cellwise-Robust Ensembles for High-Dimensional Data

The analysis of high-dimensional data, common in fields such as genomics, is complicated by the presence of cellwise contamination, where individual cells rather than entire rows are corrupted. This contamination poses a significant…

统计方法学 · 统计学 2026-03-31 Anthony Christidis , Jeyshinee Pyneeandee , Gabriela Cohen-Freue

Gimbal Regression: Orientation-Adaptive Local Linear Regression under Spatial Heterogeneity

Local regression is widely used to explore spatial heterogeneity, but anisotropic or effectively low-dimensional neighborhoods can produce ill-conditioned local solves, causing coefficient variation driven by numerical artifacts rather than…

统计方法学 · 统计学 2026-03-31 Yuichiro Otani

A Calibration Framework for Inference with Partially Observed Data

Missing data is an universal problem in statistics. We develop a unified framework for estimating parameters defined by general estimating equations under a missing-at-random (MAR) mechanism, based on generalized entropy calibration…

统计方法学 · 统计学 2026-03-31 Mst Moushumi Pervin , Hengfang Wang , Jae Kwang Kim

A Doubled Adjacency Spectral Embedding Approach to Graph Clustering

Spectral clustering is a popular tool in network data analysis, with applications in a variety of scientific application areas. However, many studies have shown that classical spectral clustering does not perform well on certain network…

统计方法学 · 统计学 2026-03-31 Sinyoung Park , Matthew Nunes , Sandipan Roy

Difference-in-differences with stochastic policy shifts of a continuous treatment

Treatment effects of stochastic policy shifts quantify differences in outcomes across counterfactual scenarios with varying treatment distributions. Stochastic policy shifts may be of interest in settings where it is unrealistic or…

统计方法学 · 统计学 2026-03-31 Michael Jetsupphasuk , Chenwei Fang , Didong Li , Michael G. Hudgens

Omnibus goodness-of-fit tests for univariate continuous distributions based on trigonometric moments

We propose a new omnibus goodness-of-fit test based on trigonometric moments of probability-integral-transformed data. The test builds on the framework of the LK test introduced by Langholz and Kronmal [J. Amer. Statist. Assoc. 86 (1991),…

统计方法学 · 统计学 2026-03-31 Alain Desgagné , Frédéric Ouimet

Cross-World Assumption and Refining Prediction Intervals for Individual Treatment Effects

While average treatment effects (ATE) and conditional average treatment effects (CATE) provide valuable population- and subgroup-level summaries, they fail to capture uncertainty at the individual level. For high-stakes decision-making,…

统计方法学 · 统计学 2026-03-31 Juraj Bodik , Yaxuan Huang , Bin Yu

Bayesian Modeling for Aggregated Relational Data: A Unified Perspective

Aggregated relational data is widely collected to study social networks, in fields such as sociology, public health and economics. Many of the successes of ARD inference have been driven by increasingly complex Bayesian models, which…

统计方法学 · 统计学 2026-03-31 Owen G. Ward , Anna L. Smith , Tian Zheng

Fast Penalized Generalized Estimating Equations for Large Longitudinal Functional Datasets

Longitudinal binary or count functional data are common in neuroscience, but are often too large to analyze with existing functional regression methods. We propose one-step penalized generalized estimating equations that supports…

统计方法学 · 统计学 2026-03-31 Gabriel Loewinger , Alex W. Levis , Erjia Cui , Francisco Pereira

Exploration, Confirmation, and Replication in the Same Observational Study: A Two Team Cross-Screening Approach to Studying the Effect of Unwanted Pregnancy on Mothers' Later Life Outcomes

The long term consequences of unwanted pregnancies carried to term on mothers have not been much explored. We use data from the Wisconsin Longitudinal Study (WLS) and propose a novel approach, namely two team cross-screening, to study the…

统计方法学 · 统计学 2026-03-31 Samrat Roy , Marina Bogomolov , Ruth Heller , Amy M. Claridge , Tishra Beeson , Dylan S. Small

bayesNMF: Fast Bayesian Poisson NMF with Automatically Learned Rank Applied to Mutational Signatures

Bayesian Poisson Non-Negative Matrix Factorization (NMF) is widely used to model count data, including in cancer mutational signature analysis. However, standard Gibbs samplers rely on computationally expensive Poisson augmentation, and…

统计方法学 · 统计学 2026-03-31 Jenna M. Landy , Nishanth Basava , Giovanni Parmigiani

A framework for joint assessment of a terminal event and a score existing only in the absence of the terminal event

Analysis of data from randomized controlled trials in vulnerable populations requires special attention when assessing treatment effect by a score measuring, e.g., disease stage or activity together with onset of prevalent terminal events.…

统计方法学 · 统计学 2026-03-31 Klaus Kähler Holst , Andreas Nordland , Julie Funch Furberg , Lars Holm Damgaard , Christian Bressen Pipper

Finite mixture representations of zero-and-$N$-inflated distributions for count-compositional data

We provide novel probabilistic portrayals of two multivariate models designed to handle zero-inflation in count-compositional data. We develop a new unifying framework that represents both as finite mixture distributions. One of these…

统计方法学 · 统计学 2026-03-31 André F. B. Menezes , Andrew C. Parnell , Keefe Murphy

Combining BART and Principal Stratification to estimate the effect of intermediate variables on primary outcomes with application to estimating the effect of family planning on employment in Nigeria and Senegal

There is interest in learning about the causal effects of modern contraceptive use on empowerment outcomes. Data on this question often come from family planning (FP) programs that increase access to FP and facilitate contraceptive use…

统计方法学 · 统计学 2026-03-31 Lucas Godoy Garraza , Ilene Speizer , Leontine Alkema

Metric Oja Depth, New Statistical Tool for Estimating the Most Central Objects

The Oja depth (simplicial volume depth) is one of the classical statistical techniques for measuring the central tendency of data in multivariate space. Despite the widespread emergence of object data like images, texts, matrices or graphs,…

统计方法学 · 统计学 2026-03-31 Vida Zamanifarizhandi , Joni Virta

Convex estimation of Gaussian graphical regression models with covariates

Gaussian graphical models (GGMs) are widely used to recover the conditional independence structure among random variables. Recent work has sought to incorporate auxiliary covariates to improve estimation, particularly in applications such…

统计方法学 · 统计学 2026-03-31 Ruobin Liu , Guo Yu