统计方法学 — Scifaro

Log-Laplace Nuggets for Fully Bayesian Fitting of Spatial Extremes Models to Threshold Exceedances

Flexible random scale-mixture models provide a framework for capturing a broad range of extremal dependence structures. However, likelihood-based inference under the peaks-over-threshold setting is often computationally infeasible, due to…

统计方法学 · 统计学 2026-04-10 Muyang Shi , Likun Zhang , Benjamin A. Shaby

Sample-split REGression SREG: A robust estimator for high-dimensional survey data

Model-assisted regression estimation is fundamental in survey sampling for incorporating auxiliary information. However, when the auxiliary dimension grows with the sample size, the standard Generalized regression (GREG) estimator can…

统计方法学 · 统计学 2026-04-10 Yonghyun Kwon , Shu Yang , Jae Kwang Kim

From Ground Truth to Measurement: A Statistical Framework for Human Labeling

Supervised machine learning assumes that labeled data provide accurate measurements of the concepts models are meant to learn. Yet in practice, human labeling introduces systematic variation arising from ambiguous items, divergent…

统计方法学 · 统计学 2026-04-10 Robert Chew , Stephanie Eckman , Christoph Kern , Frauke Kreuter

Climate-Aware Copula Models for Sovereign Rating Migration Risk

This paper develops a copula-based time-series framework for modelling sovereign credit rating activity and its dependence dynamics, with extensions incorporating climate risk. We introduce a mixed-difference transformation that maps…

统计方法学 · 统计学 2026-04-10 Marina Palaisti

Robust Mendelian Randomization Estimation using Weighted Quantile Regression

In Mendelian randomization (MR) studies, genetic variants are used as instrumental variables (IVs) to investigate causal relationships between exposures and outcomes based on observational data. However, numerous genetic studies have shown…

统计方法学 · 统计学 2026-04-10 Julien St-Pierre , Archer Y. Yang , Mireille E. Schnitzer , Marc-André Legault

A covariate-dependent Cholesky decomposition for high-dimensional covariance regression

Estimation of covariance matrices is a fundamental problem in multivariate statistics. Recently, growing efforts have focused on incorporating covariate effects into these matrices, facilitating subject-specific estimation. Despite these…

统计方法学 · 统计学 2026-04-10 Rakheon Kim , Emma Jingfei Zhang

Langevin-Gradient Rerandomization

Rerandomization is an experimental design technique that repeatedly randomizes treatment assignments until covariates are balanced between treatment groups. Rerandomization in the design stage of an experiment can lead to many asymptotic…

统计方法学 · 统计学 2026-04-10 Antônio Carlos Herling Ribeiro Junior

Regularized estimation for highly multivariate spatial Gaussian random fields

Estimating covariance parameters for multivariate spatial Gaussian random fields is computationally challenging, as the number of parameters grows rapidly with the number of variables, and likelihood evaluation requires operations of order…

统计方法学 · 统计学 2026-04-10 Francisco Cuevas-Pacheco , Gabriel Riffo , Xavier Emery

Eliciting core spatial association from spatial time series: a random matrix approach

Spatial time series (STS) data are fundamental to climate science, yet conventional approaches often conflate temporal co-evolution with genuine spatial dependence, obscuring subtle but critical climatic anomalies. We introduce a Random…

统计方法学 · 统计学 2026-04-10 Madhuchhanda Bhattacharjee , Arup Bose

Virtual Dummies: Enabling Scalable FDR-Controlled Variable Selection via Sequential Sampling of Null Features

High-dimensional variable selection, particularly in genomics, requires error-controlling procedures that scale to millions of predictors. The Terminating-Random Experiments (T-Rex) selector achieves false discovery rate (FDR) control by…

统计方法学 · 统计学 2026-04-10 Taulant Koka , Jasin Machkour , Daniel P. Palomar , Michael Muma

Poisson-response Tensor-on-Tensor Regression and Applications

We introduce Poisson-response tensor-on-tensor regression (PToTR), a novel regression framework designed to handle tensor responses composed element-wise of random Poisson-distributed counts. Tensors, or multi-dimensional arrays, composed…

统计方法学 · 统计学 2026-04-10 Carlos Llosa-Vite , Daniel M. Dunlavy

Directional-Shift Dirichlet ARMA Models for Compositional Time Series with Structural Break Intervention

Compositional time series frequently exhibit structural breaks due to external shocks, policy changes, or market disruptions. Standard methods either ignore such breaks or handle them through fixed effects that cannot extrapolate beyond the…

统计方法学 · 统计学 2026-04-10 Harrison Katz

Bayesian nonparametric models for zero-inflated count-compositional data using ensembles of regression trees

Count-compositional data arise in many different fields, including high-throughput sequencing experiments, ecological surveys, and palaeoclimate studies, where a common, important goal is to understand how covariates relate to the observed…

统计方法学 · 统计学 2026-04-10 André F. B. Menezes , Andrew C. Parnell , Keefe Murphy

Exact two-stage finite-mixture representations for species sampling processes

Discrete random probability measures are central to Bayesian inference, particularly as priors for mixture modeling and clustering. A broad and unifying class is that of proper species sampling processes (SSPs), encompassing many Bayesian…

统计方法学 · 统计学 2026-04-10 Ramsés H. Mena , Christos Merkatas , Theodoros Nicoleris , Carlos E. Rodríguez

Assessing whether two patient populations exhibit comparable event dynamics is essential for evaluating treatment equivalence, pooling data across cohorts, or comparing clinical pathways across hospitals or strategies. We introduce a…

统计方法学 · 统计学 2026-04-10 Zoe Kristin Lange , Maryam Farhadizadeh , Holger Dette , Nadine Binder

Inference on multiple quantiles in regression models by a rank-score approach

This paper tackles the challenge of performing multiple quantile regressions across different quantile levels and the associated problem of controlling the familywise error rate, an issue that is generally overlooked in practice. We propose…

统计方法学 · 统计学 2026-04-10 Riccardo De Santis , Anna Vesely , Angela Andreella

Covariate Adjustment Cannot Hurt: Treatment Effect Estimation under Interference with Low-Order Outcome Interactions

In randomized experiments, covariates are often used to reduce variance and improve the precision of treatment effect estimates. However, in many real-world settings, interference between units, where one unit's treatment affects another's…

统计方法学 · 统计学 2026-04-10 Xinyi Wang , Shuangning Li

Optimal Debiased Inference on Privatized Data via Indirect Estimation and Parametric Bootstrap

We design a debiased parametric bootstrap framework for statistical inference from differentially private data. Existing usage of the parametric bootstrap on privatized data ignored or avoided handling possible biases introduced by the…

统计方法学 · 统计学 2026-04-10 Zhanyu Wang , Arin Chang , Jordan Awan

Impact of Label Noise from Large Language Models Generated Annotations on Evaluation of Diagnostic Model Performance

Large language models (LLMs) are increasingly used to generate labels from radiology reports to enable large-scale AI evaluation. However, label noise from LLMs can introduce bias into performance estimates, especially under varying disease…

统计方法学 · 统计学 2026-04-10 Mohammadreza Chavoshi , Hari Trivedi , Janice Newsome , Aawez Mansuri , Chiratidzo Rudado Sanyika , Rohan Satya Isaac , Frank Li , Theo Dapamede , Judy Gichoya

Propensity Score Methods for Local Test Score Equating: Stratification and Inverse Probability Weighting

In test equating, ensuring score comparability across different test forms is crucial but particularly challenging when test groups are non-equivalent and no anchor test is available. Local test equating aims to satisfy Lord's equity…

统计方法学 · 统计学 2026-04-10 Gabriel Wallin , Marie Wiberg