统计方法学 — Scifaro

Model-agnostic information transfer and fusion for classification with label noise

Label noise presents a fundamental challenge in modern machine learning, especially when large-scale datasets are generated via automated processes. An increasingly common and important data paradigm, particularly in domains like medical…

统计方法学 · 统计学 2026-04-29 Zhu Guojun , Zhang Sanguo , Ren Mingyang

Bayesian Environment Invariant Regression

The availability of data from multiple heterogeneous environments has motivated methods that remain reliable under distributional shifts. When the joint distribution of response and predictors varies across environments, the response may…

统计方法学 · 统计学 2026-04-29 Ruqian Zhang , Juan Shen , Yijiao Zhang

Testing linear combinations of multiple variance components

We test the hypothesis that simulataneous linear contrasts of multiple variance components equal zero in a Gaussian variance components model via a parametric bootstrap. Applications include but are not limited to nested and crossed…

统计方法学 · 统计学 2026-04-29 Alex Stringer , Jeffrey Negrea

Detecting Changes in Production Frontiers

We study the problem of estimating locations in time at which the level of technology in an economy changes when given a sequence of time ordered inputs and outputs. We approach the problem through the lens of nonparametric frontier…

统计方法学 · 统计学 2026-04-29 Shakeel Gavioli-Akilagun , Yining Chen , Flavio Ziegelmann

Post-Hoc Inference of Cross-Classified Statistics from Hierarchical Bayes Survey Weights

Tam [2026] shows that combining Bethel multivariate allocation with Hierarchical Bayes (HB) small area models can substantially reduce survey sample sizes while maintaining domain-level precision and near-nominal coverage of posterior…

统计方法学 · 统计学 2026-04-29 Siu-Ming Tam

Variable Fusion and Selection via a Spike-and-Slab Approach with Nonlocal Priors

Variable fusion in linear regression models is a statistical method that identifies covariates making similar contributions to the response variable and imposes the same coefficient values on them. Many methods for variable fusion also…

统计方法学 · 统计学 2026-04-29 Junya Miyake , Akira Okazaki , Shuichi Kawano

Bayesian integration G-formula for platform SMART designs allowing for adding new treatments

Dynamic treatment regimes (DTRs) are sequences of decision rules to guide treatment assignments in response to a patient's evolving, time-varying disease status. Sequential multiple assignment randomized trials (SMARTs) are considered the…

统计方法学 · 统计学 2026-04-29 Xinru Wang , Meghna Bose , Bibhas Chakraborty , Robert Mahar

Generalized Local Polynomial Regression with Decomposed Context-Aware Kernels

Local Polynomial Regression (LPR) is a powerful tool for nonparametric smoothing, yet it traditionally suffers from a "Euclidean tautology": the variables used to define the local neighborhood are identical to those used in the polynomial…

统计方法学 · 统计学 2026-04-29 Yaniv Shulman

Functional Autoregression Without Truncation: A Continuous-Regularization Approach

Functional autoregressive models of order one (FAR(1)) are predominantly estimated by projecting curves onto leading functional principal components and fitting a vector autoregression in score space, requiring a discrete truncation level…

统计方法学 · 统计学 2026-04-29 Yao Zhao

Tail allocation for conformal prediction intervals

We study split-conformal prediction for regression when the reported prediction set must be a single interval, at target marginal coverage $1-\alpha$, where $\alpha$ is the nominal miscoverage level. Under this reporting constraint, the…

统计方法学 · 统计学 2026-04-29 Tianying Wang

Online Learning for Autoregressive Multilayer Stochastic Block Models under Stationarity and Non-Stationarity

Dynamic multilayer networks arise in many applications where multiple types of relations among a common set of nodes evolve over time. Existing approaches often assume temporal independence, focus on single-layer networks or impose…

统计方法学 · 统计学 2026-04-29 Fan Wang , Haotian Xu , Yi Yu

Fractionally Supervised Classification with Maxima Nominated Samples

Fractionally supervised classification (FSC) offers a flexible framework for combining labeled and unlabeled data in model-based classification, but existing formulations assume simple random sampling. In many applications, however, the…

统计方法学 · 统计学 2026-04-29 Mohammad Jafari Jozani , Jingyu Wang

Conflict Forecasting via Conformal Prediction for Markov Processes

Whether or not a country is at war, or experiencing escalating or deescalating levels of conflict, has massive ramifications on a country's national and foreign policy. Given a country's history of conflict, or lack thereof, future…

统计方法学 · 统计学 2026-04-29 Aditya Basarkar , Emmett B. Kendall , David Randahl , Jonathan P. Williams , Gudmund H. Hermansen

Density-valued VAR Models with Latent Factors

We propose a density-valued vector autoregressive model with latent factors for multivariate time series of density functions. Motivated by weekly regional distributions of SARS-CoV-2 cycle threshold (Ct) values in Brazil, we study their…

统计方法学 · 统计学 2026-04-29 Yasumasa Matsuda , Michel F. C. Haddad

Evolving Longitudinal Patient Histories and Re-enrollment in Master Protocol Trials

A master protocol trial uses a single overarching protocol to test multiple therapies, often across several diseases or subtypes. Although such trials offer considerable flexibility and efficiency, their constrained and non-uniform…

统计方法学 · 统计学 2026-04-29 Shiyu Wan , Yuhan Qian , Yanyao Yi , Nicole Mayer-Hamblett , Patrick J. Heagerty , Ting Ye

Rectified Fisher-Bingham Model for Compositional Data with Zeros

This paper introduces a rectified and renormalized Fisher-Bingham model for compositional data with zeros, motivated in part by the presence of zeros in microbiota studies. The approach represents compositions through a square-root…

统计方法学 · 统计学 2026-04-29 Eugene Han , Marahi Perez-Tamayo , Hannah D. Holscher , Ruoqing Zhu

Network-aware IV Regression for Causal Node Discovery and Estimation

Estimating causal effects from high-dimensional, structured exposures is a fundamental challenge in modern applications ranging from neuroscience and finance to environmental science. While the literature has addressed high-dimensional…

统计方法学 · 统计学 2026-04-29 Samhita Pal , Dhrubajyoti Ghosh

Finite Mixture Modeling with Riemannian Gaussian Distributions on Hyperbolic Space

Hyperbolic space is increasingly used for hierarchical, tree-like, and network-structured data, but likelihood-based density modeling on hyperbolic space remains relatively limited. This paper develops finite mixture modeling with isotropic…

统计方法学 · 统计学 2026-04-29 Kisung You

Large-Sample Bayesian Approximations for Privatized Data

The increased use of differential privacy (DP) has allowed the sharing of large amounts of data while reducing the risk of disclosure of sensitive information at the individual level. However, the noise introduced by DP methods makes…

统计方法学 · 统计学 2026-04-29 Jordan Awan , Xi Chen , Roberto Molinari

A Robust Framework for Two-Sample Mendelian Randomization under Population Heterogeneity

Mendelian randomization is a powerful tool for causal inference in observational studies. The two-sample summary-data design, which estimates genetic associations with exposures and outcomes in separate cohorts, is the most widely used…

统计方法学 · 统计学 2026-04-29 Dingke Tang , Xuming He , Shu Yang