统计方法学 — Scifaro

Joint modelling of time-dependent biomarker variability and time-to-event outcomes, a two-step approach

Increasing evidence suggests that variability in longitudinal biomarkers, in addition to their mean trajectory, carries prognostic information for time-to-event outcomes. However, standard joint models typically capture only the expected…

统计方法学 · 统计学 2026-05-08 Felix Boakye Oppong , Dimitris Rizopoulos , Thierry Gorlia , Nicole Erler

Estimation of treatment effects in presence of differential use of post-randomization concomitant medication with time-to-event outcomes

In placebo-controlled randomized trials, the post-randomization use of concomitant medications may be higher in the placebo arm than in the treatment arm. This may dilute the full benefits of the randomized drug as estimated by the…

统计方法学 · 统计学 2026-05-08 Helene C. W. Rytgaard , Edwin Fong , Jens M. Tarp , Thomas A. Gerds , Mark J. van der Laan , Henrik Ravn

Detecting Changes in Causal Dependence with Kernels and Copulas

We propose a framework for determining whether the causal dependence of an outcome $Y$ on a covariate $X$ changes at a given time point, given confounders $\boldsymbol{Z}$. For instance, in financial markets, the effect of a market…

统计方法学 · 统计学 2026-05-08 Shakeel Gavioli-Akilagun , Kieran Wood , Francesco Quinzan

UD-DML: Uniform Design Subsampling for Double Machine Learning over Massive Data

Double machine learning (DML) delivers valid inference on low-dimensional causal parameters while permitting flexible nuisance estimation, but its computational cost becomes prohibitive once cross-fitted learners must be trained on massive…

统计方法学 · 统计学 2026-05-08 Yuanke Qu , Xiaoya Xu , Hengtao Zhang

Generative AI-Based Monte Carlo Simulation for Method Evaluation Using Synthetic Multilevel Data

The role of AI-generated synthetic data has recently been expanded to support realistic Monte Carlo simulations. However, guidance is limited on generating data with multilevel structures and designing simulations based on such data. This…

统计方法学 · 统计学 2026-05-08 Youmi Suk , Chenguang Pan , Weixuan Xiao

A Stein Characterization-type Omnibus Tests for the Discrete Pareto Distribution

The discrete Pareto (or Zeta, Zipf) distribution, arises naturally in modeling rank-frequency data across diverse fields such as linguistics, demography, biology, and computer science. Despite its widespread applicability, goodness-of-fit…

统计方法学 · 统计学 2026-05-08 Deepesh Bhati , Bruno Ebner , Sakshi Khandelwal

Latent Impact and Differential Item Functioning Analysis for Asymmetric IRT Models

Differential item functioning (DIF) arises alongside latent population heterogeneity in many applications, and both must be accounted for when assessing measurement invariance. In many practical settings, however, the comparison groups are…

统计方法学 · 统计学 2026-05-08 Gabriel Wallin , Qi Huang

Socio-Conformal Calibration in Complex Survey Data: Marginal Validity Is Not Enough for Subgroup Reliability

Machine-learning systems used in survey-based social measurement require uncertainty estimates that are reliable across population subgroups, not merely valid in aggregate. We study ordinal conformal prediction for five-level AI-attitude…

统计方法学 · 统计学 2026-05-08 Amir Rafe , Subasish Das

Spectral Collapsed Gibbs Sampler for Bayesian Sparse Regression

Sparse regression based on global-local shrinkage priors are increasingly used for Bayesian modeling of modern high-dimensional data, but scaling up the Gibbs sampler for posterior inference remains a challenge. While much effort has gone…

统计方法学 · 统计学 2026-05-08 Andrew Chin , Xiyu Ding , Akihiko Nishimura

A renormalization-group inspired lattice-based framework for piecewise generalized linear models

We formally introduce a class of models inspired by renormalization group (RG) theory, built on additive hierarchical expansions analogous to those appearing in functional ANOVA and mixed-effects models. Like ReLU convolutional neural…

统计方法学 · 统计学 2026-05-08 Joshua C. Chang

Model Form Identification in High-Dimensional Functional Linear Regressions

High-dimensional functional data are becoming increasingly common in fields such as environmental monitoring and neuroimaging. This paper studies high-dimensional functional linear regression models that relate a scalar response to…

统计方法学 · 统计学 2026-05-08 Xingche Guo , Yehua Li , Pang Du

Causal Effect Estimation on Restricted Mean Survival Time in Case-Cohort Studies via a Matching Design

In large observational studies, the case-cohort design is commonly used to reduce the cost associated with covariate measurement. For survival outcomes, literature has suggested that the restricted mean survival time (RMST) be a more…

统计方法学 · 统计学 2026-05-08 Andy Ni , Wei-En Lu , Bo Lu

Bayesian Region Selection and Prediction in Poisson Regression with Spatially Dependent Global-Local Shrinkage Prior

High-dimensional spatially correlated covariates are common in regression models encountered in environmental sciences and other fields. In such models, the regression coefficients often exhibit a sparse structure with spatial dependence.…

统计方法学 · 统计学 2026-05-08 Zihan Zhu , Xueying Tang , Shuang Zhou

Multilevel Regression Modeling of Covariance Matrix Outcomes

Covariance matrix outcomes arise naturally in neuroimaging experiments to study brain functional connectivity. It is also of interest to understand how brain network organization varies with subject-level covariates. Existing covariance…

统计方法学 · 统计学 2026-05-08 Michelle Murphy Green , Xi Luo , Brian S. Caffo , Yi Zhao

Bayesian inference of sparsity in stable vector autoregressive processes

Advances in sensing technology have made it possible to collect large volumes of high-dimensional time-series data. In fields like genetics and neuroscience, key questions concern whether directed relationships between variables can be…

统计方法学 · 统计学 2026-05-08 Sarah E. Heaps , Ian H. Jermyn , Yujiang Wang , Darren J. Wilkinson

Heterogeneous Judge-Aware Ranking with Sensitivity, Disagreement, and Confidence

Pairwise comparisons from multiple judges are central to large language model evaluation and preference modeling, yet standard ranking pipelines often pool judgments into a single score vector, treating systematic judge disagreement as…

统计方法学 · 统计学 2026-05-08 Shibo Yu , Yingzhou Wang , Yan Chen , Guodong Li , Jin-Hong Du

Penalized KLIC Model Selection for the Generalized Method of Moments in Longitudinal Data with Time-Dependent Covariates

Model selection plays an important role in longitudinal data analysis, especially when models are estimated using the generalized method of moments (GMM) in the presence of time-dependent covariates. In this setting, the number of valid…

统计方法学 · 统计学 2026-05-08 Mahmud Hasan , Mathias Nthiani Muia , Mous-Abou Hamadou , Niloofar Ramezani

Structure Learning for Directed Trees with Zero-Inflated Compositional Nodes

Compositional data, which are vectors of proportions constrained to the probability simplex, arise frequently in modern scientific applications, including microbiome relative abundances across body sites and cell-type mixture weights…

统计方法学 · 统计学 2026-05-08 Shuangjie Zhang , Bani K. Mallick , Yang Ni

A Novel Exact Inference Approach for Log-Logistic Reliability Functions with Applications to Time-to-Event Data

Log-logistic distribution is a flexible distribution that can model a wide range of failure patterns in the field of electrical, electronic and mechanical engineering and is often used in reliability inference. However, the inference of the…

统计方法学 · 统计学 2026-05-08 Bowen Liu , Malwane M. A. Ananda , Sam Weerahandi

CBARA: Covariate-Balanced-and-Adjusted Response-Adaptive Randomization

We propose the covariate-balanced-and-adjusted response-adaptive randomization (CBARA) procedure for adaptive design in clinical trials, which integrates the complementary strengths of covariate-adjusted response-adaptive randomization…

统计方法学 · 统计学 2026-05-08 Hengjia Fang , Wei Ma