统计方法学 — Scifaro

Different Statistical Perspectives for Understanding Generalisation in Graph Neural Networks

Graph Neural Networks (GNN) are currently the most popular approach for learning and prediction on graph-structured data and are deployed in various fields, from social network analysis to drug discovery. However, there is limited…

统计方法学 · 统计学 2026-05-26 Nil Ayday , Mahalakshmi Sabanayagam , Debarghya Ghoshdastidar

Rank-Based Tests for Mutual Independence of High-Dimensional Random Vectors via $L_q$ Norm

We consider the problem of testing mutual independence among the components of a high-dimensional random vector. Building on the rank-based max-sum framework, we introduce fixed finite-$L_q$ power-sum statistics under three general classes…

统计方法学 · 统计学 2026-05-26 Ping Zhao , Hongfei Wang , Long Feng

Information-Theoretic Reliability is Robust to Analytic Choice: A 24-Specification Multiverse on Public Cognitive Test-Retest Data

Background. The reliability paradox describes the empirical observation that cognitive tasks producing robust group-level effects often yield poor between-individual reliability. Existing approaches rely predominantly on the intraclass…

统计方法学 · 统计学 2026-05-26 Maria Westrin

Optimal Estimation of Discrete Multiview Distributions under Heteroskedastic Multinomial Sampling

Multiview latent-variable models provide a fundamental framework for discrete data analysis, with applications to latent structure models, topic models, and mixtures of product distributions. In the discrete setting, the joint distribution…

统计方法学 · 统计学 2026-05-26 Runshi Tang , Julien Chhor , Olga Klopp , Alexandre B. Tsybakov , Anru R. Zhang

Deep Regression for Repeated Measurements under Covariate Shift

This paper studies nonparametric regression with repeated measurements when the response in the target domain is unobservable or costly to collect. We adopt a transfer learning framework that leverages a source domain with observable…

统计方法学 · 统计学 2026-05-26 Yingxuan Wang , Xiangyu Xing , Wangli Xu

Distributional Conformal Prediction for Markov Processes

We introduce the Markov Distributional Conformal Prediction (MDCP) method that extends the distributional conformal prediction (previously developed for regression) to the setting of a strictly stationary Markov process. Instead of relying…

统计方法学 · 统计学 2026-05-26 Dehao Dai , Kejin Wu , Dimitris N. Politis

Adaptable High-Dimensional Change Point Detection via Ridge Regularization

We study the problem of detecting multiple change points in the mean vectors of an independent sequence of high-dimensional observations. We propose a family of ridge-regularized CUSUM statistics built upon the adaptable ridge-regularized…

统计方法学 · 统计学 2026-05-26 Haoran Li , Haotian Xu

Spiking the training data to correct for test set contamination

The literature on test set contamination largely focuses on detection, but the correction of contaminated test scores is underexplored. Our core proposal is to spike the training data by intentionally contaminating some test examples at…

统计方法学 · 统计学 2026-05-26 Johnny Tian-Zheng Wei , Jerry Li , Ameya Godbole , Robin Jia

Shared hidden-factor information framework for multiple behavioral tasks

Understanding cognitive processes in major depressive disorder (MDD) often relies on behavioral tasks, which are typically analyzed separately, overlooking potential correlations and shared latent structure. To address this limitation, we…

统计方法学 · 统计学 2026-05-26 Yuan Bian , Yuanjia Wang , Xingche Guo

Bayesian Conformal-Projective Prediction

We propose a general robust prediction framework, termed conformal-projective prediction (CPP), that integrates Bayesian predictive modeling with ideas from conformal prediction. Rather than assessing conformity through residual-based…

统计方法学 · 统计学 2026-05-26 Arkaprava Roy , Malay Ghosh

Synthetic Heterogeneous-Effects LASSO: A Fixed-effects Estimation Approach for High-dimensional Mixed-effects Models

This paper studies variable selection and post-selection inference for high-dimensional clustered data using marginal-model-based procedures. We show that, when covariates are heterogeneously distributed across clusters, marginal-model…

统计方法学 · 统计学 2026-05-26 Shangyuan Ye , Cong Zhang , Ying Chen , Ye Liang , Guanbo Wang

Using the target trial framework for combining information: external comparator analyses and other applications

We describe how the target trial framework can be used to plan and report analyses that attempt to answer causal questions by combining information from multiple, diverse sources. Such analyses may involve comparisons of treatments…

统计方法学 · 统计学 2026-05-26 Lawson Ung , Miguel A. Hernán , Issa J. Dahabreh

Post-Processing Posterior Predictive P-values

This article addresses issues of model criticism and model comparison in Bayesian contexts, and focusses on the use of the so-called posterior predictive p-values (ppp values). These involve a general discrepancy or conflict measure and…

统计方法学 · 统计学 2026-05-26 Nils Lid Hjort , Fredrik A. Dahl , Gunnhildur Högnadóttir Steinbakk

Modified treatment policies that depend on the natural history of treatment

Longitudinal modified treatment policies (LMTP) are a class of interventions that allow the definition, identification, and estimation of causal effects in general settings, such as with continuous or multivariate exposures, treatment…

统计方法学 · 统计学 2026-05-26 Iván Díaz , Nicholas T. Williams , Paweł Morzywołek , Kara E. Rudolph

PCA score regression: the art of losing power

The regression of principal component scores (RPCS) on covariates is a widely used analytic approach to detect and test for associations between functional measurements and study participant characteristics. Here we show that: (1) RPCS…

统计方法学 · 统计学 2026-05-26 Yu Lu , Nidhi Pai , Erjia Cui , Ciprian Crainiceanu

Variance-Reduced Manifold Sampling via Polynomial-Maximization Density Estimation

Uniform sampling on implicitly defined manifolds is a core primitive in motion planning, constrained simulation, and probabilistic machine learning. MASEM addresses this problem by entropy-maximizing resampling, but its resampling weights…

统计方法学 · 统计学 2026-05-26 Serhii Zabolotnii

Polynomial Maximization Method with Fractional Polynomial Basis: A Frequentist Bridge to Bayesian Fractional Polynomials

Fractional polynomials are widely used for dose-response modelling, and recent Bayesian fractional polynomial work has renewed interest in this finite model class. We propose PMM-FP, a frequentist extension of Kunchenko's polynomial…

统计方法学 · 统计学 2026-05-26 Serhii Zabolotnii

Learning Preferences from Conjoint Data: A Structural Deep Learning Approach

Conjoint experiments randomize multidimensional profiles, offering a powerful design for recovering structural preference parameters -- including marginal rates of substitution, willingness to pay, and the distribution of preferences across…

统计方法学 · 统计学 2026-05-26 Avidit Acharya , Jens Hainmueller , Yiqing Xu

Estimating Dynamic Marginal Policy Effects under Sequential Unconfoundedness

We develop methods for estimating how infinitesimal policy changes affect long-term outcomes in dynamic systems. We show that dynamic marginal policy effects (MPEs) can be identified via tractable reduced-form expressions, and can be…

统计方法学 · 统计学 2026-05-26 I-han Lai , Stefan Wager

Refined Inference for Asymptotically Linear Estimators with Non-Negligible Second-Order Remainders

Asymptotically linear estimators in semiparametric models are usually studied through a von Mises expansion in which first-order inference is based on the influence-function variance. This reduction is valid only when the second-order…

统计方法学 · 统计学 2026-05-26 Lin Li