统计方法学 — Scifaro

Modeling and forecasting subnational age distribution of death counts

Existing mortality forecasting methods focus on age-specific mortality rates, which lie in an unconstrained space and overlook the distributional nature of life-table death counts. Few studies have developed and compared forecasting methods…

统计方法学 · 统计学 2026-04-23 Han Lin Shang , Cristian F. Jiménez-Varón

Large multi-response linear regression estimation based on low-rank pre-smoothing

Pre-smoothing is a technique aimed at increasing the signal-to-noise ratio in data to improve subsequent estimation and model selection in regression problems. However, pre-smoothing has thus far been limited to the univariate response…

统计方法学 · 统计学 2026-04-23 Xinle Tian , Alex Gibberd , Matthew Nunes , Sandipan Roy

Identification strategies for combining an experimental study with external data

There is increasing interest in combining information from experimental studies, including randomized and single-group trials, with information from external experimental or observational data sources. Such efforts are usually motivated by…

统计方法学 · 统计学 2026-04-23 Lawson Ung , Guanbo Wang , Sebastien Haneuse , Miguel A. Hernán , Issa J. Dahabreh

Estimating the Number of Components in Finite Mixture Models via Variational Approximation

This work introduces a new method for selecting the number of components in finite mixture models (FMMs) using variational Bayes, inspired by the large-sample properties of the Evidence Lower Bound (ELBO) derived from mean-field (MF)…

统计方法学 · 统计学 2026-04-23 Chenyang Wang , Yun Yang

Regularized Exponentially Tilted Empirical Likelihood for Bayesian Inference

Bayesian inference with empirical likelihood faces a challenge as the posterior domain is a proper subset of the original parameter space due to the convex hull constraint. We propose a regularized exponentially tilted empirical likelihood…

统计方法学 · 统计学 2026-04-23 Eunseop Kim , Steven N. MacEachern , Mario Peruggia

A Goodness-of-Fit Test for Mixed-Effects Logistic Regression

Mixed-effects logistic regression is widely used for binary outcomes in hierarchical data, yet formal goodness-of-fit tests remain limited to random-intercept models and do not address sparse cluster settings. We extend a grouping-based…

统计方法学 · 统计学 2026-04-22 Ariel Linden

PRADAS: PRior-Assisted DAta Splitting for False Discovery Rate Control

In the FDR-controlling literature, mirror statistics offer a flexible alternative to $p$-value based procedures. When prior information is available, however, it is unclear how to incorporate mirror statistics in a principled way, and the…

统计方法学 · 统计学 2026-04-22 Yuanchuan Guo , Buyu Lin , Jun S. Liu

A Nonparametric Goodness-of-Fit Test for High-Dimensional Generalized Gaussian Distributions via Nearest-Neighbor Graphs

The multivariate generalised Gaussian distribution (MGGD) is commonly used to model high-dimensional vectors with non-Gaussian radial behaviour, ranging from sharp-peaked to heavy-tailed profiles. However, because many classical…

统计方法学 · 统计学 2026-04-22 Mehmet Sıddık Çadırcı , Yener Ünal

Random Reward Phase-Type Distributions with Applications in Latent Severity Modeling

This paper proposes an extension to discrete Phase-Type distributions (DPH) by introducing random rewards. These allow for modeling a system in which a visit to a certain state does not emit a deterministic reward. Instead, the rewards…

统计方法学 · 统计学 2026-04-22 Simon Pauli , Andreas Futschik

Multiscale Cochran-Mantel-Haenszel Scanning for Conditional Dependency

We propose a nonparametric approach to testing conditional independence and estimating conditional association, generalizing the Cochran-Mantel-Haenszel (CMH) test and odds-ratio estimator to continuous sample spaces. It leverages a…

统计方法学 · 统计学 2026-04-22 Gyeonghun Kang , Jialiang Mao , Li Ma

Transfer Learning for Degree-Corrected Mixed Membership Network Models

Statistical analysis of network data has attracted considerable attention in recent years, due to the rapid advancement of well-trained network models and the accessibility of large public network datasets. In this article, we propose a…

统计方法学 · 统计学 2026-04-22 Yong He , Kangxiang Qin , Haoran Tang

The General Formulation of Loss-Based Priors for Parameter Spaces

Loss-based priors assign probability mass to parameter values according to the inferential loss incurred when they are excluded from the parameter space, and provide a general solution for discrete parameters. Extending this idea to…

统计方法学 · 统计学 2026-04-22 Cristiano Villa

Overstuffed sandwiches and separation anxiety: finite-sample variance estimation for penalized GEE with near-separated binary data

Penalized generalized estimating equations (PGEE) stabilize point estimation for longitudinal binary data under near-separation, but inference still depends on how the sandwich variance is corrected. Existing corrections for PGEE can…

统计方法学 · 统计学 2026-04-22 Awan Afiaz , M. Shafiqur Rahman

Locally parametric nonparametric density estimation

This paper develops a nonparametric density estimator with parametric overtones. Suppose $f(x,\theta)$ is some family of densities, indexed by a vector of parameters $\theta$. We define a local kernel smoothed likelihood function which for…

统计方法学 · 统计学 2026-04-22 Nils Lid Hjort , M. C. Jones

How to quantify direct correlations between variables

Analyzing correlation between variables is often both the tool and the goal of modern science. A crucial question is whether the correlation between two variables is a direct correlation or only an indirect correlation through a confounder.…

统计方法学 · 统计学 2026-04-22 Shengjun Wu , Jeffery Wu

Stable Transport Meta-Analysis for Heterogeneous Cardiovascular Trials: A Nuisance-Anchor Framework with a Sign-Stability Diagnostic

Random-effects meta-analysis summarizes heterogeneous trials by estimating an average effect over the observed evidence base, which may not represent the clinically relevant target population. In cardiovascular medicine, treatment effects…

统计方法学 · 统计学 2026-04-22 Ibrahim Halil Tanboga

Deep Ranking with Heterogeneous Effects

Classical latent-score ranking models often fail to distinguish objects' intrinsic scores from contextual effects, which are typically nonlinear and can dominate the observed outcomes. To address this, we introduce a semiparametric ranking…

统计方法学 · 统计学 2026-04-22 Yuanhang Luo , Shuxing Fang , Ruijian Han , Yiming Xu

Nonparametric Identification and Estimation of Causal Effects on Latent Outcomes

How should researchers conduct causal inference when the outcome of interest is latent and measured imperfectly by multiple indicators? We develop a general nonparametric framework for identifying and estimating average treatment effects on…

统计方法学 · 统计学 2026-04-22 Jiawei Fu , Donald P. Green

Neyman-Pearson multiclass classification under label noise via empirical likelihood

In many classification problems, misclassification costs are highly asymmetric, while training labels are often corrupted due to measurement error, annotator variability, or adversarial noise. The Neyman-Pearson multiclass classification…

统计方法学 · 统计学 2026-04-22 Qiong Zhang , Qinglong Tian , Pengfei Li

Heterogeneous readmission prediction with hierarchical effect decomposition and regularization

Accurately predicting hospital readmission risks using electronic health records (EHRs) is critical for effective patient management and healthcare resource allocation. Patient populations in health systems are highly heterogeneous across…

统计方法学 · 统计学 2026-04-22 Ziren Jiang , Lingfeng Huo , Jue Hou , Mary Vaughan-Sarrazin , Maureen A. Smith , Jared D. Huling