统计方法学 — Scifaro

Semiparametric Regression for Misclassified Competing Risks Data

The analysis of competing risks data is often complicated by misclassification of the cause of failure. This issue can lead to seriously biased estimates and invalid conclusions. One way to deal with such misclassification is to use a…

统计方法学 · 统计学 2026-05-19 Theofanis Balanos , Constantin T. Yiannoutsos , Felix M. Pabon-Rodriguez , Hongmei Nan , Giorgos Bakoyannis

Tests for constancy of model parameters Over time

Suppose that a sequence of data points follows a distribution of a certain parametric form, but that one or more of the underlying parameters may change over time. This paper addresses various natural questions in such a framework. We…

统计方法学 · 统计学 2026-05-19 Nils Lid Hjort , Alex J. Koning

Piece-wise linear isotonic regression

Isotonic regression provides a flexible, tuning-free approach to estimating monotonic functions without imposing global curvature constraints, yet the estimated regression function is inherently a step function. This paper addresses a key…

统计方法学 · 统计学 2026-05-19 Timo Kuosmanen , Juan F. Monge , José L. Ruiz , Xun Zhou

Parametrically Adaptive Transition Polynomial: a Signed-Parity Continuous-alpha Extension of Kunchenko Stochastic Polynomials

Kunchenko's method of polynomial maximization provides a semiparametric apparatus for parameter estimation under non-Gaussian errors, but its classical power basis relies on finite higher-order integer moments. This paper introduces the…

统计方法学 · 统计学 2026-05-19 Serhii Zabolotnii

A Bayesian Longitudinal Spatial Normative Model for Individualized Brain Deviation Mapping

Normative modeling enables individualized characterization of structural brain deviations by evaluating subjects against a reference population rather than a group average. Most existing implementations treat brain regions independently and…

统计方法学 · 统计学 2026-05-19 J. T. Korley

Sample size and power calculations for causal inference with time-to-event outcomes

This paper develops power and sample size formulas for causal inference with time-to-event outcomes. The target estimand is the marginal hazard ratio: the coefficient of a marginal structural Cox proportional hazard model with treatment as…

统计方法学 · 统计学 2026-05-19 Chengxin Yang , Bo Liu , Fan Li

Transporting treatment effects by calibrating large-scale observational outcomes

A high-quality experimental dataset is often much smaller than a corresponding observational dataset. When this holds with possibly biased measurements of the outcome of interest in the latter, we propose an estimation and inference…

统计方法学 · 统计学 2026-05-19 Harrison H Li

Addressing Confounding by Indication Through (Un)Measured Centre Characteristics in Learn-As-you-GO(LAGO) Trials

The Learn-As-you-Go (LAGO) design is an adaptive clinical trial design that allows modifications to multicomponent intervention packages across stages. Centers participate in more than one stage, as is common in large-scale implementation…

统计方法学 · 统计学 2026-05-19 Minh Thu Bui , Christopher T. Longenecker , Ante Bing , Donna Spiegelman , Allison R. Webel , Hayden B. Bosworth , Judith J. Lok

Predictive Volatility of Machine Learning in Micro-Samples: A Regularised Assessment of Regional Poverty

Identifying the structural drivers of poverty in regional datasets is frequently hindered by small sample sizes and high multidimensional collinearity, which can result in unstable and misleading policy advice. This paper evaluates the…

统计方法学 · 统计学 2026-05-19 A. H. Jamaluddin , A. T. R. Dani , N. I. Mahat , V. Ratnasari , S. S. M. Fauzi

Machine learning methods for finite population parameter estimation in survey sampling

This pedagogical review examines the use of machine learning methods in finite-population inference for survey sampling, with an emphasis on design-based validity and statistical inference. While flexible prediction tools offer substantial…

统计方法学 · 统计学 2026-05-19 Mehdi Dagdoug , David Haziza

Weak-Form Recovery of Stochastic Generators and Dynamical Invariants

Spectral gaps, Kramers escape rates, and position-dependent relaxation timescales are dynamical invariants encoded in the infinitesimal generator $\Lop$ of a stochastic flow. We show that weak projection of the governing It\^{o} SDE onto…

统计方法学 · 统计学 2026-05-19 Eshwar R A , Gajanan V. Honnavar

Global structure of the time delay likelihood

We identify a fundamental pathology in the likelihood for time delay inference which challenges standard inference methods. By analysing the likelihood for time delay inference with Gaussian process light curve models, we show that it…

统计方法学 · 统计学 2026-05-19 Namu Kroupa , Will Handley

Approximate Likelihood-Based Inference for Spatial Generalized Linear Mixed Models

We study maximum likelihood estimation for spatial generalized linear mixed models with Gaussian process approximations using a stochastic Newton-Raphson algorithm. We consider two Gaussian Process approximations in this context: spectral…

统计方法学 · 统计学 2026-05-19 Samuel I. Watson , Yixin Wang , Emanuele Giorgi

Exact inference via quasi-conjugacy in two-parameter Poisson-Dirichlet hidden Markov models

We introduce a nonparametric model for inferring time-evolving, unobserved probability distributions from discrete-time data consisting of unlabelled partitions. The latent process is a two-parameter Poisson-Dirichlet diffusion, and…

统计方法学 · 统计学 2026-05-19 Marco Dalla Pria , Matteo Ruggiero , Dario Spanò

A tree-based kernel for densities and its applications in clustering DNase-seq profiles

Modeling multiple sampling densities within a hierarchical framework enables borrowing of information across samples. These density random effects can act as kernels in latent variable models to represent exchangeable subgroups or clusters.…

统计方法学 · 统计学 2026-05-19 Yuliang Xu , Kaixuan Luo , Li Ma

Advancing clustering methods in physics education research: A case for mixture models

Clustering methods are often used in physics education research (PER) to identify subgroups of individuals within a population who share similar response patterns or characteristics. K-means (or k-modes, for categorical data) is one of the…

统计方法学 · 统计学 2026-05-19 Minghui Wang , Meagan Sundstrom , Karen Nylund-Gibson , Marsha Ing

Reliable fairness auditing with semi-supervised inference

Machine learning (ML) models often exhibit bias that can exacerbate inequities in biomedical applications. Fairness auditing, the process of evaluating a model's performance across subpopulations, is critical for identifying and mitigating…

统计方法学 · 统计学 2026-05-19 Jianhui Gao , Jessica Gronsbell

Multivariate Poisson intensity estimation via low-rank tensor decomposition

In this work, we propose new matrix- and tensor-based methodologies for estimating multivariate intensity functions of inhomogeneous point processes. By viewing multivariate intensity functions as infinite-dimensional matrices or tensors…

统计方法学 · 统计学 2026-05-19 Haotian Xu , Carlos Misael Madrid Padilla , Oscar Hernan Madrid Padilla , Daren Wang

Sample size and power calculations for causal inference of observational studies

This paper investigates the theoretical foundation and develops analytical formulas for sample size and power calculations for causal inference with observational data. By analyzing the variance of an inverse probability weighting estimator…

统计方法学 · 统计学 2026-05-19 Bo Liu , Chengxin Yang , Fan Li

Family-wise Error Rate Control with E-values

The closure principle is a standard tool for achieving strong family-wise error rate (FWER) control in multiple testing problems. We develop an e-value-based closed testing framework that inherits nice properties of e-values, which are…

统计方法学 · 统计学 2026-05-19 Will Hartog , Lihua Lei