统计方法学 — Scifaro

Kantorovich Regression Analysis of Random Distributions with Mixed Predictors

We study regression problems with distribution-valued responses and mixed distributional and Euclidean predictors. In quadratic cost, the negative gradient of the Kantorovich potential represents, at each source location, the displacement…

统计方法学 · 统计学 2026-03-19 Kaheon Kim , Changbo Zhu

Robust Regression with Student's T: The Role of Degrees of Freedom

Linear regression estimators are known to be sensitive to outliers, and one alternative to obtain a robust and efficient estimator of the regression parameter is to model the error with Student's $t$ distribution. In this article, we…

统计方法学 · 统计学 2026-03-19 Amanda Ng , Shangkai Zhu , Archer Gong Zhang , Nancy Reid

Bounding causal effects with an unknown mixture of informative and non-informative missingness

In experimental and observational data settings, researchers often have limited knowledge of the reasons for missing outcomes. To address this uncertainty, we propose bounds on causal effects for missing outcomes, accommodating the scenario…

统计方法学 · 统计学 2026-03-19 Max Rubinstein , Denis Agniel , Larry Han , Marcela Horvitz-Lennon , Sharon-Lise Normand

High-dimensional Statistical Inference and Variable Selection Using Sufficient Dimension Association

Simultaneous variable selection and statistical inference is challenging in high-dimensional data analysis. Most existing post-selection inference methods require explicitly specified regression models, which are often linear, as well as…

统计方法学 · 统计学 2026-03-19 Shangyuan Ye , Shauna Rakshe , Ye Liang

Localized Sparse Principal Component Analysis of Multivariate Time Series in Frequency Domain

Principal component analysis has been a main tool in multivariate analysis for estimating a low dimensional linear subspace that explains most of the variability in the data. However, in high-dimensional regimes, naive estimates of the…

统计方法学 · 统计学 2026-03-19 Jamshid Namdari , Amita Manatunga , Fabio Ferrarelli , Robert Krafty

A marginalized three-part interrupted time series regression model for proportional data

Interrupted time series (ITS) is often used to evaluate the effectiveness of a health policy intervention that accounts for the temporal dependence of outcomes. When the outcome of interest is a percentage or percentile, the data can be…

统计方法学 · 统计学 2026-03-19 Shangyuan Ye , Maricela Cruz , Ziyou Wang , Yun Yu

Regression and Dimension Reduction for Multivariate Mixed-Type Data via Semiparametric Gaussian Copula

Clinical and epidemiological studies encode participant information in multivariate vectors with mixed type variables on continuous, truncated, ordinal, and binary scales. Semiparametric Gaussian Copula (SGC) assumes that observed data is…

统计方法学 · 统计学 2026-03-19 Debangan Dey , Vadim Zipunnikov

Spatial Causal Tensor Completion for Multiple Exposures and Outcomes: An Application to the Health Effects of PFAS Pollution

Per- and polyfluoroalkyl substances (PFAS) are typically encountered as mixtures of distinct chemicals with distinct effects on multiple health outcomes. Estimating joint causal effects using spatially-dependent observed data is…

统计方法学 · 统计学 2026-03-18 Xiaodan Zhou , Brian J Reich , Shu Yang

Estimation and Hypothesis Testing of Fixed Effects Models-Based Uncertainty for Factor Designs

To analyze the uncertain data frequently encountered in practice, this paper proposes novel fixed-effects models that incorporate an uncertain measure to investigate variables of interest and nuisance variables in factor designs. First, an…

统计方法学 · 统计学 2026-03-18 Fan Zhang , Zhiming Li

A nonparametric approach to understand multivariate quantile dynamics in financial time series

Over the last decade, nonparametric methods have gained increasing attention for modeling complex data structures due to their flexibility and minimal structural assumptions. In this paper, we study a general multivariate nonparametric…

统计方法学 · 统计学 2026-03-18 Kunal Rai , Archi Roy , Itai Dattner , Soudeep Deb

A flexible wrapped Lindley-type distribution for angular data modelling

Flexible distributions for modelling angular data have received considerable attention in recent years, with ongoing work extending existing circular models to provide greater flexibility in capturing diverse angular behaviours. In this…

统计方法学 · 统计学 2026-03-18 Johan Ferreira , Delene van Wyk-de Ridder , Janet van Niekerk

Power Analysis for Prediction-Powered Inference

Modern studies increasingly leverage outcomes predicted by machine learning and artificial intelligence (AI/ML) models, and recent work, such as prediction-powered inference (PPI), has developed valid downstream statistical inference…

统计方法学 · 统计学 2026-03-18 Yiqun T. Chen , Moran Guo , Shengy Li

Time Partitioning in Target Trial Emulation

In target trial emulation, time partitioning enables researchers to handle time-varying confounders and immortal time bias with appropriate methods. Based on two clinical scenarios, this study aimed to explore issues related to time…

统计方法学 · 统计学 2026-03-18 Harold Tankpinou Zoumenou , Simon Ferreira , Charles Assaad , Nathanael Lapidus , Daria Bystrova , Benjamin Glemain

Bayesian Inference in Epidemic Modelling: A Beginner's Guide

This lecture note provides a self-contained introduction to Bayesian inference and Markov Chain Monte Carlo (MCMC) methods for parameter estimation in epidemic models. Using the classical Susceptible-Infectious-Recovered (SIR) compartmental…

统计方法学 · 统计学 2026-03-18 Augustine Okolie

Rank-based methods for estimating landmark win probability in longitudinal randomized controlled trials with missing data

The primary analysis for longitudinal randomized controlled trials (RCTs) often compares treatment groups at the last timepoint, referred to as the landmark time. Assuming data are normally distributed and missing at random, the mixed model…

统计方法学 · 统计学 2026-03-18 Guangyong Zou , Shi-Fang Qui , Joshua Zou , Emma Davies Smith , Yun-Hee Choi , Yuhan Bi

Differential gene expression analysis via two-component mixture models with a semiparametric skew-normal scale mixture alternative

Two-component mixture models are particularly useful for identifying differentially expressed genes, but their performance can deteriorate markedly when the alternative distribution departs from parametric assumptions or symmetry. We…

统计方法学 · 统计学 2026-03-18 Sangkon Oh , Geoffrey J. McLachlan

M-estimation under Two-Phase Multiwave Sampling with Applications to Prediction-Powered Inference

In two-phase multiwave sampling, inexpensive measurements are collected on a large sample and expensive, more informative measurements are adaptively obtained on subsets of units across multiple waves. Adaptively collecting the expensive…

统计方法学 · 统计学 2026-03-18 Dan M. Kluger , Stephen Bates

A Novel Multiple Imputation Approach For Parameter Estimation in Observation-Driven Time Series Models With Missing Data

Handling missing data in time series is a complex problem due to the presence of temporal dependence. General-purpose imputation methods, while widely used, often distort key statistical properties of the data, such as variance and…

统计方法学 · 统计学 2026-03-18 Guilherme Pumi , Taiane Schaedler Prass , Douglas Krauthein Verdum

Confidence Intervals for Extinction Risk: Validating Population Viability Analysis with Limited Data

Quantitative assessment of extinction risk requires confidence intervals (CIs) that remain informative with limited data. Their usefulness has long been debated because short observation spans can make uncertainty so large that population…

统计方法学 · 统计学 2026-03-18 Hiroshi Hakoyama

On the interplay between prior weight and variance of the robustification component in Robust Mixture Prior Bayesian Dynamic Borrowing approach

Robust Mixture Prior (RMP) is a popular Bayesian dynamic borrowing method, which combines an informative historical distribution with a less informative component (referred as robustification component) in a mixture prior to enhance the…

统计方法学 · 统计学 2026-03-18 Marco Ratta , Gaelle Saint-Hilary , Mauro Gasparini , Pavel Mozgunov