统计方法学 — Scifaro

Is There an AI Bubble? Robust Date-Stamping for Periods of Exuberance

The recent surge in valuations among AI related firms has renewed concerns that markets may be entering a new phase of speculative exuberance, especially in the technology and semiconductor sectors at the center of the AI investment wave.…

统计方法学 · 统计学 2026-05-12 Abir Sarkar , Martin T. Wells

A Fast, Closed-Form Bandwidth Selector for the Beta Kernel Density Estimator

The Beta kernel estimator offers a theoretically superior alternative to the Gaussian kernel for unit interval data, eliminating boundary bias without requiring reflection or transformation. However, its adoption remains limited by the lack…

统计方法学 · 统计学 2026-05-12 Johan Hallberg Szabadváry

Deriving Complete Constraints in Hidden Variable Models

Hidden variable graphical models can sometimes imply constraints on the observable distribution that are more complex than simple conditional independence relations. These observable constraints can falsify assumptions of the model that…

统计方法学 · 统计学 2026-05-12 Michael C. Sachs , Erin E. Gabriel , Robin J. Evans , Arvid Sjölander

On the use of cross-fitting in causal machine learning with correlated units

In causal machine learning, the fitting and evaluation of nuisance models are often performed on separate partitions, or folds, of the observed data. This technique, called cross-fitting, eliminates bias introduced by the use of black-box…

统计方法学 · 统计学 2026-05-12 Salvador V. Balkus , Hasan Laith , Nima S. Hejazi

White noise testing for functional time series via functional quantile autocorrelation

We introduce a novel class of nonlinear tests for serial dependence in functional time series, grounded in the functional quantile autocorrelation framework. Unlike traditional approaches based on the classical autocovariance kernel, the…

统计方法学 · 统计学 2026-05-12 Ángel López-Oriona , Ying Sun , Hanlin Shang

Sequential Randomization Tests Using e-values: Applications for trial monitoring

Sequential monitoring of randomized trials traditionally relies on parametric assumptions or asymptotic approximations. We discuss a family of nonparametric sequential tests - collectively called e-RT - for binary, event-only, and…

统计方法学 · 统计学 2026-05-12 Fernando G Zampieri

Comparing Two Proxy Methods for Causal Identification

Identifying causal effects in the presence of unmeasured variables is a fundamental challenge in causal inference, for which proxy variable methods have emerged as a powerful solution. We contrast two major approaches in this framework: (1)…

统计方法学 · 统计学 2026-05-12 Helen Guo , Elizabeth L. Ogburn , Ilya Shpitser

State-Space Representation of INGARCH Models and Their Application in Insurance

Integer-valued generalized autoregressive conditional heteroskedastic (INGARCH) models are a popular framework for modeling serial dependence in count time-series. While convenient for modeling, prediction, and estimation, INGARCH models…

统计方法学 · 统计学 2026-05-12 Jae Youn Ahn , Hong Beng Lim , Mario V. Wüthrich

Valid Inference when Testing Violations of Parallel Trends for Difference-in-Differences

The difference-in-differences (DID) research design is a key identification strategy which allows researchers to estimate causal effects under the parallel trends assumption. While the parallel trends assumption is counterfactual and cannot…

统计方法学 · 统计学 2026-05-12 Jonas M. Mikhaeil , Christopher Harshaw

Causal Effect Estimation with TMLE: Handling Missing Data and Near-Violations of Positivity

We evaluate the performance of targeted maximum likelihood estimation (TMLE) for estimating the average treatment effect in missing data scenarios under varying levels of positivity violations. We employ model- and design-based simulations,…

统计方法学 · 统计学 2026-05-12 Christoph Wiederkehr , Christian Heumann , Michael Schomaker

Robust and Efficient Semiparametric Inference for the Stepped Wedge Design

Stepped wedge designs (SWDs) are increasingly used to evaluate longitudinal cluster-level interventions but pose substantial challenges for valid inference. Because crossover times are randomized, intervention effects are intrinsically…

统计方法学 · 统计学 2026-05-12 Fan Xia , K. C. Gary Chan , Emily Voldal , Avi Kenny , Patrick J. Heagerty , James P. Hughes

Untangling Sample and Population Level Estimands in Bayesian Causal Computation

Model-based Bayesian inference for sample and population-level causal estimands has been growing in popularity. This literature routinely emphasizes clear specification of the target estimand, however blind implementation of standard…

统计方法学 · 统计学 2026-05-12 Arman Oganisian

Hawkes Processes with Variable Length Memory: Existence, Inference and Application to Neuronal Activity

Multivariate Hawkes processes are past-dependant point processes originally introduced to model excitation effects, later extended to a nonlinear framework to account for the opposite effect, known as inhibition. Motivated by applications…

统计方法学 · 统计学 2026-05-12 Sacha Quayle , Anna Bonnet , Maxime Sangnier

Prediction of linear fractional stable motions using codifference, with application to non-Gaussian rough volatility

The linear fractional stable motion (LFSM) extends the fractional Brownian motion (fBm) by considering $\alpha$-stable increments. We propose a method to forecast future increments of the LFSM from past discrete-time observations, using the…

统计方法学 · 统计学 2026-05-12 Matthieu Garcin , Karl Sawaya , Thomas Valade

Generalized optimal parameter-transfer learning through Mallows-type model averaging

In many economic applications, multiple source datasets are available, but their effective combination is challenging due to heterogeneity across datasets. To address this problem, we study a parameter-transfer framework that shares only…

统计方法学 · 统计学 2026-05-12 Fen Jiang , Wenhui Li , Xinyu Zhang

gcor: A Python Implementation of Categorical Gini Correlation and Its Inference

Categorical Gini Correlation (CGC), introduced by Dang et al. (2020), is a novel dependence measure designed to quantify the association between a numerical variable and a categorical variable. It has appealing properties compared to…

统计方法学 · 统计学 2026-05-12 Sameera Hewage

Valid F-screening in linear regression

Suppose that a data analyst wishes to report the results of a least squares linear regression only if the overall null hypothesis, $H_0^{1:p}: \beta_1= \beta_2 = \ldots = \beta_p=0$, is rejected. This practice, which we refer to as…

统计方法学 · 统计学 2026-05-12 Olivia McGough , Daniela Witten , Daniel Kessler

Proximal Inference for Indirect and Intervening Effects in Population Interventions

Unmeasured confounding, unethical exposure, and ill-defined interventions pose significant challenges to evaluating policy-relevant mediation estimands in medicine and public health. In observational studies involving harmful exposures, the…

统计方法学 · 统计学 2026-05-12 Yang Bai , Yifan Cui , Baoluo Sun

Multivariable Behavioral Change Modeling of Epidemics in the Presence of Undetected Infections

Epidemic models are invaluable tools to understand and implement strategies to control the spread of infectious diseases, as well as to inform public health policies and resource allocation. However, current modeling approaches have…

统计方法学 · 统计学 2026-05-12 Caitlin Ward , Rob Deardon , Alexandra M. Schmidt

Efficient nonparametric estimation with difference-in-differences in the presence of network dependence and interference

Differences-in-differences (DiD) is a causal inference method for observational longitudinal data that assumes parallel expected potential outcome trajectories between treatment groups under the counterfactual scenario where all units…

统计方法学 · 统计学 2026-05-12 Michael Jetsupphasuk , Didong Li , Michael G. Hudgens