Statistics — Scifaro

Generalized linear models with spatial dependence and a functional covariate

We extend generalized functional linear models under independence to a situation in which a functional covariate is related to a scalar response variable that exhibits spatial dependence-a complex yet prevalent phenomenon. For estimation,…

Methodology · Statistics 2026-05-22 Sooran Kim , Mark S. Kaiser , Xiongtao Dai

From Isotonic to Lipschitz Regression: A New Interpolative Perspective on Shape-restricted Estimation

This manuscript bridges nonparametric smoothness-based and shape-restricted estimation, which may appear as two disjoint paradigms in the field. The proposed approach is motivated by a conceptually simple observation: every Lipschitz…

Methodology · Statistics 2026-05-22 Kenta Takatsu , Tianyu Zhang , Arun Kumar Kuchibhotla

Robust and Efficient Estimation for a Discrete Distribution Using L2 Optimization

This paper proposes a novel method to estimate the rate parameter of the Poisson distribution. The proposed method employs the Cramer-von Mises type optimization which has been commonly used in estimating parameters of continuous…

Computation · Statistics 2026-05-22 Jiwoong Kim

Assessing the impact of tourist attractions through the integration of causal inference and demand-side economic analysis: A case study of the Sensoria experience museum in Holzminden, Germany

This research note investigates the impact of the experience museum Sensoria, opened in September 2024 in Holzminden, Germany, on local tourism demand and related direct and indirect effects. To this end, the study employs a novel approach…

Applications · Statistics 2026-05-21 Thomas Wieland

TCARD: Nearly Balanced Two-Level Designs with Treatment Cardinality Constraints with an Application to LLM Prompt Engineering

Modern experimental designs often face the so-called treatment cardinality constraint, which is the constraint on the number of included factors in each treatment. Experiments with such constraints are commonly encountered in engineering…

Methodology · Statistics 2026-05-21 Kexin Xie , Ryan Lekivetz , Xinwei Deng

Memorisation, convergence and generalisation in generative models

Generative neural networks learn how to produce highly realistic images from a large, but finite number of examples - or do they simply memorise their training set? To settle this question, Kadkhodaie, Guth, Simoncelli and Mallat (ICLR '24)…

Machine Learning · Statistics 2026-05-21 Antoine Maillard , Sebastian Goldt

Clustering Craters on the Moon with Dysfunctional Families

Summaries of craters on terrestrial bodies, such as the number and size distribution, are essential for understanding the history of the Solar System. Identifying craters, however, has not been automated and thus relies on expert…

Methodology · Statistics 2026-05-21 Nathan Weed , Emily Castleton , Dave Osthus , Brian Weaver , Richard L. Warr

Semiparametric Efficient Bilevel Gradient Estimation

Functional bilevel methods estimate a lower-level function and plug it into a hypergradient, but this plug-in gradient can retain first-order bias when the lower-level problem is learned nonparametrically. To remove this bias, we develop a…

Machine Learning · Statistics 2026-05-21 Fares El Khoury , Houssam Zenati , Nathan Kallus , Michael Arbel , Aurélien Bibaut

Bitcoin's Power Law: Weak Structure, Strong Forecasts

Bitcoin's price has been described as following a power law (PL) in time, $P \sim t^{\beta}$ with $\hat\beta \approx 5.7$ over 2010-2026. We test this claim using the Clauset-Shalizi-Newman protocol applied to Bitcoin's tail-relevant…

Applications · Statistics 2026-05-21 Carlos Baquero , Raquel Menezes

The Bayesian Gaussian Process Latent Variable Model for Spatio-Temporal Stream Networks

A variational inference-based framework for training a multi-output Gaussian process latent variable model, specifically tailored to the tails-up spatio-temporal stream network, is developed. Training, given a censored observational data…

Methodology · Statistics 2026-05-21 Marno Basson , Tobias M. Louw , Theresa R. Smith

How does limma-trend work? An empirical partially Bayes perspective

In high-throughput biology, it is common to fit thousands of linear regressions -- one per gene, protein, or other unit -- with very few samples per unit. Limma-trend, one of the most widely used methods in this setting, improves power by…

Methodology · Statistics 2026-05-21 Sagnik Nandy , Wanyi Ling , Nikolaos Ignatiadis

Large-Step Training Dynamics of a Two-Factor Linear Transformer Model

Gradient-flow analyses show that simplified linear transformers can learn the in-context linear-regression algorithm, but they do not explain the finite-step behavior of gradient descent at large learning rates. Motivated by empirical work…

Machine Learning · Statistics 2026-05-21 Krishnakumar Balasubramanian

A continuous-time Markov chain framework for population size estimation from multi-list data: accounting for absorbing lists and asymmetric interactions

We introduce a continuous-time Markov chain framework for estimating population size from multi-list data, which allows directional interactions to be modelled and can accommodate absorbing lists, such as death records, or more general data…

Methodology · Statistics 2026-05-21 Ophélie Schaller , Andrew Titman , Rachel McCrea

Theoretical guidelines for annealed Langevin dynamics in compositional simulation-based inference

Compositional score-based approaches to simulation-based inference (SBI) approximate the posterior over a shared parameter given $n$ independent observations by aggregating individually learned posterior scores: currently, there are two…

Machine Learning · Statistics 2026-05-21 Camille Touron , Gabriel V. Cardoso , Julyan Arbel , Pedro L. C. Rodrigues

Federated LoRA Fine-Tuning for LLMs via Collaborative Alignment

Low-rank adaptation (LoRA) has emerged as a powerful tool for parameter-efficient fine-tuning of large language models (LLMs). This paper studies LoRA under a federated learning setting, enabling collaborative fine-tuning across clients…

Machine Learning · Statistics 2026-05-21 Shuaida He , Liwen Chen , Long Feng

Laplace Approximations for Mixed-Effects and Gaussian Process Quantile Regression

Laplace approximations are a standard tool for computationally efficient inference in latent Gaussian models, but they fail for quantile regression with the asymmetric Laplace likelihood because the observed Hessian vanishes almost…

Methodology · Statistics 2026-05-21 Andrea Nava , Fabio Sigrist

A Rigorous, Tractable Measure of Model Complexity

An accurate assessment of a model's complexity is crucial for topics such as interpretation, generalization, and model selection. However, most existing complexity measures either rely on heuristic assumptions or are computationally…

Machine Learning · Statistics 2026-05-21 Oskar Allerbo , Thomas B. Schön

An Introduction to Copulas: a Complement

For many years I have taught an advanced statistical inference course for master's students using the text of Casella and Berger (2002). The book gives a comprehensive treatment of the core topics at a level that avoids measure theory while…

Other Statistics · Statistics 2026-05-21 Werner G. Müller

Conditioning Gaussian Processes on Almost Anything

Gaussian processes (GPs) offer a principled probabilistic model over functions, but exact inference is restricted to the linear-Gaussian regime. We establish an explicit equivalence between GPs and a class of linear diffusion models,…

Machine Learning · Statistics 2026-05-21 Henry Moss , Lachlan Astfalck , Thomas Cowperthwaite , Colin Doumont , Sam Willis , Philipp Hennig , Christopher Nemeth , Andrew Zammit-Mangion

Particle filtering methods for partially observed branching processes

This paper focuses on the estimation of partially observed branching processes. First, the estimators from a frequentist perspective proposed in the literature are reviewed. The main objective of this paper is to present computational tools…

Computation · Statistics 2026-05-21 Miguel González , Inés M. del Puerto , Manuel Serrano-Pastor