Statistics — Scifaro

Joint Object Tracking and Intent Recognition

This paper presents a Bayesian framework for inferring the posterior of the augmented state of a target, incorporating its underlying goal or intent, such as any intermediate waypoints and/or the final destination. Thus, it is for joint…

Applications · Statistics 2026-05-25 Jiaming Liang , Bashar I. Ahmad , Simon Godsill

Faster Hamiltonian Monte Carlo by Learning Leapfrog Scale: a self-calibrated randomized solution

We introduce a Hamiltonian Monte Carlo (HMC) methodology based on a randomized selection of integration times, referred to as eHMC, where "e" stands for empirical. The approach relies on an offline calibration phase that leverages…

Computation · Statistics 2026-05-25 Changye Wu , Pierre Pudlo , Christian P. Robert , Julien Stoehr

Comparison of probabilistic nowcasts and forecasts of SARS-CoV-2 variant proportions made by hierarchical multinomial linear regression models

Nowcasting and forecasting of infectious diseases have become increasingly important since the SARS-CoV-2 pandemic. In particular, methods for modeling the composition of circulating variants at a given time have seen more use in part due…

Applications · Statistics 2026-05-22 Isaac MacArthur , Thomas Robacker , Evan L. Ray , Benjamin W. Rogers , Nicholas G. Reich , Maryclare Griffin

Positive-definiteness in separable priors: effects on prior interpretability and inference

A popular class of priors for symmetric positive-definite matrices assumes independent entries and adds a truncation to ensure positive-definiteness. While conceptually simple and often computationally convenient, unless done carefully this…

Methodology · Statistics 2026-05-22 Jack Storror Carter , David Rossell

A new class of functional conditional autoregressive models

We introduce a new class of conditional autoregressive models for spatially dependent functional data, formulated through conditional means given neighboring functional observations and characterized by a covariance operator and a spatial…

Methodology · Statistics 2026-05-22 Sooran Kim

A Martingale Kernel Independence Test

The Hilbert-Schmidt Independence Criterion (HSIC) and its joint-independence extension $d\mathrm{HSIC}$ are degenerate $V$-statistics whose data-dependent weighted-$\chi^2$ null limits force a permutation calibration that multiplies the…

Machine Learning · Statistics 2026-05-22 Felix Laumann , Zhaolu Liu , Mauricio Barahona

Do Not Trust The Auctioneer: Learning to Bid in Feedback-Manipulated Auctions

Shilling is the use of artificial bids to make competition appear stronger and push prices upward. We study repeated first-price auctions in which shilling affects feedback but not allocation: the learner wins or loses against the real…

Machine Learning · Statistics 2026-05-22 Luigi Foscari , Matilde Tullii , Vianney Perchet

From Volterra Series to Kunchenko Stochastic Polynomials: Half a Century of Non-Gaussian Estimation Methodology

This paper reconstructs the half-century evolution of the scientific school founded by Yuriy P. Kunchenko (1939--2006) as the development of a semiparametric methodology for non-Gaussian estimation. Starting with Kunchenko's 1972/1973…

Methodology · Statistics 2026-05-22 Serhii Zabolotnii

Departure from Regularity: Degree Heterogeneity and Eigengap as the Structural Drivers of ASE-LSE Latent Subspace Disagreement

Two of the most widely used methods for analysing graph data, Adjacency Spectral Embedding and Laplacian Spectral Embedding, often produce different results when applied to the same network. Yet the structural reasons behind this…

Machine Learning · Statistics 2026-05-22 Minh Triet Pham , Ian Gallagher

Chained Markov melding using divide and conquer sequential Monte Carlo

Specifying a full Bayesian model that integrates multiple data sources can be challenging. One natural approach is to specify each individual model separately and join them afterwards. This is the approach adopted in Markov melding.…

Methodology · Statistics 2026-05-22 Yixuan Liu , Robert J. B. Goudie

moveEZ: An R Package for Animated Biplots

The moveEZ (pronounced move easy) R package provides tools for constructing animated PCA biplots that reveal how multivariate structure evolves across the ordered levels of a categorical variable. Built as an extension to the biplotEZ…

Computation · Statistics 2026-05-22 Raeesa Ganey , Johané Nienkemper-Swanepoel

Bayesian Nonparametrics: Principles and Practice

This extended preface [to the Book `Bayesian Nonparametrics', Cambridge University Press, 2010, by NL Hjort, CC Holmes, P Mueller, SG Walker] is meant to explain why you are right to be curious about Bayesian nonparametrics -- why you may…

Methodology · Statistics 2026-05-22 Nils Lid Hjort , Chris Holmes , Peter Mueller , Stephen G. Walker

A critical comparison of handling zeros in high-dimensional compositional count data

The growing use of high-throughput sequencing (HTS) has enabled the large-scale production of compositional count data, driving progress in microbiome research. However, such count data are often high-dimensional, over-dispersed, and…

Other Statistics · Statistics 2026-05-22 Wenqi Tang , Kamila Fačevicová , Klaus Nordhausen , Sara Taskinen

From Betting to Empirical Bernstein LIL

This is a verbatim copy of a technical report I wrote in 2017-2018 to obtain the law of the iterated logarithm using the guarantee on the wealth of an online betting strategy.

Machine Learning · Statistics 2026-05-22 Francesco Orabona

Two-stage Ensemble Clustering of Functional Data Using Random Projections

We propose a computationally simple framework for clustering functional data based on Gaussian-process-generated random projections. In this approach, each curve is first projected onto a large collection of independent Gaussian process…

Methodology · Statistics 2026-05-22 Sourav Chakrabarty , Anirvan Chakraborty , Shyamal K. De

A Mixed Self-Exciting Process to Model Epileptic Seizures

Epilepsy is a neurological disorder characterized by recurrent seizures affecting more than 70 million people worldwide. Often, an individual with epilepsy is more likely to experience subsequent seizures following an initial seizure, a…

Methodology · Statistics 2026-05-22 Karen Kanaster , Giovani L. Silva , Peter Mueller , Jacob Pellinen , Elizabeth Juarez-Colunga

Eigen for Statistical and Machine Learning Computing: A Lightweight C++ Tutorial with Python Bindings

This note provides a lightweight tutorial on using Eigen, a C++ template library for linear algebra, to implement statistical and machine learning algorithms. The emphasis is practical rather than methodological: we show how common matrix…

Computation · Statistics 2026-05-22 Seyoung Lee , Kwan-Young Bak

Testing for Serial Independence via Auto Hilbert-Schmidt Independence Criterion

We develop a Hilbert--Schmidt independence criterion (HSIC)-based framework for testing serial independence in strictly stationary time series. The proposed auto Hilbert--Schmidt independence criterion (AutoHSIC) measures dependence between…

Methodology · Statistics 2026-05-22 Muyi Li , Yuqing Xu , Zhou Zhou

Uniform-in-Time Weak Propagation-of-Chaos in Shallow Neural Networks

We consider one-hidden layer neural networks trained in the feature-learning regime using gradient descent, and relate the output of the finite-width network $f_{\hat{\rho}_t^m}$ to its infinite-width counterpart $f_{\rho_t^{MF}}$, which…

Machine Learning · Statistics 2026-05-22 Margalit Glasgow , Joan Bruna

Selecting Informative Conformal Prediction Sets with an Optimized FCR-Controlled Approach

Conformal methods provide prediction sets for outcomes with confidence guarantees. We study their use in a selective inference setting, where inference is performed only when the prediction set is informative. The analyst may consider as…

Methodology · Statistics 2026-05-22 Israela Solomon , Etienne Roquain , Saharon Rosset , Ruth Heller