统计理论 — Scifaro

Conditional validity and a fast approximation formula of full conformal prediction sets

Prediction sets based on full conformal prediction have seen an increasing interest in statistical learning due to their universal marginal coverage guarantees. However, practitioners have refrained from using it in applications for two…

统计理论 · 数学 2025-08-08 Nicolai Amann

Kaminsky Type Functional Equations and Bivariate Residual Lifetimes Distributions

This paper considers generalizations of the functional equations that characterize the lack-of-memory properties at univariate and bivariate levels. Specifically, we extend the univariate functional equation introduced by Kaminsky (1983)…

统计理论 · 数学 2025-08-08 Sabrina Mulinacci , Massimo Ricci

General asymptotic representations of indexes based on the functional empirical process and the residual functional empirical process and applications

The objective of this paper is to establish a general asymptotic representation (\textit{GAR}) for a wide range of statistics, employing two fundamental processes: the functional empirical process (\textit{fep}) and the residual functional…

统计理论 · 数学 2025-08-08 Gane Samb Lo , Tchilabalo Abozou Kpanzou , Gandasor Bonyiri Onesiphore Da

Weak Identification in Peer Effects Estimation

It is commonly accepted that some phenomena are social: for example, individuals' smoking habits often correlate with those of their peers. Such correlations can have a variety of explanations, such as direct contagion or shared…

统计理论 · 数学 2025-08-08 William W. Wang , Ali Jadbabaie

On the asymptotic validity of confidence sets for linear functionals of solutions to integral equations

This paper examines the construction of confidence sets for parameters defined as linear functionals of a function of W and X whose conditional mean given Z and X equals the conditional mean of another variable Y given Z and X. Many…

统计理论 · 数学 2025-08-08 Ezequiel Smucler , James M. Robins , Andrea Rotnitzky

The many routes to the ubiquitous Bradley-Terry model

The rating of items based on pairwise comparisons has been a topic of statistical investigation for many decades. Numerous approaches have been proposed. One of the best known is the Bradley-Terry model. This paper seeks to assemble and…

统计理论 · 数学 2025-08-08 Ian Hamilton , Nick Tawn , David Firth

Expanding the Standard Diffusion Process to Specified Non-Gaussian Marginal Distributions

We develop a class of non-Gaussian translation processes that extend classical stochastic differential equations (SDEs) by prescribing arbitrary absolutely continuous marginal distributions. Our approach uses a copula-based transformation…

统计理论 · 数学 2025-08-06 Robert Richardson , H. Dennis Tolley , Kenneth Kuttler

Identifiability and Estimation in High-Dimensional Nonparametric Latent Structure Models

This paper studies the problems of identifiability and estimation in high-dimensional nonparametric latent structure models. We introduce an identifiability theorem that generalizes existing conditions, establishing a unified framework…

统计理论 · 数学 2025-08-06 Yichen Lyu , Pengkun Yang

Estimation and variable selection in high dimension in nonlinear mixed-effects models

We consider nonlinear mixed effects models including high-dimensional covariates to model individual parameters variability. The objective is to identify relevant covariates among a large set under sparsity assumption and to estimate model…

统计理论 · 数学 2025-08-06 Antoine Caillebotte , Estelle Kuhn , Sarah Lemler

Estimation and variable selection in high dimension in a causal joint model of survival times and longitudinal outcomes with random effects

We consider a joint survival and mixed-effects model to explain the survival time from longitudinal data and high-dimensional covariates in a population. The longitudinal data is modeled using a non linear mixed-effects model to account for…

统计理论 · 数学 2025-08-06 Antoine Caillebotte , Estelle Kuhn , Sarah Lemler

Variational Bernstein-von Mises theorem with increasing parameter dimension

Variational Bayes (VB) provides a computationally efficient alternative to Markov Chain Monte Carlo, especially for high-dimensional and large-scale inference. However, existing theory on VB primarily focuses on fixed-dimensional settings…

统计理论 · 数学 2025-08-05 Jiawei Yan , Peirong Xu , Tao Wang

Estimation of Algebraic Sets: Extending PCA Beyond Linearity

An algebraic set is defined as the zero locus of a system of real polynomial equations. In this paper we address the problem of recovering an unknown algebraic set $\mathcal{A}$ from noisy observations of latent points lying on…

统计理论 · 数学 2025-08-05 Alberto González-Sanz , Gilles Mordant , Álvaro Samperio , Bodhisattva Sen

From Thomas Bayes to Big Data: On the feasibility of being a subjective Bayesian

We argue that the Bayesian paradigm, of a prior which represents the beliefs of the statistician before observing the data, is not feasible in ultra-high-dimensional models. We claim that natural priors that represent the a priori beliefs…

统计理论 · 数学 2025-08-05 Ya'acov Ritov

Likelihood Functions with Parameter-Dependent Support: A Survey of the Cram\'{e}r-Rao-Leibniz Lower Bound

Parameter estimation is a fundamental problem in science and engineering. In many safety-critical applications, one is not only interested in a {\it point} estimator, but also the uncertainty bound that can self-assess the accuracy of the…

统计理论 · 数学 2025-08-05 Qin Lu , Yaakov Bar-Shalom , Peter Willett

Asymptotic guarantees for Bayesian phylogenetic tree reconstruction

We derive tractable criteria for the consistency of Bayesian tree reconstruction procedures, which constitute a central class of algorithms for inferring common ancestry among DNA sequence samples in phylogenetics. Our results encompass…

统计理论 · 数学 2025-08-05 Alisa Kirichenko , Luke J. Kelly , Jere Koskela

Consistent DAG selection for Bayesian causal discovery under general error distributions

We consider the problem of learning the underlying causal structure among a set of variables, which are assumed to follow a Bayesian network or, more specifically, a linear recursive structural equation model (SEM) with the associated…

统计理论 · 数学 2025-08-05 Anamitra Chaudhuri , Anirban Bhattacharya , Yang Ni

Functional independent component analysis by choice of norm: a framework for near-perfect classification

We develop a theory for functional independent component analysis in an infinite-dimensional framework using Sobolev spaces that accommodate smoother functions. The notion of penalized kurtosis is introduced motivated by Silverman's method…

统计理论 · 数学 2025-08-05 Marc Vidal , Marc Leman , Ana M. Aguilera

Revisiting Step-Size Assumptions in Stochastic Approximation

Many machine learning and optimization algorithms are built upon the framework of stochastic approximation (SA), for which the selection of step-size (or learning rate) $\{\alpha_n\}$ is crucial for success. An essential condition for…

统计理论 · 数学 2025-08-05 Caio Kalil Lauand , Sean Meyn

Estimation of on- and off-time distributions in a dynamic Erd\H{o}s-R\'enyi random graph

In this paper we consider a dynamic Erd\H{o}s-R\'enyi graph in which edges, according to an alternating renewal process, change from present to absent and vice versa. The objective is to estimate the on- and off-time distributions while…

统计理论 · 数学 2025-08-05 Michel Mandjes , Jiesen Wang

Distribution-free inference with hierarchical data

This paper studies distribution-free inference in settings where the data set has a hierarchical structure -- for example, groups of observations, or repeated measurements. In such settings, standard notions of exchangeability may not hold.…

统计理论 · 数学 2025-08-05 Yonghoon Lee , Rina Foygel Barber , Rebecca Willett