统计理论 — Scifaro

Penalized spline estimation of principal components for sparse functional data: rates of convergence

This paper gives a comprehensive treatment of the convergence rates of penalized spline estimators for simultaneously estimating several leading principal component functions, when the functional data is sparsely observed. The penalized…

统计理论 · 数学 2024-02-09 Shiyuan He , Jianhua Z. Huang , Kejun He

Minimax optimal density estimation using a shallow generative model with a one-dimensional latent variable

A deep generative model yields an implicit estimator for the unknown distribution or density function of the observation. This paper investigates some statistical properties of the implicit density estimator pursued by VAE-type methods from…

统计理论 · 数学 2024-02-09 Hyeok Kyu Kwon , Minwoo Chae

Catoni-style confidence sequences for heavy-tailed mean estimation

A confidence sequence (CS) is a sequence of confidence intervals that is valid at arbitrary data-dependent stopping times. These are useful in applications like A/B testing, multi-armed bandits, off-policy evaluation, election auditing,…

统计理论 · 数学 2024-02-09 Hongjian Wang , Aaditya Ramdas

From fuzzy information to community detection: an approach to social networks analysis with soft information

On the basis of network analysis, and within the context of modeling imprecision or vague information with fuzzy sets, we propose an innovative way to analyze, aggregate and apply this uncertain knowledge into community detection of…

统计理论 · 数学 2024-02-08 Inmaculada Gutiérrez , Daniel Gómez , Javier Castro , Rosa Espínola

Calculating the interaction index of a fuzzy measure: a polynomial approach based on sampling

In this paper we address the problem of fuzzy measures index calculation. On the basis of fuzzy sets, Murofushi and Soneda proposed an interaction index to deal with the relations between two individuals. This index was later extended in a…

统计理论 · 数学 2024-02-08 Inmaculada Gutiérrez , Javier Castro , Daniel Gómez , Rosa Espínola

Improvements in the estimation of the Weibull tail coefficient -- a comparative study

The Weibull tail-coefficient (WTC) plays a crucial role in extreme value statistics when dealing with Weibull-type tails. Several distributions, such as normal, Gamma, Weibull, and Logistic distributions, exhibit this type of tail…

统计理论 · 数学 2024-02-08 Lígia Henriques-Rodrigues , Frederico Caeiro , M. Ivette Gomes

Prediction with eventual almost sure guarantees

We study the problem of sequentially predicting properties of a probabilistic model and its next outcome over an infinite horizon, with the goal of ensuring that the predictions incur only finitely many errors with probability 1. We…

统计理论 · 数学 2024-02-08 Changlong Wu , Narayana Santhanam

nlstac: Non-Gradient Separable Nonlinear Least Squares Fitting

A new package for nonlinear least squares fitting is introduced in this paper. This package implements a recently developed algorithm that, for certain types of nonlinear curve fitting, reduces the number of nonlinear parameters to be…

统计理论 · 数学 2024-02-07 J. A. F. Torvisco , R. Benítez , M. R. Arias , J. Cabello Sánchez

Random features models: a way to study the success of naive imputation

Constant (naive) imputation is still widely used in practice as this is a first easy-to-use technique to deal with missing data. Yet, this simple method could be expected to induce a large bias for prediction purposes, as the imputed input…

统计理论 · 数学 2024-02-07 Alexis Ayme , Claire Boyer , Aymeric Dieuleveut , Erwan Scornet

Asymptotic properties of Vecchia approximation for Gaussian processes

Vecchia approximation has been widely used to accurately scale Gaussian-process (GP) inference to large datasets, by expressing the joint density as a product of conditional densities with small conditioning sets. We study fixed-domain…

统计理论 · 数学 2024-02-07 Myeongjong Kang , Florian Schäfer , Joseph Guinness , Matthias Katzfuss

Local differential privacy in survival analysis using private failure indicators

We consider the estimation of the cumulative hazard function, and equivalently the distribution function, with censored data under a setup that preserves the privacy of the survival database. This is done through a $\alpha$-locally…

统计理论 · 数学 2024-02-07 Egea Maxime , Escobar-Bach Mikael

Estimation of sparse linear regression coefficients under $L$-subexponential covariates

We tackle estimating sparse coefficients in a linear regression when the covariates are sampled from an $L$-subexponential random vector. This vector belongs to a class of distributions that exhibit heavier tails than Gaussian random…

统计理论 · 数学 2024-02-07 Takeyuki Sasai

Nonsense associations in Markov random fields with pairwise dependence

Yule (1926) identified the issue of "nonsense correlations" in time series data, where dependence within each of two random vectors causes overdispersion -- i.e. variance inflation -- for measures of dependence between the two. During the…

统计理论 · 数学 2024-02-06 Sohom Bhattacharya , Rajarshi Mukherjee , Elizabeth Ogburn

Inverse regression for spatially distributed functional data

Spatially distributed functional data are prevalent in many statistical applications such as meteorology, energy forecasting, census data, disease mapping, and neurological studies. Given their complex and high-dimensional nature,…

统计理论 · 数学 2024-02-06 Suneel Babu Chatla , Ruiqi Liu

Right-censored models by the expectile method

Based on the expectile loss function and the adaptive LASSO penalty, the paper proposes and studies the estimation methods for the accelerated failure time (AFT) model. In this approach, we need to estimate the survival function of the…

统计理论 · 数学 2024-02-06 Gabriela Ciuperca

Heavy-tailed $p$-value combinations from the perspective of extreme value theory

Handling multiplicity without losing much power has been a persistent challenge in various fields that often face the necessity of managing numerous statistical tests simultaneously. Recently, $p$-value combination methods based on…

统计理论 · 数学 2024-02-06 Yeonwoo Rho

Stochastic ordering of extreme order statistics in Archimax copula

An extension of Archimax copula class in more than two random variables ( Multivariate ) was introduced in (J\'agr 2011) for describing dependency structures among random variables in higher dimension, and some properties of Archimax copula…

统计理论 · 数学 2024-02-06 Sarikul Islam , Nitin Gupta

A new approach for imprecise probabilities

This paper introduces a novel concept of interval probability measures that enables the representation of imprecise probabilities, or uncertainty, in a natural and coherent manner. Within an algebra of sets, we introduce a notion of weak…

统计理论 · 数学 2024-02-06 Marcello Basili , Luca Pratelli

Multiple sequences Prophet Inequality Under Observation Constraints

In our problem, we are given access to a number of sequences of nonnegative i.i.d. random variables, whose realizations are observed sequentially. All sequences are of the same finite length. The goal is to pick one element from each…

统计理论 · 数学 2024-02-06 Aristomenis Tsopelakos , Olgica Milenkovic

Extremal behaviour and convergence rates for sample--based geometric quantiles and half space depths

We consider the empirical versions of geometric quantile and halfspace depth, and study their extremal behaviour as a function of the sample size. The objective of this study is to establish connection between the rates of convergence and…

统计理论 · 数学 2024-02-06 Sibsankar Singha , Marie Kratz , Sreekar Vadlamani