统计理论 — Scifaro

Kernel Stein Discrepancy on Lie Groups: Theory and Applications

Distributional approximation is a fundamental problem in machine learning with numerous applications across all fields of science and engineering and beyond. The key challenge in most approximation methods is the need to tackle the…

统计理论 · 数学 2024-09-19 Xiaoda Qu , Xiran Fan , Baba C. Vemuri

On the Statistical Complexity of Sample Amplification

The ``sample amplification'' problem formalizes the following question: Given $n$ i.i.d. samples drawn from an unknown distribution $P$, when is it possible to produce a larger set of $n+m$ samples which cannot be distinguished from $n+m$…

统计理论 · 数学 2024-09-19 Brian Axelrod , Shivam Garg , Yanjun Han , Vatsal Sharan , Gregory Valiant

Functional Adaptive Huber Linear Regression

Robust estimation has played an important role in statistical and machine learning. However, its applications to functional linear regression are still under-developed. In this paper, we focus on Huber's loss with a diverging robustness…

统计理论 · 数学 2024-09-18 Ling Peng , Xiaohui Liu , Heng Lian

Learning with Sparsely Permuted Data: A Robust Bayesian Approach

Data dispersed across multiple files are commonly integrated through probabilistic linkage methods, where even minimal error rates in record matching can significantly contaminate subsequent statistical analyses. In regression problems, we…

统计理论 · 数学 2024-09-18 Abhisek Chakraborty , Saptati Datta

Variance Residual Life Ageing Intensity Function

Quantitative measurement of ageing across systems and components is crucial for accurately assessing reliability and predicting failure probabilities. This measurement supports effective maintenance scheduling, performance optimisation, and…

统计理论 · 数学 2024-09-18 Ashutosh Singh

On the maximal correlation coefficient for the bivariate Marshall Olkin distribution

We prove a formula for the maximal correlation coefficient of the bivariate Marshall Olkin distribution that was conjectured in Lin, Lai, and Govindaraju (2016, Stat. Methodol., 29:1-9). The formula is applied to obtain a new proof for a…

统计理论 · 数学 2024-09-18 Axel Bücher , Torben Staud

Bayesian inference of covariate-parameter relationships for population modelling

We consider population modelling using parametrised ordinary differential equation initial value problems (ODE-IVPs). For each individual drawn randomly from the unknown population distribution, the corresponding parameters for the ODE-IVP…

统计理论 · 数学 2024-09-18 Han Cheng Lie

False discovery proportion envelopes with m-consistency

We provide new non-asymptotic false discovery proportion (FDP) confidence envelopes in several multiple testing settings relevant for modern high dimensional-data methods. We revisit the multiple testing scenarios considered in the recent…

统计理论 · 数学 2024-09-18 Iqraa Meah , Gilles Blanchard , Etienne Roquain

Censoring heavy-tail count distributions for parameter estimation with an application to stable distributions

A new approach based on censoring and moment criterion is introduced for parameter estimation of count distributions when the probability generating function is available even though a closed form of the probability mass function and/or…

统计理论 · 数学 2024-09-18 Antonio Di Noia , Marzia Marcheselli , Caterina Pisani , Luca Pratelli

Testing for independence in high dimensions based on empirical copulas

Testing for pairwise independence for the case where the number of variables may be of the same size or even larger than the sample size has received increasing attention in the recent years. We contribute to this branch of the literature…

统计理论 · 数学 2024-09-18 Axel Bücher , Cambyse Pakzad

Mean Residual Life Ageing Intensity Function

The ageing intensity function is a powerful analytical tool that provides valuable insights into the ageing process across diverse domains such as reliability engineering, actuarial science, and healthcare. Its applications continue to…

统计理论 · 数学 2024-09-17 Ashutosh Singh , Ishapathik Das , Asok Kumar Nanda , Sumen Sen

Consistent complete independence test in high dimensions based on Chatterjee correlation coefficient

In this article, we consider the complete independence test of high-dimensional data. Based on Chatterjee coefficient, we pioneer the development of quadratic test and extreme value test which possess good testing performance for…

统计理论 · 数学 2024-09-17 Liqi Xia , Ruiyuan Cao , Jiang Du , Jun Dai

Extending the Gini Index to Higher Dimensions via Whitening Processes

Measuring the degree of inequality expressed by a multivariate statistical distribution is a challenging problem, which appears in many fields of science and engineering. In this paper, we propose to extend the well known univariate Gini…

统计理论 · 数学 2024-09-17 Gennaro Auricchio , Paolo Giudici , Giuseppe Toscani

Privately Learning Smooth Distributions on the Hypercube by Projections

Fueled by the ever-increasing need for statistics that guarantee the privacy of their training sets, this article studies the centrally-private estimation of Sobolev-smooth densities of probability over the hypercube in dimension d. The…

统计理论 · 数学 2024-09-17 Clément Lalanne , Sébastien Gadat

Bounding the probability of causality under ordinal outcomes

The probability of causation (PC) is often used in liability assessments. In a legal context, for example, where a patient suffered the side effect after taking a medication and sued the pharmaceutical company as a result, the value of the…

统计理论 · 数学 2024-09-17 Hanmei Sun , Chengfeng Shi , Qiang Zhao

On Admissibility in Bipartite Incidence Graph Sampling

In bipartite incidence graph sampling, the target study units may be formed as connected population elements, which are distinct to the units of sampling and there may exist generally more than one way by which a given study unit can be…

统计理论 · 数学 2024-09-17 Pedro García-Segador , Li-Chun Zhang

Higher-Order Graphon Theory: Fluctuations, Degeneracies, and Inference

Exchangeable random graphs, which include some of the most widely studied network models, have emerged as the mainstay of statistical network analysis in recent years. Graphons, which are the central objects in graph limit theory, provide a…

统计理论 · 数学 2024-09-17 Anirban Chatterjee , Soham Dan , Bhaswar B. Bhattacharya

Asymptotics of predictive distributions driven by sample means and variances

Let $\alpha_n(\cdot)=P\bigl(X_{n+1}\in\cdot\mid X_1,\ldots,X_n\bigr)$ be the predictive distributions of a sequence $(X_1,X_2,\ldots)$ of $p$-dimensional random vectors. Suppose $$\alpha_n= \mathcal{N} _p (M_n,Q_n)$$ where…

统计理论 · 数学 2024-09-17 Samuele Garelli , Fabrizio Leisen , Luca Pratelli , Pietro Rigo

Tensor Time Series Imputation through Tensor Factor Modelling

We propose tensor time series imputation when the missing pattern in the tensor data can be general, as long as any two data positions along a tensor fibre are both observed for enough time points. The method is based on a tensor time…

统计理论 · 数学 2024-09-17 Zetai Cen , Clifford Lam

Analysis of the rSVDdpd Algorithm: A Robust Singular Value Decomposition Method using Density Power Divergence

The traditional method of computing singular value decomposition (SVD) of a data matrix is based on a least squares principle, thus, is very sensitive to the presence of outliers. Hence the resulting inferences across different applications…

统计理论 · 数学 2024-09-17 Subhrajyoty Roy , Abhik Ghosh , Ayanendranath Basu