统计理论 — Scifaro

Multiple testing under negative dependence

The multiple testing literature has primarily dealt with three types of dependence assumptions between p-values: independence, positive regression dependence, and arbitrary dependence. In this paper, we provide what we believe are the first…

统计理论 · 数学 2024-05-10 Ziyu Chi , Aaditya Ramdas , Ruodu Wang

On the Computational Complexity of Metropolis-Adjusted Langevin Algorithms for Bayesian Posterior Sampling

In this paper, we examine the computational complexity of sampling from a Bayesian posterior (or pseudo-posterior) using the Metropolis-adjusted Langevin algorithm (MALA). MALA first employs a discrete-time Langevin SDE to propose a new…

统计理论 · 数学 2024-05-10 Rong Tang , Yun Yang

Uncertainty quantification in metric spaces

This paper introduces a novel uncertainty quantification framework for regression models where the response takes values in a separable metric space, and the predictors are in a Euclidean space. The proposed algorithms can efficiently…

统计理论 · 数学 2024-05-09 Gábor Lugosi , Marcos Matabuena

Covariance estimation using h-statistics in Monte Carlo and multilevel Monte Carlo methods

We present novel Monte Carlo (MC) and multilevel Monte Carlo (MLMC) methods to determine the unbiased covariance of random variables using h-statistics. The advantage of this procedure lies in the unbiased construction of the estimator's…

统计理论 · 数学 2024-05-09 Sharana Kumar Shivanand

From dense to sparse design: Optimal rates under the supremum norm for estimating the mean function in functional data analysis

We derive optimal rates of convergence in the supremum norm for estimating the H\"older-smooth mean function of a stochastic process which is repeatedly and discretely observed with additional errors at fixed, multivariate, synchronous…

统计理论 · 数学 2024-05-09 Max Berger , Philipp Hermann , Hajo Holzmann

Entropic covariance models

In covariance matrix estimation, one of the challenges lies in finding a suitable model and an efficient estimation method. Two commonly used modelling approaches in the literature involve imposing linear restrictions on the covariance…

统计理论 · 数学 2024-05-09 Piotr Zwiernik

The Projected Covariance Measure for assumption-lean variable significance testing

Testing the significance of a variable or group of variables $X$ for predicting a response $Y$, given additional covariates $Z$, is a ubiquitous task in statistics. A simple but common approach is to specify a linear model, and then test…

统计理论 · 数学 2024-05-08 Anton Rask Lundborg , Ilmun Kim , Rajen D. Shah , Richard J. Samworth

Superconsistency of Tests in High Dimensions

To assess whether there is some signal in a big database, aggregate tests for the global null hypothesis of no effect are routinely applied in practice before more specialized analysis is carried out. Although a plethora of aggregate tests…

统计理论 · 数学 2024-05-08 Anders Bredahl Kock , David Preinerstorfer

Precision-based designs for sequential randomized experiments

In this paper, we consider an experimental setting where units enter the experiment sequentially. Our goal is to form stopping rules which lead to estimators of treatment effects with a given precision. We propose a fixed-width confidence…

统计理论 · 数学 2024-05-07 Mattias Nordin , Mårten Schultzberg

Multimatrix variate distributions

A new family of distributions indexed by the class of matrix variate contoured elliptically distribution is proposed as an extension of some bimatrix variate distributions. The termed \emph{multimatrix variate distributions} open new…

统计理论 · 数学 2024-05-07 José A. Díaz-García , Francisco J. Caro-Lopera

Causal Inference with High-dimensional Discrete Covariates

When estimating causal effects from observational studies, researchers often need to adjust for many covariates to deconfound the non-causal relationship between exposure and outcome, among which many covariates are discrete. The behavior…

统计理论 · 数学 2024-05-07 Zhenghao Zeng , Sivaraman Balakrishnan , Yanjun Han , Edward H. Kennedy

Finding Super-spreaders in Network Cascades

Suppose that a cascade (e.g., an epidemic) spreads on an unknown graph, and only the infection times of vertices are observed. What can be learned about the graph from the infection times caused by multiple distinct cascades? Most of the…

统计理论 · 数学 2024-05-07 Elchanan Mossel , Anirudh Sridhar

Reversibility of elliptical slice sampling revisited

We extend elliptical slice sampling, a Markov chain transition kernel suggested in Murray, Adams and MacKay 2010, to infinite-dimensional separable Hilbert spaces and discuss its well-definedness. We point to a regularity requirement,…

统计理论 · 数学 2024-05-07 Mareike Hasenpflug , Viacheslav Telezhnikov , Daniel Rudolf

Seeded graph matching for the correlated Gaussian Wigner model via the projected power method

In the \emph{graph matching} problem we observe two graphs $G,H$ and the goal is to find an assignment (or matching) between their vertices such that some measure of edge agreement is maximized. We assume in this work that the observed pair…

统计理论 · 数学 2024-05-07 Ernesto Araya , Guillaume Braun , Hemant Tyagi

Nonparametric classification with missing data

We introduce a new nonparametric framework for classification problems in the presence of missing data. The key aspect of our framework is that the regression function decomposes into an anova-type sum of orthogonal functions, of which some…

统计理论 · 数学 2024-05-06 Torben Sell , Thomas B. Berrett , Timothy I. Cannings

Adaptive deep learning for nonlinear time series models

In this paper, we develop a general theory for adaptive nonparametric estimation of the mean function of a non-stationary and nonlinear time series model using deep neural networks (DNNs). We first consider two types of DNN estimators,…

统计理论 · 数学 2024-05-06 Daisuke Kurisu , Riku Fukami , Yuta Koike

Asymptotic Results for Penalized Quasi-Likelihood Estimation in Generalized Linear Mixed Models

Generalized Linear Mixed Models (GLMMs) are widely used for analysing clustered data. One well-established method of overcoming the integral in the marginal likelihood function for GLMMs is penalized quasi-likelihood (PQL) estimation,…

统计理论 · 数学 2024-05-03 Xu Ning , Francis Hui , Alan Welsh

On Ridge Estimation in High-dimensional Rotationally Sparse Linear Regression

Recently, deep neural networks have been found to nearly interpolate training data but still generalize well in various applications. To help understand such a phenomenon, it has been of interest to analyze the ridge estimator and its…

统计理论 · 数学 2024-05-03 Libin Liang , Zhiqiang Tan

Deriving Lehmer and H\"older means as maximum weighted likelihood estimates for the multivariate exponential family

The links between the mean families of Lehmer and H\"older and the weighted maximum likelihood estimator have recently been established in the case of a regular univariate exponential family. In this article, we will extend the outcomes…

统计理论 · 数学 2024-05-03 Djemel Ziou , Issam Fakir

Nearly Optimum Properties of Certain Multi-Decision Sequential Rules for General Non-i.i.d. Stochastic Models

Dedicated to the memory of Professor Tze Leung Lai, this paper introduces three multi-hypothesis sequential tests. These tests are derived from one-sided versions of the sequential probability ratio test and its modifications. They are…

统计理论 · 数学 2024-05-03 Alexander G. Tartakovsky