English
Related papers

Related papers: Dimension-agnostic inference using cross U-statist…

200 papers

Recently, Tibshirani et al. (2016) proposed a method for making inferences about parameters defined by model selection, in a typical regression setting with normally distributed errors. Here, we study the large sample properties of this…

Statistics Theory · Mathematics 2017-08-10 Ryan J. Tibshirani , Alessandro Rinaldo , Robert Tibshirani , Larry Wasserman

In this paper, we study inference for high-dimensional data characterized by small sample sizes relative to the dimension of the data. In particular, we provide an infinite-dimensional framework to study statistical models that involve…

Statistics Theory · Mathematics 2010-02-25 Jim Kuelbs , Anand N. Vidyashankar

Change point testing for high-dimensional data has attracted a lot of attention in statistics and machine learning owing to the emergence of high-dimensional data with structural breaks from many fields. In practice, when the dimension is…

Methodology · Statistics 2023-12-05 Hanjia Gao , Runmin Wang , Xiaofeng Shao

This paper studies inference for the mean vector of a high-dimensional $U$-statistic. In the era of Big Data, the dimension $d$ of the $U$-statistic and the sample size $n$ of the observations tend to be both large, and the computation of…

Statistics Theory · Mathematics 2019-01-29 Xiaohui Chen , Kengo Kato

Asymptotic methods for hypothesis testing in high-dimensional data usually require the dimension of the observations to increase to infinity, often with an additional condition on its rate of increase compared to the sample size. On the…

Statistics Theory · Mathematics 2024-03-26 Joydeep Chowdhury , Subhajit Dutta , Marc G. Genton

Many high-dimensional hypothesis tests aim to globally examine marginal or low-dimensional features of a high-dimensional joint distribution, such as testing of mean vectors, covariance matrices and regression coefficients. This paper…

Statistics Theory · Mathematics 2020-02-04 Yinqiu He , Gongjun Xu , Chong Wu , Wei Pan

In repeated Measure Designs with multiple groups, the primary purpose is to compare different groups in various aspects. For several reasons, the number of measurements and therefore the dimension of the observation vectors can depend on…

Statistics Theory · Mathematics 2022-07-20 Paavo Sattler , Markus Pauly

We prove a convergence theorem for U-statistics of degree two, where the data dimension $d$ is allowed to scale with sample size $n$. We find that the limiting distribution of a U-statistic undergoes a phase transition from the…

Statistics Theory · Mathematics 2023-07-04 Kevin H. Huang , Xing Liu , Andrew B. Duncan , Axel Gandy

Consider $d$ dependent change point tests, each based on a CUSUM-statistic. We provide an asymptotic theory that allows us to deal with the maximum over all test statistics as both the sample size $n$ and $d$ tend to infinity. We achieve…

Statistics Theory · Mathematics 2017-12-07 Moritz Jirak

In this article, we propose a class of $L_q$-norm based U-statistics for a family of global testing problems related to high-dimensional data. This includes testing of mean vector and its spatial sign, simultaneous testing of linear model…

Statistics Theory · Mathematics 2023-03-16 Yangfan Zhang , Runmin Wang , Xiaofeng Shao

Existing two-sample testing techniques, particularly those based on choosing a kernel for the Maximum Mean Discrepancy (MMD), often assume equal sample sizes from the two distributions. Applying these methods in practice can require…

Machine Learning · Statistics 2025-12-17 Aaron Wei , Milad Jalali , Danica J. Sutherland

We consider non-parametric estimation and inference of conditional moment models in high dimensions. We show that even when the dimension $D$ of the conditioning variable is larger than the sample size $n$, estimation and inference is…

Machine Learning · Computer Science 2019-06-19 Khashayar Khosravi , Greg Lewis , Vasilis Syrgkanis

Traditional statistical inference considers relatively small data sets and the corresponding theoretical analysis focuses on the asymptotic behavior of a statistical estimator when the number of samples approaches infinity. However, many…

Methodology · Statistics 2013-01-03 Jon Wellner , Tong Zhang

U-statistics are widely used in fields such as economics, machine learning, and statistics. However, while they enjoy desirable statistical properties, they have an obvious drawback in that the computation becomes impractical as the data…

Statistics Theory · Mathematics 2020-08-12 Xiangshun Kong , Wei Zheng

Clustering methods are a valuable tool for the identification of patterns in high dimensional data with applications in many scientific problems. However, quantifying uncertainty in clustering is a challenging problem, particularly when…

Methodology · Statistics 2018-06-01 Marcio Valk , Gabriela Bettella Cybis

This paper studies the Gaussian and bootstrap approximations for the probabilities of a non-degenerate U-statistic belonging to the hyperrectangles in $\mathbb{R}^d$ when the dimension $d$ is large. A two-step Gaussian approximation…

Statistics Theory · Mathematics 2017-07-11 Xiaohui Chen

We establish a strong Gaussian approximation for high-dimensional non-degenerate U-statistics with diverging dimension. Under mild assumptions, we construct, on a sufficiently rich probability space, a Gaussian process that uniformly…

Statistics Theory · Mathematics 2026-03-12 Weijia Li , Leheng Cai , Qirui Hu

This paper is concerned with estimation and inference for the location of a change point in the mean of independent high-dimensional data. Our change point location estimator maximizes a new U-statistic based objective function, and its…

Methodology · Statistics 2020-02-12 Runmin Wang , Xiaofeng Shao

Statistical inferences for quadratic functionals of linear regression parameter have found wide applications including signal detection, global testing, inferences of error variance and fraction of variance explained. Classical theory based…

Statistics Theory · Mathematics 2020-06-17 Xiao Guo , Guang Cheng

Many statistical methodologies for high-dimensional data assume the population is normal. Although a few multivariate normality tests have been proposed, to the best of our knowledge, none of them can properly control the type I error when…

Methodology · Statistics 2021-05-04 Hao Chen , Yin Xia
‹ Prev 1 2 3 10 Next ›