English
Related papers

Related papers: On optimal spatial subsample size for variance est…

200 papers

For high volume data streams and large data warehouses, sampling is used for efficient approximate answers to aggregate queries over selected subsets. Mathematically, we are dealing with a set of weighted items and want to support queries…

Data Structures and Algorithms · Computer Science 2007-05-23 Mario Szegedy , Mikkel Thorup

In this paper, we investigate the randomized algorithms for block matrix multiplication from random sampling perspective. Based on the A-optimal design criterion, the optimal sampling probabilities and sampling block sizes are obtained. To…

Numerical Analysis · Mathematics 2021-05-12 Chengmei Niu , Hanyu Li

We establish a general theory of optimality for block bootstrap distribution estimation for sample quantiles under a mild strong mixing assumption. In contrast to existing results, we study the block bootstrap for varying numbers of blocks.…

Statistics Theory · Mathematics 2017-10-10 Todd A. Kuffner , Stephen M. S. Lee , G. Alastair Young

The Birnbaum-Saunders distribution has been widely applied in several areas of science and although several methodologies related to this distribution have been proposed, the problem of determining the optimal sample size for estimating its…

Methodology · Statistics 2020-07-27 Eliardo G. Costa , Manoel Santos-Neto

The block maximum method, which is widely used in extreme value analysis, uses a generalized extreme value distribution to approximate that of the maximum of m observations. The quality of this approximation depends on the value of m and…

Methodology · Statistics 2026-05-14 Léo R. Belzile , Anthony C. Davison

We introduce a very general method for sparse and large-scale variable selection. The large-scale regression settings is such that both the number of parameters and the number of samples are extremely large. The proposed method is based on…

Statistics Theory · Mathematics 2019-07-31 Jelena Bradic

We consider the minimization of a sum of an expectation-valued coordinate-wise $L_i$-smooth nonconvex function and a nonsmooth block-separable convex regularizer. We propose an asynchronous variance-reduced algorithm, where in each…

Optimization and Control · Mathematics 2020-02-20 Jinlong Lei , Uday V. Shanbhag

Subsampling methods aim to select a subsample as a surrogate for the observed sample. As a powerful technique for large-scale data analysis, various subsampling methods are developed for more effective coefficient estimation and model…

Methodology · Statistics 2021-05-05 Tao Li , Cheng Meng

An optimal mesh size of the sampling region can help to reduce computational burden in practical applications. In this work, we investigate optimal choices of mesh sizes for the identifications of medium obstacles from either the far-field…

Numerical Analysis · Mathematics 2018-10-22 Habib Ammari , Yat Tin Chow , Keji Liu

We consider the least-squares regression problem and provide a detailed asymptotic analysis of the performance of averaged constant-step-size stochastic gradient descent (a.k.a. least-mean-squares). In the strongly-convex case, we provide…

Machine Learning · Computer Science 2014-12-02 Alexandre Défossez , Francis Bach

For optimization on large-scale data, exactly calculating its solution may be computationally difficulty because of the large size of the data. In this paper we consider subsampled optimization for fast approximating the exact solution. In…

Machine Learning · Statistics 2018-04-11 Rong Zhu , Jiming Jiang

A theory of superefficiency and adaptation is developed under flexible performance measures which give a multiresolution view of risk and bridge the gap between pointwise and global estimation. This theory provides a useful benchmark for…

Statistics Theory · Mathematics 2007-06-13 T. Tony Cai , Mark G. Low

In this paper, we propose a stochastic optimization method that adaptively controls the sample size used in the computation of gradient approximations. Unlike other variance reduction techniques that either require additional storage or the…

Optimization and Control · Mathematics 2017-11-01 Raghu Bollapragada , Richard Byrd , Jorge Nocedal

We investigate the accuracy of two general non-parametric methods for estimating optimal block lengths for block bootstraps with time series - the first proposed in the seminal paper of Hall, Horowitz and Jing (Biometrika 82 (1995) 561-574)…

Statistics Theory · Mathematics 2014-03-14 Daniel J. Nordman , Soumendra N. Lahiri

We consider batch size selection for a general class of multivariate batch means variance estimators, which are computationally viable for high-dimensional Markov chain Monte Carlo simulations. We derive the asymptotic mean squared error…

Statistics Theory · Mathematics 2019-07-18 Ying Liu , Dootika Vats , James M. Flegal

The bootstrap is a widely used procedure for statistical inference because of its simplicity and attractive statistical properties. However, the vanilla version of bootstrap is no longer feasible computationally for many modern massive…

Methodology · Statistics 2023-02-16 Yingying Ma , Chenlei Leng , Hansheng Wang

Overparametrization often helps improve the generalization performance. This paper presents a dual view of overparametrization suggesting that downsampling may also help generalize. Focusing on the proportional regime $m\asymp n \asymp p$,…

Statistics Theory · Mathematics 2023-10-17 Xin Chen , Yicheng Zeng , Siyue Yang , Qiang Sun

We consider the problem of approximating an unknown function from point evaluations. This problem is a crucial subproblem in many modern (nonlinear) approximation schemes. When obtaining these point evaluations is costly, minimising the…

Numerical Analysis · Mathematics 2025-12-03 Philipp Trunschke , Anthony Nouy

Block-based resampling estimators have been intensively investigated for weakly dependent time processes, which has helped to inform implementation (e.g., best block sizes). However, little is known about resampling performance and block…

Statistics Theory · Mathematics 2022-08-04 Qihao Zhang , Soumendra N. Lahiri , Daniel J. Nordman

We study the probabilistic sampling of a random variable, in which the variable is sampled only if it falls outside a given set, which is called the silence set. This helps us to understand optimal event-based sampling for the special case…

Optimization and Control · Mathematics 2023-03-17 Maben Rabi , Junfeng Wu , Vyoma Singh , Karl Henrik Johansson
‹ Prev 1 2 3 10 Next ›