Related papers: A Cheap Bootstrap Method for Fast Inference

Bootstrap in High Dimension with Low Computation

The bootstrap is a popular data-driven method to quantify statistical uncertainty, but for modern high-dimensional problems, it could suffer from huge computational costs due to the need to repeatedly generate resamples and refit models. We…

Methodology · Statistics 2023-06-21 Henry Lam , Zhenyuan Liu

Optimal Subsampling Bootstrap for Massive Data

The bootstrap is a widely used procedure for statistical inference because of its simplicity and attractive statistical properties. However, the vanilla version of bootstrap is no longer feasible computationally for many modern massive…

Methodology · Statistics 2023-02-16 Yingying Ma , Chenlei Leng , Hansheng Wang

Cheap Subsampling bootstrap confidence intervals for fast and robust inference

Bootstrapping is often applied to get confidence limits for semiparametric inference of a target parameter in the presence of nuisance parameters. Bootstrapping with replacement can be computationally expensive and problematic when…

Methodology · Statistics 2025-03-06 Johan Sebastian Ohlendorff , Anders Munch , Kathrine Kold Sørensen , Thomas Alexander Gerds

A subsampled double bootstrap for massive data

The bootstrap is a popular and powerful method for assessing precision of estimators and inferential methods. However, for massive datasets which are increasingly prevalent, the bootstrap becomes prohibitively costly in computation and its…

Methodology · Statistics 2015-08-06 Srijan Sengupta , Stanislav Volgushev , Xiaofeng Shao

Bootstrap Methods in Econometrics

The bootstrap is a method for estimating the distribution of an estimator or test statistic by re-sampling the data or a model estimated from the data. Under conditions that hold in a wide variety of econometric applications, the bootstrap…

Econometrics · Economics 2018-09-12 Joel L. Horowitz

Robust, scalable and fast bootstrap method for analyzing large scale data

In this paper we address the problem of performing statistical inference for large scale data sets i.e., Big Data. The volume and dimensionality of the data may be so high that it cannot be processed or stored in a single computing node. We…

Methodology · Statistics 2016-04-20 Shahab Basiri , Esa Ollila , Visa Koivunen

Bootstrapping Through Discrete Convolutional Methods

Bootstrapping was designed to randomly resample data from a fixed sample using Monte Carlo techniques. However, the original sample itself defines a discrete distribution. Convolutional methods are well suited for discrete distributions,…

Methodology · Statistics 2021-07-19 Jared M. Clark , Richard L. Warr

Bootstrapping and Sample Splitting For High-Dimensional, Assumption-Free Inference

Several new methods have been proposed for performing valid inference after model selection. An older method is sampling splitting: use part of the data for model selection and part for inference. In this paper we revisit sample splitting…

Statistics Theory · Mathematics 2018-04-04 Alessandro Rinaldo , Larry Wasserman , Max G'Sell , Jing Lei

Simultaneous Inference for Massive Data: Distributed Bootstrap

In this paper, we propose a bootstrap method applied to massive data processed distributedly in a large number of machines. This new method is computationally efficient in that we bootstrap on the master machine without over-resampling,…

Machine Learning · Statistics 2020-02-21 Yang Yu , Shih-Kang Chao , Guang Cheng

Stratified sampling and bootstrapping for approximate Bayesian computation

Approximate Bayesian computation (ABC) is computationally intensive for complex model simulators. To exploit expensive simulations, data-resampling via bootstrapping can be employed to obtain many artificial datasets at little cost.…

Computation · Statistics 2021-07-05 Umberto Picchini , Richard G. Everitt

An Online Bootstrap for Time Series

Resampling methods such as the bootstrap have proven invaluable in the field of machine learning. However, the applicability of traditional bootstrap methods is limited when dealing with large streams of dependent data, such as time series…

Machine Learning · Statistics 2024-02-28 Nicolai Palm , Thomas Nagler

Statistical inference in massive datasets by empirical likelihood

In this paper, we propose a new statistical inference method for massive data sets, which is very simple and efficient by combining divide-and-conquer method and empirical likelihood. Compared with two popular methods (the bag of little…

Methodology · Statistics 2020-04-21 Xuejun Ma , Shaochen Wang , Wang Zhou

Bootstrap inference for the finite population total under complex sampling designs

Bootstrap is a useful tool for making statistical inference, but it may provide erroneous results under complex survey sampling. Most studies about bootstrap-based inference are developed under simple random sampling and stratified random…

Statistics Theory · Mathematics 2019-01-08 Zhonglei Wang , Jae Kwang Kim , Liuhua Peng

Bootstrap for neural model selection

Bootstrap techniques (also called resampling computation techniques) have introduced new advances in modeling and model evaluation. Using resampling methods to construct a series of new samples which are based on the original data set,…

Statistics Theory · Mathematics 2007-06-13 Riadh Kallel , Marie Cottrell , Vincent Vigneron

Finite Sample Valid Inference via Calibrated Bootstrap

While widely used as a general method for uncertainty quantification, the bootstrap method encounters difficulties that raise concerns about its validity in practical applications. This paper introduces a new resampling-based method, termed…

Methodology · Statistics 2024-08-30 Yiran Jiang , Chuanhai Liu , Heping Zhang

Inference by Stochastic Optimization: A Free-Lunch Bootstrap

Assessing sampling uncertainty in extremum estimation can be challenging when the asymptotic variance is not analytically tractable. Bootstrap inference offers a feasible solution but can be computationally costly especially when the model…

Econometrics · Economics 2020-09-15 Jean-Jacques Forneron , Serena Ng

Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty

Bootstrap is a popular methodology for simulating input uncertainty. However, it can be computationally expensive when the number of samples is large. We propose a new approach called \textbf{Orthogonal Bootstrap} that reduces the number of…

Methodology · Statistics 2024-05-02 Kaizhao Liu , Jose Blanchet , Lexing Ying , Yiping Lu

A simple recipe for making accurate parametric inference in finite sample

Constructing tests or confidence regions that control over the error rates in the long-run is probably one of the most important problem in statistics. Yet, the theoretical justification for most methods in statistics is asymptotic. The…

Methodology · Statistics 2019-01-23 Stéphane Guerrier , Mucyo Karemera , Samuel Orso , Maria-Pia Victoria-Feser

Inference with Sequential Monte-Carlo Computation of $p$-values: Fast and Valid Approaches

Hypothesis tests calibrated by (re)sampling methods (such as permutation, rank and bootstrap tests) are useful tools for statistical analysis, at the computational cost of requiring Monte-Carlo sampling for calibration. It is common and…

Methodology · Statistics 2024-09-30 Ivo V. Stoepker , Rui M. Castro

Wild Bootstrap for Counting Process-Based Statistics

The wild bootstrap is a popular resampling method in the context of time-to-event data analyses. Previous works established the large sample properties of it for applications to different estimators and test statistics. It can be used to…

Methodology · Statistics 2023-10-27 Marina T. Dietrich , Dennis Dobler , Mathisca C. M. de Gunst