Related papers: Resampling Methods with Imputed Data

The Jackknife Estimation Method

Statistical resampling methods have become feasible for parametric estimation, hypothesis testing, and model validation now that the computer is a ubiquitous tool for statisticians. This essay focuses on the resampling technique for…

Methodology · Statistics 2016-06-03 Avery McIntosh

Analysis of Bootstrap and Subsampling in High-dimensional Regularized Regression

We investigate popular resampling methods for estimating the uncertainty of statistical models, such as subsampling, bootstrap and the jackknife, and their performance in high-dimensional supervised regression tasks. We provide a tight…

Machine Learning · Statistics 2024-11-04 Lucas Clarté , Adrien Vandenbroucque , Guillaume Dalle , Bruno Loureiro , Florent Krzakala , Lenka Zdeborová

Subsampling and Jackknifing: A Practically Convenient Solution for Large Data Analysis with Limited Computational Resources

Modern statistical analysis often encounters datasets with large sizes. For these datasets, conventional estimation methods can hardly be used immediately because practitioners often suffer from limited computational resources. In most…

Methodology · Statistics 2023-04-14 Shuyuan Wu , Xuening Zhu , Hansheng Wang

Bootstrap Inference when Using Multiple Imputation

Many modern estimators require bootstrapping to calculate confidence intervals because either no analytic standard error is available or the distribution of the parameter of interest is non-symmetric. It remains however unclear how to…

Methodology · Statistics 2018-09-13 Michael Schomaker , Christian Heumann

Fast and Reliable Jackknife and Bootstrap Methods for Cluster-Robust Inference

We provide computationally attractive methods to obtain jackknife-based cluster-robust variance matrix estimators (CRVEs) for linear regression models estimated by least squares. We also propose several new variants of the wild cluster…

Econometrics · Economics 2023-02-14 James G. MacKinnon , Morten Ørregaard Nielsen , Matthew D. Webb

Multiple imputation in data that grow over time: A comparison of three strategies

Multiple imputation is a highly recommended technique to deal with missing data, but the application to longitudinal datasets can be done in multiple ways. When a new wave of longitudinal data arrives, we can treat the combined data of…

Methodology · Statistics 2026-05-18 X. M. Kavelaars , S. van Buuren , J. R. van Ginkel

A Cheap Bootstrap Method for Fast Inference

The bootstrap is a versatile inference method that has proven powerful in many statistical problems. However, when applied to modern large-scale models, it could face substantial computation demand from repeated data resampling and model…

Methodology · Statistics 2022-02-02 Henry Lam

Bootstrap inference for the finite population total under complex sampling designs

Bootstrap is a useful tool for making statistical inference, but it may provide erroneous results under complex survey sampling. Most studies about bootstrap-based inference are developed under simple random sampling and stratified random…

Statistics Theory · Mathematics 2019-01-08 Zhonglei Wang , Jae Kwang Kim , Liuhua Peng

Estimation and Inference by Stochastic Optimization

In non-linear estimations, it is common to assess sampling uncertainty by bootstrap inference. For complex models, this can be computationally intensive. This paper combines optimization with resampling: turning stochastic optimization into…

Econometrics · Economics 2022-05-09 Jean-Jacques Forneron

Investigation of Parameter Uncertainty in Clustering Using a Gaussian Mixture Model Via Jackknife, Bootstrap and Weighted Likelihood Bootstrap

Mixture models are a popular tool in model-based clustering. Such a model is often fitted by a procedure that maximizes the likelihood, such as the EM algorithm. At convergence, the maximum likelihood parameter estimates are typically…

Computation · Statistics 2019-07-23 Adrian O'Hagan , Thomas Brendan Murphy , Luca Scrucca , Isobel Claire Gormley

Creating Jackknife and Bootstrap estimates of the covariance matrix for the two-point correlation function

We present correction terms that allow delete-one Jackknife and Bootstrap methods to be used to recover unbiased estimates of the data covariance matrix of the two-point correlation function $\xi\left(\mathbf{r}\right)$. We demonstrate the…

Cosmology and Nongalactic Astrophysics · Physics 2022-06-14 Faizan G. Mohammad , Will J. Percival

Probability and Non-Probability Samples: Improving Regression Modeling by Using Data from Different Sources

Non-probability sampling, for example in the form of online panels, has become a fast and cheap method to collect data. While reliable inference tools are available for classical probability samples, non-probability samples can yield…

Methodology · Statistics 2022-04-05 Gerhard Tutz

An Online Bootstrap for Time Series

Resampling methods such as the bootstrap have proven invaluable in the field of machine learning. However, the applicability of traditional bootstrap methods is limited when dealing with large streams of dependent data, such as time series…

Machine Learning · Statistics 2024-02-28 Nicolai Palm , Thomas Nagler

Bootstrapping and Multiple Imputation Ensemble Approaches for Missing Data

Presence of missing values in a dataset can adversely affect the performance of a classifier. Single and Multiple Imputation are normally performed to fill in the missing values. In this paper, we present several variants of combining…

Machine Learning · Computer Science 2019-10-16 Shehroz S. Khan , Amir Ahmad , Alex Mihailidis

Bootstrap in High Dimension with Low Computation

The bootstrap is a popular data-driven method to quantify statistical uncertainty, but for modern high-dimensional problems, it could suffer from huge computational costs due to the need to repeatedly generate resamples and refit models. We…

Methodology · Statistics 2023-06-21 Henry Lam , Zhenyuan Liu

Bootstrap for neural model selection

Bootstrap techniques (also called resampling computation techniques) have introduced new advances in modeling and model evaluation. Using resampling methods to construct a series of new samples which are based on the original data set,…

Statistics Theory · Mathematics 2007-06-13 Riadh Kallel , Marie Cottrell , Vincent Vigneron

Bootstrap Inference for Multiple Imputation under Uncongeniality and Misspecification

Multiple imputation has become one of the most popular approaches for handling missing data in statistical analyses. Part of this success is due to Rubin's simple combination rules. These give frequentist valid inferences when the…

Methodology · Statistics 2019-11-28 Jonathan W. Bartlett , Rachael A. Hughes

Gap bootstrap methods for massive data sets with an application to transportation engineering

In this paper we describe two bootstrap methods for massive data sets. Naive applications of common resampling methodology are often impractical for massive data sets due to computational burden and due to complex patterns of inhomogeneity.…

Applications · Statistics 2013-01-14 S. N. Lahiri , C. Spiegelman , J. Appiah , L. Rilett

Compressed sensing with a jackknife and a bootstrap

Compressed sensing proposes to reconstruct more degrees of freedom in a signal than the number of values actually measured. Compressed sensing therefore risks introducing errors -- inserting spurious artifacts or masking the abnormalities…

Image and Video Processing · Electrical Eng. & Systems 2024-04-09 Mark Tygert , Rachel Ward , Jure Zbontar

Finite Sample Valid Inference via Calibrated Bootstrap

While widely used as a general method for uncertainty quantification, the bootstrap method encounters difficulties that raise concerns about its validity in practical applications. This paper introduces a new resampling-based method, termed…

Methodology · Statistics 2024-08-30 Yiran Jiang , Chuanhai Liu , Heping Zhang