Related papers: Cross Validation for Comparing Multiple Density Es…

Consistency of cross validation for comparing regression procedures

Theoretical developments on cross validation (CV) have mainly focused on selecting one among a list of finite-dimensional models (e.g., subset or order selection in linear regression) or selecting a smoothing parameter (e.g., bandwidth for…

Statistics Theory · Mathematics 2008-12-18 Yuhong Yang

Cross-Validation, Risk Estimation, and Model Selection

Cross-validation is a popular non-parametric method for evaluating the accuracy of a predictive rule. The usefulness of cross-validation depends on the task we want to employ it for. In this note, I discuss a simple non-parametric setting,…

Methodology · Statistics 2019-09-27 Stefan Wager

Use of Cross-validation Bayes Factors to Test Equality of Two Densities

We propose a non-parametric, two-sample Bayesian test for checking whether or not two data sets share a common distribution. The test makes use of data splitting ideas and does not require priors for high-dimensional parameter vectors as do…

Methodology · Statistics 2020-03-16 Jeffery Hart , Taeryon Choi , Naveed Merchant

Cross-Validation with Confidence

Cross-validation is one of the most popular model selection methods in statistics and machine learning. Despite its wide applicability, traditional cross validation methods tend to select overfitting models, due to the ignorance of the…

Methodology · Statistics 2017-12-25 Jing Lei

A survey of cross-validation procedures for model selection

Used to estimate the risk of an estimator or to perform model selection, cross-validation is a widespread strategy because of its simplicity and its apparent universality. Many results exist on the model selection performances of…

Statistics Theory · Mathematics 2011-02-01 Sylvain Arlot , Alain Celisse

Cross-validation: what does it estimate and how well does it do it?

Cross-validation is a widely-used technique to estimate prediction error, but its behavior is complex and not fully understood. Ideally, one would like to think that cross-validation estimates the prediction error for the model at hand, fit…

Methodology · Statistics 2024-03-12 Stephen Bates , Trevor Hastie , Robert Tibshirani

Network cross-validation by edge sampling

While many statistical models and methods are now available for network analysis, resampling network data remains a challenging problem. Cross-validation is a useful general tool for model selection and parameter tuning, but is not directly…

Methodology · Statistics 2020-05-04 Tianxi Li , Elizaveta Levina , Ji Zhu

Cross-Validation for Nonlinear Mixed Effects Models

Cross-validation is frequently used for model selection in a variety of applications. However, it is difficult to apply cross-validation to mixed effects models (including nonlinear mixed effects models or NLME models) due to the fact that…

Methodology · Statistics 2013-05-24 Emily Colby , Eric Bair

Cross-validation estimation of covariance parameters under fixed-domain asymptotics

We consider a one-dimensional Gaussian process having exponential covariance function. Under fixed-domain asymptotics, we prove the strong consistency and asymptotic normality of a cross validation estimator of the microergodic covariance…

Statistics Theory · Mathematics 2017-07-26 Francois Bachoc , Agnes Lagnoux , Thi Mong Ngoc Nguyen

Cross-Validation with Antithetic Gaussian Randomization

We introduce a new cross-validation method based on an equicorrelated Gaussian randomization scheme. Our method is well-suited for problems where sample splitting is infeasible, either because the data violate the assumption of independent…

Methodology · Statistics 2026-02-10 Sifan Liu , Snigdha Panigrahi , Jake A. Soloff

Nonparametric inference for ratios of densities via uniformly valid and powerful permutation tests

We propose the density ratio permutation test, a hypothesis test that assesses whether the ratio between two densities is proportional to a known function based on independent samples from each distribution. The test uses an efficient…

Methodology · Statistics 2026-01-14 Alberto Bordino , Thomas B. Berrett

Partitioned Cross-Validation for Divide-and-Conquer Density Estimation

We present an efficient method to estimate cross-validation bandwidth parameters for kernel density estimation in very large datasets where ordinary cross-validation is rendered highly inefficient, both statistically and computationally.…

Methodology · Statistics 2016-09-02 Anirban Bhattacharya , Jeffrey D. Hart

Estimation of density functionals via cross-validation

In density estimation, the mean integrated squared error (MISE) is commonly used as a measure of performance. In that setting, the cross-validation criterion provides an unbiased estimator of the MISE minus the integral of the squared…

Methodology · Statistics 2024-07-30 José E. Chacón , Carlos Tenreiro

Bayesian cross-validation of geostatistical models

The problem of validating or criticising models for georeferenced data is challenging, since the conclusions can vary significantly depending on the locations of the validation set. This work proposes the use of cross-validation techniques…

Computation · Statistics 2018-02-19 Viviana G R Lobo , Thaís C O da Fonseca , Fernando A S Moura

Testing for Common Breaks in a Multiple Equations System

The issue addressed in this paper is that of testing for common breaks across or within equations of a multivariate system. Our framework is very general and allows integrated regressors and trends as well as stationary regressors. The null…

Statistics Theory · Mathematics 2018-01-12 Tatsushi Oka , Pierre Perron

Cross-validation in nonparametric regression with outliers

A popular data-driven method for choosing the bandwidth in standard kernel regression is cross-validation. Even when there are outliers in the data, robust kernel regression can be used to estimate the unknown regression curve [Robust and…

Statistics Theory · Mathematics 2007-06-13 Denis Heng-Yan Leung

A Honest Cross-Validation Estimator for Prediction Performance

Cross-validation is a standard tool for obtaining a honest assessment of the performance of a prediction model. The commonly used version repeatedly splits data, trains the prediction model on the training set, evaluates the model…

Machine Learning · Statistics 2025-10-10 Tianyu Pan , Vincent Z. Yu , Viswanath Devanarayan , Lu Tian

Fast Cross-Validation via Sequential Testing

With the increasing size of today's data sets, finding the right parameter configuration in model selection via cross-validation can be an extremely time-consuming task. In this paper we propose an improved cross-validation procedure which…

Machine Learning · Computer Science 2016-02-05 Tammo Krueger , Danny Panknin , Mikio Braun

Cross-validation Approaches for Multi-study Predictions

We consider prediction in multiple studies with potential differences in the relationships between predictors and outcomes. Our objective is to integrate data from multiple studies to develop prediction models for unseen studies. We propose…

Methodology · Statistics 2024-07-23 Boyu Ren , Prasad Patil , Francesca Dominici , Giovanni Parmigiani , Lorenzo Trippa

Testing equality of spectral densities using randomization techniques

In this paper, we investigate the testing problem that the spectral density matrices of several, not necessarily independent, stationary processes are equal. Based on an $L_2$-type test statistic, we propose a new nonparametric approach,…

Statistics Theory · Mathematics 2015-06-03 Carsten Jentsch , Markus Pauly