English
Related papers

Related papers: Testing for Overfitting

200 papers

The repeated community-wide reuse of test sets in popular benchmark problems raises doubts about the credibility of reported test-error rates. Verifying whether a learned model is overfitted to a test set is challenging as independent test…

Machine Learning · Computer Science 2019-11-15 Roman Werpachowski , András György , Csaba Szepesvári

Overfitting describes a machine learning phenomenon where the model fits too closely to the training data, resulting in poor generalization. While this occurrence is thoroughly documented for many forms of supervised learning, it is not…

Machine Learning · Computer Science 2024-08-23 Zachary Rabin , Jim Davis , Benjamin Lewis , Matthew Scherreik

Overfitting is a phenomenon that occurs when a machine learning model is trained for too long and focused too much on the exact fitness of the training samples to the provided training labels and cannot keep track of the predictive rules…

Machine Learning · Computer Science 2025-09-22 Nuri Korhan , Samet Bayram

Overfitting is a well-known issue in machine learning that occurs when a model struggles to generalize its predictions to new, unseen data beyond the scope of its training set. Traditional techniques to mitigate overfitting include early…

Machine Learning · Computer Science 2025-12-09 Flavio Giorgi , Fabiano Veglianti , Fabrizio Silvestri , Gabriele Tolomei

Assisted by the availability of data and high performance computing, deep learning techniques have achieved breakthroughs and surpassed human performance empirically in difficult tasks, including object recognition, speech recognition, and…

Machine Learning · Computer Science 2019-01-23 Shaeke Salman , Xiuwen Liu

Overfitting and generalization is an important concept in Machine Learning as only models that generalize are interesting for general applications. Yet some students have trouble learning this important concept through lectures and…

Machine Learning · Computer Science 2022-09-08 Matias Valdenegro-Toro , Matthia Sabatelli

Machine learning models suffer from overfitting, which is caused by a lack of labeled data. To tackle this problem, we proposed a framework of regularization methods, called density-fixing, that can be used commonly for supervised and…

Machine Learning · Computer Science 2020-09-08 Masanari Kimura , Ryohei Izawa

In software engineering, deep learning models are increasingly deployed for critical tasks such as bug detection and code review. However, overfitting remains a challenge that affects the quality, reliability, and trustworthiness of…

Software Engineering · Computer Science 2024-05-21 Hao Li , Gopi Krishnan Rajbahadur , Dayi Lin , Cor-Paul Bezemer , Zhen Ming , Jiang

Machine learning models that are overfitted/overtrained are more vulnerable to knowledge leakage, which poses a risk to privacy. Suppose we download or receive a model from a third-party collaborator without knowing its training accuracy.…

Machine Learning · Computer Science 2023-06-01 Hossein Rezaei , Mohammad Sabokrou

Overfitting is the bane of data analysts, even when data are plentiful. Formal approaches to understanding this problem focus on statistical inference and generalization of individual analysis procedures. Yet the practice of data analysis…

Machine Learning · Computer Science 2015-09-28 Cynthia Dwork , Vitaly Feldman , Moritz Hardt , Toniann Pitassi , Omer Reingold , Aaron Roth

Changepoint detection is commonly formulated by minimizing the sum of in-sample losses to quantify the model's overall fit. However, for flexible modeling procedures -- especially those involving high-dimensional parameter spaces or…

Methodology · Statistics 2026-05-05 Chengde Qian , Guanghui Wang , Zhaojun Wang , Changliang Zou

Genetic Programming has been very successful in solving a large area of problems but its use as a machine learning algorithm has been limited so far. One of the reasons is the problem of overfitting which cannot be solved or suppresed as…

Neural and Evolutionary Computing · Computer Science 2015-05-05 Jan Žegklitz , Petr Pošík

Excessive reuse of test data has become commonplace in today's machine learning workflows. Popular benchmarks, competitions, industrial scale tuning, among other applications, all involve test data reuse beyond guidance by statistical…

Machine Learning · Computer Science 2019-05-30 Horia Mania , John Miller , Ludwig Schmidt , Moritz Hardt , Benjamin Recht

Statistical machine learning theory often tries to give generalization guarantees of machine learning models. Those models naturally underlie some fluctuation, as they are based on a data sample. If we were unlucky, and gathered a sample…

Machine Learning · Computer Science 2022-11-21 Alexander Mey

Methods of performing anomaly detection on high-dimensional data sets are needed, since algorithms which are trained on data are only expected to perform well on data that is similar to the training data. There are theoretical results on…

Machine Learning · Computer Science 2020-11-13 Forrest Laine , Claire Tomlin

Modern machine learning models with a large number of parameters often generalize well despite perfectly interpolating noisy training data - a phenomenon known as benign overfitting. A foundational explanation for this in linear…

Machine Learning · Statistics 2025-11-18 Yuta Kondo

The reason for Meta Overfitting can be attributed to two factors: Mutual Non-exclusivity and the Lack of diversity, consequent to which a single global function can fit the support set data of all the meta-training tasks and fail to…

Machine Learning · Computer Science 2024-11-12 Manas Gogoi , Sambhavi Tiwari , Shekhar Verma

Adversarial training is a popular method to robustify models against adversarial attacks. However, it exhibits much more severe overfitting than training on clean inputs. In this work, we investigate this phenomenon from the perspective of…

Machine Learning · Computer Science 2024-12-18 Chen Liu , Zhichao Huang , Mathieu Salzmann , Tong Zhang , Sabine Süsstrunk

We present an information-theoretic framework for understanding overfitting and underfitting in machine learning and prove the formal undecidability of determining whether an arbitrary classification algorithm will overfit a dataset.…

Machine Learning · Computer Science 2020-11-10 Daniel Bashir , George D. Montanez , Sonia Sehra , Pedro Sandoval Segura , Julius Lauw

Common practice in modern machine learning involves fitting a large number of parameters relative to the number of observations. These overparameterized models can exhibit surprising generalization behavior, e.g., ``double descent'' in the…

Machine Learning · Statistics 2024-10-03 Pratik Patil , Jin-Hong Du , Ryan J. Tibshirani
‹ Prev 1 2 3 10 Next ›