English
Related papers

Related papers: ConsistentFeature: A Plug-and-Play Component for N…

200 papers

Neural networks have attracted a lot of attention due to its success in applications such as natural language processing and computer vision. For large scale data, due to the tremendous number of parameters in neural networks, overfitting…

Machine Learning · Statistics 2022-07-05 Xiaoxi Shen , Jinghang Lin

Several image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding…

Computer Vision and Pattern Recognition · Computer Science 2022-01-11 Claudio Filipi Gonçalves dos Santos , João Paulo Papa

Overfitting is a well-known issue in machine learning that occurs when a model struggles to generalize its predictions to new, unseen data beyond the scope of its training set. Traditional techniques to mitigate overfitting include early…

Machine Learning · Computer Science 2025-12-09 Flavio Giorgi , Fabiano Veglianti , Fabrizio Silvestri , Gabriele Tolomei

This paper aims to compare different regularization strategies to address a common phenomenon, severe overfitting, in embedding-based neural networks for NLP. We chose two widely studied neural models and tasks as our testbed. We tried…

Computation and Language · Computer Science 2015-08-18 Hao Peng , Lili Mou , Ge Li , Yunchuan Chen , Yangyang Lu , Zhi Jin

Overfitting is a phenomenon that occurs when a machine learning model is trained for too long and focused too much on the exact fitness of the training samples to the provided training labels and cannot keep track of the predictive rules…

Machine Learning · Computer Science 2025-09-22 Nuri Korhan , Samet Bayram

Deep neural networks are learning models with a very high capacity and therefore prone to over-fitting. Many regularization techniques such as Dropout, DropConnect, and weight decay all attempt to solve the problem of over-fitting by…

Machine Learning · Computer Science 2016-12-06 Armen Aghajanyan

Neural network models and deep models are one of the leading and state of the art models in machine learning. Most successful deep neural models are the ones with many layers which highly increases their number of parameters. Training such…

Machine Learning · Computer Science 2018-07-17 Soufiane Belharbi

Although numerous methods to reduce the overfitting of convolutional neural networks (CNNs) exist, it is still not clear how to confidently measure the degree of overfitting. A metric reflecting the overfitting level might be, however,…

Machine Learning · Computer Science 2022-09-28 Svetlana Pavlitskaya , Joël Oswald , J. Marius Zöllner

Machine learning models suffer from overfitting, which is caused by a lack of labeled data. To tackle this problem, we proposed a framework of regularization methods, called density-fixing, that can be used commonly for supervised and…

Machine Learning · Computer Science 2020-09-08 Masanari Kimura , Ryohei Izawa

Model regularization requires extensive manual tuning to balance complexity against overfitting. Cross-regularization resolves this tradeoff by directly adapting regularization parameters through validation gradients during training. The…

Machine Learning · Computer Science 2025-06-25 Carlos Stein Brito

Convolution neural networks have achieved remarkable performance in many tasks of computing vision. However, CNN tends to bias to low frequency components. They prioritize capturing low frequency patterns which lead them fail when suffering…

Machine Learning · Computer Science 2020-07-08 Weiyu Guo , Yidong Ouyang

Regularization is commonly used for alleviating overfitting in machine learning. For convolutional neural networks (CNNs), regularization methods, such as DropBlock and Shake-Shake, have illustrated the improvement in the generalization…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Yi Wang , Zhen-Peng Bian , Junhui Hou , Lap-Pui Chau

Adversarial training (AT) is currently one of the most successful methods to obtain the adversarial robustness of deep neural networks. However, the phenomenon of robust overfitting, i.e., the robustness starts to decrease significantly…

Machine Learning · Computer Science 2021-12-23 Jihoon Tack , Sihyun Yu , Jongheon Jeong , Minseon Kim , Sung Ju Hwang , Jinwoo Shin

One major challenge in training Deep Neural Networks is preventing overfitting. Many techniques such as data augmentation and novel regularizers such as Dropout have been proposed to prevent overfitting without requiring a massive amount of…

Machine Learning · Computer Science 2016-06-13 Michael Cogswell , Faruk Ahmed , Ross Girshick , Larry Zitnick , Dhruv Batra

Deep neural networks possess strong representational capacity yet remain vulnerable to overfitting, primarily because neurons tend to co-adapt in ways that, while capturing complex and fine-grained feature interactions, also reinforce…

Machine Learning · Computer Science 2025-12-16 Gelesh G Omathil , Sreeja CS

Despite being able to capture a range of features of the data, high accuracy models trained with supervision tend to make similar predictions. This seemingly implies that high-performing models share similar biases regardless of training…

Machine Learning · Computer Science 2022-04-27 Raphael Gontijo-Lopes , Yann Dauphin , Ekin D. Cubuk

Overfitting is one of the most critical challenges in deep neural networks, and there are various types of regularization methods to improve generalization performance. Injecting noises to hidden units during training, e.g., dropout, is…

Machine Learning · Computer Science 2017-11-10 Hyeonwoo Noh , Tackgeun You , Jonghwan Mun , Bohyung Han

Assisted by the availability of data and high performance computing, deep learning techniques have achieved breakthroughs and surpassed human performance empirically in difficult tasks, including object recognition, speech recognition, and…

Machine Learning · Computer Science 2019-01-23 Shaeke Salman , Xiuwen Liu

The strength of machine learning models stems from their ability to learn complex function approximations from data; however, this strength also makes training deep neural networks challenging. Notably, the complex models tend to memorize…

Computer Vision and Pattern Recognition · Computer Science 2023-04-17 Mofassir ul Islam Arif , Mohsan Jameel , Josif Grabocka , Lars Schmidt-Thieme

Overfitting is one of the most common problems when training deep neural networks on comparatively small datasets. Here, we demonstrate that neural network activation sparsity is a reliable indicator for overfitting which we utilize to…

Machine Learning · Computer Science 2020-02-24 Karim Huesmann , Soeren Klemm , Lars Linsen , Benjamin Risse
‹ Prev 1 2 3 10 Next ›