Related papers: Robust Neural Network Classification via Double Re…

Manifold Regularized Discriminative Neural Networks

Unregularized deep neural networks (DNNs) can be easily overfit with a limited sample size. We argue that this is mostly due to the disriminative nature of DNNs which directly model the conditional probability (or score) of labels given the…

Machine Learning · Computer Science 2016-01-11 Shuangfei Zhai , Zhongfei Zhang

Improving Generalization of Deep Fault Detection Models in the Presence of Mislabeled Data

Mislabeled samples are ubiquitous in real-world datasets as rule-based or expert labeling is usually based on incorrect assumptions or subject to biased opinions. Neural networks can "memorize" these mislabeled samples and, as a result,…

Machine Learning · Computer Science 2021-11-24 Katharina Rombach , Gabriel Michau , Olga Fink

Regularized Linear Regression for Binary Classification

Regularized linear regression is a promising approach for binary classification problems in which the training set has noisy labels since the regularization term can help to avoid interpolating the mislabeled data points. In this paper we…

Machine Learning · Computer Science 2023-11-07 Danil Akhtiamov , Reza Ghane , Babak Hassibi

Deep Bilevel Learning

We present a novel regularization approach to train neural networks that enjoys better generalization and test error than standard stochastic gradient descent. Our approach is based on the principles of cross-validation, where a validation…

Computer Vision and Pattern Recognition · Computer Science 2018-09-06 Simon Jenni , Paolo Favaro

Regularizing Neural Networks by Penalizing Confident Output Distributions

We systematically explore regularizing neural networks by penalizing low entropy output distributions. We show that penalizing low entropy output distributions, which has been shown to improve exploration in reinforcement learning, acts as…

Neural and Evolutionary Computing · Computer Science 2017-01-24 Gabriel Pereyra , George Tucker , Jan Chorowski , Łukasz Kaiser , Geoffrey Hinton

Synergistic Network Learning and Label Correction for Noise-robust Image Classification

Large training datasets almost always contain examples with inaccurate or incorrect labels. Deep Neural Networks (DNNs) tend to overfit training label noise, resulting in poorer model performance in practice. To address this problem, we…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Chen Gong , Kong Bin , Eric J. Seibel , Xin Wang , Youbing Yin , Qi Song

rSDNet: Unified Robust Neural Learning against Label Noise and Adversarial Attacks

Neural networks are central to modern artificial intelligence, yet their training remains highly sensitive to data contamination. Standard neural classifiers are trained by minimizing the categorical cross-entropy loss, corresponding to…

Machine Learning · Statistics 2026-03-19 Suryasis Jana , Abhik Ghosh

Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

Over-parameterized deep neural networks trained by simple first-order methods are known to be able to fit any labeling of data. Such over-fitting ability hinders generalization when mislabeled training examples are present. On the other…

Machine Learning · Computer Science 2020-10-06 Wei Hu , Zhiyuan Li , Dingli Yu

SoftTarget Regularization: An Effective Technique to Reduce Over-Fitting in Neural Networks

Deep neural networks are learning models with a very high capacity and therefore prone to over-fitting. Many regularization techniques such as Dropout, DropConnect, and weight decay all attempt to solve the problem of over-fitting by…

Machine Learning · Computer Science 2016-12-06 Armen Aghajanyan

Guided Layer-wise Learning for Deep Models using Side Information

Training of deep models for classification tasks is hindered by local minima problems and vanishing gradients, while unsupervised layer-wise pretraining does not exploit information from class labels. Here, we propose a new regularization…

Machine Learning · Computer Science 2019-11-07 Pavel Sulimov , Elena Sukmanova , Roman Chereshnev , Attila Kertesz-Farkas

Dual Pattern Learning Networks by Empirical Dual Prediction Risk Minimization

Motivated by the observation that humans can learn patterns from two given images at one time, we propose a dual pattern learning network architecture in this paper. Unlike conventional networks, the proposed architecture has two input…

Computer Vision and Pattern Recognition · Computer Science 2018-06-12 Haimin Zhang , Min Xu

Learning Deep Networks from Noisy Labels with Dropout Regularization

Large datasets often have unreliable labels-such as those obtained from Amazon's Mechanical Turk or social media platforms-and classifiers trained on mislabeled datasets often exhibit poor performance. We present a simple, effective…

Computer Vision and Pattern Recognition · Computer Science 2017-05-10 Ishan Jindal , Matthew Nokleby , Xuewen Chen

Label-Based Diversity Measure Among Hidden Units of Deep Neural Networks: A Regularization Method

Although the deep structure guarantees the powerful expressivity of deep networks (DNNs), it also triggers serious overfitting problem. To improve the generalization capacity of DNNs, many strategies were developed to improve the diversity…

Machine Learning · Computer Science 2021-04-06 Chenguang Zhang , Yuexian Hou , Dawei Song , Liangzhu Ge , Yaoshuai Yao

Robust Linear Classification from Limited Training Data

We consider the problem of linear classification under general loss functions in the limited-data setting. Overfitting is a common problem here. The standard approaches to prevent overfitting are dimensionality reduction and regularization.…

Machine Learning · Computer Science 2021-11-22 Deepayan Chakrabarti

Selective Learning: Towards Robust Calibration with Dynamic Regularization

Miscalibration in deep learning refers to there is a discrepancy between the predicted confidence and performance. This problem usually arises due to the overfitting problem, which is characterized by learning everything presented in the…

Machine Learning · Computer Science 2024-07-16 Zongbo Han , Yifeng Yang , Changqing Zhang , Linjun Zhang , Joey Tianyi Zhou , Qinghua Hu

Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Recent works have shown that deep neural networks can achieve super-human performance in a wide range of image classification tasks in the medical imaging domain. However, these works have primarily focused on classification accuracy,…

Computer Vision and Pattern Recognition · Computer Science 2020-09-10 Gongbo Liang , Yu Zhang , Xiaoqin Wang , Nathan Jacobs

DL-Reg: A Deep Learning Regularization Technique using Linear Regression

Regularization plays a vital role in the context of deep learning by preventing deep neural networks from the danger of overfitting. This paper proposes a novel deep learning regularization method named as DL-Reg, which carefully reduces…

Machine Learning · Computer Science 2020-11-05 Maryam Dialameh , Ali Hamzeh , Hossein Rahmani

Fighting over-fitting with quantization for learning deep neural networks on noisy labels

The rising performance of deep neural networks is often empirically attributed to an increase in the available computational power, which allows complex models to be trained upon large amounts of annotated data. However, increased model…

Computer Vision and Pattern Recognition · Computer Science 2023-03-22 Gauthier Tallec , Edouard Yvinec , Arnaud Dapogny , Kevin Bailly

Improving the Certified Robustness of Neural Networks via Consistency Regularization

A range of defense methods have been proposed to improve the robustness of neural networks on adversarial examples, among which provable defense methods have been demonstrated to be effective to train neural networks that are certifiably…

Machine Learning · Computer Science 2021-01-21 Mengting Xu , Tao Zhang , Zhongnian Li , Daoqiang Zhang

AL2: Progressive Activation Loss for Learning General Representations in Classification Neural Networks

The large capacity of neural networks enables them to learn complex functions. To avoid overfitting, networks however require a lot of training data that can be expensive and time-consuming to collect. A common practical approach to…

Machine Learning · Computer Science 2020-03-10 Majed El Helou , Frederike Dümbgen , Sabine Süsstrunk