Related papers: Guided Layer-wise Learning for Deep Models using S…

Regularizing Class-wise Predictions via Self-knowledge Distillation

Deep neural networks with millions of parameters may suffer from poor generalization due to overfitting. To mitigate the issue, we propose a new regularization method that penalizes the predictive distribution between similar samples. In…

Machine Learning · Computer Science 2020-04-08 Sukmin Yun , Jongjin Park , Kimin Lee , Jinwoo Shin

Label-Based Diversity Measure Among Hidden Units of Deep Neural Networks: A Regularization Method

Although the deep structure guarantees the powerful expressivity of deep networks (DNNs), it also triggers serious overfitting problem. To improve the generalization capacity of DNNs, many strategies were developed to improve the diversity…

Machine Learning · Computer Science 2021-04-06 Chenguang Zhang , Yuexian Hou , Dawei Song , Liangzhu Ge , Yaoshuai Yao

Regularising Deep Networks with Deep Generative Models

We develop a new method for regularising neural networks. We learn a probability distribution over the activations of all layers of the model and then insert imputed values into the network during training. We obtain a posterior for an…

Machine Learning · Computer Science 2019-10-14 Matthew Willetts , Alexander Camuto , Stephen Roberts , Chris Holmes

Proximal Mapping for Deep Regularization

Underpinning the success of deep learning is effective regularizations that allow a variety of priors in data to be modeled. For example, robustness to adversarial perturbations, and correlations between multiple modalities. However, most…

Machine Learning · Computer Science 2020-06-16 Mao Li , Yingyi Ma , Xinhua Zhang

Adaptive Discriminative Regularization for Visual Classification

How to improve discriminative feature learning is central in classification. Existing works address this problem by explicitly increasing inter-class separability and intra-class similarity, whether by constructing positive and negative…

Machine Learning · Computer Science 2024-08-21 Qingsong Zhao , Yi Wang , Shuguang Dou , Chen Gong , Yin Wang , Cairong Zhao

Generalized Deep Learning-based Proximal Gradient Descent for MR Reconstruction

The data consistency for the physical forward model is crucial in inverse problems, especially in MR imaging reconstruction. The standard way is to unroll an iterative algorithm into a neural network with a forward model embedded. The…

Image and Video Processing · Electrical Eng. & Systems 2023-06-28 Guanxiong Luo , Mengmeng Kuang , Peng Cao

Differential Privacy Regularization: Protecting Training Data Through Loss Function Regularization

Training machine learning models based on neural networks requires large datasets, which may contain sensitive information. The models, however, should not expose private information from these datasets. Differentially private SGD [DP-SGD]…

Machine Learning · Computer Science 2024-09-26 Francisco Aguilera-Martínez , Fernando Berzal

Combining Explicit and Implicit Regularization for Efficient Learning in Deep Networks

Works on implicit regularization have studied gradient trajectories during the optimization process to explain why deep networks favor certain kinds of solutions over others. In deep linear networks, it has been shown that gradient descent…

Machine Learning · Computer Science 2023-06-02 Dan Zhao

Learning Compact Neural Networks with Regularization

Proper regularization is critical for speeding up training, improving generalization performance, and learning compact models that are cost efficient. We propose and analyze regularized gradient descent algorithms for learning shallow…

Machine Learning · Computer Science 2018-06-08 Samet Oymak

Robust Neural Network Classification via Double Regularization

The presence of mislabeled observations in data is a notoriously challenging problem in statistics and machine learning, associated with poor generalization properties for both traditional classifiers and, perhaps even more so, flexible…

Machine Learning · Statistics 2022-02-09 Olof Zetterqvist , Rebecka Jörnsten , Johan Jonasson

Hidden Classification Layers: Enhancing linear separability between classes in neural networks layers

In the context of classification problems, Deep Learning (DL) approaches represent state of art. Many DL approaches are based on variations of standard multi-layer feed-forward neural networks. These are also referred to as deep networks.…

Machine Learning · Computer Science 2023-11-21 Andrea Apicella , Francesco Isgrò , Roberto Prevete

Multi-level Distance Regularization for Deep Metric Learning

We propose a novel distance-based regularization method for deep metric learning called Multi-level Distance Regularization (MDR). MDR explicitly disturbs a learning procedure by regularizing pairwise distances between embedding vectors…

Computer Vision and Pattern Recognition · Computer Science 2021-02-09 Yonghyun Kim , Wonpyo Park

With Greater Distance Comes Worse Performance: On the Perspective of Layer Utilization and Model Generalization

Generalization of deep neural networks remains one of the main open problems in machine learning. Previous theoretical works focused on deriving tight bounds of model complexity, while empirical works revealed that neural networks exhibit…

Machine Learning · Computer Science 2022-01-31 James Wang , Cheng-Lin Yang

Deep supervised learning using local errors

Error backpropagation is a highly effective mechanism for learning high-quality hierarchical features in deep networks. Updating the features or weights in one layer, however, requires waiting for the propagation of error signals from…

Neural and Evolutionary Computing · Computer Science 2017-11-21 Hesham Mostafa , Vishwajith Ramesh , Gert Cauwenberghs

Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias

Gradient regularization (GR) is a method that penalizes the gradient norm of the training loss during training. While some studies have reported that GR can improve generalization performance, little attention has been paid to it from the…

Machine Learning · Computer Science 2023-02-06 Ryo Karakida , Tomoumi Takase , Tomohiro Hayase , Kazuki Osawa

Learning Not to Learn: Training Deep Neural Networks with Biased Data

We propose a novel regularization algorithm to train deep neural networks, in which data at training time is severely biased. Since a neural network efficiently learns data distribution, a network is likely to learn the bias information to…

Computer Vision and Pattern Recognition · Computer Science 2019-04-16 Byungju Kim , Hyunwoo Kim , Kyungsu Kim , Sungjin Kim , Junmo Kim

DL-Reg: A Deep Learning Regularization Technique using Linear Regression

Regularization plays a vital role in the context of deep learning by preventing deep neural networks from the danger of overfitting. This paper proposes a novel deep learning regularization method named as DL-Reg, which carefully reduces…

Machine Learning · Computer Science 2020-11-05 Maryam Dialameh , Ali Hamzeh , Hossein Rahmani

Right for the Right Reasons: Training Differentiable Models by Constraining their Explanations

Neural networks are among the most accurate supervised learning methods in use today, but their opacity makes them difficult to trust in critical applications, especially when conditions in training differ from those in test. Recent work on…

Machine Learning · Computer Science 2017-11-15 Andrew Slavin Ross , Michael C. Hughes , Finale Doshi-Velez

Multilevel-in-Layer Training for Deep Neural Network Regression

A common challenge in regression is that for many problems, the degrees of freedom required for a high-quality solution also allows for overfitting. Regularization is a class of strategies that seek to restrict the range of possible…

Machine Learning · Computer Science 2022-11-15 Colin Ponce , Ruipeng Li , Christina Mao , Panayot Vassilevski

Training Neural Networks with Local Error Signals

Supervised training of neural networks for classification is typically performed with a global loss function. The loss function provides a gradient for the output layer, and this gradient is back-propagated to hidden layers to dictate an…

Machine Learning · Statistics 2019-05-09 Arild Nøkland , Lars Hiller Eidnes