Related papers: Learning Less-Overlapping Representations

AL2: Progressive Activation Loss for Learning General Representations in Classification Neural Networks

The large capacity of neural networks enables them to learn complex functions. To avoid overfitting, networks however require a lot of training data that can be expensive and time-consuming to collect. A common practical approach to…

Machine Learning · Computer Science 2020-03-10 Majed El Helou , Frederike Dümbgen , Sabine Süsstrunk

Learning Compact Neural Networks with Regularization

Proper regularization is critical for speeding up training, improving generalization performance, and learning compact models that are cost efficient. We propose and analyze regularized gradient descent algorithms for learning shallow…

Machine Learning · Computer Science 2018-06-08 Samet Oymak

Learning Sparse Visual Representations with Leaky Capped Norm Regularizers

Sparsity inducing regularization is an important part for learning over-complete visual representations. Despite the popularity of $\ell_1$ regularization, in this paper, we investigate the usage of non-convex regularizations in this…

Machine Learning · Computer Science 2017-11-09 Jianqiao Wangni , Dahua Lin

Reducing Overfitting in Deep Networks by Decorrelating Representations

One major challenge in training Deep Neural Networks is preventing overfitting. Many techniques such as data augmentation and novel regularizers such as Dropout have been proposed to prevent overfitting without requiring a massive amount of…

Machine Learning · Computer Science 2016-06-13 Michael Cogswell , Faruk Ahmed , Ross Girshick , Larry Zitnick , Dhruv Batra

Learning a smooth kernel regularizer for convolutional neural networks

Modern deep neural networks require a tremendous amount of data to train, often needing hundreds or thousands of labeled examples to learn an effective representation. For these networks to work with less data, more structure must be built…

Computer Vision and Pattern Recognition · Computer Science 2019-03-06 Reuben Feinman , Brenden M. Lake

Reproducibility of predictive networks for mouse visual cortex

Deep predictive models of neuronal activity have recently enabled several new discoveries about the selectivity and invariance of neurons in the visual cortex. These models learn a shared set of nonlinear basis functions, which are linearly…

Neurons and Cognition · Quantitative Biology 2024-06-19 Polina Turishcheva , Max Burg , Fabian H. Sinz , Alexander Ecker

Neural Networks Regularization Through Representation Learning

Neural network models and deep models are one of the leading and state of the art models in machine learning. Most successful deep neural models are the ones with many layers which highly increases their number of parameters. Training such…

Machine Learning · Computer Science 2018-07-17 Soufiane Belharbi

ConsistentFeature: A Plug-and-Play Component for Neural Network Regularization

Over-parameterized neural network models often lead to significant performance discrepancies between training and test sets, a phenomenon known as overfitting. To address this, researchers have proposed numerous regularization techniques…

Machine Learning · Computer Science 2025-01-27 RuiZhe Jiang , Haotian Lei

Adaptive Low-Rank Regularization with Damping Sequences to Restrict Lazy Weights in Deep Networks

Overfitting is one of the critical problems in deep neural networks. Many regularization schemes try to prevent overfitting blindly. However, they decrease the convergence speed of training algorithms. Adaptive regularization schemes can…

Machine Learning · Computer Science 2021-06-18 Mohammad Mahdi Bejani , Mehdi Ghatee

Sample-based Regularization: A Transfer Learning Strategy Toward Better Generalization

Training a deep neural network with a small amount of data is a challenging problem as it is vulnerable to overfitting. However, one of the practical difficulties that we often face is to collect many samples. Transfer learning is a…

Machine Learning · Computer Science 2020-07-13 Yunho Jeon , Yongseok Choi , Jaesun Park , Subin Yi , Dongyeon Cho , Jiwon Kim

Weight Compander: A Simple Weight Reparameterization for Regularization

Regularization is a set of techniques that are used to improve the generalization ability of deep neural networks. In this paper, we introduce weight compander (WC), a novel effective method to improve generalization by reparameterizing…

Machine Learning · Computer Science 2023-06-30 Rinor Cakaj , Jens Mehnert , Bin Yang

Regularized Linear Regression for Binary Classification

Regularized linear regression is a promising approach for binary classification problems in which the training set has noisy labels since the regularization term can help to avoid interpolating the mislabeled data points. In this paper we…

Machine Learning · Computer Science 2023-11-07 Danil Akhtiamov , Reza Ghane , Babak Hassibi

On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity

Most complex machine learning and modelling techniques are prone to over-fitting and may subsequently generalise poorly to future data. Artificial neural networks are no different in this regard and, despite having a level of implicit…

Machine Learning · Statistics 2022-05-26 Vincent Szolnoky , Viktor Andersson , Balazs Kulcsar , Rebecka Jörnsten

ReCoRe: Regularized Contrastive Representation Learning of World Model

While recent model-free Reinforcement Learning (RL) methods have demonstrated human-level effectiveness in gaming environments, their success in everyday tasks like visual navigation has been limited, particularly under significant…

Machine Learning · Computer Science 2024-04-04 Rudra P. K. Poudel , Harit Pandya , Stephan Liwicki , Roberto Cipolla

Spectral regularization for adversarially-robust representation learning

The vulnerability of neural network classifiers to adversarial attacks is a major obstacle to their deployment in safety-critical applications. Regularization of network parameters during training can be used to improve adversarial…

Machine Learning · Computer Science 2024-05-28 Sheng Yang , Jacob A. Zavatone-Veth , Cengiz Pehlevan

A Comparative Study on Regularization Strategies for Embedding-based Neural Networks

This paper aims to compare different regularization strategies to address a common phenomenon, severe overfitting, in embedding-based neural networks for NLP. We chose two widely studied neural models and tasks as our testbed. We tried…

Computation and Language · Computer Science 2015-08-18 Hao Peng , Lili Mou , Ge Li , Yunchuan Chen , Yangyang Lu , Zhi Jin

Transfer RL across Observation Feature Spaces via Model-Based Regularization

In many reinforcement learning (RL) applications, the observation space is specified by human developers and restricted by physical realizations, and may thus be subject to dramatic changes over time (e.g. increased number of observable…

Machine Learning · Computer Science 2022-04-07 Yanchao Sun , Ruijie Zheng , Xiyao Wang , Andrew Cohen , Furong Huang

Learning Disentangled Textual Representations via Statistical Measures of Similarity

When working with textual data, a natural application of disentangled representations is fair classification where the goal is to make predictions without being biased (or influenced) by sensitive attributes that may be present in the data…

Computation and Language · Computer Science 2022-10-10 Pierre Colombo , Guillaume Staerman , Nathan Noiry , Pablo Piantanida

Neural Networks Regularization Through Class-wise Invariant Representation Learning

Training deep neural networks is known to require a large number of training samples. However, in many applications only few training samples are available. In this work, we tackle the issue of training neural networks for classification…

Machine Learning · Computer Science 2017-12-25 Soufiane Belharbi , Clément Chatelain , Romain Hérault , Sébastien Adam

Robustness via Deep Low-Rank Representations

We investigate the effect of the dimensionality of the representations learned in Deep Neural Networks (DNNs) on their robustness to input perturbations, both adversarial and random. To achieve low dimensionality of learned representations,…

Machine Learning · Computer Science 2020-02-20 Amartya Sanyal , Varun Kanade , Philip H. S. Torr , Puneet K. Dokania