English
Related papers

Related papers: SVMax: A Feature Embedding Regularizer

200 papers

Deep predictive models of neuronal activity have recently enabled several new discoveries about the selectivity and invariance of neurons in the visual cortex. These models learn a shared set of nonlinear basis functions, which are linearly…

Neurons and Cognition · Quantitative Biology 2024-06-19 Polina Turishcheva , Max Burg , Fabian H. Sinz , Alexander Ecker

Using weight decay to penalize the L2 norms of weights in neural networks has been a standard training practice to regularize the complexity of networks. In this paper, we show that a family of regularizers, including weight decay, is…

Machine Learning · Computer Science 2022-06-09 Ziquan Liu , Yufei Cui , Antoni B. Chan

We present weight normalization: a reparameterization of the weight vectors in a neural network that decouples the length of those weight vectors from their direction. By reparameterizing the weights in this way we improve the conditioning…

Machine Learning · Computer Science 2016-06-07 Tim Salimans , Diederik P. Kingma

This paper aims to compare different regularization strategies to address a common phenomenon, severe overfitting, in embedding-based neural networks for NLP. We chose two widely studied neural models and tasks as our testbed. We tried…

Computation and Language · Computer Science 2015-08-18 Hao Peng , Lili Mou , Ge Li , Yunchuan Chen , Yangyang Lu , Zhi Jin

Normalization methods improve both optimization and generalization of ConvNets. To further boost performance, the recently-proposed switchable normalization (SN) provides a new perspective for deep learning: it learns to select different…

Computer Vision and Pattern Recognition · Computer Science 2019-03-12 Wenqi Shao , Tianjian Meng , Jingyu Li , Ruimao Zhang , Yudian Li , Xiaogang Wang , Ping Luo

Regularization is a set of techniques that are used to improve the generalization ability of deep neural networks. In this paper, we introduce weight compander (WC), a novel effective method to improve generalization by reparameterizing…

Machine Learning · Computer Science 2023-06-30 Rinor Cakaj , Jens Mehnert , Bin Yang

Continual learning of deep neural networks is a key requirement for scaling them up to more complex applicative scenarios and for achieving real lifelong learning of these architectures. Previous approaches to the problem have considered…

Machine Learning · Computer Science 2020-06-25 Jary Pomponi , Simone Scardapane , Vincenzo Lomonaco , Aurelio Uncini

The architecture and the parameters of neural networks are often optimized independently, which requires costly retraining of the parameters whenever the architecture is modified. In this work we instead focus on growing the architecture…

Machine Learning · Computer Science 2022-06-08 Utku Evci , Bart van Merriënboer , Thomas Unterthiner , Max Vladymyrov , Fabian Pedregosa

We propose a novel regularization method, called \textit{volumization}, for neural networks. Inspired by physics, we define a physical volume for the weight parameters in neural networks, and we show that this method is an effective way of…

Machine Learning · Computer Science 2020-04-02 Liu Ziyin , Zihao Wang , Makoto Yamada , Masahito Ueda

We employ triplet loss as a feature embedding regularizer to boost classification performance. Standard architectures, like ResNet and Inception, are extended to support both losses with minimal hyper-parameter tuning. This promotes…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Ahmed Taha , Yi-Ting Chen , Teruhisa Misu , Abhinav Shrivastava , Larry Davis

Different techniques have emerged in the deep learning scenario, such as Convolutional Neural Networks, Deep Belief Networks, and Long Short-Term Memory Networks, to cite a few. In lockstep, regularization methods, which aim to prevent…

Machine Learning · Computer Science 2020-07-28 Claudio Filipi Goncalves do Santos , Danilo Colombo , Mateus Roder , João Paulo Papa

The vulnerability of neural network classifiers to adversarial attacks is a major obstacle to their deployment in safety-critical applications. Regularization of network parameters during training can be used to improve adversarial…

Machine Learning · Computer Science 2024-05-28 Sheng Yang , Jacob A. Zavatone-Veth , Cengiz Pehlevan

Feature selection in learning to rank has recently emerged as a crucial issue. Whereas several preprocessing approaches have been proposed, only a few works have been focused on integrating the feature selection into the learning process.…

Machine Learning · Computer Science 2015-07-03 Léa Laporte , Rémi Flamary , Stephane Canu , Sébastien Déjean , Josiane Mothe

Effective regularisation of neural networks is essential to combat overfitting due to the large number of parameters involved. We present an empirical analogue to the Lipschitz constant of a feed-forward neural network, which we refer to as…

Machine Learning · Statistics 2018-07-03 Henry Gouk , Bernhard Pfahringer , Eibe Frank , Michael Cree

Weight sharing has become a de facto standard in neural architecture search because it enables the search to be done on commodity hardware. However, recent works have empirically shown a ranking disorder between the performance of…

Machine Learning · Computer Science 2021-04-13 Kaicheng Yu , Rene Ranftl , Mathieu Salzmann

Deep neural networks (DNNs) have become increasingly important due to their excellent empirical performance on a wide range of problems. However, regularization is generally achieved by indirect means, largely due to the complex set of…

Machine Learning · Computer Science 2018-07-02 Amal Rannen Triki , Maxim Berman , Matthew B. Blaschko

Works on implicit regularization have studied gradient trajectories during the optimization process to explain why deep networks favor certain kinds of solutions over others. In deep linear networks, it has been shown that gradient descent…

Machine Learning · Computer Science 2023-06-02 Dan Zhao

Deep learning methods achieve great success recently on many computer vision problems, with image classification and object detection as the prominent examples. In spite of these practical successes, optimization of deep networks remains an…

Computer Vision and Pattern Recognition · Computer Science 2017-03-21 Kui Jia

Over the recent years, various deep learning-based methods were proposed for extracting a fixed-dimensional embedding vector from speech signals. Although the deep learning-based embedding extraction methods have shown good performance in…

Audio and Speech Processing · Electrical Eng. & Systems 2021-12-08 Woo Hyun Kang , Jahangir Alam , Abderrahim Fathan

We propose a compression based continual task learning method that can dynamically grow a neural network. Inspired from the recent model compression techniques, we employ compression-aware training and perform low-rank weight approximations…

Computer Vision and Pattern Recognition · Computer Science 2020-09-16 Varigonda Pavan Teja , Priyadarshini Panda
‹ Prev 1 2 3 10 Next ›