Related papers: Convolutional Normalization: Improving Deep Convol…

Normalized Convolutional Neural Network

We introduce a Normalized Convolutional Neural Layer, a novel approach to normalization in convolutional networks. Unlike conventional methods, this layer normalizes the rows of the im2col matrix during convolution, making it inherently…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Dongsuk Kim , Geonhee Lee , Myungjae Lee , Shin Uk Kang , Dongmin Kim

Regularization and Optimization strategies in Deep Convolutional Neural Network

Convolution Neural Networks, known as ConvNets exceptionally perform well in many complex machine learning tasks. The architecture of ConvNets demands the huge and rich amount of data and involves with a vast number of parameters that leads…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Pushparaja Murugan , Shanmugasundaram Durairaj

Filter Bank Regularization of Convolutional Neural Networks

Regularization techniques are widely used to improve the generality, robustness, and efficiency of deep convolutional neural networks (DCNNs). In this paper, we propose a novel approach of regulating DCNN convolutional kernels by a…

Machine Learning · Computer Science 2019-11-28 Seyed Mehdi Ayyoubzadeh , Xiaolin Wu

Preprint: Norm Loss: An efficient yet effective regularization method for deep neural networks

Convolutional neural network training can suffer from diverse issues like exploding or vanishing gradients, scaling-based weight space symmetry and covariant-shift. In order to address these issues, researchers develop weight regularization…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Theodoros Georgiou , Sebastian Schmitt , Thomas Bäck , Wei Chen , Michael Lew

Canonical convolutional neural networks

We introduce canonical weight normalization for convolutional neural networks. Inspired by the canonical tensor decomposition, we express the weight tensors in so-called canonical networks as scaled sums of outer vector products. In…

Machine Learning · Computer Science 2022-06-06 Lokesh Veeramacheneni , Moritz Wolter , Reinhard Klein , Jochen Garcke

Can We Gain More from Orthogonality Regularizations in Training Deep CNNs?

This paper seeks to answer the question: as the (near-) orthogonality of weights is found to be a favorable property for training deep convolutional neural networks, how can we enforce it in more effective and easy-to-use ways? We develop…

Machine Learning · Computer Science 2018-10-23 Nitin Bansal , Xiaohan Chen , Zhangyang Wang

Learning a smooth kernel regularizer for convolutional neural networks

Modern deep neural networks require a tremendous amount of data to train, often needing hundreds or thousands of labeled examples to learn an effective representation. For these networks to work with less data, more structure must be built…

Computer Vision and Pattern Recognition · Computer Science 2019-03-06 Reuben Feinman , Brenden M. Lake

Convolutional Normalization

As the deep neural networks are being applied to complex tasks, the size of the networks and architecture increases and their topology becomes more complicated too. At the same time, training becomes slow and at some instances inefficient.…

Machine Learning · Computer Science 2021-02-22 Massimiliano Esposito , Nader Ganaba

Kernel Normalized Convolutional Networks

Existing convolutional neural network architectures frequently rely upon batch normalization (BatchNorm) to effectively train the model. BatchNorm, however, performs poorly with small batch sizes, and is inapplicable to differential…

Machine Learning · Computer Science 2024-03-06 Reza Nasirigerdeh , Reihaneh Torkzadehmahani , Daniel Rueckert , Georgios Kaissis

FQ-Conv: Fully Quantized Convolution for Efficient and Accurate Inference

Deep neural networks (DNNs) can be made hardware-efficient by reducing the numerical precision of the weights and activations of the network and by improving the network's resilience to noise. However, this gain in efficiency often comes at…

Machine Learning · Computer Science 2019-12-20 Bram-Ernst Verhoef , Nathan Laubeuf , Stefan Cosemans , Peter Debacker , Ioannis Papistas , Arindam Mallik , Diederik Verkest

New Interpretations of Normalization Methods in Deep Learning

In recent years, a variety of normalization methods have been proposed to help train neural networks, such as batch normalization (BN), layer normalization (LN), weight normalization (WN), group normalization (GN), etc. However,…

Machine Learning · Computer Science 2020-06-17 Jiacheng Sun , Xiangyong Cao , Hanwen Liang , Weiran Huang , Zewei Chen , Zhenguo Li

Enhancing Generalization in Convolutional Neural Networks through Regularization with Edge and Line Features

This paper proposes a novel regularization approach to bias Convolutional Neural Networks (CNNs) toward utilizing edge and line features in their hidden layers. Rather than learning arbitrary kernels, we constrain the convolution layers to…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Christoph Linse , Beatrice Brückner , Thomas Martinetz

ACDC: Weight Sharing in Atom-Coefficient Decomposed Convolution

Convolutional Neural Networks (CNNs) are known to be significantly over-parametrized, and difficult to interpret, train and adapt. In this paper, we introduce a structural regularization across convolutional kernels in a CNN. In our…

Computer Vision and Pattern Recognition · Computer Science 2020-09-08 Ze Wang , Xiuyuan Cheng , Guillermo Sapiro , Qiang Qiu

Generalization Bounds for Convolutional Neural Networks

Convolutional neural networks (CNNs) have achieved breakthrough performances in a wide range of applications including image classification, semantic segmentation, and object detection. Previous research on characterizing the generalization…

Machine Learning · Statistics 2019-10-04 Shan Lin , Jingwei Zhang

Convolutional Neural Networks with Dynamic Regularization

Regularization is commonly used for alleviating overfitting in machine learning. For convolutional neural networks (CNNs), regularization methods, such as DropBlock and Shake-Shake, have illustrated the improvement in the generalization…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Yi Wang , Zhen-Peng Bian , Junhui Hou , Lap-Pui Chau

Training Deep Neural Networks Without Batch Normalization

Training neural networks is an optimization problem, and finding a decent set of parameters through gradient descent can be a difficult task. A host of techniques has been developed to aid this process before and during the training phase.…

Machine Learning · Computer Science 2020-08-19 Divya Gaur , Joachim Folz , Andreas Dengel

Optimal Weighted Convolution for Classification and Denosing

We introduce a novel weighted convolution operator that enhances traditional convolutional neural networks (CNNs) by integrating a spatial density function into the convolution operator. This extension enables the network to differentially…

Computer Vision and Pattern Recognition · Computer Science 2025-06-02 Simone Cammarasana , Giuseppe Patanè

Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization

Deep convolutional neural networks are known to be unstable during training at high learning rate unless normalization techniques are employed. Normalizing weights or activations allows the use of higher learning rates, resulting in faster…

Machine Learning · Computer Science 2019-12-02 Brendan Ruff , Taylor Beck , Joscha Bach

Large Norms of CNN Layers Do Not Hurt Adversarial Robustness

Since the Lipschitz properties of convolutional neural networks (CNNs) are widely considered to be related to adversarial robustness, we theoretically characterize the $\ell_1$ norm and $\ell_\infty$ norm of 2D multi-channel convolutional…

Machine Learning · Computer Science 2021-08-17 Youwei Liang , Dong Huang

Optimization and Generalization Guarantees for Weight Normalization

Weight normalization (WeightNorm) is widely used in practice for the training of deep neural networks and modern deep learning libraries have built-in implementations of it. In this paper, we provide the first theoretical characterizations…

Machine Learning · Computer Science 2025-01-22 Pedro Cisneros-Velarde , Zhijie Chen , Sanmi Koyejo , Arindam Banerjee