English
Related papers

Related papers: Convolutional Normalization: Improving Deep Convol…

200 papers

We introduce a Normalized Convolutional Neural Layer, a novel approach to normalization in convolutional networks. Unlike conventional methods, this layer normalizes the rows of the im2col matrix during convolution, making it inherently…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Dongsuk Kim , Geonhee Lee , Myungjae Lee , Shin Uk Kang , Dongmin Kim

Convolution Neural Networks, known as ConvNets exceptionally perform well in many complex machine learning tasks. The architecture of ConvNets demands the huge and rich amount of data and involves with a vast number of parameters that leads…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Pushparaja Murugan , Shanmugasundaram Durairaj

Regularization techniques are widely used to improve the generality, robustness, and efficiency of deep convolutional neural networks (DCNNs). In this paper, we propose a novel approach of regulating DCNN convolutional kernels by a…

Machine Learning · Computer Science 2019-11-28 Seyed Mehdi Ayyoubzadeh , Xiaolin Wu

Convolutional neural network training can suffer from diverse issues like exploding or vanishing gradients, scaling-based weight space symmetry and covariant-shift. In order to address these issues, researchers develop weight regularization…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Theodoros Georgiou , Sebastian Schmitt , Thomas Bäck , Wei Chen , Michael Lew

We introduce canonical weight normalization for convolutional neural networks. Inspired by the canonical tensor decomposition, we express the weight tensors in so-called canonical networks as scaled sums of outer vector products. In…

Machine Learning · Computer Science 2022-06-06 Lokesh Veeramacheneni , Moritz Wolter , Reinhard Klein , Jochen Garcke

This paper seeks to answer the question: as the (near-) orthogonality of weights is found to be a favorable property for training deep convolutional neural networks, how can we enforce it in more effective and easy-to-use ways? We develop…

Machine Learning · Computer Science 2018-10-23 Nitin Bansal , Xiaohan Chen , Zhangyang Wang

Modern deep neural networks require a tremendous amount of data to train, often needing hundreds or thousands of labeled examples to learn an effective representation. For these networks to work with less data, more structure must be built…

Computer Vision and Pattern Recognition · Computer Science 2019-03-06 Reuben Feinman , Brenden M. Lake

As the deep neural networks are being applied to complex tasks, the size of the networks and architecture increases and their topology becomes more complicated too. At the same time, training becomes slow and at some instances inefficient.…

Machine Learning · Computer Science 2021-02-22 Massimiliano Esposito , Nader Ganaba

Existing convolutional neural network architectures frequently rely upon batch normalization (BatchNorm) to effectively train the model. BatchNorm, however, performs poorly with small batch sizes, and is inapplicable to differential…

Machine Learning · Computer Science 2024-03-06 Reza Nasirigerdeh , Reihaneh Torkzadehmahani , Daniel Rueckert , Georgios Kaissis

Deep neural networks (DNNs) can be made hardware-efficient by reducing the numerical precision of the weights and activations of the network and by improving the network's resilience to noise. However, this gain in efficiency often comes at…

In recent years, a variety of normalization methods have been proposed to help train neural networks, such as batch normalization (BN), layer normalization (LN), weight normalization (WN), group normalization (GN), etc. However,…

Machine Learning · Computer Science 2020-06-17 Jiacheng Sun , Xiangyong Cao , Hanwen Liang , Weiran Huang , Zewei Chen , Zhenguo Li

This paper proposes a novel regularization approach to bias Convolutional Neural Networks (CNNs) toward utilizing edge and line features in their hidden layers. Rather than learning arbitrary kernels, we constrain the convolution layers to…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Christoph Linse , Beatrice Brückner , Thomas Martinetz

Convolutional Neural Networks (CNNs) are known to be significantly over-parametrized, and difficult to interpret, train and adapt. In this paper, we introduce a structural regularization across convolutional kernels in a CNN. In our…

Computer Vision and Pattern Recognition · Computer Science 2020-09-08 Ze Wang , Xiuyuan Cheng , Guillermo Sapiro , Qiang Qiu

Convolutional neural networks (CNNs) have achieved breakthrough performances in a wide range of applications including image classification, semantic segmentation, and object detection. Previous research on characterizing the generalization…

Machine Learning · Statistics 2019-10-04 Shan Lin , Jingwei Zhang

Regularization is commonly used for alleviating overfitting in machine learning. For convolutional neural networks (CNNs), regularization methods, such as DropBlock and Shake-Shake, have illustrated the improvement in the generalization…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Yi Wang , Zhen-Peng Bian , Junhui Hou , Lap-Pui Chau

Training neural networks is an optimization problem, and finding a decent set of parameters through gradient descent can be a difficult task. A host of techniques has been developed to aid this process before and during the training phase.…

Machine Learning · Computer Science 2020-08-19 Divya Gaur , Joachim Folz , Andreas Dengel

We introduce a novel weighted convolution operator that enhances traditional convolutional neural networks (CNNs) by integrating a spatial density function into the convolution operator. This extension enables the network to differentially…

Computer Vision and Pattern Recognition · Computer Science 2025-06-02 Simone Cammarasana , Giuseppe Patanè

Deep convolutional neural networks are known to be unstable during training at high learning rate unless normalization techniques are employed. Normalizing weights or activations allows the use of higher learning rates, resulting in faster…

Machine Learning · Computer Science 2019-12-02 Brendan Ruff , Taylor Beck , Joscha Bach

Since the Lipschitz properties of convolutional neural networks (CNNs) are widely considered to be related to adversarial robustness, we theoretically characterize the $\ell_1$ norm and $\ell_\infty$ norm of 2D multi-channel convolutional…

Machine Learning · Computer Science 2021-08-17 Youwei Liang , Dong Huang

Weight normalization (WeightNorm) is widely used in practice for the training of deep neural networks and modern deep learning libraries have built-in implementations of it. In this paper, we provide the first theoretical characterizations…

Machine Learning · Computer Science 2025-01-22 Pedro Cisneros-Velarde , Zhijie Chen , Sanmi Koyejo , Arindam Banerjee
‹ Prev 1 2 3 10 Next ›