Related papers: Learning Implicitly Recurrent CNNs Through Paramet…

Convexified Convolutional Neural Networks

We describe the class of convexified convolutional neural networks (CCNNs), which capture the parameter sharing of convolutional neural networks in a convex manner. By representing the nonlinear convolutional filters as vectors in a…

Machine Learning · Computer Science 2016-09-06 Yuchen Zhang , Percy Liang , Martin J. Wainwright

Low-Cost Parameterizations of Deep Convolutional Neural Networks

Convolutional Neural Networks (CNNs) filter the input data using a series of spatial convolution operators with compactly supported stencils and point-wise nonlinearities. Commonly, the convolution operators couple features from all…

Numerical Analysis · Computer Science 2018-10-04 Eran Treister , Lars Ruthotto , Michal Sharoni , Sapir Zafrani , Eldad Haber

Filter sharing: Efficient learning of parameters for volumetric convolutions

Typical convolutional neural networks (CNNs) have several millions of parameters and require a large amount of annotated data to train them. In medical applications where training data is hard to come by, these sophisticated machine…

Computer Vision and Pattern Recognition · Computer Science 2016-12-09 Rahul Venkataramani , Sheshadri Thiruvenkadam , Prasad Sudhakar , Hariharan Ravishankar , Vivek Vaidya

Convolutional Neural Networks with Layer Reuse

A convolutional layer in a Convolutional Neural Network (CNN) consists of many filters which apply convolution operation to the input, capture some special patterns and pass the result to the next layer. If the same patterns also occur at…

Computer Vision and Pattern Recognition · Computer Science 2019-02-04 Okan Köpüklü , Maryam Babaee , Stefan Hörmann , Gerhard Rigoll

ACDC: Weight Sharing in Atom-Coefficient Decomposed Convolution

Convolutional Neural Networks (CNNs) are known to be significantly over-parametrized, and difficult to interpret, train and adapt. In this paper, we introduce a structural regularization across convolutional kernels in a CNN. In our…

Computer Vision and Pattern Recognition · Computer Science 2020-09-08 Ze Wang , Xiuyuan Cheng , Guillermo Sapiro , Qiang Qiu

Recurrent Neural Networks to Correct Satellite Image Classification Maps

While initially devised for image categorization, convolutional neural networks (CNNs) are being increasingly used for the pixelwise semantic labeling of images. However, the proper nature of the most common CNN architectures makes them…

Computer Vision and Pattern Recognition · Computer Science 2017-04-24 Emmanuel Maggiori , Guillaume Charpiat , Yuliya Tarabalka , Pierre Alliez

Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition

Deep convolutional neural networks (CNNs) have been shown to be very successful in a wide range of image processing applications. However, due to their increasing number of model parameters and an increasing availability of large amounts of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Axel Klawonn , Martin Lanser , Janine Weber

Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing

Deep convolutional neural networks (DCNNs) have become the state-of-the-art (SOTA) approach for many computer vision tasks: image classification, object detection, semantic segmentation, etc. However, most SOTA networks are too large for…

Computer Vision and Pattern Recognition · Computer Science 2022-10-26 Alireza Azadbakht , Saeed Reza Kheradpisheh , Ismail Khalfaoui-Hassani , Timothée Masquelier

The Power of Linear Combinations: Learning with Random Convolutions

Following the traditional paradigm of convolutional neural networks (CNNs), modern CNNs manage to keep pace with more recent, for example transformer-based, models by not only increasing model depth and width but also the kernel size. This…

Computer Vision and Pattern Recognition · Computer Science 2023-06-23 Paul Gavrikov , Janis Keuper

Towards Efficient Convolutional Network Models with Filter Distribution Templates

Increasing number of filters in deeper layers when feature maps are decreased is a widely adopted pattern in convolutional network design. It can be found in classical CNN architectures and in automatic discovered models. Even CNS methods…

Computer Vision and Pattern Recognition · Computer Science 2021-04-20 Ramon Izquierdo-Cordova , Walterio Mayol-Cuevas

Training CNNs with Selective Allocation of Channels

Recent progress in deep convolutional neural networks (CNNs) have enabled a simple paradigm of architecture design: larger models typically achieve better accuracy. Due to this, in modern CNN architectures, it becomes more important to…

Machine Learning · Computer Science 2019-05-14 Jongheon Jeong , Jinwoo Shin

Convolutional Neural Networks with Recurrent Neural Filters

We introduce a class of convolutional neural networks (CNNs) that utilize recurrent neural networks (RNNs) as convolution filters. A convolution filter is typically implemented as a linear affine transformation followed by a non-linear…

Computation and Language · Computer Science 2018-08-29 Yi Yang

Deeply Shared Filter Bases for Parameter-Efficient Convolutional Neural Networks

Modern convolutional neural networks (CNNs) have massive identical convolution blocks, and, hence, recursive sharing of parameters across these blocks has been proposed to reduce the amount of parameters. However, naive sharing of…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Woochul Kang , Daeyeon Kim

Hyper-Convolutions via Implicit Kernels for Medical Imaging

The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares…

Image and Video Processing · Electrical Eng. & Systems 2022-02-08 Tianyu Ma , Alan Q. Wang , Adrian V. Dalca , Mert R. Sabuncu

RecNets: Channel-wise Recurrent Convolutional Neural Networks

In this paper, we introduce Channel-wise recurrent convolutional neural networks (RecNets), a family of novel, compact neural network architectures for computer vision tasks inspired by recurrent neural networks (RNNs). RecNets build upon…

Machine Learning · Computer Science 2020-03-23 George Retsinas , Athena Elafrou , Georgios Goumas , Petros Maragos

Associative Convolutional Layers

Motivated by the necessity for parameter efficiency in distributed machine learning and AI-enabled edge devices, we provide a general and easy to implement method for significantly reducing the number of parameters of Convolutional Neural…

Machine Learning · Computer Science 2019-08-13 Hamed Omidvar , Vahideh Akhlaghi , Massimo Franceschetti , Rajesh K. Gupta

Exploiting the Redundancy in Convolutional Filters for Parameter Reduction

Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance in many computer vision tasks over the years. However, this comes at the cost of heavy computation and memory intensive network designs, suggesting potential…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Kumara Kahatapitiya , Ranga Rodrigo

Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms

Neural networks (NNs) struggle to efficiently solve certain problems, such as learning parities, even when there are simple learning algorithms for those problems. Can NNs discover learning algorithms on their own? We exhibit a NN…

Machine Learning · Computer Science 2023-01-18 Surbhi Goel , Sham Kakade , Adam Tauman Kalai , Cyril Zhang

Efficient Parameter Sampling for Neural Network Construction

The customizable nature of deep learning models have allowed them to be successful predictors in various disciplines. These models are often trained with respect to thousands or millions of instances for complicated problems, but the…

Machine Learning · Computer Science 2019-12-24 Drimik Roy Chowdhury , Muhammad Firmansyah Kasim

Towards Understanding Feature Learning in Parameter Transfer

Parameter transfer is a central paradigm in transfer learning, enabling knowledge reuse across tasks and domains by sharing model parameters between upstream and downstream models. However, when only a subset of parameters from the upstream…

Machine Learning · Computer Science 2026-01-08 Hua Yuan , Xuran Meng , Qiufeng Wang , Shiyu Xia , Ning Xu , Xu Yang , Jing Wang , Xin Geng , Yong Rui