Related papers: Separable Layers Enable Structured Efficient Linea…

Convolutional Networks with Dense Connectivity

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we…

Machine Learning · Computer Science 2020-01-09 Gao Huang , Zhuang Liu , Geoff Pleiss , Laurens van der Maaten , Kilian Q. Weinberger

Compute Better Spent: Replacing Dense Layers with Structured Matrices

Dense linear layers are the dominant computational bottleneck in foundation models. Identifying more efficient alternatives to dense matrices has enormous potential for building more compute-efficient models, as exemplified by the success…

Machine Learning · Computer Science 2024-06-11 Shikai Qiu , Andres Potapczynski , Marc Finzi , Micah Goldblum , Andrew Gordon Wilson

Exploring Feature Reuse in DenseNet Architectures

Densely Connected Convolutional Networks (DenseNets) have been shown to achieve state-of-the-art results on image classification tasks while using fewer parameters and computation than competing methods. Since each layer in this…

Computer Vision and Pattern Recognition · Computer Science 2018-06-07 Andy Hess

Densely Connected Convolutional Networks

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2018-01-30 Gao Huang , Zhuang Liu , Laurens van der Maaten , Kilian Q. Weinberger

Reconstructing Pruned Filters using Cheap Spatial Transformations

We present an efficient alternative to the convolutional layer using cheap spatial transformations. This construction exploits an inherent spatial redundancy of the learned convolutional filters to enable a much greater parameter…

Computer Vision and Pattern Recognition · Computer Science 2023-08-25 Roy Miles , Krystian Mikolajczyk

Constructing Fast Network through Deconstruction of Convolution

Convolutional neural networks have achieved great success in various vision tasks; however, they incur heavy resource costs. By using deeper and wider networks, network accuracy can be improved rapidly. However, in an environment with…

Computer Vision and Pattern Recognition · Computer Science 2018-11-01 Yunho Jeon , Junmo Kim

CondenseNet: An Efficient DenseNet using Learned Group Convolutions

Deep neural networks are increasingly used on mobile devices, where computational resources are limited. In this paper we develop CondenseNet, a novel network architecture with unprecedented efficiency. It combines dense connectivity with a…

Computer Vision and Pattern Recognition · Computer Science 2018-06-08 Gao Huang , Shichen Liu , Laurens van der Maaten , Kilian Q. Weinberger

CondConv: Conditionally Parameterized Convolutions for Efficient Inference

Convolutional layers are one of the basic building blocks of modern deep neural networks. One fundamental assumption is that convolutional kernels should be shared for all examples in a dataset. We propose conditionally parameterized…

Computer Vision and Pattern Recognition · Computer Science 2020-09-07 Brandon Yang , Gabriel Bender , Quoc V. Le , Jiquan Ngiam

Learning Features with Parameter-Free Layers

Trainable layers such as convolutional building blocks are the standard network design choices by learning parameters to capture the global context through successive spatial operations. When designing an efficient network, trainable layers…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Dongyoon Han , YoungJoon Yoo , Beomyoung Kim , Byeongho Heo

Exploiting the Redundancy in Convolutional Filters for Parameter Reduction

Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance in many computer vision tasks over the years. However, this comes at the cost of heavy computation and memory intensive network designs, suggesting potential…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Kumara Kahatapitiya , Ranga Rodrigo

Accelerating Depthwise Separable Convolutions on Ultra-Low-Power Devices

Depthwise separable convolutions are a fundamental component in efficient Deep Neural Networks, as they reduce the number of parameters and operations compared to traditional convolutions while maintaining comparable accuracy. However,…

Machine Learning · Computer Science 2024-06-19 Francesco Daghero , Alessio Burrello , Massimo Poncino , Enrico Macii , Daniele Jahier Pagliari

Leveraging Sparse Linear Layers for Debuggable Deep Networks

We show how fitting sparse linear models over learned deep feature representations can lead to more debuggable neural networks. These networks remain highly accurate while also being more amenable to human interpretation, as we demonstrate…

Machine Learning · Computer Science 2021-05-12 Eric Wong , Shibani Santurkar , Aleksander Mądry

Deformably-Scaled Transposed Convolution

Transposed convolution is crucial for generating high-resolution outputs, yet has received little attention compared to convolution layers. In this work we revisit transposed convolution and introduce a novel layer that allows us to place…

Computer Vision and Pattern Recognition · Computer Science 2022-10-19 Stefano B. Blumberg , Daniele Raví , Mou-Cheng Xu , Matteo Figini , Iasonas Kokkinos , Daniel C. Alexander

The Power of Sparsity in Convolutional Neural Networks

Deep convolutional networks are well-known for their high computational and memory demands. Given limited resources, how does one design a network that balances its size, training time, and prediction accuracy? A surprisingly effective…

Computer Vision and Pattern Recognition · Computer Science 2017-02-22 Soravit Changpinyo , Mark Sandler , Andrey Zhmoginov

Deformable Butterfly: A Highly Structured and Sparse Linear Transform

We introduce a new kind of linear transform named Deformable Butterfly (DeBut) that generalizes the conventional butterfly matrices and can be adapted to various input-output dimensions. It inherits the fine-to-coarse-grained learnable…

Computer Vision and Pattern Recognition · Computer Science 2022-03-28 Rui Lin , Jie Ran , King Hung Chiu , Graziano Chesi , Ngai Wong

Boosting Dilated Convolutional Networks with Mixed Tensor Decompositions

The driving force behind deep networks is their ability to compactly represent rich classes of functions. The primary notion for formally reasoning about this phenomenon is expressive efficiency, which refers to a situation where one…

Machine Learning · Computer Science 2018-02-14 Nadav Cohen , Ronen Tamari , Amnon Shashua

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

Linear recurrent neural networks enable powerful long-range sequence modeling with constant memory usage and time-per-token during inference. These architectures hold promise for streaming applications at the edge, but deployment in…

Machine Learning · Computer Science 2025-08-14 Alessandro Pierro , Steven Abreu , Jonathan Timcheck , Philipp Stratmann , Andreas Wild , Sumit Bam Shrestha

Learning Task-Specific Generalized Convolutions in the Permutohedral Lattice

Dense prediction tasks typically employ encoder-decoder architectures, but the prevalent convolutions in the decoder are not image-adaptive and can lead to boundary artifacts. Different generalized convolution operations have been…

Computer Vision and Pattern Recognition · Computer Science 2019-09-10 Anne S. Wannenwetsch , Martin Kiefel , Peter V. Gehler , Stefan Roth

Convolutional Neural Networks with Layer Reuse

A convolutional layer in a Convolutional Neural Network (CNN) consists of many filters which apply convolution operation to the input, capture some special patterns and pass the result to the next layer. If the same patterns also occur at…

Computer Vision and Pattern Recognition · Computer Science 2019-02-04 Okan Köpüklü , Maryam Babaee , Stefan Hörmann , Gerhard Rigoll

Selective Depthwise Separable Convolution for Lightweight Joint Source-Channel Coding in Wireless Image Transmission

Depthwise separable convolutional (DSConv) layers have been successfully applied to deep learning (DL)-based joint source-channel coding (JSCC) schemes to reduce computational complexity. However, a systematic investigation of the layerwise…

Image and Video Processing · Electrical Eng. & Systems 2026-04-27 Ming Ye , Kui Cai , Cunhua Pan , Zhen Mei , Wanting Yang , Chunguo Li