English
Related papers

Related papers: Separable Layers Enable Structured Efficient Linea…

200 papers

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we…

Machine Learning · Computer Science 2020-01-09 Gao Huang , Zhuang Liu , Geoff Pleiss , Laurens van der Maaten , Kilian Q. Weinberger

Dense linear layers are the dominant computational bottleneck in foundation models. Identifying more efficient alternatives to dense matrices has enormous potential for building more compute-efficient models, as exemplified by the success…

Machine Learning · Computer Science 2024-06-11 Shikai Qiu , Andres Potapczynski , Marc Finzi , Micah Goldblum , Andrew Gordon Wilson

Densely Connected Convolutional Networks (DenseNets) have been shown to achieve state-of-the-art results on image classification tasks while using fewer parameters and computation than competing methods. Since each layer in this…

Computer Vision and Pattern Recognition · Computer Science 2018-06-07 Andy Hess

Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2018-01-30 Gao Huang , Zhuang Liu , Laurens van der Maaten , Kilian Q. Weinberger

We present an efficient alternative to the convolutional layer using cheap spatial transformations. This construction exploits an inherent spatial redundancy of the learned convolutional filters to enable a much greater parameter…

Computer Vision and Pattern Recognition · Computer Science 2023-08-25 Roy Miles , Krystian Mikolajczyk

Convolutional neural networks have achieved great success in various vision tasks; however, they incur heavy resource costs. By using deeper and wider networks, network accuracy can be improved rapidly. However, in an environment with…

Computer Vision and Pattern Recognition · Computer Science 2018-11-01 Yunho Jeon , Junmo Kim

Deep neural networks are increasingly used on mobile devices, where computational resources are limited. In this paper we develop CondenseNet, a novel network architecture with unprecedented efficiency. It combines dense connectivity with a…

Computer Vision and Pattern Recognition · Computer Science 2018-06-08 Gao Huang , Shichen Liu , Laurens van der Maaten , Kilian Q. Weinberger

Convolutional layers are one of the basic building blocks of modern deep neural networks. One fundamental assumption is that convolutional kernels should be shared for all examples in a dataset. We propose conditionally parameterized…

Computer Vision and Pattern Recognition · Computer Science 2020-09-07 Brandon Yang , Gabriel Bender , Quoc V. Le , Jiquan Ngiam

Trainable layers such as convolutional building blocks are the standard network design choices by learning parameters to capture the global context through successive spatial operations. When designing an efficient network, trainable layers…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Dongyoon Han , YoungJoon Yoo , Beomyoung Kim , Byeongho Heo

Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance in many computer vision tasks over the years. However, this comes at the cost of heavy computation and memory intensive network designs, suggesting potential…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Kumara Kahatapitiya , Ranga Rodrigo

Depthwise separable convolutions are a fundamental component in efficient Deep Neural Networks, as they reduce the number of parameters and operations compared to traditional convolutions while maintaining comparable accuracy. However,…

Machine Learning · Computer Science 2024-06-19 Francesco Daghero , Alessio Burrello , Massimo Poncino , Enrico Macii , Daniele Jahier Pagliari

We show how fitting sparse linear models over learned deep feature representations can lead to more debuggable neural networks. These networks remain highly accurate while also being more amenable to human interpretation, as we demonstrate…

Machine Learning · Computer Science 2021-05-12 Eric Wong , Shibani Santurkar , Aleksander Mądry

Transposed convolution is crucial for generating high-resolution outputs, yet has received little attention compared to convolution layers. In this work we revisit transposed convolution and introduce a novel layer that allows us to place…

Computer Vision and Pattern Recognition · Computer Science 2022-10-19 Stefano B. Blumberg , Daniele Raví , Mou-Cheng Xu , Matteo Figini , Iasonas Kokkinos , Daniel C. Alexander

Deep convolutional networks are well-known for their high computational and memory demands. Given limited resources, how does one design a network that balances its size, training time, and prediction accuracy? A surprisingly effective…

Computer Vision and Pattern Recognition · Computer Science 2017-02-22 Soravit Changpinyo , Mark Sandler , Andrey Zhmoginov

We introduce a new kind of linear transform named Deformable Butterfly (DeBut) that generalizes the conventional butterfly matrices and can be adapted to various input-output dimensions. It inherits the fine-to-coarse-grained learnable…

Computer Vision and Pattern Recognition · Computer Science 2022-03-28 Rui Lin , Jie Ran , King Hung Chiu , Graziano Chesi , Ngai Wong

The driving force behind deep networks is their ability to compactly represent rich classes of functions. The primary notion for formally reasoning about this phenomenon is expressive efficiency, which refers to a situation where one…

Machine Learning · Computer Science 2018-02-14 Nadav Cohen , Ronen Tamari , Amnon Shashua

Linear recurrent neural networks enable powerful long-range sequence modeling with constant memory usage and time-per-token during inference. These architectures hold promise for streaming applications at the edge, but deployment in…

Machine Learning · Computer Science 2025-08-14 Alessandro Pierro , Steven Abreu , Jonathan Timcheck , Philipp Stratmann , Andreas Wild , Sumit Bam Shrestha

Dense prediction tasks typically employ encoder-decoder architectures, but the prevalent convolutions in the decoder are not image-adaptive and can lead to boundary artifacts. Different generalized convolution operations have been…

Computer Vision and Pattern Recognition · Computer Science 2019-09-10 Anne S. Wannenwetsch , Martin Kiefel , Peter V. Gehler , Stefan Roth

A convolutional layer in a Convolutional Neural Network (CNN) consists of many filters which apply convolution operation to the input, capture some special patterns and pass the result to the next layer. If the same patterns also occur at…

Computer Vision and Pattern Recognition · Computer Science 2019-02-04 Okan Köpüklü , Maryam Babaee , Stefan Hörmann , Gerhard Rigoll

Depthwise separable convolutional (DSConv) layers have been successfully applied to deep learning (DL)-based joint source-channel coding (JSCC) schemes to reduce computational complexity. However, a systematic investigation of the layerwise…

Image and Video Processing · Electrical Eng. & Systems 2026-04-27 Ming Ye , Kui Cai , Cunhua Pan , Zhen Mei , Wanting Yang , Chunguo Li
‹ Prev 1 2 3 10 Next ›