Related papers: Transform-Based Feature Map Compression for CNN In…

Feature Map Transform Coding for Energy-Efficient CNN Inference

Convolutional neural networks (CNNs) achieve state-of-the-art accuracy in a variety of tasks in computer vision and beyond. One of the major obstacles hindering the ubiquitous use of CNNs for inference on low-power edge devices is their…

Computer Vision and Pattern Recognition · Computer Science 2021-09-28 Brian Chmiel , Chaim Baskin , Ron Banner , Evgenii Zheltonozhskii , Yevgeny Yermolin , Alex Karbachevsky , Alex M. Bronstein , Avi Mendelson

Memory-Efficient CNN Accelerator Based on Interlayer Feature Map Compression

Existing deep convolutional neural networks (CNNs) generate massive interlayer feature data during network inference. To maintain real-time processing in embedded systems, large on-chip memory is required to buffer the interlayer feature…

Hardware Architecture · Computer Science 2021-10-13 Zhuang Shao , Xiaoliang Chen , Li Du , Lei Chen , Yuan Du , Wei Zhuang , Huadong Wei , Chenjia Xie , Zhongfeng Wang

DepthTCM: High Efficient Depth Compression via Physics-aware Transformer-CNN Mixed Architecture

We propose DepthTCM, a physics-aware end-to-end framework for depth map compression. In our framework of DepthTCM, the high-bit depth map is first converted to a conventional 3-channel image representation losslessly using a method inspired…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Young-Seo Chang , Yatong An , Jae-Sang Hyun

Wavelet Feature Maps Compression for Image-to-Image CNNs

Convolutional Neural Networks (CNNs) are known for requiring extensive computational resources, and quantization is among the best and most common methods for compressing them. While aggressive quantization (i.e., less than 4-bits) performs…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Shahaf E. Finder , Yair Zohav , Maor Ashkenazi , Eran Treister

Joint Matrix Decomposition for Deep Convolutional Neural Networks Compression

Deep convolutional neural networks (CNNs) with a large number of parameters require intensive computational resources, and thus are hard to be deployed in resource-constrained platforms. Decomposition-based methods, therefore, have been…

Computer Vision and Pattern Recognition · Computer Science 2022-10-27 Shaowu Chen , Jiahao Zhou , Weize Sun , Lei Huang

Frequency-Aware Re-Parameterization for Over-Fitting Based Image Compression

Over-fitting-based image compression requires weights compactness for compression and fast convergence for practical use, posing challenges for deep convolutional neural networks (CNNs) based methods. This paper presents a simple…

Image and Video Processing · Electrical Eng. & Systems 2023-10-13 Yun Ye , Yanjie Pan , Qually Jiang , Ming Lu , Xiaoran Fang , Beryl Xu

CAT: Compression-Aware Training for bandwidth reduction

Convolutional neural networks (CNNs) have become the dominant neural network architecture for solving visual processing tasks. One of the major obstacles hindering the ubiquitous use of CNNs for inference is their relatively high memory…

Computer Vision and Pattern Recognition · Computer Science 2019-09-26 Chaim Baskin , Brian Chmiel , Evgenii Zheltonozhskii , Ron Banner , Alex M. Bronstein , Avi Mendelson

Tensor Reordering for CNN Compression

We show how parameter redundancy in Convolutional Neural Network (CNN) filters can be effectively reduced by pruning in spectral domain. Specifically, the representation extracted via Discrete Cosine Transform (DCT) is more conducive for…

Machine Learning · Computer Science 2020-10-26 Matej Ulicny , Vladimir A. Krylov , Rozenn Dahyot

Harmonic Convolutional Networks based on Discrete Cosine Transform

Convolutional neural networks (CNNs) learn filters in order to capture local correlation patterns in feature space. We propose to learn these filters as combinations of preset spectral filters defined by the Discrete Cosine Transform (DCT).…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Matej Ulicny , Vladimir A. Krylov , Rozenn Dahyot

MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural Network Compression

Deep neural networks (DNNs) have become the state-of-the-art technique for machine learning tasks in various applications. However, due to their size and the computational complexity, large DNNs are not readily deployable on edge devices in…

Machine Learning · Computer Science 2018-05-31 Lazar Supic , Rawan Naous , Ranko Sredojevic , Aleksandra Faust , Vladimir Stojanovic

Focused Quantization for Sparse CNNs

Deep convolutional neural networks (CNNs) are powerful tools for a wide range of vision tasks, but the enormous amount of memory and compute resources required by CNNs pose a challenge in deploying them on constrained devices. Existing…

Machine Learning · Computer Science 2019-10-30 Yiren Zhao , Xitong Gao , Daniel Bates , Robert Mullins , Cheng-Zhong Xu

ConvNeXt-ChARM: ConvNeXt-based Transform for Efficient Neural Image Compression

Over the last few years, neural image compression has gained wide attention from research and industry, yielding promising end-to-end deep neural codecs outperforming their conventional counterparts in rate-distortion performance. Despite…

Computer Vision and Pattern Recognition · Computer Science 2023-07-14 Ahmed Ghorbel , Wassim Hamidouche , Luce Morin

Discrete Cosine Transform Network for Guided Depth Map Super-Resolution

Guided depth super-resolution (GDSR) is an essential topic in multi-modal image processing, which reconstructs high-resolution (HR) depth maps from low-resolution ones collected with suboptimal conditions with the help of HR RGB images of…

Computer Vision and Pattern Recognition · Computer Science 2022-04-22 Zixiang Zhao , Jiangshe Zhang , Shuang Xu , Zudi Lin , Hanspeter Pfister

Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression

Compressing convolutional neural networks (CNNs) has received ever-increasing research focus. However, most existing CNN compression methods do not interpret their inherent structures to distinguish the implicit redundancy. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Yuchao Li , Shaohui Lin , Baochang Zhang , Jianzhuang Liu , David Doermann , Yongjian Wu , Feiyue Huang , Rongrong Ji

Transform Quantization for CNN (Convolutional Neural Network) Compression

In this paper, we compress convolutional neural network (CNN) weights post-training via transform quantization. Previous CNN quantization techniques tend to ignore the joint statistics of weights and activations, producing sub-optimal CNN…

Computer Vision and Pattern Recognition · Computer Science 2021-11-09 Sean I. Young , Wang Zhe , David Taubman , Bernd Girod

Efficient Neural Networks with Discrete Cosine Transform Activations

In this paper, we extend our previous work on the Expressive Neural Network (ENN), a multilayer perceptron with adaptive activation functions parametrized using the Discrete Cosine Transform (DCT). Building upon previous work that…

Machine Learning · Computer Science 2025-11-06 Marc Martinez-Gost , Sara Pepe , Ana Pérez-Neira , Miguel Ángel Lagunas

Compression strategies and space-conscious representations for deep neural networks

Recent advances in deep learning have made available large, powerful convolutional neural networks (CNN) with state-of-the-art performance in several real-world applications. Unfortunately, these large-sized models have millions of…

Machine Learning · Computer Science 2020-07-17 Giosuè Cataldo Marinò , Gregorio Ghidoli , Marco Frasca , Dario Malchiodi

DNN Feature Map Compression using Learned Representation over GF(2)

In this paper, we introduce a method to compress intermediate feature maps of deep neural networks (DNNs) to decrease memory storage and bandwidth requirements during inference. Unlike previous works, the proposed method is based on…

Computer Vision and Pattern Recognition · Computer Science 2018-08-17 Denis A. Gudovskiy , Alec Hodgkinson , Luca Rigazio

Discrete Cosine Transform Based Decorrelated Attention for Vision Transformers

Self-attention is central to the success of Transformer architectures; however, learning the query, key, and value projections from random initialization remains challenging and computationally expensive. In this paper, we propose two…

Computer Vision and Pattern Recognition · Computer Science 2026-05-18 Hongyi Pan , Emadeldeen Hamdan , Xin Zhu , Ahmet Enis Cetin , Ulas Bagci

Compressing complex convolutional neural network based on an improved deep compression algorithm

Although convolutional neural network (CNN) has made great progress, large redundant parameters restrict its deployment on embedded devices, especially mobile devices. The recent compression works are focused on real-value convolutional…

Computer Vision and Pattern Recognition · Computer Science 2019-03-07 Jiasong Wu , Hongshan Ren , Youyong Kong , Chunfeng Yang , Lotfi Senhadji , Huazhong Shu