Related papers: Activation Map Compression through Tensor Decompos…

Accelerating the Low-Rank Decomposed Models

Tensor decomposition is a mathematically supported technique for data compression. It consists of applying some kind of a Low Rank Decomposition technique on the tensors or matrices in order to reduce the redundancy of the data. However, it…

Machine Learning · Computer Science 2025-05-27 Habib Hajimolahoseini , Walid Ahmed , Austin Wen , Yang Liu

Comprehensive Design Space Exploration for Tensorized Neural Network Hardware Accelerators

High-order tensor decomposition has been widely adopted to obtain compact deep neural networks for edge deployment. However, existing studies focus primarily on its algorithmic advantages such as accuracy and compression ratio-while…

Hardware Architecture · Computer Science 2025-11-26 Jinsong Zhang , Minghe Li , Jiayi Tian , Jinming Lu , Zheng Zhang

Beyond Low-rank Decomposition: A Shortcut Approach for Efficient On-Device Learning

On-device learning has emerged as a promising direction for AI development, particularly because of its potential to reduce latency issues and mitigate privacy risks associated with device-server communication, while improving energy…

Machine Learning · Computer Science 2025-07-25 Le-Trung Nguyen , Ael Quelennec , Van-Tam Nguyen , Enzo Tartaglione

Unified Framework for Pre-trained Neural Network Compression via Decomposition and Optimized Rank Selection

Despite their high accuracy, complex neural networks demand significant computational resources, posing challenges for deployment on resource constrained devices such as mobile phones and embedded systems. Compression algorithms have been…

Machine Learning · Computer Science 2025-09-23 Ali Aghababaei-Harandi , Massih-Reza Amini

LANCE: Low Rank Activation Compression for Efficient On-Device Continual Learning

On-device learning is essential for personalization, privacy, and long-term adaptation in resource-constrained environments. Achieving this requires efficient learning, both fine-tuning existing models and continually acquiring new tasks…

Machine Learning · Computer Science 2026-03-18 Marco Paul E. Apolinario , Kaushik Roy

Tensor Decompositions for Signal Processing Applications From Two-way to Multiway Component Analysis

The widespread use of multi-sensor technology and the emergence of big datasets has highlighted the limitations of standard flat-view matrix models and the necessity to move towards more versatile data analysis tools. We show that…

Numerical Analysis · Computer Science 2015-06-19 A. Cichocki , D. Mandic , A-H. Phan , C. Caiafa , G. Zhou , Q. Zhao , L. De Lathauwer

Less Memory Means smaller GPUs: Backpropagation with Compressed Activations

The ever-growing scale of deep neural networks (DNNs) has lead to an equally rapid growth in computational resource requirements. Many recent architectures, most prominently Large Language Models, have to be trained using supercomputers…

Machine Learning · Computer Science 2024-09-19 Daniel Barley , Holger Fröning

Training Deep Networks with Structured Layers by Matrix Backpropagation

Deep neural network architectures have recently produced excellent results in a variety of areas in artificial intelligence and visual recognition, well surpassing traditional shallow architectures trained using hand-designed features. The…

Computer Vision and Pattern Recognition · Computer Science 2016-04-15 Catalin Ionescu , Orestis Vantzos , Cristian Sminchisescu

Tensor Networks for Latent Variable Analysis. Part I: Algorithms for Tensor Train Decomposition

Decompositions of tensors into factor matrices, which interact through a core tensor, have found numerous applications in signal processing and machine learning. A more general tensor model which represents data as an ordered network of…

Numerical Analysis · Computer Science 2016-09-30 Anh-Huy Phan , Andrzej Cichocki , Andre Uschmajew , Petr Tichavsky , George Luta , Danilo Mandic

Tensor Decompositions in Deep Learning

The paper surveys the topic of tensor decompositions in modern machine learning applications. It focuses on three active research topics of significant relevance for the community. After a brief review of consolidated works on multi-way…

Machine Learning · Computer Science 2020-02-28 Davide Bacciu , Danilo P. Mandic

Inverted Activations: Reducing Memory Footprint in Neural Network Training

The scaling of neural networks with increasing data and model sizes necessitates the development of more efficient deep learning algorithms. A significant challenge in neural network training is the memory footprint associated with…

Machine Learning · Computer Science 2024-10-08 Georgii Novikov , Ivan Oseledets

Improving Memory Utilization in Convolutional Neural Network Accelerators

While the accuracy of convolutional neural networks has achieved vast improvements by introducing larger and deeper network architectures, also the memory footprint for storing their parameters and activations has increased. This trend…

Image and Video Processing · Electrical Eng. & Systems 2021-04-07 Petar Jokic , Stephane Emery , Luca Benini

Tensor decomposition to Compress Convolutional Layers in Deep Learning

Feature extraction for tensor data serves as an important step in many tasks such as anomaly detection, process monitoring, image classification, and quality control. Although many methods have been proposed for tensor feature extraction,…

Machine Learning · Computer Science 2021-06-01 Yinan Wang , Weihong "Grace" Guo , Xiaowei Yue

DeepMapping: Learned Data Mapping for Lossless Compression and Efficient Lookup

Storing tabular data to balance storage and query efficiency is a long-standing research question in the database community. In this work, we argue and show that a novel DeepMapping abstraction, which relies on the impressive memorization…

Databases · Computer Science 2024-09-27 Lixi Zhou , K. Selçuk Candan , Jia Zou

DecoHD: Decomposed Hyperdimensional Classification under Extreme Memory Budgets

Decomposition is a proven way to shrink deep networks without changing input-output dimensionality or interface semantics. We bring this idea to hyperdimensional computing (HDC), where footprint cuts usually shrink the feature axis and…

Machine Learning · Computer Science 2026-02-04 Sanggeon Yun , Hyunwoo Oh , Ryozo Masukawa , Mohsen Imani

BitTrain: Sparse Bitmap Compression for Memory-Efficient Training on the Edge

Training on the Edge enables neural networks to learn continuously from new data after deployment on memory-constrained edge devices. Previous work is mostly concerned with reducing the number of model parameters which is only beneficial…

Machine Learning · Computer Science 2021-11-01 Abdelrahman Hosny , Marina Neseem , Sherief Reda

Supervised Compression for Resource-Constrained Edge Computing Systems

There has been much interest in deploying deep learning algorithms on low-powered devices, including smartphones, drones, and medical sensors. However, full-scale deep neural networks are often too resource-intensive in terms of energy and…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Yoshitomo Matsubara , Ruihan Yang , Marco Levorato , Stephan Mandt

Accelerating Convolutional Neural Networks via Activation Map Compression

The deep learning revolution brought us an extensive array of neural network architectures that achieve state-of-the-art performance in a wide variety of Computer Vision tasks including among others, classification, detection and…

Computer Vision and Pattern Recognition · Computer Science 2019-03-28 Georgios Georgiadis

CP-decomposition with Tensor Power Method for Convolutional Neural Networks Compression

Convolutional Neural Networks (CNNs) has shown a great success in many areas including complex image classification tasks. However, they need a lot of memory and computational cost, which hinders them from running in relatively low-end…

Machine Learning · Computer Science 2017-01-26 Marcella Astrid , Seung-Ik Lee

Low-Rank+Sparse Tensor Compression for Neural Networks

Low-rank tensor compression has been proposed as a promising approach to reduce the memory and compute requirements of neural networks for their deployment on edge devices. Tensor compression reduces the number of parameters required to…

Machine Learning · Computer Science 2021-11-03 Cole Hawkins , Haichuan Yang , Meng Li , Liangzhen Lai , Vikas Chandra