Related papers: Accelerating Training using Tensor Decomposition

Accelerating the Low-Rank Decomposed Models

Tensor decomposition is a mathematically supported technique for data compression. It consists of applying some kind of a Low Rank Decomposition technique on the tensors or matrices in order to reduce the redundancy of the data. However, it…

Machine Learning · Computer Science 2025-05-27 Habib Hajimolahoseini , Walid Ahmed , Austin Wen , Yang Liu

Tensor Yard: One-Shot Algorithm of Hardware-Friendly Tensor-Train Decomposition for Convolutional Neural Networks

Nowadays Deep Learning became widely used in many economic, technical and scientific areas of human interest. It is clear that efficiency of solutions based on Deep Neural Networks should consider not only quality metric for the target…

Computer Vision and Pattern Recognition · Computer Science 2021-08-10 Anuar Taskynov , Vladimir Korviakov , Ivan Mazurenko , Yepan Xiong

Applying Tensor Decomposition to image for Robustness against Adversarial Attack

Nowadays the deep learning technology is growing faster and shows dramatic performance in computer vision areas. However, it turns out a deep learning based model is highly vulnerable to some small perturbation called an adversarial attack.…

Computer Vision and Pattern Recognition · Computer Science 2020-03-06 Seungju Cho , Tae Joon Jun , Mingu Kang , Daeyoung Kim

Training and Inference within 1 Second -- Tackle Cross-Sensor Degradation of Real-World Pansharpening with Efficient Residual Feature Tailoring

Deep learning methods for pansharpening have advanced rapidly, yet models pretrained on data from a specific sensor often generalize poorly to data from other sensors. Existing methods to tackle such cross-sensor degradation include…

Computer Vision and Pattern Recognition · Computer Science 2025-11-21 Tianyu Xin , Jin-Liang Xiao , Zeyu Xia , Shan Yin , Liang-Jian Deng

Training Acceleration of Low-Rank Decomposed Networks using Sequential Freezing and Rank Quantization

Low Rank Decomposition (LRD) is a model compression technique applied to the weight tensors of deep learning models in order to reduce the number of trainable parameters and computational complexity. However, due to high number of new…

Machine Learning · Computer Science 2025-05-27 Habib Hajimolahoseini , Walid Ahmed , Yang Liu

Learning to Learn to Compress

In this paper we present an end-to-end meta-learned system for image compression. Traditional machine learning based approaches to image compression train one or more neural network for generalization performance. However, at inference…

Image and Video Processing · Electrical Eng. & Systems 2021-05-04 Nannan Zou , Honglei Zhang , Francesco Cricri , Hamed R. Tavakoli , Jani Lainema , Miska Hannuksela , Emre Aksu , Esa Rahtu

Tensor Decomposition for Model Reduction in Neural Networks: A Review

Modern neural networks have revolutionized the fields of computer vision (CV) and Natural Language Processing (NLP). They are widely used for solving complex CV tasks and NLP tasks such as image classification, image generation, and machine…

Machine Learning · Computer Science 2023-07-25 Xingyi Liu , Keshab K. Parhi

Convolutional neural networks compression with low rank and sparse tensor decompositions

Convolutional neural networks show outstanding results in a variety of computer vision tasks. However, a neural network architecture design usually faces a trade-off between model performance and computational/memory complexity. For some…

Computer Vision and Pattern Recognition · Computer Science 2020-06-12 Pavel Kaloshin

Same accuracy, twice as fast: continuous training surpasses retraining from scratch

Continual learning aims to enable models to adapt to new datasets without losing performance on previously learned data, often assuming that prior data is no longer available. However, in many practical scenarios, both old and new data are…

Machine Learning · Computer Science 2025-03-03 Eli Verwimp , Guy Hacohen , Tinne Tuytelaars

Reducing the Training Time of Neural Networks by Partitioning

This paper presents a new method for pre-training neural networks that can decrease the total training time for a neural network while maintaining the final performance, which motivates its use on deep neural networks. By partitioning the…

Neural and Evolutionary Computing · Computer Science 2016-01-05 Conrado S. Miranda , Fernando J. Von Zuben

Convolutional neural networks with low-rank regularization

Large CNNs have delivered impressive performance in various computer vision applications. But the storage and computation requirements make it problematic for deploying these models on mobile devices. Recently, tensor decompositions have…

Machine Learning · Computer Science 2016-02-16 Cheng Tai , Tong Xiao , Yi Zhang , Xiaogang Wang , Weinan E

Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition

We address the computational barrier of deploying advanced deep learning segmentation models in clinical settings by studying the efficacy of network compression through tensor decomposition. We propose a post-training Tucker factorization…

Image and Video Processing · Electrical Eng. & Systems 2024-04-19 Tobias Weber , Jakob Dexl , David Rügamer , Michael Ingrisch

Speeding up Resnet Architecture with Layers Targeted Low Rank Decomposition

Compression of a neural network can help in speeding up both the training and the inference of the network. In this research, we study applying compression using low rank decomposition on network layers. Our research demonstrates that to…

Computer Vision and Pattern Recognition · Computer Science 2023-09-25 Walid Ahmed , Habib Hajimolahoseini , Austin Wen , Yang Liu

Speeding up Deep Model Training by Sharing Weights and Then Unsharing

We propose a simple and efficient approach for training the BERT model. Our approach exploits the special structure of BERT that contains a stack of repeated modules (i.e., transformer encoders). Our proposed approach first trains BERT with…

Machine Learning · Computer Science 2021-10-11 Shuo Yang , Le Hou , Xiaodan Song , Qiang Liu , Denny Zhou

Matrix and tensor decompositions for training binary neural networks

This paper is on improving the training of binary neural networks in which both activations and weights are binary. While prior methods for neural network binarization binarize each filter independently, we propose to instead parametrize…

Computer Vision and Pattern Recognition · Computer Science 2019-04-17 Adrian Bulat , Jean Kossaifi , Georgios Tzimiropoulos , Maja Pantic

Accelerating the Training of Video Super-Resolution Models

Despite that convolution neural networks (CNN) have recently demonstrated high-quality reconstruction for video super-resolution (VSR), efficiently training competitive VSR models remains a challenging problem. It usually takes an order of…

Computer Vision and Pattern Recognition · Computer Science 2022-05-18 Lijian Lin , Xintao Wang , Zhongang Qi , Ying Shan

Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization

Tensor decomposition is one of the fundamental technique for model compression of deep convolution neural networks owing to its ability to reveal the latent relations among complex structures. However, most existing methods compress the…

Computer Vision and Pattern Recognition · Computer Science 2021-12-08 Bo-Shiuan Chu , Che-Rung Lee

Efficient Training with Denoised Neural Weights

Good weight initialization serves as an effective measure to reduce the training cost of a deep neural network (DNN) model. The choice of how to initialize parameters is challenging and may require manual tuning, which can be time-consuming…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Yifan Gong , Zheng Zhan , Yanyu Li , Yerlan Idelbayev , Andrey Zharkov , Kfir Aberman , Sergey Tulyakov , Yanzhi Wang , Jian Ren

On-FPGA Training with Ultra Memory Reduction: A Low-Precision Tensor Method

Various hardware accelerators have been developed for energy-efficient and real-time inference of neural networks on edge devices. However, most training is done on high-performance GPUs or servers, and the huge memory and computing costs…

Hardware Architecture · Computer Science 2021-04-21 Kaiqi Zhang , Cole Hawkins , Xiyuan Zhang , Cong Hao , Zheng Zhang

On Tensor Train Rank Minimization: Statistical Efficiency and Scalable Algorithm

Tensor train (TT) decomposition provides a space-efficient representation for higher-order tensors. Despite its advantage, we face two crucial limitations when we apply the TT decomposition to machine learning problems: the lack of…

Machine Learning · Statistics 2017-08-03 Masaaki Imaizumi , Takanori Maehara , Kohei Hayashi