Related papers: THC: Accelerating Distributed Deep Learning Using …

Accelerating Distributed Deep Learning using Lossless Homomorphic Compression

As deep neural networks (DNNs) grow in complexity and size, the resultant increase in communication overhead during distributed training has become a significant bottleneck, challenging the scalability of distributed training systems.…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-02-13 Haoyu Li , Yuchen Xu , Jiayi Chen , Rohit Dwivedula , Wenfei Wu , Keqiang He , Aditya Akella , Daehyeok Kim

Hybrid Tensor Decomposition in Neural Network Compression

Deep neural networks (DNNs) have enabled impressive breakthroughs in various artificial intelligence (AI) applications recently due to its capability of learning high-level features from big data. However, the current demand of DNNs for…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Bijiao Wu , Dingheng Wang , Guangshe Zhao , Lei Deng , Guoqi Li

Homomorphic Parameter Compression for Distributed Deep Learning Training

Distributed training of deep neural networks has received significant research interest, and its major approaches include implementations on multiple GPUs and clusters. Parallelization can dramatically improve the efficiency of training…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-11-29 Jaehee Jang , Byungook Na , Sungroh Yoon

A New Clustering-Based Technique for the Acceleration of Deep Convolutional Networks

Deep learning and especially the use of Deep Neural Networks (DNNs) provides impressive results in various regression and classification tasks. However, to achieve these results, there is a high demand for computing and storing resources.…

Machine Learning · Computer Science 2021-07-21 Erion-Vasilis Pikoulis , Christos Mavrokefalidis , Aris S. Lalos

HyperTune: Dynamic Hyperparameter Tuning For Efficient Distribution of DNN Training Over Heterogeneous Systems

Distributed training is a novel approach to accelerate Deep Neural Networks (DNN) training, but common training libraries fall short of addressing the distributed cases with heterogeneous processors or the cases where the processing nodes…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-07-17 Ali HeydariGorji , Siavash Rezaei , Mahdi Torabzadehkashi , Hossein Bobarshad , Vladimir Alves , Pai H. Chou

A Targeted Acceleration and Compression Framework for Low bit Neural Networks

1 bit deep neural networks (DNNs), of which both the activations and weights are binarized , are attracting more and more attention due to their high computational efficiency and low memory requirement . However, the drawback of large…

Computer Vision and Pattern Recognition · Computer Science 2019-07-12 Biao Qian , Yang Wang

Tensor Hypercontraction Error Correction Using Regression

Wavefunction-based quantum methods are some of the most accurate tools for predicting and analyzing the electronic structure of molecules, in particular for accounting for dynamical electron correlation. However, most methods of including…

Chemical Physics · Physics 2026-03-02 Ishna Satyarth , Eric C. Larson , Devin A. Matthews

Hardware-Efficient Photonic Tensor Core: Accelerating Deep Neural Networks with Structured Compression

The rapid growth in computing demands, particularly driven by artificial intelligence applications, has begun to exceed the capabilities of traditional electronic hardware. Optical computing offers a promising alternative due to its…

Hardware Architecture · Computer Science 2025-07-24 Shupeng Ning , Hanqing Zhu , Chenghao Feng , Jiaqi Gu , David Z. Pan , Ray T. Chen

THDC: Training Hyperdimensional Computing Models with Backpropagation

Hyperdimensional computing (HDC) offers lightweight learning for energy-constrained devices by encoding data into high-dimensional vectors. However, its reliance on ultra-high dimensionality and static, randomly initialized hypervectors…

Machine Learning · Computer Science 2026-02-03 Hanne Dejonghe , Sam Leroux

Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations

Deep-predictive-coding networks (DPCNs) are hierarchical, generative models. They rely on feed-forward and feed-back connections to modulate latent feature representations of stimuli in a dynamic and context-sensitive manner. A crucial…

Artificial Intelligence · Computer Science 2021-09-27 Isaac J. Sledge , Jose C. Principe

AC/DC: Alternating Compressed/DeCompressed Training of Deep Neural Networks

The increasing computational requirements of deep neural networks (DNNs) have led to significant interest in obtaining DNN models that are sparse, yet accurate. Recent work has investigated the even harder case of sparse training, where the…

Machine Learning · Computer Science 2021-12-16 Alexandra Peste , Eugenia Iofinova , Adrian Vladu , Dan Alistarh

Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition

Recurrent Neural Networks (RNNs) have been widely used in sequence analysis and modeling. However, when processing high-dimensional data, RNNs typically require very large model sizes, thereby bringing a series of deployment challenges.…

Machine Learning · Computer Science 2020-05-12 Miao Yin , Siyu Liao , Xiao-Yang Liu , Xiaodong Wang , Bo Yuan

Channel-wise Feature Decorrelation for Enhanced Learned Image Compression

The emerging Learned Compression (LC) replaces the traditional codec modules with Deep Neural Networks (DNN), which are trained end-to-end for rate-distortion performance. This approach is considered as the future of image/video…

Image and Video Processing · Electrical Eng. & Systems 2024-07-08 Farhad Pakdaman , Moncef Gabbouj

TAGC: Optimizing Gradient Communication in Distributed Transformer Training

The increasing complexity of large language models (LLMs) necessitates efficient training strategies to mitigate the high computational costs associated with distributed training. A significant bottleneck in this process is gradient…

Machine Learning · Computer Science 2025-04-09 Igor Polyakov , Alexey Dukhanov , Egor Spirin

ThumbNet: One Thumbnail Image Contains All You Need for Recognition

Although deep convolutional neural networks (CNNs) have achieved great success in computer vision tasks, its real-world application is still impeded by its voracious demand of computational resources. Current works mostly seek to compress…

Computer Vision and Pattern Recognition · Computer Science 2020-12-04 Chen Zhao , Bernard Ghanem

Learned Gradient Compression for Distributed Deep Learning

Training deep neural networks on large datasets containing high-dimensional data requires a large amount of computation. A solution to this problem is data-parallel distributed training, where a model is replicated into several…

Machine Learning · Computer Science 2021-03-18 Lusine Abrahamyan , Yiming Chen , Giannis Bekoulis , Nikos Deligiannis

Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression

We propose tensorial neural networks (TNNs), a generalization of existing neural networks by extending tensor operations on low order operands to those on high order ones. The problem of parameter learning is challenging, as it corresponds…

Machine Learning · Statistics 2018-12-11 Jiahao Su , Jingling Li , Bobby Bhattacharjee , Furong Huang

Compression and Interpretability of Deep Neural Networks via Tucker Tensor Layer: From First Principles to Tensor Valued Back-Propagation

This work aims to help resolve the two main stumbling blocks in the application of Deep Neural Networks (DNNs), that is, the exceedingly large number of trainable parameters and their physical interpretability. This is achieved through a…

Machine Learning · Computer Science 2020-01-07 Giuseppe G. Calvi , Ahmad Moniri , Mahmoud Mahfouz , Qibin Zhao , Danilo P. Mandic

"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach

Modern deep neural networks (DNNs) are extremely powerful; however, this comes at the price of increased depth and having more parameters per layer, making their training and inference more computationally challenging. In an attempt to…

Machine Learning · Statistics 2024-03-04 Lingyu Gu , Yongqi Du , Yuan Zhang , Di Xie , Shiliang Pu , Robert C. Qiu , Zhenyu Liao

TaxoNN: A Light-Weight Accelerator for Deep Neural Network Training

Emerging intelligent embedded devices rely on Deep Neural Networks (DNNs) to be able to interact with the real-world environment. This interaction comes with the ability to retrain DNNs, since environmental conditions change continuously in…

Hardware Architecture · Computer Science 2020-10-13 Reza Hojabr , Kamyar Givaki , Kossar Pourahmadi , Parsa Nooralinejad , Ahmad Khonsari , Dara Rahmati , M. Hassan Najafi