English

Enabling Unstructured Sparse Acceleration on Structured Sparse Accelerators

Machine Learning 2025-05-27 v3 Artificial Intelligence Hardware Architecture

Abstract

Exploiting sparsity in deep neural networks (DNNs) has been a promising area for meeting the growing computation requirements. To minimize the overhead of sparse acceleration, hardware designers have proposed structured sparsity support, but it provides limited flexibility and requires extra model fine-tuning. Moreover, any sparse model fine-tuned for certain structured sparse HW cannot be accelerated by other structured hardware. To enable acceleration using unstructured sparsity of DNNs on structured sparse hardware, we propose an approximation method leveraging the distributive property in linear algebra to turn any sparse tensor into a series of structured sparse tensors. We also develop a software framework, TASDER, to apply high-quality structured approximation on weights and activations of DNNs. Our method accelerates dense and sparse DNNs without fine-tuning and improves energy-delay-product (EDP) by up to 83% and 74%. It achieves up to 39% speed-up on a real system.

Keywords

Cite

@article{arxiv.2403.07953,
  title  = {Enabling Unstructured Sparse Acceleration on Structured Sparse Accelerators},
  author = {Geonhwa Jeong and Po-An Tsai and Abhimanyu R. Bambhaniya and Stephen W. Keckler and Tushar Krishna},
  journal= {arXiv preprint arXiv:2403.07953},
  year   = {2025}
}

Comments

This paper is accepted to MLSys 2025

R2 v1 2026-06-28T15:17:46.101Z