English

Sensitivity-Aware Mixed-Precision Quantization for ReRAM-based Computing-in-Memory

Hardware Architecture 2025-12-23 v1 Emerging Technologies

Abstract

Compute-In-Memory (CIM) systems, particularly those utilizing ReRAM and memristive technologies, offer a promising path toward energy-efficient neural network computation. However, conventional quantization and compression techniques often fail to fully optimize performance and efficiency in these architectures. In this work, we present a structured quantization method that combines sensitivity analysis with mixed-precision strategies to enhance weight storage and computational performance on ReRAM-based CIM systems. Our approach improves ReRAM Crossbar utilization, significantly reducing power consumption, latency, and computational load, while maintaining high accuracy. Experimental results show 86.33% accuracy at 70% compression, alongside a 40% reduction in power consumption, demonstrating the method's effectiveness for power-constrained applications.

Keywords

Cite

@article{arxiv.2512.19445,
  title  = {Sensitivity-Aware Mixed-Precision Quantization for ReRAM-based Computing-in-Memory},
  author = {Guan-Cheng Chen and Chieh-Lin Tsai and Pei-Hsuan Tsai and Yuan-Hao Chang},
  journal= {arXiv preprint arXiv:2512.19445},
  year   = {2025}
}
R2 v1 2026-07-01T08:37:01.528Z