Related papers: Software-Level Accuracy Using Stochastic Computing…

Non-Ideal Program-Time Conservation in Charge Trap Flash for Deep Learning

Training deep neural networks (DNNs) is computationally intensive but arrays of non-volatile memories like Charge Trap Flash (CTF) can accelerate DNN operations using in-memory computing. Specifically, the Resistive Processing Unit (RPU)…

Neural and Evolutionary Computing · Computer Science 2023-09-12 Shalini Shrivastava , Vivek Saraswat , Gayatri Dash , Samyak Chakrabarty , Udayan Ganguly

System-level Impact of Non-Ideal Program-Time of Charge Trap Flash (CTF) on Deep Neural Network

Learning of deep neural networks (DNN) using Resistive Processing Unit (RPU) architecture is energy-efficient as it utilizes dedicated neuromorphic hardware and stochastic computation of weight updates for in-memory computing. Charge Trap…

Neural and Evolutionary Computing · Computer Science 2024-02-16 S. Shrivastava , A. Biswas , S. Chakrabarty , G. Dash , V. Saraswat , U. Ganguly

Mixed-precision deep learning based on computational memory

Deep neural networks (DNNs) have revolutionized the field of artificial intelligence and have achieved unprecedented success in cognitive tasks such as image and speech recognition. Training of large DNNs, however, is computationally…

Emerging Technologies · Computer Science 2020-05-13 S. R. Nandakumar , Manuel Le Gallo , Christophe Piveteau , Vinay Joshi , Giovanni Mariani , Irem Boybat , Geethan Karunaratne , Riduan Khaddam-Aljameh , Urs Egger , Anastasios Petropoulos , Theodore Antonakopoulos , Bipin Rajendran , Abu Sebastian , Evangelos Eleftheriou

Mixed-precision training of deep neural networks using computational memory

Deep neural networks have revolutionized the field of machine learning by providing unprecedented human-like performance in solving many real-world problems such as image and speech recognition. Training of large DNNs, however, is a…

Emerging Technologies · Computer Science 2017-12-05 Nandakumar S. R. , Manuel Le Gallo , Irem Boybat , Bipin Rajendran , Abu Sebastian , Evangelos Eleftheriou

Accurate deep neural network inference using computational phase-change memory

In-memory computing is a promising non-von Neumann approach for making energy-efficient deep learning inference hardware. Crossbar arrays of resistive memory devices can be used to encode the network weights and perform efficient analog…

Emerging Technologies · Computer Science 2020-05-19 Vinay Joshi , Manuel Le Gallo , Simon Haefeli , Irem Boybat , S. R. Nandakumar , Christophe Piveteau , Martino Dazzi , Bipin Rajendran , Abu Sebastian , Evangelos Eleftheriou

Ultra-low Energy charge trap flash based synapse enabled by parasitic leakage mitigation

Brain-inspired computation promises complex cognitive tasks at biological energy efficiencies. The brain contains $10^4$ synapses per neuron. Hence, ultra-low energy, high-density synapses are needed for spiking neural networks (SNN). In…

Emerging Technologies · Computer Science 2020-12-22 Shalini Shrivastava , Tanmay Chavan , Udayan Ganguly

CAP-RAM: A Charge-Domain In-Memory Computing 6T-SRAM for Accurate and Precision-Programmable CNN Inference

A compact, accurate, and bitwidth-programmable in-memory computing (IMC) static random-access memory (SRAM) macro, named CAP-RAM, is presented for energy-efficient convolutional neural network (CNN) inference. It leverages a novel…

Hardware Architecture · Computer Science 2021-07-07 Zhiyu Chen , Zhanghao Yu , Qing Jin , Yan He , Jingyu Wang , Sheng Lin , Dai Li , Yanzhi Wang , Kaiyuan Yang

CFTrack: Enhancing Lightweight Visual Tracking through Contrastive Learning and Feature Matching

Achieving both efficiency and strong discriminative ability in lightweight visual tracking is a challenge, especially on mobile and edge devices with limited computational resources. Conventional lightweight trackers often struggle with…

Computer Vision and Pattern Recognition · Computer Science 2025-02-28 Juntao Liang , Jun Hou , Weijun Zhang , Yong Wang

Training LSTM Networks with Resistive Cross-Point Devices

In our previous work we have shown that resistive cross point devices, so called Resistive Processing Unit (RPU) devices, can provide significant power and speed benefits when training deep fully connected networks as well as convolutional…

Machine Learning · Computer Science 2023-02-17 Tayfun Gokmen , Malte Rasch , Wilfried Haensch

Analyzing and Mitigating the Impact of Permanent Faults on a Systolic Array Based Neural Network Accelerator

Due to their growing popularity and computational cost, deep neural networks (DNNs) are being targeted for hardware acceleration. A popular architecture for DNN acceleration, adopted by the Google Tensor Processing Unit (TPU), utilizes a…

Machine Learning · Computer Science 2018-02-20 Jeff Zhang , Tianyu Gu , Kanad Basu , Siddharth Garg

Space-Time Adaptive Processing Using Random Matrix Theory Under Limited Training Samples

Space-time adaptive processing (STAP) is one of the most effective approaches to suppressing ground clutters in airborne radar systems. It basically takes two forms, i.e., full-dimension STAP (FD-STAP) and reduced-dimension STAP (RD-STAP).…

Information Theory · Computer Science 2022-02-11 Di Song , Shengyao Chen , Feng Xi , Zhong Liu

Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization

Transformers have achieved remarkable success in sequence modeling and beyond but suffer from quadratic computational and memory complexities with respect to the length of the input sequence. Leveraging techniques include sparse and linear…

Machine Learning · Computer Science 2022-08-02 Tan Nguyen , Richard G. Baraniuk , Robert M. Kirby , Stanley J. Osher , Bao Wang

Learning a Low-Rank Feature Representation: Achieving Better Trade-Off between Stability and Plasticity in Continual Learning

In continual learning, networks confront a trade-off between stability and plasticity when trained on a sequence of tasks. To bolster plasticity without sacrificing stability, we propose a novel training algorithm called LRFR. This approach…

Machine Learning · Computer Science 2023-12-15 Zhenrong Liu , Yang Li , Yi Gong , Yik-Chung Wu

Artificial Synapse based on ULTRARAM Memory Device for Neuromorphic Applications

The memory demands of large-scale deep neural networks (DNNs) require synaptic weight values to be stored and updated in off-chip memory like dynamic random-access memory, which reduces energy efficiency and increases training time.…

Applied Physics · Physics 2025-10-08 Abhishek Kumar , Peter D. Hodgson , Manus Hayne , Avirup Dasgupta

Efficient and Fault-Tolerant Memristive Neural Networks with In-Situ Training

Neuromorphic architectures, which incorporate parallel and in-memory processing, are crucial for accelerating artificial neural network (ANN) computations. This work presents a novel memristor-based multi-layer neural network (memristive…

Emerging Technologies · Computer Science 2025-07-29 Santlal Prajapat , Manobendra Nath Mondal , Susmita Sur-Kolay

Learning to be Reproducible: Custom Loss Design for Robust Neural Networks

To enhance the reproducibility and reliability of deep learning models, we address a critical gap in current training methodologies: the lack of mechanisms that ensure consistent and robust performance across runs. Our empirical analysis…

Machine Learning · Computer Science 2026-01-05 Waqas Ahmed , Sheeba Samuel , Kevin Coakley , Birgitta Koenig-Ries , Odd Erik Gundersen

Stuck-at Faults in ReRAM Neuromorphic Circuit Array and their Correction through Machine Learning

In this paper, we study the inference accuracy of the Resistive Random Access Memory (ReRAM) neuromorphic circuit due to stuck-at faults (stuck-on, stuck-off, and stuck at a certain resistive value). A simulation framework using Python is…

Hardware Architecture · Computer Science 2024-08-16 Vedant Sawal , Hiu Yung Wong

Representable Matrices: Enabling High Accuracy Analog Computation for Inference of DNNs using Memristors

Analog computing based on memristor technology is a promising solution to accelerating the inference phase of deep neural networks (DNNs). A fundamental problem is to map an arbitrary matrix to a memristor crossbar array (MCA) while…

Emerging Technologies · Computer Science 2019-11-28 Baogang Zhang , Necati Uysal , Deliang Fan , Rickard Ewetz

MXFormer: A Microscaling Floating-Point Charge-Trap Transistor Compute-in-Memory Transformer Accelerator

The proliferation of Transformer models is often constrained by the significant computational and memory bandwidth demands of deployment. To address this, we present MXFormer, a novel, hybrid, weight-stationary Compute-in-Memory (CIM)…

Hardware Architecture · Computer Science 2026-02-16 George Karfakis , Samyak Chakrabarty , Vinod Kurian Jacob , Siyun Qiao , Subramanian S. Iyer , Sudhakar Pamarti , Puneet Gupta

Improving DNN Fault Tolerance using Weight Pruning and Differential Crossbar Mapping for ReRAM-based Edge AI

Recent research demonstrated the promise of using resistive random access memory (ReRAM) as an emerging technology to perform inherently parallel analog domain in-situ matrix-vector multiplication -- the intensive and key computation in…

Machine Learning · Computer Science 2021-06-22 Geng Yuan , Zhiheng Liao , Xiaolong Ma , Yuxuan Cai , Zhenglun Kong , Xuan Shen , Jingyan Fu , Zhengang Li , Chengming Zhang , Hongwu Peng , Ning Liu , Ao Ren , Jinhui Wang , Yanzhi Wang