English
Related papers

Related papers: Software-Level Accuracy Using Stochastic Computing…

200 papers

Training deep neural networks (DNNs) is computationally intensive but arrays of non-volatile memories like Charge Trap Flash (CTF) can accelerate DNN operations using in-memory computing. Specifically, the Resistive Processing Unit (RPU)…

Neural and Evolutionary Computing · Computer Science 2023-09-12 Shalini Shrivastava , Vivek Saraswat , Gayatri Dash , Samyak Chakrabarty , Udayan Ganguly

Learning of deep neural networks (DNN) using Resistive Processing Unit (RPU) architecture is energy-efficient as it utilizes dedicated neuromorphic hardware and stochastic computation of weight updates for in-memory computing. Charge Trap…

Neural and Evolutionary Computing · Computer Science 2024-02-16 S. Shrivastava , A. Biswas , S. Chakrabarty , G. Dash , V. Saraswat , U. Ganguly

Deep neural networks (DNNs) have revolutionized the field of artificial intelligence and have achieved unprecedented success in cognitive tasks such as image and speech recognition. Training of large DNNs, however, is computationally…

Deep neural networks have revolutionized the field of machine learning by providing unprecedented human-like performance in solving many real-world problems such as image and speech recognition. Training of large DNNs, however, is a…

Emerging Technologies · Computer Science 2017-12-05 Nandakumar S. R. , Manuel Le Gallo , Irem Boybat , Bipin Rajendran , Abu Sebastian , Evangelos Eleftheriou

In-memory computing is a promising non-von Neumann approach for making energy-efficient deep learning inference hardware. Crossbar arrays of resistive memory devices can be used to encode the network weights and perform efficient analog…

Brain-inspired computation promises complex cognitive tasks at biological energy efficiencies. The brain contains $10^4$ synapses per neuron. Hence, ultra-low energy, high-density synapses are needed for spiking neural networks (SNN). In…

Emerging Technologies · Computer Science 2020-12-22 Shalini Shrivastava , Tanmay Chavan , Udayan Ganguly

A compact, accurate, and bitwidth-programmable in-memory computing (IMC) static random-access memory (SRAM) macro, named CAP-RAM, is presented for energy-efficient convolutional neural network (CNN) inference. It leverages a novel…

Hardware Architecture · Computer Science 2021-07-07 Zhiyu Chen , Zhanghao Yu , Qing Jin , Yan He , Jingyu Wang , Sheng Lin , Dai Li , Yanzhi Wang , Kaiyuan Yang

Achieving both efficiency and strong discriminative ability in lightweight visual tracking is a challenge, especially on mobile and edge devices with limited computational resources. Conventional lightweight trackers often struggle with…

Computer Vision and Pattern Recognition · Computer Science 2025-02-28 Juntao Liang , Jun Hou , Weijun Zhang , Yong Wang

In our previous work we have shown that resistive cross point devices, so called Resistive Processing Unit (RPU) devices, can provide significant power and speed benefits when training deep fully connected networks as well as convolutional…

Machine Learning · Computer Science 2023-02-17 Tayfun Gokmen , Malte Rasch , Wilfried Haensch

Due to their growing popularity and computational cost, deep neural networks (DNNs) are being targeted for hardware acceleration. A popular architecture for DNN acceleration, adopted by the Google Tensor Processing Unit (TPU), utilizes a…

Machine Learning · Computer Science 2018-02-20 Jeff Zhang , Tianyu Gu , Kanad Basu , Siddharth Garg

Space-time adaptive processing (STAP) is one of the most effective approaches to suppressing ground clutters in airborne radar systems. It basically takes two forms, i.e., full-dimension STAP (FD-STAP) and reduced-dimension STAP (RD-STAP).…

Information Theory · Computer Science 2022-02-11 Di Song , Shengyao Chen , Feng Xi , Zhong Liu

Transformers have achieved remarkable success in sequence modeling and beyond but suffer from quadratic computational and memory complexities with respect to the length of the input sequence. Leveraging techniques include sparse and linear…

Machine Learning · Computer Science 2022-08-02 Tan Nguyen , Richard G. Baraniuk , Robert M. Kirby , Stanley J. Osher , Bao Wang

In continual learning, networks confront a trade-off between stability and plasticity when trained on a sequence of tasks. To bolster plasticity without sacrificing stability, we propose a novel training algorithm called LRFR. This approach…

Machine Learning · Computer Science 2023-12-15 Zhenrong Liu , Yang Li , Yi Gong , Yik-Chung Wu

The memory demands of large-scale deep neural networks (DNNs) require synaptic weight values to be stored and updated in off-chip memory like dynamic random-access memory, which reduces energy efficiency and increases training time.…

Applied Physics · Physics 2025-10-08 Abhishek Kumar , Peter D. Hodgson , Manus Hayne , Avirup Dasgupta

Neuromorphic architectures, which incorporate parallel and in-memory processing, are crucial for accelerating artificial neural network (ANN) computations. This work presents a novel memristor-based multi-layer neural network (memristive…

Emerging Technologies · Computer Science 2025-07-29 Santlal Prajapat , Manobendra Nath Mondal , Susmita Sur-Kolay

To enhance the reproducibility and reliability of deep learning models, we address a critical gap in current training methodologies: the lack of mechanisms that ensure consistent and robust performance across runs. Our empirical analysis…

Machine Learning · Computer Science 2026-01-05 Waqas Ahmed , Sheeba Samuel , Kevin Coakley , Birgitta Koenig-Ries , Odd Erik Gundersen

In this paper, we study the inference accuracy of the Resistive Random Access Memory (ReRAM) neuromorphic circuit due to stuck-at faults (stuck-on, stuck-off, and stuck at a certain resistive value). A simulation framework using Python is…

Hardware Architecture · Computer Science 2024-08-16 Vedant Sawal , Hiu Yung Wong

Analog computing based on memristor technology is a promising solution to accelerating the inference phase of deep neural networks (DNNs). A fundamental problem is to map an arbitrary matrix to a memristor crossbar array (MCA) while…

Emerging Technologies · Computer Science 2019-11-28 Baogang Zhang , Necati Uysal , Deliang Fan , Rickard Ewetz

The proliferation of Transformer models is often constrained by the significant computational and memory bandwidth demands of deployment. To address this, we present MXFormer, a novel, hybrid, weight-stationary Compute-in-Memory (CIM)…

Hardware Architecture · Computer Science 2026-02-16 George Karfakis , Samyak Chakrabarty , Vinod Kurian Jacob , Siyun Qiao , Subramanian S. Iyer , Sudhakar Pamarti , Puneet Gupta

Recent research demonstrated the promise of using resistive random access memory (ReRAM) as an emerging technology to perform inherently parallel analog domain in-situ matrix-vector multiplication -- the intensive and key computation in…

‹ Prev 1 2 3 10 Next ›