Related papers: NN-PARS: A Parallelized Neural Network Based Circu…

CSM-NN: Current Source Model Based Logic Circuit Simulation -- A Neural Network Approach

The miniaturization of transistors down to 5nm and beyond, plus the increasing complexity of integrated circuits, significantly aggravate short channel effects, and demand analysis and optimization of more design corners and modes.…

Machine Learning · Computer Science 2020-02-14 Mohammad Saeed Abrishami , Massoud Pedram , Shahin Nazarian

Artificial Neural Network for Performance Modeling and Optimization of CMOS Analog Circuits

This paper presents an implementation of multilayer feed forward neural networks (NN) to optimize CMOS analog circuits. For modeling and design recently neural network computational modules have got acceptance as an unorthodox and useful…

Neural and Evolutionary Computing · Computer Science 2012-12-13 Mriganka Chakraborty

SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction

With the growing model size, deep neural networks (DNN) are increasingly trained over massive GPU accelerators, which demands a proper parallelization plan that transforms a DNN model into fine-grained tasks and then schedules them to GPUs…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-24 Zhiqi Lin , Youshan Miao , Guodong Liu , Xiaoxiang Shi , Quanlu Zhang , Fan Yang , Saeed Maleki , Yi Zhu , Xu Cao , Cheng Li , Mao Yang , Lintao Zhang , Lidong Zhou

Implementation of Tensor Network Simulation TN-Sim under NWQ-Sim

Large-scale tensor network simulations are crucial for developing robust complexity-theoretic bounds on classical quantum simulation, enabling circuit cutting approaches, and optimizing circuit compilation, all of which aid efficient…

Quantum Physics · Physics 2026-01-09 Aaron C. Hoyt , Jonathan S. Bersson , Sean Garner , Chenxu Liu , Ang Li

PARSIR: a Package for Effective Parallel Discrete Event Simulation on Multi-processor Machines

In this article we present PARSIR (PARallel SImulation Runner), a package that enables the effective exploitation of shared-memory multi-processor machines for running discrete event simulation models. PARSIR is a compile/run-time…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-02 Francesco Quaglia

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

Linear recurrent neural networks enable powerful long-range sequence modeling with constant memory usage and time-per-token during inference. These architectures hold promise for streaming applications at the edge, but deployment in…

Machine Learning · Computer Science 2025-08-14 Alessandro Pierro , Steven Abreu , Jonathan Timcheck , Philipp Stratmann , Andreas Wild , Sumit Bam Shrestha

Parallelizable Neural Turing Machines

We introduce a parallelizable simplification of Neural Turing Machine (NTM), referred to as P-NTM, which redesigns the core operations of the original architecture to enable efficient scan-based parallel execution. We evaluate the proposed…

Neural and Evolutionary Computing · Computer Science 2026-02-24 Gabriel Faria , Arnaldo Candido Junior

Sparse Spiking Neural-like Membrane Systems on Graphics Processing Units

The parallel simulation of Spiking Neural P systems is mainly based on a matrix representation, where the graph inherent to the neural model is encoded in an adjacency matrix. The simulation algorithm is based on a matrix-vector…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-08-09 Javier Hernández-Tello , Miguel Ángel Martínez-del-Amor , David Orellana-Martín , Francis George C. Cabarle

Neural Network Accelerated Process Design of Polycrystalline Microstructures

Computational experiments are exploited in finding a well-designed processing path to optimize material structures for desired properties. This requires understanding the interplay between the processing-(micro)structure-property linkages…

Computational Engineering, Finance, and Science · Computer Science 2023-05-04 Junrong Lin , Mahmudul Hasan , Pinar Acar , Jose Blanchet , Vahid Tarokh

Parareal Neural Networks Emulating a Parallel-in-time Algorithm

As deep neural networks (DNNs) become deeper, the training time increases. In this perspective, multi-GPU parallel computing has become a key tool in accelerating the training of DNNs. In this paper, we introduce a novel methodology to…

Numerical Analysis · Mathematics 2024-07-08 Chang-Ock Lee , Youngkyu Lee , Jongho Park

Single stream parallelization of generalized LSTM-like RNNs on a GPU

Recurrent neural networks (RNNs) have shown outstanding performance on processing sequence data. However, they suffer from long training time, which demands parallel implementations of the training procedure. Parallelization of the training…

Neural and Evolutionary Computing · Computer Science 2015-11-25 Kyuyeon Hwang , Wonyong Sung

GSR-GNN: Training Acceleration and Memory-Saving Framework of Deep GNNs on Circuit Graph

Graph Neural Networks (GNNs) show strong promise for circuit analysis, but scaling to modern large-scale circuit graphs is limited by GPU memory and training cost, especially for deep models. We revisit deep GNNs for circuit graphs and show…

Machine Learning · Computer Science 2026-03-31 Yuebo Luo , Shiyang Li , Yifei Feng , Vishal Kancharla , Shaoyi Huang , Caiwen Ding

Physics-Informed Neural Networks for Device and Circuit Modeling: A Case Study of NeuroSPICE

We present NeuroSPICE, a physics-informed neural network (PINN) framework for device and circuit simulation. Unlike conventional SPICE, which relies on time-discretized numerical solvers, NeuroSPICE leverages PINNs to solve circuit…

Artificial Intelligence · Computer Science 2026-04-28 Chien-Ting Tung , Chenming Hu

Parallel-in-Time Nonlinear Optimal Control via GPU-native Sequential Convex Programming

Real-time trajectory optimization for nonlinear constrained autonomous systems is critical and typically performed by CPU-based sequential solvers. Specifically, reliance on global sparse linear algebra or the serial nature of dynamic…

Robotics · Computer Science 2026-03-13 Yilin Zou , Zhong Zhang , Maxime Robic , Fanghua Jiang

Block-Sparse Recurrent Neural Networks

Recurrent Neural Networks (RNNs) are used in state-of-the-art models in domains such as speech recognition, machine translation, and language modelling. Sparsity is a technique to reduce compute and memory requirements of deep learning…

Machine Learning · Computer Science 2017-11-09 Sharan Narang , Eric Undersander , Gregory Diamos

Scalable Parameter Design for Superconducting Quantum Circuits with Graph Neural Networks

To demonstrate supremacy of quantum computing, increasingly large-scale superconducting quantum computing chips are being designed and fabricated. However, the complexity of simulating quantum systems poses a significant challenge to…

Quantum Physics · Physics 2025-07-25 Hao Ai , Yu-xi Liu

NASS: Optimizing Secure Inference via Neural Architecture Search

Due to increasing privacy concerns, neural network (NN) based secure inference (SI) schemes that simultaneously hide the client inputs and server models attract major research interests. While existing works focused on developing secure…

Cryptography and Security · Computer Science 2020-02-18 Song Bian , Weiwen Jiang , Qing Lu , Yiyu Shi , Takashi Sato

nuGPR: GPU-Accelerated Gaussian Process Regression with Iterative Algorithms and Low-Rank Approximations

Gaussian Process Regression (GPR) is an important type of supervised machine learning model with inherent uncertainty measure in its predictions. We propose a new framework, nuGPR, to address the well-known challenge of high computation…

Machine Learning · Computer Science 2025-10-15 Ziqi Zhao , Vivek Sarin

HEP-BNN: A Framework for Finding Low-Latency Execution Configurations of BNNs on Heterogeneous Multiprocessor Platforms

Binarized Neural Networks (BNNs) significantly reduce the computation and memory demands with binarized weights and activations compared to full-precision NNs. Executing a layer in a BNN on different devices of a heterogeneous…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-13 Leonard David Bereholschi , Ching-Chi Lin , Mikail Yayla , Jian-Jia Chen

Narses: A Scalable Flow-Based Network Simulator

Most popular, modern network simulators, such as ns, are targeted towards simulating low-level protocol details. These existing simulators are not intended for simulating large distributed applications with many hosts and many concurrent…

Performance · Computer Science 2007-05-23 TJ Giuli , Mary Baker