Related papers: Parallelizable Neural Turing Machines

Implementing Neural Turing Machines

Neural Turing Machines (NTMs) are an instance of Memory Augmented Neural Networks, a new class of recurrent neural networks which decouple computation from memory by introducing an external memory unit. NTMs have demonstrated superior…

Machine Learning · Computer Science 2018-08-21 Mark Collier , Joeran Beel

Single stream parallelization of generalized LSTM-like RNNs on a GPU

Recurrent neural networks (RNNs) have shown outstanding performance on processing sequence data. However, they suffer from long training time, which demands parallel implementations of the training procedure. Parallelization of the training…

Neural and Evolutionary Computing · Computer Science 2015-11-25 Kyuyeon Hwang , Wonyong Sung

Structured Memory for Neural Turing Machines

Neural Turing Machines (NTM) contain memory component that simulates "working memory" in the brain to store and retrieve information to ease simple algorithms learning. So far, only linearly organized memory is proposed, and during…

Artificial Intelligence · Computer Science 2015-10-27 Wei Zhang , Yang Yu , Bowen Zhou

Neural GPUs Learn Algorithms

Learning an algorithm from examples is a fundamental problem that has been widely studied. Recently it has been addressed using neural networks, in particular by Neural Turing Machines (NTMs). These are fully differentiable computers that…

Machine Learning · Computer Science 2016-03-16 Łukasz Kaiser , Ilya Sutskever

Towards Work-Efficient Parallel Parameterized Algorithms

Parallel parameterized complexity theory studies how fixed-parameter tractable (fpt) problems can be solved in parallel. Previous theoretical work focused on parallel algorithms that are very fast in principle, but did not take into account…

Data Structures and Algorithms · Computer Science 2019-02-21 Max Bannach , Malte Skambath , Till Tantau

A provably stable neural network Turing Machine

We introduce a neural stack architecture, including a differentiable parametrized stack operator that approximates stack push and pop operations for suitable choices of parameters that explicitly represents a stack. We prove the stability…

Machine Learning · Computer Science 2022-09-20 John Stogin , Ankur Mali , C Lee Giles

Parallel, Self Organizing, Consensus Neural Networks

A new neural network architecture (PSCNN) is developed to improve performance and speed of such networks. The architecture has all the advantages of the previous models such as self-organization and possesses some other superior…

Neural and Evolutionary Computing · Computer Science 2020-08-06 Homayoun Valafar , Faramarz Valafar , Okan Ersoy

Deep Neural Machine Translation with Weakly-Recurrent Units

Recurrent neural networks (RNNs) have represented for years the state of the art in neural machine translation. Recently, new architectures have been proposed, which can leverage parallel computation on GPUs better than classical RNNs.…

Computation and Language · Computer Science 2018-05-14 Mattia Antonino Di Gangi , Marcello Federico

Effective Parallelisation for Machine Learning

We present a novel parallelisation scheme that simplifies the adaptation of learning algorithms to growing amounts of data as well as growing needs for accurate and confident predictions in critical applications. In contrast to other…

Machine Learning · Computer Science 2018-10-09 Michael Kamp , Mario Boley , Olana Missura , Thomas Gärtner

Neural Turing Machines

We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Machine or Von Neumann architecture but is…

Neural and Evolutionary Computing · Computer Science 2014-12-11 Alex Graves , Greg Wayne , Ivo Danihelka

Learning Numeracy: Binary Arithmetic with Neural Turing Machines

One of the main problems encountered so far with recurrent neural networks is that they struggle to retain long-time information dependencies in their recurrent connections. Neural Turing Machines (NTMs) attempt to mitigate this issue by…

Neural and Evolutionary Computing · Computer Science 2024-12-20 Jacopo Castellini

Learning Operations on a Stack with Neural Turing Machines

Multiple extensions of Recurrent Neural Networks (RNNs) have been proposed recently to address the difficulty of storing information over long time periods. In this paper, we experiment with the capacity of Neural Turing Machines (NTMs) to…

Machine Learning · Computer Science 2016-12-05 Tristan Deleu , Joseph Dureau

Neural Field Turing Machine: A Differentiable Spatial Computer

We introduce the Neural Field Turing Machine (NFTM), a differentiable architecture that unifies symbolic computation, physical simulation, and perceptual inference within continuous spatial fields. NFTM combines a neural controller,…

Neural and Evolutionary Computing · Computer Science 2025-09-04 Akash Malhotra , Nacéra Seghouani

Efficient Parallelization of Message Passing Neural Network Potentials for Large-scale Molecular Dynamics

Machine learning potentials have achieved great success in accelerating atomistic simulations. Many of them relying on atom-centered local descriptors are natural for parallelization. More recent message passing neural network (MPNN) models…

Chemical Physics · Physics 2025-06-10 Junfan Xia , Bin Jiang

Parallelizing Legendre Memory Unit Training

Recently, a new recurrent neural network (RNN) named the Legendre Memory Unit (LMU) was proposed and shown to achieve state-of-the-art performance on several benchmark datasets. Here we leverage the linear time-invariant (LTI) memory…

Machine Learning · Computer Science 2021-05-12 Narsimha Chilkuri , Chris Eliasmith

Parallel Recursive LSTM

Transformers have become the dominant architecture for sequence modeling by using self-attention to enable expressive and highly parallel processing. However, the resulting quadratic time and memory costs limit efficiency in long-context…

Machine Learning · Computer Science 2026-05-19 Tristan Gaudreault , Yongyi Mao

Parallel Scheduling Self-attention Mechanism: Generalization and Optimization

Over the past few years, self-attention is shining in the field of deep learning, especially in the domain of natural language processing(NLP). Its impressive effectiveness, along with ubiquitous implementations, have aroused our interest…

Machine Learning · Computer Science 2020-12-03 Mingfei Yu , Masahiro Fujita

An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks

Recurrent neural networks (RNN) have been successfully applied to various sequential decision-making tasks, natural language processing applications, and time-series predictions. Such networks are usually trained through back-propagation…

Machine Learning · Computer Science 2019-12-02 Julia El Zini , Yara Rizk , Mariette Awad

Neural Turing Machines: Convergence of Copy Tasks

The architecture of neural Turing machines is differentiable end to end and is trainable with gradient descent methods. Due to their large unfolded depth Neural Turing Machines are hard to train and because of their linear access of…

Neural and Evolutionary Computing · Computer Science 2016-12-08 Janez Aleš

TNT: Improving Chunkwise Training for Test-Time Memorization

Recurrent neural networks (RNNs) with deep test-time memorization modules, such as Titans and TTT, represent a promising, linearly-scaling paradigm distinct from Transformers. While these expressive models do not yet match the peak…

Machine Learning · Computer Science 2025-11-11 Zeman Li , Ali Behrouz , Yuan Deng , Peilin Zhong , Praneeth Kacham , Mahdi Karami , Meisam Razaviyayn , Vahab Mirrokni