Related papers: Parallelized Spatiotemporal Binding

Probabilistic Synchronous Parallel

Most machine learning and deep neural network algorithms rely on certain iterative algorithms to optimise their utility/cost functions, e.g. Stochastic Gradient Descent. In distributed learning, the networked nodes have to work…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-06 Liang Wang , Ben Catterall , Richard Mortier

Parallelizing non-linear sequential models over the sequence length

Sequential models, such as Recurrent Neural Networks and Neural Ordinary Differential Equations, have long suffered from slow training due to their inherent sequential nature. For many years this bottleneck has persisted, as many thought…

Machine Learning · Computer Science 2024-01-17 Yi Heng Lim , Qi Zhu , Joshua Selfridge , Muhammad Firmansyah Kasim

Parallel Spatio-Temporal Attention-Based TCN for Multivariate Time Series Prediction

As industrial systems become more complex and monitoring sensors for everything from surveillance to our health become more ubiquitous, multivariate time series prediction is taking an important place in the smooth-running of our society. A…

Machine Learning · Computer Science 2022-03-03 Fan Jin , Ke Zhang , Yipan Huang , Yifei Zhu , Baiping Chen

ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models

Recurrent Neural Networks (RNNs) laid the foundation for sequence modeling, but their intrinsic sequential nature restricts parallel computation, creating a fundamental barrier to scaling. This has led to the dominance of parallelizable…

Machine Learning · Computer Science 2025-11-04 Federico Danieli , Pau Rodriguez , Miguel Sarabia , Xavier Suau , Luca Zappella

STSC-SNN: Spatio-Temporal Synaptic Connection with Temporal Convolution and Attention for Spiking Neural Networks

Spiking Neural Networks (SNNs), as one of the algorithmic models in neuromorphic computing, have gained a great deal of research attention owing to temporal information processing capability, low power consumption, and high biological…

Neural and Evolutionary Computing · Computer Science 2023-06-07 Chengting Yu , Zheming Gu , Da Li , Gaoang Wang , Aili Wang , Erping Li

Structured State Space Models for In-Context Reinforcement Learning

Structured state space sequence (S4) models have recently achieved state-of-the-art performance on long-range sequence modeling tasks. These models also have fast inference speeds and parallelisable training, making them potentially useful…

Machine Learning · Computer Science 2023-11-27 Chris Lu , Yannick Schroecker , Albert Gu , Emilio Parisotto , Jakob Foerster , Satinder Singh , Feryal Behbahani

SpikeSTAG: Spatial-Temporal Forecasting via GNN-SNN Collaboration

Spiking neural networks (SNNs), inspired by the spiking behavior of biological neurons, offer a distinctive approach for capturing the complexities of temporal data. However, their potential for spatial modeling in multivariate time-series…

Machine Learning · Computer Science 2025-08-19 Bang Hu , Changze Lv , Mingjie Li , Yunpeng Liu , Xiaoqing Zheng , Fengzhe Zhang , Wei cao , Fan Zhang

Elastic Bulk Synchronous Parallel Model for Distributed Deep Learning

The bulk synchronous parallel (BSP) is a celebrated synchronization model for general-purpose parallel computing that has successfully been employed for distributed training of machine learning models. A prevalent shortcoming of the BSP is…

Machine Learning · Computer Science 2020-01-07 Xing Zhao , Manos Papagelis , Aijun An , Bao Xin Chen , Junfeng Liu , Yonggang Hu

Scalable Spatiotemporal Graph Neural Networks

Neural forecasting of spatiotemporal time series drives both research and industrial innovation in several relevant application domains. Graph neural networks (GNNs) are often the core component of the forecasting architecture. However, in…

Machine Learning · Computer Science 2023-02-21 Andrea Cini , Ivan Marisca , Filippo Maria Bianchi , Cesare Alippi

Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Deep Recurrent Neural Network architectures, though remarkably capable at modeling sequences, lack an intuitive high-level spatio-temporal structure. That is while many problems in computer vision inherently have an underlying high-level…

Computer Vision and Pattern Recognition · Computer Science 2016-04-12 Ashesh Jain , Amir R. Zamir , Silvio Savarese , Ashutosh Saxena

Scalable Spatiotemporal Inference with Biased Scan Attention Transformer Neural Processes

Neural Processes (NPs) are a rapidly evolving class of models designed to directly model the posterior predictive distribution of stochastic processes. While early architectures were developed primarily as a scalable alternative to Gaussian…

Machine Learning · Computer Science 2026-04-28 Daniel Jenson , Jhonathan Navott , Piotr Grynfelder , Mengyan Zhang , Makkunda Sharma , Elizaveta Semenova , Seth Flaxman

Block-State Transformers

State space models (SSMs) have shown impressive results on tasks that require modeling long-range dependencies and efficiently scale to long sequences owing to their subquadratic runtime complexity. Originally designed for continuous…

Computation and Language · Computer Science 2023-10-31 Mahan Fathi , Jonathan Pilault , Orhan Firat , Christopher Pal , Pierre-Luc Bacon , Ross Goroshin

Sequence Parallelism: Long Sequence Training from System Perspective

Transformer achieves promising results on various tasks. However, self-attention suffers from quadratic memory requirements with respect to the sequence length. Existing work focuses on reducing time and space complexity from an algorithm…

Machine Learning · Computer Science 2022-05-24 Shenggui Li , Fuzhao Xue , Chaitanya Baranwal , Yongbin Li , Yang You

Parallel, Self Organizing, Consensus Neural Networks

A new neural network architecture (PSCNN) is developed to improve performance and speed of such networks. The architecture has all the advantages of the previous models such as self-organization and possesses some other superior…

Neural and Evolutionary Computing · Computer Science 2020-08-06 Homayoun Valafar , Faramarz Valafar , Okan Ersoy

MS-RNN: A Flexible Multi-Scale Framework for Spatiotemporal Predictive Learning

Spatiotemporal predictive learning, which predicts future frames through historical prior knowledge with the aid of deep learning, is widely used in many fields. Previous work essentially improves the model performance by widening or…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Zhifeng Ma , Hao Zhang , Jie Liu

Parallel Recursive LSTM

Transformers have become the dominant architecture for sequence modeling by using self-attention to enable expressive and highly parallel processing. However, the resulting quadratic time and memory costs limit efficiency in long-context…

Machine Learning · Computer Science 2026-05-19 Tristan Gaudreault , Yongyi Mao

Parallelizable memory recurrent units

With the emergence of massively parallel processing units, parallelization has become a desirable property for new sequence models. The ability to parallelize the processing of sequences with respect to the sequence length during training…

Machine Learning · Computer Science 2026-05-19 Florent De Geeter , Gaspard Lambrechts , Damien Ernst , Guillaume Drion

Sequential-Parallel Duality in Prefix Scannable Models

Modern neural sequence models are designed to meet the dual mandate of parallelizable training and fast sequential inference. Recent developments have given rise to various models, such as Gated Linear Attention (GLA) and Mamba, that…

Machine Learning · Computer Science 2026-03-12 Morris Yau , Sharut Gupta , Valerie Engelmayer , Kazuki Irie , Stefanie Jegelka , Jacob Andreas

Enhancing Spatiotemporal Prediction Model using Modular Design and Beyond

Predictive learning uses a known state to generate a future state over a period of time. It is a challenging task to predict spatiotemporal sequence because the spatiotemporal sequence varies both in time and space. The mainstream method is…

Computer Vision and Pattern Recognition · Computer Science 2022-10-05 Haoyu Pan , Hao Wu , Tan Yang

Parallel Long Short-Term Memory for Multi-stream Classification

Recently, machine learning methods have provided a broad spectrum of original and efficient algorithms based on Deep Neural Networks (DNN) to automatically predict an outcome with respect to a sequence of inputs. Recurrent hidden cells…

Machine Learning · Computer Science 2017-02-15 Mohamed Bouaziz , Mohamed Morchid , Richard Dufour , Georges Linarès , Renato De Mori