Related papers: Accelerator-Oriented Algorithm Transformation for …

Towards Chip-on-Chip Neuroscience: Fast Mining of Frequent Episodes Using Graphics Processors

Computational neuroscience is being revolutionized with the advent of multi-electrode arrays that provide real-time, dynamic, perspectives into brain function. Mining event streams from these chips is critical to understanding the firing…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-05-15 Yong Cao , Debprakash Patnaik , Sean Ponce , Jeremy Archuleta , Patrick Butler , Wu-chun Feng , Naren Ramakrishnan

Inferring Neuronal Network Connectivity from Spike Data: A Temporal Datamining Approach

Understanding the functioning of a neural system in terms of its underlying circuitry is an important problem in neuroscience. Recent developments in electrophysiology and imaging allow one to simultaneously record activities of hundreds of…

Databases · Computer Science 2008-03-11 Debprakash Patnaik , P. S. Sastry , K. P. Unnikrishnan

Inferring Neuronal Network Connectivity using Time-constrained Episodes

Discovering frequent episodes in event sequences is an interesting data mining task. In this paper, we argue that this framework is very effective for analyzing multi-neuronal spike train data. Analyzing spike train data is an important…

Databases · Computer Science 2008-03-10 Debprakash Patnaik , P. S. Sastry , K. P. Unnikrishnan

Efficient Discovery of Large Synchronous Events in Neural Spike Streams

We address the problem of finding patterns from multi-neuronal spike trains that give us insights into the multi-neuronal codes used in the brain and help us design better brain computer interfaces. We focus on the synchronous firings of…

Neural and Evolutionary Computing · Computer Science 2010-06-09 Raajay Viswanathan , P. S. Sastry , K. P. Unnikrishnan

PAGANI: A Parallel Adaptive GPU Algorithm for Numerical

We present a new adaptive parallel algorithm for the challenging problem of multi-dimensional numerical integration on massively parallel architectures. Adaptive algorithms have demonstrated the best performance, but efficient many-core…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-06-24 Ioannis Sakiotis , Kamesh Arumugam , Marc Paterno , Desh Ranjan , Balša Terzić , Mohammad Zubair

Streaming Algorithms for Pattern Discovery over Dynamically Changing Event Sequences

Discovering frequent episodes over event sequences is an important data mining task. In many applications, events constituting the data sequence arrive as a stream, at furious rates, and recent trends (or frequent episodes) can change and…

Machine Learning · Computer Science 2012-05-22 Debprakash Patnaik , Naren Ramakrishnan , Srivatsan Laxman , Badrish Chandramouli

Discovering general partial orders in event streams

Frequent episode discovery is a popular framework for pattern discovery in event streams. An episode is a partially ordered set of nodes with each node associated with an event type. Efficient (and separate) algorithms exist for episode…

Artificial Intelligence · Computer Science 2009-12-11 Avinash Achar , Srivatsan Laxman , Raajay Viswanathan , P. S. Sastry

Temporal data mining for root-cause analysis of machine faults in automotive assembly lines

Engine assembly is a complex and heavily automated distributed-control process, with large amounts of faults data logged everyday. We describe an application of temporal data mining for analyzing fault logs in an engine assembly plant.…

Machine Learning · Computer Science 2009-04-30 Srivatsan Laxman , Basel Shadid , P. S. Sastry , K. P. Unnikrishnan

Towards Scalable GPU-Accelerated SNN Training via Temporal Fusion

Drawing on the intricate structures of the brain, Spiking Neural Networks (SNNs) emerge as a transformative development in artificial intelligence, closely emulating the complex dynamics of biological neural networks. While SNNs show…

Artificial Intelligence · Computer Science 2024-08-02 Yanchen Li , Jiachun Li , Kebin Sun , Luziwei Leng , Ran Cheng

Arbitrarily large iterative tomographic reconstruction on multiple GPUs using the TIGRE toolbox

Tomographic image sizes keep increasing over time and while the GPUs that compute the tomographic reconstruction are also increasing in memory size, they are not doing so fast enough to reconstruct the largest datasets. This problem is…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-05-10 Ander Biguri , Reuben Lindroos , Robert Bryll , Hossein Towsyfyan , Hans Deyhle , Richard Boardman , Mark Mavrogordato , Manjit Dosanjh , Steven Hancock , Thomas Blumensath

A unified view of Automata-based algorithms for Frequent Episode Discovery

Frequent Episode Discovery framework is a popular framework in Temporal Data Mining with many applications. Over the years many different notions of frequencies of episodes have been proposed along with different algorithms for episode…

Artificial Intelligence · Computer Science 2010-07-06 Avinash Achar , Srivatsan Laxman , P. S. Sastry

Accelerating Exact and Approximate Inference for (Distributed) Discrete Optimization with GPUs

Discrete optimization is a central problem in artificial intelligence. The optimization of the aggregated cost of a network of cost functions arises in a variety of problems including (W)CSP, DCOP, as well as optimization in stochastic…

Artificial Intelligence · Computer Science 2018-01-12 Ferdinando Fioretto , Enrico Pontelli , William Yeoh , Rina Dechter

Exploring the Limits of GPUs With Parallel Graph Algorithms

In this paper, we explore the limits of graphics processors (GPUs) for general purpose parallel computing by studying problems that require highly irregular data access patterns: parallel graph algorithms for list ranking and connected…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-02-25 Frank Dehne , Kumanan Yogaratnam

Manycore processing of repeated range queries over massive moving objects observations

The ability to timely process significant amounts of continuously updated spatial data is mandatory for an increasing number of applications. Parallelism enables such applications to face this data-intensive challenge and allows the devised…

Databases · Computer Science 2014-11-13 Francesco Lettich , Salvatore Orlando , Claudio Silvestri , Christian S. Jensen

Parallelizing non-linear sequential models over the sequence length

Sequential models, such as Recurrent Neural Networks and Neural Ordinary Differential Equations, have long suffered from slow training due to their inherent sequential nature. For many years this bottleneck has persisted, as many thought…

Machine Learning · Computer Science 2024-01-17 Yi Heng Lim , Qi Zhu , Joshua Selfridge , Muhammad Firmansyah Kasim

User-friendly Foundation Model Adapters for Multivariate Time Series Classification

Foundation models, while highly effective, are often resource-intensive, requiring substantial inference time and memory. This paper addresses the challenge of making these models more accessible with limited computational resources by…

Machine Learning · Computer Science 2024-09-20 Vasilii Feofanov , Romain Ilbert , Malik Tiomoko , Themis Palpanas , Ievgen Redko

Optimizing Performance of Recurrent Neural Networks on GPUs

As recurrent neural networks become larger and deeper, training times for single networks are rising into weeks or even months. As such there is a significant incentive to improve the performance and scalability of these networks. While…

Machine Learning · Computer Science 2016-04-08 Jeremy Appleyard , Tomas Kocisky , Phil Blunsom

Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism

Transformer models have achieved state-of-the-art performance on various domains of applications and gradually becomes the foundations of the advanced large deep learning (DL) models. However, how to train these models over multiple GPUs…

Machine Learning · Computer Science 2022-11-28 Xupeng Miao , Yujie Wang , Youhe Jiang , Chunan Shi , Xiaonan Nie , Hailin Zhang , Bin Cui

Scaling Behaviors of Evolutionary Algorithms on GPUs: When Does Parallelism Pay Off?

Evolutionary algorithms (EAs) are increasingly implemented on graphics processing units (GPUs) to leverage parallel processing capabilities for enhanced efficiency. However, existing studies largely emphasize the raw speedup obtained by…

Neural and Evolutionary Computing · Computer Science 2026-01-28 Xinmeng Yu , Tao Jiang , Ran Cheng , Yaochu Jin , Kay Chen Tan

GPU-based Split algorithm for Large-Scale CVRPSD

Dynamic programming (DP) is a cornerstone of combinatorial optimization, yet its inherently sequential structure has long limited its scalability in scenario-based stochastic programming (SP). This paper introduces a GPU-accelerated…

Optimization and Control · Mathematics 2025-11-25 Jingyi Zhao , Linxin Yang , Haohua Zhang , Tian Ding