Related papers: STEP : A Distributed Multi-threading Framework Tow…

STEP: Structured Training and Evaluation Platform for benchmarking trajectory prediction models

While trajectory prediction plays a critical role in enabling safe and effective path-planning in automated vehicles, standardized practices for evaluating such models remain underdeveloped. Recent efforts have aimed to unify dataset…

Machine Learning · Computer Science 2025-09-19 Julian F. Schumann , Anna Mészáros , Jens Kober , Arkady Zgonnikov

Distributed Management of Massive Data: an Efficient Fine-Grain Data Access Scheme

This paper addresses the problem of efficiently storing and accessing massive data blocks in a large-scale distributed environment, while providing efficient fine-grain access to data subsets. This issue is crucial in the context of…

Distributed, Parallel, and Cluster Computing · Computer Science 2008-10-14 Bogdan Nicolae , Gabriel Antoniu , Luc Bougé

Hidden States as Early Signals: Step-level Trace Evaluation and Pruning for Efficient Test-Time Scaling

Large Language Models (LLMs) can enhance reasoning capabilities through test-time scaling by generating multiple traces. However, the combination of lengthy reasoning traces with multiple sampling introduces substantial computation and high…

Machine Learning · Computer Science 2026-04-29 Zhixiang Liang , Beichen Huang , Zheng Wang , Minjia Zhang

STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking

Spiking Transformers have recently emerged as promising architectures for combining the efficiency of spiking neural networks with the representational power of self-attention. However, the lack of standardized implementations, evaluation…

Neural and Evolutionary Computing · Computer Science 2025-12-24 Sicheng Shen , Dongcheng Zhao , Linghao Feng , Zeyang Yue , Jindong Li , Tenglong Li , Guobin Shen , Yi Zeng

Distributed Programming via Safe Closure Passing

Programming systems incorporating aspects of functional programming, e.g., higher-order functions, are becoming increasingly popular for large-scale distributed programming. New frameworks such as Apache Spark leverage functional techniques…

Programming Languages · Computer Science 2016-02-12 Philipp Haller , Heather Miller

Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies

Decision making in multi-agent systems (MAS) is a great challenge due to enormous state and joint action spaces as well as uncertainty, making centralized control generally infeasible. Decentralized control offers better scalability and…

Artificial Intelligence · Computer Science 2019-01-28 Thomy Phan , Kyrill Schmid , Lenz Belzner , Thomas Gabor , Sebastian Feld , Claudia Linnhoff-Popien

A Case for Stale Synchronous Distributed Model for Declarative Recursive Computation

A large class of traditional graph and data mining algorithms can be concisely expressed in Datalog, and other Logic-based languages, once aggregates are allowed in recursion. In fact, for most BigData algorithms, the difficult semantic…

Programming Languages · Computer Science 2019-07-25 Ariyam Das , Carlo Zaniolo

Intelligible Protocol Learning for Resource Allocation in 6G O-RAN Slicing

An adaptive standardized protocol is essential for addressing inter-slice resource contention and conflict in network slicing. Traditional protocol standardization is a cumbersome task that yields hardcoded predefined protocols, resulting…

Networking and Internet Architecture · Computer Science 2024-07-02 Farhad Rezazadeh , Hatim Chergui , Shuaib Siddiqui , Josep Mangues , Houbing Song , Walid Saad , Mehdi Bennis

STOMP: A Tool for Evaluation of Scheduling Policies in Heterogeneous Multi-Processors

The proliferation of heterogeneous chip multiprocessors in recent years has reached unprecedented levels. Traditional homogeneous platforms have shown fundamental limitations when it comes to enabling high-performance yet-ultra-low-power…

Hardware Architecture · Computer Science 2020-07-29 Augusto Vega , Aporva Amarnath , John-David Wellman , Hiwot Kassa , Subhankar Pal , Hubertus Franke , Alper Buyuktosunoglu , Ronald Dreslinski , Pradip Bose

Automating Distributed Tiered Storage Management in Cluster Computing

Data-intensive platforms such as Hadoop and Spark are routinely used to process massive amounts of data residing on distributed file systems like HDFS. Increasing memory sizes and new hardware technologies (e.g., NVRAM, SSDs) have recently…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-22 Herodotos Herodotou , Elena Kakoulli

Less Can Be More: Unsupervised Graph Pruning for Large-scale Dynamic Graphs

The prevalence of large-scale graphs poses great challenges in time and storage for training and deploying graph neural networks (GNNs). Several recent works have explored solutions for pruning the large original graph into a small and…

Machine Learning · Computer Science 2023-05-19 Jintang Li , Sheng Tian , Ruofan Wu , Liang Zhu , Welong Zhao , Changhua Meng , Liang Chen , Zibin Zheng , Hongzhi Yin

MESH: A Flexible Distributed Hypergraph Processing System

With the rapid growth of large online social networks, the ability to analyze large-scale social structure and behavior has become critically important, and this has led to the development of several scalable graph processing systems. In…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-05-14 Benjamin Heintz , Rankyung Hong , Shivangi Singh , Gaurav Khandelwal , Corey Tesdahl , Abhishek Chandra

DeepSpark: A Spark-Based Distributed Deep Learning Framework for Commodity Clusters

The increasing complexity of deep neural networks (DNNs) has made it challenging to exploit existing large-scale data processing pipelines for handling massive data and parameters involved in DNN training. Distributed computing platforms…

Machine Learning · Computer Science 2016-10-04 Hanjoo Kim , Jaehong Park , Jaehee Jang , Sungroh Yoon

Splash: User-friendly Programming Interface for Parallelizing Stochastic Algorithms

Stochastic algorithms are efficient approaches to solving machine learning and optimization problems. In this paper, we propose a general framework called Splash for parallelizing stochastic algorithms on multi-node distributed systems.…

Machine Learning · Computer Science 2015-09-24 Yuchen Zhang , Michael I. Jordan

Primitives for Dynamic Big Model Parallelism

When training large machine learning models with many variables or parameters, a single machine is often inadequate since the model may be too large to fit in memory, while training can take a long time even with stochastic updates. A…

Machine Learning · Statistics 2014-06-19 Seunghak Lee , Jin Kyu Kim , Xun Zheng , Qirong Ho , Garth A. Gibson , Eric P. Xing

NEST: Network- and Memory-Aware Device Placement For Distributed Deep Learning

The growing scale of deep learning demands distributed training frameworks that jointly reason about parallelism, memory, and network topology. Prior works often rely on heuristic or topology-agnostic search, handling communication and…

Machine Learning · Computer Science 2026-05-26 Irene Wang , Vishnu Varma Venkata , Arvind Krishnamurthy , Divya Mahajan

Scalable data abstractions for distributed parallel computations

The ability to express a program as a hierarchical composition of parts is an essential tool in managing the complexity of software and a key abstraction this provides is to separate the representation of data from the computation. Many…

Programming Languages · Computer Science 2012-10-04 James Hanlon , Simon J. Hollis , David May

Distributed Estimation in Large Scale Wireless Sensor Networks via a Two Step Group-based Approach

We consider the problem of collaborative distributed estimation in a large scale sensor network with statistically dependent sensor observations. In collaborative setup, the aim is to maximize the overall estimation performance by modeling…

Signal Processing · Electrical Eng. & Systems 2022-03-21 Shan Zhang , Pranay Sharma , Baocheng Geng , Pramod K. Varshney

Easy Acceleration with Distributed Arrays

High level programming languages and GPU accelerators are powerful enablers for a wide range of applications. Achieving scalable vertical (within a compute node), horizontal (across compute nodes), and temporal (over different generations…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-21 Jeremy Kepner , Chansup Byun , LaToya Anderson , William Arcand , David Bestor , William Bergeron , Alex Bonn , Daniel Burrill , Vijay Gadepally , Ryan Haney , Michael Houle , Matthew Hubbell , Hayden Jananthan , Michael Jones , Piotr Luszczek , Lauren Milechin , Guillermo Morales , Julie Mullen , Andrew Prout , Albert Reuther , Antonio Rosa , Charles Yee , Peter Michaleas

Efficient Approximate Temporal Triangle Counting in Streaming with Predictions

Triangle counting is a fundamental and widely studied problem on static graphs, and recently on temporal graphs, where edges carry information on the timings of the associated events. Streaming processing and resource efficiency are crucial…

Data Structures and Algorithms · Computer Science 2025-06-17 Giorgio Venturin , Ilie Sarpe , Fabio Vandin