Related papers: DisTRaC: Accelerating High Performance Compute Pro…

Rethinking Storage Management for Data Processing Pipelines in Cloud Data Centers

Data processing frameworks such as Apache Beam and Apache Spark are used for a wide range of applications, from logs analysis to data preparation for DNN training. It is thus unsurprising that there has been a large amount of work on…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-07 Ubaid Ullah Hafeez , Martin Maas , Mustafa Uysal , Richard McDougall

Challenges and Considerations for Utilizing Burst Buffers in High-Performance Computing

As high-performance computing (HPC) moves into the exascale era, computer scientists and engineers must find innovative ways of transferring and processing unprecedented amounts of data. As the scale and complexity of the applications…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-09-30 Melissa Romanus , Robert B. Ross , Manish Parashar

Analyzing I/O Performance of a Hierarchical HPC Storage System for Distributed Deep Learning

Today, deep learning is an essential technology for our life. To solve more complex problems with deep learning, both sizes of training datasets and neural networks are increasing. To train a model with large datasets and networks,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-05 Takaaki Fukai , Kento Sato , Takahiro Hirofuchi

Conceptual and Technical Challenges for High Performance Computing

High Performance Computing (HPC) aims at providing reasonably fast computing solutions to scientific and real life problems. The advent of multicore architectures is noticeable in the HPC history, because it has brought the underlying…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-07 Claude Tadonki

Periodic I/O scheduling for super-computers

With the ever-growing need of data in HPC applications, the congestion at the I/O level becomes critical in super-computers. Architectural enhancement such as burst-buffers and pre-fetching are added to machines, but are not sufficient to…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-02-23 Guillaume Aupy , Ana Gainaru , Valentin Le Fèvre

PystachIO: Efficient Distributed GPU Query Processing with PyTorch over Fast Networks & Fast Storage

The AI hardware boom has led modern data centers to adopt HPC-style architectures centered on distributed, GPU-centric computation. Large GPU clusters interconnected by fast RDMA networks and backed by high-bandwidth NVMe storage enable…

Databases · Computer Science 2026-05-21 Jigao Luo , Nils Boeschen , Muhammad El-Hindi , Carsten Binnig

High-throughput Execution of Hierarchical Analysis Pipelines on Hybrid Cluster Platforms

We propose, implement, and experimentally evaluate a runtime middleware to support high-throughput execution on hybrid cluster machines of large-scale analysis applications. A hybrid cluster machine consists of computation nodes which have…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-09-18 George Teodoro , Tony Pan , Tahsin M. Kurc , Jun Kong , Lee A. D. Cooper , Joel H. Saltz

Accelerating Fast Fourier Transforms Using Hadoop and CUDA

There has been considerable research into improving Fast Fourier Transform (FFT) performance through parallelization and optimization for specialized hardware. However, even with those advancements, processing of very large files, over 1TB…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-07-28 Rostislav Tsiomenko , Bradley S. Rees

Big Data Analytics on Traditional HPC Infrastructure Using Two-Level Storage

Data-intensive computing has become one of the major workloads on traditional high-performance computing (HPC) clusters. Currently, deploying data-intensive computing software framework on HPC clusters still faces performance and…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-10-13 Pengfei Xuan , Jeffrey Denton , Rong Ge , Pradip K. Srimani , Feng Luo

The Petascale DTN Project: High Performance Data Transfer for HPC Facilities

The movement of large-scale (tens of Terabytes and larger) data sets between high performance computing (HPC) facilities is an important and increasingly critical capability. A growing number of scientific collaborations rely on HPC…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-09-10 Eli Dart , William Allcock , Wahid Bhimji , Tim Boerner , Ravinderjeet Cheema , Andrew Cherry , Brent Draney , Salman Habib , Damian Hazen , Jason Hill , Matt Kollross , Suzanne Parete-Koon , Daniel Pelfrey , Adrian Pope , Jeff Porter , David Wheeler

GraphH: High Performance Big Graph Analytics in Small Clusters

It is common for real-world applications to analyze big graphs using distributed graph processing systems. Popular in-memory systems require an enormous amount of resources to handle big graphs. While several out-of-core approaches have…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-08-08 Peng Sun , Yonggang Wen , Ta Nguyen Binh Duong , Xiaokui Xiao

Exploring high performance distributed file storage using LDPC codes

We explore the feasibility of implementing a reliable, high performance, distributed storage system on a commodity computing cluster. Files are distributed across storage nodes using erasure coding with small Low-Density Parity-Check (LDPC)…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Benjamin Gaidioz , Birger Koblitz , Nuno Santos

Systems for Memory Disaggregation: Challenges & Opportunities

Memory disaggregation addresses memory imbalance in a cluster by decoupling CPU and memory allocations of applications while also increasing the effective memory capacity for (memory-intensive) applications beyond the local memory limit…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-02-07 Anil Yelam

Overview of Caching Mechanisms to Improve Hadoop Performance

Nowadays distributed computing environments, large amounts of data are generated from different resources with a high velocity, rendering the data difficult to capture, manage, and process within existing relational databases. Hadoop is a…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-10-24 Rana Ghazali , Douglas G. Down

Making Caches Work for Graph Analytics

Modern hardware systems are heavily underutilized when running large-scale graph applications. While many in-memory graph frameworks have made substantial progress in optimizing these applications, we show that it is still possible to…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-01-15 Yunming Zhang , Vladimir Kiriansky , Charith Mendis , Matei Zaharia , Saman Amarasinghe

Mitigating Shared Storage Congestion Using Control Theory

Efficient data access in High-Performance Computing (HPC) systems is essential to the performance of intensive computing tasks. Traditional optimizations of the I/O stack aim to improve peak performance but are often workload specific and…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-21 Thomas Collignon , Kouds Halitim , Raphaël Bleuse , Sophie Cerf , Bogdan Robu , Éric Rutten , Lionel Seinturier , Alexandre van Kempen

CFS: A Distributed File System for Large Scale Container Platforms

We propose CFS, a distributed file system for large scale container platforms. CFS supports both sequential and random file accesses with optimized storage for both large files and small files, and adopts different replication protocols for…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-11-11 Haifeng Liu , Wei Ding , Yuan Chen , Weilong Guo , Shuoran Liu , Tianpeng Li , Mofei Zhang , Jianxing Zhao , Hongyin Zhu , Zhengyi Zhu

Software Resource Disaggregation for HPC with Serverless Computing

Aggregated HPC resources have rigid allocation systems and programming models which struggle to adapt to diverse and changing workloads. Consequently, HPC systems fail to efficiently use the large pools of unused memory and increase the…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-29 Marcin Copik , Marcin Chrapek , Larissa Schmid , Alexandru Calotoiu , Torsten Hoefler

Building Block Components to Control a Data Rate in the Apache Hadoop Compute Platform

Resource management is one of the most indispensable components of cluster-level infrastructure layers. Users of such systems should be able to specify their job requirements as a configuration parameter (CPU, RAM, disk I/O, network I/O)…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-10-23 Tien Van Do , Binh T. Vu , Nam H. Do , Lóránt Farkas , Csaba Rotter , Tamás Tarjányi

Selecting Efficient Cluster Resources for Data Analytics: When and How to Allocate for In-Memory Processing?

Distributed dataflow systems such as Apache Spark or Apache Flink enable parallel, in-memory data processing on large clusters of commodity hardware. Consequently, the appropriate amount of memory to allocate to the cluster is a crucial…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-08 Jonathan Will , Lauritz Thamsen , Dominik Scheinert , Odej Kao