English
Related papers

Related papers: RDMA vs. RPC for Implementing Distributed Data Str…

200 papers

In this work, we aim to evaluate different Distributed Lock Management service designs with Remote Direct Memory Access (RDMA). In specific, we implement and evaluate the centralized and the RDMA-enabled lock manager designs for fast…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-07-21 Yeounoh Chung , Erfan Zamanian

Deep learning emerges as an important new resource-intensive workload and has been successfully applied in computer vision, speech, natural language processing, and so on. Distributed deep learning is becoming a necessity to cope with…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-23 Jilong Xue , Youshan Miao , Cheng Chen , Ming Wu , Lintao Zhang , Lidong Zhou

RDMA is an exciting technology that enables a host to access the memory of a remote host without involving the remote CPU. Prior work shows how to use RDMA to improve the performance of distributed in-memory storage systems. However, RDMA…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-08 Stanko Novakovic , Yizhou Shan , Aasheesh Kolli , Michael Cui , Yiying Zhang , Haggai Eran , Liran Liss , Michael Wei , Dan Tsafrir , Marcos Aguilera

High-performance clusters and datacenters pose increasingly demanding requirements on storage systems. If these systems do not operate at scale, applications are doomed to become I/O bound and waste compute cycles. To accelerate the data…

Networking and Internet Architecture · Computer Science 2022-06-22 Salvatore Di Girolamo , Daniele De Sensi , Konstantin Taranov , Milos Malesevic , Maciej Besta , Timo Schneider , Severin Kistler , Torsten Hoefler

The drive towards exascale computing is opening an enormous opportunity for more realistic and precise simulations of natural phenomena. The process of simulation, however, involves not only the numerical computation of predictions but also…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-21 Allan Santos , Hermano Lustosa , Fabio Porto , Bruno Schulze

Coordinating concurrent access to a shared resource using mutual exclusion is a fundamental problem in computation. In this paper, we present a novel approach to mutual exclusion designed specifically for distributed systems leveraging a…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-23 Jacob Nelson-Slivon , Lewis Tseng , Roberto Palmieri

Remote Direct Memory Access (RDMA) is an efficient way to improve the performance of traditional client-server systems. Currently, there are two main design paradigms for RDMA-accelerated systems. The first allows the clients to directly…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-12-25 Maomeng Su , Mingxing Zhang , Kang Chen , Yongwei Wu , Guoliang Li

Remote Direct Memory Access (RDMA) is a technology that allows direct memory access from the memory of one computer into that of another without involving either one's operating system. This enables high-throughput, low-latency networking,…

Logic in Computer Science · Computer Science 2026-05-12 Parosh Aziz Abdulla , Mohamed Faouzi Atig , Govind Rajanbabu , Stephan Spengler

Remote memory access (RMA) is an emerging high-performance programming model that uses RDMA hardware directly. Yet, accessing remote memories cannot invoke activities at the target which complicates implementation and limits performance of…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-20 Maciej Besta , Torsten Hoefler

Memory disaggregation is being considered as a strong alternative to traditional architecture to deal with the memory under-utilization in data centers. Disaggregated memory can adapt to dynamically changing memory requirements for the data…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-04-11 Amit Puri , John Jose , Tamarapalli Venkatesh

Overheads in Operating System kernel network stacks and sockets have been hindering OSes from managing networking operations efficiently for years. Moreover, when building Remote Procedure Calls over TCP, certain TCP features do not match…

Networking and Internet Architecture · Computer Science 2021-07-06 Ioannis Argyroulis

In this paper, we conduct systematic measurement studies to show that the high memory bandwidth consumption of modern distributed applications can lead to a significant drop of network throughput and a large increase of tail latency in…

We can use a hybrid memory system consisting of DRAM and Intel Optane DC Persistent Memory (We call it DCPM in this paper) as DCPM is now commercially available since April 2019. Even if the latency for DCPM is several times higher than…

Performance · Computer Science 2020-08-31 Kazuichi Oe

Remote Procedure Call (RPC) is a widely used abstraction for cloud computing. The programmer specifies type information for each remote procedure, and a compiler generates stub code linked into each application to marshal and unmarshal…

Networking and Internet Architecture · Computer Science 2023-04-18 Jingrong Chen , Yongji Wu , Shihan Lin , Yechen Xu , Xinhao Kong , Thomas Anderson , Matthew Lentz , Xiaowei Yang , Danyang Zhuo

Grid Computing is a type of parallel and distributed systems that is designed to provide reliable access to data and computational resources in wide area networks. These resources are distributed in different geographical locations, however…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-09-27 Sheida Dayyani , Mohammad Reza Khayyambashi

With Dynamic Resource Management (DRM) the resources assigned to a job can be changed dynamically during its execution. From the system's perspective, DRM opens a new level of flexibility in resource allocation and job scheduling and…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-27 Dominik Huber , Martin Schreiber , Martin Schulz , Howard Pritchard , Daniel Holmes

Modern interconnects offer remote direct memory access (RDMA) features. Yet, most applications rely on explicit message passing for communications albeit their unwanted overheads. The MPI-3.0 standard defines a programming interface for…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-07-01 Robert Gerstenberger , Maciej Besta , Torsten Hoefler

Paper presents and evaluates various mechanisms for remote access to memory in distributed systems based on two distinct HPC clusters. We are comparing solutions based on the shared storage and MPI (over Infiniband and Slingshot) to the…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-03 Jan Meizner , Maciej Malawski

This work-in-progress report presents both the design and partial evaluation of distributed execution indexing, a technique for microservice applications that precisely identifies dynamic instances of inter-service remote procedure calls…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-09-20 Christopher S. Meiklejohn , Rohan Padhye , Heather Miller

Conventional wisdom holds that an efficient interface between an OS running on a CPU and a high-bandwidth I/O device should use Direct Memory Access (DMA) to offload data transfer, descriptor rings for buffering and queuing, and interrupts…

Hardware Architecture · Computer Science 2025-04-25 Anastasiia Ruzhanskaia , Pengcheng Xu , David Cock , Timothy Roscoe
‹ Prev 1 2 3 10 Next ›