Related papers: Self-stabilization Overhead: an Experimental Case …

Self-Stabilizing and Private Distributed Shared Atomic Memory in Seldomly Fair Message Passing Networks

We study the problem of privately emulating shared memory in message-passing networks. The system includes clients that store and retrieve replicated information on N servers, out of which e are malicious. When a client access a malicious…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-06-12 Shlomi Dolev , Thomas Petig , Elad Michael Schiller

A Coded Shared Atomic Memory Algorithm for Message Passing Architectures

This paper considers the communication and storage costs of emulating atomic (linearizable) multi-writer multi-reader shared memory in distributed message-passing systems. The paper contains three main contributions: (1) We present a atomic…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-07-17 Viveck R. Cadambe , Nancy Lynch , Muriel Médard , Peter Musial

ARES: Adaptive, Reconfigurable, Erasure coded, atomic Storage

Atomicity or strong consistency is one of the fundamental, most intuitive, and hardest to provide primitives in distributed shared memory emulations. To ensure survivability, scalability, and availability of a storage service in the…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-05-31 Nicolas Nicolaou , Viveck Cadambe , N. Prakash , Andria Trigeorgi , Kishori M. Konwar , Nancy Lynch , Muriel Medard

An Empirical Study of the Repair Performance of Novel Coding Schemes for Networked Distributed Storage Systems

Erasure coding techniques are getting integrated in networked distributed storage systems as a way to provide fault-tolerance at the cost of less storage overhead than traditional replication. Redundancy is maintained over time through…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-06-12 Lluis Pamies-Juarez , Frédérique Oggier , Anwitaman Datta

Storage-Optimized Data-Atomic Algorithms for Handling Erasures and Errors in Distributed Storage Systems

Erasure codes are increasingly being studied in the context of implementing atomic memory objects in large scale asynchronous distributed storage systems. When compared with the traditional replication based schemes, erasure codes have the…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-15 Kishori M. Konwar , N. Prakash , Erez Kantor , Nancy Lynch , Muriel Medard , Alexander A. Schwarzmann

Storage-Efficient Shared Memory Emulation

We study the design of storage-efficient algorithms for emulating atomic shared memory over an asynchronous, distributed message-passing system. Our first algorithm is an atomic single-writer multi-reader algorithm based on a novel…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-06-27 Marwen Zorgui , Robert Mateescu , Filip Blagojevic , Cyril Guyot , Zhiying Wang

A Layered Architecture for Erasure-Coded Consistent Distributed Storage

Motivated by emerging applications to the edge computing paradigm, we introduce a two-layer erasure-coded fault-tolerant distributed storage system offering atomic access for read and write operations. In edge computing, clients interact…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-06-01 Kishori M. Konwar , N. Prakash , Nancy Lynch , Muriel Medard

A New Design Framework for Heterogeneous Uncoded Storage Elastic Computing

Elasticity is one important feature in modern cloud computing systems and can result in computation failure or significantly increase computing time. Such elasticity means that virtual machines over the cloud can be preempted under a short…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-07-26 Mingyue Ji , Xiang Zhang , Kai Wan

Decentralized Uncoded Storage Elastic Computing with Heterogeneous Computation Speeds

Elasticity plays an important role in modern cloud computing systems. Elastic computing allows virtual machines (i.e., computing nodes) to be preempted when high-priority jobs arise, and also allows new virtual machines to participate in…

Information Theory · Computer Science 2024-03-04 Wenbo Huang , Xudong You , Kai Wan , Robert Caiming Qiu , Mingyue Ji

Self-Stabilizing Distributed Cooperative Reset

Self-stabilization is a versatile fault-tolerance approach that characterizes the ability of a system to eventually resume a correct behavior after any finite number of transient faults. In this paper, we propose a self-stabilizing reset…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-04-23 Stéphane Devismes , Colette Johnen

CausalEC: A Causally Consistent Data Storage Algorithm based on Cross-Object Erasure Coding

Current causally consistent data storage algorithms use partial or full replication to ensure data access to clients over a distributed setting. We develop, for the first time, an erasure coding-based algorithm called CausalEC that ensures…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-23 Viveck R. Cadambe , Shihang Lyu

Time-Efficient Constant-Space-Overhead Fault-Tolerant Quantum Computation

Scaling up quantum computers to attain substantial speedups over classical computing requires fault tolerance. Conventionally, protocols for fault-tolerant quantum computation demand excessive space overheads by using many physical qubits…

Quantum Physics · Physics 2025-01-29 Hayata Yamasaki , Masato Koashi

Coded Elastic Computing on Machines with Heterogeneous Storage and Computation Speed

We study the optimal design of heterogeneous Coded Elastic Computing (CEC) where machines have varying computation speeds and storage. CEC introduced by Yang et al. in 2018 is a framework that mitigates the impact of elastic events, where…

Information Theory · Computer Science 2020-08-13 Nicholas Woolsey , Rong-Rong Chen , Mingyue Ji

Exploring Fault-Tolerant Erasure Codes for Scalable All-Flash Array Clusters

Large-scale systems with all-flash arrays have become increasingly common in many computing segments. To make such systems resilient, we can adopt erasure coding such as Reed-Solomon (RS) code as an alternative to replication because…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-21 Sungjoon Koh , Jie Zhang , Miryeong Kwon , Jungyeon Yoon , David Donofrio , Nam Sung Kim , Myoungsoo Jung

Self-repairing Homomorphic Codes for Distributed Storage Systems

Erasure codes provide a storage efficient alternative to replication based redundancy in (networked) storage systems. They however entail high communication overhead for maintenance, when some of the encoded fragments are lost and need to…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-11-24 Frederique Oggier , Anwitaman Datta

Space Bounds for Reliable Storage: Fundamental Limits of Coding

We study the inherent space requirements of shared storage algorithms in asynchronous fault-prone systems. Previous works use codes to achieve a better storage cost than the well-known replication approach. However, a closer look reveals…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-07-21 Alexander Spiegelman , Yuval Cassuto , Gregory Chockler , Idit Keidar

The CORE Storage Primitive: Cross-Object Redundancy for Efficient Data Repair & Access in Erasure Coded Storage

Erasure codes are an integral part of many distributed storage systems aimed at Big Data, since they provide high fault-tolerance for low overheads. However, traditional erasure codes are inefficient on reading stored data in degraded…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-06-27 Kyumars Sheykh Esmaili , Lluis Pamies-Juarez , Anwitaman Datta

Self-Stabilizing Snapshot Objects for Asynchronous Fail-Prone Network Systems

A snapshot object simulates the behavior of an array of single-writer/multi-reader shared registers that can be read atomically. Delporte-Gallet et al. proposed two fault-tolerant algorithms for snapshot objects in asynchronous crash-prone…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-03-03 Chryssis Georgiou , Oskar Lundström , Elad Michael Schiller

Understanding System Characteristics of Online Erasure Coding on Scalable, Distributed and Large-Scale SSD Array Systems

Large-scale systems with arrays of solid state disks (SSDs) have become increasingly common in many computing segments. To make such systems resilient, we can adopt erasure coding such as Reed-Solomon (RS) code as an alternative to…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-09-20 Sungjoon Koh , Jie Zhang , Miryeong Kwon , Jungyeon Yoon , David Donofrio , Namsung Kim , Myoungsoo Jung

Information-Theoretic Lower Bounds on the Storage Cost of Shared Memory Emulation

The focus of this paper is to understand storage costs of emulating an atomic shared memory over an asynchronous, distributed message passing system. Previous literature has developed several shared memory emulation algorithms based on…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-05-26 Viveck R. Cadambe , Zhiying Wang , Nancy Lynch