Related papers: Resilient Cloud-based Replication with Low Latency

Spider: A BFT Architecture for Geo-Replicated Cloud Services

Traditionally, Byzantine fault tolerance (BFT) in geo-replicated systems is achieved by executing complex agreement protocols over large-distance communication links, and therefore typically incurs high response times. In this paper we…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-12 Michael Eischer , Tobias Distler

Byzantine Cluster-Sending in Expected Constant Communication

Traditional resilient systems operate on fully-replicated fault-tolerant clusters, which limits their scalability and performance. One way to make the step towards resilient high-performance systems that can deal with huge workloads, is by…

Databases · Computer Science 2021-08-20 Jelle Hellings , Mohammad Sadoghi

Controlled conflict resolution for replicated document

Collaborative working is increasingly popular, but it presents challenges due to the need for high responsiveness and disconnected work support. To address these challenges the data is optimistically replicated at the edges of the network,…

Databases · Computer Science 2012-12-12 Stéphane Martin , Mehdi Ahmed-Nacer , Pascal Urso

Byzantine Resilient Computing with the Cloud

We study a framework for modeling distributed network systems assisted by a reliable and powerful cloud service. Our framework aims at capturing hybrid systems based on a point to point message passing network of machines, with the…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-09-29 John Augustine , Jeffin Biju , Shachar Meir , David Peleg , Srikkanth Ramachandran , Aishwarya Thiruvengadam

Asynchronous Reconfiguration with Byzantine Failures

Replicated services are inherently vulnerable to failures and security breaches. In a long-running system, it is, therefore, indispensable to maintain a reconfiguration mechanism that would replace faulty replicas with correct ones. An…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-10-22 Petr Kuznetsov , Andrei Tonkikh

A Cluster Based Replication Architecture for Load Balancing in Peer-to-Peer Content Distribution

In P2P systems, large volumes of data are declustered naturally across a large number of peers. But it is very difficult to control the initial data distribution because every user has the freedom to share any data with other users. The…

Networking and Internet Architecture · Computer Science 2010-09-24 S. Ayyasamy , S. N. Sivanandam

Tuning the Tail Latency of Distributed Queries Using Replication

Querying graph data with low latency is an important requirement in application domains such as social networks and knowledge graphs. Graph queries perform multiple hops between vertices. When data is partitioned and stored across multiple…

Databases · Computer Science 2022-12-21 Nathan Ng , Hung Le , Marco Serafini

PISTIS: An Event-Triggered Real-Time Byzantine-Resilient Protocol Suite

The accelerated digitalisation of society along with technological evolution have extended the geographical span of cyber-physical systems. Two main threats have made the reliable and real-time control of these systems challenging: (i)…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-03-19 David Kozhaya , Jeremie Decouchant , Vincent Rahli , Paulo Esteves-Verissimo

A Comparative Study of Replication Techniques in Grid Computing Systems

Grid Computing is a type of parallel and distributed systems that is designed to provide reliable access to data and computational resources in wide area networks. These resources are distributed in different geographical locations, however…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-09-27 Sheida Dayyani , Mohammad Reza Khayyambashi

Stretching Multi-Ring Paxos

Internet-scale services rely on data partitioning and replication to provide scalable performance and high availability. Moreover, to reduce user-perceived response times and tolerate disasters (i.e., the failure of a whole datacenter),…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-04-21 Samuel Benz , Leandro Pacheco de Sousa , Fernando Pedone

Egalitarian Byzantine Fault Tolerance

Minimizing end-to-end latency in geo-replicated systems usually makes it necessary to compromise on resilience, resource efficiency, or throughput performance, because existing approaches either tolerate only crashes, require additional…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-07-08 Michael Eischer , Tobias Distler

GeoLayer: Towards Low-Latency and Cost-Efficient Geo-Distributed Graph Stores with Layered Graph

The inherent connectivity and dependency of graph-structured data, combined with its unique topology-driven access patterns, pose fundamental challenges to conventional data replication and request routing strategies in geo-distributed…

Databases · Computer Science 2025-10-22 Feng Yao , Xiaokang Yang , Shufeng Gong , Song Yu , Yanfeng Zhang , Ge Yu

A Resilient Distributed Algorithm for Solving Linear Equations

This paper presents a resilient distributed algorithm for solving a system of linear algebraic equations over a multi-agent network in the presence of Byzantine agents capable of arbitrarily introducing untrustworthy information in…

Systems and Control · Electrical Eng. & Systems 2023-04-04 Jingxuan Zhu , Alvaro Velasquez , Ji Liu

The fault-tolerant cluster-sending problem

The development of fault-tolerant distributed systems that can tolerate Byzantine behavior has traditionally been focused on consensus protocols, which support fully-replicated designs. For the development of more sophisticated…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-08-06 Jelle Hellings , Mohammad Sadoghi

Scaling Strongly Consistent Replication

Strong consistency replication helps keep application logic simple and provides significant benefits for correctness and manageability. Unfortunately, the adoption of strongly-consistent replication protocols has been curbed due to their…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-01-22 Aleksey Charapko , Ailidani Ailijiang , Murat Demirbas

Granular Synchrony

Today's mainstream network timing models for distributed computing are synchrony, partial synchrony, and asynchrony. These models are coarse-grained and often make either too strong or too weak assumptions about the network. This paper…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-08-29 Neil Giridharan , Ittai Abraham , Natacha Crooks , Kartik Nayak , Ling Ren

Efficient Replication of Queued Tasks for Latency Reduction in Cloud Systems

In cloud computing systems, assigning a job to multiple servers and waiting for the earliest copy to finish is an effective method to combat the variability in response time of individual servers. Although adding redundant replicas always…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-10-21 Gauri Joshi , Emina Soljanin , Gregory Wornell

A QoS-Aware Intelligent Replica Management Architecture for Content Distribution in Peer-to-Peer Overlay Networks

The large scale content distribution systems were improved broadly using the replication techniques. The demanded contents can be brought closer to the clients by multiplying the source of information geographically, which in turn reduce…

Networking and Internet Architecture · Computer Science 2009-12-14 S. Ayyasamy , S. N. Sivanandam

Non-determinism in Byzantine Fault-Tolerant Replication

Service replication distributes an application over many processes for tolerating faults, attacks, and misbehavior among a subset of the processes. The established state-machine replication paradigm inherently requires the application to be…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-12-20 Christian Cachin , Simon Schubert , Marko Vukolić

GeoShield: Byzantine Fault Detection and Recovery for Geo-Distributed Real-Time Cyber-Physical Systems

Large-scale cyber-physical systems (CPS), such as railway control systems and smart grids, consist of geographically distributed subsystems that are connected via unreliable, asynchronous inter-region networks. Their scale and distribution…

Cryptography and Security · Computer Science 2025-11-20 Yifan Cai , Linh Thi Xuan Phan