Related papers: Building State Machine Replication Using Practical…

Optimistic Parallel State-Machine Replication

State-machine replication, a fundamental approach to fault tolerance, requires replicas to execute commands deterministically, which usually results in sequential execution of commands. Sequential execution limits performance and underuses…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-04-29 Parisa Jalili Marandi , Fernando Pedone

Early Scheduling in Parallel State Machine Replication

State machine replication is standard approach to fault tolerance. One of the key assumptions of state machine replication is that replicas must execute operations deterministically and thus serially. To benefit from multi-core servers,…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-15 Eduardo Alchieri , Fernando Dotti , Fernando Pedone

Stream-based State-Machine Replication

Developing state-machine replication protocols for practical use is a complex and labor-intensive process because of the myriad of essential tasks (e.g., deployment, communication, recovery) that need to be taken into account in an…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-06-25 Laura Lawniczak , Tobias Distler

Reducing asynchrony to synchronized rounds

Synchronous computation models simplify the design and the verification of fault-tolerant distributed systems. For efficiency reasons such systems are designed and implemented using an asynchronous semantics. In this paper, we bridge the…

Programming Languages · Computer Science 2019-01-23 Andrei Damien , Cezara Dragoi , Alexandru Militaru , Josef Widder

SCAR: Strong Consistency using Asynchronous Replication with Minimal Coordination

Data replication is crucial in modern distributed systems as a means to provide high availability. Many techniques have been proposed to utilize replicas to improve a system's performance, often requiring expensive coordination or…

Databases · Computer Science 2019-03-04 Yi Lu , Xiangyao Yu , Samuel Madden

State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

With the slowdown of Moore's law, CPU-oriented packet processing in software will be significantly outpaced by emerging line speeds of network interface cards (NICs). Single-core packet-processing throughput has saturated. We consider the…

Networking and Internet Architecture · Computer Science 2024-06-18 Qiongwen Xu , Sebastiano Miano , Xiangyu Gao , Tao Wang , Adithya Murugadass , Songyuan Zhang , Anirudh Sivaraman , Gianni Antichi , Srinivas Narayana

Efficient Synchronization of State-based CRDTs

To ensure high availability in large scale distributed systems, Conflict-free Replicated Data Types (CRDTs) relax consistency by allowing immediate query and update operations at the local replica, with no need for remote synchronization.…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-12 Vitor Enes , Paulo Sérgio Almeida , Carlos Baquero , João Leitão

HT-Ring Paxos: Theory of High Throughput State-Machine Replication for Clustered Data Centers

Implementations of state-machine replication (SMR) prevalently use the variants of Paxos. Some of the recent variants of Paxos like, Ring Paxos, Multi-Ring Paxos, S-Paxos and HT-Paxos achieve significantly high throughput. However, to meet…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-07-16 Vinit Kumar , Ajay Agarwal

Cora: Accelerating Stateful Network Applications with SmartNICs

With the growing performance requirements on networked applications, there is a new trend of offloading stateful network applications to SmartNICs to improve performance and reduce the total cost of ownership. However, offloading stateful…

Networking and Internet Architecture · Computer Science 2024-10-30 Shaoke Xi , Jiaqi Gao , Mengqi Liu , Jiamin Cao , Fuliang Li , Kai Bu , Kui Ren , Minlan Yu , Dennis Cai , Ennan Zhai

HT-Paxos: High Throughput State-Machine Replication Protocol for Large Clustered Data Centers

Paxos is a prominent theory of state machine replication. Recent data intensive Systems those implement state machine replication generally require high throughput. Earlier versions of Paxos as few of them are classical Paxos, fast Paxos…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-07-07 Vinit Kumar , Ajay Agarwal

Leaderless State-Machine Replication: Specification, Properties, Limits (Extended Version)

Modern Internet services commonly replicate critical data across several geographical locations using state-machine replication (SMR). Due to their reliance on a leader replica, classical SMR protocols offer limited scalability and…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-07 Tuanir França Rezende , Pierre Sutra

Practically-Self-Stabilizing Virtual Synchrony

Virtual synchrony is an important abstraction that is proven to be extremely useful when implemented over asynchronous, typically large, message-passing distributed systems. Fault tolerant design is a key criterion for the success of such…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-26 Shlomi Dolev , Chryssis Georgiou , Ioannis Marcoullis , Elad Michael Schiller

Rethinking State-Machine Replication for Parallelism

State-machine replication, a fundamental approach to designing fault-tolerant services, requires commands to be executed in the same order by all replicas. Moreover, command execution must be deterministic: each replica must produce the…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-11-26 Parisa Jalili Marandi , Carlos Eduardo Bezerra , Fernando Pedone

Efficient Distributed Data Structures for Future Many-core Architectures

We study general techniques for implementing distributed data structures on top of future many-core architectures with non cache-coherent or partially cache-coherent memory. With the goal of contributing towards what might become, in the…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-09 Panagiota Fatourou , Nikolaos D. Kallimanis , Eleni Kanellou , Odysseas Makridakis , Christi Symeonidou

Optimal state replication in stateful data planes

In SDN stateful data planes, switches can execute algorithms to process traffic based on local states. This approach permits to offload decisions from the controller to the switches, thus to reduce the latency to react to network events. We…

Networking and Internet Architecture · Computer Science 2020-04-14 Abubakar Siddique Muqaddas , German Sviridov , Paolo Giaccone , Andrea Bianco

A Comparative Study of Replication Techniques in Grid Computing Systems

Grid Computing is a type of parallel and distributed systems that is designed to provide reliable access to data and computational resources in wide area networks. These resources are distributed in different geographical locations, however…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-09-27 Sheida Dayyani , Mohammad Reza Khayyambashi

Cross-Chain State Machine Replication

This paper considers the classical state machine replication (SMR) problem in a distributed system model inspired by cross-chain exchanges. We propose a novel SMR protocol adapted for this model. Each state machine transition takes $O(n)$…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-15 Yingjie Xue , Maurice Herlihy

A Model for Communication in Clusters of Multi-core Machines

A common paradigm for scientific computing is distributed message-passing systems, and a common approach to these systems is to implement them across clusters of high-performance workstations. As multi-core architectures become increasingly…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-05-01 Christine Task , Arun Chauhan

Few-Round Distributed Principal Component Analysis: Closing the Statistical Efficiency Gap by Consensus

Distributed algorithms and theories are called for in this era of big data. Under weaker local signal-to-noise ratios, we improve upon the celebrated one-round distributed principal component analysis (PCA) algorithm designed in the spirit…

Methodology · Statistics 2025-07-01 ZeYu Li , Xinsheng Zhang , Wang Zhou

More Iterations per Second, Same Quality -- Why Asynchronous Algorithms may Drastically Outperform Traditional Ones

In this paper, we consider the convergence of a very general asynchronous-parallel algorithm called ARock, that takes many well-known asynchronous algorithms as special cases (gradient descent, proximal gradient, Douglas Rachford, ADMM,…

Optimization and Control · Mathematics 2017-08-28 Robert Hannah , Wotao Yin