English
Related papers

Related papers: Rethinking State-Machine Replication for Paralleli…

200 papers

State-machine replication, a fundamental approach to fault tolerance, requires replicas to execute commands deterministically, which usually results in sequential execution of commands. Sequential execution limits performance and underuses…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-04-29 Parisa Jalili Marandi , Fernando Pedone

State machine replication is standard approach to fault tolerance. One of the key assumptions of state machine replication is that replicas must execute operations deterministically and thus serially. To benefit from multi-core servers,…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-15 Eduardo Alchieri , Fernando Dotti , Fernando Pedone

Linearizability is a well-known correctness property for concurrent and distributed systems. In the past, it was also used to prove the design and implementation of replicated state-machines correct. State-machine replication (SMR) is a…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-03 Franz J. Hauck , Alexander Heß

State Machine Replication (SMR) is a fundamental approach to designing service with fault tolerance. However, its requirement for the deterministic execution of transactions often results in single-threaded replicas, which cannot fully…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-11-27 Gang Wu1 , Guodong Zhao , Yidong Song

Modern Internet services commonly replicate critical data across several geographical locations using state-machine replication (SMR). Due to their reliance on a leader replica, classical SMR protocols offer limited scalability and…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-07 Tuanir França Rezende , Pierre Sutra

State machine replication (SMR) is a replication technique that ensures fault tolerance by duplicating a service. Geo-replicated SMR is an enhanced version of SMR that distributes replicas in separate geographical locations, making the…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-06 Kohya Shiozaki , Junya Nakamura

This paper considers the classical state machine replication (SMR) problem in a distributed system model inspired by cross-chain exchanges. We propose a novel SMR protocol adapted for this model. Each state machine transition takes $O(n)$…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-15 Yingjie Xue , Maurice Herlihy

With the slowdown of Moore's law, CPU-oriented packet processing in software will be significantly outpaced by emerging line speeds of network interface cards (NICs). Single-core packet-processing throughput has saturated. We consider the…

Networking and Internet Architecture · Computer Science 2024-06-18 Qiongwen Xu , Sebastiano Miano , Xiangyu Gao , Tao Wang , Adithya Murugadass , Songyuan Zhang , Anirudh Sivaraman , Gianni Antichi , Srinivas Narayana

Despite the promising performance of state space models (SSMs) in long sequence modeling, limitations still exist. Advanced SSMs like S5 and S6 (Mamba) in addressing non-uniform sampling, their recursive structures impede efficient SSM…

Machine Learning · Computer Science 2024-06-11 Biqing Qi , Junqi Gao , Kaiyan Zhang , Dong Li , Jianxing Liu , Ligang Wu , Bowen Zhou

We present a new state transfer method for geographic State Machine Replication (SMR) that dynamically allocates the state to be transferred among replicas according to changes in communication bandwidths. SMR is a method that improves…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-10-12 Tairi Chiba , Ren Ohmura , Junya Nakamura

State-space models (SSMs) have recently attention as an efficient alternative to computationally expensive attention-based models for sequence modeling. They rely on linear recurrences to integrate information over time, enabling fast…

Machine Learning · Computer Science 2026-01-01 Mahdi Karami , Ali Behrouz , Peilin Zhong , Razvan Pascanu , Vahab Mirrokni

Consensus, state-machine replication (SMR) and total order broadcast (TOB) protocols are notorious for being poorly scalable with the number of participating nodes. Despite the recent race to reduce overall message complexity of…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-03-14 Chrysoula Stathakopoulou , Matej Pavlovic , Marko Vukolić

Geographic state machine replication (SMR) is a replication method in which replicas of a service are located on multiple continents to improve the fault tolerance of a general service. Nowadays, geographic SMR is easily realized using…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-10-12 Shota Numakura , Junya Nakamura , Ren Ohmura

This paper proposes a parallelizable algorithm for linear-quadratic model predictive control (MPC) problems with state and input constraints. The algorithm itself is based on a parallel MPC scheme that has originally been designed for…

Optimization and Control · Mathematics 2022-07-04 Jiahe Shi , Yuning Jiang , Juraj Oravec , Boris Houska

Stochastic simulations need multiple replications in order to build confidence intervals for their results. Even if we do not need a large amount of replications, it is a good practice to speed-up the whole simulation time using the…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-01-08 Jonathan Passerat-Palmbach , Jonathan Caux , Pridi Siregar , Claude Mazel , David Hill

This paper proposes a new state transfer method for geographic state machine replication (SMR) that dynamically allocates the state to be transferred among replicas according to changes in communication bandwidths. SMR improves fault…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-04-20 Tairi Chiba , Ren Ohmura , Junya Nakamura

Deterministic execution offers many benefits for debugging, fault tolerance, and security. Running parallel programs deterministically is usually difficult and costly, however - especially if we desire system-enforced determinism, ensuring…

Operating Systems · Computer Science 2010-05-20 Amittai Aviram , Shu-Chun Weng , Sen Hu , Bryan Ford

We introduce a parallelizable simplification of Neural Turing Machine (NTM), referred to as P-NTM, which redesigns the core operations of the original architecture to enable efficient scan-based parallel execution. We evaluate the proposed…

Neural and Evolutionary Computing · Computer Science 2026-02-24 Gabriel Faria , Arnaldo Candido Junior

Synchronous programs are used extensively in implementation of safety critical embedded software. Imperative synchronous programming languages model multiple Finite State Machines (FSMs) executing in lockstep at logical clock ticks. The…

Programming Languages · Computer Science 2025-08-05 Avinash Malik

In large-scale LLM pre-training systems with 100k+ GPUs, failures become the norm rather than the exception, and restart costs can dominate wall-clock training time. However, existing fault-tolerance mechanisms are largely unprepared for…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-29 Jin Lee , Zhonghao Chen , Xuhang He , Robert Underwood , Bogdan Nicolae , Franck Cappello , Xiaoyi Lu , Sheng Di , Zheng Zhang
‹ Prev 1 2 3 10 Next ›