Related papers: Practically-Self-Stabilizing Virtual Synchrony

Self-stabilizing Reconfiguration

Current reconfiguration techniques are based on starting the system in a consistent configuration, in which all participating entities are in their initial state. Starting from that state, the system must preserve consistency as long as a…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-12-07 Shlomi Dolev , Chryssis Georgiou , Ioannis Marcoullis , Elad M. Schiller

Self-Stabilizing Paxos

We present the first self-stabilizing consensus and replicated state machine for asynchronous message passing systems. The scheme does not require that all participants make a certain number of steps prior to reaching a practically infinite…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-05-21 Peva Blanchard , Shlomi Dolev , Joffroy Beauquier , Sylvie Delaët

Self-Stabilizing Distributed Cooperative Reset

Self-stabilization is a versatile fault-tolerance approach that characterizes the ability of a system to eventually resume a correct behavior after any finite number of transient faults. In this paper, we propose a self-stabilizing reset…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-04-23 Stéphane Devismes , Colette Johnen

Robust Self-Triggered Control Approaches Optimizing Sampling Sequences with Synchronous Measurements

Feedback control algorithms traditionally rely on periodic execution on digital platforms. While this simplifies design and analysis, it often leads to inefficient resource usage (e.g., CPU, network bandwidth) in embedded control and shared…

Systems and Control · Electrical Eng. & Systems 2025-11-04 Abbas Tariverdi

Resilience of a synchronized multi-agent system

Fault tolerance is increasingly important for unmanned autonomous vehicles. For example, in a multi robot system the agents need the ability to effectively detect and tolerate internal failures in order to continue performing their tasks…

Combinatorics · Mathematics 2016-05-02 S. Bereg , L. E. Caraballo , J. M. Díaz-Báñez , M. A. Lopez

Towards Optimal Synchronous Counting

Consider a complete communication network of $n$ nodes, where the nodes receive a common clock pulse. We study the synchronous $c$-counting problem: given any starting state and up to $f$ faulty nodes with arbitrary behaviour, the task is…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-03-24 Christoph Lenzen , Joel Rybicki , Jukka Suomela

Practically-Self-Stabilizing Vector Clocks in the Absence of Execution Fairness

Vector clock algorithms are basic wait-free building blocks that facilitate causal ordering of events. As wait-free algorithms, they are guaranteed to complete their operations within a finite number of steps. Stabilizing algorithms allow…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-12-25 Iosif Salem , Elad Michael Schiller

Fault-tolerant Algorithms for Tick-Generation in Asynchronous Logic: Robust Pulse Generation

Today's hardware technology presents a new challenge in designing robust systems. Deep submicron VLSI technology introduced transient and permanent faults that were never considered in low-level system designs in the past. Still, robustness…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-03-19 Danny Dolev , Matthias Fuegger , Christoph Lenzen , Ulrich Schmid

The R(1)W(1) Communication Model for Self-Stabilizing Distributed Algorithms

Self-stabilization is a versatile methodology in the design of fault-tolerant distributed algorithms for transient faults. A self-stabilizing system automatically recovers from any kind and any finite number of transient faults. This…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-07 Hirotsugu Kakugawa , Sayaka Kamei , Masahiro Shibata , Fukuhito Ooshita

Effective Numerical Simulation of Fault Transient System

Power systems, including synchronous generator systems, are typical systems that strive for stable operation. In this article, we numerically study the fault transient process of a synchronous generator system based on the first benchmark…

Numerical Analysis · Mathematics 2025-02-25 Sixu Wu , Feng Ji , Lu Gao , Ruili Zhang , Cunwei Tang , Yifa Tang

Early-Stabilizing Counting

Synchronous Counting is the task of reaching agreement on a common round counter in a synchronous system of $n$ nodes with up to $t$ Byzantine faults in a self-stabilizing manner. That is, after transient faults may have arbitrarily…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-19 Christoph Lenzen , Julian Loss

Early Scheduling in Parallel State Machine Replication

State machine replication is standard approach to fault tolerance. One of the key assumptions of state machine replication is that replicas must execute operations deterministically and thus serially. To benefit from multi-core servers,…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-15 Eduardo Alchieri , Fernando Dotti , Fernando Pedone

Self-stabilizing Byzantine-tolerant Recycling

Numerous distributed applications, such as cloud computing and distributed ledgers, necessitate the system to invoke asynchronous consensus objects an unbounded number of times, where the completion of one consensus instance is followed by…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-28 Chryssis Georgiou , Michel Raynal , Elad M. Schiller

Dynamic FTSS in Asynchronous Systems: the Case of Unison

Distributed fault-tolerance can mask the effect of a limited number of permanent faults, while self-stabilization provides forward recovery after an arbitrary number of transient fault hit the system. FTSS protocols combine the best of both…

Data Structures and Algorithms · Computer Science 2011-02-11 Swan Dubois , Maria Potop-Butucaru , Sébastien Tixeuil

Self-stabilizing Total-order Broadcast

The problem of total-order (uniform reliable) broadcast is fundamental in fault-tolerant distributed computing since it abstracts a broad set of problems requiring processes to uniformly deliver messages in the same order in which they were…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-09-30 Oskar Lundström , Michel Raynal , Elad Michael Schiller

Efficient counting with optimal resilience

Consider a complete communication network of $n$ nodes, where the nodes receive a common clock pulse. We study the synchronous $c$-counting problem: given any starting state and up to $f$ faulty nodes with arbitrary behaviour, the task is…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-12-24 Christoph Lenzen , Joel Rybicki , Jukka Suomela

Automated Synthesis of Distributed Self-Stabilizing Protocols

In this paper, we introduce an SMT-based method that automatically synthesizes a distributed self-stabilizing protocol from a given high-level specification and network topology. Unlike existing approaches, where synthesis algorithms…

Software Engineering · Computer Science 2023-06-22 Fathiyeh Faghih , Borzoo Bonakdarpour , Sebastien Tixeuil , Sandeep Kulkarni

Self-stabilizing Multivalued Consensus in Asynchronous Crash-prone Systems

The problem of multivalued consensus is fundamental in the area of fault-tolerant distributed computing since it abstracts a very broad set of agreement problems in which processes have to uniformly decide on a specific value v in V, where…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-04-08 Oskar Lundström , Michel Raynal , Elad Michael Schiller

Self-stabilising Byzantine Clock Synchronisation is Almost as Easy as Consensus

We give fault-tolerant algorithms for establishing synchrony in distributed systems in which each of the $n$ nodes has its own clock. Our algorithms operate in a very strong fault model: we require self-stabilisation, i.e., the initial…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-12 Christoph Lenzen , Joel Rybicki

Interactive Consistency in practical, mostly-asynchronous systems

Interactive consistency is the problem in which n nodes, where up to t may be byzantine, each with its own private value, run an algorithm that allows all non-faulty nodes to infer the values of each other node. This problem is relevant to…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-07-28 Panos Diamantopoulos , Stathis Maneas , Christos Patsonakis , Nikos Chondros , Mema Roussopoulos