English

uBFT: Microsecond-scale BFT using Disaggregated Memory [Extended Version]

Distributed, Parallel, and Cluster Computing 2026-02-17 v5

Abstract

We propose uBFT, the first State-Machine Replication (SMR) system to achieve microsecond-scale latency in data centers, while using only 2f+12f{+}1 replicas to tolerate ff Byzantine failures. The Byzantine Fault Tolerance (BFT) provided by uBFT is essential as pure crashes appear to be a mere illusion with real-life systems reportedly failing in many unexpected ways. uBFT relies on a small non-tailored trusted computing base -- disaggregated memory -- and consumes a practically bounded amount of memory (both local and disaggregated). uBFT is based on a novel abstraction called Consistent Tail Broadcast, which we use to prevent equivocation while bounding memory. We implement uBFT using RDMA-based disaggregated memory and obtain an end-to-end latency of as little as 10us. This is at least 50×\times faster than MinBFT , a state of the art 2f+12f{+}1 BFT SMR based on Intel's SGX. We use uBFT to replicate two key-value stores (Memcached and Redis), as well as a financial order matching engine (Liquibook). These applications have low latency (up to 20us) and become Byzantine tolerant with as little as 10us more. The price for uBFT is a small amount of reliable disaggregated memory (less than 1 MiB), which in our prototype consists of a small number of memory servers connected through RDMA and replicated for fault tolerance.

Keywords

Cite

@article{arxiv.2210.17174,
  title  = {uBFT: Microsecond-scale BFT using Disaggregated Memory [Extended Version]},
  author = {Marcos K. Aguilera and Naama Ben-David and Rachid Guerraoui and Antoine Murat and Athanasios Xygkis and Igor Zablotchi},
  journal= {arXiv preprint arXiv:2210.17174},
  year   = {2026}
}