English
Related papers

Related papers: Microsecond Consensus for Microsecond Applications

200 papers

Synchronous Mirroring (SM) is a standard approach to building highly-available and fault-tolerant enterprise storage systems. SM ensures strong data consistency by maintaining multiple exact data replicas and synchronously propagating every…

The private chain-based Internet of Things (IoT) system ensures the security of cross-organizational data sharing. As a widely used consensus model in private chains, the leader-based state-machine replication (SMR) model meets the…

Networking and Internet Architecture · Computer Science 2021-09-21 Dongjie Zhu , Haiwen Du , Yundong Sun , Zhaoshuo Tian

Replicating data across multiple data centers not only allows moving the data closer to the user and, thus, reduces latency for applications, but also increases the availability in the event of a data center failure. Therefore, it is not…

Databases · Computer Science 2012-03-28 Tim Kraska , Gene Pang , Michael J. Franklin , Samuel Madden

Modern web applications replicate their data across the globe and require strong consistency guarantees for their most critical data. These guarantees are usually provided via state-machine replication (SMR). Recent advances in SMR have…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-04-27 Vitor Enes , Carlos Baquero , Alexey Gotsman , Pierre Sutra

Edge devices such as smartwatches and smart glasses cannot continuously run even the smallest 100M-1B parameter language models due to power and compute constraints, yet cloud inference introduces multi-second latencies that break the…

Computation and Language · Computer Science 2026-04-22 Wen Cheng , Tuochao Chen , Karim Helwani , Sriram Srinivasan , Luke Zettlemoyer , Shyamnath Gollakota

State-machine replication, a fundamental approach to designing fault-tolerant services, requires commands to be executed in the same order by all replicas. Moreover, command execution must be deterministic: each replica must produce the…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-11-26 Parisa Jalili Marandi , Carlos Eduardo Bezerra , Fernando Pedone

This work elaborates on a High performance computing (HPC) architecture based on Simple Linux Utility for Resource Management (SLURM) [1] for deploying heterogeneous Large Language Models (LLMs) into a scalable inference engine. Dynamic…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-26 Anderson de Lima Luiz , Shubham Vijay Kurlekar , Munir Georges

Production state-machine replication (SMR) implementations are complex, multi-layered architectures comprising data dissemination, ordering, execution, and reconfiguration components. Existing research consensus protocols rarely discuss…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-02-11 Allen Clement , Natacha Crooks , Neil Giridharan , Alex Shamis

As we have entered Exascale computing, the faults in high-performance systems are expected to increase considerably. To compensate for a higher failure rate, the standard checkpoint/restart technique would need to create checkpoints at a…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-10-26 Sarthak Joshi , Sathish Vadhiyar

Developing state-machine replication protocols for practical use is a complex and labor-intensive process because of the myriad of essential tasks (e.g., deployment, communication, recovery) that need to be taken into account in an…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-06-25 Laura Lawniczak , Tobias Distler

Traditional approaches to replication require client requests to be ordered before making them durable by copying them to replicas. As a result, clients must wait for two round-trip times (RTTs) before updates complete. In this paper, we…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-30 Seo Jin Park , John Ousterhout

Recently, we saw the emergence of consensus-based database systems that promise resilience against failures, strong data provenance, and federated data management. Typically, these fully-replicated systems are operated on top of a…

Databases · Computer Science 2020-11-04 Suyash Gupta , Jelle Hellings , Mohammad Sadoghi

Large-scale distributed Multiuser MIMO (MU-MIMO) is a promising wireless network architecture that combines the advantages of "massive MIMO" and "small cells." It consists of several Access Points (APs) connected to a central server via a…

Networking and Internet Architecture · Computer Science 2015-04-02 Ryan Rogalin , Ozgun Bursalioglu , Haralabos Papadopoulos , Giuseppe Caire , Andreas Molisch , Antonios Michaloliakos , Vlad Balan , Konstantinos Psounis

Sequential computation is well understood but does not scale well with current technology. Within the next decade, systems will contain large numbers of processors with potentially thousands of processors per chip. Despite this, many…

Hardware Architecture · Computer Science 2015-11-17 James Hanlon

In-memory databases (IMDBs) are gaining increasing popularity in big data applications, where clients commit updates intensively. Specifically, it is necessary for IMDBs to have efficient snapshot performance to support certain special…

Databases · Computer Science 2018-10-12 Liang Li , Guoren Wang , Gang Wu , Ye Yuan , Lei Chen , Xiang Lian

Data management systems have traditionally been designed to support either long-running analytics queries or short-lived transactions, but an increasing number of applications need both. For example, online games, socio-mobile apps, and…

Databases · Computer Science 2012-05-31 Benjamin Sowell , Wojciech Golab , Mehul A. Shah

State-machine replication, a fundamental approach to fault tolerance, requires replicas to execute commands deterministically, which usually results in sequential execution of commands. Sequential execution limits performance and underuses…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-04-29 Parisa Jalili Marandi , Fernando Pedone

Replicated state machines (RSMs) cannot communicate effectively today as there is no formal framework or efficient protocol to do so. To address this issue, we introduce a new primitive, Cross-Cluster Consistent Broadcast (C3B) and present…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-25 Reginald Frank , Micah Murray , Chawinphat Tankuranand , Junseo Yoo , Ethan Xu , Natacha Crooks , Suyash Gupta , Manos Kapritsos

Computational models have become one of the prevalent methods to model complex phenomena. To accurately model complex interactions, such as detailed biomolecular interactions, scientists often rely on multiscale models comprised of several…

Despite the promising performance of state space models (SSMs) in long sequence modeling, limitations still exist. Advanced SSMs like S5 and S6 (Mamba) in addressing non-uniform sampling, their recursive structures impede efficient SSM…

Machine Learning · Computer Science 2024-06-11 Biqing Qi , Junqi Gao , Kaiyan Zhang , Dong Li , Jianxing Liu , Ligang Wu , Bowen Zhou
‹ Prev 1 2 3 10 Next ›