Related papers: Parallel Deferred Update Replication

Hybrid Transactional Replication: State-Machine and Deferred-Update Replication Combined

We propose Hybrid Transactional Replication (HTR), a novel replication scheme for highly dependable services. It combines two schemes: a transaction is executed either optimistically by only one service replica in the deferred update mode…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-01-26 Tadeusz Kobus , Maciej Kokociński , Paweł T. Wojciechowski

Fast Updates on Read-Optimized Databases Using Multi-Core CPUs

Read-optimized columnar databases use differential updates to handle writes by maintaining a separate write-optimized delta partition which is periodically merged with the read-optimized and compressed main partition. This merge process…

Databases · Computer Science 2015-03-19 Jens Krueger , Changkyu Kim , Martin Grund , Nadathur Satish , David Schwalb , Jatin Chhugani , Hasso Plattner , Pradeep Dubey , Alexander Zeier

Reducing DRAM Refresh Overheads with Refresh-Access Parallelism

This article summarizes the idea of "refresh-access parallelism," which was published in HPCA 2014, and examines the work's significance and future potential. The overarching objective of our HPCA 2014 paper is to reduce the significant…

Hardware Architecture · Computer Science 2018-05-04 K. K. Chang , D. Lee , Z. Chishti , A. R. Alameldeen , C. Wilkerson , Y. Kim , O. Mutlu

Parallel $k$-Core Decomposition with Batched Updates and Asynchronous Reads

Maintaining a dynamic $k$-core decomposition is an important problem that identifies dense subgraphs in dynamically changing graphs. Recent work by Liu et al. [SPAA 2022] presents a parallel batch-dynamic algorithm for maintaining an…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-17 Quanquan C. Liu , Julian Shun , Igor Zablotchi

Rethinking State-Machine Replication for Parallelism

State-machine replication, a fundamental approach to designing fault-tolerant services, requires commands to be executed in the same order by all replicas. Moreover, command execution must be deterministic: each replica must produce the…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-11-26 Parisa Jalili Marandi , Carlos Eduardo Bezerra , Fernando Pedone

Unobtrusive Deferred Update Stabilization for Efficient Geo-Replication

In this paper we propose a novel approach to manage the throughput vs latency tradeoff that emerges when managing updates in geo-replicated systems. Our approach consists in allowing full concurrency when processing local updates and using…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-02-08 Chathuri Gunawardhana , Manuel Bravo , Luís Rodrigues

Reducing Performance Impact of DRAM Refresh by Parallelizing Refreshes with Accesses

Modern DRAM cells are periodically refreshed to prevent data loss due to leakage. Commodity DDR DRAM refreshes cells at the rank level. This degrades performance significantly because it prevents an entire rank from serving memory requests…

Hardware Architecture · Computer Science 2016-01-26 Kevin Kai-Wei Chang , Donghyuk Lee , Zeshan Chishti , Alaa R. Alameldeen , Chris Wilkerson , Yoongu Kim , Onur Mutlu

Parallel Online Learning

In this work we study parallelization of online learning, a core primitive in machine learning. In a parallel environment all known approaches for parallel online learning lead to delayed updates, where the model is updated using…

Machine Learning · Computer Science 2011-03-23 Daniel Hsu , Nikos Karampatziakis , John Langford , Alex Smola

Improving DRAM Performance by Parallelizing Refreshes with Accesses

Modern DRAM cells are periodically refreshed to prevent data loss due to leakage. Commodity DDR DRAM refreshes cells at the rank level. This degrades performance significantly because it prevents an entire rank from serving memory requests…

Hardware Architecture · Computer Science 2017-12-22 Kevin K. Chang , Donghyuk Lee , Zeshan Chishti , Alaa R. Alameldeen , Chris Wilkerson , Yoongu Kim , Onur Mutlu

RADAR: Recall Augmentation through Deferred Asynchronous Retrieval

Modern large-scale recommender systems employ multi-stage ranking funnel (Retrieval, Pre-ranking, Ranking) to balance engagement and computational constraints (latency, CPU). However, the initial retrieval stage, often relying on efficient…

Information Retrieval · Computer Science 2025-06-10 Amit Jaspal , Qian Dang , Ajantha Ramineni

Highly Available Queue-oriented Speculative Transaction Processing

Deterministic database systems have received increasing attention from the database research community in recent years. Despite their current limitations, recent proposals of distributed deterministic transaction processing systems…

Databases · Computer Science 2021-07-26 Thamir M. Qadah , Mohammad Sadoghi

State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

With the slowdown of Moore's law, CPU-oriented packet processing in software will be significantly outpaced by emerging line speeds of network interface cards (NICs). Single-core packet-processing throughput has saturated. We consider the…

Networking and Internet Architecture · Computer Science 2024-06-18 Qiongwen Xu , Sebastiano Miano , Xiangyu Gao , Tao Wang , Adithya Murugadass , Songyuan Zhang , Anirudh Sivaraman , Gianni Antichi , Srinivas Narayana

Delayed Asynchronous Iterative Graph Algorithms

Iterative graph algorithms often compute intermediate values and update them as computation progresses. Updated output values are used as inputs for computations in current or subsequent iterations; hence the number of iterations required…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-10-05 Mark P. Blanco , Scott McMillan , Tze Meng Low

A Unified Performance-Cost Landscape of Parallel p-bit Ising Machines Based on Update Dynamics

Parallel p-bit Ising machines are a promising platform for fast and energy-efficient combinatorial optimization, but their scalability depends on update synchronization, hardware delay, and architectural cost. In this work, we establish a…

Emerging Technologies · Computer Science 2026-04-03 Naoya Onizawa , Takahiro Hanyu

A new efficient explicit Deferred Correction framework: analysis and applications to hyperbolic PDEs and adaptivity

The Deferred Correction (DeC) is an iterative procedure, characterized by increasing accuracy at each iteration, which can be used to design numerical methods for systems of ODEs. The main advantage of such framework is the automatic way of…

Numerical Analysis · Mathematics 2023-11-09 Lorenzo Micalizzi , Davide Torlo

Age-Memory Trade-off in Read-Copy-Update

In the realm of shared memory systems, the challenge of reader-writer synchronization is closely coupled with the potential for readers to access outdated updates. Read-Copy-Update (RCU) is a synchronization primitive that allows for…

Information Theory · Computer Science 2024-02-13 Vishakha Ramani , Jiachen Chen , Roy D. Yates

Minimizing Content Staleness in Dynamo-Style Replicated Storage Systems

Consistency in data storage systems requires any read operation to return the most recent written version of the content. In replicated storage systems, consistency comes at the price of delay due to large-scale write and read operations.…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-04 Jing Zhong , Roy D. Yates , Emina Soljanin

TSUE: A Two-Stage Data Update Method for an Erasure Coded Cluster File System

Compared to replication-based storage systems, erasure-coded storage incurs significantly higher overhead during data updates. To address this issue, various parity logging methods have been pro- posed. Nevertheless, due to the long update…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-25 Zheng Wei , Jing Xing , Yida Gu , Wenjing Huang , Dong Dai , Guangming Tan , Dingwen Tao

More Iterations per Second, Same Quality -- Why Asynchronous Algorithms may Drastically Outperform Traditional Ones

In this paper, we consider the convergence of a very general asynchronous-parallel algorithm called ARock, that takes many well-known asynchronous algorithms as special cases (gradient descent, proximal gradient, Douglas Rachford, ADMM,…

Optimization and Control · Mathematics 2017-08-28 Robert Hannah , Wotao Yin

DeepPCR: Parallelizing Sequential Operations in Neural Networks

Parallelization techniques have become ubiquitous for accelerating inference and training of deep neural networks. Despite this, several operations are still performed in a sequential manner. For instance, the forward and backward passes…

Machine Learning · Computer Science 2023-10-30 Federico Danieli , Miguel Sarabia , Xavier Suau , Pau Rodríguez , Luca Zappella