Related papers: Barrier Enabled IO Stack for Flash Storage

High Throughput Push Based Storage Manager

The storage manager, as a key component of the database system, is responsible for organizing, reading, and delivering data to the execution engine for processing. According to the data serving mechanism, existing storage managers are…

Databases · Computer Science 2019-05-20 Ye Zhu

LBICA: A Load Balancer for I/O Cache Architectures

In recent years, enterprise Solid-State Drives (SSDs) are used in the caching layer of high-performance servers to close the growing performance gap between processing units and storage subsystem. SSD-based I/O caching is typically not…

Performance · Computer Science 2018-12-21 Saba Ahmadian , Reza Salkhordeh , Hossein Asadi

Memory Bounds for Concurrent Bounded Queues

Concurrent data structures often require additional memory for handling synchronization issues in addition to memory for storing elements. Depending on the amount of this additional memory, implementations can be more or less…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-17 Vitaly Aksenov , Nikita Koval , Petr Kuznetsov , Anton Paramonov

A Smart Background Scheduler for Storage Systems

In today's enterprise storage systems, supported data services such as snapshot delete or drive rebuild can cause tremendous performance interference if executed inline along with heavy foreground IO, often leading to missing SLOs (Service…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-03 Maher Kachmar , David Kaeli

FB: A Flexible Buffer Management Scheme for Data Center Switches

Today, network devices share buffer across priority queues to avoid drops during transient congestion. While cost-effective most of the time, this sharing can cause undesired interference among seemingly independent traffic. As a result,…

Networking and Internet Architecture · Computer Science 2021-05-25 Maria Apostolaki , Vamsi Addanki , Manya Ghobadi , Laurent Vanbever

Differentiating Network Flows for Priority-Aware Scheduling of Incoming Packets in Real-Time IoT Systems

When IP-packet processing is unconditionally carried out on behalf of an operating system kernel thread, processing systems can experience overload in high incoming traffic scenarios. This is especially worrying for embedded real-time…

Networking and Internet Architecture · Computer Science 2022-04-20 Christoph Blumschein , Ilja Behnke , Lauritz Thamsen , Odej Kao

BPF for storage: an exokernel-inspired approach

The overhead of the kernel storage path accounts for half of the access latency for new NVMe storage devices. We explore using BPF to reduce this overhead, by injecting user-defined functions deep in the kernel's I/O processing stack. When…

Operating Systems · Computer Science 2021-02-26 Yu Jian Wu , Hongyi Wang , Yuhong Zhong , Asaf Cidon , Ryan Stutsman , Amy Tai , Junfeng Yang

Efficient Constrained Codes That Enable Page Separation in Modern Flash Memories

The pivotal storage density win achieved by solid-state devices over magnetic devices recently is a result of multiple innovations in physics, architecture, and signal processing. Constrained coding is used in Flash devices to increase…

Information Theory · Computer Science 2023-11-15 Ahmed Hareedy , Simeng Zheng , Paul Siegel , Robert Calderbank

Enlightening Flash Storage to Stream Writes by Objects

For a write request, today flash storage cannot distinguish the logical object it comes from. In such object-oblivious flash devices, concurrent writes from different objects are simply packed in their arrival order to flash memory blocks;…

Databases · Computer Science 2022-01-13 Jong-Hyeok Park , Gihwan Oh , Sang-Won Lee

LQoCo: Learning to Optimize Cache Capacity Overloading in Storage Systems

Cache plays an important role to maintain high and stable performance (i.e. high throughput, low tail latency and throughput jitter) in storage systems. Existing rule-based cache management methods, coupled with engineers' manual…

Hardware Architecture · Computer Science 2022-03-28 Ji Zhang , Xijun Li , Xiyao Zhou , Mingxuan Yuan , Zhuo Cheng , Keji Huang , Yifan Li

Design of a Timer Queue Supporting Dynamic Update Operations

Large-scale timers are ubiquitous in network processing, including flow table entry expiration control in software defined network (SDN) switches, MAC address aging in Ethernet bridges, and retransmission timeout management in TCP/IP…

Networking and Internet Architecture · Computer Science 2025-08-15 Zekun Wang , Binghao Yue , Weitao Pan , Jiangyi Shi , Yue Hao

Cache Blocking for Flux Reconstruction: Extension to Navier-Stokes Equations and Anti-aliasing

In this article, cache blocking is implemented for the Navier Stokes equations with anti-aliasing support on mixed grids in PyFR for CPUs. In particular, cache blocking is used as an alternative to kernel fusion to eliminate unnecessary…

Fluid Dynamics · Physics 2024-11-22 Semih Akkurt , Freddie Witherden , Peter Vincent

WLFC: Write Less in Flash-based Cache

Flash-based disk caches, for example Bcache and Flashcache, has gained tremendous popularity in industry in the last decade because of its low energy consumption, non-volatile nature and high I/O speed. But these cache systems have a worse…

Operating Systems · Computer Science 2023-11-16 Chaos Dong , Fang Wang , Jianshun Zhang

uFLIP: Understanding Flash IO Patterns

Does the advent of flash devices constitute a radical change for secondary storage? How should database systems adapt to this new form of secondary storage? Before we can answer these questions, we need to fully understand the performance…

Performance · Computer Science 2009-09-15 Luc Bouganim , Björn Jónsson , Philippe Bonnet

Don't Thrash: How to Cache Your Hash on Flash

This paper presents new alternatives to the well-known Bloom filter data structure. The Bloom filter, a compact data structure supporting set insertion and membership queries, has found wide application in databases, storage systems, and…

Databases · Computer Science 2012-08-02 Michael A. Bender , Martin Farach-Colton , Rob Johnson , Russell Kraner , Bradley C. Kuszmaul , Dzejla Medjedovic , Pablo Montes , Pradeep Shetty , Richard P. Spillane , Erez Zadok

Flash-based Extended Cache for Higher Throughput and Faster Recovery

Considering the current price gap between disk and flash memory drives, for applications dealing with large scale data, it will be economically more sensible to use flash memory drives to supplement disk drives rather than to replace them.…

Databases · Computer Science 2012-08-02 Woon-Hak Kang , Sang-Won Lee , Bongki Moon

Fast Shared-Memory Barrier Synchronization for a 1024-Cores RISC-V Many-Core Cluster

Synchronization is likely the most critical performance killer in shared-memory parallel programs. With the rise of multi-core and many-core processors, the relative impact on performance and energy overhead of synchronization is bound to…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-21 Marco Bertuletti , Samuel Riedel , Yichao Zhang , Alessandro Vanelli-Coralli , Luca Benini

When to use 3D Die-Stacked Memory for Bandwidth-Constrained Big Data Workloads

Response time requirements for big data processing systems are shrinking. To meet this strict response time requirement, many big data systems store all or most of their data in main memory to reduce the access latency. Main memory…

Hardware Architecture · Computer Science 2016-08-29 Jason Lowe-Power , Mark D. Hill , David A. Wood

Unfair by design: eBPF-based scheduling of mixed database workloads

Modern database systems increasingly co-schedule time-sensitive and background tasks. In such mixed workloads, background tasks should ideally utilize only spare CPU capacity without interfering with latency-critical requests. While some…

Databases · Computer Science 2026-05-05 Carl-Elliott Bilodeau-Savaria , Jan Kristof Nidzwetzki , Stefanie Scherzinger , Bettina Kemme

Plan-based Job Scheduling for Supercomputers with Shared Burst Buffers

The ever-increasing gap between compute and I/O performance in HPC platforms, together with the development of novel NVMe storage devices (NVRAM), led to the emergence of the burst buffer concept - an intermediate persistent storage layer…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-01-11 Jan Kopanski , Krzysztof Rzadca