Related papers: RStore: A Distributed Multi-version Document Store

Harnessing Correlations in Distributed Erasure Coded Key-Value Stores

Motivated by applications of distributed storage systems to cloud-based key-value stores, the multi-version coding problem has been recently formulated to efficiently store frequently updated data in asynchronous decentralized storage…

Information Theory · Computer Science 2019-03-12 Ramy E. Ali , Viveck Cadambe

Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff

The relative ease of collaborative data science and analysis has led to a proliferation of many thousands or millions of $versions$ of the same datasets in many scientific and commercial domains, acquired or constructed at various stages of…

Databases · Computer Science 2015-05-21 Souvik Bhattacherjee , Amit Chavan , Silu Huang , Amol Deshpande , Aditya Parameswaran

Harnessing Correlations in Distributed Erasure-Coded Key-Value Stores

Motivated by applications of distributed storage systems to key-value stores, the multi-version coding problem was formulated to efficiently store frequently updated data in asynchronous decentralized storage systems. Inspired by…

Information Theory · Computer Science 2019-03-14 Ramy E. Ali , Viveck R. Cadambe

Distributed Versioned Object Storage -- Alternatives at the OSD layer (Poster Extended Abstract)

The ability to store multiple versions of a data item is a powerful primitive that has had a wide variety of uses: relational databases, transactional memory, version control systems, to name a few. However, each implementation uses a very…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-06-17 Ivo Jimenez , Carlos Maltzahn , Jay Lofstead

UStore: A Distributed Storage With Rich Semantics

Today's storage systems expose abstractions which are either too low-level (e.g., key-value store, raw-block store) that they require developers to re-invent the wheels, or too high-level (e.g., relational databases, Git) that they lack…

Databases · Computer Science 2017-02-10 Anh Dinh , Ji Wang , Sheng Wang , Gang Chen , Wei-Ngan Chin , Qian Lin , Beng Chin Ooi , Pingcheng Ruan , Kian-Lee Tan , Zhongle Xie , Hao Zhang , Meihui Zhang

VStore: A Data Store for Analytics on Large Videos

We present VStore, a data store for supporting fast, resource-efficient analytics over large archival videos. VStore manages video ingestion, storage, retrieval, and consumption. It controls video formats along the video data path. It is…

Databases · Computer Science 2019-02-19 Tiantu Xu , Luis Materon Botelho , Felix Xiaozhu Lin

RapidStore: An Efficient Dynamic Graph Storage System for Concurrent Queries

Dynamic graph storage systems are essential for real-time applications such as social networks and recommendation, where graph data continuously evolves. However, they face significant challenges in efficiently handling concurrent read and…

Databases · Computer Science 2025-07-02 Chiyu Hao , Jixian Su , Shixuan Sun , Hao Zhang , Sen Gao , Jianwen Zhao , Chenyi Zhang , Jieru Zhao , Chen Chen , Minyi Guo

DynoStore: A wide-area distribution system for the management of data over heterogeneous storage

Data distribution across different facilities offers benefits such as enhanced resource utilization, increased resilience through replication, and improved performance by processing data near its source. However, managing such data is…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-02 Dante D. Sanchez-Gallegos , J. L. Gonzalez-Compean , Maxime Gonthier , Valerie Hayot-Sasson , J. Gregory Pauloski , Haochen Pan , Kyle Chard , Jesus Carretero , Ian Foster

BlobSeer: How to Enable Efficient Versioning for Large Object Storage under Heavy Access Concurrency

To accommodate the needs of large-scale distributed P2P systems, scalable data management strategies are required, allowing applications to efficiently cope with continuously growing, highly dis tributed data. This paper addresses the…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-09-30 Bogdan Nicolae , Gabriel Antoniu , Luc Bougé

Multi-Version Coding - An Information Theoretic Perspective of Consistent Distributed Storage

In applications of distributed storage systems to distributed computing and implementation of key- value stores, the following property, usually referred to as consistency in computer science and engineering, is an important requirement: as…

Information Theory · Computer Science 2015-10-20 Zhiying Wang , Viveck R. Cadambe

Reliable Data Storage in Distributed Hash Tables

Distributed Hash Tables offer a resilient lookup service for unstable distributed environments. Resilient data storage, however, requires additional data replication and maintenance algorithms. These algorithms can have an impact on both…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Matthew Leslie

Fundamental Limits of Erasure-Coded Key-Value Stores with Side Information

In applications of distributed storage systems to modern key-value stores, the stored data is highly dynamic due to frequent updates. The multi-version coding problem was formulated to study the cost of storing dynamic data in distributed…

Information Theory · Computer Science 2019-05-27 Ramy E. Ali , Viveck Cadambe , Jaime Llorca , Antonia Tulino

Distributed Caching for Complex Querying of Raw Arrays

As applications continue to generate multi-dimensional data at exponentially increasing rates, fast analytics to extract meaningful results is becoming extremely important. The database community has developed array databases that alleviate…

Databases · Computer Science 2018-03-19 Weijie Zhao , Florin Rusu , Bin Dong , Kesheng Wu , Anna Y. Q. Ho , Peter Nugent

ReStore: In-Memory REplicated STORagE for Rapid Recovery in Fault-Tolerant Algorithms

Fault-tolerant distributed applications require mechanisms to recover data lost via a process failure. On modern cluster systems it is typically impractical to request replacement resources after such a failure. Therefore, applications have…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-26 Lukas Hübner , Demian Hespe , Peter Sanders , Alexandros Stamatakis

PHD-Store: An Adaptive SPARQL Engine with Dynamic Partitioning for Distributed RDF Repositories

Many repositories utilize the versatile RDF model to publish data. Repositories are typically distributed and geographically remote, but data are interconnected (e.g., the Semantic Web) and queried globally by a language such as SPARQL. Due…

Databases · Computer Science 2014-05-21 Razen Al-Harbi , Yasser Ebrahim , Panos Kalnis

Robust Dynamic Coded Distributed Storage with Partially Storage Constrained Servers

We consider the problem of Robust Dynamic Coded Distributed Storage (RDCDS) with partially storage constrained servers where the goal is to enable robust (resilient to server dropouts) and efficient (as measured by the communication costs)…

Information Theory · Computer Science 2025-11-12 Chen Zhao , Haobo Jia , Zhuqing Jia

Secrecy and Accessibility in Distributed Storage

A distributed storage system (DSS) needs to be efficiently accessible and repairable. Recently, considerable effort has been made towards the latter, while the former is usually not considered, since a trivial solution exists in the form of…

Information Theory · Computer Science 2020-01-16 Lukas Holzbaur , Stanislav Kruglik , Alexey Frolov , Antonia Wachter-Zeh

SynchroStore: A Cost-Based Fine-Grained Incremental Compaction for Hybrid Workloads

This study proposes a novel storage engine, SynchroStore, designed to address the inefficiency of update operations in columnar storage systems based on Log-Structured Merge Trees (LSM-Trees) under hybrid workload scenarios. While columnar…

Databases · Computer Science 2025-03-25 Yinan Zhang , Huiqi Hu , Xuan Zhou

Quorum Sensing for Regenerating Codes in Distributed Storage

Distributed storage systems with replication are well known for storing large amount of data. A large number of replication is done in order to provide reliability. This makes the system expensive. Various methods have been proposed over…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-10-01 Mit Sheth , Krishna Gopal Benerjee , Manish K. Gupta

An Overview of Codes Tailor-made for Better Repairability in Networked Distributed Storage Systems

The continuously increasing amount of digital data generated by today's society asks for better storage solutions. This survey looks at a new generation of coding techniques designed specifically for the needs of distributed networked…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-01-31 Anwitaman Datta , Frederique Oggier