English

Verifying Concurrent Multicopy Search Structures

Programming Languages 2021-09-14 v1 Logic in Computer Science

Abstract

Multicopy search structures such as log-structured merge (LSM) trees are optimized for high insert/update/delete (collectively known as upsert) performance. In such data structures, an upsert on key kk, which adds (k,v)(k,v) where vv can be a value or a tombstone, is added to the root node even if kk is already present in other nodes. Thus there may be multiple copies of kk in the search structure. A search on kk aims to return the value associated with the most recent upsert. We present a general framework for verifying linearizability of concurrent multicopy search structures that abstracts from the underlying representation of the data structure in memory, enabling proof-reuse across diverse implementations. Based on our framework, we propose template algorithms for a) LSM structures forming arbitrary directed acyclic graphs and b) differential file structures, and formally verify these templates in the concurrent separation logic Iris. We also instantiate the LSM template to obtain the first verified concurrent in-memory LSM tree implementation.

Keywords

Cite

@article{arxiv.2109.05631,
  title  = {Verifying Concurrent Multicopy Search Structures},
  author = {Nisarg Patel and Siddharth Krishna and Dennis Shasha and Thomas Wies},
  journal= {arXiv preprint arXiv:2109.05631},
  year   = {2021}
}

Comments

Extended version of an article to appear in OOPSLA'21

R2 v1 2026-06-24T05:53:59.473Z