English

Multiset Combinatorial Batch Codes

Discrete Mathematics 2017-01-11 v1 Combinatorics

Abstract

Batch codes, first introduced by Ishai, Kushilevitz, Ostrovsky, and Sahai, mimic a distributed storage of a set of nn data items on mm servers, in such a way that any batch of kk data items can be retrieved by reading at most some tt symbols from each server. Combinatorial batch codes, are replication-based batch codes in which each server stores a subset of the data items. In this paper, we propose a generalization of combinatorial batch codes, called multiset combinatorial batch codes (MCBC), in which nn data items are stored in mm servers, such that any multiset request of kk items, where any item is requested at most rr times, can be retrieved by reading at most tt items from each server. The setup of this new family of codes is motivated by recent work on codes which enable high availability and parallel reads in distributed storage systems. The main problem under this paradigm is to minimize the number of items stored in the servers, given the values of n,m,k,r,tn,m,k,r,t, which is denoted by N(n,k,m,t;r)N(n,k,m,t;r). We first give a necessary and sufficient condition for the existence of MCBCs. Then, we present several bounds on N(n,k,m,t;r)N(n,k,m,t;r) and constructions of MCBCs. In particular, we determine the value of N(n,k,m,1;r)N(n,k,m,1;r) for any nk1r(mk1)(mk+1)A(m,4,k2)n\geq \left\lfloor\frac{k-1}{r}\right\rfloor{m\choose k-1}-(m-k+1)A(m,4,k-2), where A(m,4,k2)A(m,4,k-2) is the maximum size of a binary constant weight code of length mm, distance four and weight k2k-2. We also determine the exact value of N(n,k,m,1;r)N(n,k,m,1;r) when r{k,k1}r\in\{k,k-1\} or k=mk=m.

Cite

@article{arxiv.1701.02708,
  title  = {Multiset Combinatorial Batch Codes},
  author = {Hui Zhang and Eitan Yaakobi and Natalia Silberstein},
  journal= {arXiv preprint arXiv:1701.02708},
  year   = {2017}
}