Coding sets with asymmetric information
Abstract
We study the following one-way asymmetric transmission problem, also a variant of model-based compressed sensing: a resource-limited encoder has to report a small set from a universe of items to a more powerful decoder (server). The distinguishing feature is asymmetric information: the subset is comprised of i.i.d. samples from a prior distribution , and is only known to the decoder. The goal for the encoder is to encode obliviously, while achieving the information-theoretic bound of , i.e., the Shannon entropy bound. We first show that any such compression scheme must be {\em randomized}, if it gains non-trivially from the prior . This stands in contrast to the symmetric case (when both the encoder and decoder know ), where the Huffman code provides a near-optimal deterministic solution. On the other hand, a rather simple argument shows that, when , a random linear code achieves near-optimal communication rate of about bits. Alas, the resulting scheme has prohibitive decoding time: about . Our main result is a computationally efficient and linear coding scheme, which achieves an -competitive communication ratio compared to the optimal benchmark, and runs in time. Our "multi-level" coding scheme uses a combination of hashing and syndrome-decoding of Reed-Solomon codes, and relies on viewing the (unknown) prior as a rather small convex combination of uniform ("flat") distributions.
Cite
@article{arxiv.1707.04875,
title = {Coding sets with asymmetric information},
author = {Alexandr Andoni and Javad Ghaderi and Daniel Hsu and Dan Rubenstein and Omri Weinstein},
journal= {arXiv preprint arXiv:1707.04875},
year = {2018}
}