Parity Queries for Binary Classification
Abstract
Consider a query-based data acquisition problem that aims to recover the values of binary variables from parity (XOR) measurements of chosen subsets of the variables. Assume the response model where only a randomly selected subset of the measurements is received. We propose a method for designing a sequence of queries so that the variables can be identified with high probability using as few () measurements as possible. We define the query difficulty as the average size of the query subsets and the sample complexity as the minimum number of measurements required to attain a given recovery accuracy. We obtain fundamental trade-offs between recovery accuracy, query difficulty, and sample complexity. In particular, the necessary and sufficient sample complexity required for recovering all variables with high probability is and the sample complexity for recovering a fixed proportion of the variables for is , where .
Cite
@article{arxiv.1809.00901,
title = {Parity Queries for Binary Classification},
author = {Hye Won Chung and Ji Oon Lee and Doyeon Kim and Alfred O. Hero},
journal= {arXiv preprint arXiv:1809.00901},
year = {2019}
}
Comments
26 pages, 4 figures