English

Accelerating Large-Scale Inference with Anisotropic Vector Quantization

Machine Learning 2020-12-08 v5 Machine Learning

Abstract

Quantization based techniques are the current state-of-the-art for scaling maximum inner product search to massive databases. Traditional approaches to quantization aim to minimize the reconstruction error of the database points. Based on the observation that for a given query, the database points that have the largest inner products are more relevant, we develop a family of anisotropic quantization loss functions. Under natural statistical assumptions, we show that quantization with these loss functions leads to a new variant of vector quantization that more greatly penalizes the parallel component of a datapoint's residual relative to its orthogonal component. The proposed approach achieves state-of-the-art results on the public benchmarks available at \url{ann-benchmarks.com}.

Keywords

Cite

@article{arxiv.1908.10396,
  title  = {Accelerating Large-Scale Inference with Anisotropic Vector Quantization},
  author = {Ruiqi Guo and Philip Sun and Erik Lindgren and Quan Geng and David Simcha and Felix Chern and Sanjiv Kumar},
  journal= {arXiv preprint arXiv:1908.10396},
  year   = {2020}
}