Related papers: Binary Embedding: Fundamental Limits and Fast Algo…

Near-Optimal Sample Complexity Bounds for Circulant Binary Embedding

Binary embedding is the problem of mapping points from a high-dimensional space to a Hamming cube in lower dimension while preserving pairwise distances. An efficient way to accomplish this is to make use of fast embedding techniques…

Data Structures and Algorithms · Computer Science 2016-03-15 Samet Oymak

Faster Binary Embeddings for Preserving Euclidean Distances

We propose a fast, distance-preserving, binary embedding algorithm to transform a high-dimensional dataset $\mathcal{T}\subseteq\mathbb{R}^n$ into binary sequences in the cube $\{\pm 1\}^m$. When $\mathcal{T}$ consists of well-spread (i.e.,…

Information Theory · Computer Science 2021-03-11 Jinjie Zhang , Rayan Saab

Fast Binary Embedding via Circulant Downsampled Matrix -- A Data-Independent Approach

Binary embedding of high-dimensional data aims to produce low-dimensional binary codes while preserving discriminative power. State-of-the-art methods often suffer from high computation and storage costs. We present a simple and fast…

Information Theory · Computer Science 2016-01-26 Sung-Hsien Hsieh , Chun-Shien Lu , Soo-Chang Pei

Fast binary embeddings, and quantized compressed sensing with structured matrices

This paper deals with two related problems, namely distance-preserving binary embeddings and quantization for compressed sensing . First, we propose fast methods to replace points from a subset $\mathcal{X} \subset \mathbb{R}^n$, associated…

Information Theory · Computer Science 2018-07-19 Thang Huynh , Rayan Saab

Near-Optimal Bounds for Binary Embeddings of Arbitrary Sets

We study embedding a subset $K$ of the unit sphere to the Hamming cube $\{-1,+1\}^m$. We characterize the tradeoff between distortion and sample complexity $m$ in terms of the Gaussian width $\omega(K)$ of the set. For subspaces and several…

Machine Learning · Computer Science 2015-12-15 Samet Oymak , Ben Recht

Near-lossless Binarization of Word Embeddings

Word embeddings are commonly used as a starting point in many NLP models to achieve state-of-the-art performances. However, with a large vocabulary and many dimensions, these floating-point representations are expensive both in terms of…

Computation and Language · Computer Science 2020-01-23 Julien Tissier , Christophe Gravier , Amaury Habrard

Binary embeddings with structured hashed projections

We consider the hashing mechanism for constructing binary embeddings, that involves pseudo-random projections followed by nonlinear (sign function) mappings. The pseudo-random projection is described by a matrix, where not all entries are…

Machine Learning · Computer Science 2016-07-04 Anna Choromanska , Krzysztof Choromanski , Mariusz Bojarski , Tony Jebara , Sanjiv Kumar , Yann LeCun

Large-margin Learning of Compact Binary Image Encodings

The use of high-dimensional features has become a normal practice in many computer vision applications. The large dimension of these features is a limiting factor upon the number of data points which may be effectively stored and processed,…

Computer Vision and Pattern Recognition · Computer Science 2015-06-18 Sakrapee Paisitkriangkrai , Chunhua Shen , Anton van den Hengel

Global Hashing System for Fast Image Search

Hashing methods have been widely investigated for fast approximate nearest neighbor searching in large data sets. Most existing methods use binary vectors in lower dimensional spaces to represent data points that are usually real vectors of…

Computer Vision and Pattern Recognition · Computer Science 2019-04-19 Dayong Tian , Dacheng Tao

Projection Bank: From High-dimensional Data to Medium-length Binary Codes

Recently, very high-dimensional feature representations, e.g., Fisher Vector, have achieved excellent performance for visual recognition and retrieval. However, these lengthy representations always cause extremely heavy computational and…

Computer Vision and Pattern Recognition · Computer Science 2015-09-17 Li Liu , Mengyang Yu , Ling Shao

Fast metric embedding into the Hamming cube

We consider the problem of embedding a subset of $\mathbb{R}^n$ into a low-dimensional Hamming cube in an almost isometric way. We construct a simple, data-oblivious, and computationally efficient map that achieves this task with high…

Probability · Mathematics 2022-09-07 Sjoerd Dirksen , Shahar Mendelson , Alexander Stollenwerk

Metric Embeddings Beyond Bi-Lipschitz Distortion via Sherali-Adams

Metric embeddings are a widely used method in algorithm design, where generally a ``complex'' metric is embedded into a simpler, lower-dimensional one. Historically, the theoretical computer science community has focused on bi-Lipschitz…

Data Structures and Algorithms · Computer Science 2025-05-19 Ainesh Bakshi , Vincent Cohen-Addad , Samuel B. Hopkins , Rajesh Jayaram , Silvio Lattanzi

Fast Large-Scale Discrete Optimization Based on Principal Coordinate Descent

Binary optimization, a representative subclass of discrete optimization, plays an important role in mathematical optimization and has various applications in computer vision and machine learning. Usually, binary optimization problems are…

Optimization and Control · Mathematics 2021-05-18 Huan Xiong , Mengyang Yu , Li Liu , Fan Zhu , Fumin Shen , Ling Shao

Positive semi-definite embedding for dimensionality reduction and out-of-sample extensions

In machine learning or statistics, it is often desirable to reduce the dimensionality of a sample of data points in a high dimensional space $\mathbb{R}^d$. This paper introduces a dimensionality reduction method where the embedding…

Machine Learning · Computer Science 2021-11-30 Michaël Fanuel , Antoine Aspeel , Jean-Charles Delvenne , Johan A. K. Suykens

Circulant Binary Embedding

Binary embedding of high-dimensional data requires long codes to preserve the discriminative power of the input space. Traditional binary coding methods often suffer from very high computation and storage costs in such a scenario. To…

Machine Learning · Statistics 2014-05-14 Felix X. Yu , Sanjiv Kumar , Yunchao Gong , Shih-Fu Chang

Implicit Sparse Code Hashing

We address the problem of converting large-scale high-dimensional image data into binary codes so that approximate nearest-neighbor search over them can be efficiently performed. Different from most of the existing unsupervised approaches…

Computer Vision and Pattern Recognition · Computer Science 2015-12-02 Tsung-Yu Lin , Tsung-Wei Ke , Tyng-Luh Liu

Hashing with Mutual Information

Binary vector embeddings enable fast nearest neighbor retrieval in large databases of high-dimensional objects, and play an important role in many practical applications, such as image and video retrieval. We study the problem of learning…

Computer Vision and Pattern Recognition · Computer Science 2018-06-26 Fatih Cakir , Kun He , Sarah Adel Bargal , Stan Sclaroff

Near-Isometric Binary Hashing for Large-scale Datasets

We develop a scalable algorithm to learn binary hash codes for indexing large-scale datasets. Near-isometric binary hashing (NIBH) is a data-dependent hashing scheme that quantizes the output of a learned low-dimensional embedding to obtain…

Data Structures and Algorithms · Computer Science 2016-03-15 Amirali Aghazadeh , Andrew Lan , Anshumali Shrivastava , Richard Baraniuk

Supervised Hashing Using Graph Cuts and Boosted Decision Trees

Embedding image features into a binary Hamming space can improve both the speed and accuracy of large-scale query-by-example image retrieval systems. Supervised hashing aims to map the original features to compact binary codes in a manner…

Machine Learning · Computer Science 2016-11-17 Guosheng Lin , Chunhua Shen , Anton van den Hengel

On Binary Embedding using Circulant Matrices

Binary embeddings provide efficient and powerful ways to perform operations on large scale data. However binary embedding typically requires long codes in order to preserve the discriminative power of the input space. Thus binary coding…

Data Structures and Algorithms · Computer Science 2015-12-08 Felix X. Yu , Aditya Bhaskara , Sanjiv Kumar , Yunchao Gong , Shih-Fu Chang