Related papers: On Binary Embedding using Circulant Matrices

Circulant Binary Embedding

Binary embedding of high-dimensional data requires long codes to preserve the discriminative power of the input space. Traditional binary coding methods often suffer from very high computation and storage costs in such a scenario. To…

Machine Learning · Statistics 2014-05-14 Felix X. Yu , Sanjiv Kumar , Yunchao Gong , Shih-Fu Chang

Fast Binary Embedding via Circulant Downsampled Matrix -- A Data-Independent Approach

Binary embedding of high-dimensional data aims to produce low-dimensional binary codes while preserving discriminative power. State-of-the-art methods often suffer from high computation and storage costs. We present a simple and fast…

Information Theory · Computer Science 2016-01-26 Sung-Hsien Hsieh , Chun-Shien Lu , Soo-Chang Pei

BinConv: A Neural Architecture for Ordinal Encoding in Time-Series Forecasting

Recent work in time series forecasting has explored reformulating regression as a classification task. By discretizing the continuous target space into bins and predicting over a fixed set of classes, these approaches benefit from more…

Machine Learning · Computer Science 2025-08-28 Andrei Chernov , Vitaliy Pozdnyakov , Ilya Makarov

Search Efficient Binary Network Embedding

Traditional network embedding primarily focuses on learning a continuous vector representation for each node, preserving network structure and/or node content information, such that off-the-shelf machine learning algorithms can be easily…

Social and Information Networks · Computer Science 2023-01-02 Daokun Zhang , Jie Yin , Xingquan Zhu , Chengqi Zhang

Near-Optimal Sample Complexity Bounds for Circulant Binary Embedding

Binary embedding is the problem of mapping points from a high-dimensional space to a Hamming cube in lower dimension while preserving pairwise distances. An efficient way to accomplish this is to make use of fast embedding techniques…

Data Structures and Algorithms · Computer Science 2016-03-15 Samet Oymak

End-to-end Binary Representation Learning via Direct Binary Embedding

Learning binary representation is essential to large-scale computer vision tasks. Most existing algorithms require a separate quantization constraint to learn effective hashing functions. In this work, we present Direct Binary Embedding…

Computer Vision and Pattern Recognition · Computer Science 2017-06-06 Liu Liu , Alireza Rahimpour , Ali Taalimi , Hairong Qi

Faster Binary Embeddings for Preserving Euclidean Distances

We propose a fast, distance-preserving, binary embedding algorithm to transform a high-dimensional dataset $\mathcal{T}\subseteq\mathbb{R}^n$ into binary sequences in the cube $\{\pm 1\}^m$. When $\mathcal{T}$ consists of well-spread (i.e.,…

Information Theory · Computer Science 2021-03-11 Jinjie Zhang , Rayan Saab

Projection Bank: From High-dimensional Data to Medium-length Binary Codes

Recently, very high-dimensional feature representations, e.g., Fisher Vector, have achieved excellent performance for visual recognition and retrieval. However, these lengthy representations always cause extremely heavy computational and…

Computer Vision and Pattern Recognition · Computer Science 2015-09-17 Li Liu , Mengyang Yu , Ling Shao

Efficient data hashing with structured binary embeddings

We present here new mechanisms for hashing data via binary embeddings. Contrary to most of the techniques presented before, the embedding matrix of our mechanism is highly structured. That enables us to perform hashing more efficiently and…

Data Structures and Algorithms · Computer Science 2015-05-14 Krzysztof Choromanski

Coded Computing via Binary Linear Codes: Designs and Performance Limits

We consider the problem of coded distributed computing where a large linear computational job, such as a matrix multiplication, is divided into $k$ smaller tasks, encoded using an $(n,k)$ linear code, and performed over $n$ distributed…

Information Theory · Computer Science 2021-10-06 Mahdi Soleymani , Mohammad Vahid Jamali , Hessam Mahdavifar

Beyond Embeddings: Interpretable Feature Extraction for Binary Code Similarity

Binary code similarity detection is a core task in reverse engineering. It supports malware analysis and vulnerability discovery by identifying semantically similar code in different contexts. Modern methods have progressed from manually…

Artificial Intelligence · Computer Science 2025-09-30 Charles E. Gagnon , Steven H. H. Ding , Philippe Charland , Benjamin C. M. Fung

Binary Embedding: Fundamental Limits and Fast Algorithm

Binary embedding is a nonlinear dimension reduction methodology where high dimensional data are embedded into the Hamming cube while preserving the structure of the original space. Specifically, for an arbitrary $N$ distinct points in…

Data Structures and Algorithms · Computer Science 2019-01-24 Xinyang Yi , Constantine Caramanis , Eric Price

Learning Binary Autoencoder-Based Codes with Progressive Training

Error correcting codes play a central role in digital communication, ensuring that transmitted information can be accurately reconstructed despite channel impairments. Recently, autoencoder (AE) based approaches have gained attention for…

Information Theory · Computer Science 2025-11-13 Vukan Ninkovic , Dejan Vukobratovic

Fast binary embeddings with Gaussian circulant matrices: improved bounds

We consider the problem of encoding a finite set of vectors into a small number of bits while approximately retaining information on the angular distances between the vectors. By deriving improved variance bounds related to binary Gaussian…

Information Theory · Computer Science 2017-12-27 Sjoerd Dirksen , Alexander Stollenwerk

Binary embeddings with structured hashed projections

We consider the hashing mechanism for constructing binary embeddings, that involves pseudo-random projections followed by nonlinear (sign function) mappings. The pseudo-random projection is described by a matrix, where not all entries are…

Machine Learning · Computer Science 2016-07-04 Anna Choromanska , Krzysztof Choromanski , Mariusz Bojarski , Tony Jebara , Sanjiv Kumar , Yann LeCun

Recurrent Binary Embedding for GPU-Enabled Exhaustive Retrieval from Billion-Scale Semantic Vectors

Rapid advances in GPU hardware and multiple areas of Deep Learning open up a new opportunity for billion-scale information retrieval with exhaustive search. Building on top of the powerful concept of semantic learning, this paper proposes a…

Information Retrieval · Computer Science 2018-02-20 Ying Shan , Jian Jiao , Jie Zhu , JC Mao

Clustering the Sketch: A Novel Approach to Embedding Table Compression

Embedding tables are used by machine learning systems to work with categorical features. In modern Recommendation Systems, these tables can be very large, necessitating the development of new methods for fitting them in memory, even during…

Machine Learning · Computer Science 2023-10-24 Henry Ling-Hei Tsang , Thomas Dybdahl Ahle

SAFE: Self-Attentive Function Embeddings for Binary Similarity

The binary similarity problem consists in determining if two functions are similar by only considering their compiled form. Advanced techniques for binary similarity recently gained momentum as they can be applied in several fields, such as…

Cryptography and Security · Computer Science 2019-12-20 Luca Massarelli , Giuseppe Antonio Di Luna , Fabio Petroni , Leonardo Querzoni , Roberto Baldoni

Binary Code based Hash Embedding for Web-scale Applications

Nowadays, deep learning models are widely adopted in web-scale applications such as recommender systems, and online advertising. In these applications, embedding learning of categorical features is crucial to the success of deep learning…

Information Retrieval · Computer Science 2021-09-07 Bencheng Yan , Pengjie Wang , Jinquan Liu , Wei Lin , Kuang-Chih Lee , Jian Xu , Bo Zheng

Binary Embedding-based Retrieval at Tencent

Large-scale embedding-based retrieval (EBR) is the cornerstone of search-related industrial applications. Given a user query, the system of EBR aims to identify relevant information from a large corpus of documents that may be tens or…

Information Retrieval · Computer Science 2023-02-20 Yukang Gan , Yixiao Ge , Chang Zhou , Shupeng Su , Zhouchuan Xu , Xuyuan Xu , Quanchao Hui , Xiang Chen , Yexin Wang , Ying Shan