Related papers: Compositional Embeddings Using Complementary Parti…

Learning Compressed Embeddings for On-Device Inference

In deep learning, embeddings are widely used to represent categorical entities such as words, apps, and movies. An embedding layer maps each entity to a unique vector, causing the layer's memory requirement to be proportional to the number…

Machine Learning · Computer Science 2022-03-22 Niketan Pansare , Jay Katukuri , Aditya Arora , Frank Cipollone , Riyaaz Shaik , Noyan Tokgozoglu , Chandru Venkataraman

Embedding Compression in Recommender Systems: A Survey

To alleviate the problem of information explosion, recommender systems are widely deployed to provide personalized information filtering services. Usually, embedding tables are employed in recommender systems to transform high-dimensional…

Information Retrieval · Computer Science 2024-08-07 Shiwei Li , Huifeng Guo , Xing Tang , Ruiming Tang , Lu Hou , Ruixuan Li , Rui Zhang

Mixed-Precision Embeddings for Large-Scale Recommendation Models

Embedding techniques have become essential components of large databases in the deep learning era. By encoding discrete entities, such as words, items, or graph nodes, into continuous vector spaces, embeddings facilitate more efficient…

Information Retrieval · Computer Science 2024-10-18 Shiwei Li , Zhuoqi Hu , Xing Tang , Haozhao Wang , Shijie Xu , Weihong Luo , Yuhua Li , Xiuqiang He , Ruixuan Li

Learning to Collide: Recommendation System Model Compression with Learned Hash Functions

A key characteristic of deep recommendation models is the immense memory requirements of their embedding tables. These embedding tables can often reach hundreds of gigabytes which increases hardware requirements and training cost. A common…

Information Retrieval · Computer Science 2022-03-31 Benjamin Ghaemmaghami , Mustafa Ozdal , Rakesh Komuravelli , Dmitriy Korchev , Dheevatsa Mudigere , Krishnakumar Nair , Maxim Naumov

Clustering the Sketch: A Novel Approach to Embedding Table Compression

Embedding tables are used by machine learning systems to work with categorical features. In modern Recommendation Systems, these tables can be very large, necessitating the development of new methods for fitting them in memory, even during…

Machine Learning · Computer Science 2023-10-24 Henry Ling-Hei Tsang , Thomas Dybdahl Ahle

Binary Code based Hash Embedding for Web-scale Applications

Nowadays, deep learning models are widely adopted in web-scale applications such as recommender systems, and online advertising. In these applications, embedding learning of categorical features is crucial to the success of deep learning…

Information Retrieval · Computer Science 2021-09-07 Bencheng Yan , Pengjie Wang , Jinquan Liu , Wei Lin , Kuang-Chih Lee , Jian Xu , Bo Zheng

Experimental Analysis of Large-scale Learnable Vector Storage Compression

Learnable embedding vector is one of the most important applications in machine learning, and is widely used in various database-related domains. However, the high dimensionality of sparse data in recommendation tasks and the huge volume of…

Machine Learning · Computer Science 2024-02-14 Hailin Zhang , Penghao Zhao , Xupeng Miao , Yingxia Shao , Zirui Liu , Tong Yang , Bin Cui

A Universal Framework for Compressing Embeddings in CTR Prediction

Accurate click-through rate (CTR) prediction is vital for online advertising and recommendation systems. Recent deep learning advancements have improved the ability to capture feature interactions and understand user interests. However,…

Information Retrieval · Computer Science 2025-02-24 Kefan Wang , Hao Wang , Kenan Song , Wei Guo , Kai Cheng , Zhi Li , Yong Liu , Defu Lian , Enhong Chen

Deep Feature Embedding for Tabular Data

Tabular data learning has extensive applications in deep learning but its existing embedding techniques are limited in numerical and categorical features such as the inability to capture complex relationships and engineering. This paper…

Machine Learning · Computer Science 2024-09-02 Yuqian Wu , Hengyi Luo , Raymond S. T. Lee

Learning to Embed Categorical Features without Embedding Tables for Recommendation

Embedding learning of categorical features (e.g. user/item IDs) is at the core of various recommendation models including matrix factorization and neural collaborative filtering. The standard approach creates an embedding table where each…

Machine Learning · Computer Science 2021-06-08 Wang-Cheng Kang , Derek Zhiyuan Cheng , Tiansheng Yao , Xinyang Yi , Ting Chen , Lichan Hong , Ed H. Chi

Semantic IDs for Music Recommendation

Training recommender systems for next-item recommendation often requires unique embeddings to be learned for each item, which may take up most of the trainable parameters for a model. Shared embeddings, such as using content information,…

Information Retrieval · Computer Science 2025-07-28 M. Jeffrey Mei , Florian Henkel , Samuel E. Sandberg , Oliver Bembom , Andreas F. Ehmann

Embedding in Recommender Systems: A Survey

Recommender systems have become an essential component of many online platforms, providing personalized recommendations to users. A crucial aspect is embedding techniques that convert the high-dimensional discrete features, such as user and…

Information Retrieval · Computer Science 2025-10-23 Maolin Wang , Xinjian Zhao , Wanyu Wang , Sheng Zhang , Jiansheng Li , Bowen Yu , Binhao Wang , Shucheng Zhou , Dawei Yin , Qing Li , Ruocheng Guo , Xiangyu Zhao

Learnable Embedding Sizes for Recommender Systems

The embedding-based representation learning is commonly used in deep learning recommendation models to map the raw sparse features to dense vectors. The traditional embedding manner that assigns a uniform size to all features has two…

Machine Learning · Computer Science 2021-03-12 Siyi Liu , Chen Gao , Yihong Chen , Depeng Jin , Yong Li

Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

Learning high-quality feature embeddings efficiently and effectively is critical for the performance of web-scale machine learning systems. A typical model ingests hundreds of features with vocabularies on the order of millions to billions…

Machine Learning · Computer Science 2024-06-19 Benjamin Coleman , Wang-Cheng Kang , Matthew Fahrbach , Ruoxi Wang , Lichan Hong , Ed H. Chi , Derek Zhiyuan Cheng

Mem-Rec: Memory Efficient Recommendation System using Alternative Representation

Deep learning-based recommendation systems (e.g., DLRMs) are widely used AI models to provide high-quality personalized recommendations. Training data used for modern recommendation systems commonly includes categorical features taking on…

Information Retrieval · Computer Science 2026-01-06 Gopi Krishna Jha , Anthony Thomas , Nilesh Jain , Sameh Gobriel , Tajana Rosing , Ravi Iyer

The Future is Sparse: Embedding Compression for Scalable Retrieval in Recommender Systems

Industry-scale recommender systems face a core challenge: representing entities with high cardinality, such as users or items, using dense embeddings that must be accessible during both training and inference. However, as embedding sizes…

Information Retrieval · Computer Science 2025-05-19 Petr Kasalický , Martin Spišák , Vojtěch Vančura , Daniel Bohuněk , Rodrigo Alves , Pavel Kordík

Beyond Accuracy: Measuring Representation Capacity of Embeddings to Preserve Structural and Contextual Information

Effective representation of data is crucial in various machine learning tasks, as it captures the underlying structure and context of the data. Embeddings have emerged as a powerful technique for data representation, but evaluating their…

Machine Learning · Computer Science 2023-09-21 Sarwan Ali

Neural Input Search for Large Scale Recommendation Models

Recommendation problems with large numbers of discrete items, such as products, webpages, or videos, are ubiquitous in the technology industry. Deep neural networks are being increasingly used for these recommendation problems. These models…

Machine Learning · Computer Science 2019-07-11 Manas R. Joglekar , Cong Li , Jay K. Adams , Pranav Khaitan , Quoc V. Le

Learning Compact Compositional Embeddings via Regularized Pruning for Recommendation

Latent factor models are the dominant backbones of contemporary recommender systems (RSs) given their performance advantages, where a unique vector embedding with a fixed dimensionality (e.g., 128) is required to represent each entity…

Information Retrieval · Computer Science 2023-09-11 Xurong Liang , Tong Chen , Quoc Viet Hung Nguyen , Jianxin Li , Hongzhi Yin

Group Preserving Label Embedding for Multi-Label Classification

Multi-label learning is concerned with the classification of data with multiple class labels. This is in contrast to the traditional classification problem where every data instance has a single label. Due to the exponential size of output…

Machine Learning · Computer Science 2018-12-27 Vikas Kumar , Arun K Pujari , Vineet Padmanabhan , Venkateswara Rao Kagita