Related papers: Learnable Embedding Sizes for Recommender Systems

Learning Compact Compositional Embeddings via Regularized Pruning for Recommendation

Latent factor models are the dominant backbones of contemporary recommender systems (RSs) given their performance advantages, where a unique vector embedding with a fixed dimensionality (e.g., 128) is required to represent each entity…

Information Retrieval · Computer Science 2023-09-11 Xurong Liang , Tong Chen , Quoc Viet Hung Nguyen , Jianxin Li , Hongzhi Yin

Mixed-Precision Embeddings for Large-Scale Recommendation Models

Embedding techniques have become essential components of large databases in the deep learning era. By encoding discrete entities, such as words, items, or graph nodes, into continuous vector spaces, embeddings facilitate more efficient…

Information Retrieval · Computer Science 2024-10-18 Shiwei Li , Zhuoqi Hu , Xing Tang , Haozhao Wang , Shijie Xu , Weihong Luo , Yuhua Li , Xiuqiang He , Ruixuan Li

Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems

Huge embedding tables in modern deep learning recommender models (DLRM) require prohibitively large memory during training and inference. This paper proposes FIITED, a system to automatically reduce the memory footprint via FIne-grained…

Information Retrieval · Computer Science 2024-10-15 Qinyi Luo , Penghan Wang , Wei Zhang , Fan Lai , Jiachen Mao , Xiaohan Wei , Jun Song , Wei-Yu Tsai , Shuai Yang , Yuxi Hu , Xuehai Qian

Field-wise Embedding Size Search via Structural Hard Auxiliary Mask Pruning for Click-Through Rate Prediction

Feature embeddings are one of the most essential steps when training deep learning based Click-Through Rate prediction models, which map high-dimensional sparse features to dense embedding vectors. Classic human-crafted embedding size…

Information Retrieval · Computer Science 2022-08-18 Tesi Xiao , Xia Xiao , Ming Chen , Youlong Chen

Compositional Embeddings Using Complementary Partitions for Memory-Efficient Recommendation Systems

Modern deep learning-based recommendation systems exploit hundreds to thousands of different categorical features, each with millions of different categories ranging from clicks to posts. To respect the natural diversity within the…

Machine Learning · Computer Science 2020-06-30 Hao-Jun Michael Shi , Dheevatsa Mudigere , Maxim Naumov , Jiyan Yang

Embedding Compression in Recommender Systems: A Survey

To alleviate the problem of information explosion, recommender systems are widely deployed to provide personalized information filtering services. Usually, embedding tables are employed in recommender systems to transform high-dimensional…

Information Retrieval · Computer Science 2024-08-07 Shiwei Li , Huifeng Guo , Xing Tang , Ruiming Tang , Lu Hou , Ruixuan Li , Rui Zhang

OptEmbed: Learning Optimal Embedding Table for Click-through Rate Prediction

Learning embedding table plays a fundamental role in Click-through rate(CTR) prediction from the view of the model performance and memory usage. The embedding table is a two-dimensional tensor, with its axes indicating the number of feature…

Information Retrieval · Computer Science 2022-09-07 Fuyuan Lyu , Xing Tang , Hong Zhu , Huifeng Guo , Yingxue Zhang , Ruiming Tang , Xue Liu

Personalized Elastic Embedding Learning for On-Device Recommendation

To address privacy concerns and reduce network latency, there has been a recent trend of compressing cumbersome recommendation models trained on the cloud and deploying compact recommender models to resource-limited devices for the…

Information Retrieval · Computer Science 2023-11-17 Ruiqi Zheng , Liang Qu , Tong Chen , Kai Zheng , Yuhui Shi , Hongzhi Yin

PEP: Parameter Ensembling by Perturbation

Ensembling is now recognized as an effective approach for increasing the predictive performance and calibration of deep networks. We introduce a new approach, Parameter Ensembling by Perturbation (PEP), that constructs an ensemble of…

Machine Learning · Computer Science 2020-10-27 Alireza Mehrtash , Purang Abolmaesumi , Polina Golland , Tina Kapur , Demian Wassermann , William M. Wells

Training with Multi-Layer Embeddings for Model Reduction

Modern recommendation systems rely on real-valued embeddings of categorical features. Increasing the dimension of embedding vectors improves model accuracy but comes at a high cost to model size. We introduce a multi-layer embedding…

Machine Learning · Computer Science 2020-06-11 Benjamin Ghaemmaghami , Zihao Deng , Benjamin Cho , Leo Orshansky , Ashish Kumar Singh , Mattan Erez , Michael Orshansky

EncodeRec: An Embedding Backbone for Recommendation Systems

Recent recommender systems increasingly leverage embeddings from large pre-trained language models (PLMs). However, such embeddings exhibit two key limitations: (1) PLMs are not explicitly optimized to produce structured and discriminative…

Computation and Language · Computer Science 2026-01-19 Guy Hadad , Neomi Rabaev , Bracha Shapira

Experimental Analysis of Large-scale Learnable Vector Storage Compression

Learnable embedding vector is one of the most important applications in machine learning, and is widely used in various database-related domains. However, the high dimensionality of sparse data in recommendation tasks and the huge volume of…

Machine Learning · Computer Science 2024-02-14 Hailin Zhang , Penghao Zhao , Xupeng Miao , Yingxia Shao , Zirui Liu , Tong Yang , Bin Cui

LEAP: Learnable Pruning for Transformer-based Models

Pruning is an effective method to reduce the memory footprint and computational cost associated with large natural language processing models. However, current pruning algorithms either only focus on one pruning category, e.g., structured…

Computation and Language · Computer Science 2022-05-24 Zhewei Yao , Xiaoxia Wu , Linjian Ma , Sheng Shen , Kurt Keutzer , Michael W. Mahoney , Yuxiong He

The Future is Sparse: Embedding Compression for Scalable Retrieval in Recommender Systems

Industry-scale recommender systems face a core challenge: representing entities with high cardinality, such as users or items, using dense embeddings that must be accessible during both training and inference. However, as embedding sizes…

Information Retrieval · Computer Science 2025-05-19 Petr Kasalický , Martin Spišák , Vojtěch Vančura , Daniel Bohuněk , Rodrigo Alves , Pavel Kordík

Scalable Dynamic Embedding Size Search for Streaming Recommendation

Recommender systems typically represent users and items by learning their embeddings, which are usually set to uniform dimensions and dominate the model parameters. However, real-world recommender systems often operate in streaming…

Information Retrieval · Computer Science 2026-02-05 Yunke Qu , Liang Qu , Tong Chen , Xiangyu Zhao , Quoc Viet Hung Nguyen , Hongzhi Yin

Learning Effective and Efficient Embedding via an Adaptively-Masked Twins-based Layer

Embedding learning for categorical features is crucial for the deep learning-based recommendation models (DLRMs). Each feature value is mapped to an embedding vector via an embedding learning process. Conventional methods configure a fixed…

Machine Learning · Computer Science 2021-08-27 Bencheng Yan , Pengjie Wang , Kai Zhang , Wei Lin , Kuang-Chih Lee , Jian Xu , Bo Zheng

Learning Multi-granular Quantized Embeddings for Large-Vocab Categorical Features in Recommender Systems

Recommender system models often represent various sparse features like users, items, and categorical features via embeddings. A standard approach is to map each unique feature value to an embedding vector. The size of the produced embedding…

Information Retrieval · Computer Science 2020-08-26 Wang-Cheng Kang , Derek Zhiyuan Cheng , Ting Chen , Xinyang Yi , Dong Lin , Lichan Hong , Ed H. Chi

Learning Elastic Embeddings for Customizing On-Device Recommenders

In today's context, deploying data-driven services like recommendation on edge devices instead of cloud servers becomes increasingly attractive due to privacy and network latency concerns. A common practice in building compact on-device…

Information Retrieval · Computer Science 2021-06-07 Tong Chen , Hongzhi Yin , Yujia Zheng , Zi Huang , Yang Wang , Meng Wang

Learning to Collide: Recommendation System Model Compression with Learned Hash Functions

A key characteristic of deep recommendation models is the immense memory requirements of their embedding tables. These embedding tables can often reach hundreds of gigabytes which increases hardware requirements and training cost. A common…

Information Retrieval · Computer Science 2022-03-31 Benjamin Ghaemmaghami , Mustafa Ozdal , Rakesh Komuravelli , Dmitriy Korchev , Dheevatsa Mudigere , Krishnakumar Nair , Maxim Naumov

Clustered Embedding Learning for Recommender Systems

In recent years, recommender systems have advanced rapidly, where embedding learning for users and items plays a critical role. A standard method learns a unique embedding vector for each user and item. However, such a method has two…

Artificial Intelligence · Computer Science 2023-02-13 Yizhou Chen , Guangda Huzhang , Anxiang Zeng , Qingtao Yu , Hui Sun , Heng-yi Li , Jingyi Li , Yabo Ni , Han Yu , Zhiming Zhou