English
Related papers

Related papers: RecSSD: Near Data Processing for Solid State Drive…

200 papers

Personalized recommendation systems leverage deep learning models and account for the majority of data center AI cycles. Their performance is dominated by memory-bound sparse embedding operations with unique irregular memory access patterns…

Recommendation system has gained a large popularity for a variety of personalized suggestion tasks, but the ever-increasing number of user data makes real-time processing of recommendation systems difficult. NAND flash memory-based…

Hardware Architecture · Computer Science 2026-04-29 Jangho Baik , Sunghyun Kim , Gisan Ji , Wonbo Shim , Sungju Ryu

Solid-State Drives (SSDs) have significant performance advantages over traditional Hard Disk Drives (HDDs) such as lower latency and higher throughput. Significantly higher price per capacity and limited lifetime, however, prevents…

Hardware Architecture · Computer Science 2021-11-08 Shahriar Ebrahimi , Reza Salkhordeh , Seyed Ali Osia , Ali Taheri , Hamid Reza Rabiee , Hossein Asadi

Neural personalized recommendation is the corner-stone of a wide collection of cloud services and products, constituting significant compute demand of the cloud infrastructure. Thus, improving the execution efficiency of neural…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-01-10 Udit Gupta , Samuel Hsia , Vikram Saraph , Xiaodong Wang , Brandon Reagen , Gu-Yeon Wei , Hsien-Hsin S. Lee , David Brooks , Carole-Jean Wu

Recommendation systems (RecSys) suggest items to users by predicting their preferences based on historical data. Typical RecSys handle large embedding tables and many embedding table related operations. The memory size and bandwidth of the…

Hardware Architecture · Computer Science 2022-02-22 Mengyuan Li , Ann Franchesca Laguna , Dayane Reis , Xunzhao Yin , Michael Niemier , Xiaobo Sharon Hu

Absence of large-scale labeled data in the practitioner's target domain can be a bottleneck to applying machine learning algorithms in practice. Transfer learning is a popular strategy for leveraging additional data to improve the…

Machine Learning · Computer Science 2022-06-22 Tianshi Cao , Sasha Doubov , David Acuna , Sanja Fidler

The need to recognise long-term dependencies in sequential data such as video streams has made Long Short-Term Memory (LSTM) networks a prominent Artificial Intelligence model for many emerging applications. However, the high computational…

Signal Processing · Electrical Eng. & Systems 2019-10-31 Alexandros Kouris , Stylianos I. Venieris , Michail Rizakis , Christos-Savvas Bouganis

Recurrent neural networks (RNNs) were recently proposed for the session-based recommendation task. The models showed promising improvements over traditional recommendation approaches. In this work, we further study RNN-based models for…

Machine Learning · Computer Science 2016-09-19 Yong Kiam Tan , Xinxing Xu , Yong Liu

With the increasing popularity of recommendation systems (RecSys), the demand for compute resources in datacenters has surged. However, the model-wise resource allocation employed in current RecSys model serving architectures falls short in…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-06-12 Yujeong Choi , Jiin Kim , Minsoo Rhu

We propose RecShard, a fine-grained embedding table (EMB) partitioning and placement technique for deep learning recommendation models (DLRMs). RecShard is designed based on two key observations. First, not all EMBs are equal, nor all rows…

Machine Learning · Computer Science 2022-01-26 Geet Sethi , Bilge Acun , Niket Agarwal , Christos Kozyrakis , Caroline Trippel , Carole-Jean Wu

Personalized recommendation models (RecSys) are one of the most popular machine learning workload serviced by hyperscalers. A critical challenge of training RecSys is its high memory capacity requirements, reaching hundreds of GBs to TBs of…

Hardware Architecture · Computer Science 2022-05-11 Youngeun Kwon , Minsoo Rhu

Deep neural networks are widely used in personalized recommendation systems. Unlike regular DNN inference workloads, recommendation inference is memory-bound due to the many random memory accesses needed to lookup the embedding tables. The…

The widespread application of deep learning has changed the landscape of computation in the data center. In particular, personalized recommendation for content ranking is now largely accomplished leveraging deep neural networks. However,…

Existing solid state drive (SSD) simulators unfortunately lack hardware and/or software architecture models. Consequently, they are far from capturing the critical features of contemporary SSD devices. More importantly, while the…

Hardware Architecture · Computer Science 2017-09-15 Myoungsoo Jung , Jie Zhang , Ahmed Abulila , Miryeong Kwon , Narges Shahidi , John Shalf , Nam Sung Kim , Mahmut Kandemir

The idea of computational storage device (CSD) has come a long way since at least 1990s [1], [2]. By embedding computing resources within storage devices, CSDs could potentially offload computational tasks from CPUs and enable near-data…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-23 Chao Shi , Anthony Manschula , Tabassum Mahmud , Zeren Yang , Mai Zheng , Yong Chen , Jim Wayda , Matthew Wolf , Byungwoo Bang

Recommender systems are tools that support online users by pointing them to potential items of interest in situations of information overload. In recent years, the class of session-based recommendation algorithms received more attention in…

Information Retrieval · Computer Science 2020-09-29 Malte Ludewig , Noemi Mauro , Sara Latifi , Dietmar Jannach

Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector database and many data center applications, such as person re-identification and recommendation systems. It is also fundamental to retrieval augmented…

Hardware Architecture · Computer Science 2024-05-30 Yitu Wang , Shiyu Li , Qilin Zheng , Linghao Song , Zongwang Li , Andrew Chang , Hai "Helen" Li , Yiran Chen

Transfer learning has proven to be a successful technique to train deep learning models in the domains where little training data is available. The dominant approach is to pretrain a model on a large generic dataset such as ImageNet and…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Xi Yan , David Acuna , Sanja Fidler

Latent factor models are the most popular backbones for today's recommender systems owing to their prominent performance. Latent factor models represent users and items as real-valued embedding vectors for pairwise similarity computation,…

Information Retrieval · Computer Science 2026-02-11 Yunke Qu , Tong Chen , Xiangyu Zhao , Lizhen Cui , Kai Zheng , Hongzhi Yin

The Big Data trend is putting strain on modern storage systems, which have to support high-performance I/O accesses for the large quantities of data. With the prevalent Von Neumann computing architecture, this data is constantly moved back…

Hardware Architecture · Computer Science 2021-12-02 Corne Lukken , Giulia Frascaria , Animesh Trivedi
‹ Prev 1 2 3 10 Next ›