Related papers: Dynamic Superblock Pruning for Fast Learned Sparse…

Efficiency Optimizations for Superblock-based Sparse Retrieval

Learned sparse retrieval (LSR) is a popular method for first-stage retrieval because it combines the semantic matching of language models with efficient CPU-friendly algorithms. Previous work aggregates blocks into "superblocks" to quickly…

Information Retrieval · Computer Science 2026-02-04 Parker Carlson , Wentai Xie , Rohil Shah , Tao Yang

Faster Learned Sparse Retrieval with Block-Max Pruning

Learned sparse retrieval systems aim to combine the effectiveness of contextualized language models with the scalability of conventional data structures such as inverted indexes. Nevertheless, the indexes generated by these systems exhibit…

Information Retrieval · Computer Science 2024-05-03 Antonio Mallia , Torten Suel , Nicola Tonellotto

Approximate Cluster-Based Sparse Document Retrieval with Segmented Maximum Term Weights

This paper revisits cluster-based retrieval that partitions the inverted index into multiple groups and skips the index partially at cluster and document levels during online inference using a learned sparse representation. It proposes an…

Information Retrieval · Computer Science 2024-04-16 Yifan Qiao , Shanxiu He , Yingrui Yang , Parker Carlson , Tao Yang

A Static Pruning Study on Sparse Neural Retrievers

Sparse neural retrievers, such as DeepImpact, uniCOIL and SPLADE, have been introduced recently as an efficient and effective way to perform retrieval with inverted indexes. They aim to learn term importance and, in some cases, document…

Information Retrieval · Computer Science 2023-04-26 Carlos Lassance , Simon Lupart , Hervé Dejean , Stéphane Clinchant , Nicola Tonellotto

Optimizing Guided Traversal for Fast Learned Sparse Retrieval

Recent studies show that BM25-driven dynamic index skipping can greatly accelerate MaxScore-based document retrieval based on the learned sparse representation derived by DeepImpact. This paper investigates the effectiveness of such a…

Information Retrieval · Computer Science 2023-05-03 Yifan Qiao , Yingrui Yang , Haixin Lin , Tao Yang

Pairing Clustered Inverted Indexes with kNN Graphs for Fast Approximate Retrieval over Learned Sparse Representations

Learned sparse representations form an effective and interpretable class of embeddings for text retrieval. While exact top-k retrieval over such embeddings faces efficiency challenges, a recent algorithm called Seismic has enabled…

Information Retrieval · Computer Science 2024-10-22 Sebastian Bruch , Franco Maria Nardini , Cosimo Rulli , Rossano Venturini

Dual Skipping Guidance for Document Retrieval with Learned Sparse Representations

This paper proposes a dual skipping guidance scheme with hybrid scoring to accelerate document retrieval that uses learned sparse representations while still delivering a good relevance. This scheme uses both lexical BM25 and learned neural…

Information Retrieval · Computer Science 2022-04-26 Yifan Qiao , Yingrui Yang , Haixin Lin , Tianbo Xiong , Xiyue Wang , Tao Yang

Sparse Subspace Clustering: Algorithm, Theory, and Applications

In many real-world problems, we are dealing with collections of high-dimensional data, such as images, videos, text and web documents, DNA microarray data, and more. Often, high-dimensional data lie close to low-dimensional structures…

Computer Vision and Pattern Recognition · Computer Science 2013-02-06 Ehsan Elhamifar , Rene Vidal

Pushing the Efficiency Limit Using Structured Sparse Convolutions

Weight pruning is among the most popular approaches for compressing deep convolutional neural networks. Recent work suggests that in a randomly initialized deep neural network, there exist sparse subnetworks that achieve performance…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Vinay Kumar Verma , Nikhil Mehta , Shijing Si , Ricardo Henao , Lawrence Carin

Learning Stable Multilevel Dictionaries for Sparse Representations

Sparse representations using learned dictionaries are being increasingly used with success in several data processing and machine learning applications. The availability of abundant training data necessitates the development of efficient,…

Computer Vision and Pattern Recognition · Computer Science 2013-09-26 Jayaraman J. Thiagarajan , Karthikeyan Natesan Ramamurthy , Andreas Spanias

SpaDE: Improving Sparse Representations using a Dual Document Encoder for First-stage Retrieval

Sparse document representations have been widely used to retrieve relevant documents via exact lexical matching. Owing to the pre-computed inverted index, it supports fast ad-hoc search but incurs the vocabulary mismatch problem. Although…

Information Retrieval · Computer Science 2023-10-06 Eunseong Choi , Sunkyung Lee , Minjin Choi , Hyeseon Ko , Young-In Song , Jongwuk Lee

Learning Instance-wise Sparsity for Accelerating Deep Models

Exploring deep convolutional neural networks of high efficiency and low memory usage is very essential for a wide variety of machine learning tasks. Most of existing approaches used to accelerate deep models by manipulating parameters or…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Chuanjian Liu , Yunhe Wang , Kai Han , Chunjing Xu , Chang Xu

Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data

Large scale deep learning provides a tremendous opportunity to improve the quality of content recommendation systems by employing both wider and deeper models, but this comes at great infrastructural cost and carbon footprint in modern data…

Machine Learning · Computer Science 2020-10-22 Mao Ye , Dhruv Choudhary , Jiecao Yu , Ellie Wen , Zeliang Chen , Jiyan Yang , Jongsoo Park , Qiang Liu , Arun Kejariwal

Learning Multi-Stage Sparsification for Maximum Clique Enumeration

We propose a multi-stage learning approach for pruning the search space of maximum clique enumeration, a fundamental computationally difficult problem arising in various network analysis tasks. In each stage, our approach learns the…

Machine Learning · Computer Science 2019-10-02 Marco Grassia , Juho Lauri , Sourav Dutta , Deepak Ajwani

Sparsifying Sparse Representations for Passage Retrieval by Top-$k$ Masking

Sparse lexical representation learning has demonstrated much progress in improving passage retrieval effectiveness in recent models such as DeepImpact, uniCOIL, and SPLADE. This paper describes a straightforward yet effective approach for…

Information Retrieval · Computer Science 2021-12-20 Jheng-Hong Yang , Xueguang Ma , Jimmy Lin

Representation Sparsification with Hybrid Thresholding for Fast SPLADE-based Document Retrieval

Learned sparse document representations using a transformer-based neural model has been found to be attractive in both relevance effectiveness and time efficiency. This paper describes a representation sparsification scheme based on hard…

Information Retrieval · Computer Science 2023-06-21 Yifan Qiao , Yingrui Yang , Shanxiu He , Tao Yang

Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern…

Machine Learning · Statistics 2016-02-16 Kazuya Nakagawa , Shinya Suzumura , Masayuki Karasuyama , Koji Tsuda , Ichiro Takeuchi

Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution

Image super-resolution (SR) has witnessed extensive neural network designs from CNN to transformer architectures. However, prevailing SR models suffer from prohibitive memory footprint and intensive computations, which limits further…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Jiamian Wang , Huan Wang , Yulun Zhang , Yun Fu , Zhiqiang Tao

Online Sparse Subspace Clustering

This paper focuses on the sparse subspace clustering problem, and develops an online algorithmic solution to cluster data points on-the-fly, without revisiting the whole dataset. The strategy involves an online solution of a sparse…

Optimization and Control · Mathematics 2024-07-16 Liam Madden , Stephen Becker , Emiliano Dall'Anese

Dictionary Optimization for Block-Sparse Representations

Recent work has demonstrated that using a carefully designed dictionary instead of a predefined one, can improve the sparsity in jointly representing a class of signals. This has motivated the derivation of learning methods for designing a…

Information Theory · Computer Science 2010-05-04 Kevin Rosenblum , Lihi Zelnik-Manor , Yonina C. Eldar