Related papers: Query Clustering using Segment Specific Context Em…

Towards Semantic Query Segmentation

Query Segmentation is one of the critical components for understanding users' search intent in Information Retrieval tasks. It involves grouping tokens in the search query into meaningful phrases which help downstream tasks like search…

Information Retrieval · Computer Science 2017-07-26 Ajinkya Kale , Thrivikrama Taula , Sanjika Hewavitharana , Amit Srivastava

Document clustering with evolved multiword search queries

Text clustering holds significant value across various domains due to its ability to identify patterns and group related information. Current approaches which rely heavily on a computed similarity measure between documents are often limited…

Information Retrieval · Computer Science 2025-04-09 Laurence Hirsch , Robin Hirsch , Bayode Ogunleye

The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters

Exploiting information induced from (query-specific) clustering of top-retrieved documents has long been proposed as a means for improving precision at the very top ranks of the returned results. We present a novel language model approach…

Information Retrieval · Computer Science 2014-01-17 Oren Kurland , Eyal Krikon

Search Result Clustering in Collaborative Sound Collections

The large size of nowadays' online multimedia databases makes retrieving their content a difficult and time-consuming task. Users of online sound collections typically submit search queries that express a broad intent, often making the…

Information Retrieval · Computer Science 2020-06-16 Xavier Favory , Frederic Font , Xavier Serra

Search Result Clustering via Randomized Partitioning of Query-Induced Subgraphs

In this paper, we present an approach to search result clustering, using partitioning of underlying link graph. We define the notion of "query-induced subgraph" and formulate the problem of search result clustering as a problem of efficient…

Information Retrieval · Computer Science 2008-11-27 Aleksandar Bradic

Clustering Text Using Attention

Clustering Text has been an important problem in the domain of Natural Language Processing. While there are techniques to cluster text based on using conventional clustering techniques on top of contextual or non-contextual vector space…

Computation and Language · Computer Science 2022-01-11 Lovedeep Singh

Queries Are Not Alone: Clustering Text Embeddings for Video Search

The rapid proliferation of video content across various platforms has highlighted the urgent need for advanced video retrieval systems. Traditional methods, which primarily depend on directly matching textual queries with video metadata,…

Information Retrieval · Computer Science 2025-10-10 Peyang Liu , Xi Wang , Ziqiang Cui , Wei Ye

Web Pages Clustering: A New Approach

The rapid growth of web has resulted in vast volume of information. Information availability at a rapid speed to the user is vital. English language (or any for that matter) has lot of ambiguity in the usage of words. So there is no…

Information Retrieval · Computer Science 2011-08-30 Jeevan H E , Prashanth P P , Punith Kumar S N , Vinay Hegde

Toward Word Embedding for Personalized Information Retrieval

This paper presents preliminary works on using Word Embedding (word2vec) for query expansion in the context of Personalized Information Retrieval. Traditionally, word embeddings are learned on a general corpus, like Wikipedia. In this work…

Information Retrieval · Computer Science 2016-06-23 Nawal Ould-Amer , Philippe Mulhem , Mathias Gery

ESMC: MLLM-Based Embedding Selection for Explainable Multiple Clustering

Typical deep clustering methods, while achieving notable progress, can only provide one clustering result per dataset. This limitation arises from their assumption of a fixed underlying data distribution, which may fail to meet user needs…

Machine Learning · Computer Science 2025-12-02 Xinyue Wang , Yuheng Jia , Hui Liu , Junhui Hou

Query Intent Detection from the SEO Perspective

Google users have different intents from their queries such as acquiring information, buying products, comparing or simulating services, looking for products, and so on. Understanding the right intention of users helps to provide i) better…

Information Retrieval · Computer Science 2020-06-17 Samin Mohammadi , Mathieu Chapon , Arthur Fremond

Clustering Prominent People and Organizations in Topic-Specific Text Corpora

Named entities in text documents are the names of people, organization, location or other types of objects in the documents that exist in the real world. A persisting research challenge is to use computational techniques to identify such…

Computation and Language · Computer Science 2019-07-09 Abdulkareem Alsudais , Hovig Tchalian

Information Retrieval in long documents: Word clustering approach for improving Semantics

In this paper, we propose an alternative to deep neural networks for semantic information retrieval for the case of long documents. This new approach exploiting clustering techniques to take into account the meaning of words in Information…

Information Retrieval · Computer Science 2025-07-29 Paul Mbathe Mekontchou , Armel Fotsoh , Bernabe Batchakui , Eddy Ella

Query Expansion Based on Clustered Results

Query expansion is a functionality of search engines that suggests a set of related queries for a user-issued keyword query. Typical corpus-driven keyword query expansion approaches return popular words in the results as expanded queries.…

Information Retrieval · Computer Science 2011-04-19 Ziyang Liu , Sivaramakrishnan Natarajan , Yi Chen

Vec2GC -- A Graph Based Clustering Method for Text Representations

NLP pipelines with limited or no labeled data, rely on unsupervised methods for document processing. Unsupervised approaches typically depend on clustering of terms or documents. In this paper, we introduce a novel clustering algorithm,…

Information Retrieval · Computer Science 2023-04-13 Rajesh N Rao , Manojit Chakraborty

Document Clustering based on Topic Maps

Importance of document clustering is now widely acknowledged by researchers for better management, smart navigation, efficient filtering, and concise summarization of large collection of documents like World Wide Web (WWW). The next…

Information Retrieval · Computer Science 2011-12-30 Muhammad Rafi , M. Shahid Shaikh , Amir Farooq

Faster Exact Search using Document Clustering

We show how full-text search based on inverted indices can be accelerated by clustering the documents without losing results (SeCluD -- SEarch with CLUstered Documents). We develop a fast multilevel clustering algorithm that explicitly uses…

Information Retrieval · Computer Science 2014-11-06 Jonathan Dimond , Peter Sanders

Links: A High-Dimensional Online Clustering Method

We present a novel algorithm, called Links, designed to perform online clustering on unit vectors in a high-dimensional Euclidean space. The algorithm is appropriate when it is necessary to cluster data efficiently as it streams in, and is…

Machine Learning · Statistics 2018-01-31 Philip Andrew Mansfield , Quan Wang , Carlton Downey , Li Wan , Ignacio Lopez Moreno

Consensus Clustering: An Embedding Perspective, Extension and Beyond

Consensus clustering fuses diverse basic partitions (i.e., clustering results obtained from conventional clustering methods) into an integrated one, which has attracted increasing attention in both academic and industrial areas due to its…

Machine Learning · Computer Science 2019-06-04 Hongfu Liu , Zhiqiang Tao , Zhengming Ding

Which Clustering Do You Want? Inducing Your Ideal Clustering with Minimal Feedback

While traditional research on text clustering has largely focused on grouping documents by topic, it is conceivable that a user may want to cluster documents along other dimensions, such as the authors mood, gender, age, or sentiment.…

Information Retrieval · Computer Science 2014-01-22 Sajib Dasgupta , Vincent Ng