English

Query Clustering using Segment Specific Context Embeddings

Information Retrieval 2016-11-08 v2 Computation and Language

Abstract

This paper presents a novel query clustering approach to capture the broad interest areas of users querying search engines. We make use of recent advances in NLP - word2vec and extend it to get query2vec, vector representations of queries, based on query contexts, obtained from the top search results for the query and use a highly scalable Divide & Merge clustering algorithm on top of the query vectors, to get the clusters. We have tried this approach on a variety of segments, including Retail, Travel, Health, Phones and found the clusters to be effective in discovering user's interest areas which have high monetization potential.

Keywords

Cite

@article{arxiv.1608.01247,
  title  = {Query Clustering using Segment Specific Context Embeddings},
  author = {S. K Kolluru and Prasenjit Mukherjee},
  journal= {arXiv preprint arXiv:1608.01247},
  year   = {2016}
}

Comments

9 pages

R2 v1 2026-06-22T15:11:20.164Z