Related papers: Semantic Search and Recommendation Algorithm

An ontology-based approach for semantics ranking of the web search engines results

This work falls in the areas of information retrieval and semantic web, and aims to improve the evaluation of web search tools. Indeed, the huge number of information on the web as well as the growth of new inexperienced users creates new…

Information Retrieval · Computer Science 2012-12-12 Abdelkrim Bouramoul , Mohamed-Khireddine Kholladi , Bich-Liên Doan

WordNet-Based Information Retrieval Using Common Hypernyms and Combined Features

Text search based on lexical matching of keywords is not satisfactory due to polysemous and synonymous words. Semantic search that exploits word meanings, in general, improves search performance. In this paper, we survey WordNet-based…

Computation and Language · Computer Science 2018-07-17 Vuong M. Ngo , Tru H. Cao , Tuan M. V. Le

Keyword Embeddings for Query Suggestion

Nowadays, search engine users commonly rely on query suggestions to improve their initial inputs. Current systems are very good at recommending lexical adaptations or spelling corrections to users' queries. However, they often struggle to…

Information Retrieval · Computer Science 2023-01-24 Jorge Gabín , M. Eduardo Ares , Javier Parapar

How Does That Sound? Multi-Language SpokenName2Vec Algorithm Using Speech Generation and Deep Learning

Searching for information about a specific person is an online activity frequently performed by many users. In most cases, users are aided by queries containing a name and sending back to the web search engines for finding their will.…

Computation and Language · Computer Science 2020-07-23 Aviad Elyashar , Rami Puzis , Michael Fire

Concept-based indexing in text information retrieval

Traditional information retrieval systems rely on keywords to index documents and queries. In such systems, documents are retrieved based on the number of shared keywords with the query. This lexical-focused retrieval leads to inaccurate…

Information Retrieval · Computer Science 2013-03-08 Fatiha Boubekeur , Wassila Azzoug

Semantic Search At LinkedIn

Semantic search with large language models (LLMs) enables retrieval by meaning rather than keyword overlap, but scaling it requires major inference efficiency advances. We present LinkedIn's LLM-based semantic search framework for AI Job…

Information Retrieval · Computer Science 2026-02-10 Fedor Borisyuk , Sriram Vasudevan , Muchen Wu , Guoyao Li , Benjamin Le , Shaobo Zhang , Qianqi Kay Shen , Yuchin Juan , Kayhan Behdin , Liming Dong , Kaixu Yang , Shusen Jing , Ravi Pothamsetty , Rajat Arora , Sophie Yanying Sheng , Vitaly Abdrashitov , Yang Zhao , Lin Su , Xiaoqing Wang , Chujie Zheng , Sarang Metkar , Rupesh Gupta , Igor Lapchuk , David N. Racca , Madhumitha Mohan , Yanbo Li , Haojun Li , Saloni Gandhi , Xueying Lu , Chetan Bhole , Ali Hooshmand , Xin Yang , Raghavan Muthuregunathan , Jiajun Zhang , Mathew Teoh , Adam Coler , Abhinav Gupta , Xiaojing Ma , Sundara Raman Ramachandran , Morteza Ramezani , Yubo Wang , Lijuan Zhang , Richard Li , Jian Sheng , Chanh Nguyen , Yen-Chi Chen , Chuanrui Zhu , Claire Zhang , Jiahao Xu , Deepti Kulkarni , Qing Lan , Arvind Subramaniam , Ata Fatahibaarzi , Steven Shimizu , Yanning Chen , Zhipeng Wang , Ran He , Zhengze Zhou , Qingquan Song , Yun Dai , Caleb Johnson , Ping Liu , Shaghayegh Gharghabi , Gokulraj Mohanasundaram , Juan Bottaro , Santhosh Sachindran , Qi Guo , Yunxiang Ren , Chengming Jiang , Di Mo , Luke Simon , Jianqiang Shen , Jingwei Wu , Wenjing Zhang

Using Additional Indexes for Fast Full-Text Search of Phrases That Contain Frequently Used Words

Searches for phrases and word sets in large text arrays by means of additional indexes are considered. Their use may reduce the query-processing time by an order of magnitude in comparison with standard inverted files.

Information Retrieval · Computer Science 2018-11-27 A. B. Veretennikov

TinySearch -- Semantics based Search Engine using Bert Embeddings

Existing search engines use keyword matching or tf-idf based matching to map the query to the web-documents and rank them. They also consider other factors such as page rank, hubs-and-authority scores, knowledge graphs to make the results…

Information Retrieval · Computer Science 2019-08-08 Manish Patel

Affect Enriched Word Embeddings for News Information Retrieval

Distributed representations of words have shown to be useful to improve the effectiveness of IR systems in many sub-tasks like query expansion, retrieval and ranking. Algorithms like word2vec, GloVe and others are also key factors in many…

Information Retrieval · Computer Science 2019-09-05 Tommaso Teofili , Niyati Chhaya

Information Retrieval in long documents: Word clustering approach for improving Semantics

In this paper, we propose an alternative to deep neural networks for semantic information retrieval for the case of long documents. This new approach exploiting clustering techniques to take into account the meaning of words in Information…

Information Retrieval · Computer Science 2025-07-29 Paul Mbathe Mekontchou , Armel Fotsoh , Bernabe Batchakui , Eddy Ella

Estimating Text Similarity based on Semantic Concept Embeddings

Due to their ease of use and high accuracy, Word2Vec (W2V) word embeddings enjoy great success in the semantic representation of words, sentences, and whole documents as well as for semantic similarity estimation. However, they have the…

Computation and Language · Computer Science 2024-01-10 Tim vor der Brück , Marc Pouly

A Novel Approach to Enhance the Performance of Semantic Search in Bengali using Neural Net and other Classification Techniques

Search has for a long time been an important tool for users to retrieve information. Syntactic search is matching documents or objects containing specific keywords like user-history, location, preference etc. to improve the results.…

Computation and Language · Computer Science 2020-02-26 Arijit Das , Diganta Saha

Learning Passage Impacts for Inverted Indexes

Neural information retrieval systems typically use a cascading pipeline, in which a first-stage model retrieves a candidate set of documents and one or more subsequent stages re-rank this set using contextualized language models such as…

Information Retrieval · Computer Science 2021-04-27 Antonio Mallia , Omar Khattab , Nicola Tonellotto , Torsten Suel

Clever Search: A WordNet Based Wrapper for Internet Search Engines

This paper presents an approach to enhance search engines with information about word senses available in WordNet. The approach exploits information about the conceptual relations within the lexical-semantic net. In the wrapper for search…

Artificial Intelligence · Computer Science 2007-05-23 Peter M. Kruse , Andre Naujoks , Dietmar Roesner , Manuela Kunze

Indexing with WordNet synsets can improve Text Retrieval

The classical, vector space model for text retrieval is shown to give better results (up to 29% better in our experiments) if WordNet synsets are chosen as the indexing space, instead of word forms. This result is obtained for a manually…

cmp-lg · Computer Science 2007-05-23 Julio Gonzalo , Felisa Verdejo , Irina Chugur , Juan Cigarran

Content Based Document Recommender using Deep Learning

With the recent advancements in information technology there has been a huge surge in amount of data available. But information retrieval technology has not been able to keep up with this pace of information generation resulting in over…

Computation and Language · Computer Science 2017-10-24 Nishant Nikhil , Muktabh Mayank Srivastava

Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines

Vector representations and vector space modeling (VSM) play a central role in modern machine learning. We propose a novel approach to `vector similarity searching' over dense semantic representations of words and documents that can be…

Information Retrieval · Computer Science 2017-06-06 Jan Rygl , Jan Pomikálek , Radim Řehůřek , Michal Růžička , Vít Novotný , Petr Sojka

Network-Efficient Distributed Word2vec Training System for Large Vocabularies

Word2vec is a popular family of algorithms for unsupervised training of dense vector representations of words on large text corpuses. The resulting vectors have been shown to capture semantic relationships among their corresponding words,…

Computation and Language · Computer Science 2016-06-29 Erik Ordentlich , Lee Yang , Andy Feng , Peter Cnudde , Mihajlo Grbovic , Nemanja Djuric , Vladan Radosavljevic , Gavin Owens

VectorSearch: Enhancing Document Retrieval with Semantic Embeddings and Optimized Search

Traditional retrieval methods have been essential for assessing document similarity but struggle with capturing semantic nuances. Despite advancements in latent semantic analysis (LSA) and deep learning, achieving comprehensive semantic…

Information Retrieval · Computer Science 2024-09-27 Solmaz Seyed Monir , Irene Lau , Shubing Yang , Dongfang Zhao

End-to-End Open Vocabulary Keyword Search

Recently, neural approaches to spoken content retrieval have become popular. However, they tend to be restricted in their vocabulary or in their ability to deal with imbalanced test settings. These restrictions limit their applicability in…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-25 Bolaji Yusuf , Alican Gok , Batuhan Gundogdu , Murat Saraclar