Related papers: Query Obfuscation Semantic Decomposition

Hiding in Plain Sight: Query Obfuscation via Random Multilingual Searches

Modern search engines extensively personalize results by building detailed user profiles based on query history and behaviour. While personalization can enhance relevance, it introduces privacy risks and can lead to filter bubbles. This…

Cryptography and Security · Computer Science 2025-08-14 Anton Firc , Jan Klusáček , Kamil Malinka

Obfuscation for Privacy-preserving Syntactic Parsing

The goal of homomorphic encryption is to encrypt data such that another party can operate on it without being explicitly exposed to the content of the original data. We introduce an idea for a privacy-preserving transformation on natural…

Computation and Language · Computer Science 2020-05-28 Zhifeng Hu , Serhii Havrylov , Ivan Titov , Shay B. Cohen

Distortion Search, A Web Search Privacy Heuristic

Search engines have vast technical capabilities to retain Internet search logs for each user and thus present major privacy vulnerabilities to both individuals and organizations in revealing user intent. Additionally, many of the web search…

Cryptography and Security · Computer Science 2025-06-11 Kato Mivule , Kenneth Hopkinson

Words Blending Boxes. Obfuscating Queries in Information Retrieval using Differential Privacy

Ensuring the effectiveness of search queries while protecting user privacy remains an open issue. When an Information Retrieval System (IRS) does not protect the privacy of its users, sensitive information may be disclosed through the…

Information Retrieval · Computer Science 2024-05-16 Francesco Luigi De Faveri , Guglielmo Faggioli , Nicola Ferro

Towards Semantic Query Segmentation

Query Segmentation is one of the critical components for understanding users' search intent in Information Retrieval tasks. It involves grouping tokens in the search query into meaningful phrases which help downstream tasks like search…

Information Retrieval · Computer Science 2017-07-26 Ajinkya Kale , Thrivikrama Taula , Sanjika Hewavitharana , Amit Srivastava

An Accuracy-Lossless Perturbation Method for Defending Privacy Attacks in Federated Learning

Although federated learning improves privacy of training data by exchanging local gradients or parameters rather than raw data, the adversary still can leverage local gradients and parameters to obtain local training data by launching…

Machine Learning · Computer Science 2021-08-17 Xue Yang , Yan Feng , Weijun Fang , Jun Shao , Xiaohu Tang , Shu-Tao Xia , Rongxing Lu

Scalable Semantic Matching of Queries to Ads in Sponsored Search Advertising

Sponsored search represents a major source of revenue for web search engines. This popular advertising model brings a unique possibility for advertisers to target users' immediate intent communicated through a search query, usually by…

Information Retrieval · Computer Science 2016-07-08 Mihajlo Grbovic , Nemanja Djuric , Vladan Radosavljevic , Fabrizio Silvestri , Ricardo Baeza-Yates , Andrew Feng , Erik Ordentlich , Lee Yang , Gavin Owens

Gradient Obfuscation Gives a False Sense of Security in Federated Learning

Federated learning has been proposed as a privacy-preserving machine learning framework that enables multiple clients to collaborate without sharing raw data. However, client privacy protection is not guaranteed by design in this framework.…

Cryptography and Security · Computer Science 2022-10-17 Kai Yue , Richeng Jin , Chau-Wai Wong , Dror Baron , Huaiyu Dai

De-Conflated Semantic Representations

One major deficiency of most semantic representation techniques is that they usually model a word type as a single point in the semantic space, hence conflating all the meanings that the word can have. Addressing this issue by learning…

Computation and Language · Computer Science 2016-08-08 Mohammad Taher Pilehvar , Nigel Collier

Name Disambiguation in Anonymized Graphs using Network Embedding

In real-world, our DNA is unique but many people share names. This phenomenon often causes erroneous aggregation of documents of multiple persons who are namesake of one another. Such mistakes deteriorate the performance of document…

Social and Information Networks · Computer Science 2017-09-12 Baichuan Zhang , Mohammad Al Hasan

Toward Word Embedding for Personalized Information Retrieval

This paper presents preliminary works on using Word Embedding (word2vec) for query expansion in the context of Personalized Information Retrieval. Traditionally, word embeddings are learned on a general corpus, like Wikipedia. In this work…

Information Retrieval · Computer Science 2016-06-23 Nawal Ould-Amer , Philippe Mulhem , Mathias Gery

A data-driven strategy to combine word embeddings in information retrieval

Word embeddings are vital descriptors of words in unigram representations of documents for many tasks in natural language processing and information retrieval. The representation of queries has been one of the most critical challenges in…

Information Retrieval · Computer Science 2021-05-28 Alfredo Silva , Marcelo Mendoza

Deep Neural Networks for Query Expansion using Word Embeddings

Query expansion is a method for alleviating the vocabulary mismatch problem present in information retrieval tasks. Previous works have shown that terms selected for query expansion by traditional methods such as pseudo-relevance feedback…

Information Retrieval · Computer Science 2018-11-09 Ayyoob Imani , Amir Vakili , Ali Montazer , Azadeh Shakery

Semantics-Preserved Distortion for Personal Privacy Protection in Information Management

In recent years, machine learning - particularly deep learning - has significantly impacted the field of information management. While several strategies have been proposed to restrict models from learning and memorizing sensitive…

Computation and Language · Computer Science 2024-07-10 Jiajia Li , Lu Yang , Letian Peng , Shitou Zhang , Ping Wang , Zuchao Li , Hai Zhao

Query Expansion Based on Clustered Results

Query expansion is a functionality of search engines that suggests a set of related queries for a user-issued keyword query. Typical corpus-driven keyword query expansion approaches return popular words in the results as expanded queries.…

Information Retrieval · Computer Science 2011-04-19 Ziyang Liu , Sivaramakrishnan Natarajan , Yi Chen

Contextualized Query Embeddings for Conversational Search

This paper describes a compact and effective model for low-latency passage retrieval in conversational search based on learned dense representations. Prior to our work, the state-of-the-art approach uses a multi-stage pipeline comprising…

Information Retrieval · Computer Science 2021-11-30 Sheng-Chieh Lin , Jheng-Hong Yang , Jimmy Lin

A Concept Knowledge-Driven Keywords Retrieval Framework for Sponsored Search

In sponsored search, retrieving synonymous keywords for exact match type is important for accurately targeted advertising. Data-driven deep learning-based method has been proposed to tackle this problem. An apparent disadvantage of this…

Information Retrieval · Computer Science 2021-02-23 Yijiang Lian , Yubo Liu , Zhicong Ye , Liang Yuan , Yanfeng Zhu , Min Zhao , Jianyi Cheng , Xinwei Feng

Representation Learning Models for Entity Search

We focus on the problem of learning distributed representations for entity search queries, named entities, and their short descriptions. With our representation learning models, the entity search query, named entity and description can be…

Computation and Language · Computer Science 2017-01-17 Shijia E , Yang Xiang , Mohan Zhang

Federated Visualization: A Privacy-preserving Strategy for Aggregated Visual Query

We present a novel privacy preservation strategy for decentralized visualization. The key idea is to imitate the flowchart of the federated learning framework, and reformulate the visualization process within a federated infrastructure. The…

Graphics · Computer Science 2022-02-10 Wei Chen , Yating Wei , Zhiyong Wang , Shuyue Zhou , Bingru Lin , Zhiguang Zhou

Unified Embedding Based Personalized Retrieval in Etsy Search

Embedding-based neural retrieval is a prevalent approach to address the semantic gap problem which often arises in product search on tail queries. In contrast, popular queries typically lack context and have a broad intent where additional…

Information Retrieval · Computer Science 2024-09-26 Rishikesh Jha , Siddharth Subramaniyam , Ethan Benjamin , Thrivikrama Taula