Related papers: Analytical Search

From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents

Information retrieval is a cornerstone of modern knowledge acquisition, enabling billions of queries each day across diverse domains. However, traditional keyword-based search engines are increasingly inadequate for handling complex,…

Information Retrieval · Computer Science 2025-07-04 Weizhi Zhang , Yangning Li , Yuanchen Bei , Junyu Luo , Guancheng Wan , Liangwei Yang , Chenxuan Xie , Yuyao Yang , Wei-Chieh Huang , Chunyu Miao , Henry Peng Zou , Xiao Luo , Yusheng Zhao , Yankai Chen , Chunkit Chan , Peilin Zhou , Xinyang Zhang , Chenwei Zhang , Jingbo Shang , Ming Zhang , Yangqiu Song , Irwin King , Philip S. Yu

A Survey on Open Dataset Search in the LLM Era: Retrospectives and Perspectives

High-quality datasets are typically required for accomplishing data-driven tasks, such as training medical diagnosis models, predicting real-time traffic conditions, or conducting experiments to validate research hypotheses. Consequently,…

Information Retrieval · Computer Science 2025-09-03 Pengyue Li , Sheng Wang , Hua Dai , Zhiyu Chen , Zhifeng Bao , Brian D. Davison

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

The advent of large language models (LLMs) has transformed information access and reasoning through open-ended natural language interaction. However, LLMs remain limited by static knowledge, factual hallucinations, and the inability to…

Artificial Intelligence · Computer Science 2025-10-29 Minhua Lin , Zongyu Wu , Zhichao Xu , Hui Liu , Xianfeng Tang , Qi He , Charu Aggarwal , Hui Liu , Xiang Zhang , Suhang Wang

A Survey of Generative Search and Recommendation in the Era of Large Language Models

With the information explosion on the Web, search and recommendation are foundational infrastructures to satisfying users' information needs. As the two sides of the same coin, both revolve around the same core research problem, matching…

Information Retrieval · Computer Science 2024-04-29 Yongqi Li , Xinyu Lin , Wenjie Wang , Fuli Feng , Liang Pang , Wenjie Li , Liqiang Nie , Xiangnan He , Tat-Seng Chua

Evolving Paradigms in Task-Based Search and Learning: A Comparative Analysis of Traditional Search Engine with LLM-Enhanced Conversational Search System

Large Language Models (LLMs) are rapidly reshaping information retrieval by enabling interactive, generative, and inference-driven search. While traditional keyword-based search remains central to web and academic information access, it…

Information Retrieval · Computer Science 2025-12-02 Zhitong Guan , Yi Wang

An ontology-based approach for semantics ranking of the web search engines results

This work falls in the areas of information retrieval and semantic web, and aims to improve the evaluation of web search tools. Indeed, the huge number of information on the web as well as the growth of new inexperienced users creates new…

Information Retrieval · Computer Science 2012-12-12 Abdelkrim Bouramoul , Mohamed-Khireddine Kholladi , Bich-Liên Doan

Rethinking Agentic RAG: Toward LLM-Driven Logical Retrieval Beyond Embeddings

Recent advances in RAG have shifted toward an agentic paradigm, where LLMs interact with retrieval systems over multiple turns and iteratively refine queries based on intermediate results. At the same time, LLMs have demonstrated a strong…

Information Retrieval · Computer Science 2026-05-27 Yuqi Zeng , Qixiang Deng , Yulei Wan , Ruiquan Jiang , Xiaoqing Zheng , Xuanjing Huang

Rethinking Dataset Discovery with DataScout

Dataset Search -- the process of finding appropriate datasets for a given task -- remains a critical yet under-explored challenge in data science workflows. Assessing dataset suitability for a task (e.g., training a classification model) is…

Human-Computer Interaction · Computer Science 2025-07-28 Rachel Lin , Bhavya Chopra , Wenjing Lin , Shreya Shankar , Madelon Hulsebos , Aditya G. Parameswaran

User-Centric Evidence Ranking for Attribution and Fact Verification

Attribution and fact verification are critical challenges in natural language processing for assessing information reliability. While automated systems and Large Language Models (LLMs) aim to retrieve and select concise evidence to support…

Computation and Language · Computer Science 2026-01-30 Guy Alt , Eran Hirsch , Serwar Basch , Ido Dagan , Oren Glickman

GraphSearch: An Agentic Deep Searching Workflow for Graph Retrieval-Augmented Generation

Graph Retrieval-Augmented Generation (GraphRAG) enhances factual reasoning in LLMs by structurally modeling knowledge through graph-based representations. However, existing GraphRAG approaches face two core limitations: shallow retrieval…

Computation and Language · Computer Science 2025-10-01 Cehao Yang , Xiaojun Wu , Xueyuan Lin , Chengjin Xu , Xuhui Jiang , Yuanliang Sun , Jia Li , Hui Xiong , Jian Guo

Beyond Relevance: Utility-Centric Retrieval in the LLM Era

Information retrieval systems have traditionally optimized for topical relevance-the degree to which retrieved documents match a query. However, relevance only approximates a deeper goal: utility, namely, whether retrieved information helps…

Information Retrieval · Computer Science 2026-04-13 Hengran Zhang , Minghao Tang , Keping Bi , Jiafeng Guo

Do We Still Need GraphRAG? Benchmarking RAG and GraphRAG for Agentic Search Systems

Retrieval-augmented generation (RAG) and its graph-based extensions (GraphRAG) are effective paradigms for improving large language model (LLM) reasoning by grounding generation in external knowledge. However, most existing RAG and GraphRAG…

Information Retrieval · Computer Science 2026-04-14 Dongzhe Fan , Zheyi Xue , Siyuan Liu , Qiaoyu Tan

RAG-Match: Retrieval-Augmented Knowledge Injection and Hierarchical Reasoning for Calibrated Semantic Relevance

Semantic relevance judgment for search is particularly challenging in knowledge-intensive scenarios, where accurate ranking requires not only semantic matching but also background grounding, multi-step reasoning, and well-calibrated…

Information Retrieval · Computer Science 2026-05-26 Hengjun Jiang , Liansheng Sun , Yan Jiang , Xiaojie Ke , Yongjin Wang , Xiangkun Liu , Cunxin Gu , Jian Xu , Guanjun Jiang

A Survey of Conversational Search

As a cornerstone of modern information access, search engines have become indispensable in everyday life. With the rapid advancements in AI and natural language processing (NLP) technologies, particularly large language models (LLMs),…

Computation and Language · Computer Science 2025-08-07 Fengran Mo , Kelong Mao , Ziliang Zhao , Hongjin Qian , Haonan Chen , Yiruo Cheng , Xiaoxi Li , Yutao Zhu , Zhicheng Dou , Jian-Yun Nie

Patience is all you need! An agentic system for performing scientific literature review

Large language models (LLMs) have grown in their usage to provide support for question answering across numerous disciplines. The models on their own have already shown promise for answering basic questions, however fail quickly where…

Information Retrieval · Computer Science 2025-04-15 David Brett , Anniek Myatt

A Survey of Reasoning-Intensive Retrieval: Progress and Challenges

Reasoning-Intensive Retrieval (RIR) targets retrieval settings where relevance is mediated by latent inferential links between a query and supporting evidence, rather than semantic similarity. Motivated by the emergent reasoning abilities…

Information Retrieval · Computer Science 2026-05-04 Yiyang Wei , Tingyu Song , Siyue Zhang , Yilun Zhao

Using Context to Improve the Evaluation of Information Retrieval Systems

The crucial role of the evaluation in the development of the information retrieval tools is useful evidence to improve the performance of these tools and the quality of results that they return. However, the classic evaluation approaches…

Information Retrieval · Computer Science 2011-06-01 Abdelkrim Bouramoul , Mohamed-Khireddine Kholladi , Bich-Lien Doan

Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines

A cross-disciplinary examination of the user behaviours involved in seeking and evaluating data is surprisingly absent from the research data discussion. This review explores the data retrieval literature to identify commonalities in how…

Digital Libraries · Computer Science 2020-03-13 Kathleen Gregory , Paul Groth , Helena Cousijn , Andrea Scharnhorst , Sally Wyatt

Benchmarking Deep Search over Heterogeneous Enterprise Data

We present a new benchmark for evaluating Deep Search--a realistic and complex form of retrieval-augmented generation (RAG) that requires source-aware, multi-hop reasoning over diverse, sparsed, but related sources. These include documents,…

Computation and Language · Computer Science 2025-07-01 Prafulla Kumar Choubey , Xiangyu Peng , Shilpa Bhagavath , Kung-Hsiang Huang , Caiming Xiong , Chien-Sheng Wu

Face the Facts! Evaluating RAG-based Pipelines for Professional Fact-Checking

Natural Language Processing and Generation systems have recently shown the potential to complement and streamline the costly and time-consuming job of professional fact-checkers. In this work, we lift several constraints of current…

Computation and Language · Computer Science 2025-10-30 Daniel Russo , Stefano Menini , Jacopo Staiano , Marco Guerini