Related papers: Benchmark for Complex Answer Retrieval

Characterizing Question Facets for Complex Answer Retrieval

Complex answer retrieval (CAR) is the process of retrieving answers to questions that have multifaceted or nuanced answers. In this work, we present two novel approaches for CAR based on the observation that question facets can vary in…

Information Retrieval · Computer Science 2018-05-03 Sean MacAvaney , Andrew Yates , Arman Cohan , Luca Soldaini , Kai Hui , Nazli Goharian , Ophir Frieder

Interactive Retrieval Based on Wikipedia Concepts

This paper presents a new user feedback mechanism based on Wikipedia concepts for interactive retrieval. In this mechanism, the system presents to the user a group of Wikipedia concepts, and the user can choose those relevant to refine…

Information Retrieval · Computer Science 2014-12-30 Lanbo Zhang

Hybrid Retrieval and Multi-stage Text Ranking Solution at TREC 2022 Deep Learning Track

Large-scale text retrieval technology has been widely used in various practical business scenarios. This paper presents our systems for the TREC 2022 Deep Learning Track. We explain the hybrid text retrieval and multi-stage text ranking…

Information Retrieval · Computer Science 2023-08-24 Guangwei Xu , Yangzhao Zhang , Longhui Zhang , Dingkun Long , Pengjun Xie , Ruijie Guo

A Survey on Information Retrieval, Text Categorization, and Web Crawling

This paper is a survey discussing Information Retrieval concepts, methods, and applications. It goes deep into the document and query modelling involved in IR systems, in addition to pre-processing operations such as removing stop words and…

Information Retrieval · Computer Science 2012-12-11 Youssef Bassil

Overview of the TREC 2021 Fair Ranking Track

The TREC Fair Ranking Track aims to provide a platform for participants to develop and evaluate novel retrieval algorithms that can provide a fair exposure to a mixture of demographics or attributes, such as ethnicity, that are represented…

Information Retrieval · Computer Science 2023-02-22 Michael D. Ekstrand , Graham McDonald , Amifa Raj , Isaac Johnson

Overcoming low-utility facets for complex answer retrieval

Many questions cannot be answered simply; their answers must include numerous nuanced details and additional context. Complex Answer Retrieval (CAR) is the retrieval of answers to such questions. In their simplest form, these questions are…

Information Retrieval · Computer Science 2018-11-22 Sean MacAvaney , Andrew Yates , Arman Cohan , Luca Soldaini , Kai Hui , Nazli Goharian , Ophir Frieder

Overview of the TREC 2022 Fair Ranking Track

The TREC Fair Ranking Track aims to provide a platform for participants to develop and evaluate novel retrieval algorithms that can provide a fair exposure to a mixture of demographics or attributes, such as ethnicity, that are represented…

Information Retrieval · Computer Science 2023-02-14 Michael D. Ekstrand , Graham McDonald , Amifa Raj , Isaac Johnson

Topic Level Disambiguation for Weak Queries

Despite limited success, information retrieval (IR) systems today are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries).…

Information Retrieval · Computer Science 2015-02-18 Hui Zhang , Kiduk Yang , Elin Jacob

Use of Wikipedia categories on information retrieval research: a brief review

Wikipedia categories, a classification scheme built for organizing and describing Wikpedia articles, are being applied in computer science research. This paper adopts a systematic literature review approach, in order to identify different…

Digital Libraries · Computer Science 2020-04-22 Jesús Tramullas , Piedad Garrido-Picazo , Ana I. Sánchez-Casabón

WikiPassageQA: A Benchmark Collection for Research on Non-factoid Answer Passage Retrieval

With the rise in mobile and voice search, answer passage retrieval acts as a critical component of an effective information retrieval system for open domain question answering. Currently, there are no comparable collections that address…

Information Retrieval · Computer Science 2018-05-11 Daniel Cohen , Liu Yang , W. Bruce Croft

Neural Article Pair Modeling for Wikipedia Sub-article Matching

Nowadays, editors tend to separate different subtopics of a long Wiki-pedia article into multiple sub-articles. This separation seeks to improve human readability. However, it also has a deleterious effect on many Wikipedia-based tasks that…

Information Retrieval · Computer Science 2019-06-24 Muhao Chen , Changping Meng , Gang Huang , Carlo Zaniolo

A framework for contextual information retrieval from the WWW

Search engines are the most commonly used type of tool for finding relevant information on the Internet. However, today's search engines are far from perfect. Typical search queries are short, often one or two words, and can be ambiguous…

Information Retrieval · Computer Science 2014-07-24 Dilip K. Limbu , Andy M. Connor , Stephen G. MacDonell

Dynamic Information Retrieval: Theoretical Framework and Application

Theoretical frameworks like the Probability Ranking Principle and its more recent Interactive Information Retrieval variant have guided the development of ranking and retrieval algorithms for decades, yet they are not capable of helping us…

Information Retrieval · Computer Science 2016-01-19 Marc Sloan , Jun Wang

Overview of the TREC 2023 NeuCLIR Track

The principal goal of the TREC Neural Cross-Language Information Retrieval (NeuCLIR) track is to study the impact of neural approaches to cross-language information retrieval. The track has created four collections, large collections of…

Information Retrieval · Computer Science 2024-04-15 Dawn Lawrie , Sean MacAvaney , James Mayfield , Paul McNamee , Douglas W. Oard , Luca Soldaini , Eugene Yang

Overview of the TREC 2025 Retrieval Augmented Generation (RAG) Track

The second edition of the TREC Retrieval Augmented Generation (RAG) Track advances research on systems that integrate retrieval and generation to address complex, real-world information needs. Building on the foundation of the inaugural…

Information Retrieval · Computer Science 2026-03-11 Shivani Upadhyay , Nandan Thakur , Ronak Pradeep , Nick Craswell , Daniel Campos , Jimmy Lin

Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-hoc Retrieval

With the development of deep learning and natural language processing techniques, pre-trained language models have been widely used to solve information retrieval (IR) problems. Benefiting from the pre-training and fine-tuning paradigm,…

Information Retrieval · Computer Science 2024-01-02 Weihang Su , Qingyao Ai , Xiangsheng Li , Jia Chen , Yiqun Liu , Xiaolong Wu , Shengluan Hou

IR-BERT: Leveraging BERT for Semantic Search in Background Linking for News Articles

This work describes our two approaches for the background linking task of TREC 2020 News Track. The main objective of this task is to recommend a list of relevant articles that the reader should refer to in order to understand the context…

Information Retrieval · Computer Science 2020-07-27 Anup Anand Deshmukh , Udhav Sethi

Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model

Generating high-quality answers consistently by providing contextual information embedded in the prompt passed to the Large Language Model (LLM) is dependent on the quality of information retrieval. As the corpus of contextual information…

Information Retrieval · Computer Science 2024-08-01 Sai Ganesh , Anupam Purwar , Gautam B

Reading Wikipedia to Answer Open-Domain Questions

This paper proposes to tackle open- domain question answering using Wikipedia as the unique knowledge source: the answer to any factoid question is a text span in a Wikipedia article. This task of machine reading at scale combines the…

Computation and Language · Computer Science 2017-05-01 Danqi Chen , Adam Fisch , Jason Weston , Antoine Bordes

WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset

We present a new dataset of Wikipedia articles each paired with a knowledge graph, to facilitate the research in conditional text generation, graph generation and graph representation learning. Existing graph-text paired datasets typically…

Computation and Language · Computer Science 2021-07-21 Luyu Wang , Yujia Li , Ozlem Aslan , Oriol Vinyals