English
Related papers

Related papers: Pyndri: a Python Interface to the Indri Search Eng…

200 papers

In recent years, the information retrieval (IR) community has witnessed the first successful applications of deep neural network models to short-text matching and ad-hoc retrieval. It is exciting to see the research on deep neural networks…

Information Retrieval · Computer Science 2017-02-17 Bhaskar Mitra , Fernando Diaz , Nick Craswell

We present a simple web search engine for indexing and searching html documents using python programming language. Because python is well known for its simple syntax and strong support for main operating systems, we hope it will be…

Information Retrieval · Computer Science 2012-02-10 Andri Mirzal

We present Spacerini, a tool that integrates the Pyserini toolkit for reproducible information retrieval research with Hugging Face to enable the seamless construction and deployment of interactive search engines. Spacerini makes…

Information Retrieval · Computer Science 2024-03-26 Christopher Akiki , Odunayo Ogundepo , Aleksandra Piktus , Xinyu Zhang , Akintunde Oladipo , Jimmy Lin , Martin Potthast

This paper presents rerankers, a Python library which provides an easy-to-use interface to the most commonly used re-ranking approaches. Re-ranking is an integral component of many retrieval pipelines; however, there exist numerous…

Information Retrieval · Computer Science 2024-09-04 Benjamin Clavié

A wide range of transformer-based language models have been proposed for information retrieval tasks. However, including transformer-based models in retrieval pipelines is often complex and requires substantial engineering effort. In this…

Information Retrieval · Computer Science 2025-04-16 Ferdinand Schlatt , Maik Fröbe , Matthias Hagen

Pyserini is an easy-to-use Python toolkit that supports replicable IR research by providing effective first-stage retrieval in a multi-stage ranking architecture. Our toolkit is self-contained as a standard Python package and comes with…

Information Retrieval · Computer Science 2021-02-22 Jimmy Lin , Xueguang Ma , Sheng-Chieh Lin , Jheng-Hong Yang , Ronak Pradeep , Rodrigo Nogueira

PyTerrier provides a declarative framework for building and experimenting with Information Retrieval (IR) pipelines. In this demonstration, we highlight several recent pipeline operations that improve their ability to be programmatically…

Information Retrieval · Computer Science 2026-02-02 Emmanouil Georgios Lionis , Craig Macdonald , Sean MacAvaney

The advent of deep machine learning platforms such as Tensorflow and Pytorch, developed in expressive high-level languages such as Python, have allowed more expressive representations of deep neural network architectures. We argue that such…

Information Retrieval · Computer Science 2020-07-29 Craig Macdonald , Nicola Tonellotto

Python data science libraries such as Pandas and NumPy have recently gained immense popularity. Although these libraries are feature-rich and easy to use, their scalability limitations require more robust computational resources. In this…

Databases · Computer Science 2024-07-17 Hesam Shahrokhi , Amirali Kaboli , Mahdi Ghorbani , Amir Shaikhha

Reviewing the literature to understand relevant threads of past work is a critical part of research and vehicle for learning. However, as the scientific literature grows the challenges for users to find and make sense of the many different…

Human-Computer Interaction · Computer Science 2022-08-17 Hyeonsu B. Kang , Joseph Chee Chang , Yongsung Kim , Aniket Kittur

We introduce pytrec_eval, a Python interface to the tree_eval information retrieval evaluation toolkit. pytrec_eval exposes the reference implementations of trec_eval within Python as a native extension. We show that pytrec_eval is around…

Information Retrieval · Computer Science 2018-06-06 Christophe Van Gysel , Maarten de Rijke

We give novel Python and R interfaces for the (Java) Tetrad project for causal modeling, search, and estimation. The Tetrad project is a mainstay in the literature, having been under consistent development for over 30 years. Some of its…

Mathematical Software · Computer Science 2023-08-16 Joseph D. Ramsey , Bryan Andrews

With the large diversity of platforms and devices used by students, web applications increasingly suggest themselves as the solution of choice. Developing adequate educational programming environments in the browser, however, remains a…

Programming Languages · Computer Science 2024-10-10 Clemens Bachmann , Alexandra Maximova , Tobias Kohn , Dennis Komm

While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework.…

Information Retrieval · Computer Science 2022-01-26 Cash Costello , Eugene Yang , Dawn Lawrie , James Mayfield

Web is title admittance today mainly relies on search engines. A large amount of data is hidden in the databases behind the search interfaces referred to as Hidden web, which needs to be indexed so in order to serve user query. In this…

Information Retrieval · Computer Science 2013-11-21 Sudhakar Ranjan , Komal K. Bhatia

MaRDI Open Interfaces is a software project aimed at improving reuse and interoperability in Scientific Computing by alleviating the difficulties of crossing boundaries between different programming languages, in which numerical packages…

Mathematical Software · Computer Science 2025-11-18 Dmitry I. Kabanov , Stephan Rave , Mario Ohlberger

A new Python API, integrated within the NLTK suite, offers access to the FrameNet 1.7 lexical database. The lexicon (structured in terms of frames) as well as annotated sentences can be processed programatically, or browsed with…

Computation and Language · Computer Science 2017-07-25 Nathan Schneider , Chuck Wooters

In this work we introduce repro_eval - a tool for reactive reproducibility studies of system-oriented information retrieval (IR) experiments. The corresponding Python package provides IR researchers with measures for different levels of…

Information Retrieval · Computer Science 2022-01-20 Timo Breuer , Nicola Ferro , Maria Maistro , Philipp Schaer

Pythonic code is idiomatic code that follows guiding principles and practices within the Python community. Offering performance and readability benefits, Pythonic code is claimed to be widely adopted by experienced Python developers, but…

Deep Web databases contain more than 90% of pertinent information of the Web. Despite their importance, users don't profit of this treasury. Many deep web services are offering competitive services in term of prices, quality of service, and…

Information Retrieval · Computer Science 2012-05-07 Radhouane Boughamoura , Lobna Hlaoua , Mohamed Nazih Omri
‹ Prev 1 2 3 10 Next ›