Related papers: TARexp: A Python Framework for Technology-Assisted…

Certifying One-Phase Technology-Assisted Reviews

Technology-assisted review (TAR) workflows based on iterative active learning are widely used in document review applications. Most stopping rules for one-phase TAR workflows lack valid statistical guarantees, which has discouraged their…

Information Retrieval · Computer Science 2021-08-31 David D. Lewis , Eugene Yang , Ophir Frieder

MLXP: A Framework for Conducting Replicable Experiments in Python

Replicability in machine learning (ML) research is increasingly concerning due to the utilization of complex non-deterministic algorithms and the dependence on numerous hyper-parameter choices, such as model architecture and training…

Machine Learning · Computer Science 2024-06-18 Michael Arbel , Alexandre Zouaoui

From Protocol to Screening: A Hybrid Learning Approach for Technology-Assisted Systematic Literature Reviews

In the medical domain, a Systematic Literature Review (SLR) attempts to collect all empirical evidence, that fit pre-specified eligibility criteria, in order to answer a specific research question. The process of preparing an SLR consists…

Information Retrieval · Computer Science 2020-11-20 Athanasios Lagopoulos , Grigorios Tsoumakas

Ark: An Open-source Python-based Framework for Robot Learning

Robotics has made remarkable hardware strides-from DARPA's Urban and Robotics Challenges to the first humanoid-robot kickboxing tournament-yet commercial autonomy still lags behind progress in machine learning. A major bottleneck is…

Robotics · Computer Science 2025-07-15 Magnus Dierking , Christopher E. Mower , Sarthak Das , Huang Helong , Jiacheng Qiu , Cody Reading , Wei Chen , Huidong Liang , Huang Guowei , Jan Peters , Quan Xingyue , Jun Wang , Haitham Bou-Ammar

Simplified Data Wrangling with ir_datasets

Managing the data for Information Retrieval (IR) experiments can be challenging. Dataset documentation is scattered across the Internet and once one obtains a copy of the data, there are numerous different data formats to work with. Even…

Information Retrieval · Computer Science 2021-05-11 Sean MacAvaney , Andrew Yates , Sergey Feldman , Doug Downey , Arman Cohan , Nazli Goharian

Stopping Methods for Technology Assisted Reviews based on Point Processes

Technology Assisted Review (TAR), which aims to reduce the effort required to screen collections of documents for relevance, is used to develop systematic reviews of medical evidence and identify documents that must be disclosed in response…

Information Retrieval · Computer Science 2023-11-16 Mark Stevenson , Reem Bin-Hezam

RLStop: A Reinforcement Learning Stopping Method for TAR

We present RLStop, a novel Technology Assisted Review (TAR) stopping rule based on reinforcement learning that helps minimise the number of documents that need to be manually reviewed within TAR applications. RLStop is trained on example…

Information Retrieval · Computer Science 2024-06-10 Reem Bin-Hezam , Mark Stevenson

Using Chao's Estimator as a Stopping Criterion for Technology-Assisted Review

Technology-Assisted Review (TAR) aims to reduce the human effort required for screening processes such as abstract screening for systematic literature reviews. Human reviewers label documents as relevant or irrelevant during this process,…

Information Retrieval · Computer Science 2024-04-02 Michiel P. Bron , Peter G. M. van der Heijden , Ad J. Feelders , Arno P. J. M. Siebes

Beyond Experience Sampling: Evaluating Personal Informatics with Technology-Assisted Reconstruction

Experience Sampling has been considered the golden standard of in-situ measurement, yet, at the expense of high burden to participants. In this paper we propose Technology-Assisted Reconstruction (TAR), a methodological approach that…

Human-Computer Interaction · Computer Science 2012-07-10 Evangelos Karapanos

On Minimizing Cost in Legal Document Review Workflows

Technology-assisted review (TAR) refers to human-in-the-loop machine learning workflows for document review in legal discovery and other high recall review tasks. Attorneys and legal technologists have debated whether review should be a…

Information Retrieval · Computer Science 2021-06-21 Eugene Yang , David D. Lewis , Ophir Frieder

Heuristic Stopping Rules For Technology-Assisted Review

Technology-assisted review (TAR) refers to human-in-the-loop active learning workflows for finding relevant documents in large collections. These workflows often must meet a target for the proportion of relevant documents found (i.e.…

Information Retrieval · Computer Science 2021-06-21 Eugene Yang , David D. Lewis , Ophir Frieder

ir_explain: a Python Library of Explainable IR Methods

While recent advancements in Neural Ranking Models have resulted in significant improvements over traditional statistical retrieval models, it is generally acknowledged that the use of large neural architectures and the application of…

Information Retrieval · Computer Science 2025-05-13 Sourav Saha , Harsh Agarwal , V Venktesh , Avishek Anand , Swastik Mohanty , Debapriyo Majumdar , Mandar Mitra

The Information Retrieval Experiment Platform

We integrate ir_datasets, ir_measures, and PyTerrier with TIRA in the Information Retrieval Experiment Platform (TIREx) to promote more standardized, reproducible, scalable, and even blinded retrieval experiments. Standardization is…

Information Retrieval · Computer Science 2023-05-31 Maik Fröbe , Jan Heinrich Reimer , Sean MacAvaney , Niklas Deckers , Simon Reich , Janek Bevendorff , Benno Stein , Matthias Hagen , Martin Potthast

Jarvis-HEP: A lightweight Python framework for workflow composition and parameter scans in high-energy physics

High-energy physics phenomenology often requires linking multiple computational tools to evaluate observables, likelihoods, and experimental constraints across nontrivial parameter spaces. In this work, we introduce Jarvis-HEP, a…

High Energy Physics - Phenomenology · Physics 2026-04-29 Erdong Guo , Paul Jackson , Jin Min Yang , Pengxuan Zhu

Combining Counting Processes and Classification Improves a Stopping Rule for Technology Assisted Review

Technology Assisted Review (TAR) stopping rules aim to reduce the cost of manually assessing documents for relevance by minimising the number of documents that need to be examined to ensure a desired level of recall. This paper extends an…

Information Retrieval · Computer Science 2023-12-07 Reem Bin-Hezam , Mark Stevenson

repro_eval: A Python Interface to Reproducibility Measures of System-oriented IR Experiments

In this work we introduce repro_eval - a tool for reactive reproducibility studies of system-oriented information retrieval (IR) experiments. The corresponding Python package provides IR researchers with measures for different levels of…

Information Retrieval · Computer Science 2022-01-20 Timo Breuer , Nicola Ferro , Maria Maistro , Philipp Schaer

TAR on Social Media: A Framework for Online Content Moderation

Content moderation (removing or limiting the distribution of posts based on their contents) is one tool social networks use to fight problems such as harassment and disinformation. Manually screening all content is usually impractical given…

Information Retrieval · Computer Science 2021-08-31 Eugene Yang , David D. Lewis , Ophir Frieder

Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments

While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework.…

Information Retrieval · Computer Science 2022-01-26 Cash Costello , Eugene Yang , Dawn Lawrie , James Mayfield

Advancing Trace Recovery Evaluation - Applied Information Retrieval in a Software Engineering Context

Successful development of software systems involves efficient navigation among software artifacts. One state-of-practice approach to structure information is to establish trace links between artifacts, a practice that is also enforced by…

Software Engineering · Computer Science 2016-02-25 Markus Borg

Reproducible Experiment Platform

Data analysis in fundamental sciences nowadays is an essential process that pushes frontiers of our knowledge and leads to new discoveries. At the same time we can see that complexity of those analyses increases fast due to a)~enormous…

Data Analysis, Statistics and Probability · Physics 2016-01-20 Tatiana Likhomanenko , Alex Rogozhnikov , Alexander Baranov , Egor Khairullin , Andrey Ustyuzhanin