Related papers: A Data-Centric Framework for Composable NLP Workfl…

A Workflow Manager for Complex NLP and Content Curation Pipelines

We present a workflow manager for the flexible creation and customisation of NLP processing pipelines. The workflow manager addresses challenges in interoperability across various different NLP tasks and hardware-based resource usage. Based…

Computation and Language · Computer Science 2020-04-30 Julián Moreno-Schneider , Peter Bourgonje , Florian Kintzel , Georg Rehm

tasksource: A Dataset Harmonization Framework for Streamlined NLP Multi-Task Learning and Evaluation

The HuggingFace Datasets Hub hosts thousands of datasets, offering exciting opportunities for language model training and evaluation. However, datasets for a specific task type often have different schemas, making harmonization challenging.…

Computation and Language · Computer Science 2023-05-17 Damien Sileo

NLP for Local Governance Meeting Records: A Focus Article on Tasks, Datasets, Metrics and Benchmark

Local governance meeting records are official documents, in the form of minutes or transcripts, documenting how proposals, discussions, and procedural actions unfold during institutional meetings. While generally structured, these documents…

Computation and Language · Computer Science 2026-02-10 Ricardo Campos , José Pedro Evans , José Miguel Isidro , Miguel Marques , Luís Filipe Cunha , Alípio Jorge , Sérgio Nunes , Nuno Guimarães

STAMP 4 NLP -- An Agile Framework for Rapid Quality-Driven NLP Applications Development

The progress in natural language processing (NLP) research over the last years, offers novel business opportunities for companies, as automated user interaction or improved data analysis. Building sophisticated NLP applications requires…

Computation and Language · Computer Science 2021-11-17 Philipp Kohl , Oliver Schmidts , Lars Klöser , Henri Werth , Bodo Kraft , Albert Zündorf

A Review of Hybrid and Ensemble in Deep Learning for Natural Language Processing

This review presents a comprehensive exploration of hybrid and ensemble deep learning models within Natural Language Processing (NLP), shedding light on their transformative potential across diverse tasks such as Sentiment Analysis, Named…

Artificial Intelligence · Computer Science 2024-08-09 Jianguo Jia , Wen Liang , Youzhi Liang

Natural Language Processing in Biomedicine: A Unified System Architecture Overview

In modern electronic medical records (EMR) much of the clinically important data - signs and symptoms, symptom severity, disease status, etc. - are not provided in structured data fields, but rather are encoded in clinician generated…

Computation and Language · Computer Science 2014-08-10 Son Doan , Mike Conway , Tu Minh Phuong , Lucila Ohno-Machado

Towards Explainable NLP: A Generative Explanation Framework for Text Classification

Building explainable systems is a critical problem in the field of Natural Language Processing (NLP), since most machine learning models provide no explanations for the predictions. Existing approaches for explainable machine learning…

Computation and Language · Computer Science 2019-06-12 Hui Liu , Qingyu Yin , William Yang Wang

HugNLP: A Unified and Comprehensive Library for Natural Language Processing

In this paper, we introduce HugNLP, a unified and comprehensive library for natural language processing (NLP) with the prevalent backend of HuggingFace Transformers, which is designed for NLP researchers to easily utilize off-the-shelf…

Computation and Language · Computer Science 2023-03-01 Jianing Wang , Nuo Chen , Qiushi Sun , Wenkang Huang , Chengyu Wang , Ming Gao

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP). The effectiveness of transfer learning has…

Machine Learning · Computer Science 2023-09-20 Colin Raffel , Noam Shazeer , Adam Roberts , Katherine Lee , Sharan Narang , Michael Matena , Yanqi Zhou , Wei Li , Peter J. Liu

Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processing

Explainable NLP (ExNLP) has increasingly focused on collecting human-annotated textual explanations. These explanations are used downstream in three ways: as data augmentation to improve performance on a predictive task, as supervision to…

Computation and Language · Computer Science 2021-12-08 Sarah Wiegreffe , Ana Marasović

Composable NLP Workflows for BERT-based Ranking and QA System

There has been a lot of progress towards building NLP models that scale to multiple tasks. However, real-world systems contain multiple components and it is tedious to handle cross-task interaction with varying levels of text granularity.…

Computation and Language · Computer Science 2025-04-15 Gaurav Kumar , Murali Mohana Krishna Dandu

XNLP: An Interactive Demonstration System for Universal Structured NLP

Structured Natural Language Processing (XNLP) is an important subset of NLP that entails understanding the underlying semantic or syntactic structure of texts, which serves as a foundational component for many downstream applications.…

Computation and Language · Computer Science 2024-06-24 Hao Fei , Meishan Zhang , Min Zhang , Tat-Seng Chua

Neural Approaches to Entity-Centric Information Extraction

Artificial Intelligence (AI) has huge impact on our daily lives with applications such as voice assistants, facial recognition, chatbots, autonomously driving cars, etc. Natural Language Processing (NLP) is a cross-discipline of AI and…

Computation and Language · Computer Science 2023-04-18 Klim Zaporojets

A hybrid classical-quantum workflow for natural language processing

Natural language processing (NLP) problems are ubiquitous in classical computing, where they often require significant computational resources to infer sentence meanings. With the appearance of quantum computing hardware and simulators, it…

Quantum Physics · Physics 2020-10-09 Lee J. O'Riordan , Myles Doyle , Fabio Baruffa , Venkatesh Kannan

Beyond modeling: NLP Pipeline for efficient environmental policy analysis

As we enter the UN Decade on Ecosystem Restoration, creating effective incentive structures for forest and landscape restoration has never been more critical. Policy analysis is necessary for policymakers to understand the actors and rules…

Computation and Language · Computer Science 2022-01-19 Jordi Planas , Daniel Firebanks-Quevedo , Galina Naydenova , Ramansh Sharma , Cristina Taylor , Kathleen Buckingham , Rong Fang

A Tidy Data Model for Natural Language Processing using cleanNLP

The package cleanNLP provides a set of fast tools for converting a textual corpus into a set of normalized tables. The underlying natural language processing pipeline utilizes Stanford's CoreNLP library, exposing a number of annotation…

Computation and Language · Computer Science 2018-05-04 Taylor Arnold

Natural Language Processing for EHR-Based Computational Phenotyping

This article reviews recent advances in applying natural language processing (NLP) to Electronic Health Records (EHRs) for computational phenotyping. NLP-based computational phenotyping has numerous applications including diagnosis…

Computation and Language · Computer Science 2018-06-18 Zexian Zeng , Yu Deng , Xiaoyu Li , Tristan Naumann , Yuan Luo

Standardising the NLP Workflow: A Framework for Reproducible Linguistic Analysis

The introduction of large language models and other influential developments in AI-based language processing have led to an evolution in the methods available to quantitatively analyse language data. With the resultant growth of attention…

Computation and Language · Computer Science 2025-11-20 Yves Pauli , Jan-Bernard Marsman , Finn Rabe , Victoria Edkins , Roya Hüppi , Silvia Ciampelli , Akhil Ratan Misra , Nils Lang , Wolfram Hinzen , Iris Sommer , Philipp Homan

Natural Language Processing for Systems Engineering: Automatic Generation of Systems Modelling Language Diagrams

The design of complex engineering systems is an often long and articulated process that highly relies on engineers' expertise and professional judgment. As such, the typical pitfalls of activities involving the human factor often manifest…

Computation and Language · Computer Science 2022-11-22 Shaohong Zhong , Andrea Scarinci , Alice Cicirello

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain dominated by ad-hoc…

Machine Learning · Computer Science 2025-12-19 Hao Liang , Xiaochen Ma , Zhou Liu , Zhen Hao Wong , Zhengyang Zhao , Zimo Meng , Runming He , Chengyu Shen , Qifeng Cai , Zhaoyang Han , Meiyi Qiang , Yalin Feng , Tianyi Bai , Zewei Pan , Ziyi Guo , Yizhen Jiang , Jingwen Deng , Qijie You , Peichao Lai , Tianyu Guo , Chi Hsu Tsai , Hengyi Feng , Rui Hu , Wenkai Yu , Junbo Niu , Bohan Zeng , Ruichuan An , Lu Ma , Jihao Huang , Yaowei Zheng , Conghui He , Linpeng Tang , Bin Cui , Weinan E , Wentao Zhang