Related papers: CLAIRLIB Documentation v1.03

ChemNLP: A Natural Language Processing based Library for Materials Chemistry Text Data

In this work, we present the ChemNLP library that can be used for 1) curating open access datasets for materials and chemistry literature, developing and comparing traditional machine learning, transformers and graph neural network models…

Materials Science · Physics 2024-02-19 Kamal Choudhary , Mathew L. Kelley

Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization

We present Mirror, an open-source platform for data exploration and analysis powered by large language models. Mirror offers an intuitive natural language interface for querying databases, and automatically generates executable SQL commands…

Databases · Computer Science 2023-03-16 Canwen Xu , Julian McAuley , Penghan Wang

LexNLP: Natural language processing and information extraction for legal and regulatory texts

LexNLP is an open source Python package focused on natural language processing and machine learning for legal and regulatory text. The package includes functionality to (i) segment documents, (ii) identify key text such as titles and…

Computation and Language · Computer Science 2018-06-12 Michael J Bommarito , Daniel Martin Katz , Eric M Detterman

How May I Help You? Using Neural Text Simplification to Improve Downstream NLP Tasks

The general goal of text simplification (TS) is to reduce text complexity for human consumption. This paper investigates another potential use of neural TS: assisting machines performing natural language processing (NLP) tasks. We evaluate…

Computation and Language · Computer Science 2021-09-15 Hoang Van , Zheng Tang , Mihai Surdeanu

An NLP Assistant for Clide

This report describes an NLP assistant for the collaborative development environment Clide, that supports the development of NLP applications by providing easy access to some common NLP data structures. The assistant visualizes text…

Computation and Language · Computer Science 2014-09-09 Tobias Kortkamp

NLP-SIR: A Natural Language Approach for Spreadsheet Information Retrieval

Spreadsheets are a ubiquitous software tool, used for a wide variety of tasks such as financial modelling, statistical analysis and inventory management. Extracting meaningful information from such data can be a difficult task, especially…

Software Engineering · Computer Science 2009-08-11 Derek Flood , Kevin Mc Daid , Fergal Mc Caffery

A Tidy Data Model for Natural Language Processing using cleanNLP

The package cleanNLP provides a set of fast tools for converting a textual corpus into a set of normalized tables. The underlying natural language processing pipeline utilizes Stanford's CoreNLP library, exposing a number of annotation…

Computation and Language · Computer Science 2018-05-04 Taylor Arnold

Comprehensive Implementation of TextCNN for Enhanced Collaboration between Natural Language Processing and System Recommendation

Natural Language Processing (NLP) is an important branch of artificial intelligence that studies how to enable computers to understand, process, and generate human language. Text classification is a fundamental task in NLP, which aims to…

Computation and Language · Computer Science 2024-03-18 Xiaonan Xu , Zheng Xu , Zhipeng Ling , Zhengyu Jin , ShuQian Du

The Cross-Lingual Arabic Information REtrieval (CLAIRE) System

Despite advances in neural machine translation, cross-lingual retrieval tasks in which queries and documents live in different natural language spaces remain challenging. Although neural translation models may provide an intuitive approach…

Information Retrieval · Computer Science 2021-07-30 Zhizhong Chen , Carsten Eickhoff

RLlib: Abstractions for Distributed Reinforcement Learning

Reinforcement learning (RL) algorithms involve the deep nesting of highly irregular computation patterns, each of which typically exhibits opportunities for distributed computation. We argue for distributing RL components in a composable…

Artificial Intelligence · Computer Science 2018-07-02 Eric Liang , Richard Liaw , Philipp Moritz , Robert Nishihara , Roy Fox , Ken Goldberg , Joseph E. Gonzalez , Michael I. Jordan , Ion Stoica

Improving Semantic Proximity in Information Retrieval through Cross-Lingual Alignment

With the increasing accessibility and utilization of multilingual documents, Cross-Lingual Information Retrieval (CLIR) has emerged as an important research area. Conventionally, CLIR tasks have been conducted under settings where the…

Information Retrieval · Computer Science 2026-04-08 Seongtae Hong , Youngjoon Jang , Jungseob Lee , Hyeonseok Moon , Heuiseok Lim

Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout

State-of-the-art solutions for Natural Language Processing (NLP) are able to capture a broad range of contexts, like the sentence-level context or document-level context for short documents. But these solutions are still struggling when it…

Computation and Language · Computer Science 2020-03-09 Filip Graliński , Tomasz Stanisławek , Anna Wróblewska , Dawid Lipiński , Agnieszka Kaliska , Paulina Rosalska , Bartosz Topolski , Przemysław Biecek

GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface

Information extraction (IE) is fundamental to numerous NLP applications, yet existing solutions often require specialized models for different tasks or rely on computationally expensive large language models. We present GLiNER2, a unified…

Computation and Language · Computer Science 2025-07-25 Urchade Zaratiana , Gil Pasternak , Oliver Boyd , George Hurn-Maloney , Ash Lewis

Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2017)

The large scale of scholarly publications poses a challenge for scholars in information seeking and sensemaking. Bibliometrics, information retrieval (IR), text mining and NLP techniques could help in these search and look-up activities,…

Digital Libraries · Computer Science 2017-06-09 Muthu Kumar Chandrasekaran , Kokil Jaidka , Philipp Mayr

Explainable Prediction of Text Complexity: The Missing Preliminaries for Text Simplification

Text simplification reduces the language complexity of professional content for accessibility purposes. End-to-end neural network models have been widely adopted to directly generate the simplified version of input text, usually functioning…

Computation and Language · Computer Science 2021-07-08 Cristina Garbacea , Mengtian Guo , Samuel Carton , Qiaozhu Mei

CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes

The CLEVR dataset has been used extensively in language grounded visual reasoning in Machine Learning (ML) and Natural Language Processing (NLP) domains. We present a graph parser library for CLEVR, that provides functionalities for…

Computation and Language · Computer Science 2020-10-05 Raeid Saqur , Ameet Deshpande

Transforming and Enriching Documents for the Semantic Web

We suggest to employ techniques from Natural Language Processing (NLP) and Knowledge Representation (KR) to transform existing documents into documents amenable for the Semantic Web. Semantic Web documents have at least part of their…

Artificial Intelligence · Computer Science 2007-05-23 Dietmar Roesner , Manuela Kunze , Sylke Kroetzsch

Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges

Natural Language Processing (NLP) is revolutionising the way both professionals and laypersons operate in the legal field. The considerable potential for NLP in the legal sector, especially in developing computational assistance tools for…

Computation and Language · Computer Science 2025-12-12 Farid Ariai , Joel Mackenzie , Gianluca Demartini

fairlib: A Unified Framework for Assessing and Improving Classification Fairness

This paper presents fairlib, an open-source framework for assessing and improving classification fairness. It provides a systematic framework for quickly reproducing existing baseline models, developing new methods, evaluating models with…

Machine Learning · Computer Science 2022-05-05 Xudong Han , Aili Shen , Yitong Li , Lea Frermann , Timothy Baldwin , Trevor Cohn

Natural language processing for word sense disambiguation and information extraction

This research work deals with Natural Language Processing (NLP) and extraction of essential information in an explicit form. The most common among the information management strategies is Document Retrieval (DR) and Information Filtering.…

Computation and Language · Computer Science 2020-04-07 K. R. Chowdhary