English
Related papers

Related papers: PyPLN: a Distributed Platform for Natural Language…

200 papers

The recent advances in natural language processing (NLP) are linked to training processes that require vast amounts of corpora. Access to this data is commonly not a trivial process due to resource dispersion and the need to maintain these…

Computation and Language · Computer Science 2024-01-30 Rúben Almeida , Ricardo Campos , Alípio Jorge , Sérgio Nunes

We present and make available pre-trained language models (Phraser, Word2Vec, Doc2Vec, FastText, and BERT) for the Brazilian legal language, a Python package with functions to facilitate their use, and a set of demonstrations/tutorials…

In recent years, the extraction of opinions and information from user-generated text has attracted a lot of interest, largely due to the unprecedented volume of content in Social Media. However, social researchers face some issues in…

The introduction of large language models and other influential developments in AI-based language processing have led to an evolution in the methods available to quantitatively analyse language data. With the resultant growth of attention…

The use of natural language processing (NLP) is gaining popularity in software engineering. In order to correctly perform NLP, we must pre-process the textual information to separate natural language from other information, such as log…

Software Engineering · Computer Science 2018-03-21 Mika V. Mäntylä , Fabio Calefato , Maelick Claes

We present PyThaiNLP, a free and open-source natural language processing (NLP) library for Thai language implemented in Python. It provides a wide range of software, models, and datasets for Thai language. We first provide a brief…

The success of Pre-Trained Models (PTMs) has reshaped the development of Natural Language Processing (NLP). Yet, it is not easy to obtain high-performing models and deploy them online for industrial practitioners. To bridge this gap,…

Computation and Language · Computer Science 2023-03-14 Chengyu Wang , Minghui Qiu , Chen Shi , Taolin Zhang , Tingting Liu , Lei Li , Jianing Wang , Ming Wang , Jun Huang , Wei Lin

LexNLP is an open source Python package focused on natural language processing and machine learning for legal and regulatory text. The package includes functionality to (i) segment documents, (ii) identify key text such as titles and…

Computation and Language · Computer Science 2018-06-12 Michael J Bommarito , Daniel Martin Katz , Eric M Detterman

Spark NLP is a Natural Language Processing (NLP) library built on top of Apache Spark ML. It provides simple, performant and accurate NLP annotations for machine learning pipelines that can scale easily in a distributed environment. Spark…

Computation and Language · Computer Science 2021-01-27 Veysel Kocaman , David Talby

In this work, we present VNLP: the first dedicated, complete, open-source, well-documented, lightweight, production-ready, state-of-the-art Natural Language Processing (NLP) package for the Turkish language. It contains a wide variety of…

Computation and Language · Computer Science 2024-03-05 Meliksah Turker , Mehmet Erdi Ari , Aydin Han

This paper describes AllenNLP, a platform for research on deep learning methods in natural language understanding. AllenNLP is designed to support researchers who want to build novel language understanding models quickly and easily. It is…

Computation and Language · Computer Science 2018-06-01 Matt Gardner , Joel Grus , Mark Neumann , Oyvind Tafjord , Pradeep Dasigi , Nelson Liu , Matthew Peters , Michael Schmitz , Luke Zettlemoyer

Large language models (LLMs) have significantly advanced natural language processing, but their progress has yet to be equal across languages. While most LLMs are trained in high-resource languages like English, multilingual models…

Computation and Language · Computer Science 2024-05-20 Nicholas Kluge Corrêa , Sophia Falk , Shiza Fatimah , Aniket Sen , Nythamar de Oliveira

Language is the medium for many political activities, from campaigns to news reports. Natural language processing (NLP) uses computational tools to parse text into key information that is needed for policymaking. In this chapter, we…

Computation and Language · Computer Science 2023-02-08 Zhijing Jin , Rada Mihalcea

A large scale collection of both semantic and natural language resources is essential to leverage active Software Engineering research areas such as code reuse and code comprehensibility. Existing machine learning models ingest data from…

Natural Language Processing offers new insights into language data across almost all disciplines and domains, and allows us to corroborate and/or challenge existing knowledge. The primary hurdles to widening participation in and use of…

Computation and Language · Computer Science 2021-05-31 Rebekah Baglini , Arthur Hjorth

Natural Language Processing (NLP) systems often make use of machine learning techniques that are unfamiliar to end-users who are interested in analyzing clinical records. Although NLP has been widely used in extracting information from…

Human-Computer Interaction · Computer Science 2017-07-10 Gaurav Trivedi , Phuong Pham , Wendy Chapman , Rebecca Hwa , Janyce Wiebe , Harry Hochheiser

The package cleanNLP provides a set of fast tools for converting a textual corpus into a set of normalized tables. The underlying natural language processing pipeline utilizes Stanford's CoreNLP library, exposing a number of annotation…

Computation and Language · Computer Science 2018-05-04 Taylor Arnold

NLP Workbench is a web-based platform for text mining that allows non-expert users to obtain semantic understanding of large-scale corpora using state-of-the-art text mining models. The platform is built upon latest pre-trained models and…

Computation and Language · Computer Science 2024-03-06 Peiran Yao , Matej Kosmajac , Abeer Waheed , Kostyantyn Guzhva , Natalie Hervieux , Denilson Barbosa

Natural language processing for the Turkic language family, spoken by over 200 million people across Eurasia, remains fragmented, with most languages lacking unified tooling and resources. We present TurkicNLP, an open-source Python library…

Computation and Language · Computer Science 2026-05-25 Sherzod Hakimov

This study introduces Ascle, a pioneering natural language processing (NLP) toolkit designed for medical text generation. Ascle is tailored for biomedical researchers and healthcare professionals with an easy-to-use, all-in-one solution…

‹ Prev 1 2 3 10 Next ›