Related papers: Rethinking Negative Sampling for Handling Missing …

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

In many scenarios, named entity recognition (NER) models severely suffer from unlabeled entity problem, where the entities of a sentence may not be fully annotated. Through empirical studies performed on synthetic datasets, we find two…

Computation and Language · Computer Science 2021-03-19 Yangming Li , Lemao Liu , Shuming Shi

Better Sampling of Negatives for Distantly Supervised Named Entity Recognition

Distantly supervised named entity recognition (DS-NER) has been proposed to exploit the automatically labeled training data instead of human annotations. The distantly annotated datasets are often noisy and contain a considerable number of…

Computation and Language · Computer Science 2023-05-23 Lu Xu , Lidong Bing , Wei Lu

Simplify and Robustify Negative Sampling for Implicit Collaborative Filtering

Negative sampling approaches are prevalent in implicit collaborative filtering for obtaining negative labels from massive unlabeled data. As two major concerns in negative sampling, efficiency and effectiveness are still not fully achieved…

Machine Learning · Computer Science 2020-09-09 Jingtao Ding , Yuhan Quan , Quanming Yao , Yong Li , Depeng Jin

Partially Supervised Named Entity Recognition via the Expected Entity Ratio Loss

We study learning named entity recognizers in the presence of missing entity annotations. We approach this setting as tagging with latent variables and propose a novel loss, the Expected Entity Ratio, to learn models in the presence of…

Computation and Language · Computer Science 2021-08-17 Thomas Effland , Michael Collins

Rethinking Negative Instances for Generative Named Entity Recognition

Large Language Models (LLMs) have demonstrated impressive capabilities for generalizing in unseen tasks. In the Named Entity Recognition (NER) task, recent advancements have seen the remarkable improvement of LLMs in a broad range of entity…

Computation and Language · Computer Science 2024-06-21 Yuyang Ding , Juntao Li , Pinzheng Wang , Zecheng Tang , Bowen Yan , Min Zhang

Focusing on Potential Named Entities During Active Label Acquisition

Named entity recognition (NER) aims to identify mentions of named entities in an unstructured text and classify them into predefined named entity classes. While deep learning-based pre-trained language models help to achieve good predictive…

Computation and Language · Computer Science 2023-06-16 Ali Osman Berk Sapci , Oznur Tastan , Reyyan Yeniterzi

Improving Named Entity Recognition with Attentive Ensemble of Syntactic Information

Named entity recognition (NER) is highly sensitive to sentential syntactic and semantic properties where entities may be extracted according to how they are used and placed in the running text. To model such properties, one could rely on…

Computation and Language · Computer Science 2020-10-30 Yuyang Nie , Yuanhe Tian , Yan Song , Xiang Ao , Xiang Wan

Semantically-Conditioned Negative Samples for Efficient Contrastive Learning

Negative sampling is a limiting factor w.r.t. the generalization of metric-learned neural networks. We show that uniform negative sampling provides little information about the class boundaries and thus propose three novel techniques for…

Machine Learning · Computer Science 2021-02-15 James O' Neill , Danushka Bollegala

Continual Named Entity Recognition without Catastrophic Forgetting

Continual Named Entity Recognition (CNER) is a burgeoning area, which involves updating an existing model by incorporating new entity types sequentially. Nevertheless, continual learning approaches are often severely afflicted by…

Computation and Language · Computer Science 2023-10-24 Duzhen Zhang , Wei Cong , Jiahua Dong , Yahan Yu , Xiuyi Chen , Yonggang Zhang , Zhen Fang

Few-shot Learning for Named Entity Recognition in Medical Text

Deep neural network models have recently achieved state-of-the-art performance gains in a variety of natural language processing (NLP) tasks (Young, Hazarika, Poria, & Cambria, 2017). However, these gains rely on the availability of large…

Computation and Language · Computer Science 2018-11-15 Maximilian Hofer , Andrey Kormilitzin , Paul Goldberg , Alejo Nevado-Holgado

Distantly Supervised Named Entity Recognition using Positive-Unlabeled Learning

In this work, we explore the way to perform named entity recognition (NER) using only unlabeled data and named entity dictionaries. To this end, we formulate the task as a positive-unlabeled (PU) learning problem and accordingly propose a…

Computation and Language · Computer Science 2019-06-12 Minlong Peng , Xiaoyu Xing , Qi Zhang , Jinlan Fu , Xuanjing Huang

A Little Annotation does a Lot of Good: A Study in Bootstrapping Low-resource Named Entity Recognizers

Most state-of-the-art models for named entity recognition (NER) rely on the availability of large amounts of labeled data, making them challenging to extend to new, lower-resourced languages. However, there are now several proposed…

Computation and Language · Computer Science 2019-08-27 Aditi Chaudhary , Jiateng Xie , Zaid Sheikh , Graham Neubig , Jaime G. Carbonell

Named Entity Recognition with Partially Annotated Training Data

Supervised machine learning assumes the availability of fully-labeled data, but in many cases, such as low-resource languages, the only data available is partially annotated. We study the problem of Named Entity Recognition (NER) with…

Computation and Language · Computer Science 2019-09-23 Stephen Mayhew , Snigdha Chaturvedi , Chen-Tse Tsai , Dan Roth

Evaluating Negative Sampling Approaches for Neural Topic Models

Negative sampling has emerged as an effective technique that enables deep learning models to learn better representations by introducing the paradigm of learn-to-compare. The goal of this approach is to add robustness to deep learning…

Computation and Language · Computer Science 2025-03-26 Suman Adhya , Avishek Lahiri , Debarshi Kumar Sanyal , Partha Pratim Das

Uncertainty Quantification for Named Entity Recognition via Full-Sequence and Subsequence Conformal Prediction

Named Entity Recognition (NER) serves as a foundational component in many natural language processing (NLP) pipelines. However, current NER models typically output a single predicted label sequence without any accompanying measure of…

Computation and Language · Computer Science 2026-01-27 Matthew Singer , Srijan Sengupta , Karl Pazdernik

CleanCoNLL: A Nearly Noise-Free Named Entity Recognition Dataset

The CoNLL-03 corpus is arguably the most well-known and utilized benchmark dataset for named entity recognition (NER). However, prior works found significant numbers of annotation errors, incompleteness, and inconsistencies in the data.…

Computation and Language · Computer Science 2023-10-26 Susanna Rücker , Alan Akbik

Empirical Study of Named Entity Recognition Performance Using Distribution-aware Word Embedding

With the fast development of Deep Learning techniques, Named Entity Recognition (NER) is becoming more and more important in the information extraction task. The greatest difficulty that the NER task faces is to keep the detectability even…

Computation and Language · Computer Science 2024-01-23 Xin Chen , Qi Zhao , Xinyang Liu

Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data

Zero-shot named entity recognition (NER) is the task of detecting named entities of specific types (such as 'Person' or 'Medicine') without any training examples. Current research increasingly relies on large synthetic datasets,…

Computation and Language · Computer Science 2025-03-10 Jonas Golde , Patrick Haller , Max Ploner , Fabio Barth , Nicolaas Jedema , Alan Akbik

Generalisation in Named Entity Recognition: A Quantitative Analysis

Named Entity Recognition (NER) is a key NLP task, which is all the more challenging on Web and user-generated content with their diverse and continuously changing language. This paper aims to quantify how this diversity impacts…

Computation and Language · Computer Science 2017-03-09 Isabelle Augenstein , Leon Derczynski , Kalina Bontcheva

Named entity recognition in resumes

Named entity recognition (NER) is used to extract information from various documents and texts such as names and dates. It is important to extract education and work experience information from resumes in order to filter them. Considering…

Computation and Language · Computer Science 2023-06-23 Ege Kesim , Aysu Deliahmetoglu