Related papers: Learning Named Entity Tagger using Domain-Specific…

Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

We study the problem of training named entity recognition (NER) models using only distantly-labeled data, which can be automatically obtained by matching entity mentions in the raw text with entity types in a knowledge base. The biggest…

Computation and Language · Computer Science 2021-09-13 Yu Meng , Yunyi Zhang , Jiaxin Huang , Xuan Wang , Yu Zhang , Heng Ji , Jiawei Han

Noisy-Labeled NER with Confidence Estimation

Recent studies in deep learning have shown significant progress in named entity recognition (NER). Most existing works assume clean data annotation, yet a fundamental challenge in real-world scenarios is the large amount of noise from a…

Computation and Language · Computer Science 2021-04-13 Kun Liu , Yao Fu , Chuanqi Tan , Mosha Chen , Ningyu Zhang , Songfang Huang , Sheng Gao

DynClean: Training Dynamics-based Label Cleaning for Distantly-Supervised Named Entity Recognition

Distantly Supervised Named Entity Recognition (DS-NER) has attracted attention due to its scalability and ability to automatically generate labeled data. However, distant annotation introduces many mislabeled instances, limiting its…

Computation and Language · Computer Science 2025-04-08 Qi Zhang , Huitong Pan , Zhijia Chen , Longin Jan Latecki , Cornelia Caragea , Eduard Dragut

AutoTriggER: Label-Efficient and Robust Named Entity Recognition with Auxiliary Trigger Extraction

Deep neural models for named entity recognition (NER) have shown impressive results in overcoming label scarcity and generalizing to unseen entities by leveraging distant supervision and auxiliary information such as explanations. However,…

Computation and Language · Computer Science 2023-05-19 Dong-Ho Lee , Ravi Kiran Selvam , Sheikh Muhammad Sarwar , Bill Yuchen Lin , Fred Morstatter , Jay Pujara , Elizabeth Boschee , James Allan , Xiang Ren

Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-grained Student Ensemble

Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the data scarcity problem in NER by automatically generating training samples. Unfortunately, the distant supervision may induce noisy labels, thus undermining…

Computation and Language · Computer Science 2022-12-14 Xiaoye Qu , Jun Zeng , Daizong Liu , Zhefeng Wang , Baoxing Huai , Pan Zhou

Distantly Supervised Named Entity Recognition using Positive-Unlabeled Learning

In this work, we explore the way to perform named entity recognition (NER) using only unlabeled data and named entity dictionaries. To this end, we formulate the task as a positive-unlabeled (PU) learning problem and accordingly propose a…

Computation and Language · Computer Science 2019-06-12 Minlong Peng , Xiaoyu Xing , Qi Zhang , Jinlan Fu , Xuanjing Huang

Focusing on Potential Named Entities During Active Label Acquisition

Named entity recognition (NER) aims to identify mentions of named entities in an unstructured text and classify them into predefined named entity classes. While deep learning-based pre-trained language models help to achieve good predictive…

Computation and Language · Computer Science 2023-06-16 Ali Osman Berk Sapci , Oznur Tastan , Reyyan Yeniterzi

Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations

Distantly supervised named entity recognition (DS-NER) has emerged as a cheap and convenient alternative to traditional human annotation methods, enabling the automatic generation of training data by aligning text with external resources.…

Computation and Language · Computer Science 2025-05-20 Yuyang Ding , Dan Qiao , Juntao Li , Jiajie Xu , Pingfu Chao , Xiaofang Zhou , Min Zhang

Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances

To achieve state-of-the-art performance, one still needs to train NER models on large-scale, high-quality annotated data, an asset that is both costly and time-intensive to accumulate. In contrast, real-world applications often resort to…

Computation and Language · Computer Science 2023-10-26 Zhendong Chu , Ruiyi Zhang , Tong Yu , Rajiv Jain , Vlad I Morariu , Jiuxiang Gu , Ani Nenkova

HAMNER: Headword Amplified Multi-span Distantly Supervised Method for Domain Specific Named Entity Recognition

To tackle Named Entity Recognition (NER) tasks, supervised methods need to obtain sufficient cleanly annotated data, which is labor and time consuming. On the contrary, distantly supervised methods acquire automatically annotated data using…

Computation and Language · Computer Science 2019-12-05 Shifeng Liu , Yifang Sun , Bing Li , Wei Wang , Xiang Zhao

Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning

Distantly supervised named entity recognition (DS-NER) efficiently reduces labor costs but meanwhile intrinsically suffers from the label noise due to the strong assumption of distant supervision. Typically, the wrongly labeled instances…

Computation and Language · Computer Science 2023-02-16 Xinghua Zhang , Bowen Yu , Tingwen Liu , Zhenyu Zhang , Jiawei Sheng , Mengge Xue , Hongbo Xu

DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants

Named entity recognition (NER) is usually developed and tested on text from well-written sources. However, in intelligent voice assistants, where NER is an important component, input to NER may be noisy because of user or speech recognition…

Computation and Language · Computer Science 2023-10-30 Deepak Muralidharan , Joel Ruben Antony Moniz , Weicheng Zhang , Stephen Pulman , Lin Li , Megan Barnes , Jingjing Pan , Jason Williams , Alex Acero

Named Entity Recognition on Noisy Data using Images and Text (1-page abstract)

Named Entity Recognition (NER) is an important subtask of information extraction that seeks to locate and recognise named entities. Despite recent achievements, we still face limitations in correctly detecting and classifying entities,…

Information Retrieval · Computer Science 2018-09-07 Diego Esteves

Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning

Distantly-Supervised Named Entity Recognition (DS-NER) is widely used in real-world scenarios. It can effectively alleviate the burden of annotation by matching entities in existing knowledge bases with snippets in the text but suffer from…

Computation and Language · Computer Science 2025-07-04 Shuzheng Si , Helan Hu , Haozhe Zhao , Shuang Zeng , Kaikai An , Zefan Cai , Baobao Chang

ExpertNet: Adversarial Learning and Recovery Against Noisy Labels

Today's available datasets in the wild, e.g., from social media and open platforms, present tremendous opportunities and challenges for deep learning, as there is a significant portion of tagged images, but often with noisy, i.e. erroneous,…

Machine Learning · Computer Science 2020-07-14 Amirmasoud Ghiassi , Robert Birke , Rui Han , Lydia Y. Chen

Deep Active Learning for Named Entity Recognition

Deep learning has yielded state-of-the-art performance on many natural language processing tasks including named entity recognition (NER). However, this typically requires large amounts of labeled data. In this work, we demonstrate that the…

Computation and Language · Computer Science 2018-02-06 Yanyao Shen , Hyokun Yun , Zachary C. Lipton , Yakov Kronrod , Animashree Anandkumar

DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others…

Computation and Language · Computer Science 2024-06-21 Somnath Banerjee , Avik Dutta , Aaditya Agrawal , Rima Hazra , Animesh Mukherjee

Learning with Neighbor Consistency for Noisy Labels

Recent advances in deep learning have relied on large, labelled datasets to train high-capacity models. However, collecting large datasets in a time- and cost-efficient manner often results in label noise. We present a method for learning…

Computer Vision and Pattern Recognition · Computer Science 2022-07-07 Ahmet Iscen , Jack Valmadre , Anurag Arnab , Cordelia Schmid

Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data

Weak supervision has shown promising results in many natural language processing tasks, such as Named Entity Recognition (NER). Existing work mainly focuses on learning deep NER models only with weak supervision, i.e., without any human…

Computation and Language · Computer Science 2021-08-03 Haoming Jiang , Danqing Zhang , Tianyu Cao , Bing Yin , Tuo Zhao

Deep Self-Learning From Noisy Labels

ConvNets achieve good results when training from clean data, but learning from noisy labels significantly degrades performances and remains challenging. Unlike previous works constrained by many conditions, making them infeasible to real…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Jiangfan Han , Ping Luo , Xiaogang Wang