Related papers: Modeling sequential annotations for sequence label…

Truth Discovery in Sequence Labels from Crowds

Annotation quality and quantity positively affect the learning performance of sequence labeling, a vital task in Natural Language Processing. Hiring domain experts to annotate a corpus is very costly in terms of money and time.…

Human-Computer Interaction · Computer Science 2023-07-04 Nasim Sabetpour , Adithya Kulkarni , Sihong Xie , Qi Li

A Bayesian Approach for Sequence Tagging with Crowds

Current methods for sequence tagging, a core task in NLP, are data hungry, which motivates the use of crowdsourcing as a cheap way to obtain labelled data. However, annotators are often unreliable and current aggregation methods cannot…

Computation and Language · Computer Science 2019-09-09 Edwin Simpson , Iryna Gurevych

Learning Ambiguity from Crowd Sequential Annotations

Most crowdsourcing learning methods treat disagreement between annotators as noisy labelings while inter-disagreement among experts is often a good indicator for the ambiguity and uncertainty that is inherent in natural language. In this…

Computation and Language · Computer Science 2023-01-05 Xiaolei Lu

Label Selection Approach to Learning from Crowds

Supervised learning, especially supervised deep learning, requires large amounts of labeled data. One approach to collect large amounts of labeled data is by using a crowdsourcing platform where numerous workers perform the annotation…

Machine Learning · Computer Science 2023-08-22 Kosuke Yoshimura , Hisashi Kashima

Aggregating Soft Labels from Crowd Annotations Improves Uncertainty Estimation Under Distribution Shift

Selecting an effective training signal for machine learning tasks is difficult: expert annotations are expensive, and crowd-sourced annotations may not be reliable. Recent work has demonstrated that learning from a distribution over labels…

Computation and Language · Computer Science 2025-04-23 Dustin Wright , Isabelle Augenstein

Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling

Sequence labeling is a fundamental framework for various natural language processing problems. Its performance is largely influenced by the annotation quality and quantity in supervised learning scenarios, and obtaining ground truth labels…

Computation and Language · Computer Science 2020-04-17 Ouyu Lan , Xiao Huang , Bill Yuchen Lin , He Jiang , Liyuan Liu , Xiang Ren

Learning from Crowds with Sparse and Imbalanced Annotations

Traditional supervised learning requires ground truth labels for the training data, whose collection can be difficult in many cases. Recently, crowdsourcing has established itself as an efficient labeling solution through resorting to…

Machine Learning · Computer Science 2021-07-13 Ye Shi , Shao-Yuan Li , Sheng-Jun Huang

Multi-label Sequential Sentence Classification via Large Language Model

Sequential sentence classification (SSC) in scientific publications is crucial for supporting downstream tasks such as fine-grained information retrieval and extractive summarization. However, current SSC methods are constrained by model…

Computation and Language · Computer Science 2024-12-02 Mengfei Lan , Lecheng Zheng , Shufan Ming , Halil Kilicoglu

A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition

Existing models for named entity recognition (NER) are mainly based on large-scale labeled datasets, which always obtain using crowdsourcing. However, it is hard to obtain a unified and correct label via majority voting from multiple…

Computation and Language · Computer Science 2023-07-28 Limao Xiong , Jie Zhou , Qunxi Zhu , Xiao Wang , Yuanbin Wu , Qi Zhang , Tao Gui , Xuanjing Huang , Jin Ma , Ying Shan

Reducing Spatial Labeling Redundancy for Semi-supervised Crowd Counting

Labeling is onerous for crowd counting as it should annotate each individual in crowd images. Recently, several methods have been proposed for semi-supervised crowd counting to reduce the labeling efforts. Given a limited labeling budget,…

Computer Vision and Pattern Recognition · Computer Science 2021-08-09 Yongtuo Liu , Sucheng Ren , Liangyu Chai , Hanjie Wu , Jing Qin , Dan Xu , Shengfeng He

Semi-Supervised Crowd Counting from Unlabeled Data

Automatic Crowd behavior analysis can be applied to effectively help the daily transportation statistics and planning, which helps the smart city construction. As one of the most important keys, crowd counting has drawn increasing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Haoran Duan , Fan Wan , Rui Sun , Zeyu Wang , Varun Ojha , Yu Guan , Hubert P. H. Shum , Bingzhang Hu , Yang Long

Analyzing the Granularity and Cost of Annotation in Clinical Sequence Labeling

Well-annotated datasets, as shown in recent top studies, are becoming more important for researchers than ever before in supervised machine learning (ML). However, the dataset annotation process and its related human labor costs remain…

Computation and Language · Computer Science 2021-08-24 Haozhan Sun , Chenchen Xu , Hanna Suominen

Active Crowd Counting with Limited Supervision

To learn a reliable people counter from crowd images, head center annotations are normally required. Annotating head centers is however a laborious and tedious process in dense crowds. In this paper, we present an active learning framework…

Computer Vision and Pattern Recognition · Computer Science 2020-07-16 Zhen Zhao , Miaojing Shi , Xiaoxiao Zhao , Li Li

Learning to Count in the Crowd from Limited Labeled Data

Recent crowd counting approaches have achieved excellent performance. However, they are essentially based on fully supervised paradigm and require large number of annotated samples. Obtaining annotations is an expensive and labour-intensive…

Computer Vision and Pattern Recognition · Computer Science 2020-07-09 Vishwanath A. Sindagi , Rajeev Yasarla , Deepak Sam Babu , R. Venkatesh Babu , Vishal M. Patel

Learning Supervised Topic Models for Classification and Regression from Crowds

The growing need to analyze large collections of documents has led to great developments in topic modeling. Since documents are frequently associated with other related variables, such as labels or ratings, much interest has been placed on…

Machine Learning · Statistics 2018-08-20 Filipe Rodrigues , Mariana Lourenço , Bernardete Ribeiro , Francisco Pereira

A Survey on Recent Advances in Sequence Labeling from Deep Learning Models

Sequence labeling (SL) is a fundamental research problem encompassing a variety of tasks, e.g., part-of-speech (POS) tagging, named entity recognition (NER), text chunking, etc. Though prevalent and effective in many downstream applications…

Computation and Language · Computer Science 2020-11-16 Zhiyong He , Zanbo Wang , Wei Wei , Shanshan Feng , Xianling Mao , Sheng Jiang

Crowd Counting with Sparse Annotation

This paper presents a new annotation method called Sparse Annotation (SA) for crowd counting, which reduces human labeling efforts by sparsely labeling individuals in an image. We argue that sparse labeling can reduce the redundancy of full…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Shiwei Zhang , Zhengzheng Wang , Qing Liu , Fei Wang , Wei Ke , Tong Zhang

Learning from Crowds by Modeling Common Confusions

Crowdsourcing provides a practical way to obtain large amounts of labeled data at a low cost. However, the annotation quality of annotators varies considerably, which imposes new challenges in learning a high-quality model from the…

Machine Learning · Computer Science 2021-06-15 Zhendong Chu , Jing Ma , Hongning Wang

A Comparative Study on Annotation Quality of Crowdsourcing and LLM via Label Aggregation

Whether Large Language Models (LLMs) can outperform crowdsourcing on the data annotation task is attracting interest recently. Some works verified this issue with the average performance of individual crowd workers and LLM workers on some…

Computation and Language · Computer Science 2024-01-19 Jiyi Li

Multi-Label Active Learning from Crowds

Multi-label active learning is a hot topic in reducing the label cost by optimally choosing the most valuable instance to query its label from an oracle. In this paper, we consider the poolbased multi-label active learning under the…

Machine Learning · Computer Science 2015-08-05 Shao-Yuan Li , Yuan Jiang , Zhi-Hua Zhou