English
Related papers

Related papers: Modeling sequential annotations for sequence label…

200 papers

Annotation quality and quantity positively affect the learning performance of sequence labeling, a vital task in Natural Language Processing. Hiring domain experts to annotate a corpus is very costly in terms of money and time.…

Human-Computer Interaction · Computer Science 2023-07-04 Nasim Sabetpour , Adithya Kulkarni , Sihong Xie , Qi Li

Current methods for sequence tagging, a core task in NLP, are data hungry, which motivates the use of crowdsourcing as a cheap way to obtain labelled data. However, annotators are often unreliable and current aggregation methods cannot…

Computation and Language · Computer Science 2019-09-09 Edwin Simpson , Iryna Gurevych

Most crowdsourcing learning methods treat disagreement between annotators as noisy labelings while inter-disagreement among experts is often a good indicator for the ambiguity and uncertainty that is inherent in natural language. In this…

Computation and Language · Computer Science 2023-01-05 Xiaolei Lu

Supervised learning, especially supervised deep learning, requires large amounts of labeled data. One approach to collect large amounts of labeled data is by using a crowdsourcing platform where numerous workers perform the annotation…

Machine Learning · Computer Science 2023-08-22 Kosuke Yoshimura , Hisashi Kashima

Selecting an effective training signal for machine learning tasks is difficult: expert annotations are expensive, and crowd-sourced annotations may not be reliable. Recent work has demonstrated that learning from a distribution over labels…

Computation and Language · Computer Science 2025-04-23 Dustin Wright , Isabelle Augenstein

Sequence labeling is a fundamental framework for various natural language processing problems. Its performance is largely influenced by the annotation quality and quantity in supervised learning scenarios, and obtaining ground truth labels…

Computation and Language · Computer Science 2020-04-17 Ouyu Lan , Xiao Huang , Bill Yuchen Lin , He Jiang , Liyuan Liu , Xiang Ren

Traditional supervised learning requires ground truth labels for the training data, whose collection can be difficult in many cases. Recently, crowdsourcing has established itself as an efficient labeling solution through resorting to…

Machine Learning · Computer Science 2021-07-13 Ye Shi , Shao-Yuan Li , Sheng-Jun Huang

Sequential sentence classification (SSC) in scientific publications is crucial for supporting downstream tasks such as fine-grained information retrieval and extractive summarization. However, current SSC methods are constrained by model…

Computation and Language · Computer Science 2024-12-02 Mengfei Lan , Lecheng Zheng , Shufan Ming , Halil Kilicoglu

Existing models for named entity recognition (NER) are mainly based on large-scale labeled datasets, which always obtain using crowdsourcing. However, it is hard to obtain a unified and correct label via majority voting from multiple…

Computation and Language · Computer Science 2023-07-28 Limao Xiong , Jie Zhou , Qunxi Zhu , Xiao Wang , Yuanbin Wu , Qi Zhang , Tao Gui , Xuanjing Huang , Jin Ma , Ying Shan

Labeling is onerous for crowd counting as it should annotate each individual in crowd images. Recently, several methods have been proposed for semi-supervised crowd counting to reduce the labeling efforts. Given a limited labeling budget,…

Computer Vision and Pattern Recognition · Computer Science 2021-08-09 Yongtuo Liu , Sucheng Ren , Liangyu Chai , Hanjie Wu , Jing Qin , Dan Xu , Shengfeng He

Automatic Crowd behavior analysis can be applied to effectively help the daily transportation statistics and planning, which helps the smart city construction. As one of the most important keys, crowd counting has drawn increasing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Haoran Duan , Fan Wan , Rui Sun , Zeyu Wang , Varun Ojha , Yu Guan , Hubert P. H. Shum , Bingzhang Hu , Yang Long

Well-annotated datasets, as shown in recent top studies, are becoming more important for researchers than ever before in supervised machine learning (ML). However, the dataset annotation process and its related human labor costs remain…

Computation and Language · Computer Science 2021-08-24 Haozhan Sun , Chenchen Xu , Hanna Suominen

To learn a reliable people counter from crowd images, head center annotations are normally required. Annotating head centers is however a laborious and tedious process in dense crowds. In this paper, we present an active learning framework…

Computer Vision and Pattern Recognition · Computer Science 2020-07-16 Zhen Zhao , Miaojing Shi , Xiaoxiao Zhao , Li Li

Recent crowd counting approaches have achieved excellent performance. However, they are essentially based on fully supervised paradigm and require large number of annotated samples. Obtaining annotations is an expensive and labour-intensive…

Computer Vision and Pattern Recognition · Computer Science 2020-07-09 Vishwanath A. Sindagi , Rajeev Yasarla , Deepak Sam Babu , R. Venkatesh Babu , Vishal M. Patel

The growing need to analyze large collections of documents has led to great developments in topic modeling. Since documents are frequently associated with other related variables, such as labels or ratings, much interest has been placed on…

Machine Learning · Statistics 2018-08-20 Filipe Rodrigues , Mariana Lourenço , Bernardete Ribeiro , Francisco Pereira

Sequence labeling (SL) is a fundamental research problem encompassing a variety of tasks, e.g., part-of-speech (POS) tagging, named entity recognition (NER), text chunking, etc. Though prevalent and effective in many downstream applications…

Computation and Language · Computer Science 2020-11-16 Zhiyong He , Zanbo Wang , Wei Wei , Shanshan Feng , Xianling Mao , Sheng Jiang

This paper presents a new annotation method called Sparse Annotation (SA) for crowd counting, which reduces human labeling efforts by sparsely labeling individuals in an image. We argue that sparse labeling can reduce the redundancy of full…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Shiwei Zhang , Zhengzheng Wang , Qing Liu , Fei Wang , Wei Ke , Tong Zhang

Crowdsourcing provides a practical way to obtain large amounts of labeled data at a low cost. However, the annotation quality of annotators varies considerably, which imposes new challenges in learning a high-quality model from the…

Machine Learning · Computer Science 2021-06-15 Zhendong Chu , Jing Ma , Hongning Wang

Whether Large Language Models (LLMs) can outperform crowdsourcing on the data annotation task is attracting interest recently. Some works verified this issue with the average performance of individual crowd workers and LLM workers on some…

Computation and Language · Computer Science 2024-01-19 Jiyi Li

Multi-label active learning is a hot topic in reducing the label cost by optimally choosing the most valuable instance to query its label from an oracle. In this paper, we consider the poolbased multi-label active learning under the…

Machine Learning · Computer Science 2015-08-05 Shao-Yuan Li , Yuan Jiang , Zhi-Hua Zhou
‹ Prev 1 2 3 10 Next ›