Related papers: Meta-learning Representations for Learning from Mu…

Learning from Multiple Annotator Noisy Labels via Sample-wise Label Fusion

Data lies at the core of modern deep learning. The impressive performance of supervised learning is built upon a base of massive accurately labeled data. However, in some real-world applications, accurate labeling might not be viable;…

Machine Learning · Computer Science 2022-07-26 Zhengqi Gao , Fan-Keng Sun , Mingran Yang , Sucheng Ren , Zikai Xiong , Marc Engeler , Antonio Burazer , Linda Wildling , Luca Daniel , Duane S. Boning

Learning From Noisy Singly-labeled Data

Supervised learning depends on annotated examples, which are taken to be the \emph{ground truth}. But these labels often come from noisy crowdsourcing platforms, like Amazon Mechanical Turk. Practitioners typically collect multiple labels…

Machine Learning · Computer Science 2018-05-22 Ashish Khetan , Zachary C. Lipton , Anima Anandkumar

Learning to Learn from Noisy Labeled Data

Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect. There…

Machine Learning · Computer Science 2019-04-15 Junnan Li , Yongkang Wong , Qi Zhao , Mohan Kankanhalli

Meta-learning of semi-supervised learning from tasks with heterogeneous attribute spaces

We propose a meta-learning method for semi-supervised learning that learns from multiple tasks with heterogeneous attribute spaces. The existing semi-supervised meta-learning methods assume that all tasks share the same attribute space,…

Machine Learning · Computer Science 2023-11-10 Tomoharu Iwata , Atsutoshi Kumagai

Learning from Multiple Annotators by Incorporating Instance Features

Learning from multiple annotators aims to induce a high-quality classifier from training instances, where each of them is associated with a set of possibly noisy labels provided by multiple annotators under the influence of their varying…

Machine Learning · Computer Science 2021-06-30 Jingzheng Li , Hailong Sun , Jiyi Li , Zhijun Chen , Renshuai Tao , Yufei Ge

Deep learning from crowds

Over the last few years, deep learning has revolutionized the field of machine learning by dramatically improving the state-of-the-art in various domains. However, as the size of supervised artificial neural networks grows, typically so…

Machine Learning · Statistics 2017-12-27 Filipe Rodrigues , Francisco Pereira

Label Selection Approach to Learning from Crowds

Supervised learning, especially supervised deep learning, requires large amounts of labeled data. One approach to collect large amounts of labeled data is by using a crowdsourcing platform where numerous workers perform the annotation…

Machine Learning · Computer Science 2023-08-22 Kosuke Yoshimura , Hisashi Kashima

Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances

To achieve state-of-the-art performance, one still needs to train NER models on large-scale, high-quality annotated data, an asset that is both costly and time-intensive to accumulate. In contrast, real-world applications often resort to…

Computation and Language · Computer Science 2023-10-26 Zhendong Chu , Ruiyi Zhang , Tong Yu , Rajiv Jain , Vlad I Morariu , Jiuxiang Gu , Ani Nenkova

Modeling Multiple Annotator Expertise in the Semi-Supervised Learning Scenario

Learning algorithms normally assume that there is at most one annotation or label per data point. However, in some scenarios, such as medical diagnosis and on-line collaboration,multiple annotations may be available. In either case,…

Machine Learning · Computer Science 2012-03-19 Yan Yan , Romer Rosales , Glenn Fung , Jennifer Dy

Annot-Mix: Learning with Noisy Class Labels from Multiple Annotators via a Mixup Extension

Training with noisy class labels impairs neural networks' generalization performance. In this context, mixup is a popular regularization technique to improve training robustness by making memorizing false class labels more difficult.…

Machine Learning · Computer Science 2024-05-07 Marek Herde , Lukas Lührs , Denis Huseljic , Bernhard Sick

Trustable Co-label Learning from Multiple Noisy Annotators

Supervised deep learning depends on massive accurately annotated examples, which is usually impractical in many real-world scenarios. A typical alternative is learning from multiple noisy annotators. Numerous earlier works assume that all…

Machine Learning · Computer Science 2022-03-09 Shikun Li , Tongliang Liu , Jiyong Tan , Dan Zeng , Shiming Ge

Learning From Noisy Labels By Regularized Estimation Of Annotator Confusion

The predictive performance of supervised learning algorithms depends on the quality of labels. In a typical label collection process, multiple annotators provide subjective noisy estimates of the "truth" under the influence of their varying…

Machine Learning · Computer Science 2019-06-18 Ryutaro Tanno , Ardavan Saeedi , Swami Sankaranarayanan , Daniel C. Alexander , Nathan Silberman

An Active Learning Approach for Jointly Estimating Worker Performance and Annotation Reliability with Crowdsourced Data

Crowdsourcing platforms offer a practical solution to the problem of affordably annotating large datasets for training supervised classifiers. Unfortunately, poor worker performance frequently threatens to compromise annotation reliability,…

Machine Learning · Computer Science 2014-01-17 Liyue Zhao , Yu Zhang , Gita Sukthankar

Learning Image Labels On-the-fly for Training Robust Classification Models

Current deep learning paradigms largely benefit from the tremendous amount of annotated data. However, the quality of the annotations often varies among labelers. Multi-observer studies have been conducted to study these annotation…

Computer Vision and Pattern Recognition · Computer Science 2020-10-05 Xiaosong Wang , Ziyue Xu , Dong Yang , Leo Tam , Holger Roth , Daguang Xu

Meta-Learning for Neural Relation Classification with Distant Supervision

Distant supervision provides a means to create a large number of weakly labeled data at low cost for relation classification. However, the resulting labeled instances are very noisy, containing data with wrong labels. Many approaches have…

Computation and Language · Computer Science 2020-10-27 Zhenzhen Li , Jian-Yun Nie , Benyou Wang , Pan Du , Yuhan Zhang , Lixin Zou , Dongsheng Li

Task-Adaptive Pre-Training for Boosting Learning With Noisy Labels: A Study on Text Classification for African Languages

For high-resource languages like English, text classification is a well-studied task. The performance of modern NLP models easily achieves an accuracy of more than 90% in many standard datasets for text classification in English (Xie et…

Computation and Language · Computer Science 2022-06-06 Dawei Zhu , Michael A. Hedderich , Fangzhou Zhai , David Ifeoluwa Adelani , Dietrich Klakow

Exploiting Context for Robustness to Label Noise in Active Learning

Several works in computer vision have demonstrated the effectiveness of active learning for adapting the recognition model when new unlabeled data becomes available. Most of these works consider that labels obtained from the annotator are…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Sudipta Paul , Shivkumar Chandrasekaran , B. S. Manjunath , Amit K. Roy-Chowdhury

CrowdTeacher: Robust Co-teaching with Noisy Answers & Sample-specific Perturbations for Tabular Data

Samples with ground truth labels may not always be available in numerous domains. While learning from crowdsourcing labels has been explored, existing models can still fail in the presence of sparse, unreliable, or diverging annotations.…

Machine Learning · Computer Science 2021-12-07 Mani Sotoodeh , Li Xiong , Joyce C. Ho

Multi-Class Classification from Noisy-Similarity-Labeled Data

A similarity label indicates whether two instances belong to the same class while a class label shows the class of the instance. Without class labels, a multi-class classifier could be learned from similarity-labeled pairwise data by meta…

Machine Learning · Computer Science 2020-02-18 Songhua Wu , Xiaobo Xia , Tongliang Liu , Bo Han , Mingming Gong , Nannan Wang , Haifeng Liu , Gang Niu

An Analysis of Active Learning Algorithms using Real-World Crowd-sourced Text Annotations

Active learning algorithms automatically identify the most informative samples from large amounts of unlabeled data and tremendously reduce human annotation effort in inducing a machine learning model. In a conventional active learning…

Machine Learning · Computer Science 2026-04-28 Varun Totakura , Ankita Singh , Yushun Dong , Shayok Chakraborty