Related papers: Active Learning with Tabular Language Models

Towards Computationally Feasible Deep Active Learning

Active learning (AL) is a prominent technique for reducing the annotation effort required for training machine learning models. Deep learning offers a solution for several essential obstacles to deploying AL in practice but introduces many…

Computation and Language · Computer Science 2022-05-10 Akim Tsvigun , Artem Shelmanov , Gleb Kuzmin , Leonid Sanochkin , Daniil Larionov , Gleb Gusev , Manvel Avetisian , Leonid Zhukov

Learning to Rank for Active Learning via Multi-Task Bilevel Optimization

Active learning is a promising paradigm to reduce the labeling cost by strategically requesting labels to improve model performance. However, existing active learning methods often rely on expensive acquisition function to compute,…

Machine Learning · Computer Science 2023-10-27 Zixin Ding , Si Chen , Ruoxi Jia , Yuxin Chen

Focusing on Potential Named Entities During Active Label Acquisition

Named entity recognition (NER) aims to identify mentions of named entities in an unstructured text and classify them into predefined named entity classes. While deep learning-based pre-trained language models help to achieve good predictive…

Computation and Language · Computer Science 2023-06-16 Ali Osman Berk Sapci , Oznur Tastan , Reyyan Yeniterzi

Table Detection with Active Learning

Efficient data annotation remains a critical challenge in machine learning, particularly for object detection tasks requiring extensive labeled data. Active learning (AL) has emerged as a promising solution to minimize annotation costs by…

Computer Vision and Pattern Recognition · Computer Science 2025-09-25 Somraj Gautam , Nachiketa Purohit , Gaurav Harit

ActiveLab: Active Learning with Re-Labeling by Multiple Annotators

In real-world data labeling applications, annotators often provide imperfect labels. It is thus common to employ multiple annotators to label data with some overlap between their examples. We study active learning in such settings, aiming…

Machine Learning · Computer Science 2024-07-29 Hui Wen Goh , Jonas Mueller

Named Entity Recognition in Industrial Tables using Tabular Language Models

Specialized transformer-based models for encoding tabular data have gained interest in academia. Although tabular data is omnipresent in industry, applications of table transformers are still missing. In this paper, we study how these…

Artificial Intelligence · Computer Science 2022-09-30 Aneta Koleva , Martin Ringsquandl , Mark Buckley , Rakebul Hasan , Volker Tresp

Reducing Label Effort: Self-Supervised meets Active Learning

Active learning is a paradigm aimed at reducing the annotation effort by training the model on actively selected informative and/or representative samples. Another paradigm to reduce the annotation effort is self-training that learns from a…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Javad Zolfaghari Bengar , Joost van de Weijer , Bartlomiej Twardowski , Bogdan Raducanu

Deep Active Learning for Sequence Labeling Based on Diversity and Uncertainty in Gradient

Recently, several studies have investigated active learning (AL) for natural language processing tasks to alleviate data dependency. However, for query selection, most of these studies mainly rely on uncertainty-based sampling, which…

Computation and Language · Computer Science 2020-11-30 Yekyung Kim

On Dataset Transferability in Active Learning for Transformers

Active learning (AL) aims to reduce labeling costs by querying the examples most beneficial for model learning. While the effectiveness of AL for fine-tuning transformer-based pre-trained language models (PLMs) has been demonstrated, it is…

Machine Learning · Computer Science 2023-10-02 Fran Jelenić , Josip Jukić , Nina Drobac , Jan Šnajder

ALANNO: An Active Learning Annotation System for Mortals

Supervised machine learning has become the cornerstone of today's data-driven society, increasing the need for labeled data. However, the process of acquiring labels is often expensive and tedious. One possible remedy is to use active…

Machine Learning · Computer Science 2023-02-22 Josip Jukić , Fran Jelenić , Miroslav Bićanić , Jan Šnajder

ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation

Label aggregation such as majority voting is commonly used to resolve annotator disagreement in dataset creation. However, this may disregard minority values and opinions. Recent studies indicate that learning from individual annotations…

Computation and Language · Computer Science 2023-10-24 Xinpeng Wang , Barbara Plank

Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and…

Computation and Language · Computer Science 2021-04-06 Rishi Hazra , Parag Dutta , Shubham Gupta , Mohammed Abdul Qaathir , Ambedkar Dukkipati

Active Learning: Problem Settings and Recent Developments

In supervised learning, acquiring labeled training data for a predictive model can be very costly, but acquiring a large amount of unlabeled data is often quite easy. Active learning is a method of obtaining predictive models with high…

Machine Learning · Computer Science 2020-12-17 Hideitsu Hino

PartAL: Efficient Partial Active Learning in Multi-Task Visual Settings

Multi-task learning is central to many real-world applications. Unfortunately, obtaining labelled data for all tasks is time-consuming, challenging, and expensive. Active Learning (AL) can be used to reduce this burden. Existing techniques…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Nikita Durasov , Nik Dorndorf , Pascal Fua

Active Learning for Natural Language Generation

The field of Natural Language Generation (NLG) suffers from a severe shortage of labeled data due to the extremely expensive and time-consuming process involved in manual annotation. A natural approach for coping with this problem is active…

Computation and Language · Computer Science 2023-10-18 Yotam Perlitz , Ariel Gera , Michal Shmueli-Scheuer , Dafna Sheinwald , Noam Slonim , Liat Ein-Dor

How to Purchase Labels? A Cost-Effective Approach Using Active Learning Markets

We introduce and analyse active learning markets as a way to purchase labels, in situations where analysts aim to acquire additional data to improve model fitting, or to better train models for predictive analytics applications. This comes…

Machine Learning · Computer Science 2026-02-11 Xiwen Huang , Pierre Pinson

Optimizing Active Learning for Low Annotation Budgets

When we can not assume a large amount of annotated data , active learning is a good strategy. It consists in learning a model on a small amount of annotated data (annotation budget) and in choosing the best set of points to annotate in…

Computer Vision and Pattern Recognition · Computer Science 2022-01-19 Umang Aggarwal , Adrian Popescu , Céline Hudelot

Model Uncertainty based Active Learning on Tabular Data using Boosted Trees

Supervised machine learning relies on the availability of good labelled data for model training. Labelled data is acquired by human annotation, which is a cumbersome and costly process, often requiring subject matter experts. Active…

Machine Learning · Computer Science 2023-10-31 Sharath M Shankaranarayana

Bayesian active learning for production, a systematic study and a reusable library

Active learning is able to reduce the amount of labelling effort by using a machine learning model to query the user for specific inputs. While there are many papers on new active learning techniques, these techniques rarely satisfy the…

Machine Learning · Computer Science 2020-06-18 Parmida Atighehchian , Frédéric Branchaud-Charron , Alexandre Lacoste

Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and…

Machine Learning · Computer Science 2021-04-08 Rishi Hazra , Parag Dutta , Shubham Gupta , Mohammed Abdul Qaathir , Ambedkar Dukkipati