English
Related papers

Related papers: A Practical Incremental Learning Framework For Spa…

200 papers

Active learning enables efficient model training by leveraging interactions between machine learning agents and human annotators. We study and propose a novel framework that formulates batch active learning from the sparse approximation's…

Machine Learning · Computer Science 2022-11-08 Maohao Shen , Bowen Jiang , Jacky Yibo Zhang , Oluwasanmi Koyejo

Large amounts of annotated data have become more important than ever, especially since the rise of deep learning techniques. However, manual annotations are costly. We propose a tool that enables researchers to create large, high-quality,…

Digital Libraries · Computer Science 2021-12-23 Franziska Weeber , Felix Hamborg , Karsten Donnay , Bela Gipp

Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data…

Computation and Language · Computer Science 2023-08-08 Philipp Kohl , Nils Freyer , Yoka Krämer , Henri Werth , Steffen Wolf , Bodo Kraft , Matthias Meinecke , Albert Zündorf

Despite considerable progress in neural relevance ranking techniques, search engines still struggle to process complex queries effectively - both in terms of precision and recall. Sparse and dense Pseudo-Relevance Feedback (PRF) approaches…

Information Retrieval · Computer Science 2023-12-06 Iain Mackie , Shubham Chatterjee , Sean MacAvaney , Jeffrey Dalton

Active learning enhances annotation efficiency by selecting the most revealing samples for labeling, thereby reducing reliance on extensive human input. Previous methods in semantic segmentation have centered on individual pixels or small…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Jinchao Ge , Zeyu Zhang , Minh Hieu Phan , Bowen Zhang , Akide Liu , Yang Zhao , Shuwen Zhao

Telephone transcription data can be very noisy due to speech recognition errors, disfluencies, etc. Not only that annotating such data is very challenging for the annotators, but also such data may have lots of annotation errors even after…

Computation and Language · Computer Science 2022-11-03 Md Tahmid Rahman Laskar , Cheng Chen , Xue-Yong Fu , Shashi Bhushan TN

Construction of human-curated annotated datasets for abstractive text summarization (ATS) is very time-consuming and expensive because creating each instance requires a human annotator to read a long document and compose a shorter summary…

Despite the advantages of their low-resource settings, traditional sparse retrievers depend on exact matching approaches between high-dimensional bag-of-words (BoW) representations of both the queries and the collection. As a result,…

Information Retrieval · Computer Science 2022-08-11 Dahlia Shehata , Negar Arabzadeh , Charles L. A. Clarke

Entity Matching (EM) is a core data cleaning task, aiming to identify different mentions of the same real-world entity. Active learning is one way to address the challenge of scarce labeled data in practice, by dynamically collecting the…

Databases · Computer Science 2020-03-31 Venkata Vamsikrishna Meduri , Lucian Popa , Prithviraj Sen , Mohamed Sarwat

Entity linking aims to link ambiguous mentions to their corresponding entities in a knowledge base. One of the key challenges comes from insufficient labeled data for specific domains. Although dense retrievers have achieved excellent…

Computation and Language · Computer Science 2023-10-20 Yulin Chen , Zhenran Xu , Baotian Hu , Min Zhang

Active Learning (AL) aims to reduce annotation costs by strategically selecting the most informative samples for labeling. However, most active learning methods struggle in the low-budget regime where only a few labeled examples are…

Machine Learning · Computer Science 2025-04-08 Netta Shafir , Guy Hacohen , Daphna Weinshall

In this work we propose a pragmatic method that reduces the annotation cost for structured label spaces using active learning. Our approach leverages partial annotation, which reduces labeling costs for structured outputs by selecting only…

Computation and Language · Computer Science 2023-10-20 Zhisong Zhang , Emma Strubell , Eduard Hovy

When we can not assume a large amount of annotated data , active learning is a good strategy. It consists in learning a model on a small amount of annotated data (annotation budget) and in choosing the best set of points to annotate in…

Computer Vision and Pattern Recognition · Computer Science 2022-01-19 Umang Aggarwal , Adrian Popescu , Céline Hudelot

As large language models (LLMs) continue to scale up, their performance on various downstream tasks has significantly improved. However, evaluating their capabilities has become increasingly expensive, as performing inference on a large…

Computation and Language · Computer Science 2026-02-10 Taolin Zhang , Hang Guo , Wang Lu , Tao Dai , Shu-Tao Xia , Jindong Wang

This paper introduces a cost-efficient active learning (AL) framework for classification, featuring a novel query design called candidate set query. Unlike traditional AL queries requiring the oracle to examine all possible classes, our…

Machine Learning · Computer Science 2025-08-20 Yeho Gwon , Sehyun Hwang , Hoyoung Kim , Jungseul Ok , Suha Kwak

Active learning (AL) reduces human annotation costs for machine learning systems by strategically selecting the most informative unlabeled data for annotation, but performing it individually may still be insufficient due to restricted data…

Machine Learning · Computer Science 2025-04-25 Jun Zhang , Jue Wang , Huan Li , Zhongle Xie , Ke Chen , Lidan Shou

State-of-the-art question answering (QA) relies upon large amounts of training data for which labeling is time consuming and thus expensive. For this reason, customizing QA systems is challenging. As a remedy, we propose a novel framework…

Computation and Language · Computer Science 2020-11-10 Bernhard Kratzwald , Stefan Feuerriegel , Huan Sun

Despite the advantages of their low-resource settings, traditional sparse retrievers depend on exact matching approaches between high-dimensional bag-of-words (BoW) representations of both the queries and the collection. As a result,…

Information Retrieval · Computer Science 2024-04-16 Dahlia Shehata

Supervised deep learning requires a large amount of training samples with annotations (e.g. label class for classification task, pixel- or voxel-wised label map for segmentation tasks), which are expensive and time-consuming to obtain.…

Computer Vision and Pattern Recognition · Computer Science 2020-04-14 Yuanhan Mo , Shuo Wang , Chengliang Dai , Rui Zhou , Zhongzhao Teng , Wenjia Bai , Yike Guo

Named entity recognition (NER) aims to identify mentions of named entities in an unstructured text and classify them into predefined named entity classes. While deep learning-based pre-trained language models help to achieve good predictive…

Computation and Language · Computer Science 2023-06-16 Ali Osman Berk Sapci , Oznur Tastan , Reyyan Yeniterzi
‹ Prev 1 2 3 10 Next ›