Related papers: Reference-based Weak Supervision for Answer Senten…

Adaptive Ranking-based Sample Selection for Weakly Supervised Class-imbalanced Text Classification

To obtain a large amount of training labels inexpensively, researchers have recently adopted the weak supervision (WS) paradigm, which leverages labeling rules to synthesize training labels rather than using individual annotations to…

Computation and Language · Computer Science 2022-10-10 Linxin Song , Jieyu Zhang , Tianxiang Yang , Masayuki Goto

Weak Supervision and Referring Attention for Temporal-Textual Association Learning

A system capturing the association between video frames and textual queries offer great potential for better video analysis. However, training such a system in a fully supervised way inevitably demands a meticulously curated video dataset…

Computer Vision and Pattern Recognition · Computer Science 2020-06-30 Zhiyuan Fang , Shu Kong , Zhe Wang , Charless Fowlkes , Yezhou Yang

Content-Based Weak Supervision for Ad-Hoc Re-Ranking

One challenge with neural ranking is the need for a large amount of manually-labeled relevance judgments for training. In contrast with prior work, we examine the use of weak supervision sources for training that yield pseudo query-document…

Information Retrieval · Computer Science 2019-07-08 Sean MacAvaney , Andrew Yates , Kai Hui , Ophir Frieder

Label Augmentation with Reinforced Labeling for Weak Supervision

Weak supervision (WS) is an alternative to the traditional supervised learning to address the need for ground truth. Data programming is a practical WS approach that allows programmatic labeling data samples using labeling functions (LFs)…

Machine Learning · Computer Science 2022-04-14 Gürkan Solmaz , Flavio Cirillo , Fabio Maresca , Anagha Gode Anil Kumar

Generalized Weak Supervision for Neural Information Retrieval

Neural ranking models (NRMs) have demonstrated effective performance in several information retrieval (IR) tasks. However, training NRMs often requires large-scale training data, which is difficult and expensive to obtain. To address this…

Information Retrieval · Computer Science 2023-04-19 Yen-Chieh Lien , Hamed Zamani , W. Bruce Croft

End-to-End Weak Supervision

Aggregating multiple sources of weak supervision (WS) can ease the data-labeling bottleneck prevalent in many machine learning applications, by replacing the tedious manual collection of ground truth labels. Current state of the art…

Machine Learning · Computer Science 2021-12-01 Salva Rühling Cachay , Benedikt Boecking , Artur Dubrawski

Weak Augmentation Guided Relational Self-Supervised Learning

Self-supervised Learning (SSL) including the mainstream contrastive learning has achieved great success in learning visual representations without data annotations. However, most methods mainly focus on the instance level information (\ie,…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Mingkai Zheng , Shan You , Fei Wang , Chen Qian , Changshui Zhang , Xiaogang Wang , Chang Xu

Weakly-Supervised Open-Retrieval Conversational Question Answering

Recent studies on Question Answering (QA) and Conversational QA (ConvQA) emphasize the role of retrieval: a system first retrieves evidence from a large collection and then extracts answers. This open-retrieval ConvQA setting typically…

Information Retrieval · Computer Science 2021-03-04 Chen Qu , Liu Yang , Cen Chen , W. Bruce Croft , Kalpesh Krishna , Mohit Iyyer

Weak-Attention Suppression For Transformer Based Speech Recognition

Transformers, originally proposed for natural language processing (NLP) tasks, have recently achieved great success in automatic speech recognition (ASR). However, adjacent acoustic units (i.e., frames) are highly correlated, and…

Audio and Speech Processing · Electrical Eng. & Systems 2020-05-20 Yangyang Shi , Yongqiang Wang , Chunyang Wu , Christian Fuegen , Frank Zhang , Duc Le , Ching-Feng Yeh , Michael L. Seltzer

Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism

Diagnostic and intervention methodologies for skill assessment of autism typically requires a clinician repetitively initiating several stimuli and recording the child's response. In this paper, we propose to automate the response…

Computer Vision and Pattern Recognition · Computer Science 2020-01-30 Prashant Pandey , Prathosh AP , Manu Kohli , Josh Pritchard

Weakly-Supervised Referring Video Object Segmentation through Text Supervision

Referring video object segmentation (RVOS) aims to segment the target instance in a video, referred by a text expression. Conventional approaches are mostly supervised learning, requiring expensive pixel-level mask annotations. To tackle…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Miaojing Shi , Jun Huang , Zijie Yue , Hanli Wang

Multilingual Answer Sentence Reranking via Automatically Translated Data

We present a study on the design of multilingual Answer Sentence Selection (AS2) models, which are a core component of modern Question Answering (QA) systems. The main idea is to transfer data, created from one resource rich language, e.g.,…

Computation and Language · Computer Science 2021-02-23 Thuy Vu , Alessandro Moschitti

Lifting Weak Supervision To Structured Prediction

Weak supervision (WS) is a rich set of techniques that produce pseudolabels by aggregating easily obtained but potentially noisy label estimates from a variety of sources. WS is theoretically well understood for binary classification, where…

Machine Learning · Computer Science 2022-11-28 Harit Vishwakarma , Nicholas Roberts , Frederic Sala

Universalizing Weak Supervision

Weak supervision (WS) frameworks are a popular way to bypass hand-labeling large datasets for training data-hungry models. These approaches synthesize multiple noisy but cheaply-acquired estimates of labels into a set of high-quality…

Machine Learning · Computer Science 2023-11-30 Changho Shin , Winfred Li , Harit Vishwakarma , Nicholas Roberts , Frederic Sala

From Weak Labels to Strong Results: Utilizing 5,000 Hours of Noisy Classroom Transcripts with Minimal Accurate Data

Recent progress in speech recognition has relied on models trained on vast amounts of labeled data. However, classroom Automatic Speech Recognition (ASR) faces the real-world challenge of abundant weak transcripts paired with only a small…

Audio and Speech Processing · Electrical Eng. & Systems 2026-02-24 Ahmed Adel Attia , Dorottya Demszky , Jing Liu , Carol Espy-Wilson

WSL-DS: Weakly Supervised Learning with Distant Supervision for Query Focused Multi-Document Abstractive Summarization

In the Query Focused Multi-Document Summarization (QF-MDS) task, a set of documents and a query are given where the goal is to generate a summary from these documents based on the given query. However, one major challenge for this task is…

Computation and Language · Computer Science 2020-11-04 Md Tahmid Rahman Laskar , Enamul Hoque , Jimmy Xiangji Huang

WRENCH: A Comprehensive Benchmark for Weak Supervision

Recent Weak Supervision (WS) approaches have had widespread success in easing the bottleneck of labeling training data for machine learning by synthesizing labels from multiple potentially noisy supervision sources. However, proper…

Machine Learning · Computer Science 2021-10-12 Jieyu Zhang , Yue Yu , Yinghao Li , Yujing Wang , Yaming Yang , Mao Yang , Alexander Ratner

Weakly Supervised Construction of ASR Systems with Massive Video Data

Building Automatic Speech Recognition (ASR) systems from scratch is significantly challenging, mostly due to the time-consuming and financially-expensive process of annotating a large amount of audio data with transcripts. Although several…

Audio and Speech Processing · Electrical Eng. & Systems 2020-09-22 Mengli Cheng , Chengyu Wang , Xu Hu , Jun Huang , Xiaobo Wang

A Weakly Supervised Approach for Classifying Stance in Twitter Replies

Conversations on social media (SM) are increasingly being used to investigate social issues on the web, such as online harassment and rumor spread. For such issues, a common thread of research uses adversarial reactions, e.g., replies…

Computation and Language · Computer Science 2021-03-15 Sumeet Kumar , Ramon Villa Cox , Matthew Babcock , Kathleen M. Carley

Learning to Selectively Learn for Weakly-supervised Paraphrase Generation

Paraphrase generation is a longstanding NLP task that has diverse applications for downstream NLP tasks. However, the effectiveness of existing efforts predominantly relies on large amounts of golden labeled data. Though unsupervised…

Computation and Language · Computer Science 2021-09-28 Kaize Ding , Dingcheng Li , Alexander Hanbo Li , Xing Fan , Chenlei Guo , Yang Liu , Huan Liu