Related papers: Active Learning for Multilingual Semantic Parser

The Best of Both Worlds: Combining Human and Machine Translations for Multilingual Semantic Parsing with Active Learning

Multilingual semantic parsing aims to leverage the knowledge from the high-resource languages to improve low-resource semantic parsing, yet commonly suffers from the data imbalance problem. Prior works propose to utilize the translations by…

Computation and Language · Computer Science 2023-05-23 Zhuang Li , Lizhen Qu , Philip R. Cohen , Raj V. Tumuluri , Gholamreza Haffari

MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators

Prompting has recently been shown as a promising approach for applying pre-trained language models to perform downstream tasks. We present Multi-Stage Prompting (MSP), a simple and automatic approach for leveraging pre-trained language…

Computation and Language · Computer Science 2022-03-18 Zhixing Tan , Xiangwen Zhang , Shuo Wang , Yang Liu

Active Learning for Interactive Neural Machine Translation of Data Streams

We study the application of active learning techniques to the translation of unbounded data streams via interactive neural machine translation. The main idea is to select, from an unbounded stream of source sentences, those worth to be…

Computation and Language · Computer Science 2018-10-26 Álvaro Peris , Francisco Casacuberta

Multilingual Neural Semantic Parsing for Low-Resourced Languages

Multilingual semantic parsing is a cost-effective method that allows a single model to understand different languages. However, researchers face a great imbalance of availability of training data, with English being resource rich, and other…

Computation and Language · Computer Science 2021-06-15 Menglin Xia , Emilio Monti

Maximally Separated Active Learning

Active Learning aims to optimize performance while minimizing annotation costs by selecting the most informative samples from an unlabelled pool. Traditional uncertainty sampling often leads to sampling bias by choosing similar uncertain…

Machine Learning · Computer Science 2024-11-27 Tejaswi Kasarla , Abhishek Jha , Faye Tervoort , Rita Cucchiara , Pascal Mettes

Towards Multimodal Active Learning: Efficient Learning with Limited Paired Data

Active learning (AL) is a principled strategy to reduce annotation cost in data-hungry deep learning. However, existing AL algorithms focus almost exclusively on unimodal data, overlooking the substantial annotation burden in multimodal…

Machine Learning · Computer Science 2026-04-24 Jiancheng Zhang , Yinglun Zhu

Active Mining Sample Pair Semantics for Image-text Matching

Recently, commonsense learning has been a hot topic in image-text matching. Although it can describe more graphic correlations, commonsense learning still has some shortcomings: 1) The existing methods are based on triplet semantic…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Yongfeng Chena , Jin Liua , Zhijing Yang , Ruihan Chena , Junpeng Tan

Active Learning for Neural Machine Translation

The machine translation mechanism translates texts automatically between different natural languages, and Neural Machine Translation (NMT) has gained attention for its rational context analysis and fluent translation accuracy. However,…

Computation and Language · Computer Science 2023-01-03 Neeraj Vashistha , Kriti Singh , Ramakant Shakya

Bootstrapping Multilingual Semantic Parsers using Large Language Models

Despite cross-lingual generalization demonstrated by pre-trained multilingual models, the translate-train paradigm of transferring English datasets across multiple languages remains to be a key mechanism for training task-specific…

Computation and Language · Computer Science 2023-02-14 Abhijeet Awasthi , Nitish Gupta , Bidisha Samanta , Shachi Dave , Sunita Sarawagi , Partha Talukdar

Phrase-level Active Learning for Neural Machine Translation

Neural machine translation (NMT) is sensitive to domain shift. In this paper, we address this problem in an active learning setting where we can spend a given budget on translating in-domain data, and gradually fine-tune a pre-trained…

Computation and Language · Computer Science 2021-06-23 Junjie Hu , Graham Neubig

Semantic Parsing in Limited Resource Conditions

This thesis explores challenges in semantic parsing, specifically focusing on scenarios with limited data and computational resources. It offers solutions using techniques like automatic data curation, knowledge transfer, active learning,…

Computation and Language · Computer Science 2023-09-15 Zhuang Li

Extend and Explain: Interpreting Very Long Language Models

While Transformer language models (LMs) are state-of-the-art for information extraction, long text introduces computational challenges requiring suboptimal preprocessing steps or alternative model architectures. Sparse attention LMs can…

Computation and Language · Computer Science 2022-12-01 Joel Stremmel , Brian L. Hill , Jeffrey Hertzberg , Jaime Murillo , Llewelyn Allotey , Eran Halperin

One Semantic Parser to Parse Them All: Sequence to Sequence Multi-Task Learning on Semantic Parsing Datasets

Semantic parsers map natural language utterances to meaning representations. The lack of a single standard for meaning representations led to the creation of a plethora of semantic parsing datasets. To unify different datasets and train a…

Computation and Language · Computer Science 2021-06-15 Marco Damonte , Emilio Monti

Onception: Active Learning with Expert Advice for Real World Machine Translation

Active learning can play an important role in low-resource settings (i.e., where annotated data is scarce), by selecting which instances may be more worthy to annotate. Most active learning approaches for Machine Translation assume the…

Computation and Language · Computer Science 2022-03-15 Vânia Mendonça , Ricardo Rei , Luisa Coheur , Alberto Sardinha

Semantic Segmentation with Active Semi-Supervised Learning

Using deep learning, we now have the ability to create exceptionally good semantic segmentation systems; however, collecting the prerequisite pixel-wise annotations for training images remains expensive and time-consuming. Therefore, it…

Computer Vision and Pattern Recognition · Computer Science 2022-10-19 Aneesh Rangnekar , Christopher Kanan , Matthew Hoffman

Filling Memory Gaps: Enhancing Continual Semantic Parsing via SQL Syntax Variance-Guided LLMs without Real Data Replay

Continual Semantic Parsing (CSP) aims to train parsers to convert natural language questions into SQL across tasks with limited annotated examples, adapting to the real-world scenario of dynamically updated databases. Previous studies…

Computation and Language · Computer Science 2024-12-11 Ruiheng Liu , Jinyu Zhang , Yanqi Song , Yu Zhang , Bailong Yang

Testing the Assumptions of Active Learning for Translation Tasks with Few Samples

Active learning (AL) is a training paradigm for selecting unlabeled samples for annotation to improve model performance on a test set, which is useful when only a limited number of samples can be annotated. These algorithms often work by…

Computation and Language · Computer Science 2026-04-13 Lorenzo Jaime Yu Flores , Cesare Spinoso di-Piano , Ori Ernst , David Ifeoluwa Adelani , Jackie Chi Kit Cheung

Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information

We investigate the following question for machine translation (MT): can we develop a single universal MT model to serve as the common seed and obtain derivative and improved models on arbitrary language pairs? We propose mRASP, an approach…

Computation and Language · Computer Science 2021-01-25 Zehui Lin , Xiao Pan , Mingxuan Wang , Xipeng Qiu , Jiangtao Feng , Hao Zhou , Lei Li

Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models

Recently, Large language models (LLMs) with in-context learning have demonstrated remarkable potential in handling neural machine translation. However, existing evidence shows that LLMs are prompt-sensitive and it is sub-optimal to apply…

Computation and Language · Computer Science 2025-01-06 Lei Tang , Jinghui Qin , Wenxuan Ye , Hao Tan , Zhijing Yang

Active learning for medical code assignment

Machine Learning (ML) is widely used to automatically extract meaningful information from Electronic Health Records (EHR) to support operational, clinical, and financial decision-making. However, ML models require a large number of…

Machine Learning · Computer Science 2021-04-14 Martha Dais Ferreira , Michal Malyska , Nicola Sahar , Riccardo Miotto , Fernando Paulovich , Evangelos Milios