Related papers: Towards Zero-Label Language Learning

Generate, Annotate, and Learn: NLP with Synthetic Text

This paper studies the use of language models as a source of synthetic unlabeled text for NLP. We formulate a general framework called ``generate, annotate, and learn (GAL)'' to take advantage of synthetic text within knowledge…

Machine Learning · Computer Science 2022-06-01 Xuanli He , Islam Nassar , Jamie Kiros , Gholamreza Haffari , Mohammad Norouzi

Jointly Improving Language Understanding and Generation with Quality-Weighted Weak Supervision of Automatic Labeling

Neural natural language generation (NLG) and understanding (NLU) models are data-hungry and require massive amounts of annotated data to be competitive. Recent frameworks address this bottleneck with generative models that synthesize weak…

Computation and Language · Computer Science 2021-02-09 Ernie Chang , Vera Demberg , Alex Marin

Unsupervised Natural Language Generation with Denoising Autoencoders

Generating text from structured data is important for various tasks such as question answering and dialog systems. We show that in at least one domain, without any supervision and only based on unlabeled text, we are able to build a Natural…

Computation and Language · Computer Science 2018-08-28 Markus Freitag , Scott Roy

ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation

Natural Language Generation (NLG) accepts input data in the form of images, videos, or text and generates corresponding natural language text as output. Existing NLG methods mainly adopt a supervised approach and rely heavily on coupled…

Computation and Language · Computer Science 2024-06-04 Bang Yang , Fenglin Liu , Yuexian Zou , Xian Wu , Yaowei Wang , David A. Clifton

Semi-Supervised Neural Text Generation by Joint Learning of Natural Language Generation and Natural Language Understanding Models

In Natural Language Generation (NLG), End-to-End (E2E) systems trained through deep learning have recently gained a strong interest. Such deep models need a large amount of carefully annotated data to reach satisfactory performance.…

Computation and Language · Computer Science 2019-10-09 Raheel Qader , François Portet , Cyril Labbé

Adaptive Self-training for Few-shot Neural Sequence Labeling

Sequence labeling is an important technique employed for many Natural Language Processing (NLP) tasks, such as Named Entity Recognition (NER), slot tagging for dialog systems and semantic parsing. Large-scale pre-trained language models…

Computation and Language · Computer Science 2020-12-14 Yaqing Wang , Subhabrata Mukherjee , Haoda Chu , Yuancheng Tu , Ming Wu , Jing Gao , Ahmed Hassan Awadallah

Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization

Training language models to learn from human instructions for zero-shot cross-task generalization has attracted much attention in NLP communities. Recently, instruction tuning (IT), which fine-tunes a pre-trained language model on a massive…

Computation and Language · Computer Science 2022-10-18 Yuxian Gu , Pei Ke , Xiaoyan Zhu , Minlie Huang

Improving and Simplifying Pattern Exploiting Training

Recently, pre-trained language models (LMs) have achieved strong performance when fine-tuned on difficult benchmarks like SuperGLUE. However, performance can suffer when there are very few labeled examples available for fine-tuning. Pattern…

Computation and Language · Computer Science 2021-09-29 Derek Tam , Rakesh R Menon , Mohit Bansal , Shashank Srivastava , Colin Raffel

ZGUL: Zero-shot Generalization to Unseen Languages using Multi-source Ensembling of Language Adapters

We tackle the problem of zero-shot cross-lingual transfer in NLP tasks via the use of language adapters (LAs). Most of the earlier works have explored training with adapter of a single source (often English), and testing either using the…

Computation and Language · Computer Science 2023-10-26 Vipul Rathore , Rajdeep Dhingra , Parag Singla , Mausam

Predictions For Pre-training Language Models

Language model pre-training has proven to be useful in many language understanding tasks. In this paper, we investigate whether it is still helpful to add the self-training method in the pre-training step and the fine-tuning step. Towards…

Computation and Language · Computer Science 2023-02-17 Tong Guo

Meta-learning for Few-shot Natural Language Processing: A Survey

Few-shot natural language processing (NLP) refers to NLP tasks that are accompanied with merely a handful of labeled examples. This is a real-world challenge that an AI system must learn to handle. Usually we rely on collecting more…

Computation and Language · Computer Science 2020-07-21 Wenpeng Yin

Towards Practical Few-shot Federated NLP

Transformer-based pre-trained models have emerged as the predominant solution for natural language processing (NLP). Fine-tuning such pre-trained models for downstream tasks often requires a considerable amount of labeled private data. In…

Computation and Language · Computer Science 2023-08-22 Dongqi Cai , Yaozong Wu , Haitao Yuan , Shangguang Wang , Felix Xiaozhu Lin , Mengwei Xu

Generating Training Data with Language Models: Towards Zero-Shot Language Understanding

Pretrained language models (PLMs) have demonstrated remarkable performance in various natural language processing tasks: Unidirectional PLMs (e.g., GPT) are well known for their superior text generation capabilities; bidirectional PLMs…

Computation and Language · Computer Science 2022-10-13 Yu Meng , Jiaxin Huang , Yu Zhang , Jiawei Han

AUGNLG: Few-shot Natural Language Generation using Self-trained Data Augmentation

Natural Language Generation (NLG) is a key component in a task-oriented dialogue system, which converts the structured meaning representation (MR) to the natural language. For large-scale conversational systems, where it is common to have…

Computation and Language · Computer Science 2021-06-11 Xinnuo Xu , Guoyin Wang , Young-Bum Kim , Sungjin Lee

Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning

There is a rising interest in further exploring the zero-shot learning potential of large pre-trained language models (PLMs). A new paradigm called data-generation-based zero-shot learning has achieved impressive success. In this paradigm,…

Computation and Language · Computer Science 2023-02-28 Jiahui Gao , Renjie Pi , Yong Lin , Hang Xu , Jiacheng Ye , Zhiyong Wu , Weizhong Zhang , Xiaodan Liang , Zhenguo Li , Lingpeng Kong

Pre-trained Language Models Can be Fully Zero-Shot Learners

How can we extend a pre-trained model to many language understanding tasks, without labeled or additional unlabeled data? Pre-trained language models (PLMs) have been effective for a wide range of NLP tasks. However, existing approaches…

Computation and Language · Computer Science 2023-05-29 Xuandong Zhao , Siqi Ouyang , Zhiguo Yu , Ming Wu , Lei Li

Prompt Consistency for Zero-Shot Task Generalization

One of the most impressive results of recent NLP history is the ability of pre-trained language models to solve new tasks in a zero-shot setting. To achieve this, NLP tasks are framed as natural language prompts, generating a response…

Computation and Language · Computer Science 2022-12-29 Chunting Zhou , Junxian He , Xuezhe Ma , Taylor Berg-Kirkpatrick , Graham Neubig

Pre-Trained Vision-Language Models as Partial Annotators

Pre-trained vision-language models learn massive data to model unified representations of images and natural languages, which can be widely applied to downstream machine learning tasks. In addition to zero-shot inference, in order to better…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Qian-Wei Wang , Yuqiu Xie , Letian Zhang , Zimo Liu , Shu-Tao Xia

Zero-Shot Grammar Competency Estimation Using Large Language Model Generated Pseudo Labels

Grammar competency estimation is essential for assessing linguistic proficiency in both written and spoken language; however, the spoken modality presents additional challenges due to its spontaneous, unstructured, and disfluent nature.…

Computation and Language · Computer Science 2025-11-18 Sourya Dipta Das , Shubham Kumar , Kuldeep Yadav

ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval

With the development of large language models (LLMs), zero-shot learning has attracted much attention for various NLP tasks. Different from prior works that generate training data with billion-scale natural language generation (NLG) models,…

Computation and Language · Computer Science 2023-05-19 Yue Yu , Yuchen Zhuang , Rongzhi Zhang , Yu Meng , Jiaming Shen , Chao Zhang