English
Related papers

Related papers: Blank Language Models

200 papers

Language models have become the backbone of today's AI systems. However, their predominant left-to-right generation limits the use of bidirectional context, which is essential for tasks that involve filling text in the middle. We propose…

Computation and Language · Computer Science 2023-10-17 Tianxiao Shen , Hao Peng , Ruoqi Shen , Yao Fu , Zaid Harchaoui , Yejin Choi

We present a simple approach for text infilling, the task of predicting missing spans of text at any position in a document. While infilling could enable rich functionality especially for writing assistance tools, more attention has been…

Computation and Language · Computer Science 2020-09-14 Chris Donahue , Mina Lee , Percy Liang

Large transformer-based language models (LMs) trained on huge text corpora have shown unparalleled generation capabilities. However, controlling attributes of the generated language (e.g. switching topic or sentiment) is difficult without…

Computation and Language · Computer Science 2020-03-04 Sumanth Dathathri , Andrea Madotto , Janice Lan , Jane Hung , Eric Frank , Piero Molino , Jason Yosinski , Rosanne Liu

Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation,…

Computation and Language · Computer Science 2024-08-12 Nicolo Micheletti , Samuel Belkadi , Lifeng Han , Goran Nenadic

There have been various types of pretraining architectures including autoencoding models (e.g., BERT), autoregressive models (e.g., GPT), and encoder-decoder models (e.g., T5). However, none of the pretraining frameworks performs the best…

Computation and Language · Computer Science 2022-03-18 Zhengxiao Du , Yujie Qian , Xiao Liu , Ming Ding , Jiezhong Qiu , Zhilin Yang , Jie Tang

Yes! In the present-day documenting and preserving endangered languages, the application of Large Language Models (LLMs) presents a promising approach. This paper explores how LLMs, particularly through in-context learning, can assist in…

Computation and Language · Computer Science 2024-12-17 Piyapath T Spencer , Nanthipat Kongborrirak

The task of inserting text into a specified position in a passage, known as fill in the blank (FitB), is useful for a variety of applications where writers interact with a natural language generation (NLG) system to craft text. While…

Computation and Language · Computer Science 2022-07-04 Daphne Ippolito , Liam Dugan , Emily Reif , Ann Yuan , Andy Coenen , Chris Callison-Burch

Text Generation aims to produce plausible and readable text in a human language from input data. The resurgence of deep learning has greatly advanced this field, in particular, with the help of neural generation models based on pre-trained…

Computation and Language · Computer Science 2022-05-17 Junyi Li , Tianyi Tang , Wayne Xin Zhao , Jian-Yun Nie , Ji-Rong Wen

Despite the superior performance of large language models to generate natural language texts, it is hard to generate texts with correct logic according to a given task, due to the difficulties for neural models to capture implied rules from…

Computation and Language · Computer Science 2024-07-08 Fan Zhang , Kebing Jin , Hankz Hankui Zhuo

Although large language models (LLMs) have been touted for their ability to generate natural-sounding text, there are growing concerns around possible negative effects of LLMs such as data memorization, bias, and inappropriate language.…

Machine Learning · Computer Science 2023-05-10 Michael Kuchnik , Virginia Smith , George Amvrosiadis

Text generation has become one of the most important yet challenging tasks in natural language processing (NLP). The resurgence of deep learning has greatly advanced this field by neural generation models, especially the paradigm of…

Computation and Language · Computer Science 2021-05-26 Junyi Li , Tianyi Tang , Wayne Xin Zhao , Ji-Rong Wen

Large pretrained models are showing increasingly better performance in reasoning and planning tasks across different modalities, opening the possibility to leverage them for complex sequential decision making problems. In this paper, we…

Artificial Intelligence · Computer Science 2024-10-10 Martin Klissarov , Devon Hjelm , Alexander Toshev , Bogdan Mazoure

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine…

Computation and Language · Computer Science 2024-07-18 Chengwei Wei , Yun-Cheng Wang , Bin Wang , C. -C. Jay Kuo

Business Process Management (BPM) aims to improve organizational activities and their outcomes by managing the underlying processes. To achieve this, it is often necessary to consider information from various sources, including unstructured…

Computation and Language · Computer Science 2023-07-20 Michael Grohs , Luka Abb , Nourhan Elsayed , Jana-Rebecca Rehse

This article describes a novel language task, the Blackbird Language Matrices (BLM) task, inspired by intelligence tests, and illustrates the BLM datasets, their construction and benchmarking, and targeted experiments on chunking and…

Computation and Language · Computer Science 2026-02-25 Paola Merlo , Chunyang Jiang , Giuseppe Samo , Vivi Nastase

Prompt engineering is a new paradigm for enhancing the performance of trained neural network models. For optimizing text-style prompts, existing methods usually individually operate small portions of a text step by step, which either breaks…

Computation and Language · Computer Science 2023-10-03 Yujian Betterest Li , Kai Wu

Large Language Models (LLMs) are reshaping unsupervised learning by offering an unprecedented ability to perform text clustering based on their deep semantic understanding. However, their direct application is fundamentally limited by a…

Computation and Language · Computer Science 2026-04-08 Yuanjie Zhu , Liangwei Yang , Ke Xu , Weizhi Zhang , Zihe Song , Jindong Wang , Philip S. Yu

The emergence of Large Language Models (LLMs) with increasingly sophisticated natural language understanding and generative capabilities has sparked interest in the Agent-based Modelling (ABM) community. With their ability to summarize,…

A common training approach for language models involves using a large-scale language model to expand a human-provided dataset, which is subsequently used for model training.This method significantly reduces training costs by eliminating the…

Computation and Language · Computer Science 2025-07-09 Minghang Zhu , Shen Gao , Zhengliang Shi , Jiabao Fang , Pengjie Ren , Zhaochun Ren , Zhumin Chen , Shuo Shang

Automated planning is concerned with developing efficient algorithms to generate plans or sequences of actions to achieve a specific goal in a given environment. Emerging Large Language Models (LLMs) can answer questions, write high-quality…

‹ Prev 1 2 3 10 Next ›