Related papers: Blank Language Models

FiLM: Fill-in Language Models for Any-Order Generation

Language models have become the backbone of today's AI systems. However, their predominant left-to-right generation limits the use of bidirectional context, which is essential for tasks that involve filling text in the middle. We propose…

Computation and Language · Computer Science 2023-10-17 Tianxiao Shen , Hao Peng , Ruoqi Shen , Yao Fu , Zaid Harchaoui , Yejin Choi

Enabling Language Models to Fill in the Blanks

We present a simple approach for text infilling, the task of predicting missing spans of text at any position in a document. While infilling could enable rich functionality especially for writing assistance tools, more attention has been…

Computation and Language · Computer Science 2020-09-14 Chris Donahue , Mina Lee , Percy Liang

Plug and Play Language Models: A Simple Approach to Controlled Text Generation

Large transformer-based language models (LMs) trained on huge text corpora have shown unparalleled generation capabilities. However, controlling attributes of the generated language (e.g. switching topic or sentiment) is difficult without…

Computation and Language · Computer Science 2020-03-04 Sumanth Dathathri , Andrea Madotto , Janice Lan , Jane Hung , Eric Frank , Piero Molino , Jason Yosinski , Rosanne Liu

Exploration of Masked and Causal Language Modelling for Text Generation

Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation,…

Computation and Language · Computer Science 2024-08-12 Nicolo Micheletti , Samuel Belkadi , Lifeng Han , Goran Nenadic

GLM: General Language Model Pretraining with Autoregressive Blank Infilling

There have been various types of pretraining architectures including autoencoding models (e.g., BERT), autoregressive models (e.g., GPT), and encoder-decoder models (e.g., T5). However, none of the pretraining frameworks performs the best…

Computation and Language · Computer Science 2022-03-18 Zhengxiao Du , Yujie Qian , Xiao Liu , Ming Ding , Jiezhong Qiu , Zhilin Yang , Jie Tang

Can LLMs Help Create Grammar?: Automating Grammar Creation for Endangered Languages with In-Context Learning

Yes! In the present-day documenting and preserving endangered languages, the application of Large Language Models (LLMs) presents a promising approach. This paper explores how LLMs, particularly through in-context learning, can assist in…

Computation and Language · Computer Science 2024-12-17 Piyapath T Spencer , Nanthipat Kongborrirak

The Case for a Single Model that can Both Generate Continuations and Fill in the Blank

The task of inserting text into a specified position in a passage, known as fill in the blank (FitB), is useful for a variety of applications where writers interact with a natural language generation (NLG) system to craft text. While…

Computation and Language · Computer Science 2022-07-04 Daphne Ippolito , Liam Dugan , Emily Reif , Ann Yuan , Andy Coenen , Chris Callison-Burch

Pretrained Language Models for Text Generation: A Survey

Text Generation aims to produce plausible and readable text in a human language from input data. The resurgence of deep learning has greatly advanced this field, in particular, with the help of neural generation models based on pre-trained…

Computation and Language · Computer Science 2022-05-17 Junyi Li , Tianyi Tang , Wayne Xin Zhao , Jian-Yun Nie , Ji-Rong Wen

Planning with Logical Graph-based Language Model for Instruction Generation

Despite the superior performance of large language models to generate natural language texts, it is hard to generate texts with correct logic according to a given task, due to the difficulties for neural models to capture implied rules from…

Computation and Language · Computer Science 2024-07-08 Fan Zhang , Kebing Jin , Hankz Hankui Zhuo

Validating Large Language Models with ReLM

Although large language models (LLMs) have been touted for their ability to generate natural-sounding text, there are growing concerns around possible negative effects of LLMs such as data memorization, bias, and inappropriate language.…

Machine Learning · Computer Science 2023-05-10 Michael Kuchnik , Virginia Smith , George Amvrosiadis

Pretrained Language Models for Text Generation: A Survey

Text generation has become one of the most important yet challenging tasks in natural language processing (NLP). The resurgence of deep learning has greatly advanced this field by neural generation models, especially the paradigm of…

Computation and Language · Computer Science 2021-05-26 Junyi Li , Tianyi Tang , Wayne Xin Zhao , Ji-Rong Wen

On the Modeling Capabilities of Large Language Models for Sequential Decision Making

Large pretrained models are showing increasingly better performance in reasoning and planning tasks across different modalities, opening the possibility to leverage them for complex sequential decision making problems. In this paper, we…

Artificial Intelligence · Computer Science 2024-10-10 Martin Klissarov , Devon Hjelm , Alexander Toshev , Bogdan Mazoure

An Overview on Language Models: Recent Developments and Outlook

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine…

Computation and Language · Computer Science 2024-07-18 Chengwei Wei , Yun-Cheng Wang , Bin Wang , C. -C. Jay Kuo

Large Language Models can accomplish Business Process Management Tasks

Business Process Management (BPM) aims to improve organizational activities and their outcomes by managing the underlying processes. To achieve this, it is often necessary to consider information from various sources, including unstructured…

Computation and Language · Computer Science 2023-07-20 Michael Grohs , Luka Abb , Nourhan Elsayed , Jana-Rebecca Rehse

Blackbird Language Matrices: A Framework to Investigate the Linguistic Competence of Language Models

This article describes a novel language task, the Blackbird Language Matrices (BLM) task, inspired by intelligence tests, and illustrates the BLM datasets, their construction and benchmarking, and targeted experiments on chunking and…

Computation and Language · Computer Science 2026-02-25 Paola Merlo , Chunyang Jiang , Giuseppe Samo , Vivi Nastase

SPELL: Semantic Prompt Evolution based on a LLM

Prompt engineering is a new paradigm for enhancing the performance of trained neural network models. For optimizing text-style prompts, existing methods usually individually operate small portions of a text step by step, which either breaks…

Computation and Language · Computer Science 2023-10-03 Yujian Betterest Li , Kai Wu

LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering

Large Language Models (LLMs) are reshaping unsupervised learning by offering an unprecedented ability to perform text clustering based on their deep semantic understanding. However, their direct application is fundamentally limited by a…

Computation and Language · Computer Science 2026-04-08 Yuanjie Zhu , Liangwei Yang , Ke Xu , Weizhi Zhang , Zihe Song , Jindong Wang , Philip S. Yu

Large Language Models for Agent-Based Modelling: Current and possible uses across the modelling cycle

The emergence of Large Language Models (LLMs) with increasingly sophisticated natural language understanding and generative capabilities has sparked interest in the Agent-based Modelling (ABM) community. With their ability to summarize,…

Multiagent Systems · Computer Science 2025-07-09 Loïs Vanhée , Melania Borit , Peer-Olaf Siebers , Roger Cremades , Christopher Frantz , Önder Gürcan , František Kalvas , Denisa Reshef Kera , Vivek Nallur , Kavin Narasimhan , Martin Neumann

Evolution without Large Models: Training Language Model with Task Principles

A common training approach for language models involves using a large-scale language model to expand a human-provided dataset, which is subsequently used for model training.This method significantly reduces training costs by eliminating the…

Computation and Language · Computer Science 2025-07-09 Minghang Zhu , Shen Gao , Zhengliang Shi , Jiabao Fang , Pengjie Ren , Zhaochun Ren , Zhumin Chen , Shuo Shang

Understanding the Capabilities of Large Language Models for Automated Planning

Automated planning is concerned with developing efficient algorithms to generate plans or sequences of actions to achieve a specific goal in a given environment. Emerging Large Language Models (LLMs) can answer questions, write high-quality…

Artificial Intelligence · Computer Science 2023-05-26 Vishal Pallagani , Bharath Muppasani , Keerthiram Murugesan , Francesca Rossi , Biplav Srivastava , Lior Horesh , Francesco Fabiano , Andrea Loreggia