Related papers: An Efficient Algorithm for Surface Generation

Example-Based Optimization of Surface-Generation Tables

A method is given that "inverts" a logic grammar and displays it from the point of view of the logical form, rather than from that of the word string. LR-compiling techniques are used to allow a recursive-descent generation algorithm to…

cmp-lg · Computer Science 2008-02-03 Christer Samuelsson

Interfacing Constraint-Based Grammars and Generation Algorithms

Constraint-based grammars can, in principle, serve as the major linguistic knowledge source for both parsing and generation. Surface generation starts from input semantics representations that may vary across grammars. For many declarative…

Computation and Language · Computer Science 2007-05-23 Stephan Busemann

Sentence Semantic Regression for Text Generation

Recall the classical text generation works, the generation framework can be briefly divided into two phases: \textbf{idea reasoning} and \textbf{surface realization}. The target of idea reasoning is to figure out the main idea which will be…

Computation and Language · Computer Science 2021-08-09 Wei Wang , Piji Li , Hai-Tao Zheng

Practical LR Parser Generation

Parsing is a fundamental building block in modern compilers, and for industrial programming languages, it is a surprisingly involved task. There are known approaches to generate parsers automatically, but the prevailing consensus is that…

Formal Languages and Automata Theory · Computer Science 2022-09-20 Joe Zimmerman

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

One of the most striking findings in modern research on large language models (LLMs) is that scaling up compute during training leads to better results. However, less attention has been given to the benefits of scaling compute during…

Computation and Language · Computer Science 2024-11-21 Sean Welleck , Amanda Bertsch , Matthew Finlayson , Hailey Schoelkopf , Alex Xie , Graham Neubig , Ilia Kulikov , Zaid Harchaoui

What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment

We propose an end-to-end, domain-independent neural encoder-aligner-decoder model for selective generation, i.e., the joint task of content selection and surface realization. Our model first encodes a full set of over-determined database…

Computation and Language · Computer Science 2016-01-12 Hongyuan Mei , Mohit Bansal , Matthew R. Walter

Beyond [CLS] through Ranking by Generation

Generative models for Information Retrieval, where ranking of documents is viewed as the task of generating a query from a document's language model, were very successful in various IR tasks in the past. However, with the advent of modern…

Computation and Language · Computer Science 2020-10-08 Cicero Nogueira dos Santos , Xiaofei Ma , Ramesh Nallapati , Zhiheng Huang , Bing Xiang

Retrieval-Generation Synergy Augmented Large Language Models

Large language models augmented with task-relevant documents have demonstrated impressive performance on knowledge-intensive tasks. However, regarding how to obtain effective documents, the existing methods are mainly divided into two…

Computation and Language · Computer Science 2023-10-10 Zhangyin Feng , Xiaocheng Feng , Dezhi Zhao , Maojin Yang , Bing Qin

DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation

Recent advancements in Large Language Models (LLMs) have significantly improved their performance across various Natural Language Processing (NLP) tasks. However, LLMs still struggle with generating non-factual responses due to limitations…

Computation and Language · Computer Science 2024-09-10 Taeho Hwang , Soyeong Jeong , Sukmin Cho , SeungYoon Han , Jong C. Park

MGR: Multi-generator Based Rationalization

Rationalization is to employ a generator and a predictor to construct a self-explaining NLP model in which the generator selects a subset of human-intelligible pieces of the input text to the following predictor. However, rationalization…

Machine Learning · Computer Science 2023-07-25 Wei Liu , Haozhao Wang , Jun Wang , Ruixuan Li , Xinyang Li , Yuankai Zhang , Yang Qiu

Unifying Inference-Time Planning Language Generation

A line of work in planning uses LLM not to generate a plan, but to generate a formal representation in some planning language, which can be input into a symbolic solver to deterministically find a plan. While showing improved trust and…

Computation and Language · Computer Science 2025-10-08 Prabhu Prakash Kagitha , Bo Sun , Ishan Desai , Andrew Zhu , Cassie Huang , Manling Li , Ziyang Li , Li Zhang

A Survey on Retrieval-Augmented Text Generation for Large Language Models

Retrieval-Augmented Generation (RAG) merges retrieval methods with deep learning advancements to address the static limitations of large language models (LLMs) by enabling the dynamic integration of up-to-date external information. This…

Information Retrieval · Computer Science 2026-05-19 Yizheng Huang , Jimmy Huang

Reranking Laws for Language Generation: A Communication-Theoretic Perspective

To ensure large language models (LLMs) are used safely, one must reduce their propensity to hallucinate or to generate unacceptable answers. A simple and often used strategy is to first let the LLM generate multiple hypotheses and then…

Computation and Language · Computer Science 2025-02-12 António Farinhas , Haau-Sing Li , André F. T. Martins

Reflection-Window Decoding: Text Generation with Selective Refinement

The autoregressive decoding for text generation in large language models (LLMs), while widely used, is inherently suboptimal due to the lack of a built-in mechanism to perform refinement and/or correction of the generated content. In this…

Computation and Language · Computer Science 2025-06-03 Zeyu Tang , Zhenhao Chen , Xiangchen Song , Loka Li , Yunlong Deng , Yifan Shen , Guangyi Chen , Peter Spirtes , Kun Zhang

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model

Autoregressive models for text sometimes generate repetitive and low-quality output because errors accumulate during the steps of generation. This issue is often attributed to exposure bias - the difference between how a model is trained,…

Computation and Language · Computer Science 2024-03-26 Yizhe Zhang , Jiatao Gu , Zhuofeng Wu , Shuangfei Zhai , Josh Susskind , Navdeep Jaitly

Two-level, Many-Paths Generation

Large-scale natural language generation requires the integration of vast amounts of knowledge: lexical, grammatical, and conceptual. A robust generator must be able to operate well even when pieces of knowledge are missing. It must also be…

cmp-lg · Computer Science 2008-02-03 Kevin Knight , Vasileios Hatzivassiloglou

Compact non-left-recursive grammars using the selective left-corner transform and factoring

The left-corner transform removes left-recursion from (probabilistic) context-free grammars and unification grammars, permitting simple top-down parsing techniques to be used. Unfortunately the grammars produced by the standard left-corner…

Computation and Language · Computer Science 2007-05-23 Mark Johnson , Brian Roark

ParetoRAG: Leveraging Sentence-Context Attention for Robust and Efficient Retrieval-Augmented Generation

While Retrieval-Augmented Generation (RAG) systems enhance Large Language Models (LLMs) by incorporating external knowledge, they still face persistent challenges in retrieval inefficiency and the inability of LLMs to filter out irrelevant…

Computation and Language · Computer Science 2025-02-13 Ruobing Yao , Yifei Zhang , Shuang Song , Yuhua Liu , Neng Gao , Chenyang Tu

Harmonizing Diverse Models: A Layer-wise Merging Strategy for Consistent Generation

Retrieval-Augmented Generation (RAG) systems leverage Large Language Models (LLMs) to generate accurate and reliable responses that are grounded in retrieved context. However, LLMs often generate inconsistent outputs for semantically…

Computation and Language · Computer Science 2025-10-17 Xujun Peng , Anoop Kumar , Jingyu Wu , Parker Glenn , Daben Liu

Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models

Sentence embedding is essential for many NLP tasks, with contrastive learning methods achieving strong performance using annotated datasets like NLI. Yet, the reliance on manual labels limits scalability. Recent studies leverage large…

Computation and Language · Computer Science 2025-06-05 Liyang He , Chenglong Liu , Rui Li , Zhenya Huang , Shulan Ruan , Jun Zhou , Enhong Chen