Related papers: $\texttt{SEM-CTRL}$: Semantically Controlled Decod…

Learning and Enforcing Context-Sensitive Control for LLMs

Controlling the output of Large Language Models (LLMs) through context-sensitive constraints has emerged as a promising approach to overcome the limitations of Context-Free Grammars (CFGs) in guaranteeing generation validity. However, such…

Computation and Language · Computer Science 2026-04-14 Mohammad Albinhassan , Pranava Madhyastha , Mark Law , Alessandra Russo

Constrained Decoding of Diffusion LLMs with Context-Free Grammars

Large language models (LLMs) have shown promising performance across diverse domains. Many practical applications of LLMs, such as code completion and structured data extraction, require adherence to syntactic constraints specified by a…

Machine Learning · Computer Science 2025-08-18 Niels Mündler , Jasper Dekoninck , Martin Vechev

Semantic Compression With Large Language Models

The rise of large language models (LLMs) is revolutionizing information retrieval, question answering, summarization, and code generation tasks. However, in addition to confidently presenting factually inaccurate information at times (known…

Artificial Intelligence · Computer Science 2023-04-26 Henry Gilbert , Michael Sandborn , Douglas C. Schmidt , Jesse Spencer-Smith , Jules White

Extending Context Window of Large Language Models via Semantic Compression

Transformer-based Large Language Models (LLMs) often impose limitations on the length of the text input to ensure the generation of fluent and relevant responses. This constraint restricts their applicability in scenarios involving long…

Computation and Language · Computer Science 2023-12-18 Weizhi Fei , Xueyan Niu , Pingyi Zhou , Lu Hou , Bo Bai , Lei Deng , Wei Han

LLMs Can Also Do Well! Breaking Barriers in Semantic Role Labeling via Large Language Models

Semantic role labeling (SRL) is a crucial task of natural language processing (NLP). Although generative decoder-based large language models (LLMs) have achieved remarkable success across various NLP tasks, they still lag behind…

Computation and Language · Computer Science 2025-06-09 Xinxin Li , Huiyao Chen , Chengjun Liu , Jing Li , Meishan Zhang , Jun Yu , Min Zhang

Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning

Large language models (LLMs) are increasingly deployed for understanding large codebases, but whether they understand operational semantics of long code context or rely on pattern matching shortcuts remains unclear. We distinguish between…

Computation and Language · Computer Science 2026-04-21 Adam Štorek , Mukur Gupta , Samira Hajizadeh , Prashast Srivastava , Suman Jana

SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition

While Large Language Models (LLMs) have achieved remarkable success in a wide range of applications, their performance often degrades in complex reasoning tasks. In this work, we introduce SELT (Self-Evaluation LLM Tree Search), a novel…

Computation and Language · Computer Science 2025-06-10 Mengsong Wu , Di Zhang , Yuqiang Li , Dongzhan Zhou , Wenliang Chen

Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text

Large Language Models (LLMs) have demonstrated remarkable performance in various NLP tasks, including semantic parsing, which translates natural language into formal code representations. However, the reverse process, translating code into…

Computation and Language · Computer Science 2025-02-11 Ali Al-Lawati , Jason Lucas , Prasenjit Mitra

Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

A wide range of LM applications require generating text that conforms to syntactic or semantic constraints. Imposing such constraints can be naturally framed as probabilistic conditioning, but exact generation from the resulting…

Computation and Language · Computer Science 2025-04-22 João Loula , Benjamin LeBrun , Li Du , Ben Lipkin , Clemente Pasti , Gabriel Grand , Tianyu Liu , Yahya Emara , Marjorie Freedman , Jason Eisner , Ryan Cotterell , Vikash Mansinghka , Alexander K. Lew , Tim Vieira , Timothy J. O'Donnell

Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness

The applications of large language models (LLMs) have been widely spread across all domains. However, the basic abilities such as the controllability of LLMs are still limited. To address this, we propose "Self-controller", a novel agentic…

Computation and Language · Computer Science 2024-10-02 Xiao Peng , Xufan Geng

Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method

Controlled text generation (CTG) seeks to guide large language model (LLM) output to produce text that conforms to desired criteria. The current study presents a novel CTG algorithm that enforces adherence toward specific rhetorical…

Computation and Language · Computer Science 2024-02-12 Joshua Zingale , Jugal Kalita

Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL

Large Language Models (LLMs) play a crucial role in capturing structured semantics to enhance language understanding, improve interpretability, and reduce bias. Nevertheless, an ongoing controversy exists over the extent to which LLMs can…

Computation and Language · Computer Science 2024-05-13 Ning Cheng , Zhaohui Yan , Ziming Wang , Zhijie Li , Jiaming Yu , Zilong Zheng , Kewei Tu , Jinan Xu , Wenjuan Han

Beyond Position Bias: Shifting Context Compression from Position-Driven to Semantic-Driven

Large Language Models (LLMs) have demonstrated exceptional performance across diverse tasks. However, their deployment in long-context scenarios faces high computational overhead and information redundancy. While soft prompt compression has…

Computation and Language · Computer Science 2026-05-12 Jiwei Tang , Zhijing Huang , Xinyu Zhang , Chen Jason Zhang , Jianxing Yu , Libin Zheng , Rui Meng , Jian Yin

Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding

Large Language Models (LLMs) have demonstrated a powerful ability for text generation. However, achieving optimal results with a given prompt or instruction can be challenging, especially for billion-sized models. Additionally, undesired…

Computation and Language · Computer Science 2024-10-07 Lifu Tu , Semih Yavuz , Jin Qu , Jiacheng Xu , Rui Meng , Caiming Xiong , Yingbo Zhou

Evaluating Self-Supervised Speech Models via Text-Based LLMS

Self-Supervised Learning (SSL) has gained traction for its ability to learn rich representations with low labeling costs, applicable across diverse downstream tasks. However, assessing the downstream-task performance remains challenging due…

Sound · Computer Science 2025-10-07 Takashi Maekaku , Keita Goto , Jinchuan Tian , Yusuke Shinohara , Shinji Watanabe

Resource-Efficient Adaptation of Large Language Models for Text Embeddings via Prompt Engineering and Contrastive Fine-tuning

Large Language Models (LLMs) have become a cornerstone in Natural Language Processing (NLP), achieving impressive performance in text generation. Their token-level representations capture rich, human-aligned semantics. However, pooling…

Computation and Language · Computer Science 2025-09-25 Benedikt Roth , Stephan Rappensperger , Tianming Qiu , Hamza Imamović , Julian Wörmann , Hao Shen

Semantic Probabilistic Control of Language Models

Semantic control entails steering LM generations towards satisfying subtle non-lexical constraints, e.g., toxicity, sentiment, or politeness, attributes that can be captured by a sequence-level verifier. It can thus be viewed as sampling…

Machine Learning · Computer Science 2025-05-06 Kareem Ahmed , Catarina G Belem , Padhraic Smyth , Sameer Singh

Large-Language-Model Enabled Semantic Communication Systems

Large language models (LLMs) have recently demonstrated state-of-the-art performance across various natural language processing (NLP) tasks, achieving near-human levels in multiple language understanding challenges and aligning closely with…

Signal Processing · Electrical Eng. & Systems 2025-07-08 Zhenyi Wang , Li Zou , Shengyun Wei , Kai Li , Feifan Liao , Haibo Mi , Rongxuan Lai

Projectional Decoding: Towards Semantic-Aware LLM Generation

Large language models (LLMs) are increasingly used to generate software artifacts across many software engineering (SE) tasks, yet ensuring the semantic validity of these artifacts remains a fundamental challenge. Existing constrained…

Software Engineering · Computer Science 2026-05-29 Boqi Chen , José Antonio Hernández López , Aren A. Babikian

Think in Sentences: Explicit Sentence Boundaries Enhance Language Model's Capabilities

Researchers have explored different ways to improve large language models (LLMs)' capabilities via dummy token insertion in contexts. However, existing works focus solely on the dummy tokens themselves, but fail to leverage the inherent…

Computation and Language · Computer Science 2026-04-16 Zhichen Liu , Yongyuan Li , Yang Xu