English
Related papers

Related papers: Decoding Decoded: Understanding Hyperparameter Eff…

200 papers

Decoding methods play an indispensable role in converting language models from next-token predictors into practical task solvers. Prior research on decoding methods, primarily focusing on task-specific models, may not extend to the current…

Computation and Language · Computer Science 2024-10-10 Chufan Shi , Haoran Yang , Deng Cai , Zhisong Zhang , Yifan Wang , Yujiu Yang , Wai Lam

Large Language Models (LLMs) rely on various decoding strategies to generate text, and these choices can significantly affect output quality. In healthcare, where accuracy is critical, the impact of decoding strategies remains…

Computation and Language · Computer Science 2025-08-20 Oriana Presacan , Alireza Nik , Vajira Thambawita , Bogdan Ionescu , Michael Riegler

As Large Language Models (LLMs) become increasingly prevalent, their generated outputs are proliferating across the web, risking a future where machine-generated content dilutes human-authored text. Since online data is the primary resource…

Computation and Language · Computer Science 2025-09-23 George Drayson , Emine Yilmaz , Vasileios Lampos

When generating text from probabilistic models, the chosen decoding strategy has a profound effect on the resulting text. Yet the properties elicited by various decoding strategies do not always transfer across natural language generation…

Computation and Language · Computer Science 2022-03-30 Gian Wiher , Clara Meister , Ryan Cotterell

Decoding strategies manipulate the probability distribution underlying the output of a language model and can therefore affect both generation quality and its uncertainty. In this study, we investigate the impact of decoding strategies on…

Computation and Language · Computer Science 2025-09-23 Wataru Hashimoto , Hidetaka Kamigaito , Taro Watanabe

In the realm of Large Language Models (LLMs), users commonly employ diverse decoding strategies and adjust hyperparameters to control the generated text. However, a critical question emerges: Are LLMs conscious of the existence of these…

Computation and Language · Computer Science 2024-02-20 Siyin Wang , Shimin Li , Tianxiang Sun , Jinlan Fu , Qinyuan Cheng , Jiasheng Ye , Junjie Ye , Xipeng Qiu , Xuanjing Huang

Constrained decoding approaches aim to control the meaning or style of text generated by the pre-trained large language models (LLMs or also PLMs) for various tasks at inference time. However, these methods often guide plausible…

Computation and Language · Computer Science 2025-05-06 Chen Xu , Tian Lan , Yu Ji , Changlong Yu , Wei Wang , Jun Gao , Qunxi Dong , Kun Qian , Piji Li , Wei Bi , Bin Hu

Open-ended text generation has become a prominent task in natural language processing due to the rise of powerful (large) language models. However, evaluating the quality of these models and the employed decoding strategies remains…

Computation and Language · Computer Science 2025-06-18 Esteban Garces Arias , Hannah Blocher , Julian Rodemann , Meimingwei Li , Christian Heumann , Matthias Aßenmacher

Large language models (LLMs) have shown remarkable ability to generate code, yet their outputs often violate syntactic or semantic constraints when guided only through natural language prompts. We introduce TreeCoder, the most general and…

Machine Learning · Computer Science 2026-04-27 Henrijs Princis , Arindam Sharma , Cristina David

Large Language Models (LLMs) are now capable of generating highly fluent, human-like text. They enable many applications, but also raise concerns such as large scale spam, phishing, or academic misuse. While much work has focused on…

Computation and Language · Computer Science 2026-04-16 Swati Rallapalli , Shannon Gallagher , Ronald Yurko , Tyler Brooks , Chuck Loughin , Michele Sezgin , Violet Turri

Despite the remarkable advances in language modeling, current mainstream decoding methods still struggle to generate texts that align with human texts across different aspects. In particular, sampling-based methods produce less-repetitive…

Computation and Language · Computer Science 2024-06-06 Haozhe Ji , Pei Ke , Hongning Wang , Minlie Huang

How can small-scale large language models (LLMs) efficiently utilize the supervision of LLMs to improve their generative quality? This question has been well studied in scenarios where there is no restriction on the number of LLM…

Computation and Language · Computer Science 2024-10-04 Hyunjong Ok , Jegwang Ryu , Jaeho Lee

Large Language Models (LLMs) have demonstrated a powerful ability for text generation. However, achieving optimal results with a given prompt or instruction can be challenging, especially for billion-sized models. Additionally, undesired…

Computation and Language · Computer Science 2024-10-07 Lifu Tu , Semih Yavuz , Jin Qu , Jiacheng Xu , Rui Meng , Caiming Xiong , Yingbo Zhou

This study investigates semantic uncertainty in large language model (LLM) outputs across different decoding methods, focusing on emerging techniques like speculative sampling and chain-of-thought (CoT) decoding. Through experiments on…

Computation and Language · Computer Science 2025-06-24 Darius Foodeei , Simin Fan , Martin Jaggi

Advancements in natural language generation (NLG) and large language models (LLMs) have led to proficient text generation in various tasks. However, integrating intricate constraints into neural text generation, due to LLMs' opacity,…

Computation and Language · Computer Science 2024-03-22 Xiang Chen , Xiaojun Wan

Recent progress in large language models (LLMs) has demonstrated the ability to learn and leverage Internet-scale knowledge through pre-training with autoregressive models. Unfortunately, applying such models to settings with embodied…

Existing Large Language Models (LLMs) generate text through unidirectional autoregressive decoding methods to respond to various user queries. These methods tend to consider token selection in a simple sequential manner, making it easy to…

Computation and Language · Computer Science 2024-05-28 Ziqin Luo , Haixia Han , Haokun Zhao , Guochao Jiang , Chengyu Du , Tingyun Li , Jiaqing Liang , Deqing Yang , Yanghua Xiao

Large language models (LLMs) exhibit impressive natural language capabilities but suffer from hallucination -- generating content ungrounded in the realities of training data. Recent work has focused on decoding techniques to improve…

Computation and Language · Computer Science 2024-04-16 Souvik Das , Lifeng Jin , Linfeng Song , Haitao Mi , Baolin Peng , Dong Yu

Existing large language models (LLMs) are known for generating "hallucinated" content, namely a fabricated text of plausibly looking, yet unfounded, facts. To identify when these hallucination scenarios occur, we examine the properties of…

Computation and Language · Computer Science 2023-09-06 Mohamed Akrout

This paper introduces the counter-intuitive generalization results of overfitting pre-trained large language models (LLMs) on very small datasets. In the setting of open-ended text generation, it is well-documented that LLMs tend to…

Computation and Language · Computer Science 2025-02-27 Fredrik Carlsson , Fangyu Liu , Daniel Ward , Murathan Kurfali , Joakim Nivre
‹ Prev 1 2 3 10 Next ›