English
Related papers

Related papers: Contrastive Decoding: Open-ended Text Generation a…

200 papers

Contrastive Decoding (CD) has emerged as an effective inference-time strategy for enhancing open-ended text generation by exploiting the divergence in output probabilities between a large expert language model and a smaller amateur model.…

Computation and Language · Computer Science 2025-07-30 Jaydip Sen , Subhasis Dasgupta , Hetvi Waghela

Contrastive decoding (CD) (Li et al., 2023) improves the next-token distribution of a large expert language model (LM) using a small amateur LM. Although CD is applied to various LMs and domains to enhance open-ended text generation, it is…

Computation and Language · Computer Science 2024-11-05 Haw-Shiuan Chang , Nanyun Peng , Mohit Bansal , Anil Ramakrishna , Tagyoung Chung

Ensuring that large language models (LMs) are fair, robust and useful requires an understanding of how different modifications to their inputs impact the model's behaviour. In the context of open-text generation tasks, however, such an…

Computation and Language · Computer Science 2023-05-15 Gal Yona , Or Honovich , Itay Laish , Roee Aharoni

Large language models~(LLMs) exhibit exceptional performance in language tasks, yet their auto-regressive inference is limited due to high computational requirements and is sub-optimal due to the exposure bias. Inspired by speculative…

Computation and Language · Computer Science 2024-03-14 Hongyi Yuan , Keming Lu , Fei Huang , Zheng Yuan , Chang Zhou

Decoding from the output distributions of large language models to produce high-quality text is a complex challenge in language modeling. Various approaches, such as beam search, sampling with temperature, $k-$sampling, nucleus…

Computation and Language · Computer Science 2024-10-22 Esteban Garces Arias , Julian Rodemann , Meimingwei Li , Christian Heumann , Matthias Aßenmacher

In the study, we empirically compare the two recently proposed decoding methods, i.e. Contrastive Search (CS) and Contrastive Decoding (CD), for open-ended text generation. The automatic evaluation results suggest that, while CS performs…

Computation and Language · Computer Science 2022-11-22 Yixuan Su , Jialu Xu

Large language models (LLMs) are trained on huge amounts of textual data, and concerns have been raised that the limits of such data may soon be reached. A potential solution is to train on synthetic data sampled from LLMs. In this work, we…

Computation and Language · Computer Science 2025-10-10 Jannek Ulm , Kevin Du , Vésteinn Snæbjarnarson

Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or…

Computation and Language · Computer Science 2024-05-07 Zheng Zhao , Emilio Monti , Jens Lehmann , Haytham Assem

When using large language models (LLMs) in knowledge-intensive tasks, such as open-domain question answering, external context can bridge the gap between external knowledge and the LLMs' parametric knowledge. Recent research has been…

Computation and Language · Computer Science 2024-10-08 Youna Kim , Hyuhng Joon Kim , Cheonbok Park , Choonghyun Park , Hyunsoo Cho , Junyeob Kim , Kang Min Yoo , Sang-goo Lee , Taeuk Kim

We propose a straightforward approach called Distillation Contrastive Decoding (DCD) to enhance the reasoning capabilities of Large Language Models (LLMs) during inference. In contrast to previous approaches that relied on smaller amateur…

Computation and Language · Computer Science 2024-08-26 Phuc Phan , Hieu Tran , Long Phan

Recent work on test-time scaling for large language model (LLM) reasoning typically assumes that allocating more inference-time computation uniformly improves correctness. However, prior studies show that reasoning uncertainty is highly…

Computation and Language · Computer Science 2026-02-23 Lexiang Tang , Weihao Gao , Bingchen Zhao , Lu Ma , Qiao jin , Bang Yang , Yuexian Zou

The generation of toxic content by large language models (LLMs) remains a critical challenge for the safe deployment of language technology. We propose a novel framework for implicit knowledge editing and controlled text generation by…

Computation and Language · Computer Science 2025-06-02 Tassilo Klein , Moin Nabi

Text generation is of great importance to many natural language processing applications. However, maximization-based decoding methods (e.g. beam search) of neural language models often lead to degenerate solutions -- the generated text is…

Computation and Language · Computer Science 2022-09-27 Yixuan Su , Tian Lan , Yan Wang , Dani Yogatama , Lingpeng Kong , Nigel Collier

We demonstrate that Contrastive Decoding -- a simple, computationally light, and training-free text generation method proposed by Li et al 2022 -- achieves large out-of-the-box improvements over greedy decoding on a variety of reasoning…

Computation and Language · Computer Science 2023-10-02 Sean O'Brien , Mike Lewis

The synergistic mechanism based on Speculative Decoding (SD) has garnered considerable attention as a simple yet effective approach for accelerating the inference of large language models (LLMs). Nonetheless, the high rejection rates…

Computation and Language · Computer Science 2025-06-04 Chuanghao Ding , Jiaping Wang , Ziqing Yang , Xiaoliang Wang , Dahua Lin , Cam-Tu Nguyen , Fei Tan

While Contrastive Decoding (CD) has proven effective at enhancing Large Audio Language Models (LALMs), the underlying mechanisms driving its success and the comparative efficacy of different strategies remain unclear. This study…

Sound · Computer Science 2026-03-11 Tzu-Quan Lin , Wei-Ping Huang , Yi-Cheng Lin , Hung-yi Lee

Although multimodal large language models (MLLMs) exhibit remarkable reasoning capabilities on complex multimodal understanding tasks, they still suffer from the notorious hallucination issue: generating outputs misaligned with obvious…

Machine Learning · Computer Science 2025-11-04 Wei Chen , Xin Yan , Bin Wen , Fan Yang , Tingting Gao , Di Zhang , Long Chen

Large Vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned. Despite their success, LVLMs still…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Sicong Leng , Hang Zhang , Guanzheng Chen , Xin Li , Shijian Lu , Chunyan Miao , Lidong Bing

Although Large Language Models (LLMs) excel in reasoning and generation for language tasks, they are not specifically designed for multimodal challenges. Training Multimodal Large Language Models (MLLMs), however, is resource-intensive and…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Yuqi Pang , Bowen Yang , Haoqin Tu , Yun Cao , Zeyu Zhang

Large language models (LLMs) have demonstrated exceptional proficiency in language understanding. However, when LLMs align their outputs with deceptive and/or misleading prompts, the generated responses could deviate from the de facto…

Computation and Language · Computer Science 2025-09-03 Zixuan Shangguan , Yanjie Dong , Lanjun Wang , Xiaoyi Fan , Victor C. M. Leung , Xiping Hu
‹ Prev 1 2 3 10 Next ›