Related papers: Contrastive Decoding: Open-ended Text Generation a…

Multi-Amateur Contrastive Decoding for Text Generation

Contrastive Decoding (CD) has emerged as an effective inference-time strategy for enhancing open-ended text generation by exploiting the divergence in output probabilities between a large expert language model and a smaller amateur model.…

Computation and Language · Computer Science 2025-07-30 Jaydip Sen , Subhasis Dasgupta , Hetvi Waghela

Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM

Contrastive decoding (CD) (Li et al., 2023) improves the next-token distribution of a large expert language model (LM) using a small amateur LM. Although CD is applied to various LMs and domains to enhance open-ended text generation, it is…

Computation and Language · Computer Science 2024-11-05 Haw-Shiuan Chang , Nanyun Peng , Mohit Bansal , Anil Ramakrishna , Tagyoung Chung

Surfacing Biases in Large Language Models using Contrastive Input Decoding

Ensuring that large language models (LMs) are fair, robust and useful requires an understanding of how different modifications to their inputs impact the model's behaviour. In the context of open-text generation tasks, however, such an…

Computation and Language · Computer Science 2023-05-15 Gal Yona , Or Honovich , Itay Laish , Roee Aharoni

Speculative Contrastive Decoding

Large language models~(LLMs) exhibit exceptional performance in language tasks, yet their auto-regressive inference is limited due to high computational requirements and is sub-optimal due to the exposure bias. Inspired by speculative…

Computation and Language · Computer Science 2024-03-14 Hongyi Yuan , Keming Lu , Fei Huang , Zheng Yuan , Chang Zhou

Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation

Decoding from the output distributions of large language models to produce high-quality text is a complex challenge in language modeling. Various approaches, such as beam search, sampling with temperature, $k-$sampling, nucleus…

Computation and Language · Computer Science 2024-10-22 Esteban Garces Arias , Julian Rodemann , Meimingwei Li , Christian Heumann , Matthias Aßenmacher

An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation

In the study, we empirically compare the two recently proposed decoding methods, i.e. Contrastive Search (CS) and Contrastive Decoding (CD), for open-ended text generation. The automatic evaluation results suggest that, while CS performs…

Computation and Language · Computer Science 2022-11-22 Yixuan Su , Jialu Xu

Contrastive Decoding for Synthetic Data Generation in Low-Resource Language Modeling

Large language models (LLMs) are trained on huge amounts of textual data, and concerns have been raised that the limits of such data may soon be reached. A potential solution is to train on synthetic data sampled from LLMs. In this work, we…

Computation and Language · Computer Science 2025-10-10 Jannek Ulm , Kevin Du , Vésteinn Snæbjarnarson

Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or…

Computation and Language · Computer Science 2024-05-07 Zheng Zhao , Emilio Monti , Jens Lehmann , Haytham Assem

Adaptive Contrastive Decoding in Retrieval-Augmented Generation for Handling Noisy Contexts

When using large language models (LLMs) in knowledge-intensive tasks, such as open-domain question answering, external context can bridge the gap between external knowledge and the LLMs' parametric knowledge. Recent research has been…

Computation and Language · Computer Science 2024-10-08 Youna Kim , Hyuhng Joon Kim , Cheonbok Park , Choonghyun Park , Hyunsoo Cho , Junyeob Kim , Kang Min Yoo , Sang-goo Lee , Taeuk Kim

Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation

We propose a straightforward approach called Distillation Contrastive Decoding (DCD) to enhance the reasoning capabilities of Large Language Models (LLMs) during inference. In contrast to previous approaches that relied on smaller amateur…

Computation and Language · Computer Science 2024-08-26 Phuc Phan , Hieu Tran , Long Phan

Thinking by Subtraction: Confidence-Driven Contrastive Decoding for LLM Reasoning

Recent work on test-time scaling for large language model (LLM) reasoning typically assumes that allocating more inference-time computation uniformly improves correctness. However, prior studies show that reasoning uncertainty is highly…

Computation and Language · Computer Science 2026-02-23 Lexiang Tang , Weihao Gao , Bingchen Zhao , Lu Ma , Qiao jin , Bang Yang , Yuexian Zou

Contrastive Perplexity for Controlled Generation: An Application in Detoxifying Large Language Models

The generation of toxic content by large language models (LLMs) remains a critical challenge for the safe deployment of language technology. We propose a novel framework for implicit knowledge editing and controlled text generation by…

Computation and Language · Computer Science 2025-06-02 Tassilo Klein , Moin Nabi

A Contrastive Framework for Neural Text Generation

Text generation is of great importance to many natural language processing applications. However, maximization-based decoding methods (e.g. beam search) of neural language models often lead to degenerate solutions -- the generated text is…

Computation and Language · Computer Science 2022-09-27 Yixuan Su , Tian Lan , Yan Wang , Dani Yogatama , Lingpeng Kong , Nigel Collier

Contrastive Decoding Improves Reasoning in Large Language Models

We demonstrate that Contrastive Decoding -- a simple, computationally light, and training-free text generation method proposed by Li et al 2022 -- achieves large out-of-the-box improvements over greedy decoding on a variety of reasoning…

Computation and Language · Computer Science 2023-10-02 Sean O'Brien , Mike Lewis

Consultant Decoding: Yet Another Synergistic Mechanism

The synergistic mechanism based on Speculative Decoding (SD) has garnered considerable attention as a simple yet effective approach for accelerating the inference of large language models (LLMs). Nonetheless, the high rejection rates…

Computation and Language · Computer Science 2025-06-04 Chuanghao Ding , Jiaping Wang , Ziqing Yang , Xiaoliang Wang , Dahua Lin , Cam-Tu Nguyen , Fei Tan

How Contrastive Decoding Enhances Large Audio Language Models?

While Contrastive Decoding (CD) has proven effective at enhancing Large Audio Language Models (LALMs), the underlying mechanisms driving its success and the comparative efficacy of different strategies remain unclear. This study…

Sound · Computer Science 2026-03-11 Tzu-Quan Lin , Wei-Ping Huang , Yi-Cheng Lin , Hung-yi Lee

Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models

Although multimodal large language models (MLLMs) exhibit remarkable reasoning capabilities on complex multimodal understanding tasks, they still suffer from the notorious hallucination issue: generating outputs misaligned with obvious…

Machine Learning · Computer Science 2025-11-04 Wei Chen , Xin Yan , Bin Wen , Fan Yang , Tingting Gao , Di Zhang , Long Chen

Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Large Vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned. Despite their success, LVLMs still…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Sicong Leng , Hang Zhang , Guanzheng Chen , Xin Li , Shijian Lu , Chunyan Miao , Lidong Bing

Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning

Although Large Language Models (LLMs) excel in reasoning and generation for language tasks, they are not specifically designed for multimodal challenges. Training Multimodal Large Language Models (MLLMs), however, is resource-intensive and…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Yuqi Pang , Bowen Yang , Haoqin Tu , Yun Cao , Zeyu Zhang

Exploring and Mitigating Fawning Hallucinations in Large Language Models

Large language models (LLMs) have demonstrated exceptional proficiency in language understanding. However, when LLMs align their outputs with deceptive and/or misleading prompts, the generated responses could deviate from the de facto…

Computation and Language · Computer Science 2025-09-03 Zixuan Shangguan , Yanjie Dong , Lanjun Wang , Xiaoyi Fan , Victor C. M. Leung , Xiping Hu