Related papers: Contextual Temperature for Language Modeling

On the Role of Temperature Sampling in Test-Time Scaling

Large language models (LLMs) can improve reasoning at inference time through test-time scaling (TTS), where multiple reasoning traces are generated and the best one is selected. Prior work shows that increasing the number of samples K…

Artificial Intelligence · Computer Science 2025-10-06 Yuheng Wu , Azalia Mirhoseini , Thierry Tambe

Optimizing Temperature for Language Models with Multi-Sample Inference

Multi-sample aggregation strategies, such as majority voting and best-of-N sampling, are widely used in contemporary large language models (LLMs) to enhance predictive accuracy across various tasks. A key challenge in this process is…

Machine Learning · Computer Science 2025-06-17 Weihua Du , Yiming Yang , Sean Welleck

The Well-Tempered Classifier: Some Elementary Properties of Temperature Scaling

Temperature scaling is a simple method that allows to control the uncertainty of probabilistic models. It is mostly used in two contexts: improving the calibration of classifiers and tuning the stochasticity of large language models (LLMs).…

Machine Learning · Statistics 2026-05-28 Pierre-Alexandre Mattei , Bruno Loureiro

Fine-tune your Classifier: Finding Correlations With Temperature

Temperature is a widely used hyperparameter in various tasks involving neural networks, such as classification or metric learning, whose choice can have a direct impact on the model performance. Most of existing works select its value using…

Machine Learning · Computer Science 2022-10-19 Benjamin Chamand , Olivier Risser-Maroix , Camille Kurtz , Philippe Joly , Nicolas Loménie

Temperature-scaling surprisal estimates improve fit to human reading times -- but does it do so for the "right reasons"?

A wide body of evidence shows that human language processing difficulty is predicted by the information-theoretic measure surprisal, a word's negative log probability in context. However, it is still unclear how to best estimate these…

Computation and Language · Computer Science 2024-07-04 Tong Liu , Iza Škrjanec , Vera Demberg

Temperature as a Meta-Policy: Adaptive Temperature in LLM Reinforcement Learning

Temperature is a crucial hyperparameter in large language models (LLMs), controlling the trade-off between exploration and exploitation during text generation. High temperatures encourage diverse but noisy outputs, while low temperatures…

Machine Learning · Computer Science 2026-02-13 Haoran Dang , Cuiling Lan , Hai Wan , Xibin Zhao , Yan Lu

Adaptive Decoding via Latent Preference Optimization

During language model decoding, it is known that using higher temperature sampling gives more creative responses, while lower temperatures are more factually accurate. However, such models are commonly applied to general instruction…

Computation and Language · Computer Science 2024-11-15 Shehzaad Dhuliawala , Ilia Kulikov , Ping Yu , Asli Celikyilmaz , Jason Weston , Sainbayar Sukhbaatar , Jack Lanchantin

Estimating Text Temperature with Language Models

Autoregressive language models typically use temperature parameter at inference to shape the probability distribution and control the randomness of the text generated. After the text was generated, this parameter can be estimated using…

Computation and Language · Computer Science 2026-03-17 Nikolay Mikhaylovskiy

Improving Semantic Uncertainty Quantification in Language Model Question-Answering via Token-Level Temperature Scaling

Calibration is central to reliable semantic uncertainty quantification, yet prior work has largely focused on discrimination, neglecting calibration. As calibration and discrimination capture distinct aspects of uncertainty, focusing on…

Machine Learning · Computer Science 2026-04-09 Tom A. Lamb , Desi R. Ivanova , Philip H. S. Torr , Tim G. J. Rudner

Control the Temperature: Selective Sampling for Diverse and High-Quality LLM Outputs

Diversity is an essential metric for evaluating the creativity of outputs generated by language models. Temperature-based sampling is a common strategy to increase diversity. However, for tasks that require high precision, e.g.,…

Machine Learning · Computer Science 2025-10-03 Sergey Troshin , Wafaa Mohammed , Yan Meng , Christof Monz , Antske Fokkens , Vlad Niculae

Contextual Information and Specific Language Models for Spoken Language Understanding

In this paper we explain how contextual expectations are generated and used in the task-oriented spoken language understanding system Dialogos. The hard task of recognizing spontaneous speech on the telephone may greatly benefit from the…

cmp-lg · Computer Science 2007-05-23 Paolo Baggia , Morena Danieli , Elisabetta Gerbino , Loreta M. Moisa , Cosmin Popovici

Improving Diversity in Language Models: When Temperature Fails, Change the Loss

Increasing diversity in language models is a challenging yet essential objective. A common approach is to raise the decoding temperature. In this work, we investigate this approach through a simplistic yet common case to provide insights…

Computation and Language · Computer Science 2025-08-14 Alexandre Verine , Florian Le Bronnec , Kunhao Zheng , Alexandre Allauzen , Yann Chevaleyre , Benjamin Negrevergne

$T^2$ of Thoughts: Temperature Tree Elicits Reasoning in Large Language Models

Large Language Models (LLMs) have emerged as powerful tools in artificial intelligence, especially in complex decision-making scenarios, but their static problem-solving strategies often limit their adaptability to dynamic environments. We…

Computation and Language · Computer Science 2025-02-18 Chengkun Cai , Xu Zhao , Yucheng Du , Haoliang Liu , Lei Li

Learning by Distilling Context

Language models significantly benefit from context tokens, such as prompts or scratchpads. They perform better when prompted with informative instructions, and they acquire new reasoning capabilities by generating a scratch-pad before…

Computation and Language · Computer Science 2022-10-03 Charlie Snell , Dan Klein , Ruiqi Zhong

Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation

Speculative decoding stands as a pivotal technique to expedite inference in autoregressive (large) language models. This method employs a smaller draft model to speculate a block of tokens, which the target model then evaluates for…

Computation and Language · Computer Science 2024-10-15 Siru Ouyang , Shuohang Wang , Minhao Jiang , Ming Zhong , Donghan Yu , Jiawei Han , Yelong Shen

Thermometer: Towards Universal Calibration for Large Language Models

We consider the issue of calibration in large language models (LLM). Recent studies have found that common interventions such as instruction tuning often result in poorly calibrated LLMs. Although calibration is well-explored in traditional…

Machine Learning · Computer Science 2024-06-28 Maohao Shen , Subhro Das , Kristjan Greenewald , Prasanna Sattigeri , Gregory Wornell , Soumya Ghosh

Understanding temperature tuning in energy-based models

Generative models of complex systems often require post-hoc parameter adjustments to produce useful outputs. For example, energy-based models for protein design are sampled at an artificially low ''temperature'' to generate novel,…

Quantitative Methods · Quantitative Biology 2025-12-11 Peter W Fields , Vudtiwat Ngampruetikorn , David J Schwab , Stephanie E Palmer

Learning Contextually-Adaptive Rewards via Calibrated Features

A key challenge in reward learning from human input is that desired agent behavior often changes based on context. For example, a robot must adapt to avoid a stove once it becomes hot. We observe that while high-level preferences (e.g.,…

Robotics · Computer Science 2026-01-14 Alexandra Forsey-Smerek , Julie Shah , Andreea Bobu

Modeling Coherence for Discourse Neural Machine Translation

Discourse coherence plays an important role in the translation of one text. However, the previous reported models most focus on improving performance over individual sentence while ignoring cross-sentence links and dependencies, which…

Computation and Language · Computer Science 2018-11-15 Hao Xiong , Zhongjun He , Hua Wu , Haifeng Wang

Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation

Transformer models using segment-based processing have been an effective architecture for simultaneous speech translation. However, such models create a context mismatch between training and inference environments, hindering potential…

Computation and Language · Computer Science 2023-07-06 Matthew Raffel , Drew Penney , Lizhong Chen