Related papers: Layer-Stack Temperature Scaling

Sample-dependent Adaptive Temperature Scaling for Improved Calibration

It is now well known that neural networks can be wrong with high confidence in their predictions, leading to poor calibration. The most common post-hoc approach to compensate for this is to perform temperature scaling, which adjusts the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-25 Tom Joy , Francesco Pinto , Ser-Nam Lim , Philip H. S. Torr , Puneet K. Dokania

Bin-wise Temperature Scaling (BTS): Improvement in Confidence Calibration Performance through Simple Scaling Techniques

The prediction reliability of neural networks is important in many applications. Specifically, in safety-critical domains, such as cancer prediction or autonomous driving, a reliable confidence of model's prediction is critical for the…

Computer Vision and Pattern Recognition · Computer Science 2019-09-24 Byeongmoon Ji , Hyemin Jung , Jihyeun Yoon , Kyungyul Kim , Younghak Shin

On the Role of Temperature Sampling in Test-Time Scaling

Large language models (LLMs) can improve reasoning at inference time through test-time scaling (TTS), where multiple reasoning traces are generated and the best one is selected. Prior work shows that increasing the number of samples K…

Artificial Intelligence · Computer Science 2025-10-06 Yuheng Wu , Azalia Mirhoseini , Thierry Tambe

Attended Temperature Scaling: A Practical Approach for Calibrating Deep Neural Networks

Recently, Deep Neural Networks (DNNs) have been achieving impressive results on wide range of tasks. However, they suffer from being well-calibrated. In decision-making applications, such as autonomous driving or medical diagnosing, the…

Machine Learning · Computer Science 2019-05-10 Azadeh Sadat Mozafari , Hugo Siqueira Gomes , Wilson Leão , Steeven Janny , Christian Gagné

Adaptive Temperature Scaling for Robust Calibration of Deep Neural Networks

In this paper, we study the post-hoc calibration of modern neural networks, a problem that has drawn a lot of attention in recent years. Many calibration methods of varying complexity have been proposed for the task, but there is no…

Machine Learning · Computer Science 2022-08-02 Sergio A. Balanya , Juan Maroñas , Daniel Ramos

Unsupervised Temperature Scaling: An Unsupervised Post-Processing Calibration Method of Deep Networks

The great performances of deep learning are undeniable, with impressive results over a wide range of tasks. However, the output confidence of these models is usually not well-calibrated, which can be an issue for applications where…

Computer Vision and Pattern Recognition · Computer Science 2019-06-11 Azadeh Sadat Mozafari , Hugo Siqueira Gomes , Wilson Leão , Christian Gagné

Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration

We address the problem of uncertainty calibration and introduce a novel calibration method, Parametrized Temperature Scaling (PTS). Standard deep neural networks typically yield uncalibrated predictions, which can be transformed into…

Machine Learning · Computer Science 2022-09-20 Christian Tomani , Daniel Cremers , Florian Buettner

The Well-Tempered Classifier: Some Elementary Properties of Temperature Scaling

Temperature scaling is a simple method that allows to control the uncertainty of probabilistic models. It is mostly used in two contexts: improving the calibration of classifiers and tuning the stochasticity of large language models (LLMs).…

Machine Learning · Statistics 2026-05-28 Pierre-Alexandre Mattei , Bruno Loureiro

Revisiting LARS for Large Batch Training Generalization of Neural Networks

This paper explores Large Batch Training techniques using layer-wise adaptive scaling ratio (LARS) across diverse settings, uncovering insights. LARS algorithms with warm-up tend to be trapped in sharp minimizers early on due to redundant…

Machine Learning · Computer Science 2024-08-28 Khoi Do , Duong Nguyen , Hoa Nguyen , Long Tran-Thanh , Nguyen-Hoang Tran , Quoc-Viet Pham

On the Limitations of Temperature Scaling for Distributions with Overlaps

Despite the impressive generalization capabilities of deep neural networks, they have been repeatedly shown to be overconfident when they are wrong. Fixing this issue is known as model calibration, and has consequently received much…

Machine Learning · Computer Science 2024-02-15 Muthu Chidambaram , Rong Ge

Consistency-Guided Temperature Scaling Using Style and Content Information for Out-of-Domain Calibration

Research interests in the robustness of deep neural networks against domain shifts have been rapidly increasing in recent years. Most existing works, however, focus on improving the accuracy of the model, not the calibration performance…

Machine Learning · Computer Science 2024-02-26 Wonjeong Choi , Jungwuk Park , Dong-Jun Han , Younghyun Park , Jaekyun Moon

Long Horizon Temperature Scaling

Temperature scaling is a popular technique for tuning the sharpness of a model distribution. It is used extensively for sampling likely generations and calibrating model uncertainty, and even features as a controllable parameter to many…

Machine Learning · Computer Science 2023-10-03 Andy Shih , Dorsa Sadigh , Stefano Ermon

GETS: Ensemble Temperature Scaling for Calibration in Graph Neural Networks

Graph Neural Networks deliver strong classification results but often suffer from poor calibration performance, leading to overconfidence or underconfidence. This is particularly problematic in high stakes applications where accurate…

Machine Learning · Computer Science 2025-03-03 Dingyi Zhuang , Chonghe Jiang , Yunhan Zheng , Shenhao Wang , Jinhua Zhao

Large Batch Training Does Not Need Warmup

Training deep neural networks using a large batch size has shown promising results and benefits many real-world applications. However, the optimizer converges slowly at early epochs and there is a gap between large-batch deep learning…

Machine Learning · Computer Science 2020-02-06 Zhouyuan Huo , Bin Gu , Heng Huang

Neural Clamping: Joint Input Perturbation and Temperature Scaling for Neural Network Calibration

Neural network calibration is an essential task in deep learning to ensure consistency between the confidence of model prediction and the true correctness likelihood. In this paper, we propose a new post-processing calibration method called…

Machine Learning · Computer Science 2024-07-26 Yung-Chen Tang , Pin-Yu Chen , Tsung-Yi Ho

DATS: Distance-Aware Temperature Scaling for Calibrated Class-Incremental Learning

Continual Learning (CL) is recently gaining increasing attention for its ability to enable a single model to learn incrementally from a sequence of new classes. In this scenario, it is important to keep consistent predictive performance…

Machine Learning · Computer Science 2025-09-26 Giuseppe Serra , Florian Buettner

Calibrating Language Models with Adaptive Temperature Scaling

The effectiveness of large language models (LLMs) is not only measured by their ability to generate accurate outputs but also by their calibration-how well their confidence scores reflect the probability of their outputs being correct.…

Machine Learning · Computer Science 2024-10-01 Johnathan Xie , Annie S. Chen , Yoonho Lee , Eric Mitchell , Chelsea Finn

On Calibration of Modern Neural Networks

Confidence calibration -- the problem of predicting probability estimates representative of the true correctness likelihood -- is important for classification models in many applications. We discover that modern neural networks, unlike…

Machine Learning · Computer Science 2017-08-04 Chuan Guo , Geoff Pleiss , Yu Sun , Kilian Q. Weinberger

Domain-adaptive and Subgroup-specific Cascaded Temperature Regression for Out-of-distribution Calibration

Although deep neural networks yield high classification accuracy given sufficient training data, their predictions are typically overconfident or under-confident, i.e., the prediction confidences cannot truly reflect the accuracy. Post-hoc…

Computer Vision and Pattern Recognition · Computer Science 2024-02-15 Jiexin Wang , Jiahao Chen , Bing Su

Successive Refinement in Large-Scale Computation: Advancing Model Inference Applications

Modern computationally-intensive applications often operate under time constraints, necessitating acceleration methods and distribution of computational workloads across multiple entities. However, the outcome is either achieved within the…

Information Theory · Computer Science 2024-02-13 Homa Esfahanizadeh , Alejandro Cohen , Shlomo Shamai , Muriel Medard