English
Related papers

Related papers: Layer-Stack Temperature Scaling

200 papers

It is now well known that neural networks can be wrong with high confidence in their predictions, leading to poor calibration. The most common post-hoc approach to compensate for this is to perform temperature scaling, which adjusts the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-25 Tom Joy , Francesco Pinto , Ser-Nam Lim , Philip H. S. Torr , Puneet K. Dokania

The prediction reliability of neural networks is important in many applications. Specifically, in safety-critical domains, such as cancer prediction or autonomous driving, a reliable confidence of model's prediction is critical for the…

Computer Vision and Pattern Recognition · Computer Science 2019-09-24 Byeongmoon Ji , Hyemin Jung , Jihyeun Yoon , Kyungyul Kim , Younghak Shin

Large language models (LLMs) can improve reasoning at inference time through test-time scaling (TTS), where multiple reasoning traces are generated and the best one is selected. Prior work shows that increasing the number of samples K…

Artificial Intelligence · Computer Science 2025-10-06 Yuheng Wu , Azalia Mirhoseini , Thierry Tambe

Recently, Deep Neural Networks (DNNs) have been achieving impressive results on wide range of tasks. However, they suffer from being well-calibrated. In decision-making applications, such as autonomous driving or medical diagnosing, the…

Machine Learning · Computer Science 2019-05-10 Azadeh Sadat Mozafari , Hugo Siqueira Gomes , Wilson Leão , Steeven Janny , Christian Gagné

In this paper, we study the post-hoc calibration of modern neural networks, a problem that has drawn a lot of attention in recent years. Many calibration methods of varying complexity have been proposed for the task, but there is no…

Machine Learning · Computer Science 2022-08-02 Sergio A. Balanya , Juan Maroñas , Daniel Ramos

The great performances of deep learning are undeniable, with impressive results over a wide range of tasks. However, the output confidence of these models is usually not well-calibrated, which can be an issue for applications where…

Computer Vision and Pattern Recognition · Computer Science 2019-06-11 Azadeh Sadat Mozafari , Hugo Siqueira Gomes , Wilson Leão , Christian Gagné

We address the problem of uncertainty calibration and introduce a novel calibration method, Parametrized Temperature Scaling (PTS). Standard deep neural networks typically yield uncalibrated predictions, which can be transformed into…

Machine Learning · Computer Science 2022-09-20 Christian Tomani , Daniel Cremers , Florian Buettner

Temperature scaling is a simple method that allows to control the uncertainty of probabilistic models. It is mostly used in two contexts: improving the calibration of classifiers and tuning the stochasticity of large language models (LLMs).…

Machine Learning · Statistics 2026-05-28 Pierre-Alexandre Mattei , Bruno Loureiro

This paper explores Large Batch Training techniques using layer-wise adaptive scaling ratio (LARS) across diverse settings, uncovering insights. LARS algorithms with warm-up tend to be trapped in sharp minimizers early on due to redundant…

Machine Learning · Computer Science 2024-08-28 Khoi Do , Duong Nguyen , Hoa Nguyen , Long Tran-Thanh , Nguyen-Hoang Tran , Quoc-Viet Pham

Despite the impressive generalization capabilities of deep neural networks, they have been repeatedly shown to be overconfident when they are wrong. Fixing this issue is known as model calibration, and has consequently received much…

Machine Learning · Computer Science 2024-02-15 Muthu Chidambaram , Rong Ge

Research interests in the robustness of deep neural networks against domain shifts have been rapidly increasing in recent years. Most existing works, however, focus on improving the accuracy of the model, not the calibration performance…

Machine Learning · Computer Science 2024-02-26 Wonjeong Choi , Jungwuk Park , Dong-Jun Han , Younghyun Park , Jaekyun Moon

Temperature scaling is a popular technique for tuning the sharpness of a model distribution. It is used extensively for sampling likely generations and calibrating model uncertainty, and even features as a controllable parameter to many…

Machine Learning · Computer Science 2023-10-03 Andy Shih , Dorsa Sadigh , Stefano Ermon

Graph Neural Networks deliver strong classification results but often suffer from poor calibration performance, leading to overconfidence or underconfidence. This is particularly problematic in high stakes applications where accurate…

Machine Learning · Computer Science 2025-03-03 Dingyi Zhuang , Chonghe Jiang , Yunhan Zheng , Shenhao Wang , Jinhua Zhao

Training deep neural networks using a large batch size has shown promising results and benefits many real-world applications. However, the optimizer converges slowly at early epochs and there is a gap between large-batch deep learning…

Machine Learning · Computer Science 2020-02-06 Zhouyuan Huo , Bin Gu , Heng Huang

Neural network calibration is an essential task in deep learning to ensure consistency between the confidence of model prediction and the true correctness likelihood. In this paper, we propose a new post-processing calibration method called…

Machine Learning · Computer Science 2024-07-26 Yung-Chen Tang , Pin-Yu Chen , Tsung-Yi Ho

Continual Learning (CL) is recently gaining increasing attention for its ability to enable a single model to learn incrementally from a sequence of new classes. In this scenario, it is important to keep consistent predictive performance…

Machine Learning · Computer Science 2025-09-26 Giuseppe Serra , Florian Buettner

The effectiveness of large language models (LLMs) is not only measured by their ability to generate accurate outputs but also by their calibration-how well their confidence scores reflect the probability of their outputs being correct.…

Machine Learning · Computer Science 2024-10-01 Johnathan Xie , Annie S. Chen , Yoonho Lee , Eric Mitchell , Chelsea Finn

Confidence calibration -- the problem of predicting probability estimates representative of the true correctness likelihood -- is important for classification models in many applications. We discover that modern neural networks, unlike…

Machine Learning · Computer Science 2017-08-04 Chuan Guo , Geoff Pleiss , Yu Sun , Kilian Q. Weinberger

Although deep neural networks yield high classification accuracy given sufficient training data, their predictions are typically overconfident or under-confident, i.e., the prediction confidences cannot truly reflect the accuracy. Post-hoc…

Computer Vision and Pattern Recognition · Computer Science 2024-02-15 Jiexin Wang , Jiahao Chen , Bing Su

Modern computationally-intensive applications often operate under time constraints, necessitating acceleration methods and distribution of computational workloads across multiple entities. However, the outcome is either achieved within the…

Information Theory · Computer Science 2024-02-13 Homa Esfahanizadeh , Alejandro Cohen , Shlomo Shamai , Muriel Medard
‹ Prev 1 2 3 10 Next ›