Related papers: On Testing Equal Conditional Predictive Ability Un…

Using Proxies to Improve Forecast Evaluation

Comparative evaluation of forecasts of statistical functionals relies on comparing averaged losses of competing forecasts after the realization of the quantity $Y$, on which the functional is based, has been observed. Motivated by…

Methodology · Statistics 2022-11-28 Hajo Holzmann , Bernhard Klar

Testing Forecast Accuracy of Expectiles and Quantiles with the Extremal Consistent Loss Functions

Forecast evaluations aim to choose an accurate forecast for making decisions by using loss functions. However, different loss functions often generate different ranking results for forecasts, which complicates the task of comparisons. In…

Applications · Statistics 2018-07-17 Yu-Min Yen , Tso-Jung Yen

Risk Guarantees for End-to-End Prediction and Optimization Processes

Prediction models are often employed in estimating parameters of optimization models. Despite the fact that in an end-to-end view, the real goal is to achieve good optimization performance, the prediction performance is measured on its own.…

Optimization and Control · Mathematics 2021-01-01 Nam Ho-Nguyen , Fatma Kılınç-Karzan

Measures of predictive accuracy, miscalibration and discrimination

We study the evaluation of real-valued point predictors under the decision-theoretic framework of mean-consistent loss functions given by the Bregman divergences. We first derive a new version of Murphy's decomposition of the expected loss…

Methodology · Statistics 2026-05-14 Łukasz Delong , Mario Wüthrich

Expressive Losses for Verified Robustness via Convex Combinations

In order to train networks for verified adversarial robustness, it is common to over-approximate the worst-case loss over perturbation regions, resulting in networks that attain verifiability at the expense of standard performance. As shown…

Machine Learning · Computer Science 2024-03-19 Alessandro De Palma , Rudy Bunel , Krishnamurthy Dvijotham , M. Pawan Kumar , Robert Stanforth , Alessio Lomuscio

Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks

All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that…

Neural and Evolutionary Computing · Computer Science 2024-11-06 Mathew Mithra Noel , Arindam Banerjee , Yug Oswal , Geraldine Bessie Amali D , Venkataraman Muthiah-Nakarajan

Generalization Certificates for Adversarially Robust Bayesian Linear Regression

Adversarial robustness of machine learning models is critical to ensuring reliable performance under data perturbations. Recent progress has been on point estimators, and this paper considers distributional predictors. First, using the link…

Machine Learning · Computer Science 2025-02-21 Mahalakshmi Sabanayagam , Russell Tsuchida , Cheng Soon Ong , Debarghya Ghoshdastidar

Bregman Divergence Bounds and Universality Properties of the Logarithmic Loss

A loss function measures the discrepancy between the true values and their estimated fits, for a given instance of data. In classification problems, a loss function is said to be proper if a minimizer of the expected loss is the true…

Information Theory · Computer Science 2020-01-03 Amichai Painsky , Gregory W. Wornell

Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment

In most machine learning training paradigms a fixed, often handcrafted, loss function is assumed to be a good proxy for an underlying evaluation metric. In this work we assess this assumption by meta-learning an adaptive loss function to…

Machine Learning · Computer Science 2019-05-16 Chen Huang , Shuangfei Zhai , Walter Talbott , Miguel Angel Bautista , Shih-Yu Sun , Carlos Guestrin , Josh Susskind

Order-Sensitivity and Equivariance of Scoring Functions

The relative performance of competing point forecasts is usually measured in terms of loss or scoring functions. It is widely accepted that these scoring function should be strictly consistent in the sense that the expected score is…

Statistics Theory · Mathematics 2019-04-08 Tobias Fissler , Johanna F. Ziegel

Binary Losses for Density Ratio Estimation

Estimating the ratio of two probability densities from a finite number of observations is a central machine learning problem. A common approach is to construct estimators using binary classifiers that distinguish observations from the two…

Machine Learning · Computer Science 2025-01-28 Werner Zellinger

The Prediction Advantage: A Universally Meaningful Performance Measure for Classification and Regression

We introduce the Prediction Advantage (PA), a novel performance measure for prediction functions under any loss function (e.g., classification or regression). The PA is defined as the performance advantage relative to the Bayesian risk…

Machine Learning · Computer Science 2017-05-30 Ran El-Yaniv , Yonatan Geifman , Yair Wiener

Controlled abstention neural networks for identifying skillful predictions for regression problems

The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to…

Machine Learning · Computer Science 2022-01-05 Elizabeth A. Barnes , Randal J. Barnes

Total Loss Functions for Measuring the Accuracy of Nonnegative Cross-Sectional Predictions

The total loss function associated with a set of cross-sectional predictions, that is, estimates or forecasts, summarizes the set's overall accuracy. Its arguments are the individual cross-sectional units' loss functions. Under general…

Methodology · Statistics 2025-07-22 Charles D. Coleman

Get Global Guarantees: On the Probabilistic Nature of Perturbation Robustness

In safety-critical deep learning applications, robustness measures the ability of neural models that handle imperceptible perturbations in input data, which may lead to potential safety hazards. Existing pre-deployment robustness assessment…

Machine Learning · Computer Science 2025-08-27 Wenchuan Mu , Kwan Hui Lim

A Curriculum View of Robust Loss Functions

Robust loss functions are designed to combat the adverse impacts of label noise, whose robustness is typically supported by theoretical bounds agnostic to the training dynamics. However, these bounds may fail to characterize the empirical…

Machine Learning · Computer Science 2023-05-04 Zebin Ou , Yue Zhang

Predictive Multiplicity in Probabilistic Classification

Machine learning models are often used to inform real world risk assessment tasks: predicting consumer default risk, predicting whether a person suffers from a serious illness, or predicting a person's risk to appear in court. Given…

Machine Learning · Computer Science 2023-06-27 Jamelle Watson-Daniels , David C. Parkes , Berk Ustun

Robust and Adaptive Functional Logistic Regression

We introduce and study a family of robust estimators for the functional logistic regression model whose robustness automatically adapts to the data thereby leading to estimators with high efficiency in clean data and a high degree of…

Methodology · Statistics 2023-05-03 Ioannis Kalogridis

Evaluating Adversarial Robustness with Expected Viable Performance

We introduce a metric for evaluating the robustness of a classifier, with particular attention to adversarial perturbations, in terms of expected functionality with respect to possible adversarial perturbations. A classifier is assumed to…

Machine Learning · Computer Science 2023-09-19 Ryan McCoppin , Colin Dawson , Sean M. Kennedy , Leslie M. Blaha

Aligning the Evaluation of Probabilistic Predictions with Downstream Value

Every prediction is ultimately used in a downstream task. Consequently, evaluating prediction quality is more meaningful when considered in the context of its downstream use. Metrics based solely on predictive performance often diverge from…

Machine Learning · Computer Science 2025-08-26 Novin Shahroudi , Viacheslav Komisarenko , Meelis Kull