Related papers: Learning Effective Loss Functions Efficiently

Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment

In most machine learning training paradigms a fixed, often handcrafted, loss function is assumed to be a good proxy for an underlying evaluation metric. In this work we assess this assumption by meta-learning an adaptive loss function to…

Machine Learning · Computer Science 2019-05-16 Chen Huang , Shuangfei Zhai , Walter Talbott , Miguel Angel Bautista , Shih-Yu Sun , Carlos Guestrin , Josh Susskind

Learning Gradient Descent: Better Generalization and Longer Horizons

Training deep neural networks is a highly nontrivial task, involving carefully selecting appropriate training algorithms, scheduling step sizes and tuning other hyperparameters. Trying different combinations can be quite labor-intensive and…

Machine Learning · Computer Science 2017-06-13 Kaifeng Lv , Shunhua Jiang , Jian Li

Meta-Learning via Learned Loss

Typically, loss functions, regularization mechanisms and other important aspects of training parametric models are chosen heuristically from a limited set of options. In this paper, we take the first step towards automating this process,…

Machine Learning · Computer Science 2021-01-20 Sarah Bechtle , Artem Molchanov , Yevgen Chebotar , Edward Grefenstette , Ludovic Righetti , Gaurav Sukhatme , Franziska Meier

Meta-Learning Adaptive Loss Functions

Loss function learning is a new meta-learning paradigm that aims to automate the essential task of designing a loss function for a machine learning model. Existing techniques for loss function learning have shown promising results, often…

Machine Learning · Computer Science 2025-10-14 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Meta-Learning Loss Functions for Deep Neural Networks

Humans can often quickly and efficiently solve complex new learning tasks given only a small set of examples. In contrast, modern artificially intelligent systems often require thousands or millions of observations in order to solve even…

Machine Learning · Computer Science 2025-05-08 Christian Raymond

Training Deep Neural Networks via Direct Loss Minimization

Supervised training of deep neural nets typically relies on minimizing cross-entropy. However, in many domains, we are interested in performing well on metrics specific to the application. In this paper we propose a direct loss minimization…

Machine Learning · Computer Science 2016-06-03 Yang Song , Alexander G. Schwing , Richard S. Zemel , Raquel Urtasun

Improved Training Speed, Accuracy, and Data Utilization Through Loss Function Optimization

As the complexity of neural network models has grown, it has become increasingly important to optimize their design automatically through metalearning. Methods for discovering hyperparameters, topologies, and learning rate schedules have…

Machine Learning · Computer Science 2020-04-28 Santiago Gonzalez , Risto Miikkulainen

Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks

All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that…

Neural and Evolutionary Computing · Computer Science 2024-11-06 Mathew Mithra Noel , Arindam Banerjee , Yug Oswal , Geraldine Bessie Amali D , Venkataraman Muthiah-Nakarajan

Optimal Learning

This paper studies the problem of learning an unknown function $f$ from given data about $f$. The learning problem is to give an approximation $\hat f$ to $f$ that predicts the values of $f$ away from the data. There are numerous settings…

Machine Learning · Computer Science 2023-06-27 Peter Binev , Andrea Bonito , Ronald DeVore , Guergana Petrova

Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning

In this paper, we develop upon the topic of loss function learning, an emergent meta-learning paradigm that aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a…

Neural and Evolutionary Computing · Computer Science 2024-03-05 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Sparse learning of maximum likelihood model for optimization of complex loss function

Traditional machine learning methods usually minimize a simple loss function to learn a predictive model, and then use a complex performance measure to measure the prediction performance. However, minimizing a simple loss function cannot…

Machine Learning · Computer Science 2015-11-19 Ning Zhang , Prathamesh Chandrasekar

Neuro-Optimization: Learning Objective Functions Using Neural Networks

Mathematical optimization is widely used in various research fields. With a carefully-designed objective function, mathematical optimization can be quite helpful in solving many problems. However, objective functions are usually…

Machine Learning · Computer Science 2019-05-27 Younghan Jeon , Minsik Lee , Jin Young Choi

Cost Function Estimation Using Inverse Reinforcement Learning with Minimal Observations

We present an iterative inverse reinforcement learning algorithm to infer optimal cost functions in continuous spaces. Based on a popular maximum entropy criteria, our approach iteratively finds a weight improvement step and proposes a…

Machine Learning · Computer Science 2025-05-14 Sarmad Mehrdad , Avadesh Meduri , Ludovic Righetti

Minimizing the Maximal Loss: How and Why?

A commonly used learning rule is to approximately minimize the \emph{average} loss over the training set. Other learning algorithms, such as AdaBoost and hard-SVM, aim at minimizing the \emph{maximal} loss over the training set. The average…

Machine Learning · Computer Science 2016-05-24 Shai Shalev-Shwartz , Yonatan Wexler

Efficient Learning of Model Weights via Changing Features During Training

In this paper, we propose a machine learning model, which dynamically changes the features during training. Our main motivation is to update the model in a small content during the training process with replacing less descriptive features…

Machine Learning · Computer Science 2020-02-24 Marcell Beregi-Kovács , Ágnes Baran , András Hajdu

Continual Learning by Asymmetric Loss Approximation with Single-Side Overestimation

Catastrophic forgetting is a critical challenge in training deep neural networks. Although continual learning has been investigated as a countermeasure to the problem, it often suffers from the requirements of additional network components…

Machine Learning · Computer Science 2019-10-23 Dongmin Park , Seokil Hong , Bohyung Han , Kyoung Mu Lee

Metric-Optimized Example Weights

Real-world machine learning applications often have complex test metrics, and may have training and test data that are not identically distributed. Motivated by known connections between complex test metrics and cost-weighted learning, we…

Machine Learning · Statistics 2019-06-18 Sen Zhao , Mahdi Milani Fard , Harikrishna Narasimhan , Maya Gupta

Sufficiently Accurate Model Learning for Planning

Data driven models of dynamical systems help planners and controllers to provide more precise and accurate motions. Most model learning algorithms will try to minimize a loss function between the observed data and the model's predictions.…

Artificial Intelligence · Computer Science 2021-02-12 Clark Zhang , Santiago Paternain , Alejandro Ribeiro

Designing a Robust, Bounded, and Smooth Loss Function for Improved Supervised Learning

The loss function is crucial to machine learning, especially in supervised learning frameworks. It is a fundamental component that controls the behavior and general efficacy of learning algorithms. However, despite their widespread use,…

Machine Learning · Computer Science 2026-02-09 Soumi Mahato , Lineesh M. C

Towards Optimal Learning of Language Models

This work studies the general principles of improving the learning of language models (LMs), which aims at reducing the necessary training steps for achieving superior performance. Specifically, we present a theory for the optimal learning…

Computation and Language · Computer Science 2024-03-05 Yuxian Gu , Li Dong , Yaru Hao , Qingxiu Dong , Minlie Huang , Furu Wei