Related papers: Improved Training Speed, Accuracy, and Data Utiliz…

Optimizing Loss Functions Through Multivariate Taylor Polynomial Parameterization

Metalearning of deep neural network (DNN) architectures and hyperparameters has become an increasingly important area of research. Loss functions are a type of metaknowledge that is crucial to effective training of DNNs, however, their…

Machine Learning · Computer Science 2020-10-05 Santiago Gonzalez , Risto Miikkulainen

Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning

In this paper, we develop upon the topic of loss function learning, an emergent meta-learning paradigm that aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a…

Neural and Evolutionary Computing · Computer Science 2024-03-05 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Effective Regularization Through Loss-Function Metalearning

Evolutionary computation can be used to optimize several different aspects of neural network architectures. For instance, the TaylorGLO method discovers novel, customized loss functions, resulting in improved performance, faster training,…

Machine Learning · Computer Science 2025-06-12 Santiago Gonzalez , Xin Qiu , Risto Miikkulainen

Meta-Learning via Learned Loss

Typically, loss functions, regularization mechanisms and other important aspects of training parametric models are chosen heuristically from a limited set of options. In this paper, we take the first step towards automating this process,…

Machine Learning · Computer Science 2021-01-20 Sarah Bechtle , Artem Molchanov , Yevgen Chebotar , Edward Grefenstette , Ludovic Righetti , Gaurav Sukhatme , Franziska Meier

Meta-Learning Loss Functions for Deep Neural Networks

Humans can often quickly and efficiently solve complex new learning tasks given only a small set of examples. In contrast, modern artificially intelligent systems often require thousands or millions of observations in order to solve even…

Machine Learning · Computer Science 2025-05-08 Christian Raymond

Meta-Learning Adaptive Loss Functions

Loss function learning is a new meta-learning paradigm that aims to automate the essential task of designing a loss function for a machine learning model. Existing techniques for loss function learning have shown promising results, often…

Machine Learning · Computer Science 2025-10-14 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Next Generation Loss Function for Image Classification

Neural networks are trained by minimizing a loss function that defines the discrepancy between the predicted model output and the target value. The selection of the loss function is crucial to achieve task-specific behaviour and highly…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Shakhnaz Akhmedova , Nils Körber

Learning Symbolic Model-Agnostic Loss Functions via Meta-Learning

In this paper, we develop upon the emerging topic of loss function learning, which aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a new meta-learning…

Machine Learning · Computer Science 2024-07-02 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Evolving parametrized Loss for Image Classification Learning on Small Datasets

This paper proposes a meta-learning approach to evolving a parametrized loss function, which is called Meta-Loss Network (MLN), for training the image classification learning on small datasets. In our approach, the MLN is embedded in the…

Artificial Intelligence · Computer Science 2023-10-31 Zhaoyang Hai , Xiabi Liu

Generating meta-learning tasks to evolve parametric loss for classification learning

The field of meta-learning has seen a dramatic rise in interest in recent years. In existing meta-learning approaches, learning tasks for training meta-models are usually collected from public datasets, which brings the difficulty of…

Machine Learning · Computer Science 2021-11-23 Zhaoyang Hai , Xiabi Liu , Yuchen Ren , Nouman Q. Soomro

Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment

In most machine learning training paradigms a fixed, often handcrafted, loss function is assumed to be a good proxy for an underlying evaluation metric. In this work we assess this assumption by meta-learning an adaptive loss function to…

Machine Learning · Computer Science 2019-05-16 Chen Huang , Shuangfei Zhai , Walter Talbott , Miguel Angel Bautista , Shih-Yu Sun , Carlos Guestrin , Josh Susskind

Learning Effective Loss Functions Efficiently

We consider the problem of learning a loss function which, when minimized over a training dataset, yields a model that approximately minimizes a validation error metric. Though learning an optimal loss function is NP-hard, we present an…

Machine Learning · Computer Science 2019-07-02 Matthew Streeter

Optimizing Neural Networks through Activation Function Discovery and Automatic Weight Initialization

Automated machine learning (AutoML) methods improve upon existing models by optimizing various aspects of their design. While present methods focus on hyperparameters and neural network topologies, other aspects of neural network design can…

Machine Learning · Computer Science 2023-04-10 Garrett Bingham

Random Linear Projections Loss for Hyperplane-Based Optimization in Neural Networks

Advancing loss function design is pivotal for optimizing neural network training and performance. This work introduces Random Linear Projections (RLP) loss, a novel approach that enhances training efficiency by leveraging geometric…

Machine Learning · Computer Science 2024-06-03 Shyam Venkatasubramanian , Ahmed Aloui , Vahid Tarokh

Evolutionary Optimization of Deep Learning Activation Functions

The choice of activation function can have a large effect on the performance of a neural network. While there have been some attempts to hand-engineer novel activation functions, the Rectified Linear Unit (ReLU) remains the most…

Machine Learning · Computer Science 2020-04-14 Garrett Bingham , William Macke , Risto Miikkulainen

Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning

Large Language Models (LLMs) have demonstrated impressive performance across various tasks. However, current training approaches combine standard cross-entropy loss with extensive data, human feedback, or ad hoc methods to enhance…

Computation and Language · Computer Science 2024-12-16 Daniele Rege Cambrin , Giuseppe Gallipoli , Irene Benedetto , Luca Cagliero , Paolo Garza

Celo: Training Versatile Learned Optimizers on a Compute Diet

Learned optimization has emerged as a promising alternative to hand-crafted optimizers, with the potential to discover stronger learned update rules that enable faster, hyperparameter-free training of neural networks. A critical element for…

Machine Learning · Computer Science 2025-06-23 Abhinav Moudgil , Boris Knyazev , Guillaume Lajoie , Eugene Belilovsky

Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks

All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that…

Neural and Evolutionary Computing · Computer Science 2024-11-06 Mathew Mithra Noel , Arindam Banerjee , Yug Oswal , Geraldine Bessie Amali D , Venkataraman Muthiah-Nakarajan

Data optimization for large batch distributed training of deep neural networks

Distributed training in deep learning (DL) is common practice as data and models grow. The current practice for distributed training of deep neural networks faces the challenges of communication bottlenecks when operating at scale, and…

Machine Learning · Computer Science 2020-12-21 Shubhankar Gahlot , Junqi Yin , Mallikarjun Shankar

AutoLoss: Learning Discrete Schedules for Alternate Optimization

Many machine learning problems involve iteratively and alternately optimizing different task objectives with respect to different sets of parameters. Appropriately scheduling the optimization of a task objective or a set of parameters is…

Machine Learning · Computer Science 2018-10-08 Haowen Xu , Hao Zhang , Zhiting Hu , Xiaodan Liang , Ruslan Salakhutdinov , Eric Xing