Related papers: Optimizing Loss Functions Through Multivariate Tay…

Improved Training Speed, Accuracy, and Data Utilization Through Loss Function Optimization

As the complexity of neural network models has grown, it has become increasingly important to optimize their design automatically through metalearning. Methods for discovering hyperparameters, topologies, and learning rate schedules have…

Machine Learning · Computer Science 2020-04-28 Santiago Gonzalez , Risto Miikkulainen

Effective Regularization Through Loss-Function Metalearning

Evolutionary computation can be used to optimize several different aspects of neural network architectures. For instance, the TaylorGLO method discovers novel, customized loss functions, resulting in improved performance, faster training,…

Machine Learning · Computer Science 2025-06-12 Santiago Gonzalez , Xin Qiu , Risto Miikkulainen

Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning

In this paper, we develop upon the topic of loss function learning, an emergent meta-learning paradigm that aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a…

Neural and Evolutionary Computing · Computer Science 2024-03-05 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Regularized Evolutionary Population-Based Training

Metalearning of deep neural network (DNN) architectures and hyperparameters has become an increasingly important area of research. At the same time, network regularization has been recognized as a crucial dimension to effective training of…

Neural and Evolutionary Computing · Computer Science 2021-07-22 Jason Liang , Santiago Gonzalez , Hormoz Shahrzad , Risto Miikkulainen

PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions

Cross-entropy loss and focal loss are the most common choices when training deep neural networks for classification problems. Generally speaking, however, a good loss function can take on much more flexible forms, and should be tailored for…

Computer Vision and Pattern Recognition · Computer Science 2022-05-12 Zhaoqi Leng , Mingxing Tan , Chenxi Liu , Ekin Dogus Cubuk , Xiaojie Shi , Shuyang Cheng , Dragomir Anguelov

Meta-Learning Loss Functions for Deep Neural Networks

Humans can often quickly and efficiently solve complex new learning tasks given only a small set of examples. In contrast, modern artificially intelligent systems often require thousands or millions of observations in order to solve even…

Machine Learning · Computer Science 2025-05-08 Christian Raymond

Meta-Learning via Learned Loss

Typically, loss functions, regularization mechanisms and other important aspects of training parametric models are chosen heuristically from a limited set of options. In this paper, we take the first step towards automating this process,…

Machine Learning · Computer Science 2021-01-20 Sarah Bechtle , Artem Molchanov , Yevgen Chebotar , Edward Grefenstette , Ludovic Righetti , Gaurav Sukhatme , Franziska Meier

Generating meta-learning tasks to evolve parametric loss for classification learning

The field of meta-learning has seen a dramatic rise in interest in recent years. In existing meta-learning approaches, learning tasks for training meta-models are usually collected from public datasets, which brings the difficulty of…

Machine Learning · Computer Science 2021-11-23 Zhaoyang Hai , Xiabi Liu , Yuchen Ren , Nouman Q. Soomro

Task-based Loss Functions in Computer Vision: A Comprehensive Review

Loss functions are at the heart of deep learning, shaping how models learn and perform across diverse tasks. They are used to quantify the difference between predicted outputs and ground truth labels, guiding the optimization process to…

Machine Learning · Computer Science 2025-09-11 Omar Elharrouss , Yasir Mahmood , Yassine Bechqito , Mohamed Adel Serhani , Elarbi Badidi , Jamal Riffi , Hamid Tairi

Theoretical Characterization of the Generalization Performance of Overfitted Meta-Learning

Meta-learning has arisen as a successful method for improving training performance by training over many similar tasks, especially with deep neural networks (DNNs). However, the theoretical understanding of when and why overparameterized…

Machine Learning · Computer Science 2023-04-11 Peizhong Ju , Yingbin Liang , Ness B. Shroff

Meta Continual Learning

Using neural networks in practical settings would benefit from the ability of the networks to learn new tasks throughout their lifetimes without forgetting the previous tasks. This ability is limited in the current deep neural networks by a…

Machine Learning · Computer Science 2018-06-20 Risto Vuorio , Dong-Yeon Cho , Daejoong Kim , Jiwon Kim

Semi-Supervised Learning Enabled by Multiscale Deep Neural Network Inversion

Deep Neural Networks (DNNs) provide state-of-the-art solutions in several difficult machine perceptual tasks. However, their performance relies on the availability of a large set of labeled training data, which limits the breadth of their…

Machine Learning · Computer Science 2018-03-01 Randall Balestriero , Herve Glotin , Richard Baraniuk

Evolving parametrized Loss for Image Classification Learning on Small Datasets

This paper proposes a meta-learning approach to evolving a parametrized loss function, which is called Meta-Loss Network (MLN), for training the image classification learning on small datasets. In our approach, the MLN is embedded in the…

Artificial Intelligence · Computer Science 2023-10-31 Zhaoyang Hai , Xiabi Liu

Meta-Learning Adaptive Loss Functions

Loss function learning is a new meta-learning paradigm that aims to automate the essential task of designing a loss function for a machine learning model. Existing techniques for loss function learning have shown promising results, often…

Machine Learning · Computer Science 2025-10-14 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

On the relation between Loss Functions and T-Norms

Deep learning has been shown to achieve impressive results in several domains like computer vision and natural language processing. A key element of this success has been the development of new loss functions, like the popular cross-entropy…

Machine Learning · Computer Science 2019-07-19 Francesco Giannini , Giuseppe Marra , Michelangelo Diligenti , Marco Maggini , Marco Gori

Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting

Current deep neural networks (DNNs) can easily overfit to biased training data with corrupted labels or class imbalance. Sample re-weighting strategy is commonly used to alleviate this issue by designing a weighting function mapping from…

Machine Learning · Computer Science 2019-09-30 Jun Shu , Qi Xie , Lixuan Yi , Qian Zhao , Sanping Zhou , Zongben Xu , Deyu Meng

Training Deep Neural Networks via Direct Loss Minimization

Supervised training of deep neural nets typically relies on minimizing cross-entropy. However, in many domains, we are interested in performing well on metrics specific to the application. In this paper we propose a direct loss minimization…

Machine Learning · Computer Science 2016-06-03 Yang Song , Alexander G. Schwing , Richard S. Zemel , Raquel Urtasun

Deep Divergence-Based Approach to Clustering

A promising direction in deep learning research consists in learning representations and simultaneously discovering cluster structure in unlabeled data by optimizing a discriminative loss function. As opposed to supervised deep learning,…

Machine Learning · Statistics 2019-02-14 Michael Kampffmeyer , Sigurd Løkse , Filippo M. Bianchi , Lorenzo Livi , Arnt-Børre Salberg , Robert Jenssen

Domain Generalization through Meta-Learning: A Survey

Deep neural networks (DNNs) have revolutionized artificial intelligence but often lack performance when faced with out-of-distribution (OOD) data, a common scenario due to the inevitable domain shifts in real-world applications. This…

Machine Learning · Computer Science 2024-08-23 Arsham Gholamzadeh Khoee , Yinan Yu , Robert Feldt

DL101 Neural Network Outputs and Loss Functions

The loss function used to train a neural network is strongly connected to its output layer from a statistical point of view. This technical report analyzes common activation functions for a neural network output layer, like linear, sigmoid,…

Machine Learning · Computer Science 2025-11-10 Fernando Berzal