English
Related papers

Related papers: Improved Training Speed, Accuracy, and Data Utiliz…

200 papers

Metalearning of deep neural network (DNN) architectures and hyperparameters has become an increasingly important area of research. Loss functions are a type of metaknowledge that is crucial to effective training of DNNs, however, their…

Machine Learning · Computer Science 2020-10-05 Santiago Gonzalez , Risto Miikkulainen

In this paper, we develop upon the topic of loss function learning, an emergent meta-learning paradigm that aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a…

Neural and Evolutionary Computing · Computer Science 2024-03-05 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Evolutionary computation can be used to optimize several different aspects of neural network architectures. For instance, the TaylorGLO method discovers novel, customized loss functions, resulting in improved performance, faster training,…

Machine Learning · Computer Science 2025-06-12 Santiago Gonzalez , Xin Qiu , Risto Miikkulainen

Typically, loss functions, regularization mechanisms and other important aspects of training parametric models are chosen heuristically from a limited set of options. In this paper, we take the first step towards automating this process,…

Humans can often quickly and efficiently solve complex new learning tasks given only a small set of examples. In contrast, modern artificially intelligent systems often require thousands or millions of observations in order to solve even…

Machine Learning · Computer Science 2025-05-08 Christian Raymond

Loss function learning is a new meta-learning paradigm that aims to automate the essential task of designing a loss function for a machine learning model. Existing techniques for loss function learning have shown promising results, often…

Machine Learning · Computer Science 2025-10-14 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

Neural networks are trained by minimizing a loss function that defines the discrepancy between the predicted model output and the target value. The selection of the loss function is crucial to achieve task-specific behaviour and highly…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Shakhnaz Akhmedova , Nils Körber

In this paper, we develop upon the emerging topic of loss function learning, which aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a new meta-learning…

Machine Learning · Computer Science 2024-07-02 Christian Raymond , Qi Chen , Bing Xue , Mengjie Zhang

This paper proposes a meta-learning approach to evolving a parametrized loss function, which is called Meta-Loss Network (MLN), for training the image classification learning on small datasets. In our approach, the MLN is embedded in the…

Artificial Intelligence · Computer Science 2023-10-31 Zhaoyang Hai , Xiabi Liu

The field of meta-learning has seen a dramatic rise in interest in recent years. In existing meta-learning approaches, learning tasks for training meta-models are usually collected from public datasets, which brings the difficulty of…

Machine Learning · Computer Science 2021-11-23 Zhaoyang Hai , Xiabi Liu , Yuchen Ren , Nouman Q. Soomro

In most machine learning training paradigms a fixed, often handcrafted, loss function is assumed to be a good proxy for an underlying evaluation metric. In this work we assess this assumption by meta-learning an adaptive loss function to…

Machine Learning · Computer Science 2019-05-16 Chen Huang , Shuangfei Zhai , Walter Talbott , Miguel Angel Bautista , Shih-Yu Sun , Carlos Guestrin , Josh Susskind

We consider the problem of learning a loss function which, when minimized over a training dataset, yields a model that approximately minimizes a validation error metric. Though learning an optimal loss function is NP-hard, we present an…

Machine Learning · Computer Science 2019-07-02 Matthew Streeter

Automated machine learning (AutoML) methods improve upon existing models by optimizing various aspects of their design. While present methods focus on hyperparameters and neural network topologies, other aspects of neural network design can…

Machine Learning · Computer Science 2023-04-10 Garrett Bingham

Advancing loss function design is pivotal for optimizing neural network training and performance. This work introduces Random Linear Projections (RLP) loss, a novel approach that enhances training efficiency by leveraging geometric…

Machine Learning · Computer Science 2024-06-03 Shyam Venkatasubramanian , Ahmed Aloui , Vahid Tarokh

The choice of activation function can have a large effect on the performance of a neural network. While there have been some attempts to hand-engineer novel activation functions, the Rectified Linear Unit (ReLU) remains the most…

Machine Learning · Computer Science 2020-04-14 Garrett Bingham , William Macke , Risto Miikkulainen

Large Language Models (LLMs) have demonstrated impressive performance across various tasks. However, current training approaches combine standard cross-entropy loss with extensive data, human feedback, or ad hoc methods to enhance…

Computation and Language · Computer Science 2024-12-16 Daniele Rege Cambrin , Giuseppe Gallipoli , Irene Benedetto , Luca Cagliero , Paolo Garza

Learned optimization has emerged as a promising alternative to hand-crafted optimizers, with the potential to discover stronger learned update rules that enable faster, hyperparameter-free training of neural networks. A critical element for…

Machine Learning · Computer Science 2025-06-23 Abhinav Moudgil , Boris Knyazev , Guillaume Lajoie , Eugene Belilovsky

All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that…

Neural and Evolutionary Computing · Computer Science 2024-11-06 Mathew Mithra Noel , Arindam Banerjee , Yug Oswal , Geraldine Bessie Amali D , Venkataraman Muthiah-Nakarajan

Distributed training in deep learning (DL) is common practice as data and models grow. The current practice for distributed training of deep neural networks faces the challenges of communication bottlenecks when operating at scale, and…

Machine Learning · Computer Science 2020-12-21 Shubhankar Gahlot , Junqi Yin , Mallikarjun Shankar

Many machine learning problems involve iteratively and alternately optimizing different task objectives with respect to different sets of parameters. Appropriately scheduling the optimization of a task objective or a set of parameters is…

Machine Learning · Computer Science 2018-10-08 Haowen Xu , Hao Zhang , Zhiting Hu , Xiaodan Liang , Ruslan Salakhutdinov , Eric Xing
‹ Prev 1 2 3 10 Next ›