Related papers: Regularized Evolutionary Population-Based Training

Population Based Training of Neural Networks

Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In…

Machine Learning · Computer Science 2017-11-29 Max Jaderberg , Valentin Dalibard , Simon Osindero , Wojciech M. Czarnecki , Jeff Donahue , Ali Razavi , Oriol Vinyals , Tim Green , Iain Dunning , Karen Simonyan , Chrisantha Fernando , Koray Kavukcuoglu

Optimizing Loss Functions Through Multivariate Taylor Polynomial Parameterization

Metalearning of deep neural network (DNN) architectures and hyperparameters has become an increasingly important area of research. Loss functions are a type of metaknowledge that is crucial to effective training of DNNs, however, their…

Machine Learning · Computer Science 2020-10-05 Santiago Gonzalez , Risto Miikkulainen

Regularization Learning Networks: Deep Learning for Tabular Datasets

Despite their impressive performance, Deep Neural Networks (DNNs) typically underperform Gradient Boosting Trees (GBTs) on many tabular-dataset learning tasks. We propose that applying a different regularization coefficient to each weight…

Machine Learning · Statistics 2018-10-25 Ira Shavitt , Eran Segal

Training Dynamical Binary Neural Networks with Equilibrium Propagation

Equilibrium Propagation (EP) is an algorithm intrinsically adapted to the training of physical networks, thanks to the local updates of weights given by the internal dynamics of the system. However, the construction of such a hardware…

Neural and Evolutionary Computing · Computer Science 2021-04-20 Jérémie Laydevant , Maxence Ernoult , Damien Querlioz , Julie Grollier

Population-Based Evolution Optimizes a Meta-Learning Objective

Meta-learning models, or models that learn to learn, have been a long-desired target for their ability to quickly solve new tasks. Traditional meta-learning methods can require expensive inner and outer loops, thus there is demand for…

Neural and Evolutionary Computing · Computer Science 2021-03-12 Kevin Frans , Olaf Witkowski

Effective Regularization Through Loss-Function Metalearning

Evolutionary computation can be used to optimize several different aspects of neural network architectures. For instance, the TaylorGLO method discovers novel, customized loss functions, resulting in improved performance, faster training,…

Machine Learning · Computer Science 2025-06-12 Santiago Gonzalez , Xin Qiu , Risto Miikkulainen

Learning Sparse & Ternary Neural Networks with Entropy-Constrained Trained Ternarization (EC2T)

Deep neural networks (DNN) have shown remarkable success in a variety of machine learning applications. The capacity of these models (i.e., number of parameters), endows them with expressive power and allows them to reach the desired…

Machine Learning · Computer Science 2022-04-12 Arturo Marban , Daniel Becking , Simon Wiedemann , Wojciech Samek

Learning Regularization Parameters of Inverse Problems via Deep Neural Networks

In this work, we describe a new approach that uses deep neural networks (DNN) to obtain regularization parameters for solving inverse problems. We consider a supervised learning approach, where a network is trained to approximate the…

Numerical Analysis · Mathematics 2021-04-15 Babak Maboudi Afkham , Julianne Chung , Matthias Chung

Reimplementation of Learning to Reweight Examples for Robust Deep Learning

Deep neural networks (DNNs) have been used to create models for many complex analysis problems like image recognition and medical diagnosis. DNNs are a popular tool within machine learning due to their ability to model complex patterns and…

Machine Learning · Computer Science 2024-05-14 Parth Patil , Ben Boardley , Jack Gardner , Emily Loiselle , Deerajkumar Parthipan

Towards Making Deep Transfer Learning Never Hurt

Transfer learning have been frequently used to improve deep neural network training through incorporating weights of pre-trained networks as the starting-point of optimization for regularization. While deep transfer learning can usually…

Machine Learning · Computer Science 2019-11-19 Ruosi Wan , Haoyi Xiong , Xingjian Li , Zhanxing Zhu , Jun Huan

Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning

Hyperparameter optimization plays a key role in the machine learning domain. Its significance is especially pronounced in reinforcement learning (RL), where agents continuously interact with and adapt to their environments, requiring…

Machine Learning · Computer Science 2024-04-24 Hui Bai , Ran Cheng

Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting

Current deep neural networks (DNNs) can easily overfit to biased training data with corrupted labels or class imbalance. Sample re-weighting strategy is commonly used to alleviate this issue by designing a weighting function mapping from…

Machine Learning · Computer Science 2019-09-30 Jun Shu , Qi Xie , Lixuan Yi , Qian Zhao , Sanping Zhou , Zongben Xu , Deyu Meng

Meta-Learning with Network Pruning

Meta-learning is a powerful paradigm for few-shot learning. Although with remarkable success witnessed in many applications, the existing optimization based meta-learning models with over-parameterized neural networks have been evidenced to…

Machine Learning · Computer Science 2020-07-23 Hongduan Tian , Bo Liu , Xiao-Tong Yuan , Qingshan Liu

Function Norms and Regularization in Deep Networks

Deep neural networks (DNNs) have become increasingly important due to their excellent empirical performance on a wide range of problems. However, regularization is generally achieved by indirect means, largely due to the complex set of…

Machine Learning · Computer Science 2018-07-02 Amal Rannen Triki , Maxim Berman , Matthew B. Blaschko

Neural Network Training Techniques Regularize Optimization Trajectory: An Empirical Study

Modern deep neural network (DNN) trainings utilize various training techniques, e.g., nonlinear activation functions, batch normalization, skip-connections, etc. Despite their effectiveness, it is still mysterious how they help accelerate…

Machine Learning · Computer Science 2024-03-05 Cheng Chen , Junjie Yang , Yi Zhou

Iterated Population Based Training with Task-Agnostic Restarts

Hyperparameter Optimization (HPO) can lift the burden of tuning hyperparameters (HPs) of neural networks. HPO algorithms from the Population Based Training (PBT) family are efficient thanks to dynamically adjusting HPs every few steps of…

Machine Learning · Computer Science 2025-11-13 Alexander Chebykin , Tanja Alderliesten , Peter A. N. Bosman

Dynamic Continual Learning: Harnessing Parameter Uncertainty for Improved Network Adaptation

When fine-tuning Deep Neural Networks (DNNs) to new data, DNNs are prone to overwriting network parameters required for task-specific functionality on previously learned tasks, resulting in a loss of performance on those tasks. We propose…

Machine Learning · Computer Science 2025-01-22 Christopher Angelini , Nidhal Bouaynaya

SoftTarget Regularization: An Effective Technique to Reduce Over-Fitting in Neural Networks

Deep neural networks are learning models with a very high capacity and therefore prone to over-fitting. Many regularization techniques such as Dropout, DropConnect, and weight decay all attempt to solve the problem of over-fitting by…

Machine Learning · Computer Science 2016-12-06 Armen Aghajanyan

Efficient Continual Learning in Neural Networks with Embedding Regularization

Continual learning of deep neural networks is a key requirement for scaling them up to more complex applicative scenarios and for achieving real lifelong learning of these architectures. Previous approaches to the problem have considered…

Machine Learning · Computer Science 2020-06-25 Jary Pomponi , Simone Scardapane , Vincenzo Lomonaco , Aurelio Uncini

On Optimizing Deep Convolutional Neural Networks by Evolutionary Computing

Optimization for deep networks is currently a very active area of research. As neural networks become deeper, the ability in manually optimizing the network becomes harder. Mini-batch normalization, identification of effective respective…

Neural and Evolutionary Computing · Computer Science 2018-08-07 M. U. B. Dias , D. D. N. De Silva , S. Fernando