Related papers: Distributed Evolution Strategies Using TPUs for Me…

Meta-Learning with Network Pruning

Meta-learning is a powerful paradigm for few-shot learning. Although with remarkable success witnessed in many applications, the existing optimization based meta-learning models with over-parameterized neural networks have been evidenced to…

Machine Learning · Computer Science 2020-07-23 Hongduan Tian , Bo Liu , Xiao-Tong Yuan , Qingshan Liu

Meta Reinforcement Learning with Distribution of Exploration Parameters Learned by Evolution Strategies

In this paper, we propose a novel meta-learning method in a reinforcement learning setting, based on evolution strategies (ES), exploration in parameter space and deterministic policy gradients. ES methods are easy to parallelize, which is…

Machine Learning · Computer Science 2019-05-09 Yiming Shen , Kehan Yang , Yufeng Yuan , Simon Cheng Liu

Multiobjective Evolutionary Pruning of Deep Neural Networks with Transfer Learning for improving their Performance and Robustness

Evolutionary Computation algorithms have been used to solve optimization problems in relation with architectural, hyper-parameter or training configuration, forging the field known today as Neural Architecture Search. These algorithms have…

Neural and Evolutionary Computing · Computer Science 2024-02-06 Javier Poyatos , Daniel Molina , Aitor Martínez , Javier Del Ser , Francisco Herrera

ProtoDA: Efficient Transfer Learning for Few-Shot Intent Classification

Practical sequence classification tasks in natural language processing often suffer from low training data availability for target classes. Recent works towards mitigating this problem have focused on transfer learning using embeddings…

Computation and Language · Computer Science 2021-01-29 Manoj Kumar , Varun Kumar , Hadrien Glaude , Cyprien delichy , Aman Alok , Rahul Gupta

Tensor-Based Backpropagation in Neural Networks with Non-Sequential Input

Neural networks have been able to achieve groundbreaking accuracy at tasks conventionally considered only doable by humans. Using stochastic gradient descent, optimization in many dimensions is made possible, albeit at a relatively high…

Machine Learning · Computer Science 2017-07-17 Hirsh R. Agarwal , Andrew Huang

Efficient Training Convolutional Neural Networks on Edge Devices with Gradient-pruned Sign-symmetric Feedback Alignment

With the prosperity of mobile devices, the distributed learning approach enabling model training with decentralized data has attracted wide research. However, the lack of training capability for edge devices significantly limits the energy…

Machine Learning · Computer Science 2021-05-14 Ziyang Hong , C. Patrick Yue

EvoPruneDeepTL: An Evolutionary Pruning Model for Transfer Learning based Deep Neural Networks

In recent years, Deep Learning models have shown a great performance in complex optimization problems. They generally require large training datasets, which is a limitation in most practical cases. Transfer learning allows importing the…

Neural and Evolutionary Computing · Computer Science 2024-02-06 Javier Poyatos , Daniel Molina , Aritz. D. Martinez , Javier Del Ser , Francisco Herrera

Distribution of the search of evolutionary product unit neural networks for classification

This paper deals with the distributed processing in the search for an optimum classification model using evolutionary product unit neural networks. For this distributed search we used a cluster of computers. Our objective is to obtain a…

Neural and Evolutionary Computing · Computer Science 2012-05-16 A. J. Tallón-Ballesteros , P. A. Gutiérrez-Peña , C. Hervás-Martínez

Meta-Learning for Few-Shot Time Series Classification

Deep neural networks (DNNs) have achieved state-of-the-art results on time series classification (TSC) tasks. In this work, we focus on leveraging DNNs in the often-encountered practical scenario where access to labeled training data is…

Machine Learning · Computer Science 2021-03-05 Jyoti Narwariya , Pankaj Malhotra , Lovekesh Vig , Gautam Shroff , Vishnu Tv

muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems

Most uses of machine learning today involve training a model from scratch for a particular task, or sometimes starting with a model pretrained on a related task and then fine-tuning on a downstream task. Both approaches offer limited…

Machine Learning · Computer Science 2022-05-26 Andrea Gesmundo , Jeff Dean

Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters

Distributed training techniques have been widely deployed in large-scale deep neural networks (DNNs) training on dense-GPU clusters. However, on public cloud clusters, due to the moderate inter-connection bandwidth between instances,…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-21 Shaohuai Shi , Xianhao Zhou , Shutao Song , Xingyao Wang , Zilin Zhu , Xue Huang , Xinan Jiang , Feihu Zhou , Zhenyu Guo , Liqiang Xie , Rui Lan , Xianbin Ouyang , Yan Zhang , Jieqian Wei , Jing Gong , Weiliang Lin , Ping Gao , Peng Meng , Xiaomin Xu , Chenyang Guo , Bo Yang , Zhibo Chen , Yongjian Wu , Xiaowen Chu

A Hybrid Method for Training Convolutional Neural Networks

Artificial Intelligence algorithms have been steadily increasing in popularity and usage. Deep Learning, allows neural networks to be trained using huge datasets and also removes the need for human extracted features, as it automates the…

Neural and Evolutionary Computing · Computer Science 2020-05-11 Vasco Lopes , Paulo Fazendeiro

Distributed Deep Learning using Stochastic Gradient Staleness

Despite the notable success of deep neural networks (DNNs) in solving complex tasks, the training process still remains considerable challenges. A primary obstacle is the substantial time required for training, particularly as high…

Machine Learning · Computer Science 2025-09-09 Viet Hoang Pham , Hyo-Sung Ahn

Meta-Learning Adversarial Domain Adaptation Network for Few-Shot Text Classification

Meta-learning has emerged as a trending technique to tackle few-shot text classification and achieved state-of-the-art performance. However, existing solutions heavily rely on the exploitation of lexical features and their distributional…

Computation and Language · Computer Science 2021-07-27 ChengCheng Han , Zeqiu Fan , Dongxiang Zhang , Minghui Qiu , Ming Gao , Aoying Zhou

Meta-training with Demonstration Retrieval for Efficient Few-shot Learning

Large language models show impressive results on few-shot NLP tasks. However, these models are memory and computation-intensive. Meta-training allows one to leverage smaller models for few-shot generalization in a domain-general and…

Computation and Language · Computer Science 2023-07-04 Aaron Mueller , Kanika Narang , Lambert Mathias , Qifan Wang , Hamed Firooz

A Distribution-Dependent Analysis of Meta-Learning

A key problem in the theory of meta-learning is to understand how the task distributions influence transfer risk, the expected error of a meta-learner on a new task drawn from the unknown task distribution. In this paper, focusing on fixed…

Machine Learning · Statistics 2021-06-15 Mikhail Konobeev , Ilja Kuzborskij , Csaba Szepesvári

Transfer Learning based Dynamic Multiobjective Optimization Algorithms

One of the major distinguishing features of the dynamic multiobjective optimization problems (DMOPs) is the optimization objectives will change over time, thus tracking the varying Pareto-optimal front becomes a challenge. One of the…

Neural and Evolutionary Computing · Computer Science 2017-11-21 Min Jiang , Zhongqiang Huang , Liming Qiu , Wenzhen Huang , Gary G. Yen

Arbitrary Order Meta-Learning with Simple Population-Based Evolution

Meta-learning, the notion of learning to learn, enables learning systems to quickly and flexibly solve new tasks. This usually involves defining a set of outer-loop meta-parameters that are then used to update a set of inner-loop…

Machine Learning · Computer Science 2023-03-17 Chris Lu , Sebastian Towers , Jakob Foerster

Regularized Evolutionary Population-Based Training

Metalearning of deep neural network (DNN) architectures and hyperparameters has become an increasingly important area of research. At the same time, network regularization has been recognized as a crucial dimension to effective training of…

Neural and Evolutionary Computing · Computer Science 2021-07-22 Jason Liang , Santiago Gonzalez , Hormoz Shahrzad , Risto Miikkulainen

Exascale Deep Learning for Scientific Inverse Problems

We introduce novel communication strategies in synchronous distributed Deep Learning consisting of decentralized gradient reduction orchestration and computational graph-aware grouping of gradient tensors. These new techniques produce an…

Machine Learning · Computer Science 2019-09-26 Nouamane Laanait , Joshua Romero , Junqi Yin , M. Todd Young , Sean Treichler , Vitalii Starchenko , Albina Borisevich , Alex Sergeev , Michael Matheson