Related papers: Making Scalable Meta Learning Practical

Deep Learning At Scale and At Ease

Recently, deep learning techniques have enjoyed success in various multimedia applications, such as image classification and multi-modal data analysis. Large deep learning models are developed for learning rich representations of complex…

Machine Learning · Computer Science 2016-03-28 Wei Wang , Gang Chen , Haibo Chen , Tien Tuan Anh Dinh , Jinyang Gao , Beng Chin Ooi , Kian-Lee Tan , Sheng Wang

Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in…

Machine Learning · Computer Science 2016-10-31 Jack W Rae , Jonathan J Hunt , Tim Harley , Ivo Danihelka , Andrew Senior , Greg Wayne , Alex Graves , Timothy P Lillicrap

Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization

This paper proposes Meta-SAGE, a novel approach for improving the scalability of deep reinforcement learning models for combinatorial optimization (CO) tasks. Our method adapts pre-trained models to larger-scale problems in test time by…

Machine Learning · Computer Science 2023-06-08 Jiwoo Son , Minsu Kim , Hyeonah Kim , Jinkyoo Park

Meta Additive Model: Interpretable Sparse Learning With Auto Weighting

Sparse additive models have attracted much attention in high-dimensional data analysis due to their flexible representation and strong interpretability. However, most existing models are limited to single-level learning under the…

Machine Learning · Computer Science 2026-04-23 Xuelin Zhang , Xinyue Liu , Lingjuan Wu , Hong Chen

Tractable Uncertainty-Aware Meta-Learning

Meta-learning is a popular approach for learning new tasks with limited data by leveraging the commonalities among different tasks. However, meta-learned models can perform poorly when context data is too limited, or when data is drawn from…

Machine Learning · Computer Science 2026-04-10 Young-Jin Park , Cesar Almecija , Apoorva Sharma , Navid Azizan

SAMBA: A Trainable Segmentation Web-App with Smart Labelling

Segmentation is the assigning of a semantic class to every pixel in an image and is a prerequisite for various statistical analysis tasks in materials science, like phase quantification, physics simulations or morphological…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Ronan Docherty , Isaac Squires , Antonis Vamvakeros , Samuel J. Cooper

Data Distribution as a Lever for Guiding Optimizers Toward Superior Generalization in LLMs

Can modifying the training data distribution guide optimizers toward solutions with improved generalization when training large language models (LLMs)? In this work, we theoretically analyze an in-context linear regression model with…

Machine Learning · Computer Science 2026-02-03 Tushaar Gangavarapu , Jiping Li , Christopher Vattheuer , Zhangyang Wang , Baharan Mirzasoleiman

LESA: Learnable LLM Layer Scaling-Up

Training Large Language Models (LLMs) from scratch requires immense computational resources, making it prohibitively expensive. Model scaling-up offers a promising solution by leveraging the parameters of smaller models to create larger…

Machine Learning · Computer Science 2025-02-20 Yifei Yang , Zouying Cao , Xinbei Ma , Yao Yao , Libo Qin , Zhi Chen , Hai Zhao

Transfer Learning for Algorithm Recommendation

Meta-Learning is a subarea of Machine Learning that aims to take advantage of prior knowledge to learn faster and with fewer data [1]. There are different scenarios where meta-learning can be applied, and one of the most common is algorithm…

Machine Learning · Computer Science 2019-10-17 Gean Trindade Pereira , Moisés dos Santos , Edesio Alcobaça , Rafael Mantovani , André Carvalho

Scalable Bayesian Meta-Learning through Generalized Implicit Gradients

Meta-learning owns unique effectiveness and swiftness in tackling emerging tasks with limited data. Its broad applicability is revealed by viewing it as a bi-level optimization problem. The resultant algorithmic viewpoint however, faces…

Machine Learning · Computer Science 2023-12-22 Yilang Zhang , Bingcong Li , Shijian Gao , Georgios B. Giannakis

SAMOSA: Sharpness Aware Minimization for Open Set Active learning

Modern machine learning solutions require extensive data collection where labeling remains costly. To reduce this burden, open set active learning approaches aim to select informative samples from a large pool of unlabeled data that…

Machine Learning · Computer Science 2025-10-27 Young In Kim , Andrea Agiollo , Rajiv Khanna

Learning where to learn: Gradient sparsity in meta and continual learning

Finding neural network weights that generalize well from small datasets is difficult. A promising approach is to learn a weight initialization such that a small number of weight changes results in low generalization error. We show that this…

Machine Learning · Computer Science 2021-10-28 Johannes von Oswald , Dominic Zhao , Seijin Kobayashi , Simon Schug , Massimo Caccia , Nicolas Zucchet , João Sacramento

Learning with Limited Samples -- Meta-Learning and Applications to Communication Systems

Deep learning has achieved remarkable success in many machine learning tasks such as image classification, speech recognition, and game playing. However, these breakthroughs are often difficult to translate into real-world engineering…

Machine Learning · Computer Science 2022-10-07 Lisha Chen , Sharu Theresa Jose , Ivana Nikoloska , Sangwoo Park , Tianyi Chen , Osvaldo Simeone

Provable Meta-Learning of Linear Representations

Meta-learning, or learning-to-learn, seeks to design algorithms that can utilize previous experience to rapidly learn new skills or adapt to new environments. Representation learning -- a key tool for performing meta-learning -- learns a…

Machine Learning · Computer Science 2022-01-04 Nilesh Tripuraneni , Chi Jin , Michael I. Jordan

A Meta Reinforcement Learning-based Approach for Self-Adaptive System

A self-learning adaptive system (SLAS) uses machine learning to enable and enhance its adaptability. Such systems are expected to perform well in dynamic situations. For learning high-performance adaptation policy, some assumptions must be…

Software Engineering · Computer Science 2021-05-12 Mingyue Zhang , Jialong Li , Haiyan Zhao , Kenji Tei , Shinichi Honiden , Zhi Jin

On sensitivity of meta-learning to support data

Meta-learning algorithms are widely used for few-shot learning. For example, image recognition systems that readily adapt to unseen classes after seeing only a few labeled examples. Despite their success, we show that modern meta-learning…

Machine Learning · Computer Science 2021-10-28 Mayank Agarwal , Mikhail Yurochkin , Yuekai Sun

Self-Attention Meta-Learner for Continual Learning

Continual learning aims to provide intelligent agents capable of learning multiple tasks sequentially with neural networks. One of its main challenging, catastrophic forgetting, is caused by the neural networks non-optimal ability to learn…

Machine Learning · Computer Science 2021-01-29 Ghada Sokar , Decebal Constantin Mocanu , Mykola Pechenizkiy

Distributed Learning in Heterogeneous Environment: federated learning with adaptive aggregation and computation reduction

Although federated learning has achieved many breakthroughs recently, the heterogeneous nature of the learning environment greatly limits its performance and hinders its real-world applications. The heterogeneous data, time-varying wireless…

Machine Learning · Computer Science 2023-02-22 Jingxin Li , Toktam Mahmoodi , Hak-Keung Lam

Meta-Learning with Implicit Gradients

A core capability of intelligent systems is the ability to quickly learn new tasks by drawing on prior experience. Gradient (or optimization) based meta-learning has recently emerged as an effective approach for few-shot learning. In this…

Machine Learning · Computer Science 2019-09-11 Aravind Rajeswaran , Chelsea Finn , Sham Kakade , Sergey Levine

mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

Modern deep learning models are over-parameterized, where different optima can result in widely varying generalization performance. The Sharpness-Aware Minimization (SAM) technique modifies the fundamental loss function that steers gradient…

Machine Learning · Statistics 2023-10-03 Kayhan Behdin , Qingquan Song , Aman Gupta , Sathiya Keerthi , Ayan Acharya , Borja Ocejo , Gregory Dexter , Rajiv Khanna , David Durfee , Rahul Mazumder