English
Related papers

Related papers: Making Scalable Meta Learning Practical

200 papers

Recently, deep learning techniques have enjoyed success in various multimedia applications, such as image classification and multi-modal data analysis. Large deep learning models are developed for learning rich representations of complex…

Machine Learning · Computer Science 2016-03-28 Wei Wang , Gang Chen , Haibo Chen , Tien Tuan Anh Dinh , Jinyang Gao , Beng Chin Ooi , Kian-Lee Tan , Sheng Wang

Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in…

Machine Learning · Computer Science 2016-10-31 Jack W Rae , Jonathan J Hunt , Tim Harley , Ivo Danihelka , Andrew Senior , Greg Wayne , Alex Graves , Timothy P Lillicrap

This paper proposes Meta-SAGE, a novel approach for improving the scalability of deep reinforcement learning models for combinatorial optimization (CO) tasks. Our method adapts pre-trained models to larger-scale problems in test time by…

Machine Learning · Computer Science 2023-06-08 Jiwoo Son , Minsu Kim , Hyeonah Kim , Jinkyoo Park

Sparse additive models have attracted much attention in high-dimensional data analysis due to their flexible representation and strong interpretability. However, most existing models are limited to single-level learning under the…

Machine Learning · Computer Science 2026-04-23 Xuelin Zhang , Xinyue Liu , Lingjuan Wu , Hong Chen

Meta-learning is a popular approach for learning new tasks with limited data by leveraging the commonalities among different tasks. However, meta-learned models can perform poorly when context data is too limited, or when data is drawn from…

Machine Learning · Computer Science 2026-04-10 Young-Jin Park , Cesar Almecija , Apoorva Sharma , Navid Azizan

Segmentation is the assigning of a semantic class to every pixel in an image and is a prerequisite for various statistical analysis tasks in materials science, like phase quantification, physics simulations or morphological…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Ronan Docherty , Isaac Squires , Antonis Vamvakeros , Samuel J. Cooper

Can modifying the training data distribution guide optimizers toward solutions with improved generalization when training large language models (LLMs)? In this work, we theoretically analyze an in-context linear regression model with…

Machine Learning · Computer Science 2026-02-03 Tushaar Gangavarapu , Jiping Li , Christopher Vattheuer , Zhangyang Wang , Baharan Mirzasoleiman

Training Large Language Models (LLMs) from scratch requires immense computational resources, making it prohibitively expensive. Model scaling-up offers a promising solution by leveraging the parameters of smaller models to create larger…

Machine Learning · Computer Science 2025-02-20 Yifei Yang , Zouying Cao , Xinbei Ma , Yao Yao , Libo Qin , Zhi Chen , Hai Zhao

Meta-Learning is a subarea of Machine Learning that aims to take advantage of prior knowledge to learn faster and with fewer data [1]. There are different scenarios where meta-learning can be applied, and one of the most common is algorithm…

Machine Learning · Computer Science 2019-10-17 Gean Trindade Pereira , Moisés dos Santos , Edesio Alcobaça , Rafael Mantovani , André Carvalho

Meta-learning owns unique effectiveness and swiftness in tackling emerging tasks with limited data. Its broad applicability is revealed by viewing it as a bi-level optimization problem. The resultant algorithmic viewpoint however, faces…

Machine Learning · Computer Science 2023-12-22 Yilang Zhang , Bingcong Li , Shijian Gao , Georgios B. Giannakis

Modern machine learning solutions require extensive data collection where labeling remains costly. To reduce this burden, open set active learning approaches aim to select informative samples from a large pool of unlabeled data that…

Machine Learning · Computer Science 2025-10-27 Young In Kim , Andrea Agiollo , Rajiv Khanna

Finding neural network weights that generalize well from small datasets is difficult. A promising approach is to learn a weight initialization such that a small number of weight changes results in low generalization error. We show that this…

Deep learning has achieved remarkable success in many machine learning tasks such as image classification, speech recognition, and game playing. However, these breakthroughs are often difficult to translate into real-world engineering…

Machine Learning · Computer Science 2022-10-07 Lisha Chen , Sharu Theresa Jose , Ivana Nikoloska , Sangwoo Park , Tianyi Chen , Osvaldo Simeone

Meta-learning, or learning-to-learn, seeks to design algorithms that can utilize previous experience to rapidly learn new skills or adapt to new environments. Representation learning -- a key tool for performing meta-learning -- learns a…

Machine Learning · Computer Science 2022-01-04 Nilesh Tripuraneni , Chi Jin , Michael I. Jordan

A self-learning adaptive system (SLAS) uses machine learning to enable and enhance its adaptability. Such systems are expected to perform well in dynamic situations. For learning high-performance adaptation policy, some assumptions must be…

Software Engineering · Computer Science 2021-05-12 Mingyue Zhang , Jialong Li , Haiyan Zhao , Kenji Tei , Shinichi Honiden , Zhi Jin

Meta-learning algorithms are widely used for few-shot learning. For example, image recognition systems that readily adapt to unseen classes after seeing only a few labeled examples. Despite their success, we show that modern meta-learning…

Machine Learning · Computer Science 2021-10-28 Mayank Agarwal , Mikhail Yurochkin , Yuekai Sun

Continual learning aims to provide intelligent agents capable of learning multiple tasks sequentially with neural networks. One of its main challenging, catastrophic forgetting, is caused by the neural networks non-optimal ability to learn…

Machine Learning · Computer Science 2021-01-29 Ghada Sokar , Decebal Constantin Mocanu , Mykola Pechenizkiy

Although federated learning has achieved many breakthroughs recently, the heterogeneous nature of the learning environment greatly limits its performance and hinders its real-world applications. The heterogeneous data, time-varying wireless…

Machine Learning · Computer Science 2023-02-22 Jingxin Li , Toktam Mahmoodi , Hak-Keung Lam

A core capability of intelligent systems is the ability to quickly learn new tasks by drawing on prior experience. Gradient (or optimization) based meta-learning has recently emerged as an effective approach for few-shot learning. In this…

Machine Learning · Computer Science 2019-09-11 Aravind Rajeswaran , Chelsea Finn , Sham Kakade , Sergey Levine

Modern deep learning models are over-parameterized, where different optima can result in widely varying generalization performance. The Sharpness-Aware Minimization (SAM) technique modifies the fundamental loss function that steers gradient…

‹ Prev 1 2 3 10 Next ›