English
Related papers

Related papers: Reevaluating Meta-Learning Optimization Algorithms…

200 papers

Neural Ordinary Differential Equations (NODEs) often struggle to adapt to new dynamic behaviors caused by parameter changes in the underlying physical system, even when these dynamics are similar to previously observed behaviors. This…

Machine Learning · Computer Science 2025-09-30 Roussel Desmond Nzoyem , David A. W. Barton , Tom Deakin

Self-modulating mechanisms introduce dynamic adaptation capabilities within language models through contextual realignment strategies that influence token embedding trajectories across extended sequences. Contextual Flux is explored as an…

Computation and Language · Computer Science 2025-08-11 Henry Evidail , Zachary Mountebank , Alistair Hathersage , Peter Stanhope , Basil Ravenscroft , Tobias Waddingham

At the core of self-supervised learning for vision is the idea of learning invariant or equivariant representations with respect to a set of data transformations. This approach, however, introduces strong inductive biases, which can render…

Machine Learning · Computer Science 2024-05-29 Sharut Gupta , Chenyu Wang , Yifei Wang , Tommi Jaakkola , Stefanie Jegelka

In-context learning (ICL) has transformed the use of large language models (LLMs) for NLP tasks, enabling few-shot learning by conditioning on labeled examples without finetuning. Despite its effectiveness, ICL is prone to errors,…

Computation and Language · Computer Science 2025-03-21 Mario Sanz-Guerrero , Katharina von der Wense

Contextually Entangled Gradient Mapping (CEGM) introduces a new approach to gradient optimization, redefining the relationship between contextual embeddings and gradient updates to enhance semantic coherence and reasoning capabilities in…

Machine Learning · Computer Science 2025-08-11 Colin Sisate , Alistair Goldfinch , Vincent Waterstone , Sebastian Kingsley , Mariana Blackthorn

Scaling language models to longer contexts is essential for capturing rich dependencies across extended discourse. However, na\"ive context extension imposes significant computational and memory burdens, often resulting in inefficiencies…

Computation and Language · Computer Science 2026-02-03 Wenhao Li , Bangcheng Sun , Weihao Ye , Tianyi Zhang , Daohai Yu , Fei Chao , Rongrong Ji

Context-aware processing mechanisms have increasingly become a critical area of exploration for improving the semantic and contextual capabilities of language generation models. The Context-Aware Semantic Recomposition Mechanism (CASRM) was…

Computation and Language · Computer Science 2025-03-27 Richard Katrix , Quentin Carroway , Rowan Hawkesbury , Matthias Heathfield

In-Context Learning (ICL) allows Large Language Models (LLMs) to adapt to new tasks with just a few examples, but their predictions often suffer from systematic biases, leading to unstable performance in classification. While calibration…

Machine Learning · Statistics 2026-03-05 Korel Gundem , Juncheng Dong , Dennis Zhang , Vahid Tarokh , Zhengling Qi

Convolutional neural networks (CNNs) have been shown to be state-of-the-art models for visual cortical neurons. Cortical neurons in the primary visual cortex are sensitive to contextual information mediated by extensive horizontal and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Isaac Lin , Tianye Wang , Shang Gao , Shiming Tang , Tai Sing Lee

We present a gradient-based meta-learning framework for rapid adaptation of neural state-space models (NSSMs) for black-box system identification. When applicable, we also incorporate domain-specific physical constraints to improve the…

Large Language Models (LLMs) possess remarkable generalization capabilities but struggle with multi-task adaptation, particularly in balancing knowledge retention with task-specific specialization. Conventional fine-tuning methods suffer…

Artificial Intelligence · Computer Science 2025-10-21 Dayan Pan , Zhaoyang Fu , Jingyuan Wang , Xiao Han , Yue Zhu , Xiangyu Zhao

We propose CAVIA for meta-learning, a simple extension to MAML that is less prone to meta-overfitting, easier to parallelise, and more interpretable. CAVIA partitions the model parameters into two parts: context parameters that serve as…

Machine Learning · Computer Science 2019-06-11 Luisa M Zintgraf , Kyriacos Shiarlis , Vitaly Kurin , Katja Hofmann , Shimon Whiteson

Meta-reinforcement learning (meta-RL) algorithms enable agents to adapt quickly to tasks from few samples in dynamic environments. Such a feat is achieved through dynamic representations in an agent's policy network (obtained via reasoning…

Neural and Evolutionary Computing · Computer Science 2022-04-27 Eseoghene Ben-Iwhiwhu , Jeffery Dick , Nicholas A. Ketz , Praveen K. Pilly , Andrea Soltoggio

Multimodal in-context learning (ICL) is becoming a key capability that allows large vision-language models (LVLMs) to adapt to novel tasks without parameter updates, which expands their usefulness in many real-world applications. However,…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Yanshu Li , Jianjiang Yang , Ziteng Yang , Bozheng Li , Ligong Han , Hongyang He , Zhengtao Yao , Yingjie Victor Chen , Songlin Fei , Dongfang Liu , Ruixiang Tang

The Neural Contextual Reinforcement Framework introduces an innovative approach to enhancing the logical coherence and structural consistency of text generated by large language models. Leveraging reinforcement learning principles, the…

Computation and Language · Computer Science 2025-08-11 Marcus Irvin , William Cooper , Edward Hughes , Jessica Morgan , Christopher Hamilton

Neural Fields (NF) have gained prominence as a versatile framework for complex data representation. This work unveils a new problem setting termed \emph{Meta-Continual Learning of Neural Fields} (MCL-NF) and introduces a novel strategy that…

Artificial Intelligence · Computer Science 2026-02-24 Seungyoon Woo , Junhyeog Yun , Gunhee Kim

Practical recommender systems experience a cold-start problem when observed user-item interactions in the history are insufficient. Meta learning, especially gradient based one, can be adopted to tackle this problem by learning initial…

Information Retrieval · Computer Science 2021-11-01 Xidong Feng , Chen Chen , Dong Li , Mengchen Zhao , Jianye Hao , Jun Wang

Biased regularization and fine-tuning are two recent meta-learning approaches. They have been shown to be effective to tackle distributions of tasks, in which the tasks' target vectors are all close to a common meta-parameter vector.…

Machine Learning · Computer Science 2020-08-26 Giulia Denevi , Massimiliano Pontil , Carlo Ciliberto

Modern self-supervised learning algorithms typically enforce persistency of instance representations across views. While being very effective on learning holistic image and video representations, such an objective becomes sub-optimal for…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Liangzhe Yuan , Rui Qian , Yin Cui , Boqing Gong , Florian Schroff , Ming-Hsuan Yang , Hartwig Adam , Ting Liu

Learning continually from a stream of non-i.i.d. data is an open challenge in deep learning, even more so when working in resource-constrained environments such as embedded devices. Visual models that are continually updated through…

Artificial Intelligence · Computer Science 2025-07-30 Clea Rebillard , Julio Hurtado , Andrii Krutsylo , Lucia Passaro , Vincenzo Lomonaco
‹ Prev 1 2 3 10 Next ›