Related papers: Generalization in Multimodal Language Learning fro…

Memorize or generalize? Searching for a compositional RNN in a haystack

Neural networks are very powerful learning systems, but they do not readily generalize from one task to the other. This is partly due to the fact that they do not learn in a compositional way, that is, by discovering skills that are shared…

Artificial Intelligence · Computer Science 2018-07-27 Adam Liška , Germán Kruszewski , Marco Baroni

Improving Systematic Generalization Through Modularity and Augmentation

Systematic generalization is the ability to combine known parts into novel meaning; an important aspect of efficient human learning, but a weakness of neural network learning. In this work, we investigate how two well-known modeling…

Artificial Intelligence · Computer Science 2022-02-23 Laura Ruis , Brenden Lake

Learning to Generalize Compositionally by Transferring Across Semantic Parsing Tasks

Neural network models often generalize poorly to mismatched domains or distributions. In NLP, this issue arises in particular when models are expected to generalize compositionally, that is, to novel combinations of familiar words and…

Computation and Language · Computer Science 2021-11-10 Wang Zhu , Peter Shaw , Tal Linzen , Fei Sha

Scaling can lead to compositional generalization

Can neural networks systematically capture discrete, compositional task structure despite their continuous, distributed nature? The impressive capabilities of large-scale neural networks suggest that the answer to this question is yes.…

Machine Learning · Computer Science 2025-10-27 Florian Redhardt , Yassir Akram , Simon Schug

Meta-Learning to Compositionally Generalize

Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural…

Computation and Language · Computer Science 2021-06-30 Henry Conklin , Bailin Wang , Kenny Smith , Ivan Titov

On The Specialization of Neural Modules

A number of machine learning models have been proposed with the goal of achieving systematic generalization: the ability to reason about new situations by combining aspects of previous experiences. These models leverage compositional…

Machine Learning · Computer Science 2024-09-24 Devon Jarvis , Richard Klein , Benjamin Rosman , Andrew M. Saxe

Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings

Compositional generalization, the ability of an agent to generalize to unseen combinations of latent factors, is easy for humans but hard for deep neural networks. A line of research in cognitive science has hypothesized a process,…

Machine Learning · Computer Science 2023-10-31 Yi Ren , Samuel Lavoie , Mikhail Galkin , Danica J. Sutherland , Aaron Courville

Propositional Logic for Probing Generalization in Neural Networks

The extent to which neural networks are able to acquire and represent symbolic rules remains a key topic of research and debate. Much current work focuses on the impressive capabilities of large language models, as well as their often…

Machine Learning · Computer Science 2025-06-11 Anna Langedijk , Jaap Jumelet , Willem Zuidema

Towards Understanding the Relationship between In-context Learning and Compositional Generalization

According to the principle of compositional generalization, the meaning of a complex expression can be understood as a function of the meaning of its parts and of how they are combined. This principle is crucial for human language…

Computation and Language · Computer Science 2024-03-19 Sungjun Han , Sebastian Padó

Distant generalization by feedforward neural networks

This paper discusses the notion of generalization of training samples over long distances in the input space of a feedforward neural network. Such a generalization might occur in various ways, that differ in how great the contribution of…

Neural and Evolutionary Computing · Computer Science 2007-06-13 Artur Rataj

A Study of Compositional Generalization in Neural Models

Compositional and relational learning is a hallmark of human intelligence, but one which presents challenges for neural models. One difficulty in the development of such models is the lack of benchmarks with clear compositional and…

Machine Learning · Computer Science 2020-07-09 Tim Klinger , Dhaval Adjodah , Vincent Marois , Josh Joseph , Matthew Riemer , Alex 'Sandy' Pentland , Murray Campbell

Compositional Generalization from First Principles

Leveraging the compositional nature of our world to expedite learning and facilitate generalization is a hallmark of human perception. In machine learning, on the other hand, achieving compositional generalization has proven to be an…

Machine Learning · Computer Science 2023-07-13 Thaddäus Wiedemer , Prasanna Mayilvahanan , Matthias Bethge , Wieland Brendel

Structural Supervision Improves Few-Shot Learning and Syntactic Generalization in Neural Language Models

Humans can learn structural properties about a word from minimal experience, and deploy their learned syntactic representations uniformly in different grammatical contexts. We assess the ability of modern neural language models to reproduce…

Computation and Language · Computer Science 2020-10-13 Ethan Wilcox , Peng Qian , Richard Futrell , Ryosuke Kohita , Roger Levy , Miguel Ballesteros

Generalization Through the Lens of Learning Dynamics

A machine learning (ML) system must learn not only to match the output of a target function on a training set, but also to generalize to novel situations in order to yield accurate predictions at deployment. In most practical applications,…

Machine Learning · Computer Science 2022-12-13 Clare Lyle

On Evaluating Multilingual Compositional Generalization with Translated Datasets

Compositional generalization allows efficient learning and human-like inductive biases. Since most research investigating compositional generalization in NLP is done on English, important questions remain underexplored. Do the necessary…

Computation and Language · Computer Science 2023-06-21 Zi Wang , Daniel Hershcovich

Generalizing in the Real World with Representation Learning

Machine learning (ML) formalizes the problem of getting computers to learn from experience as optimization of performance according to some metric(s) on a set of data examples. This is in contrast to requiring behaviour specified in advance…

Machine Learning · Computer Science 2022-10-19 Tegan Maharaj

Revisit Systematic Generalization via Meaningful Learning

Humans can systematically generalize to novel compositions of existing concepts. Recent studies argue that neural networks appear inherently ineffective in such cognitive capacity, leading to a pessimistic view and a lack of attention to…

Computation and Language · Computer Science 2022-10-19 Ning Shi , Boxin Wang , Wei Wang , Xiangyu Liu , Zhouhan Lin

Training Multimodal Systems for Classification with Multiple Objectives

We learn about the world from a diverse range of sensory information. Automated systems lack this ability as investigation has centred on processing information presented in a single form. Adapting architectures to learn from multiple…

Machine Learning · Computer Science 2020-10-27 Jason Armitage , Shramana Thakur , Rishi Tripathi , Jens Lehmann , Maria Maleshkova

Improving Compositional Generalization in Classification Tasks via Structure Annotations

Compositional generalization is the ability to generalize systematically to a new data distribution by combining known components. Although humans seem to have a great ability to generalize compositionally, state-of-the-art neural models…

Machine Learning · Computer Science 2021-06-22 Juyong Kim , Pradeep Ravikumar , Joshua Ainslie , Santiago Ontañón

Analyzing the Inner Workings of Transformers in Compositional Generalization

The compositional generalization abilities of neural models have been sought after for human-like linguistic competence. The popular method to evaluate such abilities is to assess the models' input-output behavior. However, that does not…

Computation and Language · Computer Science 2025-02-24 Ryoma Kumon , Hitomi Yanaka