Related papers: Understanding Robust Generalization in Learning Re…

Memorize or generalize? Searching for a compositional RNN in a haystack

Neural networks are very powerful learning systems, but they do not readily generalize from one task to the other. This is partly due to the fact that they do not learn in a compositional way, that is, by discovering skills that are shared…

Artificial Intelligence · Computer Science 2018-07-27 Adam Liška , Germán Kruszewski , Marco Baroni

Generalization in Multimodal Language Learning from Simulation

Neural networks can be powerful function approximators, which are able to model high-dimensional feature distributions from a subset of examples drawn from the target distribution. Naturally, they perform well at generalizing within the…

Machine Learning · Computer Science 2021-08-06 Aaron Eisermann , Jae Hee Lee , Cornelius Weber , Stefan Wermter

Reinforcement Learning for Compositional Generalization with Outcome-Level Optimization

Compositional generalization refers to correctly interpret novel combinations of known primitives, which remains a major challenge. Existing approaches often rely on supervised fine-tuning, which encourages models to imitate target outputs.…

Machine Learning · Computer Science 2026-05-07 Xiyan Fu , Wei Liu

Learning to Generalize Compositionally by Transferring Across Semantic Parsing Tasks

Neural network models often generalize poorly to mismatched domains or distributions. In NLP, this issue arises in particular when models are expected to generalize compositionally, that is, to novel combinations of familiar words and…

Computation and Language · Computer Science 2021-11-10 Wang Zhu , Peter Shaw , Tal Linzen , Fei Sha

Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks

Systematic compositionality is the ability to recombine meaningful units with regular and predictable outcomes, and it's seen as key to humans' capacity for generalization in language. Recent work has studied systematic compositionality in…

Computation and Language · Computer Science 2018-07-20 João Loula , Marco Baroni , Brenden M. Lake

Linguistic generalization and compositionality in modern artificial neural networks

In the last decade, deep artificial neural networks have achieved astounding performance in many natural language processing tasks. Given the high productivity of language, these models must possess effective generalization abilities. It is…

Computation and Language · Computer Science 2019-06-27 Marco Baroni

Revisit Systematic Generalization via Meaningful Learning

Humans can systematically generalize to novel compositions of existing concepts. Recent studies argue that neural networks appear inherently ineffective in such cognitive capacity, leading to a pessimistic view and a lack of attention to…

Computation and Language · Computer Science 2022-10-19 Ning Shi , Boxin Wang , Wei Wang , Xiangyu Liu , Zhouhan Lin

Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings

Compositional generalization, the ability of an agent to generalize to unseen combinations of latent factors, is easy for humans but hard for deep neural networks. A line of research in cognitive science has hypothesized a process,…

Machine Learning · Computer Science 2023-10-31 Yi Ren , Samuel Lavoie , Mikhail Galkin , Danica J. Sutherland , Aaron Courville

Compositional Networks Enable Systematic Generalization for Grounded Language Understanding

Humans are remarkably flexible when understanding new sentences that include combinations of concepts they have never encountered before. Recent work has shown that while deep networks can mimic some human language abilities when presented…

Computation and Language · Computer Science 2021-10-20 Yen-Ling Kuo , Boris Katz , Andrei Barbu

Meta-Learning to Compositionally Generalize

Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural…

Computation and Language · Computer Science 2021-06-30 Henry Conklin , Bailin Wang , Kenny Smith , Ivan Titov

Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networks

Humans can understand and produce new utterances effortlessly, thanks to their compositional skills. Once a person learns the meaning of a new verb "dax," he or she can immediately understand the meaning of "dax twice" or "sing and dax." In…

Computation and Language · Computer Science 2018-06-08 Brenden M. Lake , Marco Baroni

Improving Compositional Generalization in Classification Tasks via Structure Annotations

Compositional generalization is the ability to generalize systematically to a new data distribution by combining known components. Although humans seem to have a great ability to generalize compositionally, state-of-the-art neural models…

Machine Learning · Computer Science 2021-06-22 Juyong Kim , Pradeep Ravikumar , Joshua Ainslie , Santiago Ontañón

Compositional Generalization for Primitive Substitutions

Compositional generalization is a basic mechanism in human language learning, but current neural networks lack such ability. In this paper, we conduct fundamental research for encoding compositionality in neural networks. Conventional…

Computation and Language · Computer Science 2019-10-08 Yuanpeng Li , Liang Zhao , Jianyu Wang , Joel Hestness

Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks

Recurrent neural networks (RNNs) for reinforcement learning (RL) have shown distinct advantages, e.g., solving memory-dependent tasks and meta-learning. However, little effort has been spent on improving RNN architectures and on…

Machine Learning · Computer Science 2019-11-27 Dongqi Han , Kenji Doya , Jun Tani

Learning to Recombine and Resample Data for Compositional Generalization

Flexible neural sequence models outperform grammar- and automaton-based counterparts on a variety of tasks. However, neural models perform poorly in settings requiring compositional generalization beyond the training data -- particularly to…

Computation and Language · Computer Science 2021-06-09 Ekin Akyürek , Afra Feyza Akyürek , Jacob Andreas

Real-World Compositional Generalization with Disentangled Sequence-to-Sequence Learning

Compositional generalization is a basic mechanism in human language learning, which current neural networks struggle with. A recently proposed Disentangled sequence-to-sequence model (Dangle) shows promising generalization capability by…

Computation and Language · Computer Science 2022-12-13 Hao Zheng , Mirella Lapata

Compositional generalization in a deep seq2seq model by separating syntax and semantics

Standard methods in deep learning for natural language processing fail to capture the compositional structure of human language that allows for systematic generalization outside of the training distribution. However, human learners readily…

Machine Learning · Computer Science 2019-05-27 Jake Russin , Jason Jo , Randall C. O'Reilly , Yoshua Bengio

Revisiting the Compositional Generalization Abilities of Neural Sequence Models

Compositional generalization is a fundamental trait in humans, allowing us to effortlessly combine known phrases to form novel sentences. Recent works have claimed that standard seq-to-seq models severely lack the ability to compositionally…

Computation and Language · Computer Science 2022-03-16 Arkil Patel , Satwik Bhattamishra , Phil Blunsom , Navin Goyal

Learning Robust Representations of Text

Deep neural networks have achieved remarkable results across many language processing tasks, however these methods are highly sensitive to noise and adversarial attacks. We present a regularization based method for limiting network…

Computation and Language · Computer Science 2016-09-21 Yitong Li , Trevor Cohn , Timothy Baldwin

Improving Systematic Generalization Through Modularity and Augmentation

Systematic generalization is the ability to combine known parts into novel meaning; an important aspect of efficient human learning, but a weakness of neural network learning. In this work, we investigate how two well-known modeling…

Artificial Intelligence · Computer Science 2022-02-23 Laura Ruis , Brenden Lake