English
Related papers

Related papers: Improving Compositional Generalization in Semantic…

200 papers

Pretrained large generative language models have shown great performance on many tasks, but exhibit low compositional generalization abilities. Scaling such models has been shown to improve their performance on various NLP tasks even just…

Computation and Language · Computer Science 2022-11-17 Arian Hosseini , Ankit Vani , Dzmitry Bahdanau , Alessandro Sordoni , Aaron Courville

Compositional generalization is the ability to generalize systematically to a new data distribution by combining known components. Although humans seem to have a great ability to generalize compositionally, state-of-the-art neural models…

Machine Learning · Computer Science 2021-06-22 Juyong Kim , Pradeep Ravikumar , Joshua Ainslie , Santiago Ontañón

Chain-of-Thought (CoT) training has markedly advanced the reasoning capabilities of large language models (LLMs), yet the mechanisms by which CoT training enhances generalization remain inadequately understood. In this work, we demonstrate…

Machine Learning · Computer Science 2026-02-13 Xinhao Yao , Ruifeng Ren , Yun Liao , Lizhong Ding , Yong Liu

Although neural sequence-to-sequence models have been successfully applied to semantic parsing, they fail at compositional generalization, i.e., they are unable to systematically generalize to unseen compositions of seen components.…

Computation and Language · Computer Science 2021-09-10 Hao Zheng , Mirella Lapata

Despite their strong performance on many tasks, pre-trained language models have been shown to struggle on out-of-distribution compositional generalization. Meanwhile, recent work has shown considerable improvements on many NLP tasks from…

Computation and Language · Computer Science 2022-10-26 Linlu Qiu , Peter Shaw , Panupong Pasupat , Tianze Shi , Jonathan Herzig , Emily Pitler , Fei Sha , Kristina Toutanova

Compositional generalization refers to the ability to generalize to novel combinations of previously observed words and syntactic structures. Since it is regarded as a desired property of neural models, recent work has assessed…

Computation and Language · Computer Science 2025-04-07 Ryoma Kumon , Daiki Matsuoka , Hitomi Yanaka

A rapidly growing body of research on compositional generalization investigates the ability of a semantic parser to dynamically recombine linguistic elements seen in training into unseen sequences. We present a systematic comparison of…

Computation and Language · Computer Science 2022-02-25 Pia Weißenhorn , Yuekun Yao , Lucia Donatelli , Alexander Koller

Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily. However, existing neural network based models have been proven to be extremely deficient in…

Artificial Intelligence · Computer Science 2020-10-27 Qian Liu , Shengnan An , Jian-Guang Lou , Bei Chen , Zeqi Lin , Yan Gao , Bin Zhou , Nanning Zheng , Dongmei Zhang

Compositional understanding is crucial for human intelligence, yet it remains unclear whether contemporary vision models exhibit it. The dominant machine learning paradigm is built on the premise that scaling data and model sizes will…

Machine Learning · Computer Science 2025-07-10 Arnas Uselis , Andrea Dittadi , Seong Joon Oh

Neural network models often generalize poorly to mismatched domains or distributions. In NLP, this issue arises in particular when models are expected to generalize compositionally, that is, to novel combinations of familiar words and…

Computation and Language · Computer Science 2021-11-10 Wang Zhu , Peter Shaw , Tal Linzen , Fei Sha

Leveraging the compositional nature of our world to expedite learning and facilitate generalization is a hallmark of human perception. In machine learning, on the other hand, achieving compositional generalization has proven to be an…

Machine Learning · Computer Science 2023-07-13 Thaddäus Wiedemer , Prasanna Mayilvahanan , Matthias Bethge , Wieland Brendel

According to the principle of compositional generalization, the meaning of a complex expression can be understood as a function of the meaning of its parts and of how they are combined. This principle is crucial for human language…

Computation and Language · Computer Science 2024-03-19 Sungjun Han , Sebastian Padó

Compositional generalization, the ability of an agent to generalize to unseen combinations of latent factors, is easy for humans but hard for deep neural networks. A line of research in cognitive science has hypothesized a process,…

Machine Learning · Computer Science 2023-10-31 Yi Ren , Samuel Lavoie , Mikhail Galkin , Danica J. Sutherland , Aaron Courville

Compositional generalization is the ability of a model to generalize to complex, previously unseen types of combinations of entities from just having seen the primitives. This type of generalization is particularly relevant to the semantic…

Computation and Language · Computer Science 2024-04-23 Amogh Mannekote

Compositional generalization refers to a model's capability to generalize to newly composed input data based on the data components observed during training. It has triggered a series of compositional generalization analysis on different…

Computation and Language · Computer Science 2022-09-07 Yunshi Lan , Lei Wang , Jing Jiang , Ee-Peng Lim

Grounded language models use external sources of information, such as knowledge graphs, to meet some of the general challenges associated with pre-training. By extending previous work on compositional generalization in semantic parsing, we…

Computation and Language · Computer Science 2024-06-10 Sondre Wold , Étienne Simon , Lucas Georges Gabriel Charpentier , Egor V. Kostylev , Erik Velldal , Lilja Øvrelid

In-context learning has shown great success in i.i.d semantic parsing splits, where the training and test sets are drawn from the same distribution. In this setup, models are typically prompted with demonstrations that are similar to the…

Computation and Language · Computer Science 2023-06-27 Itay Levy , Ben Bogin , Jonathan Berant

Large language models (LLMs) such as GPT-4 sometimes appear to be creative, solving novel tasks often with a few demonstrations in the prompt. These tasks require the models to generalize on distributions different from those from training…

Computation and Language · Computer Science 2024-12-31 Jiajun Song , Zhuoyan Xu , Yiqiao Zhong

Learning representations that generalize to novel compositions of known concepts is crucial for bridging the gap between human and machine perception. One prominent effort is learning object-centric representations, which are widely…

Machine Learning · Computer Science 2024-11-13 Thaddäus Wiedemer , Jack Brady , Alexander Panfilov , Attila Juhos , Matthias Bethge , Wieland Brendel

Neural networks can be powerful function approximators, which are able to model high-dimensional feature distributions from a subset of examples drawn from the target distribution. Naturally, they perform well at generalizing within the…

Machine Learning · Computer Science 2021-08-06 Aaron Eisermann , Jae Hee Lee , Cornelius Weber , Stefan Wermter
‹ Prev 1 2 3 10 Next ›