Related papers: Learning to Substitute Components for Compositiona…

Learning to Substitute Spans towards Improving Compositional Generalization

Despite the rising prevalence of neural sequence models, recent empirical evidences suggest their deficiency in compositional generalization. One of the current de-facto solutions to this problem is compositional data augmentation, aiming…

Computation and Language · Computer Science 2023-06-06 Zhaoyi Li , Ying Wei , Defu Lian

Improving Compositional Generalization with Latent Structure and Data Augmentation

Generic unstructured neural networks have been shown to struggle on out-of-distribution compositional generalization. Compositional data augmentation via example recombination has transferred some prior knowledge about compositionality to…

Computation and Language · Computer Science 2022-05-06 Linlu Qiu , Peter Shaw , Panupong Pasupat , Paweł Krzysztof Nowak , Tal Linzen , Fei Sha , Kristina Toutanova

Composition-Incremental Learning for Compositional Generalization

Compositional generalization has achieved substantial progress in computer vision on pre-collected training data. Nonetheless, real-world data continually emerges, with possible compositions being nearly infinite, long-tailed, and not…

Computer Vision and Pattern Recognition · Computer Science 2025-11-13 Zhen Li , Yuwei Wu , Chenchen Jing , Che Sun , Chuanhao Li , Yunde Jia

Compositionality as Lexical Symmetry

In tasks like semantic parsing, instruction following, and question answering, standard deep networks fail to generalize compositionally from small datasets. Many existing approaches overcome this limitation with model architectures that…

Computation and Language · Computer Science 2023-07-06 Ekin Akyürek , Jacob Andreas

Context Parametrization with Compositional Adapters

Large language models (LLMs) often seamlessly adapt to new tasks through in-context learning (ICL) or supervised fine-tuning (SFT). However, ICL is inefficient when handling many demonstrations, and SFT incurs training overhead while…

Computation and Language · Computer Science 2026-01-30 Josip Jukić , Martin Tutek , Jan Šnajder

Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment

In text-to-SQL tasks -- as in much of NLP -- compositional generalization is a major challenge: neural networks struggle with compositional generalization where training and test distributions differ. However, most recent attempts to…

Computation and Language · Computer Science 2022-05-05 Yujian Gan , Xinyun Chen , Qiuping Huang , Matthew Purver

Good-Enough Compositional Data Augmentation

We propose a simple data augmentation protocol aimed at providing a compositional inductive bias in conditional and unconditional sequence models. Under this protocol, synthetic training examples are constructed by taking real training…

Computation and Language · Computer Science 2020-05-20 Jacob Andreas

Compositional-ARC: Assessing Systematic Generalization in Abstract Spatial Reasoning

Systematic generalization refers to the capacity to understand and generate novel combinations from known components. Despite recent progress by large language models (LLMs) across various domains, these models often fail to extend their…

Artificial Intelligence · Computer Science 2026-02-27 Philipp Mondorf , Shijia Zhou , Monica Riedler , Barbara Plank

Compositional Generalization for Primitive Substitutions

Compositional generalization is a basic mechanism in human language learning, but current neural networks lack such ability. In this paper, we conduct fundamental research for encoding compositionality in neural networks. Conventional…

Computation and Language · Computer Science 2019-10-08 Yuanpeng Li , Liang Zhao , Jianyu Wang , Joel Hestness

Compositional Generalization by Learning Analytical Expressions

Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily. However, existing neural network based models have been proven to be extremely deficient in…

Artificial Intelligence · Computer Science 2020-10-27 Qian Liu , Shengnan An , Jian-Guang Lou , Bei Chen , Zeqi Lin , Yan Gao , Bin Zhou , Nanning Zheng , Dongmei Zhang

Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability

Large language models (LLMs) have emerged as powerful tools for many AI problems and exhibit remarkable in-context learning (ICL) capabilities. Compositional ability, solving unseen complex tasks that combine two or more simple tasks, is an…

Computation and Language · Computer Science 2024-08-13 Zhuoyan Xu , Zhenmei Shi , Yingyu Liang

Learning Composable Chains-of-Thought

A common approach for teaching large language models (LLMs) to reason is to train on chain-of-thought (CoT) traces of in-distribution reasoning problems, but such annotated data is costly to obtain for every problem of interest. We want…

Computation and Language · Computer Science 2025-05-29 Fangcong Yin , Zeyu Leo Liu , Liu Leqi , Xi Ye , Greg Durrett

Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings

Compositional generalization, the ability of an agent to generalize to unseen combinations of latent factors, is easy for humans but hard for deep neural networks. A line of research in cognitive science has hypothesized a process,…

Machine Learning · Computer Science 2023-10-31 Yi Ren , Samuel Lavoie , Mikhail Galkin , Danica J. Sutherland , Aaron Courville

Learning to Generalize Compositionally by Transferring Across Semantic Parsing Tasks

Neural network models often generalize poorly to mismatched domains or distributions. In NLP, this issue arises in particular when models are expected to generalize compositionally, that is, to novel combinations of familiar words and…

Computation and Language · Computer Science 2021-11-10 Wang Zhu , Peter Shaw , Tal Linzen , Fei Sha

LLM Augmented LLMs: Expanding Capabilities through Composition

Foundational models with billions of parameters which have been trained on large corpora of data have demonstrated non-trivial skills in a variety of domains. However, due to their monolithic structure, it is challenging and expensive to…

Machine Learning · Computer Science 2024-01-05 Rachit Bansal , Bidisha Samanta , Siddharth Dalmia , Nitish Gupta , Shikhar Vashishth , Sriram Ganapathy , Abhishek Bapna , Prateek Jain , Partha Talukdar

CDS: Knowledge Component-Driven Data Synthesis Guided by Cognitive Diagnosis Theory

Large Language Models (LLMs) have achieved significant advancements, but the increasing complexity of tasks and higher performance demands highlight the need for continuous improvement. Some approaches utilize synthetic data generated by…

Artificial Intelligence · Computer Science 2025-06-23 Haokun Zhao , Jinyi Han , Jiaqing Liang , Yanghua Xiao , Xiaojun Meng , Jiansheng Wei

Boosting In-Context Learning in LLMs Through the Lens of Classical Supervised Learning

In-Context Learning (ICL) allows Large Language Models (LLMs) to adapt to new tasks with just a few examples, but their predictions often suffer from systematic biases, leading to unstable performance in classification. While calibration…

Machine Learning · Statistics 2026-03-05 Korel Gundem , Juncheng Dong , Dennis Zhang , Vahid Tarokh , Zhengling Qi

Compositional Generalization in Semantic Parsing: Pre-training vs. Specialized Architectures

While mainstream machine learning methods are known to have limited ability to compositionally generalize, new architectures and techniques continue to be proposed to address this limitation. We investigate state-of-the-art techniques and…

Computation and Language · Computer Science 2021-09-23 Daniel Furrer , Marc van Zee , Nathan Scales , Nathanael Schärli

On Evaluating Multilingual Compositional Generalization with Translated Datasets

Compositional generalization allows efficient learning and human-like inductive biases. Since most research investigating compositional generalization in NLP is done on English, important questions remain underexplored. Do the necessary…

Computation and Language · Computer Science 2023-06-21 Zi Wang , Daniel Hershcovich

Composition-Grounded Data Synthesis for Visual Reasoning

Pretrained multi-modal large language models (MLLMs) demonstrate strong performance on diverse multimodal tasks, but remain limited in reasoning capabilities for domains where annotations are difficult to collect. In this work, we focus on…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Xinyi Gu , Jiayuan Mao , Zhang-Wei Hong , Zhuoran Yu , Pengyuan Li , Dhiraj Joshi , Rogerio Feris , Zexue He