Related papers: Compositional diversity in visual concept learning

Flexible Compositional Learning of Structured Visual Concepts

Humans are highly efficient learners, with the ability to grasp the meaning of a new concept from just a few examples. Unlike popular computer vision systems, humans can flexibly leverage the compositional structure of the visual world,…

Computer Vision and Pattern Recognition · Computer Science 2021-05-21 Yanli Zhou , Brenden M. Lake

People infer recursive visual concepts from just a few examples

Machine learning has made major advances in categorizing objects in images, yet the best algorithms miss important aspects of how people learn and think about categories. People can learn richer concepts from fewer examples, including…

Machine Learning · Computer Science 2019-07-30 Brenden M. Lake , Steven T. Piantadosi

Compositional learning of functions in humans and machines

The ability to learn and compose functions is foundational to efficient learning and reasoning in humans, enabling flexible generalizations such as creating new dishes from known cooking processes. Beyond sequential chaining of functions,…

Artificial Intelligence · Computer Science 2024-03-20 Yanli Zhou , Brenden M. Lake , Adina Williams

Few-Shot Learning of Visual Compositional Concepts through Probabilistic Schema Induction

The ability to learn new visual concepts from limited examples is a hallmark of human cognition. While traditional category learning models represent each example as an unstructured feature vector, compositional concept learning is thought…

Computer Vision and Pattern Recognition · Computer Science 2025-05-16 Andrew Jun Lee , Taylor Webb , Trevor Bihl , Keith Holyoak , Hongjing Lu

Learning Compositional Representations for Few-Shot Recognition

One of the key limitations of modern deep learning approaches lies in the amount of data required to train them. Humans, by contrast, can learn to recognize novel categories from just a few examples. Instrumental to this rapid learning…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Pavel Tokmakov , Yu-Xiong Wang , Martial Hebert

Human-like compositional learning of visually-grounded concepts using synthetic environments

The compositional structure of language enables humans to decompose complex phrases and map them to novel visual concepts, showcasing flexible intelligence. While several algorithms exhibit compositionality, they fail to elucidate how…

Computer Vision and Pattern Recognition · Computer Science 2025-04-10 Zijun Lin , M Ganesh Kumar , Cheston Tan

A causal view of compositional zero-shot recognition

People easily recognize new visual categories that are new combinations of known components. This compositional generalization capacity is critical for learning in real-world domains like vision and language because the long tail of new…

Computer Vision and Pattern Recognition · Computer Science 2020-11-03 Yuval Atzmon , Felix Kreuk , Uri Shalit , Gal Chechik

A Survey on Compositional Learning of AI Models: Theoretical and Experimental Practices

Compositional learning, mastering the ability to combine basic concepts and construct more intricate ones, is crucial for human cognition, especially in human language comprehension and visual perception. This notion is tightly connected to…

Artificial Intelligence · Computer Science 2024-11-22 Sania Sinha , Tanawan Premsri , Parisa Kordjamshidi

Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks

Human action is naturally compositional: humans can easily recognize and perform actions with objects that are different from those used in training demonstrations. In this paper, we study the compositionality of action by looking into the…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 Joanna Materzynska , Tete Xiao , Roei Herzig , Huijuan Xu , Xiaolong Wang , Trevor Darrell

Human few-shot learning of compositional instructions

People learn in fast and flexible ways that have not been emulated by machines. Once a person learns a new verb "dax," he or she can effortlessly understand how to "dax twice," "walk and dax," or "dax vigorously." There have been striking…

Computation and Language · Computer Science 2019-05-14 Brenden M. Lake , Tal Linzen , Marco Baroni

Compositional Program Generation for Few-Shot Systematic Generalization

Compositional generalization is a key ability of humans that enables us to learn new concepts from only a handful examples. Neural machine learning models, including the now ubiquitous Transformers, struggle to generalize in this way, and…

Machine Learning · Computer Science 2024-01-19 Tim Klinger , Luke Liu , Soham Dan , Maxwell Crouse , Parikshit Ram , Alexander Gray

Neural Algebra of Classifiers

The world is fundamentally compositional, so it is natural to think of visual recognition as the recognition of basic visually primitives that are composed according to well-defined rules. This strategy allows us to recognize unseen complex…

Computer Vision and Pattern Recognition · Computer Science 2018-01-29 Rodrigo Santa Cruz , Basura Fernando , Anoop Cherian , Stephen Gould

Compositional Scene Understanding through Inverse Generative Modeling

Generative models have demonstrated remarkable abilities in generating high-fidelity visual content. In this work, we explore how generative models can further be used not only to synthesize visual content but also to understand the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yanbo Wang , Justin Dauwels , Yilun Du

Iterated Learning Improves Compositionality in Large Vision-Language Models

A fundamental characteristic common to both human vision and natural language is their compositional nature. Yet, despite the performance gains contributed by large vision and language pretraining, recent investigations find that most-if…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 Chenhao Zheng , Jieyu Zhang , Aniruddha Kembhavi , Ranjay Krishna

Development of Compositionality and Generalization through Interactive Learning of Language and Action of Robots

Humans excel at applying learned behavior to unlearned situations. A crucial component of this generalization behavior is our ability to compose/decompose a whole into reusable parts, an attribute known as compositionality. One of the…

Artificial Intelligence · Computer Science 2024-07-24 Prasanna Vijayaraghavan , Jeffrey Frederic Queisser , Sergio Verduzco Flores , Jun Tani

Where and Who? Automatic Semantic-Aware Person Composition

Image compositing is a method used to generate realistic yet fake imagery by inserting contents from one image to another. Previous work in compositing has focused on improving appearance compatibility of a user selected foreground segment…

Graphics · Computer Science 2017-12-05 Fuwen Tan , Crispin Bernier , Benjamin Cohen , Vicente Ordonez , Connelly Barnes

Concepts, Properties and an Approach for Compositional Generalization

Compositional generalization is the capacity to recognize and imagine a large amount of novel combinations from known components. It is a key in human intelligence, but current neural networks generally lack such ability. This report…

Artificial Intelligence · Computer Science 2021-02-09 Yuanpeng Li

CtD: Composition through Decomposition in Emergent Communication

Compositionality is a cognitive mechanism that allows humans to systematically combine known concepts in novel ways. This study demonstrates how artificial neural agents acquire and utilize compositional generalization to describe…

Artificial Intelligence · Computer Science 2026-01-16 Boaz Carmeli , Ron Meir , Yonatan Belinkov

Learning Compositional Visual Concepts with Mutual Consistency

Compositionality of semantic concepts in image synthesis and analysis is appealing as it can help in decomposing known and generatively recomposing unknown data. For instance, we may learn concepts of changing illumination, geometry or…

Computer Vision and Pattern Recognition · Computer Science 2018-03-29 Yunye Gong , Srikrishna Karanam , Ziyan Wu , Kuan-Chuan Peng , Jan Ernst , Peter C. Doerschuk

A Benchmark for Compositional Visual Reasoning

A fundamental component of human vision is our ability to parse complex visual scenes and judge the relations between their constituent objects. AI benchmarks for visual reasoning have driven rapid progress in recent years with…

Computer Vision and Pattern Recognition · Computer Science 2022-06-14 Aimen Zerroug , Mohit Vaishnav , Julien Colin , Sebastian Musslick , Thomas Serre