Related papers: Enhancing Compositional Generalization via Composi…

Weakly-supervised Compositional FeatureAggregation for Few-shot Recognition

Learning from a few examples is a challenging task for machine learning. While recent progress has been made for this problem, most of the existing methods ignore the compositionality in visual concept representation (e.g. objects are built…

Computer Vision and Pattern Recognition · Computer Science 2019-06-13 Ping Hu , Ximeng Sun , Kate Saenko , Stan Sclaroff

Compositional Feature Augmentation for Unbiased Scene Graph Generation

Scene Graph Generation (SGG) aims to detect all the visual relation triplets $<$\texttt{sub}, \texttt{pred}, \texttt{obj}$>$ in a given image. With the emergence of various advanced techniques for better utilizing both the intrinsic and…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Lin Li , Guikun Chen , Jun Xiao , Yi Yang , Chunping Wang , Long Chen

Consistency of Compositional Generalization across Multiple Levels

Compositional generalization is the capability of a model to understand novel compositions composed of seen concepts. There are multiple levels of novel compositions including phrase-phrase level, phrase-word level, and word-word level.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Chuanhao Li , Zhen Li , Chenchen Jing , Xiaomeng Fan , Wenbo Ye , Yuwei Wu , Yunde Jia

Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models

How to usefully encode compositional task structure has long been a core challenge in AI. Recent work in chain of thought prompting has shown that for very large neural language models (LMs), explicitly demonstrating the inferential steps…

Computation and Language · Computer Science 2022-10-25 Victor S. Bursztyn , David Demeter , Doug Downey , Larry Birnbaum

Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models

Foundation models like CLIP allow zero-shot transfer on various tasks without additional training data. Yet, the zero-shot performance is less competitive than a fully supervised one. Thus, to enhance the performance, fine-tuning and…

Computer Vision and Pattern Recognition · Computer Science 2026-01-08 Beier Zhu , Kaihua Tang , Qianru Sun , Hanwang Zhang

Real-World Compositional Generalization with Disentangled Sequence-to-Sequence Learning

Compositional generalization is a basic mechanism in human language learning, which current neural networks struggle with. A recently proposed Disentangled sequence-to-sequence model (Dangle) shows promising generalization capability by…

Computation and Language · Computer Science 2022-12-13 Hao Zheng , Mirella Lapata

Learning to Generalize Compositionally by Transferring Across Semantic Parsing Tasks

Neural network models often generalize poorly to mismatched domains or distributions. In NLP, this issue arises in particular when models are expected to generalize compositionally, that is, to novel combinations of familiar words and…

Computation and Language · Computer Science 2021-11-10 Wang Zhu , Peter Shaw , Tal Linzen , Fei Sha

Generalized Coarse-to-Fine Visual Recognition with Progressive Training

Computer vision is difficult, partly because the desired mathematical function connecting input and output data is often complex, fuzzy and thus hard to learn. Coarse-to-fine (C2F) learning is a promising direction, but it remains unclear…

Computer Vision and Pattern Recognition · Computer Science 2019-04-17 Xutong Ren , Lingxi Xie , Chen Wei , Siyuan Qiao , Chi Su , Jiaying Liu , Qi Tian , Elliot K. Fishman , Alan L. Yuille

CAFA: Class-Aware Feature Alignment for Test-Time Adaptation

Despite recent advancements in deep learning, deep neural networks continue to suffer from performance degradation when applied to new data that differs from training data. Test-time adaptation (TTA) aims to address this challenge by…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Sanghun Jung , Jungsoo Lee , Nanhee Kim , Amirreza Shaban , Byron Boots , Jaegul Choo

Causality-based Dual-Contrastive Learning Framework for Domain Generalization

Domain Generalization (DG) is essentially a sub-branch of out-of-distribution generalization, which trains models from multiple source domains and generalizes to unseen target domains. Recently, some domain generalization algorithms have…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Zining Chen , Weiqiu Wang , Zhicheng Zhao , Aidong Men

InFusionLayer: a CFA-based ensemble tool to generate new classifiers for learning and modeling

Ensemble learning is a well established body of methods for machine learning to enhance predictive performance by combining multiple algorithms/models. Combinatorial Fusion Analysis (CFA) has provided method and practice for combining…

Machine Learning · Computer Science 2026-03-12 Eric Roginek , Jingyan Xu , D. Frank. Hsu

Learning Invariant Representation via Contrastive Feature Alignment for Clutter Robust SAR Target Recognition

The deep neural networks (DNNs) have freed the synthetic aperture radar automatic target recognition (SAR ATR) from expertise-based feature designing and demonstrated superiority over conventional solutions. There has been shown the unique…

Computer Vision and Pattern Recognition · Computer Science 2023-04-05 Bowen Peng , Jianyue Xie , Bo Peng , Li Liu

Reinforcement Learning for Compositional Generalization with Outcome-Level Optimization

Compositional generalization refers to correctly interpret novel combinations of known primitives, which remains a major challenge. Existing approaches often rely on supervised fine-tuning, which encourages models to imitate target outputs.…

Machine Learning · Computer Science 2026-05-07 Xiyan Fu , Wei Liu

Revisiting Compositionality in Dual-Encoder Vision-Language Models: The Role of Inference

Dual-encoder Vision-Language Models (VLMs) such as CLIP are often characterized as bag-of-words systems due to their poor performance on compositional benchmarks. We argue that this limitation may stem less from deficient representations…

Computer Vision and Pattern Recognition · Computer Science 2026-04-17 Imanol Miranda , Ander Salaberria , Eneko Agirre , Gorka Azkune

Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment

Classifier-Free Guidance (CFG) is a critical technique for enhancing the sample quality of visual generative models. However, in autoregressive (AR) multi-modal generation, CFG introduces design inconsistencies between language and visual…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Huayu Chen , Hang Su , Peize Sun , Jun Zhu

Dual-stream Feature Augmentation for Domain Generalization

Domain generalization (DG) task aims to learn a robust model from source domains that could handle the out-of-distribution (OOD) issue. In order to improve the generalization ability of the model in unseen domains, increasing the diversity…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Shanshan Wang , ALuSi , Xun Yang , Ke Xu , Huibin Tan , Xingyi Zhang

CFA: Class-wise Calibrated Fair Adversarial Training

Adversarial training has been widely acknowledged as the most effective method to improve the adversarial robustness against adversarial examples for Deep Neural Networks (DNNs). So far, most existing works focus on enhancing the overall…

Machine Learning · Computer Science 2023-03-28 Zeming Wei , Yifei Wang , Yiwen Guo , Yisen Wang

Scalable Evaluation and Neural Models for Compositional Generalization

Compositional generalization-a key open challenge in modern machine learning-requires models to predict unknown combinations of known concepts. However, assessing compositional generalization remains a fundamental challenge due to the lack…

Machine Learning · Computer Science 2025-11-06 Giacomo Camposampiero , Pietro Barbiero , Michael Hersche , Roger Wattenhofer , Abbas Rahimi

Cross Contrasting Feature Perturbation for Domain Generalization

Domain generalization (DG) aims to learn a robust model from source domains that generalize well on unseen target domains. Recent studies focus on generating novel domain samples or features to diversify distributions complementary to…

Computer Vision and Pattern Recognition · Computer Science 2023-08-17 Chenming Li , Daoan Zhang , Wenjian Huang , Jianguo Zhang

Generative Compositional Augmentations for Scene Graph Prediction

Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges…

Computer Vision and Pattern Recognition · Computer Science 2021-10-04 Boris Knyazev , Harm de Vries , Cătălina Cangea , Graham W. Taylor , Aaron Courville , Eugene Belilovsky