English
Related papers

Related papers: Evaluating Morphological Compositional Generalizat…

200 papers

Compositional generalization is the ability of a model to generalize to complex, previously unseen types of combinations of entities from just having seen the primitives. This type of generalization is particularly relevant to the semantic…

Computation and Language · Computer Science 2024-04-23 Amogh Mannekote

Large language models (LLMs) have emerged as powerful tools for many AI problems and exhibit remarkable in-context learning (ICL) capabilities. Compositional ability, solving unseen complex tasks that combine two or more simple tasks, is an…

Computation and Language · Computer Science 2024-08-13 Zhuoyan Xu , Zhenmei Shi , Yingyu Liang

Large language models (LLMs) have demonstrated remarkable performances on a wide range of natural language tasks. Yet, LLMs' successes have been largely restricted to tasks concerning words, sentences, or documents, and it remains…

Computation and Language · Computer Science 2024-07-24 Andrew Shin , Kunitake Kaneko

Human cognition exhibits systematic compositionality, the algebraic ability to generate infinite novel combinations from finite learned components, which is the key to understanding and reasoning about complex logic. In this work, we…

Computation and Language · Computer Science 2024-10-11 Jun Zhao , Jingqi Tong , Yurong Mou , Ming Zhang , Qi Zhang , Xuanjing Huang

Large Language Models (LLMs) are known for their remarkable ability to generate synthesized 'knowledge', such as text documents, music, images, etc. However, there is a huge gap between LLM's and human capabilities for understanding…

Computation and Language · Computer Science 2024-08-14 Vladimir Cherkassky , Eng Hock Lee

Large language models (LLMs) are very performant connectionist systems, but do they exhibit more compositionality? More importantly, is that part of why they perform so well? We present empirical analyses across four LLM families (12…

Computation and Language · Computer Science 2025-05-21 Ruchira Dhar , Anders Søgaard

Large Pre-trained Language Models (PLM) have become the most desirable starting point in the field of NLP, as they have become remarkably good at solving many individual tasks. Despite such success, in this paper, we argue that current…

Computation and Language · Computer Science 2023-03-07 Hangyeol Yu , Myeongho Jeong , Jamin Shin , Hyeongdon Moon , Juneyoung Park , Seungtaek Choi

Large Language Models (LLMs), originally developed for natural language processing (NLP), have demonstrated the potential to generalize across modalities and domains. With their in-context learning (ICL) capabilities, LLMs can perform…

Artificial Intelligence · Computer Science 2025-08-26 Nikolaos Pavlidis , Vasilis Perifanis , Symeon Symeonidis , Pavlos S. Efraimidis

Large language models (LLMs) have recently reached an impressive level of linguistic capability, prompting comparisons with human language skills. However, there have been relatively few systematic inquiries into the linguistic capabilities…

Grammatical features across human languages show intriguing correlations often attributed to learning biases in humans. However, empirical evidence has been limited to experiments with highly simplified artificial languages, and whether…

Computation and Language · Computer Science 2025-02-19 Tianyang Xu , Tatsuki Kuribayashi , Yohei Oseki , Ryan Cotterell , Alex Warstadt

Large language models (LLMs) that fluently converse with humans are a reality - but do LLMs experience human-like processing difficulties? We systematically compare human and LLM sentence comprehension across seven challenging linguistic…

Computation and Language · Computer Science 2025-10-17 Samuel Joseph Amouyal , Aya Meltzer-Asscher , Jonathan Berant

What makes large language models (LLMs) impressive is also what makes them hard to evaluate: their diversity of uses. To evaluate these models, we must understand the purposes they will be used for. We consider a setting where these…

Computation and Language · Computer Science 2024-06-04 Keyon Vafa , Ashesh Rambachan , Sendhil Mullainathan

While LLMs have emerged as performant architectures for reasoning tasks, their compositional generalization capabilities have been questioned. In this work, we introduce a Compositional Generalization Challenge for Graph-based Commonsense…

Computation and Language · Computer Science 2024-10-10 Xiyan Fu , Anette Frank

Systematic generalization refers to the capacity to understand and generate novel combinations from known components. Despite recent progress by large language models (LLMs) across various domains, these models often fail to extend their…

Artificial Intelligence · Computer Science 2026-02-27 Philipp Mondorf , Shijia Zhou , Monica Riedler , Barbara Plank

As large language models (LLMs) become increasingly advanced, their ability to exhibit compositional generalization -- the capacity to combine learned skills in novel ways not encountered during training -- has garnered significant…

Computation and Language · Computer Science 2025-01-22 Haoyu Zhao , Simran Kaur , Dingli Yu , Anirudh Goyal , Sanjeev Arora

Recent advances in the performance of large language models (LLMs) have sparked debate over whether, given sufficient training, high-level human abilities emerge in such generic forms of artificial intelligence (AI). Despite the exceptional…

Computation and Language · Computer Science 2024-01-18 Nicholas Ichien , Dušan Stamenković , Keith J. Holyoak

The meaning of complex phrases in natural language is composed of their individual components. The task of compositional generalization evaluates a model's ability to understand new combinations of components. Previous studies trained…

Computation and Language · Computer Science 2023-12-14 Min Zhang , Jianfeng He , Shuo Lei , Murong Yue , Linhang Wang , Chang-Tien Lu

Large Language Models (LLMs) have drawn a lot of attention due to their strong performance on a wide range of natural language tasks, since the release of ChatGPT in November 2022. LLMs' ability of general-purpose language understanding and…

Computation and Language · Computer Science 2025-03-25 Shervin Minaee , Tomas Mikolov , Narjes Nikzad , Meysam Chenaghlu , Richard Socher , Xavier Amatriain , Jianfeng Gao

Human languages have evolved to be structured through repeated language learning and use. These processes introduce biases that operate during language acquisition and shape linguistic systems toward communicative efficiency. In this paper,…

Computation and Language · Computer Science 2024-12-16 Tom Kouwenhoven , Max Peeperkorn , Tessa Verhoef

Large Language Models (LLMs) are recruited in applications that span from clinical assistance and legal support to question answering and education. Their success in specialized tasks has led to the claim that they possess human-like…

Computation and Language · Computer Science 2024-07-10 Vittoria Dentella , Fritz Guenther , Elliot Murphy , Gary Marcus , Evelina Leivada
‹ Prev 1 2 3 10 Next ›