English
Related papers

Related papers: Easy Problems That LLMs Get Wrong

200 papers

Large Language Models (LLMs) have recently gained significant attention due to their remarkable capabilities in performing diverse tasks across various domains. However, a thorough evaluation of these models is crucial before deploying them…

Large Language Models (LLMs) are transformative not only for daily activities but also for engineering tasks. However, current evaluations of LLMs in engineering exhibit two critical shortcomings: (i) the reliance on simplified use cases,…

Artificial Intelligence · Computer Science 2025-05-21 Rene Heesch , Sebastian Eilermann , Alexander Windmann , Alexander Diedrich , Philipp Rosenthal , Oliver Niggemann

Despite their outstanding performance, large language models (LLMs) suffer notorious flaws related to their preference for simple, surface-level textual relations over full semantic complexity of the problem. This proposal investigates a…

Computation and Language · Computer Science 2022-06-20 Michal Štefánik

This paper provides a primer on Large Language Models (LLMs) and identifies their strengths, limitations, applications and research directions. It is intended to be useful to those in academia and industry who are interested in gaining an…

Computation and Language · Computer Science 2024-12-09 Sandra Johnson , David Hyland-Wood

The planning ability of Large Language Models (LLMs) has garnered increasing attention in recent years due to their remarkable capacity for multi-step reasoning and their ability to generalize across a wide range of domains. While some…

Artificial Intelligence · Computer Science 2025-02-19 Mohamed Aghzal , Erion Plaku , Gregory J. Stein , Ziyu Yao

Large Language Models (LLMs) excel in generating personalized content and facilitating interactive dialogues, showcasing their remarkable aptitude for a myriad of applications. However, their capabilities in reasoning and providing…

Computation and Language · Computer Science 2024-02-16 Min Zhang , Sato Takumi , Jack Zhang , Jun Wang

The rapid rise in popularity of Large Language Models (LLMs) with emerging capabilities has spurred public curiosity to evaluate and compare different LLMs, leading many researchers to propose their own LLM benchmarks. Noticing preliminary…

Artificial Intelligence · Computer Science 2025-05-15 Timothy R. McIntosh , Teo Susnjak , Nalin Arachchilage , Tong Liu , Paul Watters , Malka N. Halgamuge

Large language models (LLMs) regularly demonstrate new and impressive performance on a wide range of language, knowledge, and reasoning benchmarks. Such rapid progress has led many commentators to argue that LLM general cognitive…

Computation and Language · Computer Science 2025-02-21 James Fodor

Large language models (LLMs) have achieved remarkable performance in language understanding and generation tasks by leveraging vast amounts of online texts. Unlike conventional models, LLMs can adapt to new domains through prompt…

Artificial Intelligence · Computer Science 2024-06-18 Ming Cheung

Large language models (LLMs) have recently been applied to forecasting tasks, with some works claiming these systems match or exceed human performance. In this paper, we argue that, as a community, we should be careful about such…

Machine Learning · Computer Science 2025-06-03 Daniel Paleka , Shashwat Goel , Jonas Geiping , Florian Tramèr

The rapid advancement of Large Language Models (LLMs) has opened new possibilities in Multi-Robot Systems (MRS), enabling enhanced communication, task allocation and planning, and human-robot interaction. Unlike traditional single-robot and…

Robotics · Computer Science 2026-05-05 Peihan Li , Zijian An , Shams Abrar , Lifeng Zhou

The development and evaluation of Large Language Models (LLMs) has primarily focused on their task-solving capabilities, with recent models even surpassing human performance in some areas. However, this focus often neglects whether…

Computation and Language · Computer Science 2025-07-29 Yanzhu Guo , Guokan Shang , Chloé Clavel

Large Language Models (LLMs) are advancing at an amazing speed and have become indispensable across academia, industry, and daily applications. To keep pace with the status quo, this survey probes the core challenges that the rise of LLMs…

Problem-solving has been a fundamental driver of human progress in numerous domains. With advancements in artificial intelligence, Large Language Models (LLMs) have emerged as powerful tools capable of tackling complex problems across…

Machine Learning · Computer Science 2025-05-07 Da Zheng , Lun Du , Junwei Su , Yuchen Tian , Yuqi Zhu , Jintian Zhang , Lanning Wei , Ningyu Zhang , Huajun Chen

Large language models (LLMs) have shown strong performance on mathematical reasoning under well-defined conditions. However, real-world engineering problems involve uncertainty, context, and open-ended settings that extend beyond symbolic…

Artificial Intelligence · Computer Science 2026-05-05 Xiyuan Zhou , Xinlei Wang , Yirui He , Yang Wu , Ruixi Zou , Yuheng Cheng , Yulu Xie , Wenxuan Liu , Huan Zhao , Yan Xu , Jinjin Gu , Junhua Zhao

Intrigued by the claims of emergent reasoning capabilities in LLMs trained on general web corpora, in this paper, we set out to investigate their planning capabilities. We aim to evaluate (1) how good LLMs are by themselves in generating…

Artificial Intelligence · Computer Science 2023-02-15 Karthik Valmeekam , Sarath Sreedharan , Matthew Marquez , Alberto Olmo , Subbarao Kambhampati

This paper investigates the mathematical reasoning capabilities of large language models (LLMs) using 50 newly constructed high-school-level word problems. Unlike prior studies that focus solely on answer correctness, we rigorously analyze…

Artificial Intelligence · Computer Science 2025-02-24 Johan Boye , Birger Moell

While state-of-the-art large language models (LLMs) demonstrate advanced reasoning capabilities-achieving remarkable performance on challenging competitive math and coding benchmarks-they also frequently fail on tasks that are easy for…

Computation and Language · Computer Science 2025-07-11 Alan Malek , Jiawei Ge , Nevena Lazic , Chi Jin , András György , Csaba Szepesvári

Large language models (LLM) have revolutionized the processing of natural language. Although first benchmarks of the process modeling abilities of LLM are promising, it is currently under debate to what extent an LLM can generate good…

Computation and Language · Computer Science 2025-03-19 Peter Fettke , Constantin Houy

Large language models (LLMs) demonstrate considerable potential in various natural language tasks but face significant challenges in mathematical reasoning, particularly in executing precise, multi-step logic. However, current evaluation…

Computation and Language · Computer Science 2025-05-22 Tiasa Singha Roy , Aditeya Baral , Ayush Rajesh Jhaveri , Yusuf Baig
‹ Prev 1 2 3 10 Next ›