English
Related papers

Related papers: Large Language Models can Learn Rules

200 papers

Since the advent of Large Language Models (LLMs), efforts have largely focused on improving their instruction-following and deductive reasoning abilities, leaving open the question of whether these models can truly discover new knowledge.…

Computation and Language · Computer Science 2025-10-31 Kaiyu He , Zhiyu Chen

Large language models (LLMs) often struggle with complex mathematical tasks, prone to "hallucinating" incorrect answers due to their reliance on statistical patterns. This limitation is further amplified in average Small LangSLMs with…

Large language models (LLMs) increasingly help people solve problems, from debugging code to repairing machinery. This process requires generating plausible hypotheses from partial descriptions, then updating them as more information…

Machine Learning · Computer Science 2026-05-08 Hua-Dong Xiong

Large language models (LLMs) have shown incredible performance in completing various real-world tasks. The current paradigm of knowledge learning for LLMs is mainly based on learning from examples, in which LLMs learn the internal rule…

Computation and Language · Computer Science 2024-12-17 Wenkai Yang , Yankai Lin , Jie Zhou , Ji-Rong Wen

With the advent of Large Language Models (LLMs), generating rule-based data for real-world applications has become more accessible. Due to the inherent ambiguity of natural language and the complexity of rule sets, especially in long…

Computation and Language · Computer Science 2025-04-21 Teng Wang , Zhenqi He , Wing-Yin Yu , Xiaojin Fu , Xiongwei Han

Large language models (LLMs) demonstrate their promise in tackling complicated practical challenges by combining action-based policies with chain of thought (CoT) reasoning. Having high-quality prompts on hand, however, is vital to the…

Machine Learning · Computer Science 2024-03-01 Xue Yan , Yan Song , Xinyu Cui , Filippos Christianos , Haifeng Zhang , David Henry Mguni , Jun Wang

This paper investigates the logical reasoning capabilities of large language models (LLMs). For a precisely defined yet tractable formulation, we choose the conceptually simple but technically complex task of constructing proofs in Boolean…

Machine Learning · Computer Science 2025-04-30 Yuan Xia , Akanksha Atrey , Fadoua Khmaissia , Kedar S. Namjoshi

This work presents a novel systematic methodology to analyse the capabilities and limitations of Large Language Models (LLMs) with feedback from a formal inference engine, on logic theory induction. The analysis is complexity-graded w.r.t.…

Computation and Language · Computer Science 2025-01-15 João Pedro Gandarela , Danilo S. Carvalho , André Freitas

Chain-of-Thought (CoT) prompting can dramatically improve the multi-step reasoning abilities of large language models (LLMs). CoT explicitly encourages the LLM to generate intermediate rationales for solving a problem, by providing a series…

Computation and Language · Computer Science 2023-06-02 Boshi Wang , Sewon Min , Xiang Deng , Jiaming Shen , You Wu , Luke Zettlemoyer , Huan Sun

Large language models (LLMs) have shown remarkable reasoning capabilities given chain-of-thought prompts (examples with intermediate reasoning steps). Existing benchmarks measure reasoning ability indirectly, by evaluating accuracy on…

Computation and Language · Computer Science 2023-03-03 Abulhair Saparov , He He

Modern language models (LMs) can learn to perform new tasks in different ways: in instruction following, the target task is described explicitly in natural language; in few-shot prompting, the task is specified implicitly with a small…

Computation and Language · Computer Science 2024-08-30 Emmy Liu , Graham Neubig , Jacob Andreas

Combining different forms of prompts with pre-trained large language models has yielded remarkable results on reasoning tasks (e.g. Chain-of-Thought prompting). However, along with testing on more complex reasoning, these methods also…

Computation and Language · Computer Science 2024-05-14 Yitian Li , Jidong Tian , Hao He , Yaohui Jin

Instruction tuning -- supervised fine-tuning using instruction-response pairs -- is a key step in making pre-trained large language models (LLMs) instructable. Meanwhile, LLMs perform multitask learning during their pre-training, acquiring…

Computation and Language · Computer Science 2025-09-16 Seokhyun An , Minji Kim , Hyounghun Kim

The emergence of Large Language Models (LLMs) has achieved tremendous success in the field of Natural Language Processing owing to diverse training paradigms that empower LLMs to effectively capture intricate linguistic patterns and…

Information Retrieval · Computer Science 2024-07-04 Lemei Zhang , Peng Liu , Yashar Deldjoo , Yong Zheng , Jon Atle Gulla

Modern large language models (LLMs) are capable of interpreting input strings as instructions, or prompts, and carry out tasks based on them. Unlike traditional learners, LLMs cannot use back-propagation to obtain feedback, and condition…

Computation and Language · Computer Science 2026-03-17 Adrian de Wynter , Xun Wang , Qilong Gu , Si-Qing Chen

The capabilities and limitations of Large Language Models have been sketched out in great detail in recent years, providing an intriguing yet conflicting picture. On the one hand, LLMs demonstrate a general ability to solve problems. On the…

Large language models (LLMs) have been able to perform various forms of reasoning tasks in a wide range of scenarios, but are they truly engaging in task abstraction and rule-based reasoning beyond mere memorization? To answer this…

Machine Learning · Computer Science 2025-12-09 Guanyu Chen , Peiyang Wang , Yizhou Jiang , Yuqian Liu , Chujie Zhao , Ying Fang , Tianren Zhang , Feng Chen

While recent advancements in large language models (LLMs) bring us closer to achieving artificial general intelligence, the question persists: Do LLMs truly understand language, or do they merely mimic comprehension through pattern…

Computation and Language · Computer Science 2023-11-15 Houquan Zhou , Yang Hou , Zhenghua Li , Xuebin Wang , Zhefeng Wang , Xinyu Duan , Min Zhang

Large Language Models (LLMs) are known for their remarkable ability to generate synthesized 'knowledge', such as text documents, music, images, etc. However, there is a huge gap between LLM's and human capabilities for understanding…

Computation and Language · Computer Science 2024-08-14 Vladimir Cherkassky , Eng Hock Lee

Although Large Language Models (LLMs) are showing impressive performance on a wide range of Natural Language Processing tasks, researchers have found that they still have limited ability to conduct induction. Recent works mainly adopt…

Computation and Language · Computer Science 2024-03-12 Wangtao Sun , Haotian Xu , Xuanqing Yu , Pei Chen , Shizhu He , Jun Zhao , Kang Liu
‹ Prev 1 2 3 10 Next ›