Related papers: Reasoning Like Program Executors

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Algorithmic reasoning refers to the ability to understand the complex patterns behind the problem and decompose them into a sequence of reasoning steps towards the solution. Such nature of algorithmic reasoning makes it a challenge for…

Computation and Language · Computer Science 2024-04-04 Hyungjoo Chae , Yeonghyeon Kim , Seungone Kim , Kai Tzu-iunn Ong , Beong-woo Kwak , Moohyeon Kim , Seonghwan Kim , Taeyoon Kwon , Jiwan Chung , Youngjae Yu , Jinyoung Yeo

PoET: A generative model of protein families as sequences-of-sequences

Generative protein language models are a natural way to design new proteins with desired functions. However, current models are either difficult to direct to produce a protein from a specific family of interest, or must be trained on a…

Quantitative Methods · Quantitative Biology 2024-01-08 Timothy F. Truong , Tristan Bepler

Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments

Multi-step reasoning is essential for large language models (LLMs), yet multilingual performance remains challenging. While Chain-of-Thought (CoT) prompting improves reasoning, it struggles with non-English languages due to the entanglement…

Computation and Language · Computer Science 2025-05-23 Patomporn Payoungkhamdee , Pume Tuchinda , Jinheon Baek , Samuel Cahyawijaya , Can Udomcharoenchaikit , Potsawee Manakul , Peerat Limkonchotiwat , Ekapol Chuangsuwanich , Sarana Nutanong

POEM: Interactive Prompt Optimization for Enhancing Multimodal Reasoning of Large Language Models

Large language models (LLMs) have exhibited impressive abilities for multimodal content comprehension and reasoning with proper prompting in zero- or few-shot settings. Despite the proliferation of interactive systems developed to support…

Human-Computer Interaction · Computer Science 2024-10-01 Jianben He , Xingbo Wang , Shiyi Liu , Guande Wu , Claudio Silva , Huamin Qu

Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions

Creating open-ended algorithms, which generate their own never-ending stream of novel and appropriately challenging learning opportunities, could help to automate and accelerate progress in machine learning. A recent step in this direction…

Neural and Evolutionary Computing · Computer Science 2020-04-14 Rui Wang , Joel Lehman , Aditya Rawal , Jiale Zhi , Yulun Li , Jeff Clune , Kenneth O. Stanley

Enhancing Chain of Thought Prompting in Large Language Models via Reasoning Patterns

Chain of Thought (CoT) prompting can encourage language models to engage in multi-step logical reasoning. The quality of the provided demonstrations significantly influences the success of downstream inference tasks. Current unsupervised…

Computation and Language · Computer Science 2025-05-27 Yufeng Zhang , Xuepeng Wang , Lingxiang Wu , Jinqiao Wang

Large Language Models Are Reasoning Teachers

Recent works have shown that chain-of-thought (CoT) prompting can elicit language models to solve complex reasoning tasks, step-by-step. However, prompt-based CoT methods are dependent on very large models such as GPT-3 175B which are…

Computation and Language · Computer Science 2023-06-14 Namgyu Ho , Laura Schmid , Se-Young Yun

Language Models as Inductive Reasoners

Inductive reasoning is a core component of human intelligence. In the past research of inductive reasoning within computer science, formal language is used as representations of knowledge (facts and rules, more specifically). However,…

Computation and Language · Computer Science 2024-02-06 Zonglin Yang , Li Dong , Xinya Du , Hao Cheng , Erik Cambria , Xiaodong Liu , Jianfeng Gao , Furu Wei

Evaluating Prompting and Execution-Based Methods for Deterministic Computation in LLMs

Large Language Models (LLMs) have demonstrated strong capabilities in natural language understanding and reasoning. However, their ability to perform exact, deterministic computation remains unclear. In this work, we systematically evaluate…

Artificial Intelligence · Computer Science 2026-05-08 Hongkun Yu

MultiLingPoT: Enhancing Mathematical Reasoning with Multilingual Program Fine-tuning

Program-of-Thought (PoT), which aims to use programming language instead of natural language as an intermediate step in reasoning, is an important way for LLMs to solve mathematical problems. Since different programming languages excel in…

Computation and Language · Computer Science 2024-12-18 Nianqi Li , Zujie Liang , Siyu Yuan , Jiaqing Liang , Feng Wei , Yanghua Xiao

Program of Equations Thoughts to Solve Algebra Word Problems

Solving algebraic word problems (AWPs) has recently emerged as an important natural language processing task. Recently, large language models (LLMs) have demonstrated powerful mathematical capabilities, and the Chain-of-Thought technique,…

Artificial Intelligence · Computer Science 2025-07-02 Yunze Lin

Pre-training Language Models for Comparative Reasoning

Comparative reasoning is a process of comparing objects, concepts, or entities to draw conclusions, which constitutes a fundamental cognitive ability. In this paper, we propose a novel framework to pre-train language models for enhancing…

Computation and Language · Computer Science 2023-11-29 Mengxia Yu , Zhihan Zhang , Wenhao Yu , Meng Jiang

Reasoning Capabilities of Large Language Models on Dynamic Tasks

Large language models excel on static benchmarks, but their ability as self-learning agents in dynamic environments remains unclear. We evaluate three prompting strategies: self-reflection, heuristic mutation, and planning across dynamic…

Artificial Intelligence · Computer Science 2025-08-12 Annie Wong , Thomas Bäck , Aske Plaat , Niki van Stein , Anna V. Kononova

Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts

Program of Thoughts (PoT) is an approach characterized by its executable intermediate steps, which ensure the accuracy of the logical calculations in the reasoning process. Currently, PoT primarily uses Python. However, relying solely on a…

Computation and Language · Computer Science 2024-11-19 Xianzhen Luo , Qingfu Zhu , Zhiming Zhang , Libo Qin , Xuanyu Zhang , Qing Yang , Dongliang Xu , Wanxiang Che

Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning

Large Language Models (LLMs) are primarily trained on high-resource natural languages, limiting their effectiveness in low-resource settings and in tasks requiring deep logical reasoning. This research introduces Rosetta-PL, a benchmark…

Computation and Language · Computer Science 2025-05-06 Shaun Baek , Shaun Esua-Mensah , Cyrus Tsui , Sejan Vigneswaralingam , Abdullah Alali , Michael Lu , Vasu Sharma , Sean O'Brien , Kevin Zhu

Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

Recently, there has been significant progress in teaching language models to perform step-by-step reasoning to solve complex numerical reasoning tasks. Chain-of-thoughts prompting (CoT) is by far the state-of-art method for these tasks. CoT…

Computation and Language · Computer Science 2023-10-24 Wenhu Chen , Xueguang Ma , Xinyi Wang , William W. Cohen

Teaching Language Models to Forecast Research Success Through Comparative Idea Evaluation

As language models accelerate scientific research by automating hypothesis generation and implementation, a new bottleneck emerges: evaluating and filtering hundreds of AI-generated ideas without exhaustive experimentation. We ask whether…

Machine Learning · Computer Science 2026-05-22 Srujan P Mule , Aniketh Garikaparthi , Manasi Patwardhan

Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?

Large Language Models (LLMs) have been shown to achieve breakthrough performance on complex logical reasoning tasks. Nevertheless, most existing research focuses on employing formal language to guide LLMs to derive reliable reasoning paths,…

Computation and Language · Computer Science 2025-05-23 Jin Jiang , Jianing Wang , Yuchen Yan , Yang Liu , Jianhua Zhu , Mengdi Zhang , Xunliang Cai , Liangcai Gao

POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation

State-of-the-art visual generative AI tools hold immense potential to assist users in the early ideation stages of creative tasks -- offering the ability to generate (rather than search for) novel and unprecedented (instead of existing)…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Evans Xu Han , Alice Qian Zhang , Haiyi Zhu , Hong Shen , Paul Pu Liang , Jane Hsieh

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

To enhance the multi-step reasoning capabilities of large language models, researchers have extensively explored prompting methods, notably the Chain-of-Thought (CoT) method which explicitly elicits human-like rationales. However, they have…

Computation and Language · Computer Science 2024-04-19 Zhiheng Xi , Senjie Jin , Yuhao Zhou , Rui Zheng , Songyang Gao , Tao Gui , Qi Zhang , Xuanjing Huang