Related papers: Eliciting Better Multilingual Structured Reasoning…

Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting

Large language models (LLMs) demonstrate impressive multilingual capability, but their performance varies substantially across different languages. In this work, we introduce a simple yet effective method, called cross-lingual-thought…

Computation and Language · Computer Science 2023-10-24 Haoyang Huang , Tianyi Tang , Dongdong Zhang , Wayne Xin Zhao , Ting Song , Yan Xia , Furu Wei

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

Language Models of Code are Few-Shot Commonsense Learners

We address the general task of structured commonsense reasoning: given a natural language input, the goal is to generate a graph such as an event -- or a reasoning-graph. To employ large language models (LMs) for this task, existing…

Computation and Language · Computer Science 2022-12-07 Aman Madaan , Shuyan Zhou , Uri Alon , Yiming Yang , Graham Neubig

MindMerger: Efficient Boosting LLM Reasoning in non-English Languages

Reasoning capabilities are crucial for Large Language Models (LLMs), yet a notable gap exists between English and non-English languages. To bridge this disparity, some works fine-tune LLMs to relearn reasoning capabilities in non-English…

Computation and Language · Computer Science 2024-05-28 Zixian Huang , Wenhao Zhu , Gong Cheng , Lei Li , Fei Yuan

Not All Code Is Equal: A Data-Centric Study of Code Complexity and LLM Reasoning

Large Language Models (LLMs) increasingly exhibit strong reasoning abilities, often attributed to their capacity to generate chain-of-thought-style intermediate reasoning. Recent work suggests that exposure to code can further enhance these…

Machine Learning · Computer Science 2026-01-30 Lukas Twist , Shu Yang , Hanqi Yan , Jingzhi Gong , Di Wang , Helen Yannakoudakis , Jie M. Zhang

Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual Prompts

With the advent of Large Language Models (LLMs), generating rule-based data for real-world applications has become more accessible. Due to the inherent ambiguity of natural language and the complexity of rule sets, especially in long…

Computation and Language · Computer Science 2025-04-21 Teng Wang , Zhenqi He , Wing-Yin Yu , Xiaojin Fu , Xiongwei Han

Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks

Large language models (LLMs) have shown impressive promise in code generation, yet their progress remains limited by the shortage of large-scale datasets that are both diverse and well-aligned with human reasoning. Most existing resources…

Machine Learning · Computer Science 2025-10-28 Amal Abed , Ivan Lukic , Jörg K. H. Franke , Frank Hutter

The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights

Bridging the significant gap between large language model's English and non-English performance presents a great challenge. While some previous studies attempt to mitigate this gap with translated training data, the recently proposed…

Computation and Language · Computer Science 2024-11-07 Wenhao Zhu , Shujian Huang , Fei Yuan , Cheng Chen , Jiajun Chen , Alexandra Birch

On Code-Induced Reasoning in LLMs

Code data has been shown to enhance the reasoning capabilities of large language models (LLMs), but it remains unclear which aspects of code are most responsible. We investigate this question with a systematic, data-centric framework. We…

Computation and Language · Computer Science 2025-10-03 Abdul Waheed , Zhen Wu , Carolyn Rosé , Daphne Ippolito

When Models Reason in Your Language: Controlling Thinking Language Comes at the Cost of Accuracy

Recent Large Reasoning Models (LRMs) with thinking traces have shown strong performance on English reasoning tasks. However, their ability to think in other languages is less studied. This capability is as important as answer accuracy for…

Computation and Language · Computer Science 2025-12-12 Jirui Qi , Shan Chen , Zidi Xiong , Raquel Fernández , Danielle S. Bitterman , Arianna Bisazza

Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations

Existing research predominantly focuses on developing powerful language learning models (LLMs) for mathematical reasoning within monolingual languages, with few explorations in preserving efficacy in a multilingual context. To bridge this…

Computation and Language · Computer Science 2024-10-17 Nuo Chen , Zinan Zheng , Ning Wu , Ming Gong , Dongmei Zhang , Jia Li

Demystifying Multilingual Chain-of-Thought in Process Reward Modeling

Large language models (LLMs) are designed to perform a wide range of tasks. To improve their ability to solve complex problems requiring multi-step reasoning, recent research leverages process reward modeling to provide fine-grained…

Computation and Language · Computer Science 2025-09-29 Weixuan Wang , Minghao Wu , Barry Haddow , Alexandra Birch

XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs

Large Language Models (LLMs) have achieved remarkable success in natural language tasks, yet understanding their reasoning processes remains a significant challenge. We address this by introducing XplainLLM, a dataset accompanying an…

Computation and Language · Computer Science 2024-09-24 Zichen Chen , Jianda Chen , Ambuj Singh , Misha Sra

mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models

Large language models (LLMs) with Chain-of-thought (CoT) have recently emerged as a powerful technique for eliciting reasoning to improve various downstream tasks. As most research mainly focuses on English, with few explorations in a…

Computation and Language · Computer Science 2024-07-11 Huiyuan Lai , Malvina Nissim

SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment

Despite the significant improvements achieved by large language models (LLMs) in English reasoning tasks, these models continue to struggle with multilingual reasoning. Recent studies leverage a full-parameter and two-stage training…

Computation and Language · Computer Science 2025-01-08 Yuchun Fan , Yongyu Mu , Yilin Wang , Lei Huang , Junhao Ruan , Bei Li , Tong Xiao , Shujian Huang , Xiaocheng Feng , Jingbo Zhu

Steering Large Language Models between Code Execution and Textual Reasoning

While a lot of recent research focuses on enhancing the textual reasoning capabilities of Large Language Models (LLMs) by optimizing the multi-agent framework or reasoning chains, several benchmark tasks can be solved with 100\% success…

Computation and Language · Computer Science 2025-03-04 Yongchao Chen , Harsh Jhamtani , Srinagesh Sharma , Chuchu Fan , Chi Wang

What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features

Large Reasoning Models (LRMs) still exhibit large performance gaps between English and other languages, yet much current work assumes these gaps can be closed simply by making reasoning in every language resemble English reasoning. This…

Computation and Language · Computer Science 2026-04-07 Dayeon Ki , Kevin Duh , Marine Carpuat

Unlocking Multilingual Reasoning Capability of LLMs and LVLMs through Representation Engineering

Large Language Models (LLMs) and Large Vision-Language Models (LVLMs) demonstrate strong reasoning capabilities, yet their performance in English significantly outperforms that in low-resource languages, raising fairness concerns in…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Qiming Li , Xiaocheng Feng , Yixuan Ma , Zekai Ye , Ruihan Chen , Xiachong Feng , Bing Qin

ReasonXL: Shifting LLM Reasoning Language Without Sacrificing Performance

Despite advances in multilingual capabilities, most large language models (LLMs) remain English-centric in their training and, crucially, in their production of reasoning traces. Even when tasked with non-English problems, these models…

Computation and Language · Computer Science 2026-04-15 Daniil Gurgurov , Tom Röhr , Sebastian von Rohrscheidt , Josef van Genabith , Alexander Löser , Simon Ostermann

Crosslingual Reasoning through Test-Time Scaling

Reasoning capabilities of large language models are primarily studied for English, even when pretrained models are multilingual. In this work, we investigate to what extent English reasoning finetuning with long chain-of-thoughts (CoTs) can…

Computation and Language · Computer Science 2025-05-09 Zheng-Xin Yong , M. Farid Adilazuarda , Jonibek Mansurov , Ruochen Zhang , Niklas Muennighoff , Carsten Eickhoff , Genta Indra Winata , Julia Kreutzer , Stephen H. Bach , Alham Fikri Aji