Related papers: Competitive Programming with Large Reasoning Model…

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Enabling Large Language Models (LLMs) to handle a wider range of complex tasks (e.g., coding, math) has drawn great attention from many researchers. As LLMs continue to evolve, merely increasing the number of model parameters yields…

Computation and Language · Computer Science 2024-10-24 Siwei Wu , Zhongyuan Peng , Xinrun Du , Tuney Zheng , Minghao Liu , Jialong Wu , Jiachen Ma , Yizhi Li , Jian Yang , Wangchunshu Zhou , Qunshu Lin , Junbo Zhao , Zhaoxiang Zhang , Wenhao Huang , Ge Zhang , Chenghua Lin , J. H. Liu

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Language has long been conceived as an essential tool for human reasoning. The breakthrough of Large Language Models (LLMs) has sparked significant research interest in leveraging these models to tackle complex reasoning tasks. Researchers…

Artificial Intelligence · Computer Science 2025-01-24 Fengli Xu , Qianyue Hao , Zefang Zong , Jingwei Wang , Yunke Zhang , Jingyi Wang , Xiaochong Lan , Jiahui Gong , Tianjian Ouyang , Fanjin Meng , Chenyang Shao , Yuwei Yan , Qinglong Yang , Yiwen Song , Sijian Ren , Xinyuan Hu , Yu Li , Jie Feng , Chen Gao , Yong Li

On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability

Recent advancements in Large Language Models (LLMs) have showcased their ability to perform complex reasoning tasks, but their effectiveness in planning remains underexplored. In this study, we evaluate the planning capabilities of OpenAI's…

Artificial Intelligence · Computer Science 2024-10-15 Kevin Wang , Junbo Li , Neel P. Bhatt , Yihan Xi , Qiang Liu , Ufuk Topcu , Zhangyang Wang

Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training

Recent advancements in reasoning-focused language models such as OpenAI's O1 and DeepSeek-R1 have shown that scaling test-time computation-through chain-of-thought reasoning and iterative exploration-can yield substantial improvements on…

Machine Learning · Computer Science 2025-07-18 Mingjie Liu , Shizhe Diao , Jian Hu , Ximing Lu , Xin Dong , Hao Zhang , Alexander Bukharin , Shaokun Zhang , Jiaqi Zeng , Makesh Narsimhan Sreedhar , Gerald Shen , David Mosallanezhad , Di Zhang , Jonas Yang , June Yang , Oleksii Kuchaiev , Guilin Liu , Zhiding Yu , Pavlo Molchanov , Yejin Choi , Jan Kautz , Yi Dong

Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models

Competitive programming has become a rigorous benchmark for evaluating the reasoning and problem-solving capabilities of large language models (LLMs). The International Olympiad in Informatics (IOI) stands out as one of the most prestigious…

Machine Learning · Computer Science 2026-04-16 Mehrzad Samadi , Aleksander Ficek , Sean Narenthiran , Siddhartha Jain , Wasi Uddin Ahmad , Somshubra Majumdar , Vahid Noroozi , Boris Ginsburg

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Currently OpenAI o1 sparks a surge of interest in the study of large reasoning models (LRM). Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mathematics, physics, and coding -- which are…

Computation and Language · Computer Science 2024-11-26 Yu Zhao , Huifeng Yin , Bo Zeng , Hao Wang , Tianqi Shi , Chenyang Lyu , Longyue Wang , Weihua Luo , Kaifu Zhang

Large Linguistic Models: Investigating LLMs' metalinguistic abilities

The performance of large language models (LLMs) has recently improved to the point where models can perform well on many language tasks. We show here that--for the first time--the models can also generate valid metalinguistic analyses of…

Computation and Language · Computer Science 2025-07-14 Gašper Beguš , Maksymilian Dąbkowski , Ryan Rhodes

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

OpenAI o1 represents a significant milestone in Artificial Inteiligence, which achieves expert-level performances on many challanging tasks that require strong reasoning ability.OpenAI has claimed that the main techinique behinds o1 is the…

Artificial Intelligence · Computer Science 2024-12-19 Zhiyuan Zeng , Qinyuan Cheng , Zhangyue Yin , Bo Wang , Shimin Li , Yunhua Zhou , Qipeng Guo , Xuanjing Huang , Xipeng Qiu

Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1

The ability to plan a course of action that achieves a desired state of affairs has long been considered a core competence of intelligent agents and has been an integral part of AI research since its inception. With the advent of large…

Artificial Intelligence · Computer Science 2024-10-04 Karthik Valmeekam , Kaya Stechly , Atharva Gundawar , Subbarao Kambhampati

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

In this technical report, we introduce OpenR, an open-source framework designed to integrate key components for enhancing the reasoning capabilities of large language models (LLMs). OpenR unifies data acquisition, reinforcement learning…

Artificial Intelligence · Computer Science 2024-10-15 Jun Wang , Meng Fang , Ziyu Wan , Muning Wen , Jiachen Zhu , Anjie Liu , Ziqin Gong , Yan Song , Lei Chen , Lionel M. Ni , Linyi Yang , Ying Wen , Weinan Zhang

Enhancing LLM Reasoning with Reward-guided Tree Search

Recently, test-time scaling has garnered significant attention from the research community, largely due to the substantial advancements of the o1 model released by OpenAI. By allocating more computational resources during the inference…

Computation and Language · Computer Science 2025-01-03 Jinhao Jiang , Zhipeng Chen , Yingqian Min , Jie Chen , Xiaoxue Cheng , Jiapeng Wang , Yiru Tang , Haoxiang Sun , Jia Deng , Wayne Xin Zhao , Zheng Liu , Dong Yan , Jian Xie , Zhongyuan Wang , Ji-Rong Wen

Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities

Recent advancements in Large Reasoning Models (LRMs), such as OpenAI's o1/o3 and DeepSeek-R1, have demonstrated remarkable performance in specialized reasoning tasks through human-like deliberative thinking and long chain-of-thought…

Artificial Intelligence · Computer Science 2025-11-20 Weixiang Zhao , Xingyu Sui , Jiahe Guo , Yulin Hu , Yang Deng , Yanyan Zhao , Xuda Zhi , Yongbo Huang , Hao He , Wanxiang Che , Ting Liu , Bing Qin

A Tutorial on LLM Reasoning: Relevant Methods behind ChatGPT o1

OpenAI o1 has shown that applying reinforcement learning to integrate reasoning steps directly during inference can significantly improve a model's reasoning capabilities. This result is exciting as the field transitions from the…

Artificial Intelligence · Computer Science 2025-02-18 Jun Wang

REL: Working out is all you need

Recent developments, particularly OpenAI's O1 model, have demonstrated the remarkable potential of Large Language Models (LLMs) for complex reasoning tasks. Through analysis of O1's outputs and provided sample Chain-of-Thought (CoT)…

Artificial Intelligence · Computer Science 2024-12-09 Toby Simonds , Jey Han Lau , Chaithanya Bandi

Mathematical Computation and Reasoning Errors by Large Language Models

Large Language Models (LLMs) are increasingly utilized in AI-driven educational instruction and assessment, particularly within mathematics education. The capability of LLMs to generate accurate answers and detailed solutions for math…

Artificial Intelligence · Computer Science 2025-08-15 Liang Zhang , Edith Aurora Graf

Logical Reasoning in Large Language Models: A Survey

With the emergence of advanced reasoning models like OpenAI o3 and DeepSeek-R1, large language models (LLMs) have demonstrated remarkable reasoning capabilities. However, their ability to perform rigorous logical reasoning remains an open…

Artificial Intelligence · Computer Science 2025-02-14 Hanmeng Liu , Zhizhang Fu , Mengru Ding , Ruoxi Ning , Chaoli Zhang , Xiaozhang Liu , Yue Zhang

Towards Concise and Adaptive Thinking in Large Reasoning Models: A Survey

Large reasoning models (LRMs) like OpenAI o1 and DeepSeek R1 have demonstrated impressive performance on complex reasoning tasks like mathematics and programming with long Chain-of-Thought (CoT) reasoning sequences (slow-thinking), compared…

Artificial Intelligence · Computer Science 2025-07-15 Jason Zhu , Hongyu Li

Large Reasoning Models are not thinking straight: on the unreliability of thinking trajectories

Large Language Models (LLMs) trained via Reinforcement Learning (RL) have recently achieved impressive results on reasoning benchmarks. Yet, growing evidence shows that these models often generate longer but ineffective chains of thought…

Machine Learning · Computer Science 2025-07-02 Jhouben Cuesta-Ramirez , Samuel Beaussant , Mehdi Mounsif

Enhancing Math Reasoning in Small-sized LLMs via Preview Difficulty-Aware Intervention

Reinforcement learning scaling enhances the reasoning capabilities of large language models, with reinforcement learning serving as the key technique to draw out complex reasoning. However, key technical details of state-of-the-art…

Machine Learning · Computer Science 2025-08-05 Xinhan Di , JoyJiaoW

Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights

We examine the reasoning and planning capabilities of large language models (LLMs) in solving complex tasks. Recent advances in inference-time techniques demonstrate the potential to enhance LLM reasoning without additional training by…

Artificial Intelligence · Computer Science 2025-02-19 Shubham Parashar , Blake Olson , Sambhav Khurana , Eric Li , Hongyi Ling , James Caverlee , Shuiwang Ji