English
Related papers

Related papers: Competitive Programming with Large Reasoning Model…

200 papers

Enabling Large Language Models (LLMs) to handle a wider range of complex tasks (e.g., coding, math) has drawn great attention from many researchers. As LLMs continue to evolve, merely increasing the number of model parameters yields…

Language has long been conceived as an essential tool for human reasoning. The breakthrough of Large Language Models (LLMs) has sparked significant research interest in leveraging these models to tackle complex reasoning tasks. Researchers…

Recent advancements in Large Language Models (LLMs) have showcased their ability to perform complex reasoning tasks, but their effectiveness in planning remains underexplored. In this study, we evaluate the planning capabilities of OpenAI's…

Artificial Intelligence · Computer Science 2024-10-15 Kevin Wang , Junbo Li , Neel P. Bhatt , Yihan Xi , Qiang Liu , Ufuk Topcu , Zhangyang Wang

Recent advancements in reasoning-focused language models such as OpenAI's O1 and DeepSeek-R1 have shown that scaling test-time computation-through chain-of-thought reasoning and iterative exploration-can yield substantial improvements on…

Competitive programming has become a rigorous benchmark for evaluating the reasoning and problem-solving capabilities of large language models (LLMs). The International Olympiad in Informatics (IOI) stands out as one of the most prestigious…

Currently OpenAI o1 sparks a surge of interest in the study of large reasoning models (LRM). Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mathematics, physics, and coding -- which are…

Computation and Language · Computer Science 2024-11-26 Yu Zhao , Huifeng Yin , Bo Zeng , Hao Wang , Tianqi Shi , Chenyang Lyu , Longyue Wang , Weihua Luo , Kaifu Zhang

The performance of large language models (LLMs) has recently improved to the point where models can perform well on many language tasks. We show here that--for the first time--the models can also generate valid metalinguistic analyses of…

Computation and Language · Computer Science 2025-07-14 Gašper Beguš , Maksymilian Dąbkowski , Ryan Rhodes

OpenAI o1 represents a significant milestone in Artificial Inteiligence, which achieves expert-level performances on many challanging tasks that require strong reasoning ability.OpenAI has claimed that the main techinique behinds o1 is the…

Artificial Intelligence · Computer Science 2024-12-19 Zhiyuan Zeng , Qinyuan Cheng , Zhangyue Yin , Bo Wang , Shimin Li , Yunhua Zhou , Qipeng Guo , Xuanjing Huang , Xipeng Qiu

The ability to plan a course of action that achieves a desired state of affairs has long been considered a core competence of intelligent agents and has been an integral part of AI research since its inception. With the advent of large…

Artificial Intelligence · Computer Science 2024-10-04 Karthik Valmeekam , Kaya Stechly , Atharva Gundawar , Subbarao Kambhampati

In this technical report, we introduce OpenR, an open-source framework designed to integrate key components for enhancing the reasoning capabilities of large language models (LLMs). OpenR unifies data acquisition, reinforcement learning…

Artificial Intelligence · Computer Science 2024-10-15 Jun Wang , Meng Fang , Ziyu Wan , Muning Wen , Jiachen Zhu , Anjie Liu , Ziqin Gong , Yan Song , Lei Chen , Lionel M. Ni , Linyi Yang , Ying Wen , Weinan Zhang

Recently, test-time scaling has garnered significant attention from the research community, largely due to the substantial advancements of the o1 model released by OpenAI. By allocating more computational resources during the inference…

Recent advancements in Large Reasoning Models (LRMs), such as OpenAI's o1/o3 and DeepSeek-R1, have demonstrated remarkable performance in specialized reasoning tasks through human-like deliberative thinking and long chain-of-thought…

Artificial Intelligence · Computer Science 2025-11-20 Weixiang Zhao , Xingyu Sui , Jiahe Guo , Yulin Hu , Yang Deng , Yanyan Zhao , Xuda Zhi , Yongbo Huang , Hao He , Wanxiang Che , Ting Liu , Bing Qin

OpenAI o1 has shown that applying reinforcement learning to integrate reasoning steps directly during inference can significantly improve a model's reasoning capabilities. This result is exciting as the field transitions from the…

Artificial Intelligence · Computer Science 2025-02-18 Jun Wang

Recent developments, particularly OpenAI's O1 model, have demonstrated the remarkable potential of Large Language Models (LLMs) for complex reasoning tasks. Through analysis of O1's outputs and provided sample Chain-of-Thought (CoT)…

Artificial Intelligence · Computer Science 2024-12-09 Toby Simonds , Jey Han Lau , Chaithanya Bandi

Large Language Models (LLMs) are increasingly utilized in AI-driven educational instruction and assessment, particularly within mathematics education. The capability of LLMs to generate accurate answers and detailed solutions for math…

Artificial Intelligence · Computer Science 2025-08-15 Liang Zhang , Edith Aurora Graf

With the emergence of advanced reasoning models like OpenAI o3 and DeepSeek-R1, large language models (LLMs) have demonstrated remarkable reasoning capabilities. However, their ability to perform rigorous logical reasoning remains an open…

Artificial Intelligence · Computer Science 2025-02-14 Hanmeng Liu , Zhizhang Fu , Mengru Ding , Ruoxi Ning , Chaoli Zhang , Xiaozhang Liu , Yue Zhang

Large reasoning models (LRMs) like OpenAI o1 and DeepSeek R1 have demonstrated impressive performance on complex reasoning tasks like mathematics and programming with long Chain-of-Thought (CoT) reasoning sequences (slow-thinking), compared…

Artificial Intelligence · Computer Science 2025-07-15 Jason Zhu , Hongyu Li

Large Language Models (LLMs) trained via Reinforcement Learning (RL) have recently achieved impressive results on reasoning benchmarks. Yet, growing evidence shows that these models often generate longer but ineffective chains of thought…

Machine Learning · Computer Science 2025-07-02 Jhouben Cuesta-Ramirez , Samuel Beaussant , Mehdi Mounsif

Reinforcement learning scaling enhances the reasoning capabilities of large language models, with reinforcement learning serving as the key technique to draw out complex reasoning. However, key technical details of state-of-the-art…

Machine Learning · Computer Science 2025-08-05 Xinhan Di , JoyJiaoW

We examine the reasoning and planning capabilities of large language models (LLMs) in solving complex tasks. Recent advances in inference-time techniques demonstrate the potential to enhance LLM reasoning without additional training by…

Artificial Intelligence · Computer Science 2025-02-19 Shubham Parashar , Blake Olson , Sambhav Khurana , Eric Li , Hongyi Ling , James Caverlee , Shuiwang Ji
‹ Prev 1 2 3 10 Next ›