Related papers: o1-Coder: an o1 Replication for Coding

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Currently OpenAI o1 sparks a surge of interest in the study of large reasoning models (LRM). Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mathematics, physics, and coding -- which are…

Computation and Language · Computer Science 2024-11-26 Yu Zhao , Huifeng Yin , Bo Zeng , Hao Wang , Tianqi Shi , Chenyang Lyu , Longyue Wang , Weihua Luo , Kaifu Zhang

A Case Study of Web App Coding with OpenAI Reasoning Models

This paper presents a case study of coding tasks by the latest reasoning models of OpenAI, i.e. o1-preview and o1-mini, in comparison with other frontier models. The o1 models deliver SOTA results for WebApp1K, a single-task benchmark. To…

Software Engineering · Computer Science 2024-09-24 Yi Cui

Enhancing LLM Reasoning with Reward-guided Tree Search

Recently, test-time scaling has garnered significant attention from the research community, largely due to the substantial advancements of the o1 model released by OpenAI. By allocating more computational resources during the inference…

Computation and Language · Computer Science 2025-01-03 Jinhao Jiang , Zhipeng Chen , Yingqian Min , Jie Chen , Xiaoxue Cheng , Jiapeng Wang , Yiru Tang , Haoxiang Sun , Jia Deng , Wayne Xin Zhao , Zheng Liu , Dong Yan , Jian Xie , Zhongyuan Wang , Ji-Rong Wen

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

Recently, slow-thinking reasoning systems, such as o1, have demonstrated remarkable capabilities in solving complex reasoning tasks. These systems typically engage in an extended thinking process before responding to a query, allowing them…

Artificial Intelligence · Computer Science 2024-12-24 Yingqian Min , Zhipeng Chen , Jinhao Jiang , Jie Chen , Jia Deng , Yiwen Hu , Yiru Tang , Jiapeng Wang , Xiaoxue Cheng , Huatong Song , Wayne Xin Zhao , Zheng Liu , Zhongyuan Wang , Ji-Rong Wen

Graph-O1 : Monte Carlo Tree Search with Reinforcement Learning for Text-Attributed Graph Reasoning

ChatGPT said: Text-attributed graphs, where nodes and edges contain rich textual information, are widely used across diverse domains. A central challenge in this setting is question answering, which requires jointly leveraging unstructured…

Computation and Language · Computer Science 2025-12-23 Lihui Liu

REL: Working out is all you need

Recent developments, particularly OpenAI's O1 model, have demonstrated the remarkable potential of Large Language Models (LLMs) for complex reasoning tasks. Through analysis of O1's outputs and provided sample Chain-of-Thought (CoT)…

Artificial Intelligence · Computer Science 2024-12-09 Toby Simonds , Jey Han Lau , Chaithanya Bandi

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Large language models (LLMs) for code have become indispensable in various domains, including code generation, reasoning tasks and agent systems. While open-access code LLMs are increasingly approaching the performance levels of proprietary…

Computation and Language · Computer Science 2025-03-21 Siming Huang , Tianhao Cheng , J. K. Liu , Jiaran Hao , Liuyihan Song , Yang Xu , J. Yang , Jiaheng Liu , Chenchen Zhang , Linzheng Chai , Ruifeng Yuan , Zhaoxiang Zhang , Jie Fu , Qian Liu , Ge Zhang , Zili Wang , Yuan Qi , Yinghui Xu , Wei Chu

CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning

To accelerate software development, much research has been performed to help people understand and reuse the huge amount of available code resources. Two important tasks have been widely studied: code retrieval, which aims to retrieve code…

Software Engineering · Computer Science 2019-04-02 Ziyu Yao , Jayavardhan Reddy Peddamail , Huan Sun

RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation

As text and code resources have expanded, large-scale pre-trained models have shown promising capabilities in code generation tasks, typically employing supervised fine-tuning with problem statement-program pairs. However, increasing model…

Computation and Language · Computer Science 2025-04-10 Nathanaël Beau , Benoît Crabbé

R1-Code-Interpreter: LLMs Reason with Code via Supervised and Multi-stage Reinforcement Learning

Practical guidance on training Large Language Models (LLMs) to leverage Code Interpreter across diverse tasks remains lacking. We present R1-Code-Interpreter, an extension of a text-only LLM trained via multi-turn supervised fine-tuning…

Artificial Intelligence · Computer Science 2026-03-05 Yongchao Chen , Yueying Liu , Junwei Zhou , Yilun Hao , Jingquan Wang , Yang Zhang , Na Li , Chuchu Fan

O1 Replication Journey: A Strategic Progress Report -- Part 1

This paper introduces a pioneering approach to artificial intelligence research, embodied in our O1 Replication Journey. In response to the announcement of OpenAI's groundbreaking O1 model, we embark on a transparent, real-time exploration…

Artificial Intelligence · Computer Science 2024-10-28 Yiwei Qin , Xuefeng Li , Haoyang Zou , Yixiu Liu , Shijie Xia , Zhen Huang , Yixin Ye , Weizhe Yuan , Hector Liu , Yuanzhi Li , Pengfei Liu

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Enabling Large Language Models (LLMs) to handle a wider range of complex tasks (e.g., coding, math) has drawn great attention from many researchers. As LLMs continue to evolve, merely increasing the number of model parameters yields…

Computation and Language · Computer Science 2024-10-24 Siwei Wu , Zhongyuan Peng , Xinrun Du , Tuney Zheng , Minghao Liu , Jialong Wu , Jiachen Ma , Yizhi Li , Jian Yang , Wangchunshu Zhou , Qunshu Lin , Junbo Zhao , Zhaoxiang Zhang , Wenhao Huang , Ge Zhang , Chenghua Lin , J. H. Liu

Seed-CTS: Unleashing the Power of Tree Search for Superior Performance in Competitive Coding Tasks

Competition-level code generation tasks pose significant challenges for current state-of-the-art large language models (LLMs). For example, on the LiveCodeBench-Hard dataset, models such as O1-Mini and O1-Preview achieve pass@1 rates of…

Artificial Intelligence · Computer Science 2024-12-31 Hao Wang , Boyi Liu , Yufeng Zhang , Jie Chen

Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning

Cell type annotation is a key task in analyzing the heterogeneity of single-cell RNA sequencing data. Although recent foundation models automate this process, they typically annotate cells independently, without considering batch-level…

Computation and Language · Computer Science 2025-06-04 Yin Fang , Qiao Jin , Guangzhi Xiong , Bowen Jin , Xianrui Zhong , Siru Ouyang , Aidong Zhang , Jiawei Han , Zhiyong Lu

ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models

Tool learning has emerged as a crucial capability for large language models (LLMs) to solve complex real-world tasks through interaction with external tools. Existing approaches face significant challenges, including reliance on…

Computation and Language · Computer Science 2025-06-02 Hanxing Ding , Shuchang Tao , Liang Pang , Zihao Wei , Jinyang Gao , Bolin Ding , Huawei Shen , Xueqi Cheng

OCoR: An Overlapping-Aware Code Retriever

Code retrieval helps developers reuse the code snippet in the open-source projects. Given a natural language description, code retrieval aims to search for the most relevant code among a set of code. Existing state-of-the-art approaches…

Computation and Language · Computer Science 2020-08-21 Qihao Zhu , Zeyu Sun , Xiran Liang , Yingfei Xiong , Lu Zhang

Competitive Programming with Large Reasoning Models

We show that reinforcement learning applied to large language models (LLMs) significantly boosts performance on complex coding and reasoning tasks. Additionally, we compare two general-purpose reasoning models - OpenAI o1 and an early…

Machine Learning · Computer Science 2025-02-20 OpenAI , : , Ahmed El-Kishky , Alexander Wei , Andre Saraiva , Borys Minaiev , Daniel Selsam , David Dohan , Francis Song , Hunter Lightman , Ignasi Clavera , Jakub Pachocki , Jerry Tworek , Lorenz Kuhn , Lukasz Kaiser , Mark Chen , Max Schwarzer , Mostafa Rohaninejad , Nat McAleese , o3 contributors , Oleg Mürk , Rhythm Garg , Rui Shu , Szymon Sidor , Vineet Kosaraju , Wenda Zhou

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

In this technical report, we introduce OpenR, an open-source framework designed to integrate key components for enhancing the reasoning capabilities of large language models (LLMs). OpenR unifies data acquisition, reinforcement learning…

Artificial Intelligence · Computer Science 2024-10-15 Jun Wang , Meng Fang , Ziyu Wan , Muning Wen , Jiachen Zhu , Anjie Liu , Ziqin Gong , Yan Song , Lei Chen , Lionel M. Ni , Linyi Yang , Ying Wen , Weinan Zhang

O1 Embedder: Let Retrievers Think Before Action

The growing power of large language models (LLMs) has revolutionized how people access and utilize information. Notably, the LLMs excel at performing fine-grained data representation, which facilitates precise retrieval of information. They…

Computation and Language · Computer Science 2025-02-13 Ruiran Yan , Zheng Liu , Defu Lian

Large Language Models as Code Executors: An Exploratory Study

The capabilities of Large Language Models (LLMs) have significantly evolved, extending from natural language processing to complex tasks like code understanding and generation. We expand the scope of LLMs' capabilities to a broader context,…

Computation and Language · Computer Science 2024-10-11 Chenyang Lyu , Lecheng Yan , Rui Xing , Wenxi Li , Younes Samih , Tianbo Ji , Longyue Wang