Related papers: KAT-Coder Technical Report

KAT-Coder-V2 Technical Report

We present KAT-Coder-V2, an agentic coding model developed by the KwaiKAT team at Kuaishou. KAT-Coder-V2 adopts a "Specialize-then-Unify" paradigm that decomposes agentic coding into five expert domains - SWE, WebCoding, Terminal,…

Computation and Language · Computer Science 2026-03-31 Fengxiang Li , Han Zhang , Haoyang Huang , Jinghui Wang , Jinhua Hao , Kun Yuan , Mengtong Li , Minglei Zhang , Pengcheng Xu , Wenhao Zhuang , Yizhen Shao , Zongxian Feng , Can Tang , Chao Wang , Chengxiao Tong , Fan Yang , Gang Xiong , Haixuan Gao , Han Gao , Hao Wang , Haochen Liu , Hongliang Sun , Jiabao Li , Jingwen Chang , Jun Du , Junyi Peng , Leizhen Cui , Meimei Jing , Mingqi Wu , Shangpeng Yan , Shaotong Qi , Suzhe Xu , Wenxuan Zhao , Xianda Sun , Xuan Xie , Yanbo Wang , Yao Xia , Yinghan Cui , Yingpeng Chen , Yong Wang , Yuze Shi , Zhiwei Shen , Ziyu Wang , Ming Sun , Lin Ye , Bin Chen

IQuest-Coder-V1 Technical Report

In this report, we introduce the IQuest-Coder-V1 series-(7B/14B/40B/40B-Loop), a new family of code large language models (LLMs). Moving beyond static code representations, we propose the code-flow multi-stage training paradigm, which…

Artificial Intelligence · Computer Science 2026-03-18 Jian Yang , Wei Zhang , Shawn Guo , Zhengmao Ye , Lin Jing , Shark Liu , Yizhi Li , Jiajun Wu , Cening Liu , X. Ma , Yuyang Song , Siwei Wu , Yuwen Li , L. Liao , T. Zheng , Ziling Huang , Zelong Huang , Che Liu , Yan Xing , Renyuan Li , Qingsong Cai , Hanxu Yan , Siyue Wang , Shikai Li , Jason Klein Liu , An Huang , Yongsheng Kang , Jinxing Zhang , Chuan Hao , Haowen Wang , Weicheng Gu , Ran Tao , Mingjie Tang , Peihao Wu , Jianzhou Wang , Xianglong Liu , Weifeng Lv , Bryan Dai

Dual-Phase LLM Reasoning: Self-Evolved Mathematical Frameworks

In recent years, large language models (LLMs) have demonstrated significant potential in complex reasoning tasks like mathematical problem-solving. However, existing research predominantly relies on reinforcement learning (RL) frameworks…

Machine Learning · Computer Science 2026-01-12 ShaoZhen Liu , Xinting Huang , Houwen Peng , Xin Chen , Xinyang Song , Qi Li , Zhenan Sun

Scaling Agents via Continual Pre-training

Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving. However, post-training approaches building upon general-purpose foundation models…

Computation and Language · Computer Science 2025-09-17 Liangcai Su , Zhen Zhang , Guangyu Li , Zhuo Chen , Chenxi Wang , Maojia Song , Xinyu Wang , Kuan Li , Jialong Wu , Xuanzhong Chen , Zile Qiao , Zhongwang Zhang , Huifeng Yin , Shihao Cai , Runnan Fang , Zhengwei Tao , Wenbiao Yin , Chenxiong Qian , Yong Jiang , Pengjun Xie , Fei Huang , Jingren Zhou

daVinci-Dev: Agent-native Mid-training for Software Engineering

Recently, the frontier of Large Language Model (LLM) capabilities has shifted from single-turn code generation to agentic software engineering-a paradigm where models autonomously navigate, edit, and test complex repositories. While…

Software Engineering · Computer Science 2026-01-28 Ji Zeng , Dayuan Fu , Tiantian Mi , Yumin Zhuang , Yaxing Huang , Xuefeng Li , Lyumanshan Ye , Muhang Xie , Qishuo Hua , Zhen Huang , Mohan Jiang , Hanning Wang , Jifan Lin , Yang Xiao , Jie Sun , Yunze Wu , Pengfei Liu

AdaCoder: An Adaptive Planning and Multi-Agent Framework for Function-Level Code Generation

Recently, researchers have proposed many multi-agent frameworks for function-level code generation, which aim to improve software development productivity by automatically generating function-level source code based on task descriptions. A…

Software Engineering · Computer Science 2025-04-08 Yueheng Zhu , Chao Liu , Xuan He , Xiaoxue Ren , Zhongxin Liu , Ruwei Pan , Hongyu Zhang

EFT-CoT: A Multi-Agent Chain-of-Thought Framework for Emotion-Focused Therapy

The use of large language models (LLMs) for Mental Health Question Answering (MHQA) offers a promising way to alleviate shortages in mental health resources. However, prior work has mainly relied on Cognitive Behavioral Therapy (CBT) and…

Computation and Language · Computer Science 2026-03-10 Lanqing Du , Yunong Li , YuJie Long , Shihong Chen

AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation

The advancement of natural language processing (NLP) has been significantly boosted by the development of transformer-based large language models (LLMs). These models have revolutionized NLP tasks, particularly in code generation, aiding…

Computation and Language · Computer Science 2024-05-27 Dong Huang , Jie M. Zhang , Michael Luck , Qingwen Bu , Yuhao Qing , Heming Cui

KAT-V1: Kwai-AutoThink Technical Report

We present Kwaipilot-AutoThink (KAT), an open-source 40B large language model developed to address the overthinking problem in reasoning-intensive tasks, where an automatic thinking training paradigm is proposed to dynamically switch…

Computation and Language · Computer Science 2025-07-22 Zizheng Zhan , Ken Deng , Huaixi Tang , Wen Xiang , Kun Wu , Weihao Li , Wenqiang Zhu , Jingxuan Xu , Lecheng Huang , Zongxian Feng , Shaojie Wang , Shangpeng Yan , Xuxing Chen , Jiaheng Liu , Zhongyuan Peng , Zuchen Gao , Haoyang Huang , Xiaojiang Zhang , Jinghui Wang , Zheng Lin , Mengtong Li , Huiming Wang , Ziqi Zhan , Yanan Wu , Yuanxing Zhang , Jian Yang , Guang Chen , Haotian Zhang , Bin Chen , Bing Yu

MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning

Large Language Models (LLMs), enhanced through agent tuning, have demonstrated remarkable capabilities in Chain-of-Thought (CoT) and tool utilization, significantly surpassing the performance of standalone models. However, the multimodal…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Tianhong Gao , Yannian Fu , Weiqun Wu , Haixiao Yue , Shanshan Liu , Gang Zhang

Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents

Training large language models (LLMs) to reason via reinforcement learning (RL) significantly improves their problem-solving capabilities. In agentic settings, existing methods like ReAct prompt LLMs to explicitly plan before every action;…

Artificial Intelligence · Computer Science 2026-02-18 Davide Paglieri , Bartłomiej Cupiał , Jonathan Cook , Ulyana Piterbarg , Jens Tuyls , Edward Grefenstette , Jakob Nicolaus Foerster , Jack Parker-Holder , Tim Rocktäschel

TOOLCAD: Exploring Tool-Using Large Language Models in Text-to-CAD Generation with Reinforcement Learning

Computer-Aided Design (CAD) is an expert-level task that relies on long-horizon reasoning and coherent modeling actions. Large Language Models (LLMs) have shown remarkable advancements in enabling language agents to tackle real-world tasks.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Yifei Gong , Xing Wu , Wenda Liu , Kang Tu

MapCoder-Lite: Distilling Multi-Agent Coding into a Single Small LLM

Large language models (LLMs) have advanced code generation from single-function tasks to competitive-programming problems, but existing multi-agent solutions either rely on costly large-scale (>30B) models or collapse when downsized to…

Computation and Language · Computer Science 2026-02-05 Woongkyu Lee , Junhee Cho , Jungwook Choi

Analyzing and Internalizing Complex Policy Documents for LLM Agents

Large Language Model (LLM)-based agentic systems rely on in-context policy documents encoding diverse business rules. As requirements grow, these documents expand rapidly, causing high computational overhead. This motivates developing…

Artificial Intelligence · Computer Science 2025-10-14 Jiateng Liu , Zhenhailong Wang , Xiaojiang Huang , Yingjie Li , Xing Fan , Xiang Li , Chenlei Guo , Ruhi Sarikaya , Heng Ji

CAT-LM: Training Language Models on Aligned Code And Tests

Testing is an integral part of the software development process. Yet, writing tests is time-consuming and therefore often neglected. Classical test generation tools such as EvoSuite generate behavioral test suites by optimizing for…

Software Engineering · Computer Science 2023-10-04 Nikitha Rao , Kush Jain , Uri Alon , Claire Le Goues , Vincent J. Hellendoorn

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Large Reasoning Models (LRMs) like o3 and DeepSeek-R1 have achieved remarkable progress in reasoning tasks with long cot. However, they remain computationally inefficient and struggle with accuracy when solving problems requiring complex…

Artificial Intelligence · Computer Science 2026-03-03 Haipeng Luo , Huawen Feng , Qingfeng Sun , Can Xu , Kai Zheng , Yufei Wang , Tao Yang , Han Hu , Yansong Tang

Agentic Reasoning for Large Language Models

Reasoning is a fundamental cognitive process underlying inference, problem-solving, and decision-making. While large language models (LLMs) demonstrate strong reasoning capabilities in closed-world settings, they struggle in open-ended and…

Artificial Intelligence · Computer Science 2026-01-21 Tianxin Wei , Ting-Wei Li , Zhining Liu , Xuying Ning , Ze Yang , Jiaru Zou , Zhichen Zeng , Ruizhong Qiu , Xiao Lin , Dongqi Fu , Zihao Li , Mengting Ai , Duo Zhou , Wenxuan Bao , Yunzhe Li , Gaotang Li , Cheng Qian , Yu Wang , Xiangru Tang , Yin Xiao , Liri Fang , Hui Liu , Xianfeng Tang , Yuji Zhang , Chi Wang , Jiaxuan You , Heng Ji , Hanghang Tong , Jingrui He

Visual Agentic Reinforcement Fine-Tuning

A key trend in Large Reasoning Models (e.g., OpenAI's o3) is the native agentic ability to use external tools such as web browsers for searching and writing/executing code for image manipulation to think with images. In the open-source…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Ziyu Liu , Yuhang Zang , Yushan Zou , Zijian Liang , Xiaoyi Dong , Yuhang Cao , Haodong Duan , Dahua Lin , Jiaqi Wang

The Path Ahead for Agentic AI: Challenges and Opportunities

The evolution of Large Language Models (LLMs) from passive text generators to autonomous, goal-driven systems represents a fundamental shift in artificial intelligence. This chapter examines the emergence of agentic AI systems that…

Artificial Intelligence · Computer Science 2026-01-07 Nadia Sibai , Yara Ahmed , Serry Sibaee , Sawsan AlHalawani , Adel Ammar , Wadii Boulila

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

Large Language Models (LLMs) can extend their parameter knowledge limits by adopting the Tool-Integrated Reasoning (TIR) paradigm. However, existing LLM-based agent training framework often focuses on answers' accuracy, overlooking specific…

Artificial Intelligence · Computer Science 2026-01-21 Yifei Chen , Guanting Dong , Zhicheng Dou