English
Related papers

Related papers: KAT-Coder Technical Report

200 papers

We present KAT-Coder-V2, an agentic coding model developed by the KwaiKAT team at Kuaishou. KAT-Coder-V2 adopts a "Specialize-then-Unify" paradigm that decomposes agentic coding into five expert domains - SWE, WebCoding, Terminal,…

In this report, we introduce the IQuest-Coder-V1 series-(7B/14B/40B/40B-Loop), a new family of code large language models (LLMs). Moving beyond static code representations, we propose the code-flow multi-stage training paradigm, which…

In recent years, large language models (LLMs) have demonstrated significant potential in complex reasoning tasks like mathematical problem-solving. However, existing research predominantly relies on reinforcement learning (RL) frameworks…

Machine Learning · Computer Science 2026-01-12 ShaoZhen Liu , Xinting Huang , Houwen Peng , Xin Chen , Xinyang Song , Qi Li , Zhenan Sun

Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving. However, post-training approaches building upon general-purpose foundation models…

Recently, the frontier of Large Language Model (LLM) capabilities has shifted from single-turn code generation to agentic software engineering-a paradigm where models autonomously navigate, edit, and test complex repositories. While…

Recently, researchers have proposed many multi-agent frameworks for function-level code generation, which aim to improve software development productivity by automatically generating function-level source code based on task descriptions. A…

Software Engineering · Computer Science 2025-04-08 Yueheng Zhu , Chao Liu , Xuan He , Xiaoxue Ren , Zhongxin Liu , Ruwei Pan , Hongyu Zhang

The use of large language models (LLMs) for Mental Health Question Answering (MHQA) offers a promising way to alleviate shortages in mental health resources. However, prior work has mainly relied on Cognitive Behavioral Therapy (CBT) and…

Computation and Language · Computer Science 2026-03-10 Lanqing Du , Yunong Li , YuJie Long , Shihong Chen

The advancement of natural language processing (NLP) has been significantly boosted by the development of transformer-based large language models (LLMs). These models have revolutionized NLP tasks, particularly in code generation, aiding…

Computation and Language · Computer Science 2024-05-27 Dong Huang , Jie M. Zhang , Michael Luck , Qingwen Bu , Yuhao Qing , Heming Cui

We present Kwaipilot-AutoThink (KAT), an open-source 40B large language model developed to address the overthinking problem in reasoning-intensive tasks, where an automatic thinking training paradigm is proposed to dynamically switch…

Large Language Models (LLMs), enhanced through agent tuning, have demonstrated remarkable capabilities in Chain-of-Thought (CoT) and tool utilization, significantly surpassing the performance of standalone models. However, the multimodal…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Tianhong Gao , Yannian Fu , Weiqun Wu , Haixiao Yue , Shanshan Liu , Gang Zhang

Training large language models (LLMs) to reason via reinforcement learning (RL) significantly improves their problem-solving capabilities. In agentic settings, existing methods like ReAct prompt LLMs to explicitly plan before every action;…

Computer-Aided Design (CAD) is an expert-level task that relies on long-horizon reasoning and coherent modeling actions. Large Language Models (LLMs) have shown remarkable advancements in enabling language agents to tackle real-world tasks.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Yifei Gong , Xing Wu , Wenda Liu , Kang Tu

Large language models (LLMs) have advanced code generation from single-function tasks to competitive-programming problems, but existing multi-agent solutions either rely on costly large-scale (>30B) models or collapse when downsized to…

Computation and Language · Computer Science 2026-02-05 Woongkyu Lee , Junhee Cho , Jungwook Choi

Large Language Model (LLM)-based agentic systems rely on in-context policy documents encoding diverse business rules. As requirements grow, these documents expand rapidly, causing high computational overhead. This motivates developing…

Artificial Intelligence · Computer Science 2025-10-14 Jiateng Liu , Zhenhailong Wang , Xiaojiang Huang , Yingjie Li , Xing Fan , Xiang Li , Chenlei Guo , Ruhi Sarikaya , Heng Ji

Testing is an integral part of the software development process. Yet, writing tests is time-consuming and therefore often neglected. Classical test generation tools such as EvoSuite generate behavioral test suites by optimizing for…

Software Engineering · Computer Science 2023-10-04 Nikitha Rao , Kush Jain , Uri Alon , Claire Le Goues , Vincent J. Hellendoorn

Large Reasoning Models (LRMs) like o3 and DeepSeek-R1 have achieved remarkable progress in reasoning tasks with long cot. However, they remain computationally inefficient and struggle with accuracy when solving problems requiring complex…

Artificial Intelligence · Computer Science 2026-03-03 Haipeng Luo , Huawen Feng , Qingfeng Sun , Can Xu , Kai Zheng , Yufei Wang , Tao Yang , Han Hu , Yansong Tang

Reasoning is a fundamental cognitive process underlying inference, problem-solving, and decision-making. While large language models (LLMs) demonstrate strong reasoning capabilities in closed-world settings, they struggle in open-ended and…

A key trend in Large Reasoning Models (e.g., OpenAI's o3) is the native agentic ability to use external tools such as web browsers for searching and writing/executing code for image manipulation to think with images. In the open-source…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Ziyu Liu , Yuhang Zang , Yushan Zou , Zijian Liang , Xiaoyi Dong , Yuhang Cao , Haodong Duan , Dahua Lin , Jiaqi Wang

The evolution of Large Language Models (LLMs) from passive text generators to autonomous, goal-driven systems represents a fundamental shift in artificial intelligence. This chapter examines the emergence of agentic AI systems that…

Artificial Intelligence · Computer Science 2026-01-07 Nadia Sibai , Yara Ahmed , Serry Sibaee , Sawsan AlHalawani , Adel Ammar , Wadii Boulila

Large Language Models (LLMs) can extend their parameter knowledge limits by adopting the Tool-Integrated Reasoning (TIR) paradigm. However, existing LLM-based agent training framework often focuses on answers' accuracy, overlooking specific…

Artificial Intelligence · Computer Science 2026-01-21 Yifei Chen , Guanting Dong , Zhicheng Dou
‹ Prev 1 2 3 10 Next ›