Related papers: Qwen3-Coder-Next Technical Report

Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling

Despite the proliferation of powerful agentic models, the lack of critical post-training details hinders the development of strong counterparts in the open-source community. In this study, we present a comprehensive and fully open-source…

Artificial Intelligence · Computer Science 2025-11-11 Qi Wang , Hongzhi Zhang , Jia Fu , Kai Fu , Yahui Liu , Tinghai Zhang , Chenxi Sun , Gangwei Jiang , Jingyi Tang , Xingguang Ji , Yang Yue , Jingyuan Zhang , Fuzheng Zhang , Kun Gai , Guorui Zhou

Qwen3 Technical Report

In this work, we present Qwen3, the latest version of the Qwen model family. Qwen3 comprises a series of large language models (LLMs) designed to advance performance, efficiency, and multilingual capabilities. The Qwen3 series includes…

Computation and Language · Computer Science 2025-05-15 An Yang , Anfeng Li , Baosong Yang , Beichen Zhang , Binyuan Hui , Bo Zheng , Bowen Yu , Chang Gao , Chengen Huang , Chenxu Lv , Chujie Zheng , Dayiheng Liu , Fan Zhou , Fei Huang , Feng Hu , Hao Ge , Haoran Wei , Huan Lin , Jialong Tang , Jian Yang , Jianhong Tu , Jianwei Zhang , Jianxin Yang , Jiaxi Yang , Jing Zhou , Jingren Zhou , Junyang Lin , Kai Dang , Keqin Bao , Kexin Yang , Le Yu , Lianghao Deng , Mei Li , Mingfeng Xue , Mingze Li , Pei Zhang , Peng Wang , Qin Zhu , Rui Men , Ruize Gao , Shixuan Liu , Shuang Luo , Tianhao Li , Tianyi Tang , Wenbiao Yin , Xingzhang Ren , Xinyu Wang , Xinyu Zhang , Xuancheng Ren , Yang Fan , Yang Su , Yichang Zhang , Yinger Zhang , Yu Wan , Yuqiong Liu , Zekun Wang , Zeyu Cui , Zhenru Zhang , Zhipeng Zhou , Zihan Qiu

An Empirical Study of Qwen3 Quantization

The Qwen series has emerged as a leading family of open-source Large Language Models (LLMs), demonstrating remarkable capabilities in natural language understanding tasks. With the recent release of Qwen3, which exhibits superior…

Machine Learning · Computer Science 2025-05-06 Xingyu Zheng , Yuye Li , Haoran Chu , Yue Feng , Xudong Ma , Jie Luo , Jinyang Guo , Haotong Qin , Michele Magno , Xianglong Liu

CWM: An Open-Weights LLM for Research on Code Generation with World Models

We release Code World Model (CWM), a 32-billion-parameter open-weights LLM, to advance research on code generation with world models. To improve code understanding beyond what can be learned from training on static code alone, we mid-train…

Software Engineering · Computer Science 2025-10-13 FAIR CodeGen team , Jade Copet , Quentin Carbonneaux , Gal Cohen , Jonas Gehring , Jacob Kahn , Jannik Kossen , Felix Kreuk , Emily McMilin , Michel Meyer , Yuxiang Wei , David Zhang , Kunhao Zheng , Jordi Armengol-Estapé , Pedram Bashiri , Maximilian Beck , Pierre Chambon , Abhishek Charnalia , Chris Cummins , Juliette Decugis , Zacharias V. Fisches , François Fleuret , Fabian Gloeckle , Alex Gu , Michael Hassid , Daniel Haziza , Badr Youbi Idrissi , Christian Keller , Rahul Kindi , Hugh Leather , Gallil Maimon , Aram Markosyan , Francisco Massa , Pierre-Emmanuel Mazaré , Vegard Mella , Naila Murray , Keyur Muzumdar , Peter O'Hearn , Matteo Pagliardini , Dmitrii Pedchenko , Tal Remez , Volker Seeker , Marco Selvi , Oren Sultan , Sida Wang , Luca Wehrstedt , Ori Yoran , Lingming Zhang , Taco Cohen , Yossi Adi , Gabriel Synnaeve

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

In this work, we introduce the Qwen3 Embedding series, a significant advancement over its predecessor, the GTE-Qwen series, in text embedding and reranking capabilities, built upon the Qwen3 foundation models. Leveraging the Qwen3 LLMs'…

Computation and Language · Computer Science 2025-06-12 Yanzhao Zhang , Mingxin Li , Dingkun Long , Xin Zhang , Huan Lin , Baosong Yang , Pengjun Xie , An Yang , Dayiheng Liu , Junyang Lin , Fei Huang , Jingren Zhou

Qwen2.5-Coder Technical Report

In this report, we introduce the Qwen2.5-Coder series, a significant upgrade from its predecessor, CodeQwen1.5. This series includes six models: Qwen2.5-Coder-(0.5B/1.5B/3B/7B/14B/32B). As a code-specific model, Qwen2.5-Coder is built upon…

Computation and Language · Computer Science 2024-11-13 Binyuan Hui , Jian Yang , Zeyu Cui , Jiaxi Yang , Dayiheng Liu , Lei Zhang , Tianyu Liu , Jiajun Zhang , Bowen Yu , Keming Lu , Kai Dang , Yang Fan , Yichang Zhang , An Yang , Rui Men , Fei Huang , Bo Zheng , Yibo Miao , Shanghaoran Quan , Yunlong Feng , Xingzhang Ren , Xuancheng Ren , Jingren Zhou , Junyang Lin

LLaMAX2: Your Translation-Enhanced Model also Performs Well in Reasoning

General Large Language Models (LLMs) excel in reasoning, but those enhanced for translation struggle with reasoning tasks. To address this, we propose a novel translationenhanced recipe that begins with instruct models and applies…

Computation and Language · Computer Science 2025-10-13 Changjiang Gao , Zixian Huang , Jingyang Gong , Shujian Huang , Lei Li , Fei Yuan

CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability

Evaluating and improving the security capabilities of code agents requires high-quality, executable vulnerability tasks. However, existing works rely on costly, unscalable manual reproduction and suffer from outdated data distributions. To…

Cryptography and Security · Computer Science 2026-05-19 Xianzhen Luo , Jingyuan Zhang , Shiqi Zhou , Rain Huang , Chuan Xiao , Qingfu Zhu , Zhiyuan Ma , Xing Yue , Yang Yue , Wencong Zeng , Wanxiang Che

Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures

Transformer-based language models have recently been at the forefront of active research in text generation. However, these models' advances come at the price of prohibitive training costs, with parameter counts in the billions and compute…

Computation and Language · Computer Science 2025-02-04 Gabriel Lindenmaier , Sean Papay , Sebastian Padó

Evaluating and Achieving Controllable Code Completion in Code LLM

Code completion has become a central task, gaining significant attention with the rise of large language model (LLM)-based tools in software engineering. Although recent advances have greatly improved LLMs' code completion abilities,…

Software Engineering · Computer Science 2026-01-23 Jiajun Zhang , Zeyu Cui , Lei Zhang , Jian Yang , Jiaxi Yang , Qiang Liu , Zilei Wang , Binyuan Hui , Liang Wang , Junyang Lin

LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text

As large language models (LLMs) are increasingly used in legal applications, current evaluation benchmarks tend to focus mainly on factual accuracy while largely neglecting important linguistic quality aspects such as clarity, coherence,…

Computation and Language · Computer Science 2025-11-11 Li yunhan , Wu gengshen

Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models

Large language models with billions of parameters are often over-provisioned: many layers contribute little unique information yet dominate the memory and energy footprint during inference. We present LieQ Layer-wise information…

Machine Learning · Computer Science 2025-12-30 He Xiao , Qingyao Yang , Dirui Xie , Wendong Xu , Zunhai Su , Runming yang , Wenyong Zhou , Haobo Liu , Zhengwu Liu , Ngai Wong

SWE-Bench Mobile: Can Large Language Model Agents Develop Industry-Level Mobile Applications?

Can large language model agents develop industry-level mobile applications? We introduce \textbf{SWE-Bench Mobile}, a benchmark for evaluating coding agents on realistic software engineering tasks derived from a production iOS codebase.…

Software Engineering · Computer Science 2026-02-11 Muxin Tian , Zhe Wang , Blair Yang , Zhenwei Tang , Kunlun Zhu , Honghua Dong , Hanchen Li , Xinni Xie , Guangjing Wang , Jiaxuan You

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

Agentic repository-level code understanding is essential for automating complex software engineering tasks, yet the field lacks reliable benchmarks. Existing evaluations often overlook the long tail topics and rely on popular repositories…

Software Engineering · Computer Science 2026-03-18 Songcheng Cai , Zhiheng Lyu , Yuansheng Ni , Xiangchao Chen , Baichuan Zhou , Shenzhe Zhu , Yi Lu , Haozhe Wang , Chi Ruan , Benjamin Schneider , Weixu Zhang , Xiang Li , Andy Zheng , Yuyu Zhang , Ping Nie , Wenhu Chen

Qwen2 Technical Report

This report introduces the Qwen2 series, the latest addition to our large language models and large multimodal models. We release a comprehensive suite of foundational and instruction-tuned language models, encompassing a parameter range…

Computation and Language · Computer Science 2024-09-11 An Yang , Baosong Yang , Binyuan Hui , Bo Zheng , Bowen Yu , Chang Zhou , Chengpeng Li , Chengyuan Li , Dayiheng Liu , Fei Huang , Guanting Dong , Haoran Wei , Huan Lin , Jialong Tang , Jialin Wang , Jian Yang , Jianhong Tu , Jianwei Zhang , Jianxin Ma , Jianxin Yang , Jin Xu , Jingren Zhou , Jinze Bai , Jinzheng He , Junyang Lin , Kai Dang , Keming Lu , Keqin Chen , Kexin Yang , Mei Li , Mingfeng Xue , Na Ni , Pei Zhang , Peng Wang , Ru Peng , Rui Men , Ruize Gao , Runji Lin , Shijie Wang , Shuai Bai , Sinan Tan , Tianhang Zhu , Tianhao Li , Tianyu Liu , Wenbin Ge , Xiaodong Deng , Xiaohuan Zhou , Xingzhang Ren , Xinyu Zhang , Xipin Wei , Xuancheng Ren , Xuejing Liu , Yang Fan , Yang Yao , Yichang Zhang , Yu Wan , Yunfei Chu , Yuqiong Liu , Zeyu Cui , Zhenru Zhang , Zhifang Guo , Zhihao Fan

Scaling Open-Ended Reasoning to Predict the Future

High-stakes decision making involves reasoning under uncertainty about the future. In this work, we train language models to make predictions on open-ended forecasting questions. To scale up training data, we synthesize novel forecasting…

Machine Learning · Computer Science 2026-01-06 Nikhil Chandak , Shashwat Goel , Ameya Prabhu , Moritz Hardt , Jonas Geiping

Qwen2.5 Technical Report

In this report, we introduce Qwen2.5, a comprehensive series of large language models (LLMs) designed to meet diverse needs. Compared to previous iterations, Qwen 2.5 has been significantly improved during both the pre-training and…

Computation and Language · Computer Science 2025-01-06 Qwen , : , An Yang , Baosong Yang , Beichen Zhang , Binyuan Hui , Bo Zheng , Bowen Yu , Chengyuan Li , Dayiheng Liu , Fei Huang , Haoran Wei , Huan Lin , Jian Yang , Jianhong Tu , Jianwei Zhang , Jianxin Yang , Jiaxi Yang , Jingren Zhou , Junyang Lin , Kai Dang , Keming Lu , Keqin Bao , Kexin Yang , Le Yu , Mei Li , Mingfeng Xue , Pei Zhang , Qin Zhu , Rui Men , Runji Lin , Tianhao Li , Tianyi Tang , Tingyu Xia , Xingzhang Ren , Xuancheng Ren , Yang Fan , Yang Su , Yichang Zhang , Yu Wan , Yuqiong Liu , Zeyu Cui , Zhenru Zhang , Zihan Qiu

FeatureBench: Benchmarking Agentic Coding for Complex Feature Development

Agents powered by large language models (LLMs) are increasingly adopted in the software industry, contributing code as collaborators or even autonomous developers. As their presence grows, it becomes important to assess the current…

Software Engineering · Computer Science 2026-02-12 Qixing Zhou , Jiacheng Zhang , Haiyang Wang , Rui Hao , Jiahe Wang , Minghao Han , Yuxue Yang , Shuzhe Wu , Feiyang Pan , Lue Fan , Dandan Tu , Zhaoxiang Zhang

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

We present Nanbeige4.1-3B, a unified generalist language model that simultaneously achieves strong agentic behavior, code generation, and general reasoning with only 3B parameters. To the best of our knowledge, it is the first open-source…

Artificial Intelligence · Computer Science 2026-02-17 Chen Yang , Guangyue Peng , Jiaying Zhu , Ran Le , Ruixiang Feng , Tao Zhang , Xiyun Xu , Yang Song , Yiming Jia , Yuntao Wen , Yunzhi Xu , Zekai Wang , Zhenwei An , Zhicong Sun , Zongchao Chen

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

The rapid development of large language models has revolutionized code intelligence in software development. However, the predominance of closed-source models has restricted extensive research and development. To address this, we introduce…

Software Engineering · Computer Science 2024-01-29 Daya Guo , Qihao Zhu , Dejian Yang , Zhenda Xie , Kai Dong , Wentao Zhang , Guanting Chen , Xiao Bi , Y. Wu , Y. K. Li , Fuli Luo , Yingfei Xiong , Wenfeng Liang