Related papers: TeleChat Technical Report

Tele-FLM Technical Report

Large language models (LLMs) have showcased profound capabilities in language understanding and generation, facilitating a wide array of applications. However, there is a notable paucity of detailed, open-sourced methodologies on…

Computation and Language · Computer Science 2024-04-26 Xiang Li , Yiqun Yao , Xin Jiang , Xuezhi Fang , Chao Wang , Xinzhang Liu , Zihan Wang , Yu Zhao , Xin Wang , Yuyao Huang , Shuangyong Song , Yongxiang Li , Zheng Zhang , Bo Zhao , Aixin Sun , Yequan Wang , Zhongjiang He , Zhongyuan Wang , Xuelong Li , Tiejun Huang

Llama 2: Open Foundation and Fine-Tuned Chat Models

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for…

Computation and Language · Computer Science 2023-07-20 Hugo Touvron , Louis Martin , Kevin Stone , Peter Albert , Amjad Almahairi , Yasmine Babaei , Nikolay Bashlykov , Soumya Batra , Prajjwal Bhargava , Shruti Bhosale , Dan Bikel , Lukas Blecher , Cristian Canton Ferrer , Moya Chen , Guillem Cucurull , David Esiobu , Jude Fernandes , Jeremy Fu , Wenyin Fu , Brian Fuller , Cynthia Gao , Vedanuj Goswami , Naman Goyal , Anthony Hartshorn , Saghar Hosseini , Rui Hou , Hakan Inan , Marcin Kardas , Viktor Kerkez , Madian Khabsa , Isabel Kloumann , Artem Korenev , Punit Singh Koura , Marie-Anne Lachaux , Thibaut Lavril , Jenya Lee , Diana Liskovich , Yinghai Lu , Yuning Mao , Xavier Martinet , Todor Mihaylov , Pushkar Mishra , Igor Molybog , Yixin Nie , Andrew Poulton , Jeremy Reizenstein , Rashi Rungta , Kalyan Saladi , Alan Schelten , Ruan Silva , Eric Michael Smith , Ranjan Subramanian , Xiaoqing Ellen Tan , Binh Tang , Ross Taylor , Adina Williams , Jian Xiang Kuan , Puxin Xu , Zheng Yan , Iliyan Zarov , Yuchen Zhang , Angela Fan , Melanie Kambadur , Sharan Narang , Aurelien Rodriguez , Robert Stojnic , Sergey Edunov , Thomas Scialom

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Fine-tuning on instruction data has been widely validated as an effective practice for implementing chat language models like ChatGPT. Scaling the diversity and quality of such data, although straightforward, stands a great chance of…

Computation and Language · Computer Science 2023-05-24 Ning Ding , Yulin Chen , Bokai Xu , Yujia Qin , Zhi Zheng , Shengding Hu , Zhiyuan Liu , Maosong Sun , Bowen Zhou

TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving

The increasing adoption of artificial intelligence in telecommunications has raised interest in the capability of Large Language Models (LLMs) to address domain-specific, mathematically intensive tasks. Although recent advancements have…

Artificial Intelligence · Computer Science 2025-06-13 Vincenzo Colle , Mohamed Sana , Nicola Piovesan , Antonio De Domenico , Fadhel Ayed , Merouane Debbah

Baichuan 2: Open Large-scale Language Models

Large language models (LLMs) have demonstrated remarkable performance on a variety of natural language tasks based on just a few examples of natural language instructions, reducing the need for extensive feature engineering. However, most…

Computation and Language · Computer Science 2025-04-18 Aiyuan Yang , Bin Xiao , Bingning Wang , Borong Zhang , Ce Bian , Chao Yin , Chenxu Lv , Da Pan , Dian Wang , Dong Yan , Fan Yang , Fei Deng , Feng Wang , Feng Liu , Guangwei Ai , Guosheng Dong , Haizhou Zhao , Hang Xu , Haoze Sun , Hongda Zhang , Hui Liu , Jiaming Ji , Jian Xie , JunTao Dai , Kun Fang , Lei Su , Liang Song , Lifeng Liu , Liyun Ru , Luyao Ma , Mang Wang , Mickel Liu , MingAn Lin , Nuolan Nie , Peidong Guo , Ruiyang Sun , Tao Zhang , Tianpeng Li , Tianyu Li , Wei Cheng , Weipeng Chen , Xiangrong Zeng , Xiaochuan Wang , Xiaoxi Chen , Xin Men , Xin Yu , Xuehai Pan , Yanjun Shen , Yiding Wang , Yiyu Li , Youxin Jiang , Yuchen Gao , Yupeng Zhang , Zenan Zhou , Zhiying Wu

A Survey of Large Language Models

Language is essentially a complex, intricate system of human expressions governed by grammatical rules. It poses a significant challenge to develop capable AI algorithms for comprehending and grasping a language. As a major approach,…

Computation and Language · Computer Science 2026-03-19 Wayne Xin Zhao , Kun Zhou , Junyi Li , Tianyi Tang , Xiaolei Wang , Yupeng Hou , Yingqian Min , Beichen Zhang , Junjie Zhang , Zican Dong , Yifan Du , Chen Yang , Yushuo Chen , Zhipeng Chen , Jinhao Jiang , Ruiyang Ren , Yifan Li , Xinyu Tang , Zikang Liu , Peiyu Liu , Jian-Yun Nie , Ji-Rong Wen

YuLan: An Open-source Large Language Model

Large language models (LLMs) have become the foundation of many applications, leveraging their extensive capabilities in processing and understanding natural language. While many open-source LLMs have been released with technical reports,…

Computation and Language · Computer Science 2024-07-01 Yutao Zhu , Kun Zhou , Kelong Mao , Wentong Chen , Yiding Sun , Zhipeng Chen , Qian Cao , Yihan Wu , Yushuo Chen , Feng Wang , Lei Zhang , Junyi Li , Xiaolei Wang , Lei Wang , Beichen Zhang , Zican Dong , Xiaoxue Cheng , Yuhan Chen , Xinyu Tang , Yupeng Hou , Qiangqiang Ren , Xincheng Pang , Shufang Xie , Wayne Xin Zhao , Zhicheng Dou , Jiaxin Mao , Yankai Lin , Ruihua Song , Jun Xu , Xu Chen , Rui Yan , Zhewei Wei , Di Hu , Wenbing Huang , Ze-Feng Gao , Yueguo Chen , Weizheng Lu , Ji-Rong Wen

Xmodel-LM Technical Report

We introduce Xmodel-LM, a compact and efficient 1.1B language model pre-trained on around 2 trillion tokens. Trained on our self-built dataset (Xdata), which balances Chinese and English corpora based on downstream task optimization,…

Computation and Language · Computer Science 2024-11-20 Yichuan Wang , Yang Liu , Yu Yan , Qun Wang , Xucheng Huang , Ling Jiang

TeleQnA: A Benchmark Dataset to Assess Large Language Models Telecommunications Knowledge

We introduce TeleQnA, the first benchmark dataset designed to evaluate the knowledge of Large Language Models (LLMs) in telecommunications. Comprising 10,000 questions and answers, this dataset draws from diverse sources, including…

Information Theory · Computer Science 2023-10-24 Ali Maatouk , Fadhel Ayed , Nicola Piovesan , Antonio De Domenico , Merouane Debbah , Zhi-Quan Luo

Technical Report of TeleChat2, TeleChat2.5 and T1

We introduce the latest series of TeleChat models: \textbf{TeleChat2}, \textbf{TeleChat2.5}, and \textbf{T1}, offering a significant upgrade over their predecessor, TeleChat. Despite minimal changes to the model architecture, the new series…

Computation and Language · Computer Science 2025-07-30 Zihan Wang , Xinzhang Liu , Yitong Yao , Chao Wang , Yu Zhao , Zhihao Yang , Wenmin Deng , Kaipeng Jia , Jiaxin Peng , Yuyao Huang , Sishi Xiong , Zhuo Jiang , Kaidong Yu , Xiaohui Hu , Fubei Yao , Ruiyu Fang , Zhuoru Jiang , Ruiting Song , Qiyi Xie , Rui Xue , Xuewei He , Yanlei Xue , Zhu Yuan , Zhaoxi Zhang , Zilu Huang , Shiquan Wang , Xin Wang , Hanming Wu , Mingyuan Wang , Xufeng Zhan , Yuhan Sun , Zhaohu Xing , Yuhao Jiang , Bingkai Yang , Shuangyong Song , Yongxiang Li , Zhongjiang He , Xuelong Li

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

Large Language Models (LLMs) have seen great advance in both academia and industry, and their popularity results in numerous open-source frameworks and techniques in accelerating LLM pre-training, fine-tuning, and inference. Training and…

Performance · Computer Science 2023-12-04 Longteng Zhang , Xiang Liu , Zeyu Li , Xinglin Pan , Peijie Dong , Ruibo Fan , Rui Guo , Xin Wang , Qiong Luo , Shaohuai Shi , Xiaowen Chu

Training Data for Large Language Model

In 2022, with the release of ChatGPT, large-scale language models gained widespread attention. ChatGPT not only surpassed previous models in terms of parameters and the scale of its pretraining corpus but also achieved revolutionary…

Artificial Intelligence · Computer Science 2024-11-13 Yiming Ju , Huanhuan Ma

Developer-LLM Conversations: An Empirical Study of Interactions and Generated Code Quality

Large Language Models (LLMs) are becoming integral to modern software development workflows, assisting developers with code generation, API explanation, and iterative problem-solving through natural language conversations. Despite…

Software Engineering · Computer Science 2025-09-15 Suzhen Zhong , Ying Zou , Bram Adams

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

In this study, we introduce CT-LLM, a 2B large language model (LLM) that illustrates a pivotal shift towards prioritizing the Chinese language in developing LLMs. Uniquely initiated from scratch, CT-LLM diverges from the conventional…

Computation and Language · Computer Science 2024-09-16 Xinrun Du , Zhouliang Yu , Songyang Gao , Ding Pan , Yuyang Cheng , Ziyang Ma , Ruibin Yuan , Xingwei Qu , Jiaheng Liu , Tianyu Zheng , Xinchen Luo , Guorui Zhou , Wenhu Chen , Ge Zhang

Qwen Technical Report

Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural language processing tasks that were previously thought to be exclusive to humans. In this work, we introduce Qwen, the first installment…

Computation and Language · Computer Science 2023-09-29 Jinze Bai , Shuai Bai , Yunfei Chu , Zeyu Cui , Kai Dang , Xiaodong Deng , Yang Fan , Wenbin Ge , Yu Han , Fei Huang , Binyuan Hui , Luo Ji , Mei Li , Junyang Lin , Runji Lin , Dayiheng Liu , Gao Liu , Chengqiang Lu , Keming Lu , Jianxin Ma , Rui Men , Xingzhang Ren , Xuancheng Ren , Chuanqi Tan , Sinan Tan , Jianhong Tu , Peng Wang , Shijie Wang , Wei Wang , Shengguang Wu , Benfeng Xu , Jin Xu , An Yang , Hao Yang , Jian Yang , Shusheng Yang , Yang Yao , Bowen Yu , Hongyi Yuan , Zheng Yuan , Jianwei Zhang , Xingxuan Zhang , Yichang Zhang , Zhenru Zhang , Chang Zhou , Jingren Zhou , Xiaohuan Zhou , Tianhang Zhu

52B to 1T: Lessons Learned via Tele-FLM Series

Large Language Models (LLMs) represent a significant stride toward Artificial General Intelligence. As scaling laws underscore the potential of increasing model sizes, the academic community has intensified its investigations into LLMs with…

Computation and Language · Computer Science 2024-07-04 Xiang Li , Yiqun Yao , Xin Jiang , Xuezhi Fang , Chao Wang , Xinzhang Liu , Zihan Wang , Yu Zhao , Xin Wang , Yuyao Huang , Shuangyong Song , Yongxiang Li , Zheng Zhang , Bo Zhao , Aixin Sun , Yequan Wang , Zhongjiang He , Zhongyuan Wang , Xuelong Li , Tiejun Huang

A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets

The development of large language models (LLMs) such as ChatGPT has brought a lot of attention recently. However, their evaluation in the benchmark academic datasets remains under-explored due to the difficulty of evaluating the generative…

Computation and Language · Computer Science 2023-07-07 Md Tahmid Rahman Laskar , M Saiful Bari , Mizanur Rahman , Md Amran Hossen Bhuiyan , Shafiq Joty , Jimmy Xiangji Huang

Tele-LLMs: A Series of Specialized Large Language Models for Telecommunications

The emergence of large language models (LLMs) has significantly impacted various fields, from natural language processing to sectors like medicine and finance. However, despite their rapid proliferation, the applications of LLMs in…

Information Theory · Computer Science 2025-05-06 Ali Maatouk , Kenny Chirino Ampudia , Rex Ying , Leandros Tassiulas

A Comparative Study of Code Generation using ChatGPT 3.5 across 10 Programming Languages

Large Language Models (LLMs) are advanced Artificial Intelligence (AI) systems that have undergone extensive training using large datasets in order to understand and produce language that closely resembles that of humans. These models have…

Software Engineering · Computer Science 2023-08-10 Alessio Buscemi

ChatGPT or A Silent Everywhere Helper: A Survey of Large Language Models

Large Language Models (LLMs) have revo lutionized natural language processing Natural Language Processing (NLP), with Chat Generative Pre-trained Transformer (ChatGPT) standing out as a notable exampledue to its advanced capabilities and…

Computation and Language · Computer Science 2025-03-25 Azim Akhtarshenas , Afshin Dini , Navid Ayoobi