Artificial Intelligence · Computer Science
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
Liang Zeng, Liangjun Zhong, Liang Zhao, Tianwen Wei +8
2024-07-18
Computation and Language · Computer Science
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Longhui Yu, Weisen Jiang, Han Shi, Jincheng Yu +6
2024-05-06
Computation and Language · Computer Science
Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks
Avinash Anand, Mohit Gupta, Kritarth Prasad, Navya Singla +4
2024-04-23
Computation and Language · Computer Science
Skywork: A More Open Bilingual Foundation Model
Tianwen Wei, Liang Zhao, Lichang Zhang, Bo Zhu +26
2023-10-31
Computation and Language · Computer Science
KwaiYiiMath: Technical Report
Jiayi Fu, Lei Lin, Xiaoyang Gao, Pengli Liu +17
2023-10-20
Artificial Intelligence · Computer Science
TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving
Vincenzo Colle, Mohamed Sana, Nicola Piovesan, Antonio De Domenico +2
2025-06-13
Computation and Language · Computer Science
Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data
Haolong Li, Yu Ma, Yinqi Zhang, Chen Ye +1
2024-06-05
Computation and Language · Computer Science
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
Wen Yang, Chong Li, Jiajun Zhang, Chengqing Zong
2023-11-22
Computation and Language · Computer Science
Benchmarking Large Language Models for Math Reasoning Tasks
Kathrin Seßler, Yao Rong, Emek Gözlüklü, Enkelejda Kasneci
2024-12-20
Computation and Language · Computer Science
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Shuai Peng, Di Fu, Liangcai Gao, Xiuqin Zhong +2
2024-09-04
Computation and Language · Computer Science
PolyLM: An Open Source Polyglot Large Language Model
Xiangpeng Wei, Haoran Wei, Huan Lin, Tianhao Li +14
2023-07-13
Artificial Intelligence · Computer Science
RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics
Jie Zhang, Cezara Petrui, Kristina Nikolić, Florian Tramèr
2025-10-21
Computation and Language · Computer Science
FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models
Yan Liu, Renren Jin, Ling Shi, Zheng Yao +1
2024-09-09
Computation and Language · Computer Science
STBench: Assessing the Ability of Large Language Models in Spatio-Temporal Analysis
Wenbin Li, Di Yao, Ruibo Zhao, Wenjie Chen +6
2024-06-28
Computation and Language · Computer Science
Large Language Models for Mathematicians
Simon Frieder, Julius Berner, Philipp Petersen, Thomas Lukasiewicz
2024-04-03
Computation and Language · Computer Science
Specializing Smaller Language Models towards Multi-Step Reasoning
Yao Fu, Hao Peng, Litu Ou, Ashish Sabharwal +1
2023-01-31
Computation and Language · Computer Science
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu +9
2024-02-26
Computation and Language · Computer Science
Large Language Models Don't Make Sense of Word Problems. A Scoping Review from a Mathematics Education Perspective
Anselm R. Strohmaier, Wim Van Dooren, Kathrin Seßler, Brian Greer +1
2025-08-12
Computation and Language · Computer Science
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Zhengyang Tang, Xingxing Zhang, Benyou Wang, Furu Wei
2024-03-06
Computation and Language · Computer Science
Large Language Models for Mathematical Reasoning: Progresses and Challenges
Janice Ahn, Rishu Verma, Renze Lou, Di Liu +2
2024-09-18
Computation and Language · Computer Science
Mathematical Reasoning in Large Language Models: Benchmarks, Architectures, Evaluation, and Open Challenges
Husnain Amjad, Raja Khurram Shahzad, Aamir Shahzad, Mehwish Fatima
2026-05-20
Computation and Language · Computer Science
52B to 1T: Lessons Learned via Tele-FLM Series
Xiang Li, Yiqun Yao, Xin Jiang, Xuezhi Fang +16
2024-07-04
Computation and Language · Computer Science
CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
Wentao Liu, Qianjun Pan, Yi Zhang, Zhuo Liu +6
2024-11-04