Machine Learning · Computer Science
FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code Generation
Ruiyi Zhang, Peijia Qin, Qi Cao, Eric Xue +1
2026-02-02
Artificial Intelligence · Computer Science
Process Supervision-Guided Policy Optimization for Code Generation
Ning Dai, Zheng Wu, Renjie Zheng, Ziyun Wei +6
2025-02-05
Computation and Language · Computer Science
Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision
Xingwei Tan, Marco Valentino, Mahmud Akhter, Maria Liakata +1
2025-09-19
Computation and Language · Computer Science
R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She, Junxiao Liu, Yifeng Liu, Jiajun Chen +2
2025-03-28
Artificial Intelligence · Computer Science
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision
Lingxiao Du, Fanqing Meng, Zongkai Liu, Zhixiang Zhou +3
2025-06-06
Computation and Language · Computer Science
From Mathematical Reasoning to Code: Generalization of Process Reward Models in Test-Time Scaling
Zhengyu Chen, Yudong Wang, Teng Xiao, Ruochen Zhou +4
2025-06-03
Computation and Language · Computer Science
Towards Robust Process Reward Modeling via Noise-aware Learning
Bin Xie, Bingbing Xu, Xueyun Tian, Yilin Chen +1
2026-01-21
Machine Learning · Computer Science
DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning
Qi Cao, Ruiyi Wang, Ruiyi Zhang, Sai Ashish Somayajula +1
2025-11-05
Computation and Language · Computer Science
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
Jian Zhao, Runze Liu, Kaiyan Zhang, Zhimu Zhou +7
2025-04-08
Computation and Language · Computer Science
Let's reward step by step: Step-Level reward model as the Navigators for Reasoning
Qianli Ma, Haotian Zhou, Tingkai Liu, Jianbo Yuan +3
2023-10-17
Artificial Intelligence · Computer Science
GroundedPRM: Tree-Guided and Fidelity-Aware Process Reward Modeling for Step-Level Reasoning
Yao Zhang, Yu Wu, Haowei Zhang, Weiguo Li +5
2025-10-17
Machine Learning · Computer Science
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Amrith Setlur, Chirag Nagpal, Adam Fisch, Xinyang Geng +5
2024-10-11
Machine Learning · Computer Science
Process Reward Models That Think
Muhammad Khalifa, Rishabh Agarwal, Lajanugen Logeswaran, Jaekyeom Kim +4
2025-12-09
Machine Learning · Computer Science
Efficient Process Reward Model Training via Active Learning
Keyu Duan, Zichen Liu, Xin Mao, Tianyu Pang +4
2025-04-16
Computation and Language · Computer Science
Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision
Tej Deep Pala, Panshul Sharma, Amir Zadeh, Chuan Li +1
2025-05-27
Computation and Language · Computer Science
Process Supervision for Chain-of-Thought Reasoning via Monte Carlo Net Information Gain
Corentin Royer, Debarun Bhattacharjya, Gaetano Rossiello, Andrea Giovannini +1
2026-03-19
Artificial Intelligence · Computer Science
Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned
Brandon Ong, Tej Deep Pala, Vernon Toh, William Chandra Tjhi +1
2025-10-08
Computation and Language · Computer Science
The Lessons of Developing Process Reward Models in Mathematical Reasoning
Zhenru Zhang, Chujie Zheng, Yangzhen Wu, Beichen Zhang +5
2025-06-06
Computation and Language · Computer Science
FreePRM: Training Process Reward Models Without Ground Truth Process Labels
Lin Sun, Chuang Liu, Xiaofeng Ma, Tao Yang +2
2025-06-05
Machine Learning · Computer Science
Accelerating LLM Reasoning via Early Rejection with Partial Reward Modeling
Seyyed Saeid Cheshmi, Azal Ahmad Khan, Xinran Wang, Zirui Liu +1
2025-08-05
Computation and Language · Computer Science
Process-Supervised Reward Models for Verifying Clinical Note Generation: A Scalable Approach Guided by Domain Expertise
Hanyin Wang, Chufan Gao, Qiping Xu, Bolun Liu +8
2025-09-09
Computation and Language · Computer Science
A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models
Congmin Zheng, Jiachen Zhu, Zhuoying Ou, Yuxiang Chen +7
2026-04-30
Computation and Language · Computer Science
Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
Jaehoon Yun, Jiwoong Sohn, Jungwoo Park, Hyunjae Kim +8
2025-09-23
Artificial Intelligence · Computer Science
RRO: LLM Agent Optimization Through Rising Reward Trajectories
Zilong Wang, Jingfeng Yang, Sreyashi Nag, Samarth Varshney +4
2025-05-28