Related papers: Parallel Test-Time Scaling with Multi-Sequence Ver…

$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners

Test-time scaling for complex reasoning tasks shows that leveraging inference-time compute, by methods such as independently sampling and aggregating multiple solutions, results in significantly better task outcomes. However, a critical…

Computation and Language · Computer Science 2026-03-05 Harman Singh , Xiuyu Li , Kusha Sareen , Monishwaran Maheswaran , Sijun Tan , Xiaoxia Wu , Junxiong Wang , Alpay Ariyak , Qingyang Wu , Samir Khaki , Rishabh Tiwari , Long Lian , Yucheng Lu , Boyi Li , Alane Suhr , Ben Athiwaratkun , Kurt Keutzer

Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers

By utilizing more computational resources at test-time, large language models (LLMs) can improve without additional training. One common strategy uses verifiers to evaluate candidate outputs. In this work, we propose a novel scaling…

Artificial Intelligence · Computer Science 2025-02-28 Shalev Lifshitz , Sheila A. McIlraith , Yilun Du

Trust but Verify! A Survey on Verification Design for Test-time Scaling

Test-time scaling (TTS) has emerged as a new frontier for scaling the performance of Large Language Models. In test-time scaling, by using more computational resources during inference, LLMs can improve their reasoning process and task…

Computation and Language · Computer Science 2025-09-10 V Venktesh , Mandeep Rathee , Avishek Anand

Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification

Test-time compute can be scaled both sequentially and in parallel. Sequential scaling involves lengthening the generation process, while parallel scaling involves verifying and selecting among multiple candidate outputs. Combining these two…

Artificial Intelligence · Computer Science 2025-10-08 Weihao Zeng , Keqing He , Chuqiao Kuang , Xiaoguang Li , Junxian He

Scaling Flaws of Verifier-Guided Search in Mathematical Reasoning

Large language models (LLMs) struggle with multi-step reasoning, where inference-time scaling has emerged as a promising strategy for performance improvement. Verifier-guided search outperforms repeated sampling when sample size is limited…

Computation and Language · Computer Science 2025-02-04 Fei Yu , Yingru Li , Benyou Wang

SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling

Recent advancements in Large Language Models (LLMs) have created new opportunities to enhance performance on complex reasoning tasks by leveraging test-time computation. However, existing scaling methods have key limitations: parallel…

Artificial Intelligence · Computer Science 2025-12-04 Jiefeng Chen , Jie Ren , Xinyun Chen , Chengrun Yang , Ruoxi Sun , Jinsung Yoon , Sercan Ö Arık

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

Speaker Verification (SV) systems involve mainly two individual stages: feature extraction and classification. In this paper, we explore these two modules with the aim of improving the performance of a speaker verification system under…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-06 Kerlos Atia Abdalmalak , Ascensión Gallardo-Antol'in

Fast Support Vector Machines Using Parallel Adaptive Shrinking on Distributed Systems

Support Vector Machines (SVM), a popular machine learning technique, has been applied to a wide range of domains such as science, finance, and social networks for supervised learning. Whether it is identifying high-risk patients by…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-06-20 Jeyanthi Narasimhan , Abhinav Vishnu , Lawrence Holder , Adolfy Hoisie

Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs

Test-time scaling (TTS) has gained widespread attention for enhancing LLM reasoning. Existing approaches such as Best-of-N and majority voting are limited as their performance depends on the quality of candidate responses, making them…

Machine Learning · Computer Science 2026-04-28 Qibin Wang , Pu Zhao , Shaohan Huang , Fangkai Yang , Lu Wang , Furu Wei , Qingwei Lin , Saravan Rajmohan , Dongmei Zhang

Parallel Random Search Algorithm of Constrained Pseudo-Boolean Optimization for Some Distinctive Large-Scale Problems

In this paper, we consider an approach to the parallelizing of the algorithms realizing the modified probability changigng method with adaptation and partial rollback procedure for constrained pseudo-Boolean optimization problems. Existing…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-09-03 Lev Kazakovtsev

Fast Parallel SVM using Data Augmentation

As one of the most popular classifiers, linear SVMs still have challenges in dealing with very large-scale problems, even though linear or sub-linear algorithms have been developed recently on single machines. Parallel computing methods…

Machine Learning · Computer Science 2015-12-25 Hugh Perkins , Minjie Xu , Jun Zhu , Bo Zhang

Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers

Test-time scaling via solution sampling and aggregation has become a key paradigm for improving the reasoning performance of Large Language Models (LLMs). While reward model selection is commonly employed in this approach, it often fails to…

Machine Learning · Computer Science 2025-09-30 Zhicheng Yang , Zhijiang Guo , Yinya Huang , Yongxin Wang , Yiwei Wang , Xiaodan Liang , Jing Tang

Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models

With the rapid advancement of test-time compute search strategies to improve the mathematical problem-solving capabilities of large language models (LLMs), the need for building robust verifiers has become increasingly important. However,…

Computation and Language · Computer Science 2025-03-11 Jung Hyun Lee , June Yong Yang , Byeongho Heo , Dongyoon Han , Kyungsu Kim , Eunho Yang , Kang Min Yoo

SSR: Speculative Parallel Scaling Reasoning in Test-time

Large language models (LLMs) have achieved impressive results on multi-step mathematical reasoning, yet at the cost of high computational overhead. This challenge is particularly acute for test-time scaling methods such as parallel…

Machine Learning · Computer Science 2026-03-24 Yuanlin Chu , Bo Wang , Xiang Liu , Hong Chen , Aiwei Liu , Xuming Hu

MVR-cache: Optimizing Semantic Caching via Multi-Vector Retrieval and Learned Prompt Segmentation

To reduce LLM costs and latency, semantic caching systems must accurately identify when a new prompt matches a cached one. Current methods often rely on simplistic similarity measures, which limit their effectiveness. We introduce…

Information Retrieval · Computer Science 2026-05-26 Ali Noshad , Zishan Zheng , Yinjun Wu

Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks

Test-time scaling (TTS) techniques can improve the performance of large language models (LLMs) at the expense of additional computation and latency. While TTS has proven effective in formal domains such as mathematics and programming, its…

Computation and Language · Computer Science 2025-10-31 Davide Romano , Jonathan Schwarz , Daniele Giofré

Parallel Strategies Selection

We consider the problem of selecting the best variable-value strategy for solving a given problem in constraint programming. We show that the recent Embarrassingly Parallel Search method (EPS) can be used for this purpose. EPS proposes to…

Artificial Intelligence · Computer Science 2016-04-25 Anthony Palmieri , Jean-Charles Régin , Pierre Schaus

Optimizing Sequential Multi-Step Tasks with Parallel LLM Agents

Large language model (LLM)-based multi-agent systems have demonstrated remarkable promise for tackling complex tasks by breaking them down into subtasks that are iteratively planned, executed, observed, and refined. Despite their…

Multiagent Systems · Computer Science 2025-07-15 Enhao Zhang , Erkang Zhu , Gagan Bansal , Adam Fourney , Hussein Mozannar , Jack Gerrits

When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning

Scaling test-time compute has emerged as a key strategy for enhancing the reasoning capabilities of large language models (LLMs), particularly in tasks like mathematical problem-solving. A traditional approach, Self-Consistency (SC),…

Computation and Language · Computer Science 2025-10-21 Nishad Singhi , Hritik Bansal , Arian Hosseini , Aditya Grover , Kai-Wei Chang , Marcus Rohrbach , Anna Rohrbach

MixML: A Unified Analysis of Weakly Consistent Parallel Learning

Parallelism is a ubiquitous method for accelerating machine learning algorithms. However, theoretical analysis of parallel learning is usually done in an algorithm- and protocol-specific setting, giving little insight about how changes in…

Machine Learning · Computer Science 2020-06-09 Yucheng Lu , Jack Nash , Christopher De Sa