English
Related papers

Related papers: Parallel Test-Time Scaling with Multi-Sequence Ver…

200 papers

Test-time scaling for complex reasoning tasks shows that leveraging inference-time compute, by methods such as independently sampling and aggregating multiple solutions, results in significantly better task outcomes. However, a critical…

By utilizing more computational resources at test-time, large language models (LLMs) can improve without additional training. One common strategy uses verifiers to evaluate candidate outputs. In this work, we propose a novel scaling…

Artificial Intelligence · Computer Science 2025-02-28 Shalev Lifshitz , Sheila A. McIlraith , Yilun Du

Test-time scaling (TTS) has emerged as a new frontier for scaling the performance of Large Language Models. In test-time scaling, by using more computational resources during inference, LLMs can improve their reasoning process and task…

Computation and Language · Computer Science 2025-09-10 V Venktesh , Mandeep Rathee , Avishek Anand

Test-time compute can be scaled both sequentially and in parallel. Sequential scaling involves lengthening the generation process, while parallel scaling involves verifying and selecting among multiple candidate outputs. Combining these two…

Artificial Intelligence · Computer Science 2025-10-08 Weihao Zeng , Keqing He , Chuqiao Kuang , Xiaoguang Li , Junxian He

Large language models (LLMs) struggle with multi-step reasoning, where inference-time scaling has emerged as a promising strategy for performance improvement. Verifier-guided search outperforms repeated sampling when sample size is limited…

Computation and Language · Computer Science 2025-02-04 Fei Yu , Yingru Li , Benyou Wang

Recent advancements in Large Language Models (LLMs) have created new opportunities to enhance performance on complex reasoning tasks by leveraging test-time computation. However, existing scaling methods have key limitations: parallel…

Artificial Intelligence · Computer Science 2025-12-04 Jiefeng Chen , Jie Ren , Xinyun Chen , Chengrun Yang , Ruoxi Sun , Jinsung Yoon , Sercan Ö Arık

Speaker Verification (SV) systems involve mainly two individual stages: feature extraction and classification. In this paper, we explore these two modules with the aim of improving the performance of a speaker verification system under…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-06 Kerlos Atia Abdalmalak , Ascensión Gallardo-Antol'in

Support Vector Machines (SVM), a popular machine learning technique, has been applied to a wide range of domains such as science, finance, and social networks for supervised learning. Whether it is identifying high-risk patients by…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-06-20 Jeyanthi Narasimhan , Abhinav Vishnu , Lawrence Holder , Adolfy Hoisie

Test-time scaling (TTS) has gained widespread attention for enhancing LLM reasoning. Existing approaches such as Best-of-N and majority voting are limited as their performance depends on the quality of candidate responses, making them…

Machine Learning · Computer Science 2026-04-28 Qibin Wang , Pu Zhao , Shaohan Huang , Fangkai Yang , Lu Wang , Furu Wei , Qingwei Lin , Saravan Rajmohan , Dongmei Zhang

In this paper, we consider an approach to the parallelizing of the algorithms realizing the modified probability changigng method with adaptation and partial rollback procedure for constrained pseudo-Boolean optimization problems. Existing…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-09-03 Lev Kazakovtsev

As one of the most popular classifiers, linear SVMs still have challenges in dealing with very large-scale problems, even though linear or sub-linear algorithms have been developed recently on single machines. Parallel computing methods…

Machine Learning · Computer Science 2015-12-25 Hugh Perkins , Minjie Xu , Jun Zhu , Bo Zhang

Test-time scaling via solution sampling and aggregation has become a key paradigm for improving the reasoning performance of Large Language Models (LLMs). While reward model selection is commonly employed in this approach, it often fails to…

Machine Learning · Computer Science 2025-09-30 Zhicheng Yang , Zhijiang Guo , Yinya Huang , Yongxin Wang , Yiwei Wang , Xiaodan Liang , Jing Tang

With the rapid advancement of test-time compute search strategies to improve the mathematical problem-solving capabilities of large language models (LLMs), the need for building robust verifiers has become increasingly important. However,…

Computation and Language · Computer Science 2025-03-11 Jung Hyun Lee , June Yong Yang , Byeongho Heo , Dongyoon Han , Kyungsu Kim , Eunho Yang , Kang Min Yoo

Large language models (LLMs) have achieved impressive results on multi-step mathematical reasoning, yet at the cost of high computational overhead. This challenge is particularly acute for test-time scaling methods such as parallel…

Machine Learning · Computer Science 2026-03-24 Yuanlin Chu , Bo Wang , Xiang Liu , Hong Chen , Aiwei Liu , Xuming Hu

To reduce LLM costs and latency, semantic caching systems must accurately identify when a new prompt matches a cached one. Current methods often rely on simplistic similarity measures, which limit their effectiveness. We introduce…

Information Retrieval · Computer Science 2026-05-26 Ali Noshad , Zishan Zheng , Yinjun Wu

Test-time scaling (TTS) techniques can improve the performance of large language models (LLMs) at the expense of additional computation and latency. While TTS has proven effective in formal domains such as mathematics and programming, its…

Computation and Language · Computer Science 2025-10-31 Davide Romano , Jonathan Schwarz , Daniele Giofré

We consider the problem of selecting the best variable-value strategy for solving a given problem in constraint programming. We show that the recent Embarrassingly Parallel Search method (EPS) can be used for this purpose. EPS proposes to…

Artificial Intelligence · Computer Science 2016-04-25 Anthony Palmieri , Jean-Charles Régin , Pierre Schaus

Large language model (LLM)-based multi-agent systems have demonstrated remarkable promise for tackling complex tasks by breaking them down into subtasks that are iteratively planned, executed, observed, and refined. Despite their…

Multiagent Systems · Computer Science 2025-07-15 Enhao Zhang , Erkang Zhu , Gagan Bansal , Adam Fourney , Hussein Mozannar , Jack Gerrits

Scaling test-time compute has emerged as a key strategy for enhancing the reasoning capabilities of large language models (LLMs), particularly in tasks like mathematical problem-solving. A traditional approach, Self-Consistency (SC),…

Computation and Language · Computer Science 2025-10-21 Nishad Singhi , Hritik Bansal , Arian Hosseini , Aditya Grover , Kai-Wei Chang , Marcus Rohrbach , Anna Rohrbach

Parallelism is a ubiquitous method for accelerating machine learning algorithms. However, theoretical analysis of parallel learning is usually done in an algorithm- and protocol-specific setting, giving little insight about how changes in…

Machine Learning · Computer Science 2020-06-09 Yucheng Lu , Jack Nash , Christopher De Sa
‹ Prev 1 2 3 10 Next ›