English
Related papers

Related papers: Learning Randomized Reductions

200 papers

Insects, such as fruit flies and honey bees, can solve simple associative learning tasks and learn abstract concepts such as "sameness" and "difference", which is viewed as a higher-order cognitive function and typically thought to depend…

Computer Vision and Pattern Recognition · Computer Science 2021-09-15 Jinyung Hong , Theodore P. Pavlic

Reinforcement learning with verifiable rewards (RLVR) has advanced the reasoning capabilities of large language models. However, existing methods rely solely on outcome rewards, without explicitly optimizing verification or leveraging…

Software Engineering · Computer Science 2025-10-22 Yiyang Jin , Kunzhao Xu , Hang Li , Xueting Han , Yanmin Zhou , Cheng Li , Jing Bai

Machine-learning methods are gradually being adopted in a wide variety of social, economic, and scientific contexts, yet they are notorious for struggling with exact mathematics. A typical example is computer algebra, which includes tasks…

Machine Learning · Computer Science 2024-11-06 Lennart Dabelow , Masahito Ueda

Apart from the high accuracy of machine learning models, what interests many researchers in real-life problems (e.g., fraud detection, credit scoring) is to find hidden patterns in data; particularly when dealing with their challenging…

Entity recognition in Automatic Speech Recognition (ASR) is challenging for rare and domain-specific terms. In domains such as finance, medicine, and air traffic control, these errors are costly. If the entities are entirely absent from the…

Computation and Language · Computer Science 2026-03-18 Abhishek Kumar , Aashraya Sachdeva

In neuroscience, attention has been shown to bidirectionally interact with reinforcement learning (RL) processes. This interaction is thought to support dimensionality reduction of task representations, restricting computations to relevant…

Artificial Intelligence · Computer Science 2020-07-14 Lennart Bramlage , Aurelio Cortese

This paper systematically addresses the challenges of rule retrieval, a crucial yet underexplored area. Vanilla retrieval methods using sparse or dense retrievers to directly search for relevant rules to support downstream reasoning, often…

Computation and Language · Computer Science 2025-05-19 Ziyang Huang , Wangtao Sun , Jun Zhao , Kang Liu

Abstract reasoning is a cornerstone of human intelligence, and replicating it with artificial intelligence (AI) presents an ongoing challenge. This study focuses on efficiently solving Raven's progressive matrices (RPM), a visual test for…

Machine Learning · Computer Science 2024-01-30 Michael Hersche , Francesco di Stefano , Thomas Hofmann , Abu Sebastian , Abbas Rahimi

Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a key method for improving Large Language Models' reasoning capabilities, yet recent evidence suggests it may paradoxically shrink the reasoning boundary rather than…

Artificial Intelligence · Computer Science 2025-10-03 Phuc Minh Nguyen , Chinh D. La , Duy M. H. Nguyen , Nitesh V. Chawla , Binh T. Nguyen , Khoa D. Doan

While large language models (LLMs) have demonstrated strong performance on complex reasoning tasks such as competitive programming (CP), existing methods predominantly focus on single-attempt settings, overlooking their capacity for…

Artificial Intelligence · Computer Science 2026-04-02 Shaopeng Fu , Xingxing Zhang , Li Dong , Di Wang , Furu Wei

Search agents powered by Large Language Models (LLMs) have demonstrated significant potential in tackling knowledge-intensive tasks. Reinforcement learning (RL) has emerged as a powerful paradigm for training these agents to perform…

Computation and Language · Computer Science 2026-05-11 Shiyu Li , Yang Tang , Yifan Wang , Peiming Li , Xi Chen

Modern industrial applications increasingly demand language models that act as agents, capable of multi-step reasoning and tool use in real-world settings. These tasks are typically performed under strict cost and latency constraints,…

Computation and Language · Computer Science 2026-04-24 Yuanjie Lyu , Chengyu Wang , Haonan Zheng , Yuanhao Yue , Junbing Yan , Ming Wang , Jun Huang

Neural retrievers are effective but brittle: underspecified or ambiguous queries can misdirect ranking even when relevant documents exist. Existing approaches address this brittleness only partially: LLMs rewrite queries without retriever…

Information Retrieval · Computer Science 2026-02-13 Moncef Garouani , Josiane Mothe

Symbolic regression (SR), the automated discovery of mathematical expressions from data, is a cornerstone of scientific inquiry. However, it is often hindered by the combinatorial explosion of the search space and a tendency to overfit.…

Recently, the emergence of agentic RL has showcased that RL could also effectively improve the agentic reasoning ability of LLMs, yet the key design principles and optimal practices remain unclear. In this work, we conduct a comprehensive…

Computation and Language · Computer Science 2025-10-14 Zhaochen Yu , Ling Yang , Jiaru Zou , Shuicheng Yan , Mengdi Wang

Test-time scaling has been shown to substantially improve large language models' (LLMs) mathematical reasoning. However, for a large portion of mathematical corpora, especially theorem proving, RLVR's scalability is limited: intermediate…

Computation and Language · Computer Science 2025-11-24 Zhen Wang , Zhifeng Gao , Guolin Ke

Medical image analysis increasingly relies on large vision-language models (VLMs), yet most systems remain single-pass black boxes that offer limited control over reasoning, safety, and spatial grounding. We propose R^4, an agentic…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Md. Faiyaz Abdullah Sayeedi , Rashedur Rahman , Siam Tahsin Bhuiyan , Sefatul Wasi , Ashraful Islam , Saadia Binte Alam , AKM Mahbubur Rahman

Reinforcement Learning (RL) in games has gained significant momentum in recent years, enabling the creation of different agent behaviors that can transform a player's gaming experience. However, deploying RL agents in production…

Artificial Intelligence · Computer Science 2025-07-01 António Afonso , Iolanda Leite , Alessandro Sestini , Florian Fuchs , Konrad Tollmar , Linus Gisslén

In this work, we propose Reinforced Functional Token Tuning (RFTT), a novel reinforced fine-tuning framework that empowers Large Language Models (LLMs) with self-play learn-to-reason capabilities. Unlike prior prompt-driven reasoning…

Artificial Intelligence · Computer Science 2025-02-20 Kongcheng Zhang , Qi Yao , Baisheng Lai , Jiaxing Huang , Wenkai Fang , Dacheng Tao , Mingli Song , Shunyu Liu

Agentic search has recently emerged as a powerful paradigm, where an agent interleaves multi-step reasoning with on-demand retrieval to solve complex questions. Despite its success, how to design a retriever for agentic search remains…

Information Retrieval · Computer Science 2026-01-22 Wenhan Liu , Xinyu Ma , Yutao Zhu , Yuchen Li , Daiting Shi , Dawei Yin , Zhicheng Dou
‹ Prev 1 2 3 10 Next ›