Related papers: Learning Randomized Reductions

An Insect-Inspired Randomly, Weighted Neural Network with Random Fourier Features For Neuro-Symbolic Relational Learning

Insects, such as fruit flies and honey bees, can solve simple associative learning tasks and learn abstract concepts such as "sameness" and "difference", which is viewed as a higher-order cognitive function and typically thought to depend…

Computer Vision and Pattern Recognition · Computer Science 2021-09-15 Jinyung Hong , Theodore P. Pavlic

ReVeal: Self-Evolving Code Agents via Reliable Self-Verification

Reinforcement learning with verifiable rewards (RLVR) has advanced the reasoning capabilities of large language models. However, existing methods rely solely on outcome rewards, without explicitly optimizing verification or leveraging…

Software Engineering · Computer Science 2025-10-22 Yiyang Jin , Kunzhao Xu , Hang Li , Xueting Han , Yanmin Zhou , Cheng Li , Jing Bai

Symbolic Equation Solving via Reinforcement Learning

Machine-learning methods are gradually being adopted in a wide variety of social, economic, and scientific contexts, yet they are notorious for struggling with exact mathematics. A typical example is computer algebra, which includes tasks…

Machine Learning · Computer Science 2024-11-06 Lennart Dabelow , Masahito Ueda

Self-Reinforcement Attention Mechanism For Tabular Learning

Apart from the high accuracy of machine learning models, what interests many researchers in real-life problems (e.g., fraud detection, credit scoring) is to find hidden patterns in data; particularly when dealing with their challenging…

Machine Learning · Computer Science 2023-05-22 Kodjo Mawuena Amekoe , Mohamed Djallel Dilmi , Hanene Azzag , Mustapha Lebbah , Zaineb Chelly Dagdia , Gregoire Jaffre

RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery

Entity recognition in Automatic Speech Recognition (ASR) is challenging for rare and domain-specific terms. In domains such as finance, medicine, and air traffic control, these errors are costly. If the entities are entirely absent from the…

Computation and Language · Computer Science 2026-03-18 Abhishek Kumar , Aashraya Sachdeva

Attention or memory? Neurointerpretable agents in space and time

In neuroscience, attention has been shown to bidirectionally interact with reinforcement learning (RL) processes. This interaction is thought to support dimensionality reduction of task representations, restricting computations to relevant…

Artificial Intelligence · Computer Science 2020-07-14 Lennart Bramlage , Aurelio Cortese

Improve Rule Retrieval and Reasoning with Self-Induction and Relevance ReEstimate

This paper systematically addresses the challenges of rule retrieval, a crucial yet underexplored area. Vanilla retrieval methods using sparse or dense retrievers to directly search for relevant rules to support downstream reasoning, often…

Computation and Language · Computer Science 2025-05-19 Ziyang Huang , Wangtao Sun , Jun Zhao , Kang Liu

Probabilistic Abduction for Visual Abstract Reasoning via Learning Rules in Vector-symbolic Architectures

Abstract reasoning is a cornerstone of human intelligence, and replicating it with artificial intelligence (AI) presents an ongoing challenge. This study focuses on efficiently solving Raven's progressive matrices (RPM), a visual test for…

Machine Learning · Computer Science 2024-01-30 Michael Hersche , Francesco di Stefano , Thomas Hofmann , Abu Sebastian , Abbas Rahimi

The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models

Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a key method for improving Large Language Models' reasoning capabilities, yet recent evidence suggests it may paradoxically shrink the reasoning boundary rather than…

Artificial Intelligence · Computer Science 2025-10-03 Phuc Minh Nguyen , Chinh D. La , Duy M. H. Nguyen , Nitesh V. Chawla , Binh T. Nguyen , Khoa D. Doan

RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning

While large language models (LLMs) have demonstrated strong performance on complex reasoning tasks such as competitive programming (CP), existing methods predominantly focus on single-attempt settings, overlooking their capacity for…

Artificial Intelligence · Computer Science 2026-04-02 Shaopeng Fu , Xingxing Zhang , Li Dong , Di Wang , Furu Wei

ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards

Search agents powered by Large Language Models (LLMs) have demonstrated significant potential in tackling knowledge-intensive tasks. Reinforcement learning (RL) has emerged as a powerful paradigm for training these agents to perform…

Computation and Language · Computer Science 2026-05-11 Shiyu Li , Yang Tang , Yifan Wang , Peiming Li , Xi Chen

AgenticQwen: Training Small Agentic Language Models with Dual Data Flywheels for Industrial-Scale Tool Use

Modern industrial applications increasingly demand language models that act as agents, capable of multi-step reasoning and tool use in real-world settings. These tasks are typically performed under strict cost and latency constraints,…

Computation and Language · Computer Science 2026-04-24 Yuanjie Lyu , Chengyu Wang , Haonan Zheng , Yuanhao Yue , Junbing Yan , Ming Wang , Jun Huang

Improving Neural Retrieval with Attribution-Guided Query Rewriting

Neural retrievers are effective but brittle: underspecified or ambiguous queries can misdirect ranking even when relevant documents exist. Existing approaches address this brittleness only partially: LLMs rewrite queries without retriever…

Information Retrieval · Computer Science 2026-02-13 Moncef Garouani , Josiane Mothe

Iterated Agent for Symbolic Regression

Symbolic regression (SR), the automated discovery of mathematical expressions from data, is a cornerstone of scientific inquiry. However, it is often hindered by the combinatorial explosion of the search space and a tendency to overfit.…

Computational Physics · Physics 2025-10-10 Zhuo-Yang Song , Zeyu Cai , Shutao Zhang , Jiashen Wei , Jichen Pan , Shi Qiu , Qing-Hong Cao , Tie-Jiun Hou , Xiaohui Liu , Ming-xing Luo , Hua Xing Zhu

Demystifying Reinforcement Learning in Agentic Reasoning

Recently, the emergence of agentic RL has showcased that RL could also effectively improve the agentic reasoning ability of LLMs, yet the key design principles and optimal practices remain unclear. In this work, we conduct a comprehensive…

Computation and Language · Computer Science 2025-10-14 Zhaochen Yu , Ling Yang , Jiaru Zou , Shuicheng Yan , Mengdi Wang

Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards

Test-time scaling has been shown to substantially improve large language models' (LLMs) mathematical reasoning. However, for a large portion of mathematical corpora, especially theorem proving, RLVR's scalability is limited: intermediate…

Computation and Language · Computer Science 2025-11-24 Zhen Wang , Zhifeng Gao , Guolin Ke

Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical Imaging

Medical image analysis increasingly relies on large vision-language models (VLMs), yet most systems remain single-pass black boxes that offer limited control over reasoning, safety, and spatial grounding. We propose R^4, an agentic…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Md. Faiyaz Abdullah Sayeedi , Rashedur Rahman , Siam Tahsin Bhuiyan , Sefatul Wasi , Ashraful Islam , Saadia Binte Alam , AKM Mahbubur Rahman

Self-correcting Reward Shaping via Language Models for Reinforcement Learning Agents in Games

Reinforcement Learning (RL) in games has gained significant momentum in recent years, enabling the creation of different agent behaviors that can transform a player's gaming experience. However, deploying RL agents in production…

Artificial Intelligence · Computer Science 2025-07-01 António Afonso , Iolanda Leite , Alessandro Sestini , Florian Fuchs , Konrad Tollmar , Linus Gisslén

Reasoning with Reinforced Functional Token Tuning

In this work, we propose Reinforced Functional Token Tuning (RFTT), a novel reinforced fine-tuning framework that empowers Large Language Models (LLMs) with self-play learn-to-reason capabilities. Unlike prior prompt-driven reasoning…

Artificial Intelligence · Computer Science 2025-02-20 Kongcheng Zhang , Qi Yao , Baisheng Lai , Jiaxing Huang , Wenkai Fang , Dacheng Tao , Mingli Song , Shunyu Liu

Agentic-R: Learning to Retrieve for Agentic Search

Agentic search has recently emerged as a powerful paradigm, where an agent interleaves multi-step reasoning with on-demand retrieval to solve complex questions. Despite its success, how to design a retriever for agentic search remains…

Information Retrieval · Computer Science 2026-01-22 Wenhan Liu , Xinyu Ma , Yutao Zhu , Yuchen Li , Daiting Shi , Dawei Yin , Zhicheng Dou