English
Related papers

Related papers: Form Follows Function: Recursive Stem Model

200 papers

Hierarchical Reasoning Model (HRM) is a novel approach using two small neural networks recursing at different frequencies. This biologically inspired method beats Large Language models (LLMs) on hard puzzle tasks such as Sudoku, Maze, and…

Machine Learning · Computer Science 2025-10-07 Alexia Jolicoeur-Martineau

Tiny Recursive Models (TRM) solve complex reasoning tasks with a fraction of the parameters of modern large language models (LLMs) by iteratively refining a latent state and final answer. While powerful, their deterministic recursion can…

Artificial Intelligence · Computer Science 2026-05-20 Amin Sghaier , Ali Parviz , Alexia Jolicoeur-Martineau

Neural reasoners such as Tiny Recursive Models (TRMs) solve complex problems by combining neural backbones with specialized inference schemes. Such inference schemes have been a central component of stochastic reasoning systems, where…

Machine Learning · Computer Science 2026-03-06 Mieszko Komisarczyk , Saurabh Mathur , Maurice Kraus , Sriraam Natarajan , Kristian Kersting

Current large language models (LLMs) primarily rely on linear sequence generation and massive parameter counts, yet they severely struggle with complex algorithmic reasoning. While recent reasoning architectures, such as the Hierarchical…

Artificial Intelligence · Computer Science 2026-03-25 Vasiliy A. Es'kin , Mikhail E. Smorkalov

How should future neural reasoning systems implement extended computation? Recursive Reasoning Models (RRMs) offer a promising alternative to autoregressive sequence extension by performing iterative latent-state refinement with shared…

Artificial Intelligence · Computer Science 2026-05-21 Junyeob Baek , Mingyu Jo , Minsu Kim , Mengye Ren , Yoshua Bengio , Sungjin Ahn

Tiny Recursive Models (TRMs) have recently demonstrated remarkable performance on ARC-AGI, showing that very small models can compete against large foundation models through a two-step refinement mechanism that updates an internal reasoning…

Machine Learning · Computer Science 2026-03-10 Paulius Rauba , Claudio Fanconi , Mihaela van der Schaar

Tiny Recursive Models (TRM) were proposed as a parameter-efficient alternative to large language models for solving Abstraction and Reasoning Corpus (ARC) style tasks. The original work reports strong performance and suggests that recursive…

Machine Learning · Computer Science 2026-01-12 Antonio Roye-Azar , Santiago Vargas-Naranjo , Dhruv Ghai , Nithin Balamurugan , Rayan Amir

Reasoning, the process of devising and executing complex goal-oriented action sequences, remains a critical challenge in AI. Current large language models (LLMs) primarily employ Chain-of-Thought (CoT) techniques, which suffer from brittle…

Artificial Intelligence · Computer Science 2025-08-05 Guan Wang , Jin Li , Yuhao Sun , Xing Chen , Changling Liu , Yue Wu , Meng Lu , Sen Song , Yasin Abbasi Yadkori

Recent studies show that Large Language Models (LLMs) achieve strong reasoning capabilities through supervised fine-tuning or reinforcement learning. However, a key approach, the Process Reward Model (PRM), suffers from reward hacking,…

Computation and Language · Computer Science 2026-04-10 Teng Wang , Zhangyi Jiang , Zhenqi He , Shenyang Tong , Wenhan Yang , Yanan Zheng , Zeyu Li , Zifan He , Hailei Gong , Zewen Ye , Shengjie Ma , Jianping Zhang

We introduce Tab-TRM (Tabular-Tiny Recursive Model), a network architecture that adapts the recursive latent reasoning paradigm of Tiny Recursive Models (TRMs) to insurance modeling. Drawing inspiration from both the Hierarchical Reasoning…

Machine Learning · Computer Science 2026-01-13 Kishan Padayachy , Ronald Richman , Mario V. Wüthrich

Reasoning problems such as Sudoku and ARC-AGI remain challenging for neural networks. The structured problem solving architecture family of Recurrent Reasoning Models (RRMs), including Hierarchical Reasoning Model (HRM) and Tiny Recursive…

Machine Learning · Computer Science 2026-03-03 Richard Freinschlag , Timo Bertram , Erich Kobler , Andreas Mayr , Günter Klambauer

Hierarchical reasoning model (HRM) achieves extraordinary performance on various reasoning tasks, significantly outperforming large language model-based reasoners. To understand the strengths and potential failure modes of HRM, we conduct a…

Artificial Intelligence · Computer Science 2026-03-24 Zirui Ren , Ziming Liu

Recent advancements in large reasoning models (LRMs) have significantly enhanced language models' capabilities in complex problem-solving by emulating human-like deliberative thinking. However, these models often exhibit overthinking (i.e.,…

Artificial Intelligence · Computer Science 2025-06-19 Weixiang Zhao , Jiahe Guo , Yang Deng , Xingyu Sui , Yulin Hu , Yanyan Zhao , Wanxiang Che , Bing Qin , Tat-Seng Chua , Ting Liu

Large Reasoning Models (LRMs) achieve strong performance on complex tasks through extended chains of thought but suffer from high inference latency due to autoregressive reasoning. Recent work explores using Small Reasoning Models (SRMs) to…

Artificial Intelligence · Computer Science 2026-04-17 Yi Zhao , Yajuan Peng , Cam-Tu Nguyen , Zuchao Li , Xiaoliang Wang , Xiaoming Fu , Hai Zhao

Large reasoning models (LRMs) have recently shown promise in solving complex math problems when optimized with Reinforcement Learning (RL). But conventional approaches rely on outcome-only rewards that provide sparse feedback, resulting in…

Machine Learning · Computer Science 2025-08-01 Tao He , Rongchuan Mu , Lizi Liao , Yixin Cao , Ming Liu , Bing Qin

In nature, the behaviors of many complex systems can be described by parsimonious math equations. Automatically distilling these equations from limited data is cast as a symbolic regression process which hitherto remains a grand challenge.…

Machine Learning · Computer Science 2023-05-25 Yilong Xu , Yang Liu , Hao Sun

Large reasoning models (LRMs) achieve remarkable performance via long reasoning chains, but often incur excessive computational overhead due to redundant reasoning, especially on simple tasks. In this work, we systematically quantify the…

Artificial Intelligence · Computer Science 2025-05-26 Xiaoyun Zhang , Jingqing Ruan , Xing Ma , Yawen Zhu , Haodong Zhao , Hao Li , Jiansong Chen , Ke Zeng , Xunliang Cai

Large language models (LLMs) inevitably make mistakes when performing step-by-step mathematical reasoning. Process Reward Models (PRMs) have emerged as a promising solution by evaluating each reasoning step. However, existing PRMs typically…

Computation and Language · Computer Science 2025-03-28 Shuaijie She , Junxiao Liu , Yifeng Liu , Jiajun Chen , Xin Huang , Shujian Huang

Recursive architectures such as Tiny Recursive Models (TRMs) perform implicit reasoning through iterative latent computation, yet the geometric structure of these reasoning trajectories remains poorly understood. We investigate the…

Machine Learning · Computer Science 2026-04-21 Ege Çakar , Ketan Ali Raghu , Lia Zheng

Reward models play a critical role in guiding large language models toward outputs that align with human expectations. However, an open challenge remains in effectively utilizing test-time compute to enhance reward model performance. In…

Computation and Language · Computer Science 2025-05-21 Jiaxin Guo , Zewen Chi , Li Dong , Qingxiu Dong , Xun Wu , Shaohan Huang , Furu Wei
‹ Prev 1 2 3 10 Next ›