English
Related papers

Related papers: Tiny Autoregressive Recursive Models

200 papers

Tiny Recursive Models (TRM) were proposed as a parameter-efficient alternative to large language models for solving Abstraction and Reasoning Corpus (ARC) style tasks. The original work reports strong performance and suggests that recursive…

Machine Learning · Computer Science 2026-01-12 Antonio Roye-Azar , Santiago Vargas-Naranjo , Dhruv Ghai , Nithin Balamurugan , Rayan Amir

Neural reasoners such as Tiny Recursive Models (TRMs) solve complex problems by combining neural backbones with specialized inference schemes. Such inference schemes have been a central component of stochastic reasoning systems, where…

Machine Learning · Computer Science 2026-03-06 Mieszko Komisarczyk , Saurabh Mathur , Maurice Kraus , Sriraam Natarajan , Kristian Kersting

Hierarchical Reasoning Model (HRM) is a novel approach using two small neural networks recursing at different frequencies. This biologically inspired method beats Large Language models (LLMs) on hard puzzle tasks such as Sudoku, Maze, and…

Machine Learning · Computer Science 2025-10-07 Alexia Jolicoeur-Martineau

Prior to the close of the 2025 ARC Prize competition, the leading open source approach - known as TRM, or Tiny Recursive Models - involved training a 7M parameter recursive neural network on augmented variants of ARC tasks. That approach…

Machine Learning · Computer Science 2025-11-06 Ronan Killian McGovern

Recursive reasoning models such as Hierarchical Reasoning Model (HRM) and Tiny Recursive Model (TRM) show that small, weight-shared networks can solve compute-heavy and NP puzzles by iteratively refining latent states, but their training…

Artificial Intelligence · Computer Science 2026-03-18 Navid Hakimi

Tiny Recursive Models (TRM) solve complex reasoning tasks with a fraction of the parameters of modern large language models (LLMs) by iteratively refining a latent state and final answer. While powerful, their deterministic recursion can…

Artificial Intelligence · Computer Science 2026-05-20 Amin Sghaier , Ali Parviz , Alexia Jolicoeur-Martineau

Neural network controllers increasingly demand millions of parameters, and language model approaches push into the billions. For embedded aerospace systems with strict power and latency constraints, this scaling is prohibitive. We present…

Machine Learning · Computer Science 2025-12-19 Amit Jain , Richard Linares

We introduce Tab-TRM (Tabular-Tiny Recursive Model), a network architecture that adapts the recursive latent reasoning paradigm of Tiny Recursive Models (TRMs) to insurance modeling. Drawing inspiration from both the Hierarchical Reasoning…

Machine Learning · Computer Science 2026-01-13 Kishan Padayachy , Ronald Richman , Mario V. Wüthrich

Looped transformers scale computational depth without increasing parameter count by repeatedly applying a shared transformer block and can be used for iterative refinement, where each loop rewrites a full fixed-size prediction in parallel.…

Machine Learning · Computer Science 2026-04-22 Chris Cameron , Wangzheng Wang , Nikita Ivanov , Ashmita Bhattacharyya , Didier Chételat , Yingxue Zhang

The success of deep learning in computer vision has been driven by models of increasing scale, from deep Convolutional Neural Networks (CNN) to large Vision Transformers (ViT). While effective, these architectures are parameter-intensive…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Ange-Clément Akazan , Abdoulaye Koroko , Verlon Roel Mbingui , Choukouriyah Arinloye , Hassan Fifen , Rose Bandolo

Tiny Recursive Models (TRM) achieve strong results on reasoning tasks through iterative refinement of a shared network. We investigate whether these recursive mechanisms transfer to Quality Estimation (QE) for low-resource languages using a…

Computation and Language · Computer Science 2026-03-17 Umar Abubacar , Roman Bauer , Diptesh Kanojia

How should future neural reasoning systems implement extended computation? Recursive Reasoning Models (RRMs) offer a promising alternative to autoregressive sequence extension by performing iterative latent-state refinement with shared…

Artificial Intelligence · Computer Science 2026-05-21 Junyeob Baek , Mingyu Jo , Minsu Kim , Mengye Ren , Yoshua Bengio , Sungjin Ahn

Large Multimodal Models (LMMs) have achieved remarkable success in vision-language tasks, yet their vast parameter counts are often underutilized during both training and inference. In this work, we embrace the idea of looping back to move…

Machine Learning · Computer Science 2026-02-11 Ruihan Xu , Yuting Gao , Lan Wang , Jianing Li , Weihao Chen , Qingpei Guo , Ming Yang , Shiliang Zhang

Modern Large Language Models (LLMs) have shown rapid improvements in reasoning capabilities, driven largely by reinforcement learning (RL) with verifiable rewards. Here, we ask whether these LLMs can self-improve without the need for…

Computation and Language · Computer Science 2026-02-04 Yufan Zhuang , Chandan Singh , Liyuan Liu , Yelong Shen , Dinghuai Zhang , Jingbo Shang , Jianfeng Gao , Weizhu Chen

Large Reasoning Models (LRMs) achieve strong performance on complex tasks through extended chains of thought but suffer from high inference latency due to autoregressive reasoning. Recent work explores using Small Reasoning Models (SRMs) to…

Artificial Intelligence · Computer Science 2026-04-17 Yi Zhao , Yajuan Peng , Cam-Tu Nguyen , Zuchao Li , Xiaoliang Wang , Xiaoming Fu , Hai Zhao

Generative sequence modeling faces a fundamental tension between the expressivity of Transformers and the efficiency of linear sequence models. Existing efficient architectures are theoretically bounded by shallow, single-step linear…

Machine Learning · Computer Science 2026-02-13 Jie Jiang , Ke Cheng , Xin Xu , Mengyang Pang , Tianhao Lu , Jiaheng Li , Yue Liu , Yuan Wang , Jun Zhang , Huan Yu , Zhouchen Lin

Performative prediction is a framework accounting for the shift in the data distribution induced by the prediction of a model deployed in the real world. Ensuring rapid convergence to a stable solution where the data distribution remains…

Machine Learning · Computer Science 2026-01-30 Pedram Khorsandi , Rushil Gupta , Mehrnaz Mofakhami , Simon Lacoste-Julien , Gauthier Gidel

The recursive model index (RMI) has recently been introduced as a machine-learned replacement for traditional indexes over sorted data, achieving remarkably fast lookups. Follow-up work focused on explaining RMI's performance and…

Databases · Computer Science 2021-11-23 Marcel Maltry , Jens Dittrich

Non-autoregressive models greatly improve decoding speed over typical sequence-to-sequence models, but suffer from degraded performance. Infilling and iterative refinement models make up some of this gap by editing the outputs of a…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-28 Ethan A. Chi , Julian Salazar , Katrin Kirchhoff

The rapid progress seen in terms of large-scale generative AI is largely based on the attention mechanism. It is conversely non-trivial to conceive small-scale applications for which attention-based architectures outperform traditional…

Machine Learning · Computer Science 2025-08-07 Claudius Gros
‹ Prev 1 2 3 10 Next ›