English
Related papers

Related papers: Contextual Memory Trees

200 papers

The Stochastic Context Tree (SCOT) is a useful tool for studying infinite random sequences generated by an m-Markov Chain (m-MC). It captures the phenomenon that the probability distribution of the next state sometimes depends on less than…

Logic in Computer Science · Computer Science 2016-10-28 Tong Zhang

Transformer-based models show their effectiveness across multiple domains and tasks. The self-attention allows to combine information from all sequence elements into context-aware representations. However, global and local information has…

Computation and Language · Computer Science 2022-12-09 Aydar Bulatov , Yuri Kuratov , Mikhail S. Burtsev

Large Language Models (LLMs) need to adapt to the continuous changes in data, tasks, and user preferences. Due to their massive size and the high costs associated with training, LLMs are not suitable for frequent retraining. However,…

Computation and Language · Computer Science 2024-12-11 Dongfang Li , Zetian Sun , Xinshuo Hu , Baotian Hu , Min Zhang

This work introduces the Eigen Memory Tree (EMT), a novel online memory model for sequential learning scenarios. EMTs store data at the leaves of a binary tree and route new samples through the structure using the principal components of…

Machine Learning · Computer Science 2022-11-01 Mark Rucker , Jordan T. Ash , John Langford , Paul Mineiro , Ida Momennejad

Transformer-based large language models (LLM) have been widely used in language processing applications. However, due to the memory constraints of the devices, most of them restrict the context window. Even though recurrent models in…

Computation and Language · Computer Science 2025-02-07 Zifan He , Yingqi Cao , Zongyue Qin , Neha Prakriya , Yizhou Sun , Jason Cong

Recent advancements in large language models have significantly improved their context windows, yet challenges in effective long-term memory management remain. We introduce MemTree, an algorithm that leverages a dynamic, tree-structured…

Computation and Language · Computer Science 2025-03-21 Alireza Rezazadeh , Zichao Li , Wei Wei , Yujia Bao

The quadratic complexity and indefinitely growing key-value (KV) cache of standard Transformers pose a major barrier to long-context processing. To overcome this, we introduce the Collaborative Memory Transformer (CoMeT), a novel…

Machine Learning · Computer Science 2026-04-20 Runsong Zhao , Shilei Liu , Jiwei Tang , Langming Liu , Haibin Chen , Weidong Zhang , Yujin Yuan , Tong Xiao , Jingbo Zhu , Wenbo Su , Bo Zheng

There has been growing interests in recent years from both practical and research perspectives for session-based recommendation tasks as long-term user profiles do not often exist in many real-life recommendation applications. In this case,…

Information Retrieval · Computer Science 2018-06-12 Fei Mi , Boi Faltings

Handling long-range dependencies in neural architectures has remained a persistent challenge due to computational limitations and inefficient contextual retention mechanisms. Tensorial operations have provided a foundation for restructuring…

Computation and Language · Computer Science 2025-08-11 Larin Tonix , Morgana Baskerville , Nathaniel Stourton , Ophelia Tattershall

Fast changing tasks in unpredictable, collaborative environments are typical for medium-small companies, where robotised applications are increasing. Thus, robot programs should be generated in short time with small effort, and the robot…

Robotics · Computer Science 2022-03-18 Oscar Gustavsson , Matteo Iovino , Jonathan Styrud , Christian Smith

This paper presents the Cascaded Metric Tree (CMT) for efficient satisfaction of metric search queries over a dataset of N objects. It provides extra information that permits query algorithms to exploit all distance calculations performed…

Data Structures and Algorithms · Computer Science 2021-12-22 Jeffrey Uhlmann , Miguel R. Zuniga

With a large proportion of people carrying location-aware smartphones, we have an unprecedented platform from which to understand individuals and predict their future actions. This work builds upon the Context Tree data structure that…

Artificial Intelligence · Computer Science 2016-10-06 Alasdair Thomason , Nathan Griffiths , Victor Sanchez

A Concept Tree is a structure for storing knowledge where the trees are stored in a database called a Concept Base. It sits between the highly distributed neural architectures and the distributed information systems, with the intention of…

Artificial Intelligence · Computer Science 2020-04-07 Kieran Greer

Measuring the quality of a generated sequence against a set of references is a central problem in many learning frameworks, be it to compute a score, to assign a reward, or to perform discrimination. Despite great advances in model…

Machine Learning · Computer Science 2020-03-06 Florian Schmidt , Thomas Hofmann

Latent tree learning(LTL) methods learn to parse sentences using only indirect supervision from a downstream task. Recent advances in latent tree learning have made it possible to recover moderately high quality tree structures by training…

Computation and Language · Computer Science 2019-09-24 Phu Mon Htut , Kyunghyun Cho , Samuel R. Bowman

The chain-structured long short-term memory (LSTM) has showed to be effective in a wide range of problems such as speech recognition and machine translation. In this paper, we propose to extend it to tree structures, in which a memory cell…

Computation and Language · Computer Science 2015-03-18 Xiaodan Zhu , Parinaz Sobhani , Hongyu Guo

Given the increasing interest in interpretable machine learning, classification trees have again attracted the attention of the scientific community because of their glass-box structure. These models are usually built using greedy…

Machine Learning · Computer Science 2023-05-16 Tommaso Aldinucci

Tree-based machine learning techniques, such as Decision Trees and Random Forests, are top performers in several domains as they do well with limited training datasets and offer improved interpretability compared to Deep Neural Networks…

Emerging Technologies · Computer Science 2021-10-27 Giacomo Pedretti , Catherine E. Graves , Can Li , Sergey Serebryakov , Xia Sheng , Martin Foltin , Ruibin Mao , John Paul Strachan

Humans can learn concepts or recognize items from just a handful of examples, while machines require many more samples to perform the same task. In this paper, we build a computational model to investigate the possibility of this kind of…

Artificial Intelligence · Computer Science 2016-11-09 Wen-Chieh Fang , Yi-ting Chiang

Memory retention mechanisms play a central role in determining the efficiency of computational architectures designed for processing extended sequences. Conventional methods for token management often impose fixed retention thresholds or…

Computation and Language · Computer Science 2025-03-26 Jonathan Delena , Augustin Moreau , Dominic Ravensdale , Frederick Chatterton
‹ Prev 1 2 3 10 Next ›