Related papers: Contextually Structured Token Dependency Encoding …

Contextual Morphogenesis in Large Language Models: A Novel Approach to Self-Organizing Token Representations

Token representations influence the efficiency and adaptability of language models, yet conventional tokenization strategies impose rigid segmentation boundaries that do not adjust dynamically to evolving contextual relationships. The…

Computation and Language · Computer Science 2025-08-11 Alistair Dombrowski , Beatrix Engelhardt , Dimitri Fairbrother , Henry Evidail

Exploring Contextual Flux in Large Language Models: A Novel Approach to Self-Modulating Semantic Networks

Self-modulating mechanisms introduce dynamic adaptation capabilities within language models through contextual realignment strategies that influence token embedding trajectories across extended sequences. Contextual Flux is explored as an…

Computation and Language · Computer Science 2025-08-11 Henry Evidail , Zachary Mountebank , Alistair Hathersage , Peter Stanhope , Basil Ravenscroft , Tobias Waddingham

Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation

Contextual adaptation in token embeddings plays a central role in determining how well language models maintain coherence and retain semantic relationships over extended text sequences. Static embeddings often impose constraints on lexical…

Computation and Language · Computer Science 2025-03-27 Koinis Vassilis , Godfrey Milbourne , Harriet Featherstone , Xanthe Peverell , Yorick Bletchley , Zachary Montford

Contextual Memory Reweaving in Large Language Models Using Layered Latent State Reconstruction

Memory retention challenges in deep neural architectures have ongoing limitations in the ability to process and recall extended contextual information. Token dependencies degrade as sequence length increases, leading to a decline in…

Computation and Language · Computer Science 2025-03-26 Frederick Dillon , Gregor Halvorsen , Simon Tattershall , Magnus Rowntree , Gareth Vanderpool

Integrating Dependency Tree Into Self-attention for Sentence Representation

Recent progress on parse tree encoder for sentence representation learning is notable. However, these works mainly encode tree structures recursively, which is not conducive to parallelization. On the other hand, these works rarely take…

Computation and Language · Computer Science 2022-05-10 Junhua Ma , Jiajun Li , Yuxuan Liu , Shangbo Zhou , Xue Li

Structured Convergence in Large Language Model Representations via Hierarchical Latent Space Folding

Token representations in high-dimensional latent spaces often exhibit redundancy, limiting computational efficiency and reducing structural coherence across model layers. Hierarchical latent space folding introduces a structured…

Computation and Language · Computer Science 2025-08-11 Fenella Harcourt , Naderdel Piero , Gilbert Sutherland , Daphne Holloway , Harriet Bracknell , Julian Ormsby

Context-Aware Self-Attention Networks

Self-attention model have shown its flexibility in parallel computation and the effectiveness on modeling both long- and short-term dependencies. However, it calculates the dependencies between representations without considering the…

Computation and Language · Computer Science 2019-02-18 Baosong Yang , Jian Li , Derek Wong , Lidia S. Chao , Xing Wang , Zhaopeng Tu

Probabilistic Subspace Manifolds for Contextual Inference in Large Language Models

Representing token embeddings as probability distributions over learned manifolds allows for more flexible contextual inference, reducing representational rigidity while enhancing semantic granularity. Comparative evaluations demonstrate…

Computation and Language · Computer Science 2025-04-25 Christopher Nightingale , Dominic Lavington , Jonathan Thistlethwaite , Sebastian Penhaligon , Thomas Belinski , David Boldo

Statistical Coherence Alignment for Large Language Model Representation Learning Through Tensor Field Convergence

Representation learning plays a central role in structuring internal embeddings to capture the statistical properties of language, influencing the coherence and contextual consistency of generated text. Statistical Coherence Alignment is…

Computation and Language · Computer Science 2025-08-11 Jonathan Gale , Godfrey Aldington , Harriet Thistlewood , Thomas Tattershall , Basil Wentworth , Vincent Enoasmo

Contextual Reinforcement in Multimodal Token Compression for Large Language Models

Effective token compression remains a critical challenge for scaling models to handle increasingly complex and diverse datasets. A novel mechanism based on contextual reinforcement is introduced, dynamically adjusting token importance…

Computation and Language · Computer Science 2025-08-11 Naderdel Piero , Zacharias Cromwell , Nathaniel Wainwright , Matthias Nethercott

Hierarchical Contextual Manifold Alignment for Structuring Latent Representations in Large Language Models

The organization of latent token representations plays a crucial role in determining the stability, generalization, and contextual consistency of language models, yet conventional approaches to embedding refinement often rely on parameter…

Computation and Language · Computer Science 2025-03-26 Meiquan Dong , Haoran Liu , Yan Huang , Zixuan Feng , Jianhong Tang , Ruoxi Wang

Patch-Level Tokenization with CNN Encoders and Attention for Improved Transformer Time-Series Forecasting

Transformer-based models have shown strong performance in time-series forecasting by leveraging self-attention to model long-range temporal dependencies. However, their effectiveness depends critically on the quality and structure of input…

Machine Learning · Computer Science 2026-02-11 Saurish Nagrath , Saroj Kumar Panigrahy

Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment

Extended sequence generation often leads to degradation in contextual consistency due to the inability of conventional self-attention mechanisms to effectively retain long-range dependencies. Existing approaches, including memory…

Computation and Language · Computer Science 2025-01-30 Jonathan Teel , Jocasta Cumberbatch , Raphael Benington , Quentin Baskerville

Contextualized word senses: from attention to compositionality

The neural architectures of language models are becoming increasingly complex, especially that of Transformers, based on the attention mechanism. Although their application to numerous natural language processing tasks has proven to be very…

Computation and Language · Computer Science 2023-12-04 Pablo Gamallo

Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality

In Transformer architectures, tokens\textemdash discrete units derived from raw data\textemdash are formed by segmenting inputs into fixed-length chunks. Each token is then mapped to an embedding, enabling parallel attention computations…

Machine Learning · Computer Science 2026-01-14 Zhenglun Kong , Yize Li , Fanhu Zeng , Lei Xin , Shvat Messica , Xue Lin , Pu Zhao , Manolis Kellis , Hao Tang , Marinka Zitnik

Latent Structure Modulation in Large Language Models Through Stochastic Concept Embedding Transitions

Stochastic embedding transitions introduce a probabilistic mechanism for adjusting token representations dynamically during inference, mitigating the constraints imposed through static or deterministic embeddings. A transition framework was…

Computation and Language · Computer Science 2025-08-11 Stefan Whitaker , Colin Sisate , Marcel Windsor , Nikolai Fairweather , Tarquin Goldborough , Oskar Lindenfeld

Latent Reasoning via Sentence Embedding Prediction

Autoregressive language models (LMs) generate one token at a time, yet human reasoning operates over higher-level abstractions - sentences, propositions, and concepts. This contrast raises a central question- Can LMs likewise learn to…

Computation and Language · Computer Science 2025-10-14 Hyeonbin Hwang , Byeongguk Jeon , Seungone Kim , Jiyeon Kim , Hoyeon Chang , Sohee Yang , Seungpil Won , Dohaeng Lee , Youbin Ahn , Minjoon Seo

Structured Token Retention and Computational Memory Paths in Large Language Models

Memory retention mechanisms play a central role in determining the efficiency of computational architectures designed for processing extended sequences. Conventional methods for token management often impose fixed retention thresholds or…

Computation and Language · Computer Science 2025-03-26 Jonathan Delena , Augustin Moreau , Dominic Ravensdale , Frederick Chatterton

On Identifiability in Transformers

In this paper we delve deep in the Transformer architecture by investigating two of its core components: self-attention and contextual embeddings. In particular, we study the identifiability of attention weights and token embeddings, and…

Computation and Language · Computer Science 2020-02-10 Gino Brunner , Yang Liu , Damián Pascual , Oliver Richter , Massimiliano Ciaramita , Roger Wattenhofer

Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation

Transformative innovations in model architectures have introduced hierarchical embedding augmentation as a means to redefine the representation of tokens through multi-level semantic structures, offering enhanced adaptability to complex…

Computation and Language · Computer Science 2025-08-11 Derek Yotheringhay , Alistair Kirkland , Humphrey Kirkbride , Josiah Whitesteeple