English
Related papers

Related papers: Contextually Structured Token Dependency Encoding …

200 papers

Token representations influence the efficiency and adaptability of language models, yet conventional tokenization strategies impose rigid segmentation boundaries that do not adjust dynamically to evolving contextual relationships. The…

Computation and Language · Computer Science 2025-08-11 Alistair Dombrowski , Beatrix Engelhardt , Dimitri Fairbrother , Henry Evidail

Self-modulating mechanisms introduce dynamic adaptation capabilities within language models through contextual realignment strategies that influence token embedding trajectories across extended sequences. Contextual Flux is explored as an…

Computation and Language · Computer Science 2025-08-11 Henry Evidail , Zachary Mountebank , Alistair Hathersage , Peter Stanhope , Basil Ravenscroft , Tobias Waddingham

Contextual adaptation in token embeddings plays a central role in determining how well language models maintain coherence and retain semantic relationships over extended text sequences. Static embeddings often impose constraints on lexical…

Computation and Language · Computer Science 2025-03-27 Koinis Vassilis , Godfrey Milbourne , Harriet Featherstone , Xanthe Peverell , Yorick Bletchley , Zachary Montford

Memory retention challenges in deep neural architectures have ongoing limitations in the ability to process and recall extended contextual information. Token dependencies degrade as sequence length increases, leading to a decline in…

Computation and Language · Computer Science 2025-03-26 Frederick Dillon , Gregor Halvorsen , Simon Tattershall , Magnus Rowntree , Gareth Vanderpool

Recent progress on parse tree encoder for sentence representation learning is notable. However, these works mainly encode tree structures recursively, which is not conducive to parallelization. On the other hand, these works rarely take…

Computation and Language · Computer Science 2022-05-10 Junhua Ma , Jiajun Li , Yuxuan Liu , Shangbo Zhou , Xue Li

Token representations in high-dimensional latent spaces often exhibit redundancy, limiting computational efficiency and reducing structural coherence across model layers. Hierarchical latent space folding introduces a structured…

Computation and Language · Computer Science 2025-08-11 Fenella Harcourt , Naderdel Piero , Gilbert Sutherland , Daphne Holloway , Harriet Bracknell , Julian Ormsby

Self-attention model have shown its flexibility in parallel computation and the effectiveness on modeling both long- and short-term dependencies. However, it calculates the dependencies between representations without considering the…

Computation and Language · Computer Science 2019-02-18 Baosong Yang , Jian Li , Derek Wong , Lidia S. Chao , Xing Wang , Zhaopeng Tu

Representing token embeddings as probability distributions over learned manifolds allows for more flexible contextual inference, reducing representational rigidity while enhancing semantic granularity. Comparative evaluations demonstrate…

Computation and Language · Computer Science 2025-04-25 Christopher Nightingale , Dominic Lavington , Jonathan Thistlethwaite , Sebastian Penhaligon , Thomas Belinski , David Boldo

Representation learning plays a central role in structuring internal embeddings to capture the statistical properties of language, influencing the coherence and contextual consistency of generated text. Statistical Coherence Alignment is…

Computation and Language · Computer Science 2025-08-11 Jonathan Gale , Godfrey Aldington , Harriet Thistlewood , Thomas Tattershall , Basil Wentworth , Vincent Enoasmo

Effective token compression remains a critical challenge for scaling models to handle increasingly complex and diverse datasets. A novel mechanism based on contextual reinforcement is introduced, dynamically adjusting token importance…

Computation and Language · Computer Science 2025-08-11 Naderdel Piero , Zacharias Cromwell , Nathaniel Wainwright , Matthias Nethercott

The organization of latent token representations plays a crucial role in determining the stability, generalization, and contextual consistency of language models, yet conventional approaches to embedding refinement often rely on parameter…

Computation and Language · Computer Science 2025-03-26 Meiquan Dong , Haoran Liu , Yan Huang , Zixuan Feng , Jianhong Tang , Ruoxi Wang

Transformer-based models have shown strong performance in time-series forecasting by leveraging self-attention to model long-range temporal dependencies. However, their effectiveness depends critically on the quality and structure of input…

Machine Learning · Computer Science 2026-02-11 Saurish Nagrath , Saroj Kumar Panigrahy

Extended sequence generation often leads to degradation in contextual consistency due to the inability of conventional self-attention mechanisms to effectively retain long-range dependencies. Existing approaches, including memory…

Computation and Language · Computer Science 2025-01-30 Jonathan Teel , Jocasta Cumberbatch , Raphael Benington , Quentin Baskerville

The neural architectures of language models are becoming increasingly complex, especially that of Transformers, based on the attention mechanism. Although their application to numerous natural language processing tasks has proven to be very…

Computation and Language · Computer Science 2023-12-04 Pablo Gamallo

In Transformer architectures, tokens\textemdash discrete units derived from raw data\textemdash are formed by segmenting inputs into fixed-length chunks. Each token is then mapped to an embedding, enabling parallel attention computations…

Machine Learning · Computer Science 2026-01-14 Zhenglun Kong , Yize Li , Fanhu Zeng , Lei Xin , Shvat Messica , Xue Lin , Pu Zhao , Manolis Kellis , Hao Tang , Marinka Zitnik

Stochastic embedding transitions introduce a probabilistic mechanism for adjusting token representations dynamically during inference, mitigating the constraints imposed through static or deterministic embeddings. A transition framework was…

Computation and Language · Computer Science 2025-08-11 Stefan Whitaker , Colin Sisate , Marcel Windsor , Nikolai Fairweather , Tarquin Goldborough , Oskar Lindenfeld

Autoregressive language models (LMs) generate one token at a time, yet human reasoning operates over higher-level abstractions - sentences, propositions, and concepts. This contrast raises a central question- Can LMs likewise learn to…

Computation and Language · Computer Science 2025-10-14 Hyeonbin Hwang , Byeongguk Jeon , Seungone Kim , Jiyeon Kim , Hoyeon Chang , Sohee Yang , Seungpil Won , Dohaeng Lee , Youbin Ahn , Minjoon Seo

Memory retention mechanisms play a central role in determining the efficiency of computational architectures designed for processing extended sequences. Conventional methods for token management often impose fixed retention thresholds or…

Computation and Language · Computer Science 2025-03-26 Jonathan Delena , Augustin Moreau , Dominic Ravensdale , Frederick Chatterton

In this paper we delve deep in the Transformer architecture by investigating two of its core components: self-attention and contextual embeddings. In particular, we study the identifiability of attention weights and token embeddings, and…

Computation and Language · Computer Science 2020-02-10 Gino Brunner , Yang Liu , Damián Pascual , Oliver Richter , Massimiliano Ciaramita , Roger Wattenhofer

Transformative innovations in model architectures have introduced hierarchical embedding augmentation as a means to redefine the representation of tokens through multi-level semantic structures, offering enhanced adaptability to complex…

Computation and Language · Computer Science 2025-08-11 Derek Yotheringhay , Alistair Kirkland , Humphrey Kirkbride , Josiah Whitesteeple
‹ Prev 1 2 3 10 Next ›