Related papers: Higher Order Context Transformations

Combinatorial Entropy Encoding

This paper proposes a novel entropy encoding technique for lossless data compression. Representing a message string by its lexicographic index in the permutations of its symbols results in a compressed version matching Shannon entropy of…

Information Theory · Computer Science 2017-03-24 Abu Bakar Siddique

An Iterative Contextualization Algorithm with Second-Order Attention

Combining the representations of the words that make up a sentence into a cohesive whole is difficult, since it needs to account for the order of words, and to establish how the words present relate to each other. The solution we propose…

Computation and Language · Computer Science 2021-03-04 Diego Maupomé , Marie-Jean Meurs

Improving PPM Algorithm Using Dictionaries

We propose a method to improve traditional character-based PPM text compression algorithms. Consider a text file as a sequence of alternating words and non-words, the basic idea of our algorithm is to encode non-words and prefixes of words…

Information Theory · Computer Science 2015-03-17 Yichuan Hu , Jianzhong , Zhang , Farooq Khan , Ying Li

Context-adaptive Entropy Model for End-to-end Optimized Image Compression

We propose a context-adaptive entropy model for use in end-to-end optimized image compression. Our model exploits two types of contexts, bit-consuming contexts and bit-free contexts, distinguished based upon whether additional bit…

Image and Video Processing · Electrical Eng. & Systems 2019-05-07 Jooyoung Lee , Seunghyun Cho , Seung-Kwon Beack

Length Generalization of Causal Transformers without Position Encoding

Generalizing to longer sentences is important for recent Transformer-based language models. Besides algorithms manipulating explicit position features, the success of Transformers without position encodings (NoPE) provides a new way to…

Computation and Language · Computer Science 2024-05-29 Jie Wang , Tao Ji , Yuanbin Wu , Hang Yan , Tao Gui , Qi Zhang , Xuanjing Huang , Xiaoling Wang

Soft Contextual Data Augmentation for Neural Machine Translation

While data augmentation is an important trick to boost the accuracy of deep learning methods in computer vision tasks, its study in natural language tasks is still very limited. In this paper, we present a novel data augmentation method for…

Computation and Language · Computer Science 2019-05-28 Jinhua Zhu , Fei Gao , Lijun Wu , Yingce Xia , Tao Qin , Wengang Zhou , Xueqi Cheng , Tie-Yan Liu

Gradual Forgetting: Logarithmic Compression for Extending Transformer Context Windows

Most approaches to long-context processing increase the complexity of the transformer's internal architecture by integrating mechanisms such as recurrence or auxiliary memory modules. In this work, we introduce an alternative approach that…

Computation and Language · Computer Science 2025-10-28 Billy Dickson , Zoran Tiganj

Towards Zero-Overhead Disambiguation of Deep Priority Conflicts

**Context** Context-free grammars are widely used for language prototyping and implementation. They allow formalizing the syntax of domain-specific or general-purpose programming languages concisely and declaratively. However, the natural…

Programming Languages · Computer Science 2018-03-28 Luís Eduardo de Souza Amorim , Michael J. Steindorfer , Eelco Visser

Context-Aware Learning for Neural Machine Translation

Interest in larger-context neural machine translation, including document-level and multi-modal translation, has been growing. Multiple works have proposed new network architectures or evaluation schemes, but potentially helpful context is…

Computation and Language · Computer Science 2019-03-13 Sébastien Jean , Kyunghyun Cho

Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models

Large language models have shown remarkable performance across a wide range of language tasks, owing to their exceptional capabilities in context modeling. The most commonly used method of context modeling is full self-attention, as seen in…

Computation and Language · Computer Science 2025-06-26 Zhisong Zhang , Yan Wang , Xinting Huang , Tianqing Fang , Hongming Zhang , Chenlong Deng , Shuaiyi Li , Dong Yu

Context Adaptive Extended Chain Coding for Semantic Map Compression

Semantic maps are increasingly utilized in areas such as robotics, autonomous systems, and extended reality, motivating the investigation of efficient compression methods that preserve structured semantic information. This paper studies…

Image and Video Processing · Electrical Eng. & Systems 2026-03-30 Runyu Yang , Junqi Liao , Hyomin Choi , Fabien Racapé , Ivan V. Bajić

Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs

Large language models (LLMs) have shown remarkable performance in various natural language processing tasks. However, a primary constraint they face is the context limit, i.e., the maximum number of tokens they can process. Previous works…

Machine Learning · Computer Science 2024-04-17 Woomin Song , Seunghyuk Oh , Sangwoo Mo , Jaehyung Kim , Sukmin Yun , Jung-Woo Ha , Jinwoo Shin

HanoiT: Enhancing Context-aware Translation via Selective Context

Context-aware neural machine translation aims to use the document-level context to improve translation quality. However, not all words in the context are helpful. The irrelevant or trivial words may bring some noise and distract the model…

Computation and Language · Computer Science 2023-04-20 Jian Yang , Yuwei Yin , Shuming Ma , Liqun Yang , Hongcheng Guo , Haoyang Huang , Dongdong Zhang , Yutao Zeng , Zhoujun Li , Furu Wei

Equipping Transformer with Random-Access Reading for Long-Context Understanding

Long-context modeling presents a significant challenge for transformer-based large language models (LLMs) due to the quadratic complexity of the self-attention mechanism and issues with length extrapolation caused by pretraining exclusively…

Computation and Language · Computer Science 2024-05-24 Chenghao Yang , Zi Yang , Nan Hua

New Algorithms and Lower Bounds for Sequential-Access Data Compression

This thesis concerns sequential-access data compression, i.e., by algorithms that read the input one or more times from beginning to end. In one chapter we consider adaptive prefix coding, for which we must read the input character by…

Information Theory · Computer Science 2009-02-03 Travis Gagie

Context Normalization Layer with Applications

Normalization is a pre-processing step that converts the data into a more usable representation. As part of the deep neural networks (DNNs), the batch normalization (BN) technique uses normalization to address the problem of internal…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Bilal Faye , Mohamed-Djallel Dilmi , Hanane Azzag , Mustapha Lebbah , Djamel Bouchaffra

Contextually Guided Transformers via Low-Rank Adaptation

Large Language Models (LLMs) based on Transformers excel at text processing, but their reliance on prompts for specialized behavior introduces computational overhead. We propose a modification to a Transformer architecture that eliminates…

Machine Learning · Computer Science 2025-06-09 Andrey Zhmoginov , Jihwan Lee , Max Vladymyrov , Mark Sandler

Context Guided Transformer Entropy Modeling for Video Compression

Conditional entropy models effectively leverage spatio-temporal contexts to reduce video redundancy. However, incorporating temporal context often introduces additional model complexity and increases computational cost. In parallel, many…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Junlong Tong , Wei Zhang , Yaohui Jin , Xiaoyu Shen

Extending Context Window of Large Language Models via Semantic Compression

Transformer-based Large Language Models (LLMs) often impose limitations on the length of the text input to ensure the generation of fluent and relevant responses. This constraint restricts their applicability in scenarios involving long…

Computation and Language · Computer Science 2023-12-18 Weizhi Fei , Xueyan Niu , Pingyi Zhou , Lu Hou , Bo Bai , Lei Deng , Wei Han

On Context Bisimulation for Parameterized Higher-order Processes

This paper studies context bisimulation for higher-order processes, in the presence of parameterization (viz. abstraction). We show that the extension of higher-order processes with process parameterization retains the characterization of…

Logic in Computer Science · Computer Science 2013-10-18 Xian Xu