Related papers: Enlarging Context with Low Cost: Efficient Arithme…

Contextual Compression Encoding for Large Language Models: A Novel Framework for Multi-Layered Parameter Space Pruning

Context-aware compression techniques have gained increasing attention as model sizes continue to grow, introducing computational bottlenecks that hinder efficient deployment. A structured encoding approach was proposed to selectively…

Computation and Language · Computer Science 2025-02-13 Barnaby Schmitt , Alistair Grosvenor , Matthias Cunningham , Clementine Walsh , Julius Pembrokeshire , Jonathan Teel

ARC-Encoder: learning compressed text representations for large language models

Recent techniques such as retrieval-augmented generation or chain-of-thought reasoning have led to longer contexts and increased inference costs. Context compression techniques can reduce these costs, but the most effective approaches…

Computation and Language · Computer Science 2025-10-24 Hippolyte Pilchen , Edouard Grave , Patrick Pérez

Efficient Convolutional Auto-Encoding via Random Convexification and Frequency-Domain Minimization

The omnipresence of deep learning architectures such as deep convolutional neural networks (CNN)s is fueled by the synergistic combination of ever-increasing labeled datasets and specialized hardware. Despite the indisputable success, the…

Machine Learning · Statistics 2016-11-29 Meshia Cédric Oveneke , Mitchel Aliosha-Perez , Yong Zhao , Dongmei Jiang , Hichem Sahli

Context Adaptive Extended Chain Coding for Semantic Map Compression

Semantic maps are increasingly utilized in areas such as robotics, autonomous systems, and extended reality, motivating the investigation of efficient compression methods that preserve structured semantic information. This paper studies…

Image and Video Processing · Electrical Eng. & Systems 2026-03-30 Runyu Yang , Junqi Liao , Hyomin Choi , Fabien Racapé , Ivan V. Bajić

Efficient and Effective Context-Based Convolutional Entropy Modeling for Image Compression

Precise estimation of the probabilistic structure of natural images plays an essential role in image compression. Despite the recent remarkable success of end-to-end optimized image compression, the latent codes are usually assumed to be…

Image and Video Processing · Electrical Eng. & Systems 2020-06-24 Mu Li , Kede Ma , Jane You , David Zhang , Wangmeng Zuo

SEE: Sememe Entanglement Encoding for Transformer-bases Models Compression

Transformer-based large language models exhibit groundbreaking capabilities, but their storage and computational costs are prohibitively high, limiting their application in resource-constrained scenarios. An effective approach is to…

Machine Learning · Computer Science 2024-12-18 Jing Zhang , Shuzhen Sun , Peng Zhang , Guangxing Cao , Hui Gao , Xindian Ma , Nan Xu , Yuexian Hou

CSC-Unet: A Novel Convolutional Sparse Coding Strategy Based Neural Network for Semantic Segmentation

It is a challenging task to accurately perform semantic segmentation due to the complexity of real picture scenes. Many semantic segmentation methods based on traditional deep learning insufficiently captured the semantic and appearance…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Haitong Tang , Shuang He , Mengduo Yang , Xia Lu , Qin Yu , Kaiyue Liu , Hongjie Yan , Nizhuan Wang

Deep Convolutional AutoEncoder-based Lossy Image Compression

Image compression has been investigated as a fundamental research topic for many decades. Recently, deep learning has achieved great success in many computer vision tasks, and is gradually being used in image compression. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2018-04-26 Zhengxue Cheng , Heming Sun , Masaru Takeuchi , Jiro Katto

In-context Autoencoder for Context Compression in a Large Language Model

We propose the In-context Autoencoder (ICAE), leveraging the power of a large language model (LLM) to compress a long context into short compact memory slots that can be directly conditioned on by the LLM for various purposes. ICAE is first…

Computation and Language · Computer Science 2024-05-10 Tao Ge , Jing Hu , Lei Wang , Xun Wang , Si-Qing Chen , Furu Wei

An Energy-Efficient Edge Computing Paradigm for Convolution-based Image Upsampling

A novel energy-efficient edge computing paradigm is proposed for real-time deep learning-based image upsampling applications. State-of-the-art deep learning solutions for image upsampling are currently trained using either resize or…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Ian Colbert , Ken Kreutz-Delgado , Srinjoy Das

Context Cascade Compression: Exploring the Upper Limits of Text Compression

Million-level token inputs in long-context tasks pose significant computational and memory challenges for Large Language Models (LLMs). Recently, DeepSeek-OCR conducted research into the feasibility of Contexts Optical Compression and…

Computation and Language · Computer Science 2025-12-04 Fanfan Liu , Haibo Qiu

Variable Rate Neural Compression for Sparse Detector Data

High-energy large-scale particle colliders generate data at extraordinary rates. Developing real-time high-throughput data compression algorithms to reduce data volume and meet the bandwidth requirement for storage has become increasingly…

Instrumentation and Detectors · Physics 2024-11-20 Yi Huang , Yeonju Go , Jin Huang , Shuhang Li , Xihaier Luo , Thomas Marshall , Joseph Osborn , Christopher Pinkenburg , Yihui Ren , Evgeny Shulga , Shinjae Yoo , Byung-Jun Yoon

Learned Image Compression with Generalized Octave Convolution and Cross-Resolution Parameter Estimation

The application of the context-adaptive entropy model significantly improves the rate-distortion (R-D) performance, in which hyperpriors and autoregressive models are jointly utilized to effectively capture the spatial redundancy of the…

Image and Video Processing · Electrical Eng. & Systems 2022-09-09 Haisheng Fu , Feng Liang

Towards Efficient Scene Understanding via Squeeze Reasoning

Graph-based convolutional model such as non-local block has shown to be effective for strengthening the context modeling ability in convolutional neural networks (CNNs). However, its pixel-wise computational overhead is prohibitive which…

Computer Vision and Pattern Recognition · Computer Science 2021-09-01 Xiangtai Li , Xia Li , Ansheng You , Li Zhang , Guangliang Cheng , Kuiyuan Yang , Yunhai Tong , Zhouchen Lin

TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders

We propose TC-AE, a ViT-based architecture for deep compression autoencoders. Existing methods commonly increase the channel number of latent representations to maintain reconstruction quality under high compression ratios. However, this…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Teng Li , Ziyuan Huang , Cong Chen , Yangfu Li , Yuanhuiyi Lyu , Dandan Zheng , Chunhua Shen , Jun Zhang

Fast 2D Bicephalous Convolutional Autoencoder for Compressing 3D Time Projection Chamber Data

High-energy large-scale particle colliders produce data at high speed in the order of 1 terabytes per second in nuclear physics and petabytes per second in high-energy physics. Developing real-time data compression algorithms to reduce such…

Machine Learning · Statistics 2023-10-25 Yi Huang , Yihui Ren , Shinjae Yoo , Jin Huang

Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation

Image paragraph generation is the task of producing a coherent story (usually a paragraph) that describes the visual content of an image. The problem nevertheless is not trivial especially when there are multiple descriptive and diverse…

Computer Vision and Pattern Recognition · Computer Science 2019-08-02 Jing Wang , Yingwei Pan , Ting Yao , Jinhui Tang , Tao Mei

Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Large language models (LLMs) have triggered a new stream of research focusing on compressing the context length to reduce the computational cost while ensuring the retention of helpful information for LLMs to answer the given question.…

Computation and Language · Computer Science 2024-12-20 Barys Liskavets , Maxim Ushakov , Shuvendu Roy , Mark Klibanov , Ali Etemad , Shane Luke

Neural network-based arithmetic coding of intra prediction modes in HEVC

In both H.264 and HEVC, context-adaptive binary arithmetic coding (CABAC) is adopted as the entropy coding method. CABAC relies on manually designed binarization processes as well as handcrafted context models, which may restrict the…

Multimedia · Computer Science 2018-03-30 Rui Song , Dong Liu , Houqiang Li , Feng Wu

Context Tree based Image Contour Coding using A Geometric Prior

If object contours in images are coded efficiently as side information, then they can facilitate advanced image / video coding techniques, such as graph Fourier transform coding or motion prediction of arbitrarily shaped pixel blocks. In…

Multimedia · Computer Science 2016-12-21 Amin Zheng , Gene Cheung , Dinei Florencio