English
Related papers

Related papers: Enlarging Context with Low Cost: Efficient Arithme…

200 papers

Context-aware compression techniques have gained increasing attention as model sizes continue to grow, introducing computational bottlenecks that hinder efficient deployment. A structured encoding approach was proposed to selectively…

Computation and Language · Computer Science 2025-02-13 Barnaby Schmitt , Alistair Grosvenor , Matthias Cunningham , Clementine Walsh , Julius Pembrokeshire , Jonathan Teel

Recent techniques such as retrieval-augmented generation or chain-of-thought reasoning have led to longer contexts and increased inference costs. Context compression techniques can reduce these costs, but the most effective approaches…

Computation and Language · Computer Science 2025-10-24 Hippolyte Pilchen , Edouard Grave , Patrick Pérez

The omnipresence of deep learning architectures such as deep convolutional neural networks (CNN)s is fueled by the synergistic combination of ever-increasing labeled datasets and specialized hardware. Despite the indisputable success, the…

Machine Learning · Statistics 2016-11-29 Meshia Cédric Oveneke , Mitchel Aliosha-Perez , Yong Zhao , Dongmei Jiang , Hichem Sahli

Semantic maps are increasingly utilized in areas such as robotics, autonomous systems, and extended reality, motivating the investigation of efficient compression methods that preserve structured semantic information. This paper studies…

Image and Video Processing · Electrical Eng. & Systems 2026-03-30 Runyu Yang , Junqi Liao , Hyomin Choi , Fabien Racapé , Ivan V. Bajić

Precise estimation of the probabilistic structure of natural images plays an essential role in image compression. Despite the recent remarkable success of end-to-end optimized image compression, the latent codes are usually assumed to be…

Image and Video Processing · Electrical Eng. & Systems 2020-06-24 Mu Li , Kede Ma , Jane You , David Zhang , Wangmeng Zuo

Transformer-based large language models exhibit groundbreaking capabilities, but their storage and computational costs are prohibitively high, limiting their application in resource-constrained scenarios. An effective approach is to…

Machine Learning · Computer Science 2024-12-18 Jing Zhang , Shuzhen Sun , Peng Zhang , Guangxing Cao , Hui Gao , Xindian Ma , Nan Xu , Yuexian Hou

It is a challenging task to accurately perform semantic segmentation due to the complexity of real picture scenes. Many semantic segmentation methods based on traditional deep learning insufficiently captured the semantic and appearance…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Haitong Tang , Shuang He , Mengduo Yang , Xia Lu , Qin Yu , Kaiyue Liu , Hongjie Yan , Nizhuan Wang

Image compression has been investigated as a fundamental research topic for many decades. Recently, deep learning has achieved great success in many computer vision tasks, and is gradually being used in image compression. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2018-04-26 Zhengxue Cheng , Heming Sun , Masaru Takeuchi , Jiro Katto

We propose the In-context Autoencoder (ICAE), leveraging the power of a large language model (LLM) to compress a long context into short compact memory slots that can be directly conditioned on by the LLM for various purposes. ICAE is first…

Computation and Language · Computer Science 2024-05-10 Tao Ge , Jing Hu , Lei Wang , Xun Wang , Si-Qing Chen , Furu Wei

A novel energy-efficient edge computing paradigm is proposed for real-time deep learning-based image upsampling applications. State-of-the-art deep learning solutions for image upsampling are currently trained using either resize or…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Ian Colbert , Ken Kreutz-Delgado , Srinjoy Das

Million-level token inputs in long-context tasks pose significant computational and memory challenges for Large Language Models (LLMs). Recently, DeepSeek-OCR conducted research into the feasibility of Contexts Optical Compression and…

Computation and Language · Computer Science 2025-12-04 Fanfan Liu , Haibo Qiu

High-energy large-scale particle colliders generate data at extraordinary rates. Developing real-time high-throughput data compression algorithms to reduce data volume and meet the bandwidth requirement for storage has become increasingly…

The application of the context-adaptive entropy model significantly improves the rate-distortion (R-D) performance, in which hyperpriors and autoregressive models are jointly utilized to effectively capture the spatial redundancy of the…

Image and Video Processing · Electrical Eng. & Systems 2022-09-09 Haisheng Fu , Feng Liang

Graph-based convolutional model such as non-local block has shown to be effective for strengthening the context modeling ability in convolutional neural networks (CNNs). However, its pixel-wise computational overhead is prohibitive which…

Computer Vision and Pattern Recognition · Computer Science 2021-09-01 Xiangtai Li , Xia Li , Ansheng You , Li Zhang , Guangliang Cheng , Kuiyuan Yang , Yunhai Tong , Zhouchen Lin

We propose TC-AE, a ViT-based architecture for deep compression autoencoders. Existing methods commonly increase the channel number of latent representations to maintain reconstruction quality under high compression ratios. However, this…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Teng Li , Ziyuan Huang , Cong Chen , Yangfu Li , Yuanhuiyi Lyu , Dandan Zheng , Chunhua Shen , Jun Zhang

High-energy large-scale particle colliders produce data at high speed in the order of 1 terabytes per second in nuclear physics and petabytes per second in high-energy physics. Developing real-time data compression algorithms to reduce such…

Machine Learning · Statistics 2023-10-25 Yi Huang , Yihui Ren , Shinjae Yoo , Jin Huang

Image paragraph generation is the task of producing a coherent story (usually a paragraph) that describes the visual content of an image. The problem nevertheless is not trivial especially when there are multiple descriptive and diverse…

Computer Vision and Pattern Recognition · Computer Science 2019-08-02 Jing Wang , Yingwei Pan , Ting Yao , Jinhui Tang , Tao Mei

Large language models (LLMs) have triggered a new stream of research focusing on compressing the context length to reduce the computational cost while ensuring the retention of helpful information for LLMs to answer the given question.…

Computation and Language · Computer Science 2024-12-20 Barys Liskavets , Maxim Ushakov , Shuvendu Roy , Mark Klibanov , Ali Etemad , Shane Luke

In both H.264 and HEVC, context-adaptive binary arithmetic coding (CABAC) is adopted as the entropy coding method. CABAC relies on manually designed binarization processes as well as handcrafted context models, which may restrict the…

Multimedia · Computer Science 2018-03-30 Rui Song , Dong Liu , Houqiang Li , Feng Wu

If object contours in images are coded efficiently as side information, then they can facilitate advanced image / video coding techniques, such as graph Fourier transform coding or motion prediction of arbitrarily shaped pixel blocks. In…

Multimedia · Computer Science 2016-12-21 Amin Zheng , Gene Cheung , Dinei Florencio
‹ Prev 1 2 3 10 Next ›