Related papers: Adaptive Transform Coding for Semantic Compression

Toward Textual Transform Coding

Inspired by recent work on compression with and for young humans, the success of transform-based approaches to information processing, and the rise of powerful language-based AI, we propose \emph{textual transform coding}. It shares some of…

Information Theory · Computer Science 2023-05-04 Tsachy Weissman

Perceptual Image Compression with Cooperative Cross-Modal Side Information

The explosion of data has resulted in more and more associated text being transmitted along with images. Inspired by from distributed source coding, many works utilize image side information to enhance image compression. However, existing…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Shiyu Qin , Bin Chen , Yujun Huang , Baoyi An , Tao Dai , Shu-Tao Xia

Efficient Semantic Communication Through Transformer-Aided Compression

Transformers, known for their attention mechanisms, have proven highly effective in focusing on critical elements within complex data. This feature can effectively be used to address the time-varying channels in wireless communication…

Machine Learning · Computer Science 2024-12-03 Matin Mortaheb , Mohammad A. Amir Khojastepour , Sennur Ulukus

Variational Speech Waveform Compression to Catalyze Semantic Communications

We propose a novel neural waveform compression method to catalyze emerging speech semantic communications. By introducing nonlinear transform and variational modeling, we effectively capture the dependencies within speech frames and…

Sound · Computer Science 2022-12-14 Shengshi Yao , Zixuan Xiao , Sixian Wang , Jincheng Dai , Kai Niu , Ping Zhang

Compression Ratio Learning and Semantic Communications for Video Imaging

Camera sensors have been widely used in intelligent robotic systems. Developing camera sensors with high sensing efficiency has always been important to reduce the power, memory, and other related resources. Inspired by recent success on…

Image and Video Processing · Electrical Eng. & Systems 2023-10-11 Bowen Zhang , Zhijin Qin , Geoffrey Ye Li

Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform

We propose a versatile deep image compression network based on Spatial Feature Transform (SFT arXiv:1804.02815), which takes a source image and a corresponding quality map as inputs and produce a compressed image with variable rates. Our…

Image and Video Processing · Electrical Eng. & Systems 2021-08-24 Myungseo Song , Jinyoung Choi , Bohyung Han

Semantics-Guided Generative Image Compression

Advancements in text-to-image generative AI with large multimodal models are spreading into the field of image compression, creating high-quality representation of images at extremely low bit rates. This work introduces novel components to…

Image and Video Processing · Electrical Eng. & Systems 2025-06-02 Cheng-Lin Wu , Hyomin Choi , Ivan V. Bajić

Lossy Image Compression with Conditional Diffusion Models

This paper outlines an end-to-end optimized lossy image compression framework using diffusion generative models. The approach relies on the transform coding paradigm, where an image is mapped into a latent space for entropy coding and, from…

Image and Video Processing · Electrical Eng. & Systems 2024-01-03 Ruihan Yang , Stephan Mandt

Compression as Adaptation: Implicit Visual Representation with Diffusion Foundation Models

Modern visual generative models acquire rich visual knowledge through large-scale training, yet existing visual representations (such as pixels, latents, or tokens) remain external to the model and cannot directly exploit this knowledge for…

Machine Learning · Computer Science 2026-05-25 Zongyu Guo , Jiajun He , Zhaoyang Jia , Xiaoyi Zhang , Jiahao Li , Xiao Li , Bin Li , José Miguel Hernández-Lobato , Yan Lu

Generative Image Coding with Diffusion Prior

As generative technologies advance, visual content has evolved into a complex mix of natural and AI-generated images, driving the need for more efficient coding techniques that prioritize perceptual quality. Traditional codecs and learned…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Jianhui Chang

Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance

Modern video codecs and learning-based approaches struggle for semantic reconstruction at extremely low bit-rates due to reliance on low-level spatiotemporal redundancies. Generative models, especially diffusion models, offer a new paradigm…

Image and Video Processing · Electrical Eng. & Systems 2026-02-06 Maojun Zhang , Haotian Wu , Richeng Jin , Deniz Gunduz , Krystian Mikolajczyk

Cross Modal Compression: Towards Human-comprehensible Semantic Compression

Traditional image/video compression aims to reduce the transmission/storage cost with signal fidelity as high as possible. However, with the increasing demand for machine analysis and semantic monitoring in recent years, semantic fidelity…

Image and Video Processing · Electrical Eng. & Systems 2022-09-07 Jiguo Li , Chuanmin Jia , Xinfeng Zhang , Siwei Ma , Wen Gao

Conceptual Compression via Deep Structure and Texture Synthesis

Existing compression methods typically focus on the removal of signal-level redundancies, while the potential and versatility of decomposing visual data into compact conceptual components still lack further study. To this end, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2022-03-11 Jianhui Chang , Zhenghui Zhao , Chuanmin Jia , Shiqi Wang , Lingbo Yang , Qi Mao , Jian Zhang , Siwei Ma

Flexible Variable-Rate Image Feature Compression for Edge-Cloud Systems

Feature compression is a promising direction for coding for machines. Existing methods have made substantial progress, but they require designing and training separate neural network models to meet different specifications of compression…

Image and Video Processing · Electrical Eng. & Systems 2024-04-02 Md Adnan Faisal Hossain , Zhihao Duan , Yuning Huang , Fengqing Zhu

Dynamic Kernel-Based Adaptive Spatial Aggregation for Learned Image Compression

Learned image compression methods have shown superior rate-distortion performance and remarkable potential compared to traditional compression methods. Most existing learned approaches use stacked convolution or window-based self-attention…

Image and Video Processing · Electrical Eng. & Systems 2024-01-03 Huairui Wang , Nianxiang Fu , Zhenzhong Chen , Shan Liu

Synonymous Variational Inference for Perceptual Image Compression

Recent contributions of semantic information theory reveal the set-element relationship between semantic and syntactic information, represented as synonymous relationships. In this paper, we propose a synonymous variational inference (SVI)…

Information Theory · Computer Science 2025-05-29 Zijian Liang , Kai Niu , Changshuo Wang , Jin Xu , Ping Zhang

Content Adaptive Optimization for Neural Image Compression

The field of neural image compression has witnessed exciting progress as recently proposed architectures already surpass the established transform coding based approaches. While, so far, research has mainly focused on architecture and model…

Computer Vision and Pattern Recognition · Computer Science 2019-06-06 Joaquim Campos , Simon Meierhans , Abdelaziz Djelouah , Christopher Schroers

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

Traditional image codecs emphasize signal fidelity and human perception, often at the expense of machine vision tasks. Deep learning methods have demonstrated promising coding performance by utilizing rich semantic embeddings optimized for…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Sha Guo , Zhuo Chen , Yang Zhao , Ning Zhang , Xiaotong Li , Lingyu Duan

Towards Modality Transferable Visual Information Representation with Optimal Model Compression

Compactly representing the visual signals is of fundamental importance in various image/video-centered applications. Although numerous approaches were developed for improving the image and video coding performance by removing the…

Image and Video Processing · Electrical Eng. & Systems 2020-08-14 Rongqun Lin , Linwei Zhu , Shiqi Wang , Sam Kwong

Efficient Compressed Sensing Based Image Coding by Using Gray Transformation

In recent years, compressed sensing (CS) based image coding has become a hot topic in image processing field. However, since the bit depth required for encoding each CS sample is too large, the compression performance of this paradigm is…

Multimedia · Computer Science 2021-02-03 Bo Zhang , Di Xiao , Lan Wang , Sen Bai , Lei Yang