Related papers: Conditional Latent Coding with Learnable Synthesiz…

Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models

We argue that diffusion models' success in modeling complex distributions is, for the most part, coming from their input conditioning. This paper investigates the representation used to condition diffusion models from the perspective that…

Computer Vision and Pattern Recognition · Computer Science 2026-01-07 Samuel Lavoie , Michael Noukhovitch , Aaron Courville

Causal Contextual Prediction for Learned Image Compression

Over the past several years, we have witnessed impressive progress in the field of learned image compression. Recent learned image codecs are commonly based on autoencoders, that first encode an image into low-dimensional latent…

Computer Vision and Pattern Recognition · Computer Science 2021-11-02 Zongyu Guo , Zhizheng Zhang , Runsen Feng , Zhibo Chen

Conditional Neural Video Coding with Spatial-Temporal Super-Resolution

This document is an expanded version of a one-page abstract originally presented at the 2024 Data Compression Conference. It describes our proposed method for the video track of the Challenge on Learned Image Compression (CLIC) 2024. Our…

Image and Video Processing · Electrical Eng. & Systems 2024-01-26 Henan Wang , Xiaohan Pan , Runsen Feng , Zongyu Guo , Zhibo Chen

Channel-wise Feature Decorrelation for Enhanced Learned Image Compression

The emerging Learned Compression (LC) replaces the traditional codec modules with Deep Neural Networks (DNN), which are trained end-to-end for rate-distortion performance. This approach is considered as the future of image/video…

Image and Video Processing · Electrical Eng. & Systems 2024-07-08 Farhad Pakdaman , Moncef Gabbouj

Deep Contextual Video Compression

Most of the existing neural video compression methods adopt the predictive coding framework, which first generates the predicted frame and then encodes its residue with the current frame. However, as for compression ratio, predictive coding…

Image and Video Processing · Electrical Eng. & Systems 2021-12-15 Jiahao Li , Bin Li , Yan Lu

Conditional Coding for Flexible Learned Video Compression

This paper introduces a novel framework for end-to-end learned video coding. Image compression is generalized through conditional coding to exploit information from reference frames, allowing to process intra and inter frames with the same…

Image and Video Processing · Electrical Eng. & Systems 2021-04-29 Théo Ladune , Pierrick Philippe , Wassim Hamidouche , Lu Zhang , Olivier Déforges

LCCM-VC: Learned Conditional Coding Modes for Video Compression

End-to-end learning-based video compression has made steady progress over the last several years. However, unlike learning-based image coding, which has already surpassed its handcrafted counterparts, learning-based video coding still has…

Image and Video Processing · Electrical Eng. & Systems 2023-04-20 Hadi Hadizadeh , Ivan V. Bajić

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering…

Image and Video Processing · Electrical Eng. & Systems 2024-01-23 Yichi Zhang , Zhihao Duan , Ming Lu , Dandan Ding , Fengqing Zhu , Zhan Ma

Learned Image Compression with Hierarchical Progressive Context Modeling

Context modeling is essential in learned image compression for accurately estimating the distribution of latents. While recent advanced methods have expanded context modeling capacity, they still struggle to efficiently exploit long-range…

Image and Video Processing · Electrical Eng. & Systems 2025-07-28 Yuqi Li , Haotian Zhang , Li Li , Dong Liu

A Cross Channel Context Model for Latents in Deep Image Compression

This paper presents a cross channel context model for latents in deep image compression. Generally, deep image compression is based on an autoencoder framework, which transforms the original image to latents at the encoder and recovers the…

Image and Video Processing · Electrical Eng. & Systems 2021-03-05 Changyue Ma , Zhao Wang , Ruling Liao , Yan Ye

Deep Learning Logo Detection with Data Expansion by Synthesising Context

Logo detection in unconstrained images is challenging, particularly when only very sparse labelled training images are accessible due to high labelling costs. In this work, we describe a model training image synthesising method capable of…

Computer Vision and Pattern Recognition · Computer Science 2018-03-19 Hang Su , Xiatian Zhu , Shaogang Gong

Latent Programmer: Discrete Latent Codes for Program Synthesis

In many sequence learning tasks, such as program synthesis and document summarization, a key problem is searching over a large space of possible output sequences. We propose to learn representations of the outputs that are specifically…

Machine Learning · Computer Science 2021-08-09 Joey Hong , David Dohan , Rishabh Singh , Charles Sutton , Manzil Zaheer

Compact Latent Representation for Image Compression (CLRIC)

Current image compression models often require separate models for each quality level, making them resource-intensive in terms of both training and storage. To address these limitations, we propose an innovative approach that utilizes…

Image and Video Processing · Electrical Eng. & Systems 2025-09-30 Ayman A. Ameen , Thomas Richter , André Kaup

GOLLIC: Learning Global Context beyond Patches for Lossless High-Resolution Image Compression

Neural-network-based approaches recently emerged in the field of data compression and have already led to significant progress in image compression, especially in achieving a higher compression ratio. In the lossless image compression…

Image and Video Processing · Electrical Eng. & Systems 2022-10-10 Yuan Lan , Liang Qin , Zhaoyi Sun , Yang Xiang , Jie Sun

Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding

Recent advancements in deep learning-based image compression are notable. However, prevalent schemes that employ a serial context-adaptive entropy model to enhance rate-distortion (R-D) performance are markedly slow. Furthermore, the…

Applications · Statistics 2024-03-25 Haisheng Fu , Feng Liang , Jie Liang , Zhenman Fang , Guohe Zhang , Jingning Han

Learned Disentangled Latent Representations for Scalable Image Coding for Humans and Machines

As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily…

Image and Video Processing · Electrical Eng. & Systems 2023-01-12 Ezgi Ozyilkan , Mateen Ulhaq , Hyomin Choi , Fabien Racape

CCF: A Context Compression Framework for Efficient Long-Sequence Language Modeling

Scaling language models to longer contexts is essential for capturing rich dependencies across extended discourse. However, na\"ive context extension imposes significant computational and memory burdens, often resulting in inefficiencies…

Computation and Language · Computer Science 2026-02-03 Wenhao Li , Bangcheng Sun , Weihao Ye , Tianyi Zhang , Daohai Yu , Fei Chao , Rongrong Ji

Unsupervised Synthetic Image Refinement via Contrastive Learning and Consistent Semantic-Structural Constraints

Ensuring the realism of computer-generated synthetic images is crucial to deep neural network (DNN) training. Due to different semantic distributions between synthetic and real-world captured datasets, there exists semantic mismatch between…

Computer Vision and Pattern Recognition · Computer Science 2023-04-27 Ganning Zhao , Tingwei Shen , Suya You , C. -C. Jay Kuo

Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference

In recent years, compressed domain semantic inference has primarily relied on learned image coding models optimized for mean squared error (MSE). However, MSE-oriented optimization tends to yield latent spaces with limited semantic…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Xu Zhang , Ming Lu , Yan Chen , Zhan Ma

Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Large language models (LLMs) have triggered a new stream of research focusing on compressing the context length to reduce the computational cost while ensuring the retention of helpful information for LLMs to answer the given question.…

Computation and Language · Computer Science 2024-12-20 Barys Liskavets , Maxim Ushakov , Shuvendu Roy , Mark Klibanov , Ali Etemad , Shane Luke