English
Related papers

Related papers: Compression as Adaptation: Implicit Visual Represe…

200 papers

Modern video codecs and learning-based approaches struggle for semantic reconstruction at extremely low bit-rates due to reliance on low-level spatiotemporal redundancies. Generative models, especially diffusion models, offer a new paradigm…

Image and Video Processing · Electrical Eng. & Systems 2026-02-06 Maojun Zhang , Haotian Wu , Richeng Jin , Deniz Gunduz , Krystian Mikolajczyk

With the increasing consumption of 3D displays and virtual reality, multi-view video has become a promising format. However, its high resolution and multi-camera shooting result in a substantial increase in data volume, making storage and…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Chen Zhu , Guo Lu , Bing He , Rong Xie , Li Song

Visual analytics have played an increasingly critical role in the Internet of Things, where massive visual signals have to be compressed and fed into machines. But facing such big data and constrained bandwidth capacity, existing…

Computer Vision and Pattern Recognition · Computer Science 2021-06-17 Yueyu Hu , Wenhan Yang , Haofeng Huang , Jiaying Liu

While recent neural codecs achieve strong performance at low bitrates when optimized for perceptual quality, their effectiveness deteriorates significantly under ultra-low bitrate conditions. To mitigate this, generative compression methods…

Computer Vision and Pattern Recognition · Computer Science 2026-02-06 Chuqin Zhou , Xiaoyue Ling , Yunuo Chen , Jincheng Dai , Guo Lu , Wenjun Zhang

Existing compression methods typically focus on the removal of signal-level redundancies, while the potential and versatility of decomposing visual data into compact conceptual components still lack further study. To this end, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2022-03-11 Jianhui Chang , Zhenghui Zhao , Chuanmin Jia , Shiqi Wang , Lingbo Yang , Qi Mao , Jian Zhang , Siwei Ma

We propose a method to compress full-resolution video sequences with implicit neural representations. Each frame is represented as a neural network that maps coordinate positions to pixel values. We use a separate implicit network to…

Machine Learning · Computer Science 2021-12-22 Yunfan Zhang , Ties van Rozendaal , Johann Brehmer , Markus Nagel , Taco Cohen

Compactly representing the visual signals is of fundamental importance in various image/video-centered applications. Although numerous approaches were developed for improving the image and video coding performance by removing the…

Image and Video Processing · Electrical Eng. & Systems 2020-08-14 Rongqun Lin , Linwei Zhu , Shiqi Wang , Sam Kwong

As generative technologies advance, visual content has evolved into a complex mix of natural and AI-generated images, driving the need for more efficient coding techniques that prioritize perceptual quality. Traditional codecs and learned…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Jianhui Chang

Implicit Neural Networks (INRs) have emerged as powerful representations to encode all forms of data, including images, videos, audios, and scenes. With video, many INRs for video have been proposed for the compression task, and recent…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Shishira R Maiya , Anubhav Gupta , Matthew Gwilliam , Max Ehrlich , Abhinav Shrivastava

Perceptual studies demonstrate that conditional diffusion models excel at reconstructing video content aligned with human visual perception. Building on this insight, we propose a video compression framework that leverages conditional…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Fangqiu Yi , Jingyu Xu , Jiawei Shao , Chi Zhang , Xuelong Li

Video Coding for Machines (VCM) is committed to bridging to an extent separate research tracks of video/image compression and feature compression, and attempts to optimize compactness and efficiency jointly from a unified perspective of…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Wenhan Yang , Haofeng Huang , Yueyu Hu , Ling-Yu Duan , Jiaying Liu

Diffusion models have achieved remarkable success in generating high quality image and video data. More recently, they have also been used for image compression with high perceptual quality. In this paper, we present a novel approach to…

Image and Video Processing · Electrical Eng. & Systems 2024-02-15 Bohan Li , Yiming Liu , Xueyan Niu , Bo Bai , Lei Deng , Deniz Gündüz

Almost all digital videos are coded into compact representations before being transmitted. Such compact representations need to be decoded back to pixels before being displayed to humans and - as usual - before being enhanced/analyzed by…

Image and Video Processing · Electrical Eng. & Systems 2023-11-03 Xihua Sheng , Li Li , Dong Liu , Houqiang Li

We propose an end-to-end learned image data hiding framework that embeds and extracts secrets in the latent representations of a generic neural compressor. By leveraging a perceptual loss function in conjunction with our proposed message…

Cryptography and Security · Computer Science 2023-10-03 Chen-Hsiu Huang , Ja-Ling Wu

Recent work has shown that learned image compression strategies can outperform standard hand-crafted compression algorithms that have been developed over decades of intensive research on the rate-distortion trade-off. With growing…

Image and Video Processing · Electrical Eng. & Systems 2021-11-04 Felipe Codevilla , Jean Gabriel Simard , Ross Goroshin , Chris Pal

We propose a framework for learned image and video compression using the generative sparse visual representation (SVR) guided by fidelity-preserving controls. By embedding inputs into a discrete latent space spanned by learned visual…

Image and Video Processing · Electrical Eng. & Systems 2024-04-10 Wei Jiang , Wei Wang

Finding compact representation of videos is an essential component in almost every problem related to video processing or understanding. In this paper, we propose a generative model to learn compact latent codes that can efficiently…

Computer Vision and Pattern Recognition · Computer Science 2020-04-22 Rakib Hyder , M. Salman Asif

Perceptual video compression leverages generative priors to reconstruct realistic textures and motions at low bitrates. However, existing perceptual codecs often lack native support for variable bitrate and progressive delivery, and their…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Daowen Li , Ruixiao Dong , Ying Chen , Kai Li , Ding Ding , Li Li

Visual data compression is shifting from human-centered reconstruction to machine-oriented representation coding. In this setting, an image is often mapped to a compact semantic embedding, which is then compressed and transmitted for…

Image and Video Processing · Electrical Eng. & Systems 2026-04-30 Andriy Enttsel , Vincent Corlay

Popularized by their strong image generation performance, diffusion and related methods for generative modeling have found widespread success in visual media applications. In particular, diffusion methods have enabled new approaches to data…

Image and Video Processing · Electrical Eng. & Systems 2026-01-28 Yibo Yang , Stephan Mandt
‹ Prev 1 2 3 10 Next ›