English
Related papers

Related papers: CoD: A Diffusion Foundation Model for Image Compre…

200 papers

Prevailing Dataset Distillation (DD) methods leveraging generative models confront two fundamental limitations. First, despite pioneering the use of diffusion models in DD and delivering impressive performance, the vast majority of…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Letian Zhou , Songhua Liu , Xinchao Wang

Recent advanced diffusion methods typically derive strong generative priors by scaling diffusion transformers. However, scaling fails to generalize when adapted for real-time compression scenarios that demand lightweight models. In this…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Zhaoyang Jia , Naifu Xue , Zihan Zheng , Jiahao Li , Bin Li , Xiaoyi Zhang , Zongyu Guo , Yuan Zhang , Houqiang Li , Yan Lu

Diffusion-based image compression has shown remarkable potential for achieving ultra-low bitrate coding (less than 0.05 bits per pixel) with high realism, by leveraging the generative priors of large pre-trained text-to-image diffusion…

Image and Video Processing · Electrical Eng. & Systems 2025-06-30 Tianyu Zhang , Xin Luo , Li Li , Dong Liu

Conventional class-guided diffusion models generally succeed in generating images with correct semantic content, but often struggle with texture details. This limitation stems from the usage of class priors, which only provide coarse and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Xiaoyu Yue , Zidong Wang , Zeyu Lu , Shuyang Sun , Meng Wei , Wanli Ouyang , Lei Bai , Luping Zhou

As generative technologies advance, visual content has evolved into a complex mix of natural and AI-generated images, driving the need for more efficient coding techniques that prioritize perceptual quality. Traditional codecs and learned…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Jianhui Chang

Diffusion-based generative image compression has demonstrated remarkable potential for achieving realistic reconstruction at ultra-low bitrates. The key to unlocking this potential lies in making the entire compression process…

Computer Vision and Pattern Recognition · Computer Science 2026-03-26 Xihua Sheng , Lingyu Zhu , Tianyu Zhang , Dong Liu , Shiqi Wang , Jing Wang

Diffusion models, such as Stable Diffusion, have shown incredible performance on text-to-image generation. Since text-to-image generation often requires models to generate visual concepts with fine-grained details and attributes specified…

Computer Vision and Pattern Recognition · Computer Science 2024-04-26 Xuehai He , Weixi Feng , Tsu-Jui Fu , Varun Jampani , Arjun Akula , Pradyumna Narayana , Sugato Basu , William Yang Wang , Xin Eric Wang

Recently, perceptual image compression has achieved significant advancements, delivering high visual quality at low bitrates for natural images. However, for screen content, existing methods often produce noticeable artifacts when…

Computer Vision and Pattern Recognition · Computer Science 2025-05-12 Tongda Xu , Jiahao Li , Bin Li , Yan Wang , Ya-Qin Zhang , Yan Lu

Learned image compression codecs have recently achieved impressive compression performances surpassing the most efficient image coding architectures. However, most approaches are trained to minimize rate and distortion which often leads to…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Daniele Mari , Simone Milani

Large generative diffusion models have revolutionized text-to-image generation and offer immense potential for conditional generation tasks such as image enhancement, restoration, editing, and compositing. However, their widespread adoption…

Computer Vision and Pattern Recognition · Computer Science 2024-02-20 Kangfu Mei , Mauricio Delbracio , Hossein Talebi , Zhengzhong Tu , Vishal M. Patel , Peyman Milanfar

This paper outlines an end-to-end optimized lossy image compression framework using diffusion generative models. The approach relies on the transform coding paradigm, where an image is mapped into a latent space for entropy coding and, from…

Image and Video Processing · Electrical Eng. & Systems 2024-01-03 Ruihan Yang , Stephan Mandt

Incorporating diffusion models in the image compression domain has the potential to produce realistic and detailed reconstructions, especially at extremely low bitrates. Previous methods focus on using diffusion models as expressive…

Image and Video Processing · Electrical Eng. & Systems 2024-10-10 Lucas Relic , Roberto Azevedo , Markus Gross , Christopher Schroers

Capturing images is a key part of automation for high-level tasks such as scene text recognition. Low-light conditions pose a challenge for high-level perception stacks, which are often optimized on well-lit, artifact-free images.…

Image and Video Processing · Electrical Eng. & Systems 2023-11-01 Cindy M. Nguyen , Eric R. Chan , Alexander W. Bergman , Gordon Wetzstein

Artificial neural networks, especially recent diffusion-based models, have shown remarkable superiority in gaming, control, and QA systems, where the training tasks' datasets are usually static. However, in real-world applications, such as…

Machine Learning · Computer Science 2025-01-16 Jifeng Hu , Li Shen , Sili Huang , Zhejian Yang , Hechang Chen , Lichao Sun , Yi Chang , Dacheng Tao

Discrete diffusion models are a powerful, emerging paradigm for code generation. They construct programs through iterative refinement of partially corrupted token sequences and enable parallel token refinement. Importantly, this paradigm…

Computation and Language · Computer Science 2026-05-19 Lize Shao , Michael Cardei , Zichen Xie , Ferdinando Fioretto , Wenxi Wang

Camouflaged Object Detection (COD) is a challenging task in computer vision due to the high similarity between camouflaged objects and their surroundings. Existing COD methods primarily employ semantic segmentation, which suffers from…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Zhongxi Chen , Ke Sun , Xianming Lin , Rongrong Ji

Stable diffusion networks have emerged as a groundbreaking development for their ability to produce realistic and detailed visual content. This characteristic renders them ideal decoders, capable of producing high-quality and aesthetically…

Computer Vision and Pattern Recognition · Computer Science 2024-08-21 Kai Liu , Kang You , Pan Gao

Conditioning image generation facilitates seamless editing and the creation of photorealistic images. However, conditioning on noisy or Out-of-Distribution (OoD) images poses significant challenges, particularly in balancing fidelity to the…

Computer Vision and Pattern Recognition · Computer Science 2024-08-23 Bastien van Delft , Tommaso Martorella , Alexandre Alahi

Traditional image codecs emphasize signal fidelity and human perception, often at the expense of machine vision tasks. Deep learning methods have demonstrated promising coding performance by utilizing rich semantic embeddings optimized for…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Sha Guo , Zhuo Chen , Yang Zhao , Ning Zhang , Xiaotong Li , Lingyu Duan

Discrete visual tokenizers transform images into a sequence of tokens, enabling token-based visual generation akin to language models. However, this process is inherently challenging, as it requires both compressing visual signals into a…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Zeyu Liu , Zanlin Ni , Yeguo Hua , Xin Deng , Xiao Ma , Cheng Zhong , Gao Huang
‹ Prev 1 2 3 10 Next ›