Related papers: Transformer-based Image Compression

Learned Image Compression with Mixed Transformer-CNN Architectures

Learned image compression (LIC) methods have exhibited promising progress and superior rate-distortion performance compared with classical image compression standards. Most existing LIC methods are Convolutional Neural Networks-based…

Image and Video Processing · Electrical Eng. & Systems 2023-03-28 Jinming Liu , Heming Sun , Jiro Katto

Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression

Recently, the performance of neural image compression (NIC) has steadily improved thanks to the last line of study, reaching or outperforming state-of-the-art conventional codecs. Despite significant progress, current NIC methods still rely…

Computer Vision and Pattern Recognition · Computer Science 2024-01-23 Ahmed Ghorbel , Wassim Hamidouche , Luce Morin

Convolutional Transformer-Based Image Compression

In this paper, we present a novel transformer-based architecture for end-to-end image compression. Our architecture incorporates blocks that effectively capture local dependencies between tokens, eliminating the need for positional encoding…

Image and Video Processing · Electrical Eng. & Systems 2024-09-09 Bouzid Arezki , Fangchen Feng , Anissa Mokraoui

Transformer-based Learned Image Compression for Joint Decoding and Denoising

This work introduces a Transformer-based image compression system. It has the flexibility to switch between the standard image reconstruction and the denoising reconstruction from a single compressed bitstream. Instead of training separate…

Image and Video Processing · Electrical Eng. & Systems 2024-02-21 Yi-Hsin Chen , Kuan-Wei Ho , Shiau-Rung Tsai , Guan-Hsun Lin , Alessandro Gnutti , Wen-Hsiao Peng , Riccardo Leonardi

Bi-Level Spatial and Channel-aware Transformer for Learned Image Compression

Recent advancements in learned image compression (LIC) methods have demonstrated superior performance over traditional hand-crafted codecs. These learning-based methods often employ convolutional neural networks (CNNs) or Transformer-based…

Computer Vision and Pattern Recognition · Computer Science 2024-08-08 Hamidreza Soltani , Erfan Ghasemi

Enhancing Learned Image Compression via Cross Window-based Attention

In recent years, learned image compression methods have demonstrated superior rate-distortion performance compared to traditional image compression methods. Recent methods utilize convolutional neural networks (CNN), variational…

Computer Vision and Pattern Recognition · Computer Science 2025-02-14 Priyanka Mudgal , Feng Liu

Towards End-to-End Image Compression and Analysis with Transformers

We propose an end-to-end image compression and analysis model with Transformers, targeting to the cloud-based image classification application. Instead of placing an existing Transformer-based image classification model directly after an…

Computer Vision and Pattern Recognition · Computer Science 2021-12-20 Yuanchao Bai , Xu Yang , Xianming Liu , Junjun Jiang , Yaowei Wang , Xiangyang Ji , Wen Gao

AICT: An Adaptive Image Compression Transformer

Motivated by the efficiency investigation of the Tranformer-based transform coding framework, namely SwinT-ChARM, we propose to enhance the latter, as first, with a more straightforward yet effective Tranformer-based channel-wise…

Computer Vision and Pattern Recognition · Computer Science 2023-07-13 Ahmed Ghorbel , Wassim Hamidouche , Luce Morin

Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering

Along with the prosperity of recurrent neural network in modelling sequential data and the power of attention mechanism in automatically identify salient information, image captioning, a.k.a., image description, has been remarkably advanced…

Computer Vision and Pattern Recognition · Computer Science 2016-12-16 Hao Liu , Yang Yang , Fumin Shen , Lixin Duan , Heng Tao Shen

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering…

Image and Video Processing · Electrical Eng. & Systems 2024-01-23 Yichi Zhang , Zhihao Duan , Ming Lu , Dandan Ding , Fengqing Zhu , Zhan Ma

High-Efficiency Lossy Image Coding Through Adaptive Neighborhood Information Aggregation

Questing for learned lossy image coding (LIC) with superior compression performance and computation throughput is challenging. The vital factor behind it is how to intelligently explore Adaptive Neighborhood Information Aggregation (ANIA)…

Image and Video Processing · Electrical Eng. & Systems 2022-10-13 Ming Lu , Fangdong Chen , Shiliang Pu , Zhan Ma

TransTIC: Transferring Transformer-based Image Compression from Human Perception to Machine Perception

This work aims for transferring a Transformer-based image compression codec from human perception to machine perception without fine-tuning the codec. We propose a transferable Transformer-based image compression framework, termed TransTIC.…

Image and Video Processing · Electrical Eng. & Systems 2023-08-21 Yi-Hsin Chen , Ying-Chieh Weng , Chia-Hao Kao , Cheng Chien , Wei-Chen Chiu , Wen-Hsiao Peng

Transformer for Image Quality Assessment

Transformer has become the new standard method in natural language processing (NLP), and it also attracts research interests in computer vision area. In this paper we investigate the application of Transformer in Image Quality (TRIQ)…

Computer Vision and Pattern Recognition · Computer Science 2021-08-11 Junyong You , Jari Korhonen

Modeling Lost Information in Lossy Image Compression

Lossy image compression is one of the most commonly used operators for digital images. Most recently proposed deep-learning-based image compression methods leverage the auto-encoder structure, and reach a series of promising results in this…

Computer Vision and Pattern Recognition · Computer Science 2020-07-09 Yaolong Wang , Mingqing Xiao , Chang Liu , Shuxin Zheng , Tie-Yan Liu

DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression

Diffusion-based image compression has recently shown outstanding perceptual fidelity, yet its practicality is hindered by prohibitive sampling overhead and high memory usage. Most existing diffusion codecs employ U-Net architectures, where…

Image and Video Processing · Electrical Eng. & Systems 2026-03-16 Junqi Shi , Ming Lu , Xingchen Li , Anle Ke , Ruiqi Zhang , Zhan Ma

An Autoencoder-based Learned Image Compressor: Description of Challenge Proposal by NCTU

We propose a lossy image compression system using the deep-learning autoencoder structure to participate in the Challenge on Learned Image Compression (CLIC) 2018. Our autoencoder uses the residual blocks with skip connections to reduce the…

Computer Vision and Pattern Recognition · Computer Science 2019-02-21 David Alexandre , Chih-Peng Chang , Wen-Hsiao Peng , Hsueh-Ming Hang

A Unified End-to-End Framework for Efficient Deep Image Compression

Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks.…

Image and Video Processing · Electrical Eng. & Systems 2020-05-26 Jiaheng Liu , Guo Lu , Zhihao Hu , Dong Xu

Neural Image Compression for Gigapixel Histopathology Image Analysis

We propose Neural Image Compression (NIC), a two-step method to build convolutional neural networks for gigapixel image analysis solely using weak image-level labels. First, gigapixel images are compressed using a neural network trained in…

Computer Vision and Pattern Recognition · Computer Science 2020-04-16 David Tellez , Geert Litjens , Jeroen van der Laak , Francesco Ciompi

Transformer Compressed Sensing via Global Image Tokens

Convolutional neural networks (CNN) have demonstrated outstanding Compressed Sensing (CS) performance compared to traditional, hand-crafted methods. However, they are broadly limited in terms of generalisability, inductive bias and…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Marlon Bran Lorenzana , Craig Engstrom , Shekhar S. Chandra

TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation

The hybrid architecture of convolution neural networks (CNN) and Transformer has been the most popular method for medical image segmentation. However, the existing networks based on the hybrid architecture suffer from two problems. First,…

Image and Video Processing · Electrical Eng. & Systems 2023-12-21 Rui Sun , Tao Lei , Weichuan Zhang , Yong Wan , Yong Xia , Asoke K. Nandi