Related papers: Multistage Spatial Context Models for Learned Imag…

Checkerboard Context Model for Efficient Learned Image Compression

For learned image compression, the autoregressive context model is proved effective in improving the rate-distortion (RD) performance. Because it helps remove spatial redundancies among latent representations. However, the decoding process…

Image and Video Processing · Electrical Eng. & Systems 2021-04-02 Dailan He , Yaoyan Zheng , Baocheng Sun , Yan Wang , Hongwei Qin

Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation

Deep learning-based image compression has made great progresses recently. However, many leading schemes use serial context-adaptive entropy model to improve the rate-distortion (R-D) performance, which is very slow. In addition, the…

Image and Video Processing · Electrical Eng. & Systems 2023-09-07 Haisheng Fu , Feng Liang , Jie Liang , Yongqiang Wang , Guohe Zhang , Jingning Han

Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding

Recent advancements in deep learning-based image compression are notable. However, prevalent schemes that employ a serial context-adaptive entropy model to enhance rate-distortion (R-D) performance are markedly slow. Furthermore, the…

Applications · Statistics 2024-03-25 Haisheng Fu , Feng Liang , Jie Liang , Zhenman Fang , Guohe Zhang , Jingning Han

Corner-to-Center Long-range Context Model for Efficient Learned Image Compression

In the framework of learned image compression, the context model plays a pivotal role in capturing the dependencies among latent representations. To reduce the decoding time resulting from the serial autoregressive context model, the…

Image and Video Processing · Electrical Eng. & Systems 2023-12-01 Yang Sui , Ding Ding , Xiang Pan , Xiaozhong Xu , Shan Liu , Bo Yuan , Zhenzhong Chen

Causal Contextual Prediction for Learned Image Compression

Over the past several years, we have witnessed impressive progress in the field of learned image compression. Recent learned image codecs are commonly based on autoencoders, that first encode an image into low-dimensional latent…

Computer Vision and Pattern Recognition · Computer Science 2021-11-02 Zongyu Guo , Zhizheng Zhang , Runsen Feng , Zhibo Chen

Learned Image Compression with Hierarchical Progressive Context Modeling

Context modeling is essential in learned image compression for accurately estimating the distribution of latents. While recent advanced methods have expanded context modeling capacity, they still struggle to efficiently exploit long-range…

Image and Video Processing · Electrical Eng. & Systems 2025-07-28 Yuqi Li , Haotian Zhang , Li Li , Dong Liu

Learned Image Compression with Generalized Octave Convolution and Cross-Resolution Parameter Estimation

The application of the context-adaptive entropy model significantly improves the rate-distortion (R-D) performance, in which hyperpriors and autoregressive models are jointly utilized to effectively capture the spatial redundancy of the…

Image and Video Processing · Electrical Eng. & Systems 2022-09-09 Haisheng Fu , Feng Liang

Progressive Checkerboards for Autoregressive Multiscale Image Generation

A key challenge in autoregressive image generation is to efficiently sample independent locations in parallel, while still modeling mutual dependencies with serial conditioning. Some recent works have addressed this by conditioning between…

Computer Vision and Pattern Recognition · Computer Science 2026-02-26 David Eigen

Context Perception Parallel Decoder for Scene Text Recognition

Scene text recognition (STR) methods have struggled to attain high accuracy and fast inference speed. Autoregressive (AR)-based models implement the recognition in a character-by-character manner, showing superiority in accuracy but with…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Yongkun Du , Zhineng Chen , Caiyan Jia , Xiaoting Yin , Chenxia Li , Yuning Du , Yu-Gang Jiang

Learned Lossless Image Compression Through Interpolation With Low Complexity

With the increasing popularity of deep learning in image processing, many learned lossless image compression methods have been proposed recently. One group of algorithms that have shown good performance are based on learned pixel-based…

Image and Video Processing · Electrical Eng. & Systems 2022-12-27 Fatih Kamisli

Parallel Context Modeling for Sliding Window Attention in Neural Video Coding

Most neural video codecs rely on temporal conditioning, which makes them susceptible to error propagation over long sequences. While Transformer-based architectures like the VCT offer a drift-free alternative, they suffer from high…

Image and Video Processing · Electrical Eng. & Systems 2026-05-21 Alexander Kopte , André Kaup

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition

Nowadays, scene text recognition has attracted more and more attention due to its diverse applications. Most state-of-the-art methods adopt an encoder-decoder framework with the attention mechanism, autoregressively generating text from…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Xiaomeng Yang , Zhi Qiao , Yu Zhou

Spatially-Adaptive Learning-Based Image Compression with Hierarchical Multi-Scale Latent Spaces

Adaptive block partitioning is responsible for large gains in current image and video compression systems. This method is able to compress large stationary image areas with only a few symbols, while maintaining a high level of quality in…

Image and Video Processing · Electrical Eng. & Systems 2023-07-13 Fabian Brand , Alexander Kopte , Kristian Fischer , André Kaup

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

We present Locality-aware Parallel Decoding (LPD) to accelerate autoregressive image generation. Traditional autoregressive image generation relies on next-patch prediction, a memory-bound process that leads to high latency. Existing works…

Computer Vision and Pattern Recognition · Computer Science 2026-03-12 Zhuoyang Zhang , Luke J. Huang , Chengyue Wu , Shang Yang , Kelly Peng , Yao Lu , Song Han

Multi-Stage Progressive Image Restoration

Image restoration tasks demand a complex balance between spatial details and high-level contextualized information while recovering images. In this paper, we propose a novel synergistic design that can optimally balance these competing…

Computer Vision and Pattern Recognition · Computer Science 2021-03-17 Syed Waqas Zamir , Aditya Arora , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Ming-Hsuan Yang , Ling Shao

Hierarchical Attention Networks for Lossless Point Cloud Attribute Compression

In this paper, we propose a deep hierarchical attention context model for lossless attribute compression of point clouds, leveraging a multi-resolution spatial structure and residual learning. A simple and effective Level of Detail (LoD)…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Yueru Chen , Wei Zhang , Dingquan Li , Jing Wang , Ge Li

Masked Non-Autoregressive Image Captioning

Existing captioning models often adopt the encoder-decoder architecture, where the decoder uses autoregressive decoding to generate captions, such that each token is generated sequentially given the preceding generated tokens. However,…

Computer Vision and Pattern Recognition · Computer Science 2019-06-04 Junlong Gao , Xi Meng , Shiqi Wang , Xia Li , Shanshe Wang , Siwei Ma , Wen Gao

ECM-OPCC: Efficient Context Model for Octree-based Point Cloud Compression

Recently, deep learning methods have shown promising results in point cloud compression. For octree-based point cloud compression, previous works show that the information of ancestor nodes and sibling nodes are equally important for…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Yiqi Jin , Ziyu Zhu , Tongda Xu , Yuhuan Lin , Yan Wang

Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression

Entropy estimation is essential for the performance of learned image compression. It has been demonstrated that a transformer-based entropy model is of critical importance for achieving a high compression ratio, however, at the expense of a…

Image and Video Processing · Electrical Eng. & Systems 2024-02-28 A. Burakhan Koyuncu , Panqi Jia , Atanas Boev , Elena Alshina , Eckehard Steinbach

Channel-wise Autoregressive Entropy Models for Learned Image Compression

In learning-based approaches to image compression, codecs are developed by optimizing a computational model to minimize a rate-distortion objective. Currently, the most effective learned image codecs take the form of an entropy-constrained…

Image and Video Processing · Electrical Eng. & Systems 2020-07-20 David Minnen , Saurabh Singh