English
Related papers

Related papers: Multistage Spatial Context Models for Learned Imag…

200 papers

For learned image compression, the autoregressive context model is proved effective in improving the rate-distortion (RD) performance. Because it helps remove spatial redundancies among latent representations. However, the decoding process…

Image and Video Processing · Electrical Eng. & Systems 2021-04-02 Dailan He , Yaoyan Zheng , Baocheng Sun , Yan Wang , Hongwei Qin

Deep learning-based image compression has made great progresses recently. However, many leading schemes use serial context-adaptive entropy model to improve the rate-distortion (R-D) performance, which is very slow. In addition, the…

Image and Video Processing · Electrical Eng. & Systems 2023-09-07 Haisheng Fu , Feng Liang , Jie Liang , Yongqiang Wang , Guohe Zhang , Jingning Han

Recent advancements in deep learning-based image compression are notable. However, prevalent schemes that employ a serial context-adaptive entropy model to enhance rate-distortion (R-D) performance are markedly slow. Furthermore, the…

Applications · Statistics 2024-03-25 Haisheng Fu , Feng Liang , Jie Liang , Zhenman Fang , Guohe Zhang , Jingning Han

In the framework of learned image compression, the context model plays a pivotal role in capturing the dependencies among latent representations. To reduce the decoding time resulting from the serial autoregressive context model, the…

Image and Video Processing · Electrical Eng. & Systems 2023-12-01 Yang Sui , Ding Ding , Xiang Pan , Xiaozhong Xu , Shan Liu , Bo Yuan , Zhenzhong Chen

Over the past several years, we have witnessed impressive progress in the field of learned image compression. Recent learned image codecs are commonly based on autoencoders, that first encode an image into low-dimensional latent…

Computer Vision and Pattern Recognition · Computer Science 2021-11-02 Zongyu Guo , Zhizheng Zhang , Runsen Feng , Zhibo Chen

Context modeling is essential in learned image compression for accurately estimating the distribution of latents. While recent advanced methods have expanded context modeling capacity, they still struggle to efficiently exploit long-range…

Image and Video Processing · Electrical Eng. & Systems 2025-07-28 Yuqi Li , Haotian Zhang , Li Li , Dong Liu

The application of the context-adaptive entropy model significantly improves the rate-distortion (R-D) performance, in which hyperpriors and autoregressive models are jointly utilized to effectively capture the spatial redundancy of the…

Image and Video Processing · Electrical Eng. & Systems 2022-09-09 Haisheng Fu , Feng Liang

A key challenge in autoregressive image generation is to efficiently sample independent locations in parallel, while still modeling mutual dependencies with serial conditioning. Some recent works have addressed this by conditioning between…

Computer Vision and Pattern Recognition · Computer Science 2026-02-26 David Eigen

Scene text recognition (STR) methods have struggled to attain high accuracy and fast inference speed. Autoregressive (AR)-based models implement the recognition in a character-by-character manner, showing superiority in accuracy but with…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Yongkun Du , Zhineng Chen , Caiyan Jia , Xiaoting Yin , Chenxia Li , Yuning Du , Yu-Gang Jiang

With the increasing popularity of deep learning in image processing, many learned lossless image compression methods have been proposed recently. One group of algorithms that have shown good performance are based on learned pixel-based…

Image and Video Processing · Electrical Eng. & Systems 2022-12-27 Fatih Kamisli

Most neural video codecs rely on temporal conditioning, which makes them susceptible to error propagation over long sequences. While Transformer-based architectures like the VCT offer a drift-free alternative, they suffer from high…

Image and Video Processing · Electrical Eng. & Systems 2026-05-21 Alexander Kopte , André Kaup

Nowadays, scene text recognition has attracted more and more attention due to its diverse applications. Most state-of-the-art methods adopt an encoder-decoder framework with the attention mechanism, autoregressively generating text from…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Xiaomeng Yang , Zhi Qiao , Yu Zhou

Adaptive block partitioning is responsible for large gains in current image and video compression systems. This method is able to compress large stationary image areas with only a few symbols, while maintaining a high level of quality in…

Image and Video Processing · Electrical Eng. & Systems 2023-07-13 Fabian Brand , Alexander Kopte , Kristian Fischer , André Kaup

We present Locality-aware Parallel Decoding (LPD) to accelerate autoregressive image generation. Traditional autoregressive image generation relies on next-patch prediction, a memory-bound process that leads to high latency. Existing works…

Computer Vision and Pattern Recognition · Computer Science 2026-03-12 Zhuoyang Zhang , Luke J. Huang , Chengyue Wu , Shang Yang , Kelly Peng , Yao Lu , Song Han

Image restoration tasks demand a complex balance between spatial details and high-level contextualized information while recovering images. In this paper, we propose a novel synergistic design that can optimally balance these competing…

Computer Vision and Pattern Recognition · Computer Science 2021-03-17 Syed Waqas Zamir , Aditya Arora , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Ming-Hsuan Yang , Ling Shao

In this paper, we propose a deep hierarchical attention context model for lossless attribute compression of point clouds, leveraging a multi-resolution spatial structure and residual learning. A simple and effective Level of Detail (LoD)…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Yueru Chen , Wei Zhang , Dingquan Li , Jing Wang , Ge Li

Existing captioning models often adopt the encoder-decoder architecture, where the decoder uses autoregressive decoding to generate captions, such that each token is generated sequentially given the preceding generated tokens. However,…

Computer Vision and Pattern Recognition · Computer Science 2019-06-04 Junlong Gao , Xi Meng , Shiqi Wang , Xia Li , Shanshe Wang , Siwei Ma , Wen Gao

Recently, deep learning methods have shown promising results in point cloud compression. For octree-based point cloud compression, previous works show that the information of ancestor nodes and sibling nodes are equally important for…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Yiqi Jin , Ziyu Zhu , Tongda Xu , Yuhuan Lin , Yan Wang

Entropy estimation is essential for the performance of learned image compression. It has been demonstrated that a transformer-based entropy model is of critical importance for achieving a high compression ratio, however, at the expense of a…

Image and Video Processing · Electrical Eng. & Systems 2024-02-28 A. Burakhan Koyuncu , Panqi Jia , Atanas Boev , Elena Alshina , Eckehard Steinbach

In learning-based approaches to image compression, codecs are developed by optimizing a computational model to minimize a rate-distortion objective. Currently, the most effective learned image codecs take the form of an entropy-constrained…

Image and Video Processing · Electrical Eng. & Systems 2020-07-20 David Minnen , Saurabh Singh
‹ Prev 1 2 3 10 Next ›