Related papers: Explainable Machine Learning based Transform Codin…

On Energy Compaction of 2D Saab Image Transforms

The block Discrete Cosine Transform (DCT) is commonly used in image and video compression due to its good energy compaction property. The Saab transform was recently proposed as an effective signal transform for image understanding. In this…

Image and Video Processing · Electrical Eng. & Systems 2019-08-30 Na Li , Yongfei Zhang , Yun Zhang , C. -C. Jay Kuo

Multi-rate adaptive transform coding for video compression

Contemporary lossy image and video coding standards rely on transform coding, the process through which pixels are mapped to an alternative representation to facilitate efficient data compression. Despite impressive performance of…

Image and Video Processing · Electrical Eng. & Systems 2023-02-21 Lyndon R. Duong , Bohan Li , Cheng Chen , Jingning Han

Traditional Transformation Theory Guided Model for Learned Image Compression

Recently, many deep image compression methods have been proposed and achieved remarkable performance. However, these methods are dedicated to optimizing the compression performance and speed at medium and high bitrates, while research on…

Image and Video Processing · Electrical Eng. & Systems 2024-02-27 Zhiyuan Li , Chenyang Ge , Shun Li

Learning Optimal Linear Block Transform by Rate Distortion Minimization

Linear block transform coding remains a fundamental component of image and video compression. Although the Discrete Cosine Transform (DCT) is widely employed in all current compression standards, its sub-optimality has sparked ongoing…

Image and Video Processing · Electrical Eng. & Systems 2024-11-28 Alessandro Gnutti , Chia-Hao Kao , Wen-Hsiao Peng , Riccardo Leonardi

MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression

Conditional coding has lately emerged as the mainstream approach to learned video compression. However, a recent study shows that it may perform worse than residual coding when the information bottleneck arises. Conditional residual coding…

Image and Video Processing · Electrical Eng. & Systems 2024-07-11 Yi-Hsin Chen , Hong-Sheng Xie , Cheng-Wei Chen , Zong-Lin Gao , Martin Benjak , Wen-Hsiao Peng , Jörn Ostermann

Prediction of Transformed (DCT) Video Coding Residual for Video Compression

Video compression has been investigated by means of analysis-synthesis, and more particularly by means of inpainting. The first part of our approach has been to develop the inpainting of DCT coefficients in an image. This has shown good…

Information Theory · Computer Science 2014-04-17 Matthieu Moinard , Isabelle Amonou , Pierre Duhamel , Patrice Brault

Learned Image Coding for Machines: A Content-Adaptive Approach

Today, according to the Cisco Annual Internet Report (2018-2023), the fastest-growing category of Internet traffic is machine-to-machine communication. In particular, machine-to-machine communication of images and videos represents a new…

Image and Video Processing · Electrical Eng. & Systems 2021-10-14 Nam Le , Honglei Zhang , Francesco Cricri , Ramin Ghaznavi-Youvalari , Hamed Rezazadegan Tavakoli , Esa Rahtu

Combined neural network-based intra prediction and transform selection

The interactions between different tools added successively to a block-based video codec are critical to its rate-distortion efficiency. In particular, when deep neural network-based intra prediction modes are inserted into a block-based…

Image and Video Processing · Electrical Eng. & Systems 2021-08-19 Thierry Dumas , Franck Galpin , Philippe Bordes

A Multiparametric Class of Low-complexity Transforms for Image and Video Coding

Discrete transforms play an important role in many signal processing applications, and low-complexity alternatives for classical transforms became popular in recent years. Particularly, the discrete cosine transform (DCT) has proven to be…

Signal Processing · Electrical Eng. & Systems 2020-06-23 D. R. Canterle , T. L. T. da Silveira , F. M. Bayer , R. J. Cintra

A Class of Low-complexity DCT-like Transforms for Image and Video Coding

The discrete cosine transform (DCT) is a relevant tool in signal processing applications, mainly known for its good decorrelation properties. Current image and video coding standards -- such as JPEG and HEVC -- adopt the DCT as a…

Image and Video Processing · Electrical Eng. & Systems 2022-12-09 T. L. T. da Silveira , D. R. Canterle , D. F. G. Coelho , V. A. Coutinho , F. M. Bayer , R. J. Cintra

CVC: The Contourlet Video Compression algorithm for real-time applications

Nowadays, real-time video communication over the internet through video conferencing applications has become an invaluable tool in everyone's professional and personal life. This trend underlines the need for video coding algorithms that…

Multimedia · Computer Science 2015-10-05 Stamos Katsigiannis , Georgios Papaioannou , Dimitris Maroulis

Maximizing compression efficiency through block rotation

The Discrete Cosine Transform (DCT) is widely used in lossy image and video compression schemes, e.g., JPEG and MPEG. In this paper, we show that the compression efficiency of the DCT is dependent on the edge directions within a block. In…

Multimedia · Computer Science 2014-11-18 Rui F. C. Guerreiro , Pedro M. Q. Aguiar

Graph-based Transforms for Video Coding

In many state-of-the-art compression systems, signal transformation is an integral part of the encoding and decoding process, where transforms provide compact representations for the signals of interest. This paper introduces a class of…

Image and Video Processing · Electrical Eng. & Systems 2020-10-28 Hilmi E. Egilmez , Yung-Hsuan Chao , Antonio Ortega

Low-complexity Deep Video Compression with A Distributed Coding Architecture

Prevalent predictive coding-based video compression methods rely on a heavy encoder to reduce temporal redundancy, which makes it challenging to deploy them on resource-constrained devices. Since the 1970s, distributed source coding theory…

Image and Video Processing · Electrical Eng. & Systems 2023-04-04 Xinjie Zhang , Jiawei Shao , Jun Zhang

Decorrelation Speeds Up Vision Transformers

Masked Autoencoder (MAE) pre-training of vision transformers (ViTs) yields strong performance in low-label data regimes but comes with substantial computational costs, making it impractical in time- and resource-constrained industrial…

Computer Vision and Pattern Recognition · Computer Science 2026-01-16 Kieran Carrigg , Rob van Gastel , Melda Yeghaian , Sander Dalm , Faysal Boughorbel , Marcel van Gerven

Towards Video Codec Performance Evaluation: A Rate-Energy-Distortion Perspective

The Bj{\o}ntegaard Delta rate (BD-rate) objectively assesses the coding efficiency of video codecs using the rate-distortion (R-D) performance but overlooks encoding energy, which is crucial in practical applications, especially for those…

Image and Video Processing · Electrical Eng. & Systems 2024-10-03 Geetha Ramasubbu , André Kaup , Christian Herglotz

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

Steerable Discrete Cosine Transform

In image compression, classical block-based separable transforms tend to be inefficient when image blocks contain arbitrarily shaped discontinuities. For this reason, transforms incorporating directional information are an appealing…

Information Theory · Computer Science 2018-10-24 Giulia Fracastoro , Sophie Marie Fosson , Enrico Magli

INT-DTT+: Low-Complexity Data-Dependent Transforms for Video Coding

Discrete trigonometric transforms (DTTs), such as the DCT-2 and the DST-7, are widely used in video codecs for their balance between coding performance and computational efficiency. In contrast, data-dependent transforms, such as the…

Image and Video Processing · Electrical Eng. & Systems 2025-11-25 Samuel Fernández-Menduiña , Eduardo Pavez , Antonio Ortega , Tsung-Wei Huang , Thuong Nguyen Canh , Guan-Ming Su , Peng Yin

Transform-Based Feature Map Compression for CNN Inference

To achieve higher accuracy in machine learning tasks, very deep convolutional neural networks (CNNs) are designed recently. However, the large memory access of deep CNNs will lead to high power consumption. A variety of hardware-friendly…

Image and Video Processing · Electrical Eng. & Systems 2021-06-25 Yubo Shi , Meiqi Wang , Siyi Chen , Jinghe Wei , Zhongfeng Wang