Related papers: Parametric Graph-based Separable Transforms for Vi…

Graph-based Transforms for Video Coding

In many state-of-the-art compression systems, signal transformation is an integral part of the encoding and decoding process, where transforms provide compact representations for the signals of interest. This paper introduces a class of…

Image and Video Processing · Electrical Eng. & Systems 2020-10-28 Hilmi E. Egilmez , Yung-Hsuan Chao , Antonio Ortega

Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction

Current video coding standards, including H.264/AVC, HEVC, and VVC, employ discrete cosine transform (DCT), discrete sine transform (DST), and secondary to Karhunen-Loeve transforms (KLTs) decorrelate the intra-prediction residuals.…

Image and Video Processing · Electrical Eng. & Systems 2024-02-27 Wen-Yang Lu , Eduardo Pavez , Antonio Ortega , Xin Zhao , Shan Liu

Low-Complexity Transform Adjustments For Video Coding

Recent video codecs with multiple separable transforms can achieve significant coding gains using asymmetric trigonometric transforms (DCTs and DSTs), because they can exploit diverse statistics of residual block signals. However, they add…

Image and Video Processing · Electrical Eng. & Systems 2025-05-30 Amir Said , Hilmi E. Egilmez , Yung-Hsuan Chao

INT-DTT+: Low-Complexity Data-Dependent Transforms for Video Coding

Discrete trigonometric transforms (DTTs), such as the DCT-2 and the DST-7, are widely used in video codecs for their balance between coding performance and computational efficiency. In contrast, data-dependent transforms, such as the…

Image and Video Processing · Electrical Eng. & Systems 2025-11-25 Samuel Fernández-Menduiña , Eduardo Pavez , Antonio Ortega , Tsung-Wei Huang , Thuong Nguyen Canh , Guan-Ming Su , Peng Yin

Variable-size Symmetry-based Graph Fourier Transforms for image compression

Modern compression systems use linear transformations in their encoding and decoding processes, with transforms providing compact signal representations. While multiple data-dependent transforms for image/video coding can adapt to diverse…

Image and Video Processing · Electrical Eng. & Systems 2024-11-26 Alessandro Gnutti , Fabrizio Guerrini , Riccardo Leonardi , Antonio Ortega

GIST: Distributed Training for Large-Scale Graph Convolutional Networks

The graph convolutional network (GCN) is a go-to solution for machine learning on graphs, but its training is notoriously difficult to scale both in terms of graph size and the number of model parameters. Although some work has explored…

Machine Learning · Computer Science 2022-03-15 Cameron R. Wolfe , Jingkang Yang , Arindam Chowdhury , Chen Dun , Artun Bayer , Santiago Segarra , Anastasios Kyrillidis

Graph Transform Optimization with Application to Image Compression

In this paper, we propose a new graph-based transform and illustrate its potential application to signal compression. Our approach relies on the careful design of a graph that optimizes the overall rate-distortion performance through an…

Information Theory · Computer Science 2019-07-31 Giulia Fracastoro , Dorina Thanou , Pascal Frossard

Deformable Graph Transformer

Transformer-based models have recently shown success in representation learning on graph-structured data beyond natural language processing and computer vision. However, the success is limited to small-scale graphs due to the drawbacks of…

Machine Learning · Computer Science 2022-10-05 Jinyoung Park , Seongjun Yun , Hyeonjin Park , Jaewoo Kang , Jisu Jeong , Kyung-Min Kim , Jung-woo Ha , Hyunwoo J. Kim

A Class of Low-complexity DCT-like Transforms for Image and Video Coding

The discrete cosine transform (DCT) is a relevant tool in signal processing applications, mainly known for its good decorrelation properties. Current image and video coding standards -- such as JPEG and HEVC -- adopt the DCT as a…

Image and Video Processing · Electrical Eng. & Systems 2022-12-09 T. L. T. da Silveira , D. R. Canterle , D. F. G. Coelho , V. A. Coutinho , F. M. Bayer , R. J. Cintra

Joint Optimization of Primary and Secondary Transforms Using Rate-Distortion Optimized Transform Design

Data-dependent transforms are increasingly being incorporated into next-generation video coding systems such as AVM, a codec under development by the Alliance for Open Media (AOM), and VVC. To circumvent the computational complexities…

Image and Video Processing · Electrical Eng. & Systems 2025-05-22 Darukeesan Pakiyarajah , Eduardo Pavez , Antonio Ortega , Debargha Mukherjee , Onur Guleryuz , Keng-Shih Lu , Kruthika Koratti Sivakumar

Versatile Video Tokenization with Generative 2D Gaussian Splatting

Video tokenization procedure is critical for a wide range of video processing tasks. Most existing approaches directly transform video into fixed-grid and patch-wise tokens, which exhibit limited versatility. Spatially, uniformly allocating…

Computer Vision and Pattern Recognition · Computer Science 2025-08-18 Zhenghao Chen , Zicong Chen , Lei Liu , Yiming Wu , Dong Xu

Highly Efficient Non-Separable Transforms for Next Generation Video Coding

For the last few decades, the application of signal-adaptive transform coding to video compression has been stymied by the large computational complexity of matrix-based solutions. In this paper, we propose a novel parametric approach to…

Image and Video Processing · Electrical Eng. & Systems 2025-05-29 Amir Said , Xin Zhao , Marta Karczewicz , Hilmi E. Egilmez , Vadim Seregin , Jianle Chen

Learning Optimal Linear Block Transform by Rate Distortion Minimization

Linear block transform coding remains a fundamental component of image and video compression. Although the Discrete Cosine Transform (DCT) is widely employed in all current compression standards, its sub-optimality has sparked ongoing…

Image and Video Processing · Electrical Eng. & Systems 2024-11-28 Alessandro Gnutti , Chia-Hao Kao , Wen-Hsiao Peng , Riccardo Leonardi

Prediction of Transformed (DCT) Video Coding Residual for Video Compression

Video compression has been investigated by means of analysis-synthesis, and more particularly by means of inpainting. The first part of our approach has been to develop the inpainting of DCT coefficients in an image. This has shown good…

Information Theory · Computer Science 2014-04-17 Matthieu Moinard , Isabelle Amonou , Pierre Duhamel , Patrice Brault

Image Coding via Perceptually Inspired Graph Learning

Most codec designs rely on the mean squared error (MSE) as a fidelity metric in rate-distortion optimization, which allows to choose the optimal parameters in the transform domain but may fail to reflect perceptual quality. Alternative…

Image and Video Processing · Electrical Eng. & Systems 2023-03-06 Samuel Fernández-Menduiña , Eduardo Pavez , Antonio Ortega

FaSST: Fast Sparsifying Secondary Transform

Data-dependent secondary transforms, which aim to decorrelate coefficients of a separable primary transform, can improve residual coding efficiency; however, their deployment is often constrained by computational complexity. Recent video…

Image and Video Processing · Electrical Eng. & Systems 2026-05-15 Darukeesan Pakiyarajah , Samuel Fernández-Menduiña , Eduardo Pavez , Antonio Ortega , Debargha Mukherjee

A Multiparametric Class of Low-complexity Transforms for Image and Video Coding

Discrete transforms play an important role in many signal processing applications, and low-complexity alternatives for classical transforms became popular in recent years. Particularly, the discrete cosine transform (DCT) has proven to be…

Signal Processing · Electrical Eng. & Systems 2020-06-23 D. R. Canterle , T. L. T. da Silveira , F. M. Bayer , R. J. Cintra

Neural Generation of Blocks for Video Coding

Well-trained generative neural networks (GNN) are very efficient at compressing visual information for static images in their learned parameters but not as efficient as inter- and intra-prediction for most video content. However, for…

Image and Video Processing · Electrical Eng. & Systems 2020-10-07 Jonah Probell

Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation

Generating longer textual sequences when conditioned on the visual information is an interesting problem to explore. The challenge here proliferate over the standard vision conditioned sentence-level generation (e.g., image or video…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Aditya Mogadala , Marius Mosbach , Dietrich Klakow

Graph Signal Processing Meets Blind Source Separation

In graph signal processing (GSP), prior information on the dependencies in the signal is collected in a graph which is then used when processing or analyzing the signal. Blind source separation (BSS) techniques have been developed and…

Methodology · Statistics 2021-09-21 Jari Miettinen , Eyal Nitzan , Sergiy A. Vorobyov , Esa Ollila