Related papers: Task Oriented Video Coding: A Survey

A Survey on Perceptually Optimized Video Coding

To provide users with more realistic visual experiences, videos are developing in the trends of Ultra High Definition (UHD), High Frame Rate (HFR), High Dynamic Range (HDR), Wide Color Gammut (WCG) and high clarity. However, the data amount…

Multimedia · Computer Science 2022-11-17 Yun Zhang , Linwei Zhu , Gangyi Jiang , Sam Kwong , C. -C. Jay Kuo

Learned Scalable Video Coding For Humans and Machines

Video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital TV, and so on. The main intent was to enable human viewing of the encoded content. However, with the advances in deep…

Image and Video Processing · Electrical Eng. & Systems 2024-11-19 Hadi Hadizadeh , Ivan V. Bajić

Emerging Standards for Machine-to-Machine Video Coding

Machines are increasingly becoming the primary consumers of visual data, yet most deployments of machine-to-machine systems still rely on remote inference where pixel-based video is streamed using codecs optimized for human perception.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-12 Md Eimran Hossain Eimon , Velibor Adzic , Hari Kalva , Borko Furht

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics

Video coding, which targets to compress and reconstruct the whole frame, and feature compression, which only preserves and transmits the most critical information, stand at two ends of the scale. That is, one is with compactness and…

Computer Vision and Pattern Recognition · Computer Science 2023-07-19 Ling-Yu Duan , Jiaying Liu , Wenhan Yang , Tiejun Huang , Wen Gao

NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines

The recent progress in artificial intelligence has led to an ever-increasing usage of images and videos by machine analysis algorithms, mainly neural networks. Nonetheless, compression, storage and transmission of media have traditionally…

Image and Video Processing · Electrical Eng. & Systems 2024-01-22 Jukka I. Ahonen , Nam Le , Honglei Zhang , Antti Hallapuro , Francesco Cricri , Hamed Rezazadegan Tavakoli , Miska M. Hannuksela , Esa Rahtu

Deep Video Codec Control for Vision Models

Standardized lossy video coding is at the core of almost all real-world video processing pipelines. Rate control is used to enable standard codecs to adapt to different network bandwidth conditions or storage constraints. However, standard…

Image and Video Processing · Electrical Eng. & Systems 2024-10-08 Christoph Reich , Biplob Debnath , Deep Patel , Tim Prangemeier , Daniel Cremers , Srimat Chakradhar

Image and Video Compression with Neural Networks: A Review

In recent years, the image and video coding technologies have advanced by leaps and bounds. However, due to the popularization of image and video acquisition devices, the growth rate of image and video data is far beyond the improvement of…

Computer Vision and Pattern Recognition · Computer Science 2019-04-23 Siwei Ma , Xinfeng Zhang , Chuanmin Jia , Zhenghui Zhao , Shiqi Wang , Shanshe Wang

Video Quality Assessment and Coding Complexity of the Versatile Video Coding Standard

In recent years, the proliferation of multimedia applications and formats, such as IPTV, Virtual Reality (VR, 360-degree), and point cloud videos, has presented new challenges to the video compression research community. Simultaneously,…

Image and Video Processing · Electrical Eng. & Systems 2023-10-23 Thomas Amestoy , Naty Sidaty , Wassim Hamidouche , Pierrick Philippe , Daniel Menard

Rate-Accuracy Bounds in Visual Coding for Machines

Increasingly, visual signals such as images, videos and point clouds are being captured solely for the purpose of automated analysis by computer vision models. Applications include traffic monitoring, robotics, autonomous driving, smart…

Image and Video Processing · Electrical Eng. & Systems 2025-07-25 Ivan V. Bajić

Scalable Video Coding for Humans and Machines

Video content is watched not only by humans, but increasingly also by machines. For example, machine learning models analyze surveillance video for security and traffic monitoring, search through YouTube videos for inappropriate content,…

Image and Video Processing · Electrical Eng. & Systems 2022-08-05 Hyomin Choi , Ivan V. Bajić

Emerging Advances in Learned Video Compression: Models, Systems and Beyond

Video compression is a fundamental topic in the visual intelligence, bridging visual signal sensing/capturing and high-level visual analytics. The broad success of artificial intelligence (AI) technology has enriched the horizon of video…

Image and Video Processing · Electrical Eng. & Systems 2025-05-01 Chuanmin Jia , Feng Ye , Siwei Ma , Wen Gao , Huifang Sun , Leonardo Chiariglione

Image coding for machines: an end-to-end learned approach

Over recent years, deep learning-based computer vision systems have been applied to images at an ever-increasing pace, oftentimes representing the only type of consumption for those images. Given the dramatic explosion in the number of…

Computer Vision and Pattern Recognition · Computer Science 2021-08-31 Nam Le , Honglei Zhang , Francesco Cricri , Ramin Ghaznavi-Youvalari , Esa Rahtu

Recent Standard Development Activities on Video Coding for Machines

In recent years, video data has dominated internet traffic and becomes one of the major data formats. With the emerging 5G and internet of things (IoT) technologies, more and more videos are generated by edge devices, sent across networks,…

Computer Vision and Pattern Recognition · Computer Science 2021-05-27 Wen Gao , Shan Liu , Xiaozhong Xu , Manouchehr Rafie , Yuan Zhang , Igor Curcio

Human-Machine Collaborative Video Coding Through Cuboidal Partitioning

Video coding algorithms encode and decode an entire video frame while feature coding techniques only preserve and communicate the most critical information needed for a given application. This is because video coding targets human…

Image and Video Processing · Electrical Eng. & Systems 2021-09-06 Ashek Ahmmed , Manoranjan Paul , Manzur Murshed , David Taubman

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics

Video Coding for Machines (VCM) is committed to bridging to an extent separate research tracks of video/image compression and feature compression, and attempts to optimize compactness and efficiency jointly from a unified perspective of…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Wenhan Yang , Haofeng Huang , Yueyu Hu , Ling-Yu Duan , Jiaying Liu

Deep Learning-Based Video Coding: A Review and A Case Study

The past decade has witnessed great success of deep learning technology in many disciplines, especially in computer vision and image processing. However, deep learning-based video coding remains in its infancy. This paper reviews the…

Multimedia · Computer Science 2020-03-13 Dong Liu , Yue Li , Jianping Lin , Houqiang Li , Feng Wu

Learned Video Compression

We present a new algorithm for video coding, learned end-to-end for the low-latency mode. In this setting, our approach outperforms all existing video codecs across nearly the entire bitrate range. To our knowledge, this is the first…

Image and Video Processing · Electrical Eng. & Systems 2018-11-20 Oren Rippel , Sanjay Nair , Carissa Lew , Steve Branson , Alexander G. Anderson , Lubomir Bourdev

AI Oriented Large-Scale Video Management for Smart City: Technologies, Standards and Beyond

Deep learning has achieved substantial success in a series of tasks in computer vision. Intelligent video analysis, which can be broadly applied to video surveillance in various smart city applications, can also be driven by such powerful…

Computer Vision and Pattern Recognition · Computer Science 2017-12-06 Lingyu Duan , Yihang Lou , Shiqi Wang , Wen Gao , Yong Rui

Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features

With advances in image recognition technology based on deep learning, automatic video analysis by Artificial Intelligence is becoming more widespread. As the amount of video used for image recognition increases, efficient compression…

Computer Vision and Pattern Recognition · Computer Science 2023-04-04 Takahiro Shindo , Taiju Watanabe , Kein Yamada , Hiroshi Watanabe