Related papers: Video Coding for Machine: Compact Visual Represent…

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics

Video coding, which targets to compress and reconstruct the whole frame, and feature compression, which only preserves and transmits the most critical information, stand at two ends of the scale. That is, one is with compactness and…

Computer Vision and Pattern Recognition · Computer Science 2023-07-19 Ling-Yu Duan , Jiaying Liu , Wenhan Yang , Tiejun Huang , Wen Gao

An Emerging Coding Paradigm VCM: A Scalable Coding Approach Beyond Feature and Signal

In this paper, we study a new problem arising from the emerging MPEG standardization effort Video Coding for Machine (VCM), which aims to bridge the gap between visual feature compression and classical video coding. VCM is committed to…

Image and Video Processing · Electrical Eng. & Systems 2020-01-10 Sifeng Xia , Kunchangtai Liang , Wenhan Yang , Ling-Yu Duan , Jiaying Liu

End-to-End Learnable Multi-Scale Feature Compression for VCM

The proliferation of deep learning-based machine vision applications has given rise to a new type of compression, so called video coding for machine (VCM). VCM differs from traditional video coding in that it is optimized for machine vision…

Computer Vision and Pattern Recognition · Computer Science 2023-08-09 Yeongwoong Kim , Hyewon Jeong , Janghyun Yu , Younhee Kim , Jooyoung Lee , Se Yoon Jeong , Hui Yong Kim

Revisit Visual Representation in Analytics Taxonomy: A Compression Perspective

Visual analytics have played an increasingly critical role in the Internet of Things, where massive visual signals have to be compressed and fed into machines. But facing such big data and constrained bandwidth capacity, existing…

Computer Vision and Pattern Recognition · Computer Science 2021-06-17 Yueyu Hu , Wenhan Yang , Haofeng Huang , Jiaying Liu

VVC+M: Plug and Play Scalable Image Coding for Humans and Machines

Compression for machines is an emerging field, where inputs are encoded while optimizing the performance of downstream automated analysis. In scalable coding for humans and machines, the compressed representation used for machines is…

Image and Video Processing · Electrical Eng. & Systems 2023-05-19 Alon Harell , Yalda Foroutan , Ivan V. Bajic

Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling

Video Coding for Machines (VCM) aims to compress visual signals for machine analysis. However, existing methods only consider a few machines, neglecting the majority. Moreover, the machine's perceptual characteristics are not leveraged…

Computer Vision and Pattern Recognition · Computer Science 2024-01-10 Qi Zhang , Shanshe Wang , Xinfeng Zhang , Chuanmin Jia , Zhao Wang , Siwei Ma , Wen Gao

VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision

Almost all digital videos are coded into compact representations before being transmitted. Such compact representations need to be decoded back to pixels before being displayed to humans and - as usual - before being enhanced/analyzed by…

Image and Video Processing · Electrical Eng. & Systems 2023-11-03 Xihua Sheng , Li Li , Dong Liu , Houqiang Li

Emerging Standards for Machine-to-Machine Video Coding

Machines are increasingly becoming the primary consumers of visual data, yet most deployments of machine-to-machine systems still rely on remote inference where pixel-based video is streamed using codecs optimized for human perception.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-12 Md Eimran Hossain Eimon , Velibor Adzic , Hari Kalva , Borko Furht

Symmetric Entropy-Constrained Video Coding for Machines

As video transmission increasingly serves machine vision systems (MVS) instead of human vision systems (HVS), video coding for machines (VCM) has become a critical research topic. Existing VCM methods often bind codecs to specific…

Image and Video Processing · Electrical Eng. & Systems 2025-11-04 Yuxiao Sun , Meiqin Liu , Chao Yao , Qi Tang , Jian Jin , Weisi Lin , Frederic Dufaux , Yao Zhao

Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach

The past decades have witnessed the rapid development of image and video coding techniques in the era of big data. However, the signal fidelity-driven coding pipeline design limits the capability of the existing image/video coding…

Computer Vision and Pattern Recognition · Computer Science 2020-01-13 Yueyu Hu , Shuai Yang , Wenhan Yang , Ling-Yu Duan , Jiaying Liu

Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven Prompts

Image coding for machines (ICM) aims to compress images to support downstream AI analysis instead of human perception. For ICM, developing a unified codec to reduce information redundancy while empowering the compressed features to support…

Computer Vision and Pattern Recognition · Computer Science 2023-05-05 Ruoyu Feng , Jinming Liu , Xin Jin , Xiaohan Pan , Heming Sun , Zhibo Chen

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

Machines Serve Human: A Novel Variable Human-machine Collaborative Compression Framework

Human-machine collaborative compression has been receiving increasing research efforts for reducing image/video data, serving as the basis for both human perception and machine intelligence. Existing collaborative methods are dominantly…

Computer Vision and Pattern Recognition · Computer Science 2025-11-13 Zifu Zhang , Shengxi Li , Xiancheng Sun , Mai Xu , Zhengyuan Liu , Jingyuan Xia

VVC Extension Scheme for Object Detection Using Contrast Reduction

In recent years, video analysis using Artificial Intelligence (AI) has been widely used, due to the remarkable development of image recognition technology using deep learning. In 2019, the Moving Picture Experts Group (MPEG) has started…

Computer Vision and Pattern Recognition · Computer Science 2023-05-31 Takahiro Shindo , Taiju Watanabe , Kein Yamada , Hiroshi Watanabe

Low-Rank Adaptation of Pre-trained Vision Backbones for Energy-Efficient Image Coding for Machine

Image Coding for Machines (ICM) focuses on optimizing image compression for AI-driven analysis rather than human perception. Existing ICM frameworks often rely on separate codecs for specific tasks, leading to significant storage…

Image and Video Processing · Electrical Eng. & Systems 2025-05-30 Yichi Zhang , Zhihao Duan , Yuning Huang , Fengqing Zhu

Cross Modal Compression: Towards Human-comprehensible Semantic Compression

Traditional image/video compression aims to reduce the transmission/storage cost with signal fidelity as high as possible. However, with the increasing demand for machine analysis and semantic monitoring in recent years, semantic fidelity…

Image and Video Processing · Electrical Eng. & Systems 2022-09-07 Jiguo Li , Chuanmin Jia , Xinfeng Zhang , Siwei Ma , Wen Gao

High Efficiency Image Compression for Large Visual-Language Models

In recent years, large visual language models (LVLMs) have shown impressive performance and promising generalization capability in multi-modal tasks, thus replacing humans as receivers of visual information in various application scenarios.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-25 Binzhe Li , Shurun Wang , Shiqi Wang , Yan Ye

Image Coding for Machines with Omnipotent Feature Learning

Image Coding for Machines (ICM) aims to compress images for AI tasks analysis rather than meeting human perception. Learning a kind of feature that is both general (for AI tasks) and compact (for compression) is pivotal for its success. In…

Computer Vision and Pattern Recognition · Computer Science 2022-07-08 Ruoyu Feng , Xin Jin , Zongyu Guo , Runsen Feng , Yixin Gao , Tianyu He , Zhizheng Zhang , Simeng Sun , Zhibo Chen

Image Coding for Machines with Object Region Learning

Compression technology is essential for efficient image transmission and storage. With the rapid advances in deep learning, images are beginning to be used for image recognition as well as for human vision. For this reason, research has…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Takahiro Shindo , Taiju Watanabe , Kein Yamada , Hiroshi Watanabe

LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model

Image Compression for Machines (ICM) aims to compress images for machine vision tasks rather than human viewing. Current works predominantly concentrate on high-level tasks like object detection and semantic segmentation. However, the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Yuan Xue , Qi Zhang , Chuanmin Jia , Shiqi Wang