Related papers: Image Coding for Machines with Edge Information Le…

Image Coding for Machines with Object Region Learning

Compression technology is essential for efficient image transmission and storage. With the rapid advances in deep learning, images are beginning to be used for image recognition as well as for human vision. For this reason, research has…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Takahiro Shindo , Taiju Watanabe , Kein Yamada , Hiroshi Watanabe

Image Coding for Machines with Omnipotent Feature Learning

Image Coding for Machines (ICM) aims to compress images for AI tasks analysis rather than meeting human perception. Learning a kind of feature that is both general (for AI tasks) and compact (for compression) is pivotal for its success. In…

Computer Vision and Pattern Recognition · Computer Science 2022-07-08 Ruoyu Feng , Xin Jin , Zongyu Guo , Runsen Feng , Yixin Gao , Tianyu He , Zhizheng Zhang , Simeng Sun , Zhibo Chen

Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss

Image coding for machines (ICM) aims to compress images for machine analysis using recognition models rather than human vision. Hence, in ICM, it is important for the encoder to recognize and compress the information necessary for the…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Kei Iino , Shunsuke Akamatsu , Hiroshi Watanabe , Shohei Enomoto , Akira Sakamoto , Takeharu Eda

Bridging the gap between image coding for machines and humans

Image coding for machines (ICM) aims at reducing the bitrate required to represent an image while minimizing the drop in machine vision analysis accuracy. In many use cases, such as surveillance, it is also important that the visual quality…

Image and Video Processing · Electrical Eng. & Systems 2024-01-22 Nam Le , Honglei Zhang , Francesco Cricri , Ramin G. Youvalari , Hamed Rezazadegan Tavakoli , Emre Aksu , Miska M. Hannuksela , Esa Rahtu

LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model

Image Compression for Machines (ICM) aims to compress images for machine vision tasks rather than human viewing. Current works predominantly concentrate on high-level tasks like object detection and semantic segmentation. However, the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Yuan Xue , Qi Zhang , Chuanmin Jia , Shiqi Wang

Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven Prompts

Image coding for machines (ICM) aims to compress images to support downstream AI analysis instead of human perception. For ICM, developing a unified codec to reduce information redundancy while empowering the compressed features to support…

Computer Vision and Pattern Recognition · Computer Science 2023-05-05 Ruoyu Feng , Jinming Liu , Xin Jin , Xiaohan Pan , Heming Sun , Zhibo Chen

CI-ICM: Channel Importance-driven Learned Image Coding for Machines

Traditional human vision-centric image compression methods are suboptimal for machine vision centric compression due to different visual properties and feature characteristics. To address this problem, we propose a Channel Importance-driven…

Image and Video Processing · Electrical Eng. & Systems 2026-04-08 Yun Zhang , Junle Liu , Huan Zhang , Zhaoqing Pan , Gangyi Jiang , Weisi Lin

VVC+M: Plug and Play Scalable Image Coding for Humans and Machines

Compression for machines is an emerging field, where inputs are encoded while optimizing the performance of downstream automated analysis. In scalable coding for humans and machines, the compressed representation used for machines is…

Image and Video Processing · Electrical Eng. & Systems 2023-05-19 Alon Harell , Yalda Foroutan , Ivan V. Bajic

Stereo Image Coding for Machines with Joint Visual Feature Compression

2D image coding for machines (ICM) has achieved great success in coding efficiency, while less effort has been devoted to stereo image fields. To promote the efficiency of stereo image compression (SIC) and intelligent analysis, the stereo…

Computer Vision and Pattern Recognition · Computer Science 2025-02-21 Dengchao Jin , Jianjun Lei , Bo Peng , Zhaoqing Pan , Nam Ling , Qingming Huang

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs

We present a new image compression paradigm to achieve ``intelligently coding for machine'' by cleverly leveraging the common sense of Large Multimodal Models (LMMs). We are motivated by the evidence that large language/multimodal models…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Jinming Liu , Yuntao Wei , Junyan Lin , Shengyang Zhao , Heming Sun , Zhibo Chen , Wenjun Zeng , Xin Jin

Delta-ICM: Entropy Modeling with Delta Function for Learned Image Compression

Image Coding for Machines (ICM) is becoming more important as research in computer vision progresses. ICM is a vital research field that pursues the use of images for image recognition models, facilitating efficient image transmission and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-17 Takahiro Shindo , Taiju Watanabe , Yui Tatsumi , Hiroshi Watanabe

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics

Video Coding for Machines (VCM) is committed to bridging to an extent separate research tracks of video/image compression and feature compression, and attempts to optimize compactness and efficiency jointly from a unified perspective of…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Wenhan Yang , Haofeng Huang , Yueyu Hu , Ling-Yu Duan , Jiaying Liu

Low-Rank Adaptation of Pre-trained Vision Backbones for Energy-Efficient Image Coding for Machine

Image Coding for Machines (ICM) focuses on optimizing image compression for AI-driven analysis rather than human perception. Existing ICM frameworks often rely on separate codecs for specific tasks, leading to significant storage…

Image and Video Processing · Electrical Eng. & Systems 2025-05-30 Yichi Zhang , Zhihao Duan , Yuning Huang , Fengqing Zhu

Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling

Video Coding for Machines (VCM) aims to compress visual signals for machine analysis. However, existing methods only consider a few machines, neglecting the majority. Moreover, the machine's perceptual characteristics are not leveraged…

Computer Vision and Pattern Recognition · Computer Science 2024-01-10 Qi Zhang , Shanshe Wang , Xinfeng Zhang , Chuanmin Jia , Zhao Wang , Siwei Ma , Wen Gao

High-Efficiency Lossy Image Coding Through Adaptive Neighborhood Information Aggregation

Questing for learned lossy image coding (LIC) with superior compression performance and computation throughput is challenging. The vital factor behind it is how to intelligently explore Adaptive Neighborhood Information Aggregation (ANIA)…

Image and Video Processing · Electrical Eng. & Systems 2022-10-13 Ming Lu , Fangdong Chen , Shiliang Pu , Zhan Ma

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics

Video coding, which targets to compress and reconstruct the whole frame, and feature compression, which only preserves and transmits the most critical information, stand at two ends of the scale. That is, one is with compactness and…

Computer Vision and Pattern Recognition · Computer Science 2023-07-19 Ling-Yu Duan , Jiaying Liu , Wenhan Yang , Tiejun Huang , Wen Gao

SLIM: Semantic-based Low-bitrate Image compression for Machines by leveraging diffusion

In recent years, the demand of image compression models for machine vision has increased dramatically. However, the training frameworks of image compression still focus on the vision of human, maintaining the excessive perceptual details,…

Image and Video Processing · Electrical Eng. & Systems 2025-12-24 Hyeonjin Lee , Jun-Hyuk Kim , Jong-Seok Lee

You Can Mask More For Extremely Low-Bitrate Image Compression

Learned image compression (LIC) methods have experienced significant progress during recent years. However, these methods are primarily dedicated to optimizing the rate-distortion (R-D) performance at medium and high bitrates (> 0.1 bits…

Computer Vision and Pattern Recognition · Computer Science 2023-06-28 Anqi Li , Feng Li , Jiaxin Han , Huihui Bai , Runmin Cong , Chunjie Zhang , Meng Wang , Weisi Lin , Yao Zhao

MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model

With the evolution of storage and communication protocols, ultra-low bitrate image compression has become a highly demanding topic. However, existing compression algorithms must sacrifice either consistency with the ground truth or…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 Chunyi Li , Guo Lu , Donghui Feng , Haoning Wu , Zicheng Zhang , Xiaohong Liu , Guangtao Zhai , Weisi Lin , Wenjun Zhang

I-MedSAM: Implicit Medical Image Segmentation with Segment Anything

With the development of Deep Neural Networks (DNNs), many efforts have been made to handle medical image segmentation. Traditional methods such as nnUNet train specific segmentation models on the individual datasets. Plenty of recent…

Computer Vision and Pattern Recognition · Computer Science 2024-07-12 Xiaobao Wei , Jiajun Cao , Yizhu Jin , Ming Lu , Guangyu Wang , Shanghang Zhang