Related papers: Base Layer Efficiency in Scalable Human-Machine Co…

Scalable Image Coding for Humans and Machines

At present, and increasingly so in the future, much of the captured visual content will not be seen by humans. Instead, it will be used for automated machine vision analytics and may require occasional human viewing. Examples of such…

Image and Video Processing · Electrical Eng. & Systems 2022-04-13 Hyomin Choi , Ivan V. Bajic

Scalable Video Coding for Humans and Machines

Video content is watched not only by humans, but increasingly also by machines. For example, machine learning models analyze surveillance video for security and traffic monitoring, search through YouTube videos for inappropriate content,…

Image and Video Processing · Electrical Eng. & Systems 2022-08-05 Hyomin Choi , Ivan V. Bajić

Learned Scalable Video Coding For Humans and Machines

Video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital TV, and so on. The main intent was to enable human viewing of the encoded content. However, with the advances in deep…

Image and Video Processing · Electrical Eng. & Systems 2024-11-19 Hadi Hadizadeh , Ivan V. Bajić

Rate-Distortion in Image Coding for Machines

In recent years, there has been a sharp increase in transmission of images to remote servers specifically for the purpose of computer vision. In many applications, such as surveillance, images are mostly transmitted for automated analysis,…

Computer Vision and Pattern Recognition · Computer Science 2022-09-26 Alon Harell , Anderson De Andrade , Ivan V. Bajic

Refining Coded Image in Human Vision Layer Using CNN-Based Post-Processing

Scalable image coding for both humans and machines is a technique that has gained a lot of attention recently. This technology enables the hierarchical decoding of images for human vision and image recognition models. It is a highly…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Takahiro Shindo , Yui Tatsumi , Taiju Watanabe , Hiroshi Watanabe

Scalable Image Coding for Humans and Machines Using Feature Fusion Network

As image recognition models become more prevalent, scalable coding methods for machines and humans gain more importance. Applications of image recognition models include traffic monitoring and farm management. In these use cases, the…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Takahiro Shindo , Taiju Watanabe , Yui Tatsumi , Hiroshi Watanabe

Human-Machine Collaborative Video Coding Through Cuboidal Partitioning

Video coding algorithms encode and decode an entire video frame while feature coding techniques only preserve and communicate the most critical information needed for a given application. This is because video coding targets human…

Image and Video Processing · Electrical Eng. & Systems 2021-09-06 Ashek Ahmmed , Manoranjan Paul , Manzur Murshed , David Taubman

VVC+M: Plug and Play Scalable Image Coding for Humans and Machines

Compression for machines is an emerging field, where inputs are encoded while optimizing the performance of downstream automated analysis. In scalable coding for humans and machines, the compressed representation used for machines is…

Image and Video Processing · Electrical Eng. & Systems 2023-05-19 Alon Harell , Yalda Foroutan , Ivan V. Bajic

Rate-Accuracy Bounds in Visual Coding for Machines

Increasingly, visual signals such as images, videos and point clouds are being captured solely for the purpose of automated analysis by computer vision models. Applications include traffic monitoring, robotics, autonomous driving, smart…

Image and Video Processing · Electrical Eng. & Systems 2025-07-25 Ivan V. Bajić

Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach

The past decades have witnessed the rapid development of image and video coding techniques in the era of big data. However, the signal fidelity-driven coding pipeline design limits the capability of the existing image/video coding…

Computer Vision and Pattern Recognition · Computer Science 2020-01-13 Yueyu Hu , Shuai Yang , Wenhan Yang , Ling-Yu Duan , Jiaying Liu

Rate-Distortion Theory in Coding for Machines and its Application

Recent years have seen a tremendous growth in both the capability and popularity of automatic machine analysis of images and video. As a result, a growing need for efficient compression methods optimized for machine vision, rather than…

Image and Video Processing · Electrical Eng. & Systems 2025-03-05 Alon Harell , Yalda Foroutan , Nilesh Ahuja , Parual Datta , Bhavya Kanzariya , V. Srinivasa Somayazulu , Omesh Tickoo , Anderson de Andrade , Ivan V. Bajic

High efficiency compression for object detection

Image and video compression has traditionally been tailored to human vision. However, modern applications such as visual analytics and surveillance rely on computers seeing and analyzing the images before (or instead of) humans. For these…

Image and Video Processing · Electrical Eng. & Systems 2018-02-19 Hyomin Choi , Ivan V. Bajic

Task Oriented Video Coding: A Survey

Video coding technology has been continuously improved for higher compression ratio with higher resolution. However, the state-of-the-art video coding standards, such as H.265/HEVC and Versatile Video Coding, are still designed with the…

Image and Video Processing · Electrical Eng. & Systems 2022-11-22 Daniel Wood

Machine vision-aware quality metrics for compressed image and video assessment

A main goal in developing video-compression algorithms is to enhance human-perceived visual quality while maintaining file size. But modern video-analysis efforts such as detection and recognition, which are integral to video surveillance…

Computer Vision and Pattern Recognition · Computer Science 2024-11-12 Mikhail Dremin , Konstantin Kozhemyakov , Ivan Molodetskikh , Malakhov Kirill , Artur Sagitov , Dmitriy Vatolin

On Annotation-free Optimization of Video Coding for Machines

Today, image and video data is not only viewed by humans, but also automatically analyzed by computer vision algorithms. However, current coding standards are optimized for human perception. Emerging from this, research on video coding for…

Image and Video Processing · Electrical Eng. & Systems 2024-06-13 Marc Windsheimer , Fabian Brand , André Kaup

Efficient coding along the visual hierarchy

Biological visual systems learn from limited experience, unlike deep learning models that rely on millions of training images. What learning principles make this possible? We tested whether efficient coding, the idea that neural…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Ananya Passi , Brian S. Robinson , Michael F. Bonner

Scalable Human-Machine Point Cloud Compression

Due to the limited computational capabilities of edge devices, deep learning inference can be quite expensive. One remedy is to compress and transmit point cloud data over the network for server-side processing. Unfortunately, this approach…

Computer Vision and Pattern Recognition · Computer Science 2024-02-26 Mateen Ulhaq , Ivan V. Bajić

Image coding for machines: an end-to-end learned approach

Over recent years, deep learning-based computer vision systems have been applied to images at an ever-increasing pace, oftentimes representing the only type of consumption for those images. Given the dramatic explosion in the number of…

Computer Vision and Pattern Recognition · Computer Science 2021-08-31 Nam Le , Honglei Zhang , Francesco Cricri , Ramin Ghaznavi-Youvalari , Esa Rahtu

Deep Learning Technique for Human Parsing: A Survey and Outlook

Human parsing aims to partition humans in image or video into multiple pixel-level semantic parts. In the last decade, it has gained significantly increased interest in the computer vision community and has been utilized in a broad range of…

Computer Vision and Pattern Recognition · Computer Science 2024-03-15 Lu Yang , Wenhe Jia , Shan Li , Qing Song

Learned Image Coding for Machines: A Content-Adaptive Approach

Today, according to the Cisco Annual Internet Report (2018-2023), the fastest-growing category of Internet traffic is machine-to-machine communication. In particular, machine-to-machine communication of images and videos represents a new…

Image and Video Processing · Electrical Eng. & Systems 2021-10-14 Nam Le , Honglei Zhang , Francesco Cricri , Ramin Ghaznavi-Youvalari , Hamed Rezazadegan Tavakoli , Esa Rahtu