Related papers: A Preprocessing Framework for Video Machine Vision…

Preprocessing Enhanced Image Compression for Machine Vision

Recently, more and more images are compressed and sent to the back-end devices for the machine analysis tasks~(\textit{e.g.,} object detection) instead of being purely watched by humans. However, most traditional or learned image codecs are…

Image and Video Processing · Electrical Eng. & Systems 2022-06-14 Guo Lu , Xingtong Ge , Tianxiong Zhong , Jing Geng , Qiang Hu

A Coding Framework and Benchmark towards Low-Bitrate Video Understanding

Video compression is indispensable to most video analysis systems. Despite saving transportation bandwidth, it also deteriorates downstream video understanding tasks, especially at low-bitrate settings. To systematically investigate this…

Image and Video Processing · Electrical Eng. & Systems 2024-09-24 Yuan Tian , Guo Lu , Yichao Yan , Guangtao Zhai , Li Chen , Zhiyong Gao

Rate-Perception Optimized Preprocessing for Video Coding

In the past decades, lots of progress have been done in the video compression field including traditional video codec and learning-based video codec. However, few studies focus on using preprocessing techniques to improve the…

Image and Video Processing · Electrical Eng. & Systems 2023-01-26 Chengqian Ma , Zhiqiang Wu , Chunlei Cai , Pengwei Zhang , Yi Wang , Long Zheng , Chao Chen , Quan Zhou

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics

Video Coding for Machines (VCM) is committed to bridging to an extent separate research tracks of video/image compression and feature compression, and attempts to optimize compactness and efficiency jointly from a unified perspective of…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Wenhan Yang , Haofeng Huang , Yueyu Hu , Ling-Yu Duan , Jiaying Liu

Predictive Coding For Animation-Based Video Compression

We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Goluck Konuko , Stéphane Lathuilière , Giuseppe Valenzise

Rate-Distortion Theory in Coding for Machines and its Application

Recent years have seen a tremendous growth in both the capability and popularity of automatic machine analysis of images and video. As a result, a growing need for efficient compression methods optimized for machine vision, rather than…

Image and Video Processing · Electrical Eng. & Systems 2025-03-05 Alon Harell , Yalda Foroutan , Nilesh Ahuja , Parual Datta , Bhavya Kanzariya , V. Srinivasa Somayazulu , Omesh Tickoo , Anderson de Andrade , Ivan V. Bajic

Embedding Compression Distortion in Video Coding for Machines

Currently, video transmission serves not only the Human Visual System (HVS) for viewing but also machine perception for analysis. However, existing codecs are primarily optimized for pixel-domain and HVS-perception metrics rather than the…

Image and Video Processing · Electrical Eng. & Systems 2025-03-28 Yuxiao Sun , Yao Zhao , Meiqin Liu , Chao Yao , Weisi Lin

Task Oriented Video Coding: A Survey

Video coding technology has been continuously improved for higher compression ratio with higher resolution. However, the state-of-the-art video coding standards, such as H.265/HEVC and Versatile Video Coding, are still designed with the…

Image and Video Processing · Electrical Eng. & Systems 2022-11-22 Daniel Wood

Sandwiched Video Compression: Efficiently Extending the Reach of Standard Codecs with Neural Wrappers

We propose sandwiched video compression -- a video compression system that wraps neural networks around a standard video codec. The sandwich framework consists of a neural pre- and post-processor with a standard video codec between them.…

Image and Video Processing · Electrical Eng. & Systems 2023-07-07 Berivan Isik , Onur G. Guleryuz , Danhang Tang , Jonathan Taylor , Philip A. Chou

DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy

We introduce a cutting-edge video compression framework tailored for the age of ubiquitous video data, uniquely designed to serve machine learning applications. Unlike traditional compression methods that prioritize human visual perception,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-25 Huan Cui , Qing Li , Hanling Wang , Yong jiang

Machine vision-aware quality metrics for compressed image and video assessment

A main goal in developing video-compression algorithms is to enhance human-perceived visual quality while maintaining file size. But modern video-analysis efforts such as detection and recognition, which are integral to video surveillance…

Computer Vision and Pattern Recognition · Computer Science 2024-11-12 Mikhail Dremin , Konstantin Kozhemyakov , Ivan Molodetskikh , Malakhov Kirill , Artur Sagitov , Dmitriy Vatolin

Compressed Vision for Efficient Video Understanding

Experience and reasoning occur across multiple temporal scales: milliseconds, seconds, hours or days. The vast majority of computer vision research, however, still focuses on individual images or short videos lasting only a few seconds.…

Computer Vision and Pattern Recognition · Computer Science 2022-10-07 Olivia Wiles , Joao Carreira , Iain Barr , Andrew Zisserman , Mateusz Malinowski

VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision

Almost all digital videos are coded into compact representations before being transmitted. Such compact representations need to be decoded back to pixels before being displayed to humans and - as usual - before being enhanced/analyzed by…

Image and Video Processing · Electrical Eng. & Systems 2023-11-03 Xihua Sheng , Li Li , Dong Liu , Houqiang Li

Learned Video Compression

We present a new algorithm for video coding, learned end-to-end for the low-latency mode. In this setting, our approach outperforms all existing video codecs across nearly the entire bitrate range. To our knowledge, this is the first…

Image and Video Processing · Electrical Eng. & Systems 2018-11-20 Oren Rippel , Sanjay Nair , Carissa Lew , Steve Branson , Alexander G. Anderson , Lubomir Bourdev

Variable Rate Video Compression using a Hybrid Recurrent Convolutional Learning Framework

In recent years, neural network-based image compression techniques have been able to outperform traditional codecs and have opened the gates for the development of learning-based video codecs. However, to take advantage of the high temporal…

Image and Video Processing · Electrical Eng. & Systems 2020-08-25 Aishwarya Jadhav

Scene Matters: Model-based Deep Video Compression

Video compression has always been a popular research area, where many traditional and deep video compression methods have been proposed. These methods typically rely on signal prediction theory to enhance compression performance by…

Computer Vision and Pattern Recognition · Computer Science 2023-08-31 Lv Tang , Xinfeng Zhang , Gai Zhang , Xiaoqi Ma

End-to-end optimized image compression for multiple machine tasks

An increasing share of captured images and videos are transmitted for storage and remote analysis by computer vision algorithms, rather than to be viewed by humans. Contrary to traditional standard codecs with engineered tools, neural…

Computer Vision and Pattern Recognition · Computer Science 2021-03-09 Lahiru D. Chamain , Fabien Racapé , Jean Bégaint , Akshay Pushparaja , Simon Feltman

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics

Video coding, which targets to compress and reconstruct the whole frame, and feature compression, which only preserves and transmits the most critical information, stand at two ends of the scale. That is, one is with compactness and…

Computer Vision and Pattern Recognition · Computer Science 2023-07-19 Ling-Yu Duan , Jiaying Liu , Wenhan Yang , Tiejun Huang , Wen Gao

TransVFC: A Transformable Video Feature Compression Framework for Machines

Nowadays, more and more video transmissions primarily aim at downstream machine vision tasks rather than humans. While widely deployed Human Visual System (HVS) oriented video coding standards like H.265/HEVC and H.264/AVC are efficient,…

Image and Video Processing · Electrical Eng. & Systems 2025-10-20 Yuxiao Sun , Yao Zhao , Meiqin Liu , Chao Yao , Huihui Bai , Chunyu Lin , Weisi Lin

End-to-End Learning for Video Frame Compression with Self-Attention

One of the core components of conventional (i.e., non-learned) video codecs consists of predicting a frame from a previously-decoded frame, by leveraging temporal correlations. In this paper, we propose an end-to-end learned system for…

Image and Video Processing · Electrical Eng. & Systems 2020-04-22 Nannan Zou , Honglei Zhang , Francesco Cricri , Hamed R. Tavakoli , Jani Lainema , Emre Aksu , Miska Hannuksela , Esa Rahtu