Related papers: Multi-Task Learning for Screen Content Image Codin…

Image Segmentation For Improved Lossless Screen Content Compression

In recent years, it has been found that screen content images (SCI) can be effectively compressed based on appropriate probability modelling and suitable entropy coding methods such as arithmetic coding. The key objective is determining the…

Image and Video Processing · Electrical Eng. & Systems 2023-05-11 Shabhrish Reddy Uddehal , Tilo Strutz , Hannah Och , André Kaup

Scalable Image Coding for Humans and Machines

At present, and increasingly so in the future, much of the captured visual content will not be seen by humans. Instead, it will be used for automated machine vision analytics and may require occasional human viewing. Examples of such…

Image and Video Processing · Electrical Eng. & Systems 2022-04-13 Hyomin Choi , Ivan V. Bajic

Learned Disentangled Latent Representations for Scalable Image Coding for Humans and Machines

As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily…

Image and Video Processing · Electrical Eng. & Systems 2023-01-12 Ezgi Ozyilkan , Mateen Ulhaq , Hyomin Choi , Fabien Racape

End-to-end optimized image compression for machines, a study

An increasing share of image and video content is analyzed by machines rather than viewed by humans, and therefore it becomes relevant to optimize codecs for such applications where the analysis is performed remotely. Unfortunately,…

Image and Video Processing · Electrical Eng. & Systems 2020-11-13 Lahiru D. Chamain , Fabien Racapé , Jean Bégaint , Akshay Pushparaja , Simon Feltman

Benefiting from Multitask Learning to Improve Single Image Super-Resolution

Despite significant progress toward super resolving more realistic images by deeper convolutional neural networks (CNNs), reconstructing fine and natural textures still remains a challenging problem. Recent works on single image super…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Mohammad Saeed Rad , Behzad Bozorgtabar , Claudiu Musat , Urs-Viktor Marti , Max Basler , Hazim Kemal Ekenel , Jean-Philippe Thiran

Joint Learning of Intrinsic Images and Semantic Segmentation

Semantic segmentation of outdoor scenes is problematic when there are variations in imaging conditions. It is known that albedo (reflectance) is invariant to all kinds of illumination effects. Thus, using reflectance images for semantic…

Computer Vision and Pattern Recognition · Computer Science 2018-08-01 Anil S. Baslamisli , Thomas T. Groenestege , Partha Das , Hoang-An Le , Sezer Karaoglu , Theo Gevers

Adapting Learned Image Codecs to Screen Content via Adjustable Transformations

As learned image codecs (LICs) become more prevalent, their low coding efficiency for out-of-distribution data becomes a bottleneck for some applications. To improve the performance of LICs for screen content (SC) images without breaking…

Image and Video Processing · Electrical Eng. & Systems 2024-02-28 H. Burak Dogaroglu , A. Burakhan Koyuncu , Atanas Boev , Elena Alshina , Eckehard Steinbach

Image Coding for Machines with Object Region Learning

Compression technology is essential for efficient image transmission and storage. With the rapid advances in deep learning, images are beginning to be used for image recognition as well as for human vision. For this reason, research has…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Takahiro Shindo , Taiju Watanabe , Kein Yamada , Hiroshi Watanabe

Spatial Competition for Low-Complexity Learned Image Compression

Autoencoder-based image codecs achieve state-of-the-art compression performance but often incur high computational complexity, particularly at decoding time. This work introduces a low-complexity learned image compression framework based on…

Image and Video Processing · Electrical Eng. & Systems 2026-05-14 Théophile Blard , Pierrick Philippe , Théo Ladune , Xiaoran Jiang , Olivier Déforges

FD-LSCIC: Frequency Decomposition-based Learned Screen Content Image Compression

The learned image compression (LIC) methods have already surpassed traditional techniques in compressing natural scene (NS) images. However, directly applying these methods to screen content (SC) images, which possess distinct…

Image and Video Processing · Electrical Eng. & Systems 2025-02-24 Shiqi Jiang , Hui Yuan , Shuai Li , Huanqiang Zeng , Sam Kwong

End-to-end optimized image compression for multiple machine tasks

An increasing share of captured images and videos are transmitted for storage and remote analysis by computer vision algorithms, rather than to be viewed by humans. Contrary to traditional standard codecs with engineered tools, neural…

Computer Vision and Pattern Recognition · Computer Science 2021-03-09 Lahiru D. Chamain , Fabien Racapé , Jean Bégaint , Akshay Pushparaja , Simon Feltman

Rank Minimization for Snapshot Compressive Imaging

Snapshot compressive imaging (SCI) refers to compressive imaging systems where multiple frames are mapped into a single measurement, with video compressive imaging and hyperspectral compressive imaging as two representative applications.…

Computer Vision and Pattern Recognition · Computer Science 2018-10-09 Yang Liu , Xin Yuan , Jinli Suo , David J. Brady , Qionghai Dai

Semantic segmentation with coarse annotations

Semantic segmentation is the task of classifying each pixel in an image. Training a segmentation model achieves best results using annotated images, where each pixel is annotated with the corresponding class. When obtaining fine annotations…

Computer Vision and Pattern Recognition · Computer Science 2025-10-20 Jort de Jong , Mike Holenderski

Image coding for machines: an end-to-end learned approach

Over recent years, deep learning-based computer vision systems have been applied to images at an ever-increasing pace, oftentimes representing the only type of consumption for those images. Given the dramatic explosion in the number of…

Computer Vision and Pattern Recognition · Computer Science 2021-08-31 Nam Le , Honglei Zhang , Francesco Cricri , Ramin Ghaznavi-Youvalari , Esa Rahtu

Learned Scalable Video Coding For Humans and Machines

Video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital TV, and so on. The main intent was to enable human viewing of the encoded content. However, with the advances in deep…

Image and Video Processing · Electrical Eng. & Systems 2024-11-19 Hadi Hadizadeh , Ivan V. Bajić

Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding

Recent advancements in deep learning-based image compression are notable. However, prevalent schemes that employ a serial context-adaptive entropy model to enhance rate-distortion (R-D) performance are markedly slow. Furthermore, the…

Applications · Statistics 2024-03-25 Haisheng Fu , Feng Liang , Jie Liang , Zhenman Fang , Guohe Zhang , Jingning Han

Optimal Control with Natural Images: Efficient Reinforcement Learning using Overcomplete Sparse Codes

Optimal control and sequential decision making are widely used in many complex tasks. Optimal control over a sequence of natural images is a first step towards understanding the role of vision in control. Here, we formalize this problem as…

Machine Learning · Computer Science 2026-05-07 Peter N. Loxley

Self-Supervised Learning of Remote Sensing Scene Representations Using Contrastive Multiview Coding

In recent years self-supervised learning has emerged as a promising candidate for unsupervised representation learning. In the visual domain its applications are mostly studied in the context of images of natural scenes. However, its…

Computer Vision and Pattern Recognition · Computer Science 2021-06-04 Vladan Stojnić , Vladimir Risojević

MTLE: A Multitask Learning Encoder of Visual Feature Representations for Video and Movie Description

Learning visual feature representations for video analysis is a daunting task that requires a large amount of training samples and a proper generalization framework. Many of the current state of the art methods for video captioning and…

Machine Learning · Computer Science 2018-09-20 Oliver Nina , Washington Garcia , Scott Clouse , Alper Yilmaz

Towards annotation-efficient segmentation via image-to-image translation

Often in medical imaging, it is prohibitively challenging to produce enough boundary annotations to train deep neural networks for accurate tumor segmentation. We propose the use of weak labels about whether an image presents tumor or…

Computer Vision and Pattern Recognition · Computer Science 2021-06-15 Eugene Vorontsov , Pavlo Molchanov , Christopher Beckham , Jan Kautz , Samuel Kadoury