English

Sparse Point Clouds Assisted Learned Image Compression

Computer Vision and Pattern Recognition 2024-12-23 v1 Image and Video Processing

Abstract

In the field of autonomous driving, a variety of sensor data types exist, each representing different modalities of the same scene. Therefore, it is feasible to utilize data from other sensors to facilitate image compression. However, few techniques have explored the potential benefits of utilizing inter-modality correlations to enhance the image compression performance. In this paper, motivated by the recent success of learned image compression, we propose a new framework that uses sparse point clouds to assist in learned image compression in the autonomous driving scenario. We first project the 3D sparse point cloud onto a 2D plane, resulting in a sparse depth map. Utilizing this depth map, we proceed to predict camera images. Subsequently, we use these predicted images to extract multi-scale structural features. These features are then incorporated into learned image compression pipeline as additional information to improve the compression performance. Our proposed framework is compatible with various mainstream learned image compression models, and we validate our approach using different existing image compression methods. The experimental results show that incorporating point cloud assistance into the compression pipeline consistently enhances the performance.

Keywords

Cite

@article{arxiv.2412.15752,
  title  = {Sparse Point Clouds Assisted Learned Image Compression},
  author = {Yiheng Jiang and Haotian Zhang and Li Li and Dong Liu and Zhu Li},
  journal= {arXiv preprint arXiv:2412.15752},
  year   = {2024}
}

Comments

Accepted by TCSVT

R2 v1 2026-06-28T20:43:37.717Z