Related papers: Depth Estimation Algorithm Based on Transformer-En…

FuseFormer: A Transformer for Visual and Thermal Image Fusion

Due to the lack of a definitive ground truth for the image fusion problem, the loss functions are structured based on evaluation metrics, such as the structural similarity index measure (SSIM). However, in doing so, a bias is introduced…

Computer Vision and Pattern Recognition · Computer Science 2024-04-25 Aytekin Erdogan , Erdem Akagündüz

Depth Estimation using Weighted-loss and Transfer Learning

Depth estimation from 2D images is a common computer vision task that has applications in many fields including autonomous vehicles, scene understanding and robotics. The accuracy of a supervised depth estimation method mainly relies on the…

Computer Vision and Pattern Recognition · Computer Science 2024-04-12 Muhammad Adeel Hafeez , Michael G. Madden , Ganesh Sistu , Ihsan Ullah

Enhanced Encoder-Decoder Architecture for Accurate Monocular Depth Estimation

Estimating depth from a single 2D image is a challenging task due to the lack of stereo or multi-view data, which are typically required for depth perception. In state-of-the-art architectures, the main challenge is to efficiently capture…

Computer Vision and Pattern Recognition · Computer Science 2025-01-27 Dabbrata Das , Argho Deb Das , Farhan Sadaf

Multi-Frame Self-Supervised Depth with Transformers

Multi-frame depth estimation improves over single-frame approaches by also leveraging geometric relationships between images via feature matching, in addition to learning appearance-based features. In this paper we revisit feature matching…

Computer Vision and Pattern Recognition · Computer Science 2022-06-14 Vitor Guizilini , Rares Ambrus , Dian Chen , Sergey Zakharov , Adrien Gaidon

Towards Comprehensive Monocular Depth Estimation: Multiple Heads Are Better Than One

Depth estimation attracts widespread attention in the computer vision community. However, it is still quite difficult to recover an accurate depth map using only one RGB image. We observe a phenomenon that existing methods tend to fail in…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Shuwei Shao , Ran Li , Zhongcai Pei , Zhong Liu , Weihai Chen , Wentao Zhu , Xingming Wu , Baochang Zhang

Depthformer : Multiscale Vision Transformer For Monocular Depth Estimation With Local Global Information Fusion

Attention-based models such as transformers have shown outstanding performance on dense prediction tasks, such as semantic segmentation, owing to their capability of capturing long-range dependency in an image. However, the benefit of…

Computer Vision and Pattern Recognition · Computer Science 2022-07-13 Ashutosh Agarwal , Chetan Arora

SDformer: Efficient End-to-End Transformer for Depth Completion

Depth completion aims to predict dense depth maps with sparse depth measurements from a depth sensor. Currently, Convolutional Neural Network (CNN) based models are the most popular methods applied to depth completion tasks. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-13 Jian Qian , Miao Sun , Ashley Lee , Jie Li , Shenglong Zhuo , Patrick Yin Chiang

Monocular Depth Estimation Using Multi Scale Neural Network And Feature Fusion

Depth estimation from monocular images is a challenging problem in computer vision. In this paper, we tackle this problem using a novel network architecture using multi scale feature fusion. Our network uses two different blocks, first…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Abhinav Sagar

Revisiting Single Image Depth Estimation: Toward Higher Resolution Maps with Accurate Object Boundaries

This paper considers the problem of single image depth estimation. The employment of convolutional neural networks (CNNs) has recently brought about significant advancements in the research of this problem. However, most existing methods…

Computer Vision and Pattern Recognition · Computer Science 2018-09-25 Junjie Hu , Mete Ozay , Yan Zhang , Takayuki Okatani

Deep Neural Networks for Accurate Depth Estimation with Latent Space Features

Depth estimation plays a pivotal role in advancing human-robot interactions, especially in indoor environments where accurate 3D scene reconstruction is essential for tasks like navigation and object handling. Monocular depth estimation,…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Siddiqui Muhammad Yasir , Hyunsik Ahn

Autoencoded Image Compression for Secure and Fast Transmission

With exponential growth in the use of digital image data, the need for efficient transmission methods has become imperative. Traditional image compression techniques often sacrifice image fidelity for reduced file sizes, challenging…

Image and Video Processing · Electrical Eng. & Systems 2024-10-15 Aryan Kashyap Naveen , Sunil Thunga , Anuhya Murki , Mahati A Kalale , Shriya Anil

Probabilistic Multimodal Depth Estimation Based on Camera-LiDAR Sensor Fusion

Multi-modal depth estimation is one of the key challenges for endowing autonomous machines with robust robotic perception capabilities. There have been outstanding advances in the development of uni-modal depth estimation techniques based…

Robotics · Computer Science 2023-07-21 Johan S. Obando-Ceron , Victor Romero-Cano , Sildomar Monteiro

Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation

With an unprecedented increase in the number of agents and systems that aim to navigate the real world using visual cues and the rising impetus for 3D Vision Models, the importance of depth estimation is hard to understate. While supervised…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Snehal Singh Tomar , Maitreya Suin , A. N. Rajagopalan

DepthTCM: High Efficient Depth Compression via Physics-aware Transformer-CNN Mixed Architecture

We propose DepthTCM, a physics-aware end-to-end framework for depth map compression. In our framework of DepthTCM, the high-bit depth map is first converted to a conventional 3-channel image representation losslessly using a method inspired…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Young-Seo Chang , Yatong An , Jae-Sang Hyun

Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation

Unsupervised monocular depth learning generally relies on the photometric relation among temporally adjacent images. Most of previous works use both mean absolute error (MAE) and structure similarity index measure (SSIM) with conventional…

Computer Vision and Pattern Recognition · Computer Science 2025-06-06 Yijun Cao , Fuya Luo , Yongjie Li

Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces

Self-supervised monocular depth estimation (SSMDE) has gained attention in the field of deep learning as it estimates depth without requiring ground truth depth maps. This approach typically uses a photometric consistency loss between a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Wonhyeok Choi , Kyumin Hwang , Minwoo Choi , Kiljoon Han , Wonjoon Choi , Mingyu Shin , Sunghoon Im

Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation

In computer vision, depth estimation is crucial for domains like robotics, autonomous vehicles, augmented reality, and virtual reality. Integrating semantics with depth enhances scene understanding through reciprocal information sharing.…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Md Awsafur Rahman , Shaikh Anowarul Fattah

Entroformer: A Transformer-based Entropy Model for Learned Image Compression

One critical component in lossy deep image compression is the entropy model, which predicts the probability distribution of the quantized latent representation in the encoding and decoding modules. Previous works build entropy models upon…

Image and Video Processing · Electrical Eng. & Systems 2023-03-16 Yichen Qian , Ming Lin , Xiuyu Sun , Zhiyu Tan , Rong Jin

Decoder Modulation for Indoor Depth Completion

Depth completion recovers a dense depth map from sensor measurements. Current methods are mostly tailored for very sparse depth measurements from LiDARs in outdoor settings, while for indoor scenes Time-of-Flight (ToF) or structured light…

Computer Vision and Pattern Recognition · Computer Science 2021-02-09 Dmitry Senushkin , Mikhail Romanov , Ilia Belikov , Anton Konushin , Nikolay Patakin

Deformation Aware Image Compression

Lossy compression algorithms aim to compactly encode images in a way which enables to restore them with minimal error. We show that a key limitation of existing algorithms is that they rely on error measures that are extremely sensitive to…

Computer Vision and Pattern Recognition · Computer Science 2018-04-13 Tamar Rott Shaham , Tomer Michaeli