Related papers: DepthTransfer: Depth Extraction from Video Using N…

Depth Extraction from Video Using Non-parametric Sampling

We describe a technique that automatically generates plausible depth maps from videos using non-parametric depth sampling. We demonstrate our technique in cases where past methods fail (non-translating cameras and dynamic scenes). Our…

Computer Vision and Pattern Recognition · Computer Science 2020-02-12 Kevin Karsch , Ce Liu , Sing Bing Kang

Depth Extraction from Videos Using Geometric Context and Occlusion Boundaries

We present an algorithm to estimate depth in dynamic video scenes. We propose to learn and infer depth in videos from appearance, motion, occlusion boundaries, and geometric context of the scene. Using our method, depth can be estimated…

Computer Vision and Pattern Recognition · Computer Science 2015-10-27 S. Hussain Raza , Omar Javed , Aveek Das , Harpreet Sawhney , Hui Cheng , Irfan Essa

Web Stereo Video Supervision for Depth Prediction from Dynamic Scenes

We present a fully data-driven method to compute depth from diverse monocular video sequences that contain large amounts of non-rigid objects, e.g., people. In order to learn reconstruction cues for non-rigid scenes, we introduce a new…

Computer Vision and Pattern Recognition · Computer Science 2019-04-26 Chaoyang Wang , Simon Lucey , Federico Perazzi , Oliver Wang

Seurat: From Moving Points to Depth

Accurate depth estimation from monocular videos remains challenging due to ambiguities inherent in single-view geometry, as crucial depth cues like stereopsis are absent. However, humans often perceive relative depth intuitively by…

Computer Vision and Pattern Recognition · Computer Science 2025-04-22 Seokju Cho , Jiahui Huang , Seungryong Kim , Joon-Young Lee

Summary Transfer: Exemplar-based Subset Selection for Video Summarization

Video summarization has unprecedented importance to help us digest, browse, and search today's ever-growing video collections. We propose a novel subset selection technique that leverages supervision in the form of human-created summaries…

Computer Vision and Pattern Recognition · Computer Science 2016-05-02 Ke Zhang , Wei-Lun Chao , Fei Sha , Kristen Grauman

Playing for Depth

Estimating the relative depth of a scene is a significant step towards understanding the general structure of the depicted scenery, the relations of entities in the scene and their interactions. When faced with the task of estimating depth…

Computer Vision and Pattern Recognition · Computer Science 2018-10-16 Mohammad Mahdi Haji-Esmaeili , Gholamali Montazer

Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras

We present a novel method for simultaneous learning of depth, egomotion, object motion, and camera intrinsics from monocular videos, using only consistency across neighboring video frames as supervision signal. Similarly to prior work, our…

Computer Vision and Pattern Recognition · Computer Science 2019-10-31 Ariel Gordon , Hanhan Li , Rico Jonschkowski , Anelia Angelova

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Estimating video depth in open-world scenarios is challenging due to the diversity of videos in appearance, content motion, camera movement, and length. We present DepthCrafter, an innovative method for generating temporally consistent long…

Computer Vision and Pattern Recognition · Computer Science 2024-12-02 Wenbo Hu , Xiangjun Gao , Xiaoyu Li , Sijie Zhao , Xiaodong Cun , Yong Zhang , Long Quan , Ying Shan

Learning non-rigid surface reconstruction from spatio-temporal image patches

We present a method to reconstruct a dense spatio-temporal depth map of a non-rigidly deformable object directly from a video sequence. The estimation of depth is performed locally on spatio-temporal patches of the video, and then the full…

Computer Vision and Pattern Recognition · Computer Science 2020-06-22 Matteo Pedone , Abdelrahman Mostafa , Janne heikkilä

Sampling Based Scene-Space Video Processing

Many compelling video processing effects can be achieved if per-pixel depth information and 3D camera calibrations are known. However, the success of such methods is highly dependent on the accuracy of this "scene-space" information. We…

Computer Vision and Pattern Recognition · Computer Science 2021-02-08 Felix Klose , Oliver Wang , Jean-Charles Bazin , Marcus Magnor , Alexander Sorkine-Hornung

Self-Supervised Human Depth Estimation from Monocular Videos

Previous methods on estimating detailed human depth often require supervised training with `ground truth' depth data. This paper presents a self-supervised method that can be trained on YouTube videos without known depth, which makes…

Computer Vision and Pattern Recognition · Computer Science 2020-05-08 Feitong Tan , Hao Zhu , Zhaopeng Cui , Siyu Zhu , Marc Pollefeys , Ping Tan

Depth-Map Generation using Pixel Matching in Stereoscopic Pair of Images

Modern day multimedia content generation and dissemination is moving towards the presentation of more and more `realistic' scenarios. The switch from 2-dimensional (2D) to 3-dimensional (3D) has been a major driving force in that direction.…

Computer Vision and Pattern Recognition · Computer Science 2019-05-16 Asra Aslam , Mohd. Samar Ansari

Consistent Video Depth Estimation

We present an algorithm for reconstructing dense, geometrically consistent depth for all pixels in a monocular video. We leverage a conventional structure-from-motion reconstruction to establish geometric constraints on pixels in the video.…

Computer Vision and Pattern Recognition · Computer Science 2020-08-28 Xuan Luo , Jia-Bin Huang , Richard Szeliski , Kevin Matzen , Johannes Kopf

Graph Based Temporal Aggregation for Video Retrieval

Large scale video retrieval is a field of study with a lot of ongoing research. Most of the work in the field is on video retrieval through text queries using techniques such as VSE++. However, there is little research done on video…

Computer Vision and Pattern Recognition · Computer Science 2020-11-05 Arvind Srinivasan , Aprameya Bharadwaj , Aveek Saha , Subramanyam Natarajan

Video Autoencoder: self-supervised disentanglement of static 3D structure and motion

A video autoencoder is proposed for learning disentan- gled representations of 3D structure and camera pose from videos in a self-supervised manner. Relying on temporal continuity in videos, our work assumes that the 3D scene structure in…

Computer Vision and Pattern Recognition · Computer Science 2021-10-07 Zihang Lai , Sifei Liu , Alexei A. Efros , Xiaolong Wang

Accurate Human Body Reconstruction for Volumetric Video

In this work, we enhance a professional end-to-end volumetric video production pipeline to achieve high-fidelity human body reconstruction using only passive cameras. While current volumetric video approaches estimate depth maps using…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Decai Chen , Markus Worchel , Ingo Feldmann , Oliver Schreer , Peter Eisert

Edge-aware Consistent Stereo Video Depth Estimation

Video depth estimation is crucial in various applications, such as scene reconstruction and augmented reality. In contrast to the naive method of estimating depths from images, a more sophisticated approach uses temporal information,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-05 Elena Kosheleva , Sunil Jaiswal , Faranak Shamsafar , Noshaba Cheema , Klaus Illgner-Fehns , Philipp Slusallek

Depth Any Video with Scalable Synthetic Data

Video depth estimation has long been hindered by the scarcity of consistent and scalable ground truth data, leading to inconsistent and unreliable results. In this paper, we introduce Depth Any Video, a model that tackles the challenge…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Honghui Yang , Di Huang , Wei Yin , Chunhua Shen , Haifeng Liu , Xiaofei He , Binbin Lin , Wanli Ouyang , Tong He

Extraction of Key-frames of Endoscopic Videos by using Depth Information

A deep learning-based monocular depth estimation (MDE) technique is proposed for selection of most informative frames (key frames) of an endoscopic video. In most of the cases, ground truth depth maps of polyps are not readily available and…

Computer Vision and Pattern Recognition · Computer Science 2021-07-12 Pradipta Sasmal , Avinash Paul , M. K. Bhuyan , Yuji Iwahori

DynamicStereo: Consistent Dynamic Depth from Stereo Videos

We consider the problem of reconstructing a dynamic scene observed from a stereo camera. Most existing methods for depth from stereo treat different stereo frames independently, leading to temporally inconsistent depth predictions. Temporal…

Computer Vision and Pattern Recognition · Computer Science 2023-05-04 Nikita Karaev , Ignacio Rocco , Benjamin Graham , Natalia Neverova , Andrea Vedaldi , Christian Rupprecht