Related papers: Sampling Based Scene-Space Video Processing

Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis

Robust scene segmentation and keyframe extraction are essential preprocessing steps in video understanding pipelines, supporting tasks such as indexing, summarization, and semantic retrieval. However, existing methods often lack…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Vasilii Korolkov

VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Recovering 3D scenes from sparse views is a challenging task due to its inherent ill-posed problem. Conventional methods have developed specialized solutions (e.g., geometry regularization or feed-forward deterministic model) to mitigate…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Hanyang Wang , Fangfu Liu , Jiawei Chi , Yueqi Duan

Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames

Humans are remarkably efficient at forming spatial understanding from just a few visual observations. When browsing real estate or navigating unfamiliar spaces, they intuitively select a small set of views that summarize the spatial layout.…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Chao Chen , Mingzhi Zhu , Ankush Pratap Singh , Yu Yan , Felix Juefei-Xu , Chen Feng

Towards Geometric and Textural Consistency 3D Scene Generation via Single Image-guided Model Generation and Layout Optimization

In recent years, 3D generation has made great strides in both academia and industry. However, generating 3D scenes from a single RGB image remains a significant challenge, as current approaches often struggle to ensure both object…

Graphics · Computer Science 2026-02-18 Xiang Tang , Ruotong Li , Xiaopeng Fan

Scene Matters: Model-based Deep Video Compression

Video compression has always been a popular research area, where many traditional and deep video compression methods have been proposed. These methods typically rely on signal prediction theory to enhance compression performance by…

Computer Vision and Pattern Recognition · Computer Science 2023-08-31 Lv Tang , Xinfeng Zhang , Gai Zhang , Xiaoqi Ma

Consistent Depth of Moving Objects in Video

We present a method to estimate depth of a dynamic scene, containing arbitrary moving objects, from an ordinary video captured with a moving camera. We seek a geometrically and temporally consistent solution to this underconstrained…

Computer Vision and Pattern Recognition · Computer Science 2021-08-04 Zhoutong Zhang , Forrester Cole , Richard Tucker , William T. Freeman , Tali Dekel

Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond

Capturing and rendering novel views of complex real-world scenes is a long-standing problem in computer graphics and vision, with applications in augmented and virtual reality, immersive experiences and 3D photography. The advent of deep…

Graphics · Computer Science 2024-08-09 Ravi Ramamoorthi

Scene-Aware 3D Multi-Human Motion Capture from a Single Camera

In this work, we consider the problem of estimating the 3D position of multiple humans in a scene as well as their body shape and articulation from a single RGB video recorded with a static camera. In contrast to expensive marker-based or…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Diogo Luvizon , Marc Habermann , Vladislav Golyanik , Adam Kortylewski , Christian Theobalt

Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model

In this paper, we propose Scene Splatter, a momentum-based paradigm for video diffusion to generate generic scenes from single image. Existing methods, which employ video generation models to synthesize novel views, suffer from limited…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Shengjun Zhang , Jinzhao Li , Xin Fei , Hao Liu , Yueqi Duan

Fast and Flexible Indoor Scene Synthesis via Deep Convolutional Generative Models

We present a new, fast and flexible pipeline for indoor scene synthesis that is based on deep convolutional generative models. Our method operates on a top-down image-based representation, and inserts objects iteratively into the scene by…

Computer Vision and Pattern Recognition · Computer Science 2018-12-03 Daniel Ritchie , Kai Wang , Yu-an Lin

View Synthesis of Dynamic Scenes based on Deep 3D Mask Volume

Image view synthesis has seen great success in reconstructing photorealistic visuals, thanks to deep learning and various novel representations. The next key step in immersive virtual experiences is view synthesis of dynamic scenes.…

Computer Vision and Pattern Recognition · Computer Science 2022-11-29 Kai-En Lin , Guowei Yang , Lei Xiao , Feng Liu , Ravi Ramamoorthi

3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation

We present 3DScenePrompt, a framework that generates the next video chunk from arbitrary-length input while enabling precise camera control and preserving scene consistency. Unlike methods conditioned on a single image or a short clip, we…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 JoungBin Lee , Jaewoo Jung , Jisang Han , Takuya Narihira , Kazumi Fukuda , Junyoung Seo , Sunghwan Hong , Yuki Mitsufuji , Seungryong Kim

Video Scene Parsing with Predictive Feature Learning

In this work, we address the challenging video scene parsing problem by developing effective representation learning methods given limited parsing annotations. In particular, we contribute two novel methods that constitute a unified parsing…

Computer Vision and Pattern Recognition · Computer Science 2016-12-14 Xiaojie Jin , Xin Li , Huaxin Xiao , Xiaohui Shen , Zhe Lin , Jimei Yang , Yunpeng Chen , Jian Dong , Luoqi Liu , Zequn Jie , Jiashi Feng , Shuicheng Yan

Unsupervised Learning of 3D Scene Flow from Monocular Camera

Scene flow represents the motion of points in the 3D space, which is the counterpart of the optical flow that represents the motion of pixels in the 2D image. However, it is difficult to obtain the ground truth of scene flow in the real…

Computer Vision and Pattern Recognition · Computer Science 2022-06-09 Guangming Wang , Xiaoyu Tian , Ruiqi Ding , Hesheng Wang

Sample and Filter: Nonparametric Scene Parsing via Efficient Filtering

Scene parsing has attracted a lot of attention in computer vision. While parametric models have proven effective for this task, they cannot easily incorporate new training data. By contrast, nonparametric approaches, which bypass any…

Computer Vision and Pattern Recognition · Computer Science 2016-03-16 Mohammad Najafi , Sarah Taghavi Namin , Mathieu Salzmann , Lars Petersson

IntelliCap: Intelligent Guidance for Consistent View Sampling

Novel view synthesis from images, for example, with 3D Gaussian splatting, has made great progress. Rendering fidelity and speed are now ready even for demanding virtual reality applications. However, the problem of assisting humans in…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Ayaka Yasunaga , Hideo Saito , Dieter Schmalstieg , Shohei Mori

3D Scene Inference from Transient Histograms

Time-resolved image sensors that capture light at pico-to-nanosecond timescales were once limited to niche applications but are now rapidly becoming mainstream in consumer devices. We propose low-cost and low-power imaging modalities that…

Computer Vision and Pattern Recognition · Computer Science 2022-11-10 Sacha Jungerman , Atul Ingle , Yin Li , Mohit Gupta

Scene Synthesis via Uncertainty-Driven Attribute Synchronization

Developing deep neural networks to generate 3D scenes is a fundamental problem in neural synthesis with immediate applications in architectural CAD, computer graphics, as well as in generating virtual robot training environments. This task…

Computer Vision and Pattern Recognition · Computer Science 2021-09-02 Haitao Yang , Zaiwei Zhang , Siming Yan , Haibin Huang , Chongyang Ma , Yi Zheng , Chandrajit Bajaj , Qixing Huang

A new Video Synopsis Based Approach Using Stereo Camera

In today's world, the amount of data produced in every field has increased at an unexpected level. In the face of increasing data, the importance of data processing has increased remarkably. Our resource topic is on the processing of video…

Computer Vision and Pattern Recognition · Computer Science 2021-06-24 Talha Dilber , Mehmet Serdar Guzel , Erkan Bostanci

Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition

With the explosive growth of video data in real-world applications, a comprehensive representation of videos becomes increasingly important. In this paper, we address the problem of video scene recognition, whose goal is to learn a…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Xuzheng Yu , Chen Jiang , Wei Zhang , Tian Gan , Linlin Chao , Jianan Zhao , Yuan Cheng , Qingpei Guo , Wei Chu