Related papers: A Structurally Coherent Spatial Phase Estimate

Introduction To The Monogenic Signal

The monogenic signal is an image analysis methodology that was introduced by Felsberg and Sommer in 2001 and has been employed for a variety of purposes in image processing and computer vision research. In particular, it has been found to…

Computer Vision and Pattern Recognition · Computer Science 2017-03-28 Christopher P. Bridge

The Monogenic Synchrosqueezed Wavelet Transform: A tool for the Decomposition/Demodulation of AM-FM images

The synchrosqueezing method aims at decomposing 1D functions as superpositions of a small number of "Intrinsic Modes", supposed to be well separated both in time and frequency. Based on the unidimensional wavelet transform and its…

Numerical Analysis · Mathematics 2012-11-22 Marianne Clausel , Thomas Oberlin , Valérie Perrier

MOGS: Monocular Object-guided Gaussian Splatting in Large Scenes

Recent advances in 3D Gaussian Splatting (3DGS) deliver striking photorealism, and extending it to large scenes opens new opportunities for semantic reasoning and prediction in applications such as autonomous driving. Today's…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Shengkai Zhang , Yuhe Liu , Jianhua He , Xuedou Xiao , Mozi Chen , Kezhong Liu

MSDS: Deep Structural Similarity with Multiscale Representation

Deep-feature-based perceptual similarity models have demonstrated strong alignment with human visual perception in Image Quality Assessment (IQA). However, most existing approaches operate at a single spatial scale, implicitly assuming that…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Danling Kang , Xue-Hua Chen , Bin Liu , Keke Zhang , Weiling Chen , Tiesong Zhao

Double soft-thresholded model for multi-group scalar on vector-valued image regression

In this paper, we develop a novel spatial variable selection method for scalar on vector-valued image regression in a multi-group setting. Here, 'vector-valued image' refers to the imaging datasets that contain vector-valued information at…

Methodology · Statistics 2024-10-22 Arkaprava Roy , Zhou Lan

Joint Depth Prediction and Semantic Segmentation with Multi-View SAM

Multi-task approaches to joint depth and segmentation prediction are well-studied for monocular images. Yet, predictions from a single-view are inherently limited, while multiple views are available in many robotics applications. On the…

Computer Vision and Pattern Recognition · Computer Science 2023-11-02 Mykhailo Shvets , Dongxu Zhao , Marc Niethammer , Roni Sengupta , Alexander C. Berg

SD-MVS: Segmentation-Driven Deformation Multi-View Stereo with Spherical Refinement and EM optimization

In this paper, we introduce Segmentation-Driven Deformation Multi-View Stereo (SD-MVS), a method that can effectively tackle challenges in 3D reconstruction of textureless areas. We are the first to adopt the Segment Anything Model (SAM) to…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Zhenlong Yuan , Jiakai Cao , Zhaoxin Li , Hao Jiang , Zhaoqi Wang

MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion

Autonomous systems, such as self-driving cars, rely on reliable semantic environment perception for decision making. Despite great advances in video semantic segmentation, existing approaches ignore important inductive biases and lack…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Angel Villar-Corrales , Moritz Austermann , Sven Behnke

DepthSSC: Monocular 3D Semantic Scene Completion via Depth-Spatial Alignment and Voxel Adaptation

The task of 3D semantic scene completion using monocular cameras is gaining significant attention in the field of autonomous driving. This task aims to predict the occupancy status and semantic labels of each voxel in a 3D scene from…

Computer Vision and Pattern Recognition · Computer Science 2024-11-27 Jiawei Yao , Jusheng Zhang , Xiaochao Pan , Tong Wu , Canran Xiao

Two-step phase-shifting interferometry for phase-resolved imaging from a spatial light modulator

We demonstrate two-step phase-shifting interferometry (holography) of complex laser modes generated by a spatial light modulator (SLM), in which the amplitude and phase of the signal are determined directly from measurements of…

Optics · Physics 2024-09-16 Lark E. Bradsby , Andrew A. Voitiv , Mark E. Siemens

Spatial coherence control and analysis via micromirror-based mixed-state ptychography

Flexible and fast control of the phase and amplitude of coherent light, enabled by digital micromirror devices (DMDs) and spatial light modulators (SLMs), has been a driving force for recent advances in optical tweezers, nonlinear…

Optics · Physics 2021-07-07 Ruslan Röhrich , Femius Koenderink , Stefan Witte , Lars Loetgering

Recent advances in spatial light modulator-based three-dimensional optical imaging (Invited)

Phase-only spatial light modulators (SLMs) are used in optical systems for several purposes. In this article, the main landmarks of SLM-based imaging systems are surveyed. In addition to conventional two-dimensional imaging, these systems…

Optics · Physics 2026-03-10 Joseph Rosen

Performance Limits for Noisy Multi-Measurement Vector Problems

Compressed sensing (CS) demonstrates that sparse signals can be estimated from under-determined linear systems. Distributed CS (DCS) further reduces the number of measurements by considering joint sparsity within signal ensembles. DCS with…

Information Theory · Computer Science 2017-03-24 Junan Zhu , Dror Baron , Florent Krzakala

Semantic Estimation of 3D Body Shape and Pose using Minimal Cameras

We aim to simultaneously estimate the 3D articulated pose and high fidelity volumetric occupancy of human performance, from multiple viewpoint video (MVV) with as few as two views. We use a multi-channel symmetric 3D convolutional…

Computer Vision and Pattern Recognition · Computer Science 2020-09-08 Andrew Gilbert , Matthew Trumble , Adrian Hilton , John Collomosse

Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces

Self-supervised monocular depth estimation (SSMDE) has gained attention in the field of deep learning as it estimates depth without requiring ground truth depth maps. This approach typically uses a photometric consistency loss between a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Wonhyeok Choi , Kyumin Hwang , Minwoo Choi , Kiljoon Han , Wonjoon Choi , Mingyu Shin , Sunghoon Im

MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving

As one of the automotive sensors that have emerged in recent years, 4D millimeter-wave radar has a higher resolution than conventional 3D radar and provides precise elevation measurements. But its point clouds are still sparse and noisy,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-14 Hongsi Liu , Jun Liu , Guangfeng Jiang , Xin Jin

A Unified Framework for Multiscale Modeling using the Mori-Zwanzig Formalism and the Variational Multiscale Method

We describe a paradigm for multiscale modeling that combines the Mori-Zwanzig (MZ) formalism of Statistical Mechanics with the Variational Multiscale (VMS) method. The MZ-VMS approach leverages both VMS scale-separation projectors as well…

Numerical Analysis · Mathematics 2017-12-29 Eric J. Parish , Karthik Duraisamy

Semantic Scene Completion with Multi-Feature Data Balancing Network

Semantic Scene Completion (SSC) is a critical task in computer vision, that utilized in applications such as virtual reality (VR). SSC aims to construct detailed 3D models from partial views by transforming a single 2D image into a 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Mona Alawadh , Mahesan Niranjan , Hansung Kim

Moran's I 2-Stage Lasso: for Models with Spatial Correlation and Endogenous Variables

We propose a novel estimation procedure for models with endogenous variables in the presence of spatial correlation based on Eigenvector Spatial Filtering. The procedure, called Moran's $I$ 2-Stage Lasso (Mi-2SL), uses a two-stage Lasso…

Econometrics · Economics 2024-04-04 Sylvain Barde , Rowan Cherodian , Guy Tchuente

MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions

Deep learning has made significant impacts on multi-view stereo systems. State-of-the-art approaches typically involve building a cost volume, followed by multiple 3D convolution operations to recover the input image's pixel-wise depth.…

Computer Vision and Pattern Recognition · Computer Science 2021-12-14 Zhenpei Yang , Zhile Ren , Qi Shan , Qixing Huang