Related papers: Self-supervised Geometric Perception

GPA-VGGT:Adapting VGGT to Large Scale Localization by Self-Supervised Learning with Geometry and Physics Aware Loss

Transformer-based general visual geometry frameworks have shown promising performance in camera pose estimation and 3D scene understanding. Recent advancements in Visual Geometry Grounded Transformer (VGGT) models have shown great promise…

Computer Vision and Pattern Recognition · Computer Science 2026-04-03 Yangfan Xu , Lilian Zhang , Xiaofeng He , Pengdong Wu , Wenqi Wu , Jun Mao

Self-Supervised Image Representation Learning with Geometric Set Consistency

We propose a method for self-supervised image representation learning under the guidance of 3D geometric consistency. Our intuition is that 3D geometric consistency priors such as smooth regions and surface discontinuities may imply…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Nenglun Chen , Lei Chu , Hao Pan , Yan Lu , Wenping Wang

Formula-Supervised Visual-Geometric Pre-training

Throughout the history of computer vision, while research has explored the integration of images (visual) and point clouds (geometric), many advancements in image and 3D object recognition have tended to process these modalities separately.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Ryosuke Yamada , Kensho Hara , Hirokatsu Kataoka , Koshi Makihara , Nakamasa Inoue , Rio Yokota , Yutaka Satoh

Self-Supervised Feature Learning for Long-Term Metric Visual Localization

Visual localization is the task of estimating camera pose in a known scene, which is an essential problem in robotics and computer vision. However, long-term visual localization is still a challenge due to the environmental appearance…

Robotics · Computer Science 2022-12-02 Yuxuan Chen , Timothy D. Barfoot

Self-supervised Learning of Geometrically Stable Features Through Probabilistic Introspection

Self-supervision can dramatically cut back the amount of manually-labelled data required to train deep neural networks. While self-supervision has usually been considered for tasks such as image classification, in this paper we aim at…

Computer Vision and Pattern Recognition · Computer Science 2018-04-06 David Novotny , Samuel Albanie , Diane Larlus , Andrea Vedaldi

Self-supervised Learning of Hybrid Part-aware 3D Representations of 2D Gaussians and Superquadrics

Low-level 3D representations, such as point clouds, meshes, NeRFs and 3D Gaussians, are commonly used for modeling 3D objects and scenes. However, cognitive studies indicate that human perception operates at higher levels and interprets 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Zhirui Gao , Renjiao Yi , Yuhang Huang , Wei Chen , Chenyang Zhu , Kai Xu

Learning a Geometric Representation for Data-Efficient Depth Estimation via Gradient Field and Contrastive Loss

Estimating a depth map from a single RGB image has been investigated widely for localization, mapping, and 3-dimensional object detection. Recent studies on a single-view depth estimation are mostly based on deep Convolutional neural…

Computer Vision and Pattern Recognition · Computer Science 2021-03-18 Dongseok Shim , H. Jin Kim

Situation Graph Prediction: Structured Perspective Inference for User Modeling

Perspective-Aware AI requires modeling evolving internal states--goals, emotions, contexts--not merely preferences. Progress is limited by a data bottleneck: digital footprints are privacy-sensitive and perspective states are rarely…

Artificial Intelligence · Computer Science 2026-02-17 Jisung Shin , Daniel Platnick , Marjan Alirezaie , Hossein Rahnama

Self-Supervised Learning for Place Representation Generalization across Appearance Changes

Visual place recognition is a key to unlocking spatial navigation for animals, humans and robots. While state-of-the-art approaches are trained in a supervised manner and therefore hardly capture the information needed for generalizing to…

Computer Vision and Pattern Recognition · Computer Science 2023-12-25 Mohamed Adel Musallam , Vincent Gaudillière , Djamila Aouada

Unlocking Zero-shot Potential of Semi-dense Image Matching via Gaussian Splatting

Learning-based image matching critically depends on large-scale, diverse, and geometrically accurate training data. 3D Gaussian Splatting (3DGS) enables photorealistic novel-view synthesis and thus is attractive for data generation.…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Juncheng Chen , Chao Xu , Yanjun Cao

Semantically-Guided Representation Learning for Self-Supervised Monocular Depth

Self-supervised learning is showing great promise for monocular depth estimation, using geometry as the only source of supervision. Depth networks are indeed capable of learning representations that relate visual appearance to 3D properties…

Computer Vision and Pattern Recognition · Computer Science 2020-02-28 Vitor Guizilini , Rui Hou , Jie Li , Rares Ambrus , Adrien Gaidon

PointCG: Self-supervised Point Cloud Learning via Joint Completion and Generation

The core of self-supervised point cloud learning lies in setting up appropriate pretext tasks, to construct a pre-training framework that enables the encoder to perceive 3D objects effectively. In this paper, we integrate two prevalent…

Computer Vision and Pattern Recognition · Computer Science 2025-04-07 Yun Liu , Peng Li , Xuefeng Yan , Liangliang Nan , Bing Wang , Honghua Chen , Lina Gong , Wei Zhao , Mingqiang Wei

SIGNet: Semantic Instance Aided Unsupervised 3D Geometry Perception

Unsupervised learning for geometric perception (depth, optical flow, etc.) is of great interest to autonomous systems. Recent works on unsupervised learning have made considerable progress on perceiving geometry; however, they usually…

Computer Vision and Pattern Recognition · Computer Science 2019-04-08 Yue Meng , Yongxi Lu , Aman Raj , Samuel Sunarjo , Rui Guo , Tara Javidi , Gaurav Bansal , Dinesh Bharadia

SuperGlue: Learning Feature Matching with Graph Neural Networks

This paper introduces SuperGlue, a neural network that matches two sets of local features by jointly finding correspondences and rejecting non-matchable points. Assignments are estimated by solving a differentiable optimal transport…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Paul-Edouard Sarlin , Daniel DeTone , Tomasz Malisiewicz , Andrew Rabinovich

Mining and Transferring Feature-Geometry Coherence for Unsupervised Point Cloud Registration

Point cloud registration, a fundamental task in 3D vision, has achieved remarkable success with learning-based methods in outdoor environments. Unsupervised outdoor point cloud registration methods have recently emerged to circumvent the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-25 Kezheng Xiong , Haoen Xiang , Qingshan Xu , Chenglu Wen , Siqi Shen , Jonathan Li , Cheng Wang

$S^3$Net: Semantic-Aware Self-supervised Depth Estimation with Monocular Videos and Synthetic Data

Solving depth estimation with monocular cameras enables the possibility of widespread use of cameras as low-cost depth estimation sensors in applications such as autonomous driving and robotics. However, learning such a scalable depth…

Computer Vision and Pattern Recognition · Computer Science 2020-07-30 Bin Cheng , Inderjot Singh Saggu , Raunak Shah , Gaurav Bansal , Dinesh Bharadia

GeoSurDepth: Harnessing Foundation Model for Spatial Geometry Consistency-Oriented Self-Supervised Surround-View Depth Estimation

Accurate surround-view depth estimation provides a competitive alternative to laser-based sensors and is essential for 3D scene understanding in autonomous driving. While empirical studies have proposed various approaches that primarily…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Weimin Liu , Wenjun Wang , Joshua H. Meng

Self-Supervised Depth Completion Guided by 3D Perception and Geometry Consistency

Depth completion, aiming to predict dense depth maps from sparse depth measurements, plays a crucial role in many computer vision related applications. Deep learning approaches have demonstrated overwhelming success in this task. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Yu Cai , Tianyu Shen , Shi-Sheng Huang , Hua Huang

Self-supervised Feature Learning by Cross-modality and Cross-view Correspondences

The success of supervised learning requires large-scale ground truth labels which are very expensive, time-consuming, or may need special skills to annotate. To address this issue, many self- or un-supervised methods are developed. Unlike…

Computer Vision and Pattern Recognition · Computer Science 2020-04-14 Longlong Jing , Yucheng Chen , Ling Zhang , Mingyi He , Yingli Tian

SuPerPM: A Surgical Perception Framework Based on Deep Point Matching Learned from Physical Constrained Simulation Data

A major source of endoscopic tissue tracking errors during deformations stems from wrong data association between observed sensor measurements with previously tracked scene. To mitigate this issue, we present a surgical perception…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Shan Lin , Albert J. Miao , Ali Alabiad , Fei Liu , Kaiyuan Wang , Jingpei Lu , Florian Richter , Michael C. Yip