Related papers: Learning Feature Descriptors using Camera Pose Sup…

SCENES: Subpixel Correspondence Estimation With Epipolar Supervision

Extracting point correspondences from two or more views of a scene is a fundamental computer vision problem with particular importance for relative camera pose estimation and structure-from-motion. Existing local feature matching…

Computer Vision and Pattern Recognition · Computer Science 2024-01-22 Dominik A. Kloepfer , João F. Henriques , Dylan Campbell

Digging Into Self-Supervised Learning of Feature Descriptors

Fully-supervised CNN-based approaches for learning local image descriptors have shown remarkable results in a wide range of geometric tasks. However, most of them require per-pixel ground-truth keypoint correspondence data which is…

Computer Vision and Pattern Recognition · Computer Science 2021-10-12 Iaroslav Melekhov , Zakaria Laskar , Xiaotian Li , Shuzhe Wang , Juho Kannala

Deep Patch Learning for Weakly Supervised Object Classification and Discovery

Patch-level image representation is very important for object classification and detection, since it is robust to spatial transformation, scale variation, and cluttered background. Many existing methods usually require fine-grained…

Computer Vision and Pattern Recognition · Computer Science 2017-05-09 Peng Tang , Xinggang Wang , Zilong Huang , Xiang Bai , Wenyu Liu

Self-Supervised Feature Learning for Long-Term Metric Visual Localization

Visual localization is the task of estimating camera pose in a known scene, which is an essential problem in robotics and computer vision. However, long-term visual localization is still a challenge due to the environmental appearance…

Robotics · Computer Science 2022-12-02 Yuxuan Chen , Timothy D. Barfoot

Pose-Aware Weakly-Supervised Action Segmentation

Understanding human behavior is an important problem in the pursuit of visual intelligence. A challenge in this endeavor is the extensive and costly effort required to accurately label action segments. To address this issue, we consider…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Seth Z. Zhao , Reza Ghoddoosian , Isht Dwivedi , Nakul Agarwal , Behzad Dariush

Learning Descriptors for Object Recognition and 3D Pose Estimation

Detecting poorly textured objects and estimating their 3D pose reliably is still a very challenging problem. We introduce a simple but powerful approach to computing descriptors for object views that efficiently capture both the object…

Computer Vision and Pattern Recognition · Computer Science 2017-11-15 Paul Wohlhart , Vincent Lepetit

Decoupling Makes Weakly Supervised Local Feature Better

Weakly supervised learning can help local feature methods to overcome the obstacle of acquiring a large-scale dataset with densely labeled correspondences. However, since weak supervision cannot distinguish the losses caused by the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Kunhong Li , Longguang Wang , Li Liu , Qing Ran , Kai Xu , Yulan Guo

Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

We address a core problem of computer vision: Detection and description of 2D feature points for image matching. For a long time, hand-crafted designs, like the seminal SIFT algorithm, were unsurpassed in accuracy and efficiency. Recently,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-23 Aritra Bhowmik , Stefan Gumhold , Carsten Rother , Eric Brachmann

Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence

Visual correspondence is a crucial step in key computer vision tasks, including camera localization, image registration, and structure from motion. The most effective techniques for matching keypoints currently involve using learned sparse…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Felipe Cadar , Guilherme Potje , Renato Martins , Cédric Demonceaux , Erickson R. Nascimento

Learning to Guide Local Feature Matches

We tackle the problem of finding accurate and robust keypoint correspondences between images. We propose a learning-based approach to guide local feature matches via a learned approximate image matching. Our approach can boost the results…

Computer Vision and Pattern Recognition · Computer Science 2021-05-03 François Darmon , Mathieu Aubry , Pascal Monasse

Residual Learning for Image Point Descriptors

Local image feature descriptors have had a tremendous impact on the development and application of computer vision methods. It is therefore unsurprising that significant efforts are being made for learning-based image point descriptors.…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Rashik Shrestha , Ajad Chhatkuli , Menelaos Kanakis , Luc Van Gool

CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D Annotations

To improve the generalization of 3D human pose estimators, many existing deep learning based models focus on adding different augmentations to training poses. However, data augmentation techniques are limited to the "seen" pose combinations…

Computer Vision and Pattern Recognition · Computer Science 2023-01-10 Cheng-Yen Yang , Jiajia Luo , Lu Xia , Yuyin Sun , Nan Qiao , Ke Zhang , Zhongyu Jiang , Jenq-Neng Hwang

Weakly-Supervised Learning of Dense Functional Correspondences

Establishing dense correspondences across image pairs is essential for tasks such as shape reconstruction and robot manipulation. In the challenging setting of matching across different categories, the function of an object, i.e., the…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Stefan Stojanov , Linan Zhao , Yunzhi Zhang , Daniel L. K. Yamins , Jiajun Wu

Semi- and Weakly-supervised Human Pose Estimation

For human pose estimation in still images, this paper proposes three semi- and weakly-supervised learning schemes. While recent advances of convolutional neural networks improve human pose estimation using supervised training data, our…

Computer Vision and Pattern Recognition · Computer Science 2019-06-05 Norimichi Ukita , Yusuke Uematsu

Learning Human Pose Estimation Features with Convolutional Networks

This paper introduces a new architecture for human pose estimation using a multi- layer convolutional network architecture and a modified learning technique that learns low-level features and higher-level weak spatial models. Unconstrained…

Computer Vision and Pattern Recognition · Computer Science 2014-04-24 Arjun Jain , Jonathan Tompson , Mykhaylo Andriluka , Graham W. Taylor , Christoph Bregler

When Regression Meets Manifold Learning for Object Recognition and Pose Estimation

In this work, we propose a method for object recognition and pose estimation from depth images using convolutional neural networks. Previous methods addressing this problem rely on manifold learning to learn low dimensional viewpoint…

Computer Vision and Pattern Recognition · Computer Science 2019-04-19 Mai Bui , Sergey Zakharov , Shadi Albarqouni , Slobodan Ilic , Nassir Navab

Contrastive Learning for Weakly Supervised Phrase Grounding

Phrase grounding, the problem of associating image regions to caption words, is a crucial component of vision-language tasks. We show that phrase grounding can be learned by optimizing word-region attention to maximize a lower bound on…

Computer Vision and Pattern Recognition · Computer Science 2020-08-07 Tanmay Gupta , Arash Vahdat , Gal Chechik , Xiaodong Yang , Jan Kautz , Derek Hoiem

GoodPoint: unsupervised learning of keypoint detection and description

This paper introduces a new algorithm for unsupervised learning of keypoint detectors and descriptors, which demonstrates fast convergence and good performance across different datasets. The training procedure uses homographic…

Computer Vision and Pattern Recognition · Computer Science 2020-06-02 Anatoly Belikov , Alexey Potapov

DeepCap: Monocular Human Performance Capture Using Weak Supervision

Human performance capture is a highly important computer vision problem with many applications in movie production and virtual/augmented reality. Many previous performance capture approaches either required expensive multi-view setups or…

Computer Vision and Pattern Recognition · Computer Science 2020-03-19 Marc Habermann , Weipeng Xu , Michael Zollhoefer , Gerard Pons-Moll , Christian Theobalt

Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation

Weakly supervised phrase grounding aims at learning region-phrase correspondences using only image-sentence pairs. A major challenge thus lies in the missing links between image regions and sentence phrases during training. To address this…

Computer Vision and Pattern Recognition · Computer Science 2021-04-27 Liwei Wang , Jing Huang , Yin Li , Kun Xu , Zhengyuan Yang , Dong Yu