Related papers: On Coordinate Decoding for Keypoint Estimation Tas…

Distribution-Aware Coordinate Representation for Human Pose Estimation

While being the de facto standard coordinate representation in human pose estimation, heatmap is never systematically investigated in the literature, to our best knowledge. This work fills this gap by studying the coordinate representation…

Computer Vision and Pattern Recognition · Computer Science 2019-10-15 Feng Zhang , Xiatian Zhu , Hanbin Dai , Mao Ye , Ce Zhu

Joint COCO and Mapillary Workshop at ICCV 2019 Keypoint Detection Challenge Track Technical Report: Distribution-Aware Coordinate Representation for Human Pose Estimation

In this paper, we focus on the coordinate representation in human pose estimation. While being the standard choice, heatmap based representation has not been systematically investigated. We found that the process of coordinate decoding…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Hanbin Dai , Liangbo Zhou , Feng Zhang , Zhengyu Zhang , Hong Hu , Xiatian Zhu , Mao Ye

Train Your Data Processor: Distribution-Aware and Error-Compensation Coordinate Decoding for Human Pose Estimation

Recently, the leading performance of human pose estimation is dominated by heatmap based methods. While being a fundamental component of heatmap processing, heatmap decoding (i.e. transforming heatmaps to coordinates) receives only limited…

Computer Vision and Pattern Recognition · Computer Science 2020-07-20 Feiyu Yang , Zhan Song , Zhenzhong Xiao , Yu Chen , Zhe Pan , Min Zhang , Min Xue , Yaoyang Mo , Yao Zhang , Guoxiong Guan , Beibei Qian

Subpixel Heatmap Regression for Facial Landmark Localization

Deep Learning models based on heatmap regression have revolutionized the task of facial landmark localization with existing models working robustly under large poses, non-uniform illumination and shadows, occlusions and self-occlusions, low…

Computer Vision and Pattern Recognition · Computer Science 2021-11-04 Adrian Bulat , Enrique Sanchez , Georgios Tzimiropoulos

Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features

Detecting 3D keypoints with semantic consistency is widely used in many scenarios such as pose estimation, shape registration and robotics. Currently, most unsupervised 3D keypoint detection methods focus on the rigid-body objects. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Chengkai Hou , Zhengrong Xue , Bingyang Zhou , Jinghan Ke , Lin Shao , Huazhe Xu

Interpreting Encoding and Decoding Models

Encoding and decoding models are widely used in systems, cognitive, and computational neuroscience to make sense of brain-activity data. However, the interpretation of their results requires care. Decoding models can help reveal whether…

Neurons and Cognition · Quantitative Biology 2019-04-29 Nikolaus Kriegeskorte , Pamela K. Douglas

3D Keypoint Estimation Using Implicit Representation Learning

In this paper, we tackle the challenging problem of 3D keypoint estimation of general objects using a novel implicit representation. Previous works have demonstrated promising results for keypoint prediction through direct coordinate…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Xiangyu Zhu , Dong Du , Haibin Huang , Chongyang Ma , Xiaoguang Han

Learning to Predict Robot Keypoints Using Artificially Generated Images

This work considers robot keypoint estimation on color images as a supervised machine learning task. We propose the use of probabilistically created renderings to overcome the lack of labeled real images. Rather than sampling from…

Computer Vision and Pattern Recognition · Computer Science 2019-07-04 Christoph Heindl , Sebastian Zambal , Josef Scharinger

Interpretable Semantic Photo Geolocation

Planet-scale photo geolocalization is the complex task of estimating the location depicted in an image solely based on its visual content. Due to the success of convolutional neural networks (CNNs), current approaches achieve super-human…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Jonas Theiner , Eric Mueller-Budack , Ralph Ewerth

Accurate Hand Keypoint Localization on Mobile Devices

We present a novel approach for 2D hand keypoint localization from regular color input. The proposed approach relies on an appropriately designed Convolutional Neural Network (CNN) that computes a set of heatmaps, one per hand keypoint of…

Computer Vision and Pattern Recognition · Computer Science 2018-12-20 Filippos Gouidis , Paschalis Panteleris , Iason Oikonomidis , Antonis Argyros

Model Guidance via Explanations Turns Image Classifiers into Segmentation Models

Heatmaps generated on inputs of image classification networks via explainable AI methods like Grad-CAM and LRP have been observed to resemble segmentations of input images in many cases. Consequently, heatmaps have also been leveraged for…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Xiaoyan Yu , Jannik Franzen , Wojciech Samek , Marina M. -C. Höhne , Dagmar Kainmueller

Fourier Decomposition for Explicit Representation of 3D Point Cloud Attributes

While 3D point clouds are widely used in vision applications, their irregular and sparse nature make them challenging to handle. In response, numerous encoding approaches have been proposed to capture the rich semantic information of point…

Computer Vision and Pattern Recognition · Computer Science 2026-03-30 Donghyun Kim , Chanyoung Kim , Hyunah Ko , Seong Jae Hwang

Accurate Grid Keypoint Learning for Efficient Video Prediction

Video prediction methods generally consume substantial computing resources in training and deployment, among which keypoint-based approaches show promising improvement in efficiency by simplifying dense image prediction to light keypoint…

Computer Vision and Pattern Recognition · Computer Science 2021-07-29 Xiaojie Gao , Yueming Jin , Qi Dou , Chi-Wing Fu , Pheng-Ann Heng

Multi-Point Proximity Encoding For Vector-Mode Geospatial Machine Learning

Vector-mode geospatial data -- points, lines, and polygons -- must be encoded into an appropriate form in order to be used with traditional machine learning and artificial intelligence models. Encoding methods attempt to represent a given…

Machine Learning · Computer Science 2025-06-06 John Collins

Towards Keypoint Guided Self-Supervised Depth Estimation

This paper proposes to use keypoints as a self-supervision clue for learning depth map estimation from a collection of input images. As ground truth depth from real images is difficult to obtain, there are many unsupervised and…

Computer Vision and Pattern Recognition · Computer Science 2020-11-09 Kristijan Bartol , David Bojanic , Tomislav Petkovic , Tomislav Pribanic , Yago Diez Donoso

Tensor Network Decoding Beyond 2D

Decoding algorithms based on approximate tensor network contraction have proven tremendously successful in decoding 2D local quantum codes such as surface/toric codes and color codes, effectively achieving optimal decoding accuracy. In this…

Quantum Physics · Physics 2024-10-10 Christophe Piveteau , Christopher T. Chubb , Joseph M. Renes

Numerical Coordinate Regression with Convolutional Neural Networks

We study deep learning approaches to inferring numerical coordinates for points of interest in an input image. Existing convolutional neural network-based solutions to this problem either take a heatmap matching approach or regress to…

Computer Vision and Pattern Recognition · Computer Science 2018-05-07 Aiden Nibali , Zhen He , Stuart Morgan , Luke Prendergast

Sphere2Vec: Multi-Scale Representation Learning over a Spherical Surface for Geospatial Predictions

Generating learning-friendly representations for points in a 2D space is a fundamental and long-standing problem in machine learning. Recently, multi-scale encoding schemes (such as Space2Vec) were proposed to directly encode any point in…

Computer Vision and Pattern Recognition · Computer Science 2022-01-26 Gengchen Mai , Yao Xuan , Wenyun Zuo , Krzysztof Janowicz , Ni Lao

Augmenting Depth Estimation with Geospatial Context

Modern cameras are equipped with a wide array of sensors that enable recording the geospatial context of an image. Taking advantage of this, we explore depth estimation under the assumption that the camera is geocalibrated, a problem we…

Computer Vision and Pattern Recognition · Computer Science 2021-09-22 Scott Workman , Hunter Blanton

Precision Enhancement of 3D Surfaces from Multiple Compressed Depth Maps

In texture-plus-depth representation of a 3D scene, depth maps from different camera viewpoints are typically lossily compressed via the classical transform coding / coefficient quantization paradigm. In this paper we propose to reduce…

Computer Vision and Pattern Recognition · Computer Science 2023-07-19 Pengfei Wan , Gene Cheung , Philip A. Chou , Dinei Florencio , Cha Zhang , Oscar C. Au