Related papers: TorchSpatial: A Location Encoding Framework and Be…

Self-Supervised Representation Learning for Geospatial Objects: A Survey

The proliferation of various data sources in urban and territorial environments has significantly facilitated the development of geospatial artificial intelligence (GeoAI) across a wide range of geospatial applications. However, geospatial…

Artificial Intelligence · Computer Science 2025-04-28 Yile Chen , Weiming Huang , Kaiqi Zhao , Yue Jiang , Gao Cong

Learning to Perceive "Where": Spatial Pretext Tasks for Robust Self-Supervised Learning

Existing self-supervised learning (SSL) methods primarily learn object-invariant representations but often neglect the spatial structure and relationships among object parts. To address this limitation, we introduce Spatial Prediction (SP),…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Yang Shen , Yusen Cai , Weronika Hryniewska-Guzik , Qing Lin , Mengmi Zhang

Spatially Consistent Representation Learning

Self-supervised learning has been widely used to obtain transferrable representations from unlabeled images. Especially, recent contrastive learning methods have shown impressive performances on downstream image classification tasks. While…

Computer Vision and Pattern Recognition · Computer Science 2021-04-29 Byungseok Roh , Wuhyun Shin , Ildoo Kim , Sungwoong Kim

Geospatial Representation Learning: A Survey from Deep Learning to The LLM Era

The ability to transform location-centric geospatial data into meaningful computational representations has become fundamental to modern spatial analysis and decision-making. Geospatial Representation Learning (GRL), the process of…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Xixuan Hao , Yutian Jiang , Xingchen Zou , Jiabo Liu , Yifang Yin , Song Gao , Flora Salim , Tianrui Li , Yuxuan Liang

A generic self-supervised learning (SSL) framework for representation learning from spectra-spatial feature of unlabeled remote sensing imagery

Remote sensing data has been widely used for various Earth Observation (EO) missions such as land use and cover classification, weather forecasting, agricultural management, and environmental monitoring. Most existing remote sensing…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Xin Zhang , Liangxiu Han

VG-SSL: Benchmarking Self-supervised Representation Learning Approaches for Visual Geo-localization

Visual Geo-localization (VG) is a critical research area for identifying geo-locations from visual inputs, particularly in autonomous navigation for robotics and vehicles. Current VG methods often learn feature extractors from geo-labeled…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Jiuhong Xiao , Gao Zhu , Giuseppe Loianno

TorchGeo: Deep Learning With Geospatial Data

Remotely sensed geospatial data are critical for applications including precision agriculture, urban planning, disaster monitoring and response, and climate change research, among others. Deep learning methods are particularly promising for…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Adam J. Stewart , Caleb Robinson , Isaac A. Corley , Anthony Ortiz , Juan M. Lavista Ferres , Arindam Banerjee

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats. Recent methods solve this problem by predicting the adjacency relations of detected cell boxes or learning to directly generate the…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Rujiao Long , Hangdi Xing , Zhibo Yang , Qi Zheng , Zhi Yu , Cong Yao , Fei Huang

Automated Feature-Topic Pairing: Aligning Semantic and Embedding Spaces in Spatial Representation Learning

Automated characterization of spatial data is a kind of critical geographical intelligence. As an emerging technique for characterization, Spatial Representation Learning (SRL) uses deep neural networks (DNNs) to learn non-linear embedded…

Machine Learning · Computer Science 2021-09-24 Dongjie Wang , Kunpeng Liu , David Mohaisen , Pengyang Wang , Chang-Tien Lu , Yanjie Fu

Learning Where to Learn in Cross-View Self-Supervised Learning

Self-supervised learning (SSL) has made enormous progress and largely narrowed the gap with the supervised ones, where the representation learning is mainly guided by a projection into an embedding space. During the projection, current…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Lang Huang , Shan You , Mingkai Zheng , Fei Wang , Chen Qian , Toshihiko Yamasaki

A Review of Location Encoding for GeoAI: Methods and Applications

A common need for artificial intelligence models in the broader geoscience is to represent and encode various types of spatial data, such as points (e.g., points of interest), polylines (e.g., trajectories), polygons (e.g., administrative…

Machine Learning · Computer Science 2022-03-14 Gengchen Mai , Krzysztof Janowicz , Yingjie Hu , Song Gao , Bo Yan , Rui Zhu , Ling Cai , Ni Lao

SARL: Spatially-Aware Self-Supervised Representation Learning for Visuo-Tactile Perception

Contact-rich robotic manipulation requires representations that encode local geometry. Vision provides global context but lacks direct measurements of properties such as texture and hardness, whereas touch supplies these cues. Modern…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Gurmeher Khurana , Lan Wei , Dandan Zhang

Self-supervised Trajectory Representation Learning with Temporal Regularities and Travel Semantics

Trajectory Representation Learning (TRL) is a powerful tool for spatial-temporal data analysis and management. TRL aims to convert complicated raw trajectories into low-dimensional representation vectors, which can be applied to various…

Machine Learning · Computer Science 2024-03-08 Jiawei Jiang , Dayan Pan , Houxing Ren , Xiaohan Jiang , Chao Li , Jingyuan Wang

Towards Spatial Transcriptomics-driven Pathology Foundation Models

Spatial transcriptomics (ST) provides spatially resolved measurements of gene expression, enabling characterization of the molecular landscape of human tissue beyond histological assessment as well as localized readouts that can be aligned…

Computer Vision and Pattern Recognition · Computer Science 2026-02-17 Konstantin Hemker , Andrew H. Song , Cristina Almagro-Pérez , Guillaume Jaume , Sophia J. Wagner , Anurag Vaidya , Nikola Simidjievski , Mateja Jamnik , Faisal Mahmood

LORE: Logical Location Regression Network for Table Structure Recognition

Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats. Recent methods solve this problem by predicting the adjacency relations of detected cell boxes, or learning to generate the…

Computer Vision and Pattern Recognition · Computer Science 2023-03-08 Hangdi Xing , Feiyu Gao , Rujiao Long , Jiajun Bu , Qi Zheng , Liangcheng Li , Cong Yao , Zhi Yu

Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks

Learning representations of geographical space is vital for any machine learning model that integrates geolocated data, spanning application domains such as remote sensing, ecology, or epidemiology. Recent work embeds coordinates using sine…

Machine Learning · Computer Science 2024-04-16 Marc Rußwurm , Konstantin Klemmer , Esther Rolf , Robin Zbinden , Devis Tuia

Global Context-aware Representation Learning for Spatially Resolved Transcriptomics

Spatially Resolved Transcriptomics (SRT) is a cutting-edge technique that captures the spatial context of cells within tissues, enabling the study of complex biological networks. Recent graph-based methods leverage both gene expression and…

Machine Learning · Computer Science 2025-06-24 Yunhak Oh , Junseok Lee , Yeongmin Kim , Sangwoo Seo , Namkyeong Lee , Chanyoung Park

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Spatial understanding remains a weakness of Large Vision-Language Models (LVLMs). Existing supervised fine-tuning (SFT) and recent reinforcement learning with verifiable rewards (RLVR) pipelines depend on costly supervision, specialized…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Yuhong Liu , Beichen Zhang , Yuhang Zang , Yuhang Cao , Long Xing , Xiaoyi Dong , Haodong Duan , Dahua Lin , Jiaqi Wang

SEAL: Spatio-Textual Similarity Search

Location-based services (LBS) have become more and more ubiquitous recently. Existing methods focus on finding relevant points-of-interest (POIs) based on users' locations and query keywords. Nowadays, modern LBS applications generate a new…

Databases · Computer Science 2012-05-31 Ju Fan , Guoliang Li , Lizhu Zhou , Shanshan Chen , Jun Hu

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

Spatio-temporal predictive learning is a learning paradigm that enables models to learn spatial and temporal patterns by predicting future frames from given past frames in an unsupervised manner. Despite remarkable progress in recent years,…

Computer Vision and Pattern Recognition · Computer Science 2023-10-19 Cheng Tan , Siyuan Li , Zhangyang Gao , Wenfei Guan , Zedong Wang , Zicheng Liu , Lirong Wu , Stan Z. Li