English
Related papers

Related papers: GeoBlocks: A Query-Cache Accelerated Data Structur…

200 papers

Modeling geospatial tabular data with deep learning has become a promising alternative to traditional statistical and machine learning approaches. However, existing deep learning models often face challenges related to scalability and…

Machine Learning · Computer Science 2025-02-24 Rui Deng , Ziqi Li , Mingshu Wang

Accurate forecasting of bus ridership (passengers numbers) is crucial for efficient management and optimization of public transport systems. Traditional forecasting models often fail to capture the unique and localized dynamics of different…

Machine Learning · Computer Science 2026-05-04 Daniel Azenkot , Michael Fire , Eran Ben Elia

Web-based services often run randomized experiments to improve their products. A popular way to run these experiments is to use geographical regions as units of experimentation, since this does not require tracking of individual users or…

Social and Information Networks · Computer Science 2019-02-19 David Rolnick , Kevin Aydin , Jean Pouget-Abadie , Shahab Kamali , Vahab Mirrokni , Amir Najmi

The inherent connectivity and dependency of graph-structured data, combined with its unique topology-driven access patterns, pose fundamental challenges to conventional data replication and request routing strategies in geo-distributed…

Databases · Computer Science 2025-10-22 Feng Yao , Xiaokang Yang , Shufeng Gong , Song Yu , Yanfeng Zhang , Ge Yu

With the sharp increase in the number of vehicles, the issue of parking difficulties has emerged as an urgent challenge that many cities need to address promptly. In the task of predicting large-scale urban parking data, existing research…

Machine Learning · Computer Science 2025-02-24 Yixuan Wang , Zhenwu Chen , Kangshuai Zhang , Yunduan Cui , Yang Yang , Lei Peng

Block diffusion enables efficient parallel refinement in diffusion language models, but its decoding behavior depends critically on block size. Existing block-sizing strategies rely on fixed rules or heuristic signals and do not account for…

Computation and Language · Computer Science 2026-03-31 Lipeng Wan , Junjie Ma , Jianhui Gu , Zeyang Liu , Xuyang Lu , Xuguang Lan

We present GeoRocket, a software for the management of very large geospatial datasets in the cloud. GeoRocket employs a novel way to handle arbitrarily large datasets by splitting them into chunks that are processed individually. The…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-04 Michel Krämer

Data aggregation in Geographic Information Systems (GIS) is only marginally present in commercial systems nowadays, mostly through ad-hoc solutions. In this paper, we first present a formal model for representing spatial data. This model…

Databases · Computer Science 2007-07-31 Leticia Gomez , Sofie Haesevoets , Bart Kuijpers , Alejandro Vaisman

The availability of low cost sensors has led to an unprecedented growth in the volume of spatial data. However, the time required to evaluate even simple spatial queries over large data sets greatly hampers our ability to interactively…

Databases · Computer Science 2020-04-09 Harish Doraiswamy , Juliana Freire

We study the problem of aggregating polygons by covering them with disjoint representative regions, thereby inducing a clustering of the polygons. Our objective is to minimize a weighted sum of the total area and the total perimeter of the…

Cities play a pivotal role in human development and sustainability, yet studying them presents significant challenges due to the vast scale and complexity of spatial-temporal data. One such challenge is the need to uncover universal urban…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-12-04 Zhenhui Li , Hongwei Zhang , Kan Wu

Current point cloud segmentation architectures suffer from limited long-range feature modeling, as they mostly rely on aggregating information with local neighborhoods. Furthermore, in order to learn point features at multiple scales, most…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Zhening Huang , Xiaoyang Wu , Hengshuang Zhao , Lei Zhu , Shujun Wang , Georgios Hadjidemetriou , Ioannis Brilakis

This study presents a novel small-area estimation framework to enhance urban transportation planning through detailed characterization of travel behavior. Our approach improves on the four-step travel model by employing publicly available…

Machine Learning · Computer Science 2025-10-07 Yangyang Wang , Tayo Fabusuyi

Explainable numerical representations or latent information of otherwise complex datasets are more convenient to analyze and study. These representations assist in identifying clusters and outliers, assess similar data points, and explore…

Computers and Society · Computer Science 2023-10-11 Deepank Verma , Olaf Mumm , Vanessa Miriam Carlow

Optimization tasks over relational data, such as clustering, often suffer from the prohibitive cost of join operations, which are necessary to access the full dataset. While geometric data structures like BBD trees yield fast approximation…

Databases · Computer Science 2026-03-13 Aryan Esmailpour , Stavros Sintos

We selected 48 European cities and gathered their public transport timetables in the GTFS format. We utilized Uber's H3 spatial index to divide each city into hexagonal micro-regions. Based on the timetables data we created certain features…

Machine Learning · Computer Science 2021-11-04 Piotr Gramacki , Szymon Woźniak , Piotr Szymański

Distributed data mining techniques and mainly distributed clustering are widely used in the last decade because they deal with very large and heterogeneous datasets which cannot be gathered centrally. Current distributed clustering…

Databases · Computer Science 2018-02-02 Malika Bendechache , M-Tahar Kechadi

Many industries rely on visual insights to support decision- making processes in their businesses. In mining, the analysis of drills and geological shapes, represented as 3D geometries, is an important tool to assist geologists on the…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-08-30 Lucas C. Villa Real , Bruno Silva

Pandemic measures such as social distancing and contact tracing can be enhanced by rapidly integrating dynamic location data and demographic data. Projecting billions of longitude and latitude locations onto hundreds of thousands of highly…

Airborne magnetic data are commonly used to produce preliminary geological maps. Machine learning has the potential to partly fulfill this task rapidly and objectively, as geological mapping is comparable to a semantic segmentation problem.…

‹ Prev 1 2 3 10 Next ›