Related papers: Faster Relational Algorithms Using Geometric Data …

Joins via Geometric Resolutions: Worst-case and Beyond

We present a simple geometric framework for the relational join. Using this framework, we design an algorithm that achieves the fractional hypertree-width bound, which generalizes classical and recent worst-case algorithmic results on…

Databases · Computer Science 2016-12-26 Mahmoud Abo Khamis , Hung Q. Ngo , Christopher Ré , Atri Rudra

Rk-means: Fast Clustering for Relational Data

Conventional machine learning algorithms cannot be applied until a data matrix is available to process. When the data matrix needs to be obtained from a relational database via a feature extraction query, the computation cost can be…

Machine Learning · Computer Science 2019-10-14 Ryan Curtin , Ben Moseley , Hung Q. Ngo , XuanLong Nguyen , Dan Olteanu , Maximilian Schleich

Boosting Relational Deep Learning with Pretrained Tabular Models

Relational databases, organized into tables connected by primary-foreign key relationships, are a common format for organizing data. Making predictions on relational data often involves transforming them into a flat tabular format through…

Databases · Computer Science 2025-04-08 Veronica Lachi , Antonio Longa , Beatrice Bevilacqua , Bruno Lepri , Andrea Passerini , Bruno Ribeiro

Improved Approximation Algorithms for Relational Clustering

Clustering plays a crucial role in computer science, facilitating data analysis and problem-solving across numerous fields. By partitioning large datasets into meaningful groups, clustering reveals hidden structures and relationships within…

Databases · Computer Science 2026-02-19 Aryan Esmailpour , Stavros Sintos

Seeing the Trees for the Forest: Leveraging Tree-Shaped Substructures in Property Graphs

Property graphs often contain tree-shaped substructures, yet they are not captured by existing proposals for graph schemas; likewise, query languages and query engines offer little-to-no native support for managing them systematically. As a…

Databases · Computer Science 2026-03-16 Daniel Aarao Reis Arturi , Christoph Köhnen , George Fletcher , Bettina Kemme , Stefanie Scherzinger

Geometric Relational Embeddings: A Survey

Geometric relational embeddings map relational data as geometric objects that combine vector information suitable for machine learning and structured/relational information for structured/relational reasoning, typically in low dimensions.…

Artificial Intelligence · Computer Science 2023-04-25 Bo Xiong , Mojtaba Nayyeri , Ming Jin , Yunjie He , Michael Cochez , Shirui Pan , Steffen Staab

Making RDBMSs Efficient on Graph Workloads Through Predefined Joins

Joins in native graph database management systems (GDBMSs) are predefined to the system as edges, which are indexed in adjacency list indices and serve as pointers. This contrasts with and can be more performant than value-based joins in…

Databases · Computer Science 2021-08-25 Guodong Jin , Semih Salihoglu

Geometric Relational Embeddings

Relational representation learning transforms relational data into continuous and low-dimensional vector representations. However, vector-based representations fall short in capturing crucial properties of relational data that are complex…

Machine Learning · Computer Science 2024-09-25 Bo Xiong

Geometric Deep Learning for Structure-Based Drug Design: A Survey

Structure-based drug design (SBDD) leverages the three-dimensional geometry of proteins to identify potential drug candidates. Traditional approaches, rooted in physicochemical modeling and domain expertise, are often resource-intensive.…

Quantitative Methods · Quantitative Biology 2024-11-19 Zaixi Zhang , Jiaxian Yan , Yining Huang , Qi Liu , Enhong Chen , Mengdi Wang , Marinka Zitnik

Optimal Joins using Compact Data Structures

Worst-case optimal join algorithms have gained a lot of attention in the database literature. We now count with several algorithms that are optimal in the worst case, and many of them have been implemented and validated in practice.…

Databases · Computer Science 2020-01-10 Gonzalo Navarro , Juan L. Reutter , Javiel Rojas-Ledesma

Revealing the Basis: Ordinal Embedding Through Geometry

Ordinal Embedding places n objects into R^d based on comparisons such as "a is closer to b than c." Current optimization-based approaches suffer from scalability problems and an abundance of low quality local optima. We instead consider a…

Computational Geometry · Computer Science 2018-05-22 Jesse Anderton , Virgil Pavlu , Javed Aslam

Fast tree-based algorithms for DBSCAN for low-dimensional data on GPUs

DBSCAN is a well-known density-based clustering algorithm to discover arbitrary shape clusters. While conceptually simple in serial, the algorithm is challenging to efficiently parallelize on manycore GPU architectures. Common pitfalls,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-30 Andrey Prokopenko , Damien Lebrun-Grandie , Daniel Arndt

Fast Hierarchical Clustering and Other Applications of Dynamic Closest Pairs

We develop data structures for dynamic closest pair problems with arbitrary distance functions, that do not necessarily come from any geometric structure on the objects. Based on a technique previously used by the author for Euclidean…

Data Structures and Algorithms · Computer Science 2010-01-21 David Eppstein

Relational Algorithms for k-means Clustering

This paper gives a k-means approximation algorithm that is efficient in the relational algorithms model. This is an algorithm that operates directly on a relational database without performing a join to convert it to a matrix whose rows…

Data Structures and Algorithms · Computer Science 2021-05-24 Benjamin Moseley , Kirk Pruhs , Alireza Samadian , Yuyan Wang

The Case for Distance-Bounded Spatial Approximations

Spatial approximations have been traditionally used in spatial databases to accelerate the processing of complex geometric operations. However, approximations are typically only used in a first filtering step to determine a set of candidate…

Databases · Computer Science 2021-01-22 Eleni Tzirita Zacharatou , Andreas Kipf , Ibrahim Sabek , Varun Pandey , Harish Doraiswamy , Volker Markl

Relational Boosted Regression Trees

Many tasks use data housed in relational databases to train boosted regression tree models. In this paper, we give a relational adaptation of the greedy algorithm for training boosted regression trees. For the subproblem of calculating the…

Databases · Computer Science 2021-07-28 Sonia Cromp , Alireza Samadian , Kirk Pruhs

Efficient and scalable geometric hashing method for searching protein 3D structures

As the structural databases continue to expand, efficient methods are required to search similar structures of the query structure from the database. There are many previous works about comparing protein 3D structures and scanning the…

Databases · Computer Science 2011-02-16 Gook-Pil Roh , Seung-won Hwang , Byoung-Kee Yi

A method to search for local structural similarities in proteins at atomic resolution is presented. It is demonstrated that a huge amount of structural data can be handled within a reasonable CPU time by using a conventional relational…

Biomolecules · Quantitative Biology 2007-12-28 Akira R. Kinjo , Haruki Nakamura

HD-Index: Pushing the Scalability-Accuracy Boundary for Approximate kNN Search in High-Dimensional Spaces

Nearest neighbor searching of large databases in high-dimensional spaces is inherently difficult due to the curse of dimensionality. A flavor of approximation is, therefore, necessary to practically solve the problem of nearest neighbor…

Databases · Computer Science 2018-04-24 Akhil Arora , Sakshi Sinha , Piyush Kumar , Arnab Bhattacharya

Maximum Inner-Product Search using Tree Data-structures

The problem of {\em efficiently} finding the best match for a query in a given set with respect to the Euclidean distance or the cosine similarity has been extensively studied in literature. However, a closely related problem of efficiently…

Computational Geometry · Computer Science 2021-06-24 Parikshit Ram , Alexander G. Gray