Related papers: Stable Tuple Embeddings for Dynamic Databases

Selecting Walk Schemes for Database Embedding

Machinery for data analysis often requires a numeric representation of the input. Towards that, a common practice is to embed components of structured data into a high-dimensional vector space. We study the embedding of the tuples of a…

Machine Learning · Computer Science 2024-01-23 Yuval Lev Lubarsky , Jan Tönshoff , Martin Grohe , Benny Kimelfeld

Local Embeddings for Relational Data Integration

Deep learning based techniques have been recently used with promising results for data integration problems. Some methods directly use pre-trained embeddings that were trained on a large corpus such as Wikipedia. However, they may not…

Databases · Computer Science 2020-09-04 Riccardo Cappuzzo , Paolo Papotti , Saravanan Thirumuruganathan

Incorporating Deep Learning Design in Database Queries

Deep learning over relational databases is conventionally realized by translating data into graph representations and applying graph-based neural networks within external frameworks. This round-trip between the database and external machine…

Databases · Computer Science 2026-05-26 Yuval Lev Lubarsky , Dean Light , Boaz Berger , Shunit Agmon , Benny Kimelfeld

Universal Embeddings of Tabular Data

Tabular data in relational databases represents a significant portion of industrial data. Hence, analyzing and interpreting tabular data is of utmost importance. Application tasks on tabular data are manifold and are often not specified…

Machine Learning · Computer Science 2025-07-09 Astrid Franz , Frederik Hoppe , Marianne Michaelis , Udo Göbel

Towards a Flexible Embedding Learning Framework

Representation learning is a fundamental building block for analyzing entities in a database. While the existing embedding learning methods are effective in various data mining problems, their applicability is often limited because these…

Machine Learning · Computer Science 2020-09-24 Chin-Chia Michael Yeh , Dhruv Gelda , Zhongfang Zhuang , Yan Zheng , Liang Gou , Wei Zhang

Towards Universal Tabular Embeddings: A Benchmark Across Data Tasks

Tabular foundation models aim to learn universal representations of tabular data that transfer across tasks and domains, enabling applications such as table retrieval, semantic search and table-based prediction. Despite the growing number…

Machine Learning · Computer Science 2026-04-24 Liane Vogel , Kavitha Srinivas , Niharika D'Souza , Sola Shirai , Oktie Hassanzadeh , Horst Samulowitz

Fast Algorithm for Embedded Order Dependency Validation (Extended Version)

Order Dependencies (ODs) have many applications, such as query optimization, data integration, and data cleaning. Although many works addressed the problem of discovering OD (and its variants), they do not consider datasets with missing…

Databases · Computer Science 2024-01-01 Alejandro Ramos , Takuya Uemura , Daichi Amagata , Ryo Shirai , Takahiro Hara

On Embeddings in Relational Databases

We address the problem of learning a distributed representation of entities in a relational database using a low-dimensional embedding. Low-dimensional embeddings aim to encapsulate a concise vector representation for an underlying dataset…

Databases · Computer Science 2020-05-14 Siddhant Arora , Srikanta Bedathur

Symbolic Querying of Vector Spaces: Probabilistic Databases Meets Relational Embeddings

We propose unifying techniques from probabilistic databases and relational embedding models with the goal of performing complex queries on incomplete and uncertain data. We formalize a probabilistic database model with respect to which all…

Artificial Intelligence · Computer Science 2020-06-30 Tal Friedman , Guy Van den Broeck

A Survey on Embedding Dynamic Graphs

Embedding static graphs in low-dimensional vector spaces plays a key role in network analytics and inference, supporting applications like node classification, link prediction, and graph visualization. However, many real-world networks…

Machine Learning · Computer Science 2021-07-23 Claudio D. T. Barros , Matheus R. F. Mendonça , Alex B. Vieira , Artur Ziviani

Deep Feature Embedding for Tabular Data

Tabular data learning has extensive applications in deep learning but its existing embedding techniques are limited in numerical and categorical features such as the inability to capture complex relationships and engineering. This paper…

Machine Learning · Computer Science 2024-09-02 Yuqian Wu , Hengyi Luo , Raymond S. T. Lee

Word Embeddings: Stability and Semantic Change

Word embeddings are computed by a class of techniques within natural language processing (NLP), that create continuous vector representations of words in a language from a large text corpus. The stochastic nature of the training process of…

Computation and Language · Computer Science 2020-08-03 Lucas Rettenmeier

Dynamic Planar Embedding is in DynFO

Planar Embedding is a drawing of a graph on the plane such that the edges do not intersect each other except at the vertices. We know that testing the planarity of a graph and computing its embedding (if it exists), can efficiently be…

Data Structures and Algorithms · Computer Science 2023-07-19 Samir Datta , Asif Khan , Anish Mukherjee

ffstruc2vec: Flat, Flexible and Scalable Learning of Node Representations from Structural Identities

Node embedding refers to techniques that generate low-dimensional vector representations of nodes in a graph while preserving specific properties of the nodes. A key challenge in the field is developing scalable methods that can preserve…

Machine Learning · Computer Science 2025-04-03 Mario Heidrich , Jeffrey Heidemann , Rüdiger Buchkremer , Gonzalo Wandosell Fernández de Bobadilla

Fixed-Length Protein Embeddings using Contextual Lenses

The Basic Local Alignment Search Tool (BLAST) is currently the most popular method for searching databases of biological sequences. BLAST compares sequences via similarity defined by a weighted edit distance, which results in it being…

Biomolecules · Quantitative Biology 2020-10-29 Amir Shanehsazzadeh , David Belanger , David Dohan

A Simple and Powerful Framework for Stable Dynamic Network Embedding

In this paper, we address the problem of dynamic network embedding, that is, representing the nodes of a dynamic network as evolving vectors within a low-dimensional space. While the field of static network embedding is wide and…

Social and Information Networks · Computer Science 2023-11-17 Ed Davis , Ian Gallagher , Daniel John Lawson , Patrick Rubin-Delanchy

ML Based Lineage in Databases

We track the lineage of tuples throughout their database lifetime. That is, we consider a scenario in which tuples (records) that are produced by a query may affect other tuple insertions into the DB, as part of a normal workflow. As time…

Databases · Computer Science 2021-10-05 Michael Leybovich , Oded Shmueli

On the Surprising Behaviour of node2vec

Graph embedding techniques are a staple of modern graph learning research. When using embeddings for downstream tasks such as classification, information about their stability and robustness, i.e., their susceptibility to sources of noise,…

Machine Learning · Computer Science 2022-08-22 Celia Hacker , Bastian Rieck

RETRO: Relation Retrofitting For In-Database Machine Learning on Textual Data

There are massive amounts of textual data residing in databases, valuable for many machine learning (ML) tasks. Since ML techniques depend on numerical input representations, word embeddings are increasingly utilized to convert symbolic…

Databases · Computer Science 2020-01-23 Michael Günther , Maik Thiele , Wolfgang Lehner

Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings

Research on adversarial robustness is primarily focused on image and text data. Yet, many scenarios in which lack of robustness can result in serious risks, such as fraud detection, medical diagnosis, or recommender systems often do not…

Machine Learning · Computer Science 2023-12-14 Klim Kireev , Maksym Andriushchenko , Carmela Troncoso , Nicolas Flammarion