English
Related papers

Related papers: Stable Tuple Embeddings for Dynamic Databases

200 papers

Machinery for data analysis often requires a numeric representation of the input. Towards that, a common practice is to embed components of structured data into a high-dimensional vector space. We study the embedding of the tuples of a…

Machine Learning · Computer Science 2024-01-23 Yuval Lev Lubarsky , Jan Tönshoff , Martin Grohe , Benny Kimelfeld

Deep learning based techniques have been recently used with promising results for data integration problems. Some methods directly use pre-trained embeddings that were trained on a large corpus such as Wikipedia. However, they may not…

Databases · Computer Science 2020-09-04 Riccardo Cappuzzo , Paolo Papotti , Saravanan Thirumuruganathan

Deep learning over relational databases is conventionally realized by translating data into graph representations and applying graph-based neural networks within external frameworks. This round-trip between the database and external machine…

Databases · Computer Science 2026-05-26 Yuval Lev Lubarsky , Dean Light , Boaz Berger , Shunit Agmon , Benny Kimelfeld

Tabular data in relational databases represents a significant portion of industrial data. Hence, analyzing and interpreting tabular data is of utmost importance. Application tasks on tabular data are manifold and are often not specified…

Machine Learning · Computer Science 2025-07-09 Astrid Franz , Frederik Hoppe , Marianne Michaelis , Udo Göbel

Representation learning is a fundamental building block for analyzing entities in a database. While the existing embedding learning methods are effective in various data mining problems, their applicability is often limited because these…

Machine Learning · Computer Science 2020-09-24 Chin-Chia Michael Yeh , Dhruv Gelda , Zhongfang Zhuang , Yan Zheng , Liang Gou , Wei Zhang

Tabular foundation models aim to learn universal representations of tabular data that transfer across tasks and domains, enabling applications such as table retrieval, semantic search and table-based prediction. Despite the growing number…

Machine Learning · Computer Science 2026-04-24 Liane Vogel , Kavitha Srinivas , Niharika D'Souza , Sola Shirai , Oktie Hassanzadeh , Horst Samulowitz

Order Dependencies (ODs) have many applications, such as query optimization, data integration, and data cleaning. Although many works addressed the problem of discovering OD (and its variants), they do not consider datasets with missing…

Databases · Computer Science 2024-01-01 Alejandro Ramos , Takuya Uemura , Daichi Amagata , Ryo Shirai , Takahiro Hara

We address the problem of learning a distributed representation of entities in a relational database using a low-dimensional embedding. Low-dimensional embeddings aim to encapsulate a concise vector representation for an underlying dataset…

Databases · Computer Science 2020-05-14 Siddhant Arora , Srikanta Bedathur

We propose unifying techniques from probabilistic databases and relational embedding models with the goal of performing complex queries on incomplete and uncertain data. We formalize a probabilistic database model with respect to which all…

Artificial Intelligence · Computer Science 2020-06-30 Tal Friedman , Guy Van den Broeck

Embedding static graphs in low-dimensional vector spaces plays a key role in network analytics and inference, supporting applications like node classification, link prediction, and graph visualization. However, many real-world networks…

Machine Learning · Computer Science 2021-07-23 Claudio D. T. Barros , Matheus R. F. Mendonça , Alex B. Vieira , Artur Ziviani

Tabular data learning has extensive applications in deep learning but its existing embedding techniques are limited in numerical and categorical features such as the inability to capture complex relationships and engineering. This paper…

Machine Learning · Computer Science 2024-09-02 Yuqian Wu , Hengyi Luo , Raymond S. T. Lee

Word embeddings are computed by a class of techniques within natural language processing (NLP), that create continuous vector representations of words in a language from a large text corpus. The stochastic nature of the training process of…

Computation and Language · Computer Science 2020-08-03 Lucas Rettenmeier

Planar Embedding is a drawing of a graph on the plane such that the edges do not intersect each other except at the vertices. We know that testing the planarity of a graph and computing its embedding (if it exists), can efficiently be…

Data Structures and Algorithms · Computer Science 2023-07-19 Samir Datta , Asif Khan , Anish Mukherjee

Node embedding refers to techniques that generate low-dimensional vector representations of nodes in a graph while preserving specific properties of the nodes. A key challenge in the field is developing scalable methods that can preserve…

The Basic Local Alignment Search Tool (BLAST) is currently the most popular method for searching databases of biological sequences. BLAST compares sequences via similarity defined by a weighted edit distance, which results in it being…

Biomolecules · Quantitative Biology 2020-10-29 Amir Shanehsazzadeh , David Belanger , David Dohan

In this paper, we address the problem of dynamic network embedding, that is, representing the nodes of a dynamic network as evolving vectors within a low-dimensional space. While the field of static network embedding is wide and…

Social and Information Networks · Computer Science 2023-11-17 Ed Davis , Ian Gallagher , Daniel John Lawson , Patrick Rubin-Delanchy

We track the lineage of tuples throughout their database lifetime. That is, we consider a scenario in which tuples (records) that are produced by a query may affect other tuple insertions into the DB, as part of a normal workflow. As time…

Databases · Computer Science 2021-10-05 Michael Leybovich , Oded Shmueli

Graph embedding techniques are a staple of modern graph learning research. When using embeddings for downstream tasks such as classification, information about their stability and robustness, i.e., their susceptibility to sources of noise,…

Machine Learning · Computer Science 2022-08-22 Celia Hacker , Bastian Rieck

There are massive amounts of textual data residing in databases, valuable for many machine learning (ML) tasks. Since ML techniques depend on numerical input representations, word embeddings are increasingly utilized to convert symbolic…

Databases · Computer Science 2020-01-23 Michael Günther , Maik Thiele , Wolfgang Lehner

Research on adversarial robustness is primarily focused on image and text data. Yet, many scenarios in which lack of robustness can result in serious risks, such as fraud detection, medical diagnosis, or recommender systems often do not…

Machine Learning · Computer Science 2023-12-14 Klim Kireev , Maksym Andriushchenko , Carmela Troncoso , Nicolas Flammarion
‹ Prev 1 2 3 10 Next ›