English
Related papers

Related papers: Database Alignment with Gaussian Features

200 papers

Database alignment is a variant of the graph alignment problem: Given a pair of anonymized databases containing separate yet correlated features for a set of users, the problem is to identify the correspondence between the features and…

Information Theory · Computer Science 2023-07-06 Osman Emre Dai , Daniel Cullina , Negar Kiyavash

In this work, we propose an efficient two-stage algorithm solving a joint problem of correlation detection and partial alignment recovery between two Gaussian databases. Correlation detection is a hypothesis testing problem; under the null…

Information Theory · Computer Science 2023-05-26 Ran Tamir

We consider the problem of aligning a pair of databases with correlated entries. We introduce a new measure of correlation in a joint distribution that we call cycle mutual information. This measure has operational significance: it…

Information Theory · Computer Science 2018-05-11 Daniel Cullina , Prateek Mittal , Negar Kiyavash

The problem of merging databases arises in many government and commercial applications. Schema matching, a common first step, identifies equivalent fields between databases. We introduce a schema matching framework that builds nonparametric…

Information Retrieval · Computer Science 2015-07-07 Erik M. Ferragut , Jason Laska

An applied problem facing all areas of data science is harmonizing data sources. Joining data from multiple origins with unmapped and only partially overlapping features is a prerequisite to developing and testing robust, generalizable…

We establish thresholds for the feasibility of random multi-graph alignment in two models. In the Gaussian model, we demonstrate an "all-or-nothing" phenomenon: above a critical threshold, exact alignment is achievable with high…

Statistics Theory · Mathematics 2026-05-25 Louis Vassaux , Laurent Massoulié

The analysis of the three-dimensional structure of proteins is an important topic in molecular biochemistry. Structure plays a critical role in defining the function of proteins and is more strongly conserved than amino acid sequence over…

Applications · Statistics 2015-01-19 Abel Rodriguez , Scott C. Schmidler

Protein similarity searches are a routine job for molecular biologists where a query sequence of amino acids needs to be compared and ranked against an ever-growing database of proteins. All available algorithms in this field can be grouped…

Computational Engineering, Finance, and Science · Computer Science 2015-08-27 Akash Nag , Sunil Karforma

In Gaussian graphical models, the likelihood equations must typically be solved iteratively. We investigate two algorithms: A version of iterative proportional scaling which avoids inversion of large matrices, and an algorithm based on…

Computation · Statistics 2023-12-12 Søren Højsgaard , Steffen Lauritzen

A Bayesian filtering algorithm is developed for a class of state-space systems that can be modelled via Gaussian mixtures. In general, the exact solution to this filtering problem involves an exponential growth in the number of mixture…

Machine Learning · Statistics 2023-07-03 Adrian G. Wills , Johannes Hendriks , Christopher Renton , Brett Ninness

This thesis studies the graph alignment problem, the noisy version of the graph isomorphism problem, which aims to find a matching between the nodes of two graphs which preserves most of the edges. Focusing on the planted version where the…

Data Structures and Algorithms · Computer Science 2024-04-22 Luca Ganassali

This paper addresses the problem of registering multiple point sets. Solutions to this problem are often approximated by repeatedly solving for pairwise registration, which results in an uneven treatment of the sets forming a pair: a model…

Computer Vision and Pattern Recognition · Computer Science 2018-10-15 Georgios Evangelidis , Radu Horaud

Alignment algorithms usually rely on simplified models of gaps for computational efficiency. Based on an isomorphism between alignments and physical helix-coil models, we show in statistical mechanics that alignments with realistic laws for…

Genomics · Quantitative Biology 2015-06-26 E. Yeramian , E. Debonneuil

De-anonymizing user identities by matching various forms of user data available on the internet raises privacy concerns. A fundamental understanding of the privacy leakage in such scenarios requires a careful study of conditions under which…

Information Theory · Computer Science 2021-05-21 Serhat Bakirtas , Elza Erkip

This paper studies the problem of recovering a hidden vertex correspondence between two correlated graphs when both edge weights and node features are observed. While most existing work on graph alignment relies primarily on edge…

Statistics Theory · Mathematics 2026-04-07 Dong Huang , Chenyang Tian , Pengkun Yang

Graph alignment aims at finding the vertex correspondence between two correlated graphs, a task that frequently occurs in graph mining applications such as social network analysis. Attributed graph alignment is a variant of graph alignment,…

Data Structures and Algorithms · Computer Science 2024-03-13 Ziao Wang , Ning Zhang , Weina Wang , Lele Wang

Finding vertex-to-vertex correspondences in real-world graphs is a challenging task with applications in a wide variety of domains. Structural matching based on graphs connectivities has attracted considerable attention, while the…

Data Structures and Algorithms · Computer Science 2024-10-01 Raphaël Candelier

In this paper we relate a number of parsing algorithms which have been developed in very different areas of parsing theory, and which include deterministic algorithms, tabular algorithms, and a parallel algorithm. We show that these…

cmp-lg · Computer Science 2008-02-03 Mark-Jan Nederhof

In this paper, we consider large-scale ranking problems where one is given a set of (possibly non-redundant) pairwise comparisons and the underlying ranking explained by those comparisons is desired. We show that stochastic gradient descent…

Optimization and Control · Mathematics 2024-07-04 Benjamin Jarman , Lara Kassab , Deanna Needell , Alexander Sietsema

Data sets are growing in complexity thanks to the increasing facilities we have nowadays to both generate and store data. This poses many challenges to machine learning that are leading to the proposal of new methods and paradigms, in order…

Machine Learning · Computer Science 2018-12-04 Irene Córdoba , Concha Bielza , Pedro Larrañaga
‹ Prev 1 2 3 10 Next ›