English
Related papers

Related papers: Measuring Approximate Functional Dependencies: a C…

200 papers

Given a database and a target attribute of interest, how can we tell whether there exists a functional, or approximately functional dependence of the target on any set of other attributes in the data? How can we reliably, without bias to…

Databases · Computer Science 2017-06-20 Panagiotis Mandros , Mario Boley , Jilles Vreeken

Approximate functional dependencies (AFDs) relax exact functional dependencies by tolerating a bounded degree of violation, making them suited for data quality auditing. Threshold-based discovery returns all dependencies above a…

Databases · Computer Science 2026-05-26 Xiaolong Wan , Xixian Han

Functional dependencies (FDs) specify the intended data semantics while violations of FDs indicate deviation from these semantics. In this paper, we study a data cleaning problem in which the FDs may not be completely correct, e.g., due to…

Databases · Computer Science 2012-07-25 George Beskales , Ihab F. Ilyas , Lukasz Golab , Artur Galiullin

The concept of matching dependencies (mds) is recently pro- posed for specifying matching rules for object identification. Similar to the functional dependencies (with conditions), mds can also be applied to various data quality…

Databases · Computer Science 2009-06-13 Shaoxu Song , Lei Chen

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any…

Methodology · Statistics 2011-11-16 Jianqing Fan , Xu Han , Weijie Gu

Order dependencies (ODs) capture relationships between ordered domains of attributes. Approximate ODs (AODs) capture such relationships even when there exist exceptions in the data. During automated discovery of ODs, validation is the…

Databases · Computer Science 2021-01-07 Reza Karegar , Parke Godfrey , Lukasz Golab , Mehdi Kargar , Divesh Srivastava , Jaroslaw Szlichta

In real life, data are often of poor quality as a result, for instance, of uncertainty, mismeasurements, missing values or bad inputs. This issue hampers an implicit yet crucial operation of every database management system: equality…

Logic in Computer Science · Computer Science 2024-04-30 Lhouari Nourine , Jean Marc Petit , Simon Vilmin

A possible world of an incomplete database table is obtained by imputing values from the attributes (infinite) domain to the place of \texttt{NULL} s. A table satisfies a possible key or possible functional dependency constraint if there…

Databases · Computer Science 2024-02-09 Munqath Al-atar , Attila Sali

Differential dependencies (DDs) capture the relationships between data columns of relations. They are more general than functional dependencies (FDs) and and the difference is that DDs are defined on the distances between values of two…

Databases · Computer Science 2013-09-17 Jixue Liu , Selasi Kwashie , Jiuyong Li , Feiyue Ye , Millist Vincent

Functional dependencies -- traditional, approximate and conditional are of critical importance in relational databases, as they inform us about the relationships between attributes. They are useful in schema normalization, data…

Databases · Computer Science 2010-12-14 Sushovan De , Subbarao Kambhampati

Measuring a strength of dependence of random variables is an important problem in statistical practice. In this paper, we propose a new function valued measure of dependence of two random variables. It allows one to study and visualize…

Methodology · Statistics 2014-05-12 Teresa Ledwina

Usually, density functional models are considered approximations to density functional theory, However, there is no systematic connection between the two, and this can make us doubt about a linkage. This attitude can be further enforced by…

Chemical Physics · Physics 2020-11-10 Andreas Savin

Time-dependent density functional theory continues to draw a large number of users in a wide range of fields exploring myriad applications involving electronic spectra and dynamics. Although in principle exact, the predictivity of the…

Chemical Physics · Physics 2023-08-14 Lionel Lacombe , Neepa T. Maitra

We take a different look at the problem of testing the independence of two metric-space-valued random variables using the distance correlation. Instead of testing if the distance correlation vanishes exactly, we are interested in the…

Statistics Theory · Mathematics 2025-11-19 Holger Dette , Marius Kroll

Learning about density functional approximations (DFAs), or approximations for the exchange-correlation functional, can be intimidating. Density Functional Theory is now one of the primary simulation tools for the practicing chemist or…

Materials Science · Physics 2022-11-03 M. K. Horton

Two families of dependence measures between random variables are introduced. They are based on the R\'enyi divergence of order $\alpha$ and the relative $\alpha$-entropy, respectively, and both dependence measures reduce to Shannon's mutual…

Information Theory · Computer Science 2019-08-22 Amos Lapidoth , Christoph Pfister

We study the problem of discovering functional dependencies (FD) from a noisy dataset. We focus on FDs that correspond to statistical dependencies in a dataset and draw connections between FD discovery and structure learning in…

Databases · Computer Science 2019-05-07 Zhihan Guo , Theodoros Rekatsinas

Multiple testing has been a popular topic in statistical research. Although vast works have been done, controlling the false discoveries remains a challenging task when the corresponding test statistics are dependent. Various methods have…

Statistics Theory · Mathematics 2022-07-05 Meng Mei , Tao Yu , Yuan Jiang

An increasing number of generative music models can be conditioned on an audio prompt that serves as musical context for which the model is to create an accompaniment (often further specified using a text prompt). Evaluation of how well…

Sound · Computer Science 2024-12-31 Maarten Grachten

Partial dependence curves (FPD) introduced by Friedman, are an important model interpretation tool, but are often not accessible to business analysts and scientists who typically lack the skills to choose, tune, and assess machine learning…

Machine Learning · Computer Science 2020-04-28 Terence Parr , James D. Wilson
‹ Prev 1 2 3 10 Next ›