Related papers: Information Distance: New Developments

Information Distance in Multiples

Information distance is a parameter-free similarity measure based on compression, used in pattern recognition, data mining, phylogeny, clustering, and classification. The notion of information distance is extended from pairs to multiples…

Computer Vision and Pattern Recognition · Computer Science 2009-05-21 Paul M. B. Vitanyi

The Basics of Information Geometry

To what extent can we distinguish one probability distribution from another? Are there quantitative measures of distinguishability? The goal of this tutorial is to approach such questions by introducing the notion of the "distance" between…

Data Analysis, Statistics and Probability · Physics 2015-06-23 Ariel Caticha

Normalized information-based divergences

This paper is devoted to the mathematical study of some divergences based on the mutual information well-suited to categorical random vectors. These divergences are generalizations of the "entropy distance" and "information distance". Their…

Statistics Theory · Mathematics 2016-08-16 Jean-François Coeurjolly , Rémy Drouilhet , Jean-François Robineau

Network Distance Based on Laplacian Flows on Graphs

Distance plays a fundamental role in measuring similarity between objects. Various visualization techniques and learning tasks in statistics and machine learning such as shape matching, classification, dimension reduction and clustering…

Machine Learning · Statistics 2025-04-23 Dianbin Bao , Kisung You , Lizhen Lin

Proximity Measure of Information Object Features for Solving the Problem of Their Identification in Information Systems

The paper considers a new quantitative-qualitative proximity measure for the features of information objects, where data enters a common information resource from several sources independently. The goal is to determine the possibility of…

Artificial Intelligence · Computer Science 2026-04-08 Volodymyr Yuzefovych

Networks and Cities: An Information Perspective

Traffic is constrained by the information involved in locating the receiver and the physical distance between sender and receiver. We here focus on the former, and investigate traffic in the perspective of information handling. We re-plot…

Disordered Systems and Neural Networks · Physics 2007-05-23 M. Rosvall , A. Trusina , P. Minnhagen , K. Sneppen

A Review on Intelligent Object Perception Methods Combining Knowledge-based Reasoning and Machine Learning

Object perception is a fundamental sub-field of Computer Vision, covering a multitude of individual areas and having contributed high-impact results. While Machine Learning has been traditionally applied to address related problems, recent…

Computer Vision and Pattern Recognition · Computer Science 2020-03-18 Filippos Gouidis , Alexandros Vassiliades , Theodore Patkos , Antonis Argyros , Nick Bassiliades , Dimitris Plexousakis

Normalized Information Distance

The normalized information distance is a universal distance measure for objects of all kinds. It is based on Kolmogorov complexity and thus uncomputable, but there are ways to utilize it. First, compression algorithms can be used to…

Information Retrieval · Computer Science 2008-09-16 Paul M. B. Vitanyi , Frank J. Balbach , Rudi L. Cilibrasi , Ming Li

Distances between Data Sets Based on Summary Statistics

The concepts of similarity and distance are crucial in data mining. We consider the problem of defining the distance between two data sets by comparing summary statistics computed from the data sets. The initial definition of our distance…

Data Structures and Algorithms · Computer Science 2019-02-05 Nikolaj Tatti

The Extended Edit Distance Metric

Similarity search is an important problem in information retrieval. This similarity is based on a distance. Symbolic representation of time series has attracted many researchers recently, since it reduces the dimensionality of these high…

Information Retrieval · Computer Science 2010-06-18 Muhammad Marwan Muhammad Fuad , Pierre-François Marteau

Information Distance

While Kolmogorov complexity is the accepted absolute measure of information content in an individual finite object, a similarly absolute notion is needed for the information distance between two individual objects, for example, two…

Information Theory · Computer Science 2010-06-18 Charles H. Bennett , Peter Gacs , Ming Li , Paul M. B. Vitanyi , Wojciech H. Zurek

Generalization of distance to higher dimensional objects

The measurement of distance between two objects is generalized to the case where the objects are no longer points but are one-dimensional. Additional concepts such as non-extensibility, curvature constraints, and non-crossing become central…

Soft Condensed Matter · Physics 2008-03-04 Steven S. Plotkin

Modeling pattern formation in communities by using information particles

Understanding the pattern formation in communities has been at the center of attention in various fields. Here we introduce a novel model, called an "information-particle model," which is based on the reaction-diffusion model and the…

Physics and Society · Physics 2023-07-21 Junichi Miyakoshi

Ranking the information content of distance measures

Real-world data typically contain a large number of features that are often heterogeneous in nature, relevance, and also units of measure. When assessing the similarity between data points, one can build various distance measures using…

Machine Learning · Statistics 2022-05-27 Aldo Glielmo , Claudio Zeni , Bingqing Cheng , Gabor Csanyi , Alessandro Laio

An Information-Geometric Distance on the Space of Tasks

This paper prescribes a distance between learning tasks modeled as joint distributions on data and labels. Using tools in information geometry, the distance is defined to be the length of the shortest weight trajectory on a Riemannian…

Machine Learning · Computer Science 2024-05-07 Yansong Gao , Pratik Chaudhari

Metric Statistics: Exploration and Inference for Random Objects With Distance Profiles

This article provides an overview on the statistical modeling of complex data as increasingly encountered in modern data analysis. It is argued that such data can often be described as elements of a metric space that satisfies certain…

Methodology · Statistics 2024-02-28 Paromita Dubey , Yaqing Chen , Hans-Georg Müller

Information Carriers and Identification of Information Objects: An Ontological Approach

Even though library and archival practice, as well as Digital Preservation, have a long tradition in identifying information objects, the question of their precise identity under change of carrier or migration is still a riddle to science.…

Digital Libraries · Computer Science 2012-12-13 Martin Doerr , Yannis Tzitzikas

We survey the emerging area of compression-based, parameter-free, similarity distance measures useful in data-mining, pattern recognition, learning and automatic semantics extraction. Given a family of distances on a set of objects, a…

Computer Vision and Pattern Recognition · Computer Science 2007-05-23 Rudi Cilibrasi , Paul Vitanyi

Predictive Information

Observations on the past provide some hints about what will happen in the future, and this can be quantified using information theory. The ``predictive information'' defined in this way has connections to measures of complexity that have…

Statistical Mechanics · Physics 2007-05-23 William Bialek , Naftali Tishby

Detecting Visual Relationships with Deep Relational Networks

Relationships among objects play a crucial role in image understanding. Despite the great success of deep learning techniques in recognizing individual objects, reasoning about the relationships among objects remains a challenging task.…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Bo Dai , Yuqi Zhang , Dahua Lin