Related papers: A Biologically Inspired Classifier

Towards Quantifying the Distance between Opinions

Increasingly, critical decisions in public policy, governance, and business strategy rely on a deeper understanding of the needs and opinions of constituent members (e.g. citizens, shareholders). While it has become easier to collect a…

Computation and Language · Computer Science 2020-01-28 Saket Gurukar , Deepak Ajwani , Sourav Dutta , Juho Lauri , Srinivasan Parthasarathy , Alessandra Sala

We introduce a conceptually simple and effective method to quantify the similarity between relations in knowledge bases. Specifically, our approach is based on the divergence between the conditional probability distributions over entity…

Artificial Intelligence · Computer Science 2019-07-23 Weize Chen , Hao Zhu , Xu Han , Zhiyuan Liu , Maosong Sun

This paper presents a new approach for measuring semantic similarity/distance between words and concepts. It combines a lexical taxonomy structure with corpus statistical information so that the semantic distance between nodes in the…

cmp-lg · Computer Science 2008-02-03 Jay J. Jiang , David W. Conrath

Uncertainty in Ontology Matching: A Decision Rule-Based Approach

Considering the high heterogeneity of the ontologies pub-lished on the web, ontology matching is a crucial issue whose aim is to establish links between an entity of a source ontology and one or several entities from a target ontology.…

Artificial Intelligence · Computer Science 2015-01-26 Amira Essaid , Arnaud Martin , Grégory Smits , Boutheina Ben Yaghlane

Ranking relations using analogies in biological and information networks

Analogical reasoning depends fundamentally on the ability to learn and generalize about relations between objects. We develop an approach to relational learning which, given a set of pairs of objects…

Methodology · Statistics 2013-08-30 Ricardo Silva , Katherine Heller , Zoubin Ghahramani , Edoardo M. Airoldi

Correlation-Based Method for Sentiment Classification

The classic supervised classification algorithms are efficient, but time-consuming, complicated and not interpretable, which makes it difficult to analyze their results that limits the possibility to improve them based on real observations.…

Computation and Language · Computer Science 2018-03-05 Hussam Hamdan

Information Distance in Multiples

Information distance is a parameter-free similarity measure based on compression, used in pattern recognition, data mining, phylogeny, clustering, and classification. The notion of information distance is extended from pairs to multiples…

Computer Vision and Pattern Recognition · Computer Science 2009-05-21 Paul M. B. Vitanyi

Generalization of bibliographic coupling and co-citation using the node split network

Bibliographic coupling (BC) and co-citation (CC) are the two most common citation-based coupling measures of similarity between scientific items. One can interpret these measures as second-neighbor relations distinguished by the direction…

Physics and Society · Physics 2021-11-01 Jinhyuk Yun

Exploring Heritability of Functional Brain Networks with Inexact Graph Matching

Data-driven brain parcellations aim to provide a more accurate representation of an individual's functional connectivity, since they are able to capture individual variability that arises due to development or disease. This renders…

Neurons and Cognition · Quantitative Biology 2017-03-30 Sofia Ira Ktena , Salim Arslan , Sarah Parisot , Daniel Rueckert

Towards Case-Based Preference Elicitation: Similarity Measures on Preference Structures

While decision theory provides an appealing normative framework for representing rich preference structures, eliciting utility or value functions typically incurs a large cost. For many applications involving interactive systems this…

Artificial Intelligence · Computer Science 2013-02-01 Vu A. Ha , Peter Haddawy

Building and Interpreting Deep Similarity Models

Many learning algorithms such as kernel machines, nearest neighbors, clustering, or anomaly detection, are based on the concept of 'distance' or 'similarity'. Before similarities are used for training an actual machine learning model, we…

Machine Learning · Computer Science 2020-09-08 Oliver Eberle , Jochen Büttner , Florian Kräutli , Klaus-Robert Müller , Matteo Valleriani , Grégoire Montavon

A Study of Metrics of Distance and Correlation Between Ranked Lists for Compositionality Detection

Compositionality in language refers to how much the meaning of some phrase can be decomposed into the meaning of its constituents and the way these constituents are combined. Based on the premise that substitution by synonyms is…

Computation and Language · Computer Science 2017-03-13 Christina Lioma , Niels Dalum Hansen

Compression-based Similarity

First we consider pair-wise distances for literal objects consisting of finite binary files. These files are taken to contain all of their meaning, like genomes or books. The distances are based on compression of the objects concerned,…

Information Theory · Computer Science 2011-10-21 Paul M. B. Vitanyi

Matching Methods for Causal Inference: A Review and a Look Forward

When estimating causal effects using observational data, it is desirable to replicate a randomized experiment as closely as possible by obtaining treated and control groups with similar covariate distributions. This goal can often be…

Methodology · Statistics 2010-10-28 Elizabeth A. Stuart

Ordinal Characterization of Similarity Judgments

Characterizing judgments of similarity within a perceptual or semantic domain, and making inferences about the underlying structure of this domain from these judgments, has an increasingly important role in cognitive and systems…

Neurons and Cognition · Quantitative Biology 2025-08-13 Jonathan D. Victor , Guillermo Aguilar , Suniyya A. Waraich

Return to basics: Clustering of scientific literature using structural information

Scholars frequently employ relatedness measures to estimate the similarity between two different items (e.g., documents, authors, and institutes). Such relatedness measures are commonly based on overlapping references ($\textit{i.e.}$,…

Social and Information Networks · Computer Science 2020-04-14 Jinhyuk Yun , Sejung Ahn , June Young Lee

Formal Languages and Algorithms for Similarity based Retrieval from Sequence Databases

The paper considers various formalisms based on Automata, Temporal Logic and Regular Expressions for specifying queries over sequences. Unlike traditional binary semantics, the paper presents a similarity based semantics for thse…

Logic in Computer Science · Computer Science 2007-05-23 A. Prasad Sistla

C-Rank: A Link-based Similarity Measure for Scientific Literature Databases

As the number of people who use scientific literature databases grows, the demand for literature retrieval services has been steadily increased. One of the most popular retrieval services is to find a set of papers similar to the paper…

Digital Libraries · Computer Science 2011-09-07 Seok-Ho Yoon , Sang-Wook Kim , Sunju Park

Validated Intraclass Correlation Statistics to Test Item Performance Models

A new method, with an application program in Matlab code, is proposed for testing item performance models on empirical databases. This method uses data intraclass correlation statistics as expected correlations to which one compares simple…

Methodology · Statistics 2011-04-13 Pierre Courrieu , Muriele Brand-D'Abrescia , Ronald Peereman , Daniel Spieler , Arnaud Rey

The Google Similarity Distance

Words and phrases acquire meaning from the way they are used in society, from their relative semantics to other words and phrases. For computers the equivalent of `society' is `database,' and the equivalent of `use' is `way to search the…

Computation and Language · Computer Science 2007-06-13 Rudi Cilibrasi , Paul M. B. Vitanyi