English
Related papers

Related papers: Geometric Aspects of Biological Sequence Compariso…

200 papers

Tools that effectively analyze and compare sequences are of great importance in various areas of applied computational research, especially in the framework of molecular biology. In the present paper, we introduce simple geometric criteria…

Quantitative Methods · Quantitative Biology 2013-08-14 Boris Brimkov , Valentin E. Brimkov

A quasi-metric is a distance function which satisfies the triangle inequality but is not symmetric: it can be thought of as an asymmetric metric. The central result of this thesis, developed in Chapter 3, is that a natural correspondence…

Information Retrieval · Computer Science 2008-10-31 Aleksandar Stojmirovic

A new class of distances appropriate for measuring similarity relations between sequences, say one type of similarity per distance, is studied. We propose a new ``normalized information distance'', based on the noncomputable notion of…

Computational Complexity · Computer Science 2011-11-09 Ming Li , Xin Chen , Xin Li , Bin Ma , Paul Vitanyi

Given a set of sequences, the distance between pairs of them helps us to find their similarity and derive structural relationship amongst them. For genomic sequences such measures make it possible to construct the evolution tree of…

Information Theory · Computer Science 2012-08-29 Sandeep Hosangadi

Several measures exist for string similarity, including notable ones like the edit distance and the indel distance. The former measures the count of insertions, deletions, and substitutions required to transform one string into another,…

Data Structures and Algorithms · Computer Science 2024-10-15 Sudatta Bhattacharya , Sanjana Dey , Elazar Goldenberg , Michal Koucký

We propose a family of near-metrics based on local graph diffusion to capture similarity for a wide class of data sets. These quasi-metametrics, as their names suggest, dispense with one or two standard axioms of metric spaces, specifically…

Machine Learning · Statistics 2017-10-18 Chu Wang , Iraj Saniee , William S. Kennedy , Chris A. White

This paper proposes a general framework for matching similar subsequences in both time series and string databases. The matching results are pairs of query subsequences and database subsequences. The framework finds all possible pairs of…

Databases · Computer Science 2012-08-02 Haohan Zhu , George Kollios , Vassilis Athitsos

Phylogenetic tree reconstruction is traditionally based on multiple sequence alignments (MSAs) and heavily depends on the validity of this information bottleneck. With increasing sequence divergence, the quality of MSAs decays quickly.…

Populations and Evolution · Quantitative Biology 2011-01-11 Roland F. Schwarz , William Fletcher , Frank Förster , Benjamin Merget , Matthias Wolf , Jörg Schultz , Florian Markowetz

Distance measuring is a very important task in digital geometry and digital image processing. Due to our natural approach to geometry we think of the set of points that are equally far from a given point as a Euclidean circle. Using the…

Metric Geometry · Mathematics 2010-06-18 Janos Farkas , Szabolcs Bajak , Benedek Nagy

The distance on a set is a comparative function. The smaller the distance between two elements of that set, the closer, or more similar, those elements are. Fr\'echet axiomatized the distance into what is today known as a metric. In this…

General Topology · Mathematics 2016-07-05 Samer Assaf

String matching algorithm plays the vital role in the Computational Biology. The functional and structural relationship of the biological sequence is determined by similarities on that sequence. For that, the researcher is supposed to aware…

Data Structures and Algorithms · Computer Science 2014-01-30 Pandiselvam. P , Marimuthu. T , Lawrance. R

Sequence classification algorithms, such as SVM, require a definition of distance (similarity) measure between two sequences. A commonly used notion of similarity is the number of matches between $k$-mers ($k$-length subsequences) in the…

Data Structures and Algorithms · Computer Science 2017-12-13 Muhammad Farhan , Juvaria Tariq , Arif Zaman , Mudassir Shabbir , Imdad Ullah Khan

Many learning algorithms such as kernel machines, nearest neighbors, clustering, or anomaly detection, are based on the concept of 'distance' or 'similarity'. Before similarities are used for training an actual machine learning model, we…

The author has recently introduced abstract algebraic frameworks of analogical proportions and similarity within the general setting of universal algebra. The purpose of this paper is to build a bridge from similarity to analogical…

Logic in Computer Science · Computer Science 2024-02-29 Christian Antić

This paper proposes a new method for determining similarity and anomalies between time series, most practically effective in large collections of (likely related) time series, by measuring distances between structural breaks within such a…

Machine Learning · Computer Science 2020-12-01 Nick James , Max Menzies , Lamiae Azizi , Jennifer Chan

We propose a novel semiparametric classifier based on Mahalanobis distances of an observation from the competing classes. Our tool is a generalized additive model with the logistic link function that uses these distances as features to…

Methodology · Statistics 2025-02-05 Annesha Ghosh , Anil K. Ghosh , Rita SahaRay , Soham Sarkar

Sequence comparison is a basic task to capture similarities and differences between two or more sequences of symbols, with countless applications such as in computational biology. An alignment is a way to compare sequences, where a giving…

Discrete Mathematics · Computer Science 2023-03-16 Eloi Araujo , Fábio V. Martinez , Carlos H. A. Higa , José Soares

In this article, we propose tree edit distance with variables, which is an extension of the tree edit distance to handle trees with variables and has a potential application to measuring the similarity between mathematical formulas,…

Data Structures and Algorithms · Computer Science 2021-05-12 Tatsuya Akutsu , Tomoya Mori , Naotoshi Nakamura , Satoshi Kozawa , Yuhei Ueno , Thomas N. Sato

We define a distance analogous to the Gromov-Hausdorff distance that enables the comparison of arbitrary quasi-isometric spaces. We also investigate properties preserved under limits with respect to this distance, as well as properties of…

Metric Geometry · Mathematics 2026-05-28 Alexei Naianzin

Modelling the substitution of nucleotides along a phylogenetic tree is usually done by a hidden Markov process. This allows to define a distribution of characters at the leaves of the trees and one might be able to obtain polynomial…

Populations and Evolution · Quantitative Biology 2020-10-12 Marta Casanellas , Jesús Fernández-Sánchez , Marina Garrote-López
‹ Prev 1 2 3 10 Next ›