Related papers: Sequence alignment and mutual information

Sequence alignment, mutual information, and dissimilarity measures for constructing phylogenies

Existing sequence alignment algorithms use heuristic scoring schemes which cannot be used as objective distance metrics. Therefore one relies on measures like the p- or log-det distances, or makes explicit, and often simplistic, assumptions…

Genomics · Quantitative Biology 2015-05-19 Orion Penner , Peter Grassberger , Maya Paczuski

Hierarchical Clustering Based on Mutual Information

Motivation: Clustering is a frequently used concept in variety of bioinformatical applications. We present a new method for hierarchical clustering of data called mutual information clustering (MIC) algorithm. It uses mutual information…

Quantitative Methods · Quantitative Biology 2007-05-23 Alexander Kraskov , Harald Stögbauer , Ralph G. Andrzejak , Peter Grassberger

Multiple Sequence Alignment is not a Solved Problem

Multiple sequence alignment is a basic procedure in molecular biology, and it is often treated as being essentially a solved computational problem. However, this is not so, and here I review the evidence for this claim, and outline the…

Populations and Evolution · Quantitative Biology 2018-08-24 David A. Morrison

Pairwise heuristic sequence alignment algorithm based on deep reinforcement learning

Various methods have been developed to analyze the association between organisms and their genomic sequences. Among them, sequence alignment is the most frequently used for comparative analysis of biological genomes. However, the…

Quantitative Methods · Quantitative Biology 2020-10-27 Yong Joon Song , Dong Jin Ji , Hye In Seo , Gyu Bum Han , Dong Ho Cho

New Sequence Alignment Algorithm using AI Rules and Dynamic Seeds

DNA sequence alignment is important today as it is usually the first step in finding gene mutation, evolutionary similarities, protein structure, drug development and cancer treatment. Covid-19 is one recent example. There are many…

Genomics · Quantitative Biology 2023-06-01 Suchindra , Preetam Nagaraj

Technology dictates algorithms: Recent developments in read alignment

Massively parallel sequencing techniques have revolutionized biological and medical sciences by providing unprecedented insight into the genomes of humans, animals, and microbes. Modern sequencing platforms generate enormous amounts of…

Genomics · Quantitative Biology 2023-11-21 Mohammed Alser , Jeremy Rotman , Kodi Taraszka , Huwenbo Shi , Pelin Icer Baykal , Harry Taegyun Yang , Victor Xue , Sergey Knyazev , Benjamin D. Singer , Brunilda Balliu , David Koslicki , Pavel Skums , Alex Zelikovsky , Can Alkan , Onur Mutlu , Serghei Mangul

Small Coupling Expansion for Multiple Sequence Alignment

The alignment of biological sequences such as DNA, RNA, and proteins, is one of the basic tools that allow to detect evolutionary patterns, as well as functional/structural characterizations between homologous sequences in different…

Quantitative Methods · Quantitative Biology 2023-05-01 Louise Budzynski , Andrea Pagnani

Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification

Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed…

Genomics · Quantitative Biology 2015-01-21 Ivan Borozan , Stuart Watt , Vincent Ferretti

Approximating mutual information of high-dimensional variables using learned representations

Mutual information (MI) is a general measure of statistical dependence with widespread application across the sciences. However, estimating MI between multi-dimensional variables is challenging because the number of samples necessary to…

Quantitative Methods · Quantitative Biology 2025-03-06 Gokul Gowri , Xiao-Kang Lun , Allon M. Klein , Peng Yin

Sequence Alignment Algorithm for Statistical Similarity Assessment

This paper presents a new approach to statistical similarity assessment based on sequence alignment. The algorithm performs mutual matching of two random sequences by successively searching for common elements and by applying sequence…

Signal Processing · Electrical Eng. & Systems 2021-06-09 Jakub Nikonowicz , Łukasz Matuszewski , Paweł Kubczak

Evaluation of the Topological Agreement of Network Alignments

Aligning protein interaction networks (PPI) of two or more organisms consists of finding a mapping of the nodes (proteins) of the networks that captures important structural and functional associations (similarity). It is a well studied but…

Molecular Networks · Quantitative Biology 2020-10-12 Concettina Guerra , Pietro Hiram Guzzi

Who Watches the Watchmen? An Appraisal of Benchmarks for Multiple Sequence Alignment

Multiple sequence alignment (MSA) is a fundamental and ubiquitous technique in bioinformatics used to infer related residues among biological sequences. Thus alignment accuracy is crucial to a vast range of analyses, often in ways difficult…

Quantitative Methods · Quantitative Biology 2015-01-09 Stefano Iantorno , Kevin Gori , Nick Goldman , Manuel Gil , Christophe Dessimoz

Getting aligned on representational alignment

Biological and artificial information processing systems form representations of the world that they can use to categorize, reason, plan, navigate, and make decisions. How can we measure the similarity between the representations formed by…

Neurons and Cognition · Quantitative Biology 2024-11-27 Ilia Sucholutsky , Lukas Muttenthaler , Adrian Weller , Andi Peng , Andreea Bobu , Been Kim , Bradley C. Love , Christopher J. Cueva , Erin Grant , Iris Groen , Jascha Achterberg , Joshua B. Tenenbaum , Katherine M. Collins , Katherine L. Hermann , Kerem Oktar , Klaus Greff , Martin N. Hebart , Nathan Cloos , Nikolaus Kriegeskorte , Nori Jacoby , Qiuyi Zhang , Raja Marjieh , Robert Geirhos , Sherol Chen , Simon Kornblith , Sunayana Rane , Talia Konkle , Thomas P. O'Connell , Thomas Unterthiner , Andrew K. Lampinen , Klaus-Robert Müller , Mariya Toneva , Thomas L. Griffiths

Accurate Estimation of Mutual Information in High Dimensional Data

Mutual information (MI) is a fundamental measure of statistical dependence between two variables, yet accurate estimation from finite data remains notoriously difficult. No estimator is universally reliable, and common approaches fail in…

Data Analysis, Statistics and Probability · Physics 2025-10-02 Eslam Abdelaleem , K. Michael Martini , Ilya Nemenman

Hierarchical Clustering Using Mutual Information

We present a method for hierarchical clustering of data called {\it mutual information clustering} (MIC) algorithm. It uses mutual information (MI) as a similarity measure and exploits its grouping property: The MI between three objects $X,…

Quantitative Methods · Quantitative Biology 2007-05-23 Alexander Kraskov , Harald Stoegbauer , Ralph G. Andrzejak , Peter Grassberger

Multiple sequence alignment for short sequences

Multiple sequence alignment (MSA) has been one of the most important problems in bioinformatics for more decades and it is still heavily examined by many mathematicians and biologists. However, mostly because of the practical motivation of…

Quantitative Methods · Quantitative Biology 2015-11-17 Kristóf Takács

SANA: Simulated Annealing Network Alignment Applied to Biological Networks

The alignment of biological networks has the potential to teach us as much about biology and disease as has sequence alignment. Sequence alignment can be optimally solved in polynomial time. In contrast, network alignment is $NP$-hard,…

Molecular Networks · Quantitative Biology 2016-07-12 Nil Mamano , Wayne Hayes

Unaligned Sequence Similarity Search Using Deep Learning

Gene annotation has traditionally required direct comparison of DNA sequences between an unknown gene and a database of known ones using string comparison methods. However, these methods do not provide useful information when a gene does…

Machine Learning · Computer Science 2019-09-17 James K. Senter , Taylor M. Royalty , Andrew D. Steen , Amir Sadovnik

Fast computation of mutual information in the frequency domain with applications to global multimodal image alignment

Multimodal image alignment is the process of finding spatial correspondences between images formed by different imaging techniques or under different conditions, to facilitate heterogeneous data fusion and correlative analysis. The…

Computer Vision and Pattern Recognition · Computer Science 2022-07-01 Johan Öfverstedt , Joakim Lindblad , Nataša Sladoje

A Space-Efficient Approach towards Distantly Homologous Protein Similarity Searches

Protein similarity searches are a routine job for molecular biologists where a query sequence of amino acids needs to be compared and ranked against an ever-growing database of proteins. All available algorithms in this field can be grouped…

Computational Engineering, Finance, and Science · Computer Science 2015-08-27 Akash Nag , Sunil Karforma