Related papers: Aligning coding sequences with frameshift extensio…

Alignment of protein-coding sequences with frameshift extension penalties

We introduce an algorithm for the alignment of protein- coding sequences accounting for frameshifts. The main specificity of this algorithm as compared to previously published protein-coding sequence alignment methods is the introduction of…

Data Structures and Algorithms · Computer Science 2015-08-21 François Bélanger , Aïda Ouangraoua

Back-translation for discovering distant protein homologies

Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of…

Quantitative Methods · Quantitative Biology 2011-01-18 Marta L. Gîrdea , Laurent Noé , Gregory Kucherov

Aligning biological sequences by exploiting residue conservation and coevolution

Sequences of nucleotides (for DNA and RNA) or amino acids (for proteins) are central objects in biology. Among the most important computational problems is that of sequence alignment, i.e. arranging sequences from different organisms in…

Quantitative Methods · Quantitative Biology 2020-12-08 Anna Paola Muntoni , Andrea Pagnani , Martin Weigt , Francesco Zamponi

Pairwise alignment of the DNA sequence using hypercomplex number representation

A new set of DNA base-nucleic acid codes and their hypercomplex number representation have been introduced for taking the probability of each nucleotide into full account. A new scoring system has been proposed to suit the hypercomplex…

Other Quantitative Biology · Quantitative Biology 2014-03-12 Jian-Jun Shu , Li Shan Ouw

New Sequence Alignment Algorithm using AI Rules and Dynamic Seeds

DNA sequence alignment is important today as it is usually the first step in finding gene mutation, evolutionary similarities, protein structure, drug development and cancer treatment. Covid-19 is one recent example. There are many…

Genomics · Quantitative Biology 2023-06-01 Suchindra , Preetam Nagaraj

Pairwise heuristic sequence alignment algorithm based on deep reinforcement learning

Various methods have been developed to analyze the association between organisms and their genomic sequences. Among them, sequence alignment is the most frequently used for comparative analysis of biological genomes. However, the…

Quantitative Methods · Quantitative Biology 2020-10-27 Yong Joon Song , Dong Jin Ji , Hye In Seo , Gyu Bum Han , Dong Ho Cho

Aligning the True Semantics: Constrained Decoupling and Distribution Sampling for Cross-Modal Alignment

Cross-modal alignment is a crucial task in multimodal learning aimed at achieving semantic consistency between vision and language. This requires that image-text pairs exhibit similar semantics. Traditional algorithms pursue embedding…

Machine Learning · Computer Science 2026-03-09 Xiang Ma , Lexin Fang , Litian Xu , Caiming Zhang

Adaptive Learning of Rank-One Models for Efficient Pairwise Sequence Alignment

Pairwise alignment of DNA sequencing data is a ubiquitous task in bioinformatics and typically represents a heavy computational burden. State-of-the-art approaches to speed up this task use hashing to identify short segments (k-mers) that…

Machine Learning · Computer Science 2021-02-16 Govinda M. Kamath , Tavor Z. Baharav , Ilan Shomorony

A context dependent pair hidden Markov model for statistical alignment

This article proposes a novel approach to statistical alignment of nucleotide sequences by introducing a context dependent structure on the substitution process in the underlying evolutionary model. We propose to estimate alignments and…

Statistics Theory · Mathematics 2011-07-18 Ana Arribas-Gil , Catherine Matias

Small Coupling Expansion for Multiple Sequence Alignment

The alignment of biological sequences such as DNA, RNA, and proteins, is one of the basic tools that allow to detect evolutionary patterns, as well as functional/structural characterizations between homologous sequences in different…

Quantitative Methods · Quantitative Biology 2023-05-01 Louise Budzynski , Andrea Pagnani

A biological sequence comparison algorithm using quantum computers

Genetic information is encoded in a linear sequence of nucleotides, represented by letters ranging from thousands to billions. Mutations refer to changes in the DNA or RNA nucleotide sequence. Thus, mutation detection is vital in all areas…

Quantum Physics · Physics 2024-03-14 Büsra Kösoglu-Kind , Robert Loredo , Michele Grossi , Christian Bernecker , Jody M Burks , Rudiger Buchkremer

Bayesian Protein Sequence and Structure Alignment

The structure of a protein is crucial in determining its functionality, and is much more conserved than sequence during evolution. A key task in structural biology is to compare protein structures in order to determine evolutionary…

Methodology · Statistics 2019-11-06 Christopher Fallaize , Peter Green , Kanti Mardia , Stuart Barber

Pairwise sequence alignment at arbitrarily large evolutionary distance

Ancestral sequence reconstruction is a key task in computational biology. It consists in inferring a molecular sequence at an ancestral species of a known phylogeny, given descendant sequences at the tip of the tree. In addition to its many…

Populations and Evolution · Quantitative Biology 2022-07-27 Brandon Legried , Sebastien Roch

Aligning 415 519 proteins in less than two hours on PC

Rapid development of modern sequencing platforms enabled an unprecedented growth of protein families databases. The abundance of sets composed of hundreds of thousands sequences is a great challenge for multiple sequence alignment…

Genomics · Quantitative Biology 2017-03-03 Sebastin Deorowicz , Agnieszka Debudaj-Grabysz , Adam Gudys

Sketching and Sequence Alignment: A Rate-Distortion Perspective

Pairwise alignment of DNA sequencing data is a ubiquitous task in bioinformatics and typically represents a heavy computational burden. A standard approach to speed up this task is to compute "sketches" of the DNA reads (typically via…

Information Theory · Computer Science 2021-07-12 Ilan Shomorony , Govinda M. Kamath

Embed-Search-Align: DNA Sequence Alignment using Transformer Models

DNA sequence alignment involves assigning short DNA reads to the most probable locations on an extensive reference genome. This process is crucial for various genomic analyses, including variant calling, transcriptomics, and epigenomics.…

Genomics · Quantitative Biology 2024-12-06 Pavan Holur , K. C. Enevoldsen , Shreyas Rajesh , Lajoyce Mboning , Thalia Georgiou , Louis-S. Bouchard , Matteo Pellegrini , Vwani Roychowdhury

Algorithms for normalized multiple sequence alignments

Sequence alignment supports numerous tasks in bioinformatics, natural language processing, pattern recognition, social sciences, and others fields. While the alignment of two sequences may be performed swiftly in many applications, the…

Data Structures and Algorithms · Computer Science 2021-12-06 Eloi Araujo , Luiz Rozante , Diego P. Rubert , Fabio V. Martinez

Genetic Programming for Evolving Similarity Functions for Clustering: Representations and Analysis

Clustering is a difficult and widely-studied data mining task, with many varieties of clustering algorithms proposed in the literature. Nearly all algorithms use a similarity measure such as a distance metric (e.g. Euclidean distance) to…

Neural and Evolutionary Computing · Computer Science 2019-10-24 Andrew Lensen , Bing Xue , Mengjie Zhang

Technology dictates algorithms: Recent developments in read alignment

Massively parallel sequencing techniques have revolutionized biological and medical sciences by providing unprecedented insight into the genomes of humans, animals, and microbes. Modern sequencing platforms generate enormous amounts of…

Genomics · Quantitative Biology 2023-11-21 Mohammed Alser , Jeremy Rotman , Kodi Taraszka , Huwenbo Shi , Pelin Icer Baykal , Harry Taegyun Yang , Victor Xue , Sergey Knyazev , Benjamin D. Singer , Brunilda Balliu , David Koslicki , Pavel Skums , Alex Zelikovsky , Can Alkan , Onur Mutlu , Serghei Mangul

Multi-Alignment Contrastive Learning for Enzyme--Reaction Retrieval

Identifying enzymes that catalyze target biochemical reactions is a key step in computational enzyme discovery and biocatalyst design. Recent representation-learning methods formulate this problem as enzyme--reaction matching, where paired…

Biomolecules · Quantitative Biology 2026-05-26 Gengmo Zhou , Feng Yu , Wenda Wang , Zhifeng Gao , Guolin Ke , Zhewei Wei , Zhen Wang