English
Related papers

Related papers: Aligning coding sequences with frameshift extensio…

200 papers

We introduce an algorithm for the alignment of protein- coding sequences accounting for frameshifts. The main specificity of this algorithm as compared to previously published protein-coding sequence alignment methods is the introduction of…

Data Structures and Algorithms · Computer Science 2015-08-21 François Bélanger , Aïda Ouangraoua

Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of…

Quantitative Methods · Quantitative Biology 2011-01-18 Marta L. Gîrdea , Laurent Noé , Gregory Kucherov

Sequences of nucleotides (for DNA and RNA) or amino acids (for proteins) are central objects in biology. Among the most important computational problems is that of sequence alignment, i.e. arranging sequences from different organisms in…

Quantitative Methods · Quantitative Biology 2020-12-08 Anna Paola Muntoni , Andrea Pagnani , Martin Weigt , Francesco Zamponi

A new set of DNA base-nucleic acid codes and their hypercomplex number representation have been introduced for taking the probability of each nucleotide into full account. A new scoring system has been proposed to suit the hypercomplex…

Other Quantitative Biology · Quantitative Biology 2014-03-12 Jian-Jun Shu , Li Shan Ouw

DNA sequence alignment is important today as it is usually the first step in finding gene mutation, evolutionary similarities, protein structure, drug development and cancer treatment. Covid-19 is one recent example. There are many…

Genomics · Quantitative Biology 2023-06-01 Suchindra , Preetam Nagaraj

Various methods have been developed to analyze the association between organisms and their genomic sequences. Among them, sequence alignment is the most frequently used for comparative analysis of biological genomes. However, the…

Quantitative Methods · Quantitative Biology 2020-10-27 Yong Joon Song , Dong Jin Ji , Hye In Seo , Gyu Bum Han , Dong Ho Cho

Cross-modal alignment is a crucial task in multimodal learning aimed at achieving semantic consistency between vision and language. This requires that image-text pairs exhibit similar semantics. Traditional algorithms pursue embedding…

Machine Learning · Computer Science 2026-03-09 Xiang Ma , Lexin Fang , Litian Xu , Caiming Zhang

Pairwise alignment of DNA sequencing data is a ubiquitous task in bioinformatics and typically represents a heavy computational burden. State-of-the-art approaches to speed up this task use hashing to identify short segments (k-mers) that…

Machine Learning · Computer Science 2021-02-16 Govinda M. Kamath , Tavor Z. Baharav , Ilan Shomorony

This article proposes a novel approach to statistical alignment of nucleotide sequences by introducing a context dependent structure on the substitution process in the underlying evolutionary model. We propose to estimate alignments and…

Statistics Theory · Mathematics 2011-07-18 Ana Arribas-Gil , Catherine Matias

The alignment of biological sequences such as DNA, RNA, and proteins, is one of the basic tools that allow to detect evolutionary patterns, as well as functional/structural characterizations between homologous sequences in different…

Quantitative Methods · Quantitative Biology 2023-05-01 Louise Budzynski , Andrea Pagnani

Genetic information is encoded in a linear sequence of nucleotides, represented by letters ranging from thousands to billions. Mutations refer to changes in the DNA or RNA nucleotide sequence. Thus, mutation detection is vital in all areas…

The structure of a protein is crucial in determining its functionality, and is much more conserved than sequence during evolution. A key task in structural biology is to compare protein structures in order to determine evolutionary…

Methodology · Statistics 2019-11-06 Christopher Fallaize , Peter Green , Kanti Mardia , Stuart Barber

Ancestral sequence reconstruction is a key task in computational biology. It consists in inferring a molecular sequence at an ancestral species of a known phylogeny, given descendant sequences at the tip of the tree. In addition to its many…

Populations and Evolution · Quantitative Biology 2022-07-27 Brandon Legried , Sebastien Roch

Rapid development of modern sequencing platforms enabled an unprecedented growth of protein families databases. The abundance of sets composed of hundreds of thousands sequences is a great challenge for multiple sequence alignment…

Genomics · Quantitative Biology 2017-03-03 Sebastin Deorowicz , Agnieszka Debudaj-Grabysz , Adam Gudys

Pairwise alignment of DNA sequencing data is a ubiquitous task in bioinformatics and typically represents a heavy computational burden. A standard approach to speed up this task is to compute "sketches" of the DNA reads (typically via…

Information Theory · Computer Science 2021-07-12 Ilan Shomorony , Govinda M. Kamath

DNA sequence alignment involves assigning short DNA reads to the most probable locations on an extensive reference genome. This process is crucial for various genomic analyses, including variant calling, transcriptomics, and epigenomics.…

Sequence alignment supports numerous tasks in bioinformatics, natural language processing, pattern recognition, social sciences, and others fields. While the alignment of two sequences may be performed swiftly in many applications, the…

Data Structures and Algorithms · Computer Science 2021-12-06 Eloi Araujo , Luiz Rozante , Diego P. Rubert , Fabio V. Martinez

Clustering is a difficult and widely-studied data mining task, with many varieties of clustering algorithms proposed in the literature. Nearly all algorithms use a similarity measure such as a distance metric (e.g. Euclidean distance) to…

Neural and Evolutionary Computing · Computer Science 2019-10-24 Andrew Lensen , Bing Xue , Mengjie Zhang

Massively parallel sequencing techniques have revolutionized biological and medical sciences by providing unprecedented insight into the genomes of humans, animals, and microbes. Modern sequencing platforms generate enormous amounts of…

Identifying enzymes that catalyze target biochemical reactions is a key step in computational enzyme discovery and biocatalyst design. Recent representation-learning methods formulate this problem as enzyme--reaction matching, where paired…

Biomolecules · Quantitative Biology 2026-05-26 Gengmo Zhou , Feng Yu , Wenda Wang , Zhifeng Gao , Guolin Ke , Zhewei Wei , Zhen Wang
‹ Prev 1 2 3 10 Next ›