English
Related papers

Related papers: Faster exon assembly by sparse spliced alignment

200 papers

Motivation: Spliced alignment refers to the alignment of messenger RNA (mRNA) or protein sequences to eukaryotic genomes. It plays a critical role in gene annotation and the study of gene functions. Accurate spliced alignment demands…

Genomics · Quantitative Biology 2025-09-23 Siying Yang , Neng Huang , Heng Li

The Spliced Alignment Problem (SAP) that consists in finding an optimal semi-global alignment of a spliced RNA sequence on an unspliced genomic sequence has been largely considered for the prediction and the annotation of gene structures in…

Data Structures and Algorithms · Computer Science 2017-09-20 Jean-David Aguilar , Safa Jammali , Esaie Kuitche , Aïda Ouangraoua

Unbalanced translocations are among the most frequent chromosomal alterations, accounted for 30\% of all losses of heterozygosity, a major genetic event causing inactivation of tumor suppressor genes. Despite of their central role in…

Data Structures and Algorithms · Computer Science 2018-12-04 Domenico Cantone , Simone Faro , Arianna Pavone

In genomics, pattern matching against a sequence of nucleotides plays a pivotal role for DNA sequence alignment and comparing genomes. This helps tackling some diseases, such as cancer in humans. The complexity of searching biological…

Quantitative Methods · Quantitative Biology 2017-10-04 Fereshte Mozafari , Hossein Babashah , Somayyeh Koohi , Zahra Kavehvash

Multiple Sequence Alignment (MSA) is one of the most computationally intensive tasks in Computational Biology. Existing best known solutions for multiple sequence alignment take several hours (in some cases days) of computation time to…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-01-20 Fahad Saeed , Ashfaq Khokhar

Current computational methods for exon-intron structure prediction from a cluster of transcript (EST, mRNA) data do not exhibit the time and space efficiency necessary to process large clusters of over than 20,000 ESTs and genes longer than…

Genomics · Quantitative Biology 2010-05-11 Paola Bonizzoni , Gianluca Della Vedova , Yuri Pirola , Raffaella Rizzi

We introduce space-efficient plane-sweep algorithms for basic planar geometric problems. It is assumed that the input is in a read-only array of $n$ items and that the available workspace is $\Theta(s)$ bits, where $\lg n \leq s \leq n…

Data Structures and Algorithms · Computer Science 2016-04-25 Amr Elmasry , Frank Kammer

A longest repeat query on a string, motivated by its applications in many subfields including computational biology, asks for the longest repetitive substring(s) covering a particular string position (point query). In this paper, we extend…

Data Structures and Algorithms · Computer Science 2015-11-10 Bojian Xu

The sparse regression problem, also known as best subset selection problem, can be cast as follows: Given a set $S$ of $n$ points in $\mathbb{R}^d$, a point $y\in \mathbb{R}^d$, and an integer $2 \leq k \leq d$, find an affine combination…

Data Structures and Algorithms · Computer Science 2020-01-01 Jean Cardinal , Aurélien Ooms

Described are two algorithms to find long approximate palindromes in a string, for example a DNA sequence. A simple algorithm requires O(n)-space and almost always runs in $O(k.n)$-time where n is the length of the string and k is the…

Data Structures and Algorithms · Computer Science 2007-05-23 L. Allison

The problem of approximate string matching is important in many different areas such as computational biology, text processing and pattern recognition. A great effort has been made to design efficient algorithms addressing several variants…

Data Structures and Algorithms · Computer Science 2008-07-29 Dimitris Papamichail , Georgios Papamichail

The problem of assembling DNA fragments starting from imperfect strings given by a sequencer, classified as NP hard when trying to get perfect answers, has a huge importance in several fields, because of its relation with the possibility of…

Data Structures and Algorithms · Computer Science 2017-05-26 Juan Manuel Ciro Restrepo , Andrés Felipe Zapata Palacio , Mauricio Toro

Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose order and/or…

Genomics · Quantitative Biology 2022-05-27 Sergey Aganezov , Pavel Avdeyev , Nikita Alexeev , Yongwu Rong , Max A. Alekseyev

Important papers have appeared recently on the problem of indexing binary strings for jumbled pattern matching, and further lowering the time bounds in terms of the input size would now be a breakthrough with broad implications. We can…

Data Structures and Algorithms · Computer Science 2017-02-15 Luís Cunha , Simone Dantas , Travis Gagie , Roland Wittler , Luis Kowada , Jens Stoye

This report presents the implementation of a protein sequence comparison algorithm specifically designed for speeding up time consuming part on parallel hardware such as SSE instructions, multicore architectures or graphic boards. Three…

Quantitative Methods · Quantitative Biology 2008-12-18 Van Hoa Nguyen , Dominique Lavenier

Motivation: A pan-genome graph represents a collection of genomes and encodes sequence variations between them. It is a powerful data structure for studying multiple similar genomes. Sequence-to-graph alignment is an essential step for the…

Genomics · Quantitative Biology 2022-06-29 Haowen Zhang , Shiqi Wu , Srinivas Aluru , Heng Li

We consider the problem of finding \textit{semi-matching} in bipartite graphs which is also extensively studied under various names in the scheduling literature. We give faster algorithms for both weighted and unweighted case. For the…

Data Structures and Algorithms · Computer Science 2012-06-15 Jittat Fakcharoenphol , Bundit Laekhanukit , Danupon Nanongkai

We revisit the complexity of approximate pattern matching in an elastic-degenerate string. Such a string is a sequence of $n$ finite sets of strings of total length $N$, and compactly describes a collection of strings obtained by first…

Data Structures and Algorithms · Computer Science 2025-03-04 Paweł Gawrychowski , Adam Górkiewicz , Pola Marciniak , Solon P. Pissis , Karol Pokorski

Given a text $T$ of length $n$ and a pattern $P$ of length $m$, the string matching problem is a task to find all occurrences of $P$ in $T$. In this study, we propose an algorithm that solves this problem in $O((n + m)q)$ time considering…

Data Structures and Algorithms · Computer Science 2020-04-14 Satoshi Kobayashi , Diptarama Hendrian , Ryo Yoshinaka , Ayumi Shinohara

Searching for all occurrences of a pattern in a text is a fundamental problem in computer science with applications in many other fields, like natural language processing, information retrieval and computational biology. Sampled string…

Data Structures and Algorithms · Computer Science 2019-08-19 Simone Faro , Arianna Pavone , Francesco Pio Marino
‹ Prev 1 2 3 10 Next ›