Related papers: A Misclassification Network-Based Method for Compa…

Alignment-Free Sequence Analysis and Applications

Genome and metagenome comparisons based on large amounts of next-generation sequencing (NGS) data pose significant challenges for alignment-based approaches due to the huge data size and the relatively short length of the reads.…

Quantitative Methods · Quantitative Biology 2018-03-28 Jie Ren , Xin Bai , Yang Young Lu , Kujin Tang , Ying Wang , Gesine Reinert , Fengzhu Sun

Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification

Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed…

Genomics · Quantitative Biology 2015-01-21 Ivan Borozan , Stuart Watt , Vincent Ferretti

Unaligned Sequence Similarity Search Using Deep Learning

Gene annotation has traditionally required direct comparison of DNA sequences between an unknown gene and a database of known ones using string comparison methods. However, these methods do not provide useful information when a gene does…

Machine Learning · Computer Science 2019-09-17 James K. Senter , Taylor M. Royalty , Andrew D. Steen , Amir Sadovnik

Local versus Global Biological Network Alignment

Network alignment (NA) aims to find regions of similarities between molecular networks of different species. There exist two NA categories: local (LNA) or global (GNA). LNA finds small highly conserved network regions and produces a…

Molecular Networks · Quantitative Biology 2015-09-30 Lei Meng , Aaron Striegel , Tijana Milenkovic

A Comparative Analysis of Ensemble Classifiers: Case Studies in Genomics

The combination of multiple classifiers using ensemble methods is increasingly important for making progress in a variety of difficult prediction problems. We present a comparative analysis of several ensemble methods through two case…

Machine Learning · Computer Science 2013-09-20 Sean Whalen , Gaurav Pandey

Alignment-free Genomic Analysis via a Big Data Spark Platform

Motivation: Alignment-free distance and similarity functions (AF functions, for short) are a well established alternative to two and multiple sequence alignments for many genomic, metagenomic and epigenomic tasks. Due to data-intensive…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-10-26 Umberto Ferraro Petrillo , Francesco Palini , Giuseppe Cattaneo , Raffaele Giancarlo

GRAMEP: an alignment-free method based on the Maximum Entropy Principle for identifying SNPs

Background: Advances in high throughput sequencing technologies provide a huge number of genomes to be analyzed. Thus, computational methods play a crucial role in analyzing and extracting knowledge from the data generated. Investigating…

Genomics · Quantitative Biology 2024-11-22 Matheus Henrique Pimenta-Zanon , André Yoshiaki Kashiwabara , André Luís Laforga Vanzela , Fabricio Martins Lopes

Recent Advances in Network-based Methods for Disease Gene Prediction

Disease-gene association through Genome-wide association study (GWAS) is an arduous task for researchers. Investigating single nucleotide polymorphisms (SNPs) that correlate with specific diseases needs statistical analysis of associations.…

Quantitative Methods · Quantitative Biology 2020-12-21 Sezin Kircali Ata , Min Wu , Yuan Fang , Le Ou-Yang , Chee Keong Kwoh , Xiao-Li Li

Missing Data Prediction and Classification: The Use of Auto-Associative Neural Networks and Optimization Algorithms

This paper presents methods which are aimed at finding approximations to missing data in a dataset by using optimization algorithms to optimize the network parameters after which prediction and classification tasks can be performed. The…

Neural and Evolutionary Computing · Computer Science 2014-03-24 Collins Leke , Bhekisipho Twala , T. Marwala

Multiple network alignment via multiMAGNA++

Network alignment (NA) aims to find a node mapping between molecular networks of different species that identifies topologically or functionally similar network regions. Analogous to genomic sequence alignment, NA can be used to transfer…

Molecular Networks · Quantitative Biology 2016-04-07 Vipin Vijayan , Tijana Milenkovic

Exploring The Potential Of GANs In Biological Sequence Analysis

Biological sequence analysis is an essential step toward building a deeper understanding of the underlying functions, structures, and behaviors of the sequences. It can help in identifying the characteristics of the associated organisms,…

Machine Learning · Computer Science 2023-03-07 Taslim Murad , Sarwan Ali , Murray Patterson

Misclassification in Automated Content Analysis Causes Bias in Regression. Can We Fix It? Yes We Can!

Automated classifiers (ACs), often built via supervised machine learning (SML), can categorize large, statistically powerful samples of data ranging from text to images and video, and have become widely popular measurement devices in…

Machine Learning · Computer Science 2024-12-10 Nathan TeBlunthuis , Valerie Hase , Chung-Hong Chan

Fair Evaluation of Global Network Aligners

Biological network alignment identifies topologically and functionally conserved regions between networks of different species. It encompasses two algorithmic steps: node cost function (NCF), which measures similarities between nodes in…

Molecular Networks · Quantitative Biology 2014-07-21 Joseph Crawford , Yihan Sun , Tijana Milenković

Rank and Align: Towards Effective Source-free Graph Domain Adaptation

Graph neural networks (GNNs) have achieved impressive performance in graph domain adaptation. However, extensive source graphs could be unavailable in real-world scenarios due to privacy and storage concerns. To this end, we investigate an…

Machine Learning · Computer Science 2024-08-23 Junyu Luo , Zhiping Xiao , Yifan Wang , Xiao Luo , Jingyang Yuan , Wei Ju , Langechuan Liu , Ming Zhang

GraSSRep: Graph-Based Self-Supervised Learning for Repeat Detection in Metagenomic Assembly

Repetitive DNA (repeats) poses significant challenges for accurate and efficient genome assembly and sequence alignment. This is particularly true for metagenomic data, where genome dynamics such as horizontal gene transfer, gene…

Machine Learning · Computer Science 2024-02-15 Ali Azizpour , Advait Balaji , Todd J. Treangen , Santiago Segarra

Sequence Alignment Algorithm for Statistical Similarity Assessment

This paper presents a new approach to statistical similarity assessment based on sequence alignment. The algorithm performs mutual matching of two random sequences by successively searching for common elements and by applying sequence…

Signal Processing · Electrical Eng. & Systems 2021-06-09 Jakub Nikonowicz , Łukasz Matuszewski , Paweł Kubczak

Generic Neural Architecture Search via Regression

Most existing neural architecture search (NAS) algorithms are dedicated to and evaluated by the downstream tasks, e.g., image classification in computer vision. However, extensive experiments have shown that, prominent neural architectures,…

Machine Learning · Computer Science 2021-11-18 Yuhong Li , Cong Hao , Pan Li , Jinjun Xiong , Deming Chen

Scalable Graph Neural Networks for Heterogeneous Graphs

Graph neural networks (GNNs) are a popular class of parametric model for learning over graph-structured data. Recent work has argued that GNNs primarily use the graph for feature smoothing, and have shown competitive results on benchmark…

Machine Learning · Computer Science 2020-11-20 Lingfan Yu , Jiajun Shen , Jinyang Li , Adam Lerer

Multiple Genome Analytics Framework: The Case of All SARS-CoV-2 Complete Variants

Pattern detection and string matching are fundamental problems in computer science and the accelerated expansion of bioinformatics and computational biology have made them a core topic for both disciplines. The SARS-CoV-2 pandemic has made…

Genomics · Quantitative Biology 2025-05-14 Konstantinos Xylogiannopoulos

AGS-GNN: Attribute-guided Sampling for Graph Neural Networks

We propose AGS-GNN, a novel attribute-guided sampling algorithm for Graph Neural Networks (GNNs) that exploits node features and connectivity structure of a graph while simultaneously adapting for both homophily and heterophily in graphs.…

Machine Learning · Computer Science 2024-05-27 Siddhartha Shankar Das , S M Ferdous , Mahantesh M Halappanavar , Edoardo Serra , Alex Pothen