English
Related papers

Related papers: Detecting Superbubbles in Assembly Graphs

200 papers

DNA sequencing is the process of determining the exact order of the nucleotide bases of an individual's genome in order to catalogue sequence variation and understand its biological implications. Whole-genome sequencing techniques produce…

Data Structures and Algorithms · Computer Science 2015-09-18 Ljiljana Brankovic , Costas S. Iliopoulos , Ritu Kundu , Manal Mohamed , Solon P. Pissis , Fatima Vayani

Superbubbles are acyclic induced subgraphs of a digraph with single entrance and exit that naturally arise in the context of genome assembly and the analysis of genome alignments in computational biology. These structures can be computed in…

The first step in any genome assembly algorithm entails the conversion from the domain of strings and overlaps to the language of graphs and paths, typically using one of the two conventional methods: de Bruijn graphs or overlap graphs.…

Genomics · Quantitative Biology 2026-04-27 Anton Bankevich

This paper studies the problem of detecting anomalous graphs using a machine learning model trained on only normal graphs, which has many applications in molecule, biology, and social network data analysis. We present a self-discriminative…

Machine Learning · Computer Science 2023-10-11 Jinyu Cai , Yunhe Zhang , Jicong Fan

Maximal clique enumeration is a fundamental graph mining task, but its utility is often limited by computational intractability and highly redundant output. To address these challenges, we introduce \emph{$\rho$-dense aggregators}, a novel…

Data Structures and Algorithms · Computer Science 2025-12-04 Noga Alon , Sabyasachi Basu , Shweta Jain , Haim Kaplan , Jakub Łącki , Blair D. Sullivan

The problem of finding dense components of a graph is a widely explored area in data analysis, with diverse applications in fields and branches of study including community mining, spam detection, computer security and bioinformatics. This…

Information Retrieval · Computer Science 2021-03-02 B. D. M. De Zoysa , Y. A. M. M. A. Ali , M. D. I. Maduranga , Indika Perera , Saliya Ekanayake , Anil Vullikanti

The maximum clique problem is a well known NP-Hard problem with applications in data mining, network analysis, information retrieval and many other areas related to the World Wide Web. There exist several algorithms for the problem with…

Data Structures and Algorithms · Computer Science 2014-12-01 Bharath Pattabiraman , Md. Mostofa Ali Patwary , Assefaw H. Gebremedhin , Wei-keng Liao , Alok Choudhary

Community detection is a fundamental task in graph analysis, with methods often relying on fitting models like the Stochastic Block Model (SBM) to observed networks. While many algorithms can accurately estimate SBM parameters when the…

Machine Learning · Statistics 2025-06-05 Leonardo Martins Bianco , Christine Keribin , Zacharie Naulet

De novo genome assembly focuses on finding connections between a vast amount of short sequences in order to reconstruct the original genome. The central problem of genome assembly could be described as finding a Hamiltonian path through a…

Machine Learning · Computer Science 2020-11-11 Lovro Vrček , Petar Veličković , Mile Šikić

Assembling genomic sequences from a set of overlapping reads is one of the most fundamental problems in computational biology. Algorithms addressing the assembly problem fall into two broad categories -- based on the data structures which…

Data Structures and Algorithms · Computer Science 2010-03-10 Vamsi Kundeti , Sanguthevar Rajasekaran , Hieu Dinh

Bidirected graphs are a common generalisation of directed graphs where arcs can also be incoming to both their incident nodes, or outgoing from both their incident nodes. Such arcs allow a walk to change direction. Some algorithms can…

Data Structures and Algorithms · Computer Science 2026-05-14 Sebastian Schmidt , Juha Harviainen , Corentin Moumard , Aleksandr Politov , Francisco Sena , Alexandru I. Tomescu

Finding dense substructures in a graph is a fundamental graph mining operation, with applications in bioinformatics, social networks, and visualization to name a few. Yet most standard formulations of this problem (like clique, quasiclique,…

Social and Information Networks · Computer Science 2015-03-10 Ahmet Erdem Sariyuce , C. Seshadhri , Ali Pinar , Umit V. Catalyurek

Genome assembly is a prominent problem studied in bioinformatics, which computes the source string using a set of its overlapping substrings. Classically, genome assembly uses assembly graphs built using this set of substrings to compute…

Data Structures and Algorithms · Computer Science 2024-09-24 Saumya Talera , Parth Bansal , Shabnam Khan , Shahbaz Khan

Molecule generation is central to a variety of applications. Current attention has been paid to approaching the generation task as subgraph prediction and assembling. Nevertheless, these methods usually rely on hand-crafted or external…

Machine Learning · Computer Science 2022-12-20 Xiangzhe Kong , Wenbing Huang , Zhixing Tan , Yang Liu

Graphs are naturally used to describe the structures of various real-world systems in biology, society, computer science etc., where subgraphs or motifs as basic blocks play an important role in function expression and information…

Social and Information Networks · Computer Science 2021-02-11 Jinhuan Wang , Pengtao Chen , Bin Ma , Jiajun Zhou , Zhongyuan Ruan , Guanrong Chen , Qi Xuan

A fundamental algorithmic problem in computational biology is to find all subgraphs of a given type (superbubbles, snarls, and ultrabubbles) in a directed or bidirected input graph. These correspond to regions of genetic variation and are…

To understand the structure of a network, it can be useful to break it down into its constituent pieces. This is the approach taken in a multitude of successful network analysis methods, such as motif analysis. These methods require one to…

Physics and Society · Physics 2023-08-02 Tarmo Nurmi , Mikko Kivelä

In the subgraph counting problem, we are given a input graph $G(V, E)$ and a target graph $H$; the goal is to estimate the number of occurrences of $H$ in $G$. Our focus here is on designing sublinear-time algorithms for approximately…

Data Structures and Algorithms · Computer Science 2018-11-20 Sepehr Assadi , Michael Kapralov , Sanjeev Khanna

Recently we have developed a new method in graph theory based on the Regularity Lemma. The method is applied to find certain spanning subgraphs in dense graphs. The other main general tool of the method, beside the Regularity Lemma, is the…

Combinatorics · Mathematics 2016-09-07 János Komlós , Gabor N. Sarkozy , Endre Szemerédi

We develop random graph models where graphs are generated by connecting not only pairs of vertices by edges but also larger subsets of vertices by copies of small atomic subgraphs of arbitrary topology. This allows the for the generation of…

Statistics Theory · Mathematics 2021-04-21 Anatol E. Wegner , Sofia Olhede
‹ Prev 1 2 3 10 Next ›