Related papers: Detecting Superbubbles in Assembly Graphs

Linear-Time Superbubble Identification Algorithm for Genome Assembly

DNA sequencing is the process of determining the exact order of the nucleotide bases of an individual's genome in order to catalogue sequence variation and understand its biological implications. Whole-genome sequencing techniques produce…

Data Structures and Algorithms · Computer Science 2015-09-18 Ljiljana Brankovic , Costas S. Iliopoulos , Ritu Kundu , Manal Mohamed , Solon P. Pissis , Fatima Vayani

Superbubbles as an Empirical Characteristic of Directed Networks

Superbubbles are acyclic induced subgraphs of a digraph with single entrance and exit that naturally arise in the context of genome assembly and the analysis of genome alignments in computational biology. These structures can be computed in…

Social and Information Networks · Computer Science 2021-02-10 Fabian Gärtner , Felix Kühnl , Carsten R. Seemann , The Students of the Graphs , Networks Computer Lab 2018/19 , Christian Höner zu Siederdissen , Peter F. Stadler

Supregraph: Enabling Information-Optimal Assembly Graph Representation of a Read Set

The first step in any genome assembly algorithm entails the conversion from the domain of strings and overlaps to the language of graphs and paths, typically using one of the two conventional methods: de Bruijn graphs or overlap graphs.…

Genomics · Quantitative Biology 2026-04-27 Anton Bankevich

Self-Discriminative Modeling for Anomalous Graph Detection

This paper studies the problem of detecting anomalous graphs using a machine learning model trained on only normal graphs, which has many applications in molecule, biology, and social network data analysis. We present a self-discriminative…

Machine Learning · Computer Science 2023-10-11 Jinyu Cai , Yunhe Zhang , Jicong Fan

Aggregating maximal cliques in real-world graphs

Maximal clique enumeration is a fundamental graph mining task, but its utility is often limited by computational intractability and highly redundant output. To address these challenges, we introduce \emph{$\rho$-dense aggregators}, a novel…

Data Structures and Algorithms · Computer Science 2025-12-04 Noga Alon , Sabyasachi Basu , Shweta Jain , Haim Kaplan , Jakub Łącki , Blair D. Sullivan

Parallel Algorithms for Densest Subgraph Discovery Using Shared Memory Model

The problem of finding dense components of a graph is a widely explored area in data analysis, with diverse applications in fields and branches of study including community mining, spam detection, computer security and bioinformatics. This…

Information Retrieval · Computer Science 2021-03-02 B. D. M. De Zoysa , Y. A. M. M. A. Ali , M. D. I. Maduranga , Indika Perera , Saliya Ekanayake , Anil Vullikanti

Fast Algorithms for the Maximum Clique Problem on Massive Graphs with Applications to Overlapping Community Detection

The maximum clique problem is a well known NP-Hard problem with applications in data mining, network analysis, information retrieval and many other areas related to the World Wide Web. There exist several algorithms for the problem with…

Data Structures and Algorithms · Computer Science 2014-12-01 Bharath Pattabiraman , Md. Mostofa Ali Patwary , Assefaw H. Gebremedhin , Wei-keng Liao , Alok Choudhary

SubSearch: Robust Estimation and Outlier Detection for Stochastic Block Models via Subgraph Search

Community detection is a fundamental task in graph analysis, with methods often relying on fitting models like the Stochastic Block Model (SBM) to observed networks. While many algorithms can accurately estimate SBM parameters when the…

Machine Learning · Statistics 2025-06-05 Leonardo Martins Bianco , Christine Keribin , Zacharie Naulet

A step towards neural genome assembly

De novo genome assembly focuses on finding connections between a vast amount of short sequences in order to reconstruct the original genome. The central problem of genome assembly could be described as finding a Hamiltonian path through a…

Machine Learning · Computer Science 2020-11-11 Lovro Vrček , Petar Veličković , Mile Šikić

Efficient Parallel and Out of Core Algorithms for Constructing Large Bi-directed de Bruijn Graphs

Assembling genomic sequences from a set of overlapping reads is one of the most fundamental problems in computational biology. Algorithms addressing the assembly problem fall into two broad categories -- based on the data structures which…

Data Structures and Algorithms · Computer Science 2010-03-10 Vamsi Kundeti , Sanguthevar Rajasekaran , Hieu Dinh

The Power of Graph Doubling: Computing Ultrabubbles in a Bidirected Graph by Reducing to Weak Superbubbles

Bidirected graphs are a common generalisation of directed graphs where arcs can also be incoming to both their incident nodes, or outgoing from both their incident nodes. Such arcs allow a walk to change direction. Some algorithms can…

Data Structures and Algorithms · Computer Science 2026-05-14 Sebastian Schmidt , Juha Harviainen , Corentin Moumard , Aleksandr Politov , Francisco Sena , Alexandru I. Tomescu

Finding the Hierarchy of Dense Subgraphs using Nucleus Decompositions

Finding dense substructures in a graph is a fundamental graph mining operation, with applications in bioinformatics, social networks, and visualization to name a few. Yet most standard formulations of this problem (like clique, quasiclique,…

Social and Information Networks · Computer Science 2015-03-10 Ahmet Erdem Sariyuce , C. Seshadhri , Ali Pinar , Umit V. Catalyurek

Practical algorithms for Hierarchical overlap graphs

Genome assembly is a prominent problem studied in bioinformatics, which computes the source string using a set of its overlapping substrings. Classically, genome assembly uses assembly graphs built using this set of substrings to compute…

Data Structures and Algorithms · Computer Science 2024-09-24 Saumya Talera , Parth Bansal , Shabnam Khan , Shahbaz Khan

Molecule Generation by Principal Subgraph Mining and Assembling

Molecule generation is central to a variety of applications. Current attention has been paid to approaching the generation task as subgraph prediction and assembling. Nevertheless, these methods usually rely on hand-crafted or external…

Machine Learning · Computer Science 2022-12-20 Xiangzhe Kong , Wenbing Huang , Zhixing Tan , Yang Liu

Sampling Subgraph Network with Application to Graph Classification

Graphs are naturally used to describe the structures of various real-world systems in biology, society, computer science etc., where subgraphs or motifs as basic blocks play an important role in function expression and information…

Social and Information Networks · Computer Science 2021-02-11 Jinhuan Wang , Pengtao Chen , Bin Ma , Jiajun Zhou , Zhongyuan Ruan , Guanrong Chen , Qi Xuan

Identifying bubble-like subgraphs in linear-time via a unified SPQR-tree framework

A fundamental algorithmic problem in computational biology is to find all subgraphs of a given type (superbubbles, snarls, and ultrabubbles) in a directed or bidirected input graph. These correspond to regions of genetic variation and are…

Data Structures and Algorithms · Computer Science 2026-04-10 Francisco Sena , Aleksandr Politov , Corentin Moumard , Massimo Cairo , Romeo Rizzi , Manuel Cáceres , Sebastian Schmidt , Juha Harviainen , Alexandru I. Tomescu

Subnetwork enumeration algorithms for multilayer networks

To understand the structure of a network, it can be useful to break it down into its constituent pieces. This is the approach taken in a multitude of successful network analysis methods, such as motif analysis. These methods require one to…

Physics and Society · Physics 2023-08-02 Tarmo Nurmi , Mikko Kivelä

A Simple Sublinear-Time Algorithm for Counting Arbitrary Subgraphs via Edge Sampling

In the subgraph counting problem, we are given a input graph $G(V, E)$ and a target graph $H$; the goal is to estimate the number of occurrences of $H$ in $G$. Our focus here is on designing sublinear-time algorithms for approximately…

Data Structures and Algorithms · Computer Science 2018-11-20 Sepehr Assadi , Michael Kapralov , Sanjeev Khanna

An Algorithmic Version of the Blow-up Lemma

Recently we have developed a new method in graph theory based on the Regularity Lemma. The method is applied to find certain spanning subgraphs in dense graphs. The other main general tool of the method, beside the Regularity Lemma, is the…

Combinatorics · Mathematics 2016-09-07 János Komlós , Gabor N. Sarkozy , Endre Szemerédi

Atomic subgraphs and the statistical mechanics of networks

We develop random graph models where graphs are generated by connecting not only pairs of vertices by edges but also larger subsets of vertices by copies of small atomic subgraphs of arbitrary topology. This allows the for the generation of…

Statistics Theory · Mathematics 2021-04-21 Anatol E. Wegner , Sofia Olhede