Related papers: Biological sequence analysis

Multidimensional Stochastic Process Model and its Applications to Analysis of Longitudinal Data with Genetic Information

Stochastic Process Model has many applications in analysis of longitudinal biodemographic data. Such data contain various physiological variables (sometimes known as covariates). It also can potentially contain genetic information available…

Populations and Evolution · Quantitative Biology 2016-05-31 Ilya Zhbannikov , Konstantin Arbeev , Anatoliy Yashin

Statistical Model Checking for Biological Applications

In this paper we survey recent work on the use of statistical model checking techniques for biological applications. We begin with an overview of the basic modelling techniques for biochemical reactions and their corresponding stochastic…

Logic in Computer Science · Computer Science 2014-11-04 Paolo Zuliani

Getting started in probabilistic graphical models

Probabilistic graphical models (PGMs) have become a popular tool for computational analysis of biological data in a variety of domains. But, what exactly are they and how do they work? How can we use PGMs to discover patterns that are…

Quantitative Methods · Quantitative Biology 2010-02-22 Edoardo M Airoldi

Mutational paths with sequence-based models of proteins: from sampling to mean-field characterisation

Identifying and characterizing mutational paths is an important issue in evolutionary biology and in bioengineering. We here introduce a generic description of mutational paths in terms of the goodness of sequences and of the mutational…

Biomolecules · Quantitative Biology 2023-03-29 Eugenio Mauri , Simona Cocco , Rémi Monasson

BSM: Small but Powerful Biological Sequence Model for Genes and Proteins

Modeling biological sequences such as DNA, RNA, and proteins is crucial for understanding complex processes like gene regulation and protein synthesis. However, most current models either focus on a single type or treat multiple types of…

Genomics · Quantitative Biology 2024-10-16 Weixi Xiang , Xueting Han , Xiujuan Chai , Jing Bai

Spectral Sequence Motif Discovery

Sequence discovery tools play a central role in several fields of computational biology. In the framework of Transcription Factor binding studies, motif finding algorithms of increasingly high performance are required to process the big…

Quantitative Methods · Quantitative Biology 2014-08-27 Nicolò Colombo , Nikos Vlassis

Finding Sequence Features in Tissue-specific Sequences

The discovery of motifs underlying gene expression is a challenging one. Some of these motifs are known transcription factors, but sequence inspection often provides valuable clues, even discovery of novel motifs with uncharacterized…

Genomics · Quantitative Biology 2007-05-23 Arvind Rao , Alfred O. Hero , David J. States , James Douglas Engel

100 years after Smoluchowski: stochastic processes in cell biology

100 years after Smoluchowski introduces his approach to stochastic processes, they are now at the basis of mathematical and physical modeling in cellular biology: they are used for example to analyse and to extract features from large…

Data Analysis, Statistics and Probability · Physics 2017-03-08 David Holcman , Zeev Schuss

Sequential Bayesian Learning for Hidden Semi-Markov Models

In this paper, we explore the class of the Hidden Semi-Markov Model (HSMM), a flexible extension of the popular Hidden Markov Model (HMM) that allows the underlying stochastic process to be a semi-Markov chain. HSMMs are typically used less…

Applications · Statistics 2023-01-26 Patrick Aschermayr , Konstantinos Kalogeropoulos

Biological Sequence with Language Model Prompting: A Survey

Large Language models (LLMs) have emerged as powerful tools for addressing challenges across diverse domains. Notably, recent studies have demonstrated that large language models significantly enhance the efficiency of biomolecular analysis…

Computation and Language · Computer Science 2025-03-07 Jiyue Jiang , Zikang Wang , Yuheng Shan , Heyan Chai , Jiayi Li , Zixian Ma , Xinrui Zhang , Yu Li

Feature extraction in protein sequences classification : a new stability measure

Feature extraction is an unavoidable task, especially in the critical step of preprocessing biological sequences. This step consists for example in transforming the biological sequences into vectors of motifs where each motif is a…

Machine Learning · Computer Science 2016-08-24 Rabie Saidi , Sabeur Aridhi , Mondher Maddouri , Engelbert Mephu Nguifo

Bayesian analysis of biological networks: clusters, motifs, cross-species correlations

An important part of the analysis of bio-molecular networks is to detect different functional units. Different functions are reflected in a different evolutionary dynamics, and hence in different statistical characteristics of network…

Molecular Networks · Quantitative Biology 2007-05-23 Johannes Berg , Michael Lässig

The theoretical analysis of sequencing bioinformatics algorithms and beyond

The theoretical analysis of performance has been an important tool in the engineering of algorithms in many application domains. Its goals are to predict the empirical performance of an algorithm and to be a yardstick that drives the design…

Data Structures and Algorithms · Computer Science 2022-11-15 Paul Medvedev

On the uses and abuses of regression models: a call for reform of statistical practice and teaching

Regression methods dominate the practice of biostatistical analysis, but biostatistical training emphasises the details of regression models and methods ahead of the purposes for which such modelling might be useful. More broadly,…

Methodology · Statistics 2024-09-12 John B. Carlin , Margarita Moreno-Betancur

Stochastic ordering tools for continuous-time Markov chains and applications to reaction network models

Stochastic reaction networks are mathematical models with a wide range of applications in biochemistry, ecology, and epidemiology, and are often complex to analyze. Except for some special cases, it is generally difficult to predict how the…

Probability · Mathematics 2026-04-02 Daniele Cappelletti , Giulio Cuniberti , Paola Siri

A Deep Learning Approach to Analyzing Continuous-Time Systems

Scientists often use observational time series data to study complex natural processes, but regression analyses often assume simplistic dynamics. Recent advances in deep learning have yielded startling improvements to the performance of…

Machine Learning · Computer Science 2023-04-21 Cory Shain , William Schuler

Sifting through the Noise: A Survey of Diffusion Probabilistic Models and Their Applications to Biomolecules

Diffusion probabilistic models have made their way into a number of high-profile applications since their inception. In particular, there has been a wave of research into using diffusion models in the prediction and design of biomolecular…

Biomolecules · Quantitative Biology 2024-06-05 Trevor Norton , Debswapna Bhattacharya

Biological Sequence Clustering: A Survey

The rapid development of high-throughput sequencing technologies has led to an explosive increase in biological sequence data, making sequence clustering a fundamental task in large-scale bioinformatics analyses. Unlike traditional…

Genomics · Quantitative Biology 2026-01-22 Simeng Zhang , Xinying Liu , Jun Lou , Mudi Jiang , Quan Zou , Zengyou He

Bayesian History Reconstruction of Complex Human Gene Clusters on a Phylogeny

Clusters of genes that have evolved by repeated segmental duplication present difficult challenges throughout genomic analysis, from sequence assembly to functional analysis. Improved understanding of these clusters is of utmost importance,…

Machine Learning · Computer Science 2010-01-25 Tomáš Vinař , Broňa Brejová , Giltae Song , Adam Siepel

Stochastic modeling of auto-regulatory genetic feedback loops: a review and comparative study

Auto-regulatory feedback loops are one of the most common network motifs. A wide variety of stochastic models have been constructed to understand how the fluctuations in protein numbers in these loops are influenced by the kinetic…

Subcellular Processes · Quantitative Biology 2020-04-22 James Holehouse , Zhixing Cao , Ramon Grima