Related papers: BOAssembler: a Bayesian Optimization Framework to …

SOAPdenovo-Trans: De novo transcriptome assembly with short RNA-Seq reads

Motivation: Transcriptome sequencing has long been the favored method for quickly and inexpensively obtaining the sequences for a large number of genes from an organism with no reference genome. With the rapidly increasing throughputs and…

Genomics · Quantitative Biology 2013-08-12 Yinlong Xie , Gengxiong Wu , Jingbo Tang , Ruibang Luo , Jordan Patterson , Shanlin Liu , Weihua Huang , Guangzhu He , Shengchang Gu , Shengkang Li , Xin Zhou , Tak-Wah Lam , Yingrui Li , Xun Xu , Gane Ka-Shu Wong , Jun Wang

Augmenting transcriptome assembly combinatorially

RNA-seq allows detection and precise quantification of transcripts, provides comprehensive understanding of exon/intron boundaries, aids discovery of alternatively spliced isoforms and fusion transcripts along with measurement of…

Genomics · Quantitative Biology 2013-06-03 Prachi Jain , Neeraja M. Krishnan , Binay Panda

Pangenome-guided sequence assembly via binary optimisation

De novo genome assembly is challenging in highly repetitive regions; however, reference-guided assemblers often suffer from bias. We propose a framework for pangenome-guided sequence assembly, which can resolve short-read data in complex…

Quantum Physics · Physics 2026-02-11 Josh Cudby , James Bonfield , Chenxi Zhou , Richard Durbin , Sergii Strelchuk

Fast Approximate Inference of Transcript Expression Levels from RNA-seq Data

Motivation: The mapping of RNA-seq reads to their transcripts of origin is a fundamental task in transcript expression estimation and differential expression scoring. Where ambiguities in mapping exist due to transcripts sharing sequence,…

Genomics · Quantitative Biology 2015-01-28 James Hensman , Peter Glaus , Antti Honkela , Magnus Rattray

Improving transcriptome assembly through error correction of high-throughput sequence reads

The study of functional genomics--particularly in non-model organisms has been dramatically improved over the last few years by use of transcriptomes and RNAseq. While these studies are potentially extremely powerful, a computationally…

Genomics · Quantitative Biology 2013-07-25 Matthew D MacManes , Michael B Eisen

Sequential Sampling for Optimal Bayesian Classification of Sequencing Count Data

High throughput technologies have become the practice of choice for comparative studies in biomedical applications. Limited number of sample points due to sequencing cost or access to organisms of interest necessitates the development of…

Methodology · Statistics 2018-07-17 Ariana Broumand , Siamak Zamani Dadaneh

Fast and accurate approximate inference of transcript expression from RNA-seq data

Motivation: Assigning RNA-seq reads to their transcript of origin is a fundamental task in transcript expression estimation. Where ambiguities in assignments exist due to transcripts sharing sequence, e.g. alternative isoforms or alleles,…

Quantitative Methods · Quantitative Biology 2015-07-01 James Hensman , Panagiotis Papastamoulis , Peter Glaus , Antti Honkela , Magnus Rattray

Baa.pl: A tool to evaluate de novo genome assemblies with RNA transcripts

Assessing the correctness of genome assemblies is an important step in any genome project. Several methods exist, but most are computationally intensive and, in some cases, inappropriate. Here I present baa.pl, a fast and easy-to-use…

Genomics · Quantitative Biology 2014-02-10 Joseph F. Ryan

Hyperparameter Transfer Learning with Adaptive Complexity

Bayesian optimization (BO) is a sample efficient approach to automatically tune the hyperparameters of machine learning models. In practice, one frequently has to solve similar hyperparameter tuning problems sequentially. For example, one…

Machine Learning · Computer Science 2021-02-26 Samuel Horváth , Aaron Klein , Peter Richtárik , Cédric Archambeau

RNA-seq data science: From raw data to effective interpretation

RNA-sequencing (RNA-seq) has become an exemplar technology in modern biology and clinical applications over the past decade. It has gained immense popularity in the recent years driven by continuous efforts of the bioinformatics community…

Genomics · Quantitative Biology 2021-02-17 Dhrithi Deshpande , Karishma Chhugani , Yutong Chang , Aaron Karlsberg , Caitlin Loeffler , Jinyang Zhang , Agata Muszynska , Jeremy Rotman , Laura Tao , Brunilda Balliu , Elizabeth Tseng , Eleazar Eskin , Fangqing Zhao , Pejman Mohammadi , Pawel P Labaj , Serghei Mangul

Assembly of repetitive regions using next-generation sequencing data

High read depth can be used to assemble short sequence repeats. The existing genome assemblers fail in repetitive regions of longer than average read. I propose a new algorithm for a DNA assembly which uses the relative frequency of reads…

Genomics · Quantitative Biology 2015-01-08 Robert M. Nowak

Statistical Modeling of RNA-Seq Data

Recently, ultra high-throughput sequencing of RNA (RNA-Seq) has been developed as an approach for analysis of gene expression. By obtaining tens or even hundreds of millions of reads of transcribed sequences, an RNA-Seq experiment can offer…

Methodology · Statistics 2011-06-17 Julia Salzman , Hui Jiang , Wing Hung Wong

SPATA: A Seeding and Patching Algorithm for Hybrid Transcriptome Assembly

Transcriptome assembly from RNA-Seq reads is an active area of bioinformatics research. The ever-declining cost and the increasing depth of RNA-Seq have provided unprecedented opportunities to better identify expressed transcripts. However,…

Computational Engineering, Finance, and Science · Computer Science 2013-06-07 Tin Chi Nguyen , Zhiyu Zhao , Dongxiao Zhu

A Bayesian model selection approach for identifying differentially expressed transcripts from RNA-Seq data

Recent advances in molecular biology allow the quantification of the transcriptome and scoring transcripts as differentially or equally expressed between two biological conditions. Although these two tasks are closely linked, the available…

Methodology · Statistics 2017-02-08 Panagiotis Papastamoulis , Magnus Rattray

Detecting and Correcting Sample-by-Sample Scale Distortion in RNA Sequencing Data

RNA sequencing (RNA-seq) is the conventional genome-scale approach used to capture the expression levels of all detectable genes in a biological sample. This is now regularly used for population-based studies designed to identify genetic…

Genomics · Quantitative Biology 2026-05-25 Christopher Thron , Farhad Jafari

Bayesian Estimation of Negative Binomial Parameters with Applications to RNA-Seq Data

RNA-Seq data characteristically exhibits large variances, which need to be appropriately accounted for in the model. We first explore the effects of this variability on the maximum likelihood estimator (MLE) of the overdispersion parameter…

Methodology · Statistics 2015-12-03 Luis Leon-Novelo , Claudio Fuentes , Sarah Emerson

Bayesian ensemble refinement by replica simulations and reweighting

We describe different Bayesian ensemble refinement methods, examine their interrelation, and discuss their practical application. With ensemble refinement, the properties of dynamic and partially disordered (bio)molecular structures can be…

Data Analysis, Statistics and Probability · Physics 2016-01-20 Gerhard Hummer , Jürgen Köfinger

Compression of structured high-throughput sequencing data

Large biological datasets are being produced at a rapid pace and create substantial storage challenges, particularly in the domain of high-throughput sequencing (HTS). Most approaches currently used to store HTS data are either unable to…

Quantitative Methods · Quantitative Biology 2014-03-05 Fabien Campagne , Kevin C. Dorff , Nyasha Chambwe , James T. Robinson , Jill P. Mesirov , Thomas D. Wu

BayesAdapter: Being Bayesian, Inexpensively and Reliably, via Bayesian Fine-tuning

Despite their theoretical appealingness, Bayesian neural networks (BNNs) are left behind in real-world adoption, mainly due to persistent concerns on their scalability, accessibility, and reliability. In this work, we develop the…

Machine Learning · Computer Science 2022-10-14 Zhijie Deng , Jun Zhu

An optimized protocol for single cell transcriptional profiling by combinatorial indexing

Single cell combinatorial indexing RNA sequencing (sci-RNA-seq) is a powerful method for recovering gene expression data from an exponentially scalable number of individual cells or nuclei. However, sci-RNA-seq is a complex protocol that…

Genomics · Quantitative Biology 2022-01-07 Beth K. Martin , Chengxiang Qiu , Eva Nichols , Melissa Phung , Rula Green-Gladden , Sanjay Srivatsan , Ronnie Blecher-Gonen , Brian J. Beliveau , Cole Trapnell , Junyue Cao , Jay Shendure