English
Related papers

Related papers: A Simple Data-Adaptive Probabilistic Variant Calli…

200 papers

Variant calling, the problem of estimating whether a position in a DNA sequence differs from a reference sequence, given noisy, redundant, overlapping short sequences that cover that position, is fundamental to genomics. We propose a deep…

Genomics · Quantitative Biology 2020-03-17 Nikolai Yakovenko , Avantika Lal , Johnny Israeli , Bryan Catanzaro

Comparing allele frequencies among populations that differ in environment has long been a tool for detecting loci involved in local adaptation. However, such analyses are complicated by an imperfect knowledge of population allele…

Populations and Evolution · Quantitative Biology 2012-09-17 Torsten Günther , Graham Coop

Deep Neural Networks (DNNs) have been shown to be susceptible to memorization or overfitting in the presence of noisily-labelled data. For the problem of robust learning under such noisy data, several algorithms have been proposed. A…

Machine Learning · Computer Science 2022-12-06 Deep Patel , P. S. Sastry

Whole and targeted sequencing of human genomes is a promising, increasingly feasible tool for discovering genetic contributions to risk of complex diseases. A key step is calling an individual's genotype from the multiple aligned short read…

Applications · Statistics 2012-06-29 Baiyu Zhou , Alice S. Whittemore

The detection of rare variants is important for understanding the genetic heterogeneity in mixed samples. Recently, next-generation sequencing (NGS) technologies have enabled the identification of single nucleotide variants (SNVs) in mixed…

Genomics · Quantitative Biology 2016-04-25 Fan Zhang , Patrick Flaherty

Motivation: Most existing methods for DNA sequence analysis rely on accurate sequences or genotypes. However, in applications of the next-generation sequencing (NGS), accurate genotypes may not be easily obtained (e.g. multi-sample…

Genomics · Quantitative Biology 2013-03-19 Heng Li

Kernel methods are typically formulated under the assumption of exact, noise-free access to the Gram matrix. However, in emerging settings such as quantum machine learning, each kernel entry must be inferred from noisy observations, and its…

Machine Learning · Computer Science 2026-05-22 Artur Miroszewski

This paper describes a methodology for detecting anomalies from sequentially observed and potentially noisy data. The proposed approach consists of two main elements: (1) {\em filtering}, or assigning a belief or likelihood to each…

Machine Learning · Computer Science 2016-11-17 Maxim Raginsky , Rebecca Willett , Corinne Horn , Jorge Silva , Roummel Marcia

Deep neural networks (DNN) have been used successfully in many scientific problems for their high prediction accuracy, but their application to genetic studies remains challenging due to their poor interpretability. In this paper, we…

Machine Learning · Computer Science 2021-10-01 Peyman H. Kassani , Fred Lu , Yann Le Guen , Zihuai He

Label noise is ubiquitous in real-world scenarios, posing a practical challenge to supervised models due to its effect in hurting the generalization performance of deep neural networks. Existing methods primarily employ the sample selection…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Mengmeng Sheng , Zeren Sun , Tao Chen , Shuchao Pang , Yucheng Wang , Yazhou Yao

Visual anomaly detection targets to detect images that notably differ from normal pattern, and it has found extensive application in identifying defective parts within the manufacturing industry. These anomaly detection paradigms…

Computer Vision and Pattern Recognition · Computer Science 2024-11-15 Anindya Sundar Das , Guansong Pang , Monowar Bhuyan

Automatic lyrics to polyphonic audio alignment is a challenging task not only because the vocals are corrupted by background music, but also there is a lack of annotated polyphonic corpus for effective acoustic modeling. In this work, we…

Audio and Speech Processing · Electrical Eng. & Systems 2019-06-26 Chitralekha Gupta , Emre Yılmaz , Haizhou Li

Second-generation sequencing technologies have replaced array-based technologies and become the default method for genomics and epigenomics analysis. Second-generation sequencing technologies sequence tens of millions of DNA/cDNA fragments…

Methodology · Statistics 2017-02-08 Ping Ma , Nan Zhang , Jianhua Z. Huang , Wenxuan Zhong

When only limited target domain data is available, domain adaptation could be used to promote performance of deep neural network (DNN) acoustic model by leveraging well-trained source model and target domain data. However, suffering from…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-06 Han Zhu , Jiangjiang Zhao , Yuling Ren , Li Wang , Pengyuan Zhang

Motivation: Computational methods are essential to extract actionable information from raw sequencing data, and to thus fulfill the promise of next-generation sequencing technology. Unfortunately, computational tools developed to call…

We exploit the observation that stochastic variational inference (SVI) is a form of annealing and present a modified SVI approach -- applicable to both large and small datasets -- that allows the amount of annealing done by SVI to be tuned.…

Machine Learning · Computer Science 2025-11-17 John Paisley , Ghazal Fazelnia , Brian Barr

We propose a resampling-based fast variable selection technique for detecting relevant single nucleotide polymorphisms (SNP) in a multi-marker mixed effect model. Due to computational complexity, current practice primarily involves testing…

Applications · Statistics 2025-04-30 Subhabrata Majumdar , Saonli Basu , Matt McGue , Snigdhansu Chatterjee

Genomic copy number variation (CNV) is a large source of variation between organisms, and its consequences include phenotypic differences and genetic disorders. CNVs are commonly detected by hybridizing genomic DNA to microarrays of nucleic…

DNA sequencing to identify genetic variants is becoming increasingly valuable in clinical settings. Assessment of variants in such sequencing data is commonly implemented through Bayesian heuristic algorithms. Machine learning has shown…

SGD does not produce robust results on datasets with label noise. Because the gradients calculated according to the losses of the noisy samples cause the optimization process to go in the wrong direction. In this paper, as an alternative to…

Machine Learning · Computer Science 2022-03-29 Enes Dedeoglu , Himmet Toprak Kesgin , Mehmet Fatih Amasyali
‹ Prev 1 2 3 10 Next ›