Machine Learning for Genomic Data
Genomics
2021-11-17 v1 Machine Learning
Abstract
This report explores the application of machine learning techniques on short timeseries gene expression data. Although standard machine learning algorithms work well on longer time-series', they often fail to find meaningful insights from fewer timepoints. In this report, we explore model-based clustering techniques. We combine popular unsupervised learning techniques like K-Means, Gaussian Mixture Models, Bayesian Networks, Hidden Markov Models with the well-known Expectation Maximization algorithm. K-Means and Gaussian Mixture Models are fairly standard, while Hidden Markov Model and Bayesian Networks clustering are more novel ideas that suit time-series gene expression data.
Cite
@article{arxiv.2111.08507,
title = {Machine Learning for Genomic Data},
author = {Akankshita Dash},
journal= {arXiv preprint arXiv:2111.08507},
year = {2021}
}
Comments
Number of pages: 53