Related papers: Machine Learning for Genomic Data

Model-based clustering with Hidden Markov Model regression for time series with regime changes

This paper introduces a novel model-based clustering approach for clustering time series which present changes in regime. It consists of a mixture of polynomial regressions governed by hidden Markov chains. The underlying hidden process for…

Machine Learning · Statistics 2013-12-30 Faicel Chamroukhi , Allou Samé , Patrice Aknin , Gérard Govaert

Gamma-based clustering via ordered means with application to gene-expression analysis

Discrete mixture models provide a well-known basis for effective clustering algorithms, although technical challenges have limited their scope. In the context of gene-expression data analysis, a model is presented that mixes over a finite…

Methodology · Statistics 2012-11-12 Michael A. Newton , Lisa M. Chung

Effective Clustering Algorithms for Gene Expression Data

Microarrays are made it possible to simultaneously monitor the expression profiles of thousands of genes under various experimental conditions. Identification of co-expressed genes and coherent patterns is the central goal in microarray or…

Computational Engineering, Finance, and Science · Computer Science 2012-01-25 T. Chandrasekhar , K. Thangavel , E. Elayaraja

A Latent Gaussian Mixture Model for Clustering Longitudinal Data

Finite mixture models have become a popular tool for clustering. Amongst other uses, they have been applied for clustering longitudinal data and clustering high-dimensional data. In the latter case, a latent Gaussian mixture model is…

Methodology · Statistics 2018-04-17 Vanessa S. E. Bierling , Paul D. McNicholas

Robust model-based clustering with gene ranking

Cluster analysis of biological samples using gene expression measurements is a common task which aids the discovery of heterogeneous biological sub-populations having distinct mRNA profiles. Several model-based clustering algorithms have…

Methodology · Statistics 2012-01-30 Alberto Cozzini , Ajay Jasra , Giovanni Montana

Quantum Expectation-Maximization Algorithm

Clustering algorithms are a cornerstone of machine learning applications. Recently, a quantum algorithm for clustering based on the k-means algorithm has been proposed by Kerenidis, Landman, Luongo and Prakash. Based on their work, we…

Quantum Physics · Physics 2020-01-23 Hideyuki Miyahara , Kazuyuki Aihara , Wolfgang Lechner

Gene Expression Data Knowledge Discovery using Global and Local Clustering

To understand complex biological systems, the research community has produced huge corpus of gene expression data. A large number of clustering approaches have been proposed for the analysis of gene expression data. However, extracting…

Computational Engineering, Finance, and Science · Computer Science 2010-03-28 Swathi. H

Scalable Clustering: Large Scale Unsupervised Learning of Gaussian Mixture Models with Outliers

Clustering is a widely used technique with a long and rich history in a variety of areas. However, most existing algorithms do not scale well to large datasets, or are missing theoretical guarantees of convergence. This paper introduces a…

Machine Learning · Statistics 2024-10-16 Yijia Zhou , Kyle A. Gallivan , Adrian Barbu

Unsupervised Gene Expression Data using Enhanced Clustering Method

Microarrays are made it possible to simultaneously monitor the expression profiles of thousands of genes under various experimental conditions. Identification of co-expressed genes and coherent patterns is the central goal in microarray or…

Computational Engineering, Finance, and Science · Computer Science 2013-07-15 T. Chandrasekhar , K. Thangavel , E. Elayaraja , E. N. Sathishkumar

Model-based clustering and segmentation of time series with changes in regime

Mixture model-based clustering, usually applied to multidimensional data, has become a popular approach in many data analysis problems, both for its good statistical properties and for the simplicity of implementation of the…

Methodology · Statistics 2013-12-30 Allou Samé , Faicel Chamroukhi , Gérard Govaert , Patrice Aknin

Fuzzy soft rough K-Means clustering approach for gene expression data

Clustering is one of the widely used data mining techniques for medical diagnosis. Clustering can be considered as the most important unsupervised learning technique. Most of the clustering methods group data based on distance and few…

Machine Learning · Computer Science 2012-12-24 K. Dhanalakshmi , H. Hannah Inbarani

Clustering Approaches for Mixed-Type Data: A Comparative Study

Clustering is widely used in unsupervised learning to find homogeneous groups of observations within a dataset. However, clustering mixed-type data remains a challenge, as few existing approaches are suited for this task. This study…

Machine Learning · Statistics 2025-11-26 Badih Ghattas , Alvaro Sanchez San-Benito

K-ARMA Models for Clustering Time Series Data

We present an approach to clustering time series data using a model-based generalization of the K-Means algorithm which we call K-Models. We prove the convergence of this general algorithm and relate it to the hard-EM algorithm for mixture…

Methodology · Statistics 2022-07-04 Derek O. Hoare , David S. Matteson , Martin T. Wells

Data Aggregation for Reducing Training Data in Symbolic Regression

The growing volume of data makes the use of computationally intense machine learning techniques such as symbolic regression with genetic programming more and more impractical. This work discusses methods to reduce the training data and…

Machine Learning · Computer Science 2021-08-25 Lukas Kammerer , Gabriel Kronberger , Michael Kommenda

Model Based Clustering for Mixed Data: clustMD

A model based clustering procedure for data of mixed type, clustMD, is developed using a latent variable model. It is proposed that a latent variable, following a mixture of Gaussian distributions, generates the observed data of mixed type.…

Methodology · Statistics 2015-11-06 Damien McParland , Isobel Claire Gormley

Time Series Clustering with an EM algorithm for Mixtures of Linear Gaussian State Space Models

In this paper, we consider the task of clustering a set of individual time series while modeling each cluster, that is, model-based time series clustering. The task requires a parametric model with sufficient flexibility to describe the…

Machine Learning · Computer Science 2023-02-23 Ryohei Umatani , Takashi Imai , Kaoru Kawamoto , Shutaro Kunimasa

Clustering based on Mixtures of Sparse Gaussian Processes

Creating low dimensional representations of a high dimensional data set is an important component in many machine learning applications. How to cluster data using their low dimensional embedded space is still a challenging problem in…

Machine Learning · Computer Science 2023-03-27 Zahra Moslehi , Abdolreza Mirzaei , Mehran Safayani

A comprehensive survey on computational learning methods for analysis of gene expression data

Computational analysis methods including machine learning have a significant impact in the fields of genomics and medicine. High-throughput gene expression analysis methods such as microarray technology and RNA sequencing produce enormous…

Genomics · Quantitative Biology 2022-09-28 Nikita Bhandari , Rahee Walambe , Ketan Kotecha , Satyajeet Khare

Minimum Message Length Clustering Using Gibbs Sampling

The K-Mean and EM algorithms are popular in clustering and mixture modeling, due to their simplicity and ease of implementation. However, they have several significant limitations. Both coverage to a local optimum of their respective…

Machine Learning · Computer Science 2013-01-18 Ian Davidson

A sparse negative binomial mixture model for clustering RNA-seq count data

Clustering with variable selection is a challenging yet critical task for modern small-n-large-p data. Existing methods based on sparse Gaussian mixture models or sparse K-means provide solutions to continuous data. With the prevalence of…

Machine Learning · Statistics 2020-04-28 Tanbin Rahman , Yujia Li , Tianzhou Ma , Lu Tang , George Tseng