English
Related papers

Related papers: Machine Learning for Genomic Data

200 papers

This paper introduces a novel model-based clustering approach for clustering time series which present changes in regime. It consists of a mixture of polynomial regressions governed by hidden Markov chains. The underlying hidden process for…

Machine Learning · Statistics 2013-12-30 Faicel Chamroukhi , Allou Samé , Patrice Aknin , Gérard Govaert

Discrete mixture models provide a well-known basis for effective clustering algorithms, although technical challenges have limited their scope. In the context of gene-expression data analysis, a model is presented that mixes over a finite…

Methodology · Statistics 2012-11-12 Michael A. Newton , Lisa M. Chung

Microarrays are made it possible to simultaneously monitor the expression profiles of thousands of genes under various experimental conditions. Identification of co-expressed genes and coherent patterns is the central goal in microarray or…

Computational Engineering, Finance, and Science · Computer Science 2012-01-25 T. Chandrasekhar , K. Thangavel , E. Elayaraja

Finite mixture models have become a popular tool for clustering. Amongst other uses, they have been applied for clustering longitudinal data and clustering high-dimensional data. In the latter case, a latent Gaussian mixture model is…

Methodology · Statistics 2018-04-17 Vanessa S. E. Bierling , Paul D. McNicholas

Cluster analysis of biological samples using gene expression measurements is a common task which aids the discovery of heterogeneous biological sub-populations having distinct mRNA profiles. Several model-based clustering algorithms have…

Methodology · Statistics 2012-01-30 Alberto Cozzini , Ajay Jasra , Giovanni Montana

Clustering algorithms are a cornerstone of machine learning applications. Recently, a quantum algorithm for clustering based on the k-means algorithm has been proposed by Kerenidis, Landman, Luongo and Prakash. Based on their work, we…

Quantum Physics · Physics 2020-01-23 Hideyuki Miyahara , Kazuyuki Aihara , Wolfgang Lechner

To understand complex biological systems, the research community has produced huge corpus of gene expression data. A large number of clustering approaches have been proposed for the analysis of gene expression data. However, extracting…

Computational Engineering, Finance, and Science · Computer Science 2010-03-28 Swathi. H

Clustering is a widely used technique with a long and rich history in a variety of areas. However, most existing algorithms do not scale well to large datasets, or are missing theoretical guarantees of convergence. This paper introduces a…

Machine Learning · Statistics 2024-10-16 Yijia Zhou , Kyle A. Gallivan , Adrian Barbu

Microarrays are made it possible to simultaneously monitor the expression profiles of thousands of genes under various experimental conditions. Identification of co-expressed genes and coherent patterns is the central goal in microarray or…

Computational Engineering, Finance, and Science · Computer Science 2013-07-15 T. Chandrasekhar , K. Thangavel , E. Elayaraja , E. N. Sathishkumar

Mixture model-based clustering, usually applied to multidimensional data, has become a popular approach in many data analysis problems, both for its good statistical properties and for the simplicity of implementation of the…

Methodology · Statistics 2013-12-30 Allou Samé , Faicel Chamroukhi , Gérard Govaert , Patrice Aknin

Clustering is one of the widely used data mining techniques for medical diagnosis. Clustering can be considered as the most important unsupervised learning technique. Most of the clustering methods group data based on distance and few…

Machine Learning · Computer Science 2012-12-24 K. Dhanalakshmi , H. Hannah Inbarani

Clustering is widely used in unsupervised learning to find homogeneous groups of observations within a dataset. However, clustering mixed-type data remains a challenge, as few existing approaches are suited for this task. This study…

Machine Learning · Statistics 2025-11-26 Badih Ghattas , Alvaro Sanchez San-Benito

We present an approach to clustering time series data using a model-based generalization of the K-Means algorithm which we call K-Models. We prove the convergence of this general algorithm and relate it to the hard-EM algorithm for mixture…

Methodology · Statistics 2022-07-04 Derek O. Hoare , David S. Matteson , Martin T. Wells

The growing volume of data makes the use of computationally intense machine learning techniques such as symbolic regression with genetic programming more and more impractical. This work discusses methods to reduce the training data and…

Machine Learning · Computer Science 2021-08-25 Lukas Kammerer , Gabriel Kronberger , Michael Kommenda

A model based clustering procedure for data of mixed type, clustMD, is developed using a latent variable model. It is proposed that a latent variable, following a mixture of Gaussian distributions, generates the observed data of mixed type.…

Methodology · Statistics 2015-11-06 Damien McParland , Isobel Claire Gormley

In this paper, we consider the task of clustering a set of individual time series while modeling each cluster, that is, model-based time series clustering. The task requires a parametric model with sufficient flexibility to describe the…

Machine Learning · Computer Science 2023-02-23 Ryohei Umatani , Takashi Imai , Kaoru Kawamoto , Shutaro Kunimasa

Creating low dimensional representations of a high dimensional data set is an important component in many machine learning applications. How to cluster data using their low dimensional embedded space is still a challenging problem in…

Machine Learning · Computer Science 2023-03-27 Zahra Moslehi , Abdolreza Mirzaei , Mehran Safayani

Computational analysis methods including machine learning have a significant impact in the fields of genomics and medicine. High-throughput gene expression analysis methods such as microarray technology and RNA sequencing produce enormous…

Genomics · Quantitative Biology 2022-09-28 Nikita Bhandari , Rahee Walambe , Ketan Kotecha , Satyajeet Khare

The K-Mean and EM algorithms are popular in clustering and mixture modeling, due to their simplicity and ease of implementation. However, they have several significant limitations. Both coverage to a local optimum of their respective…

Machine Learning · Computer Science 2013-01-18 Ian Davidson

Clustering with variable selection is a challenging yet critical task for modern small-n-large-p data. Existing methods based on sparse Gaussian mixture models or sparse K-means provide solutions to continuous data. With the prevalence of…

Machine Learning · Statistics 2020-04-28 Tanbin Rahman , Yujia Li , Tianzhou Ma , Lu Tang , George Tseng
‹ Prev 1 2 3 10 Next ›