Related papers: Identifying statistically significant patterns in …

Triclustering of Gene Expression Microarray Data Using Coarse-Grained Parallel Genetic Algorithm

Microarray data analysis is one of the major area of research in the field computational biology. Numerous techniques like clustering, biclustering are often applied to microarray data to extract meaningful outcomes which play key roles in…

Neural and Evolutionary Computing · Computer Science 2019-09-04 Shubhankar Mohapatra , Moumita Sarkar , Anjali Mohapatra , Bhawani Sankar Biswal

Clustering Gene Expression Time Series with Coregionalization: Speed propagation of ALS

Clustering of gene expression time series gives insight into which genes may be coregulated, allowing us to discern the activity of pathways in a given microarray experiment. Of particular interest is how a given group of genes varies with…

Quantitative Methods · Quantitative Biology 2018-02-13 Muhammad Arifur Rahman , Paul R. Heath , Neil D. Lawrence

Statistical Significance for Hierarchical Clustering

Cluster analysis has proved to be an invaluable tool for the exploratory and unsupervised analysis of high dimensional datasets. Among methods for clustering, hierarchical approaches have enjoyed substantial popularity in genomics and other…

Methodology · Statistics 2014-11-20 Patrick K. Kimes , Yufeng Liu , D. Neil Hayes , J. S. Marron

Gene Expression Data Knowledge Discovery using Global and Local Clustering

To understand complex biological systems, the research community has produced huge corpus of gene expression data. A large number of clustering approaches have been proposed for the analysis of gene expression data. However, extracting…

Computational Engineering, Finance, and Science · Computer Science 2010-03-28 Swathi. H

Identification of relevant subtypes via preweighted sparse clustering

Cluster analysis methods are used to identify homogeneous subgroups in a data set. In biomedical applications, one frequently applies cluster analysis in order to identify biologically interesting subgroups. In particular, one may wish to…

Methodology · Statistics 2016-09-23 Sheila Gaynor , Eric Bair

Coupled Two-Way Clustering Analysis of Gene Microarray Data

We present a novel coupled two-way clustering approach to gene microarray data analysis. The main idea is to identify subsets of the genes and samples, such that when one of these is used to cluster the other, stable and significant…

Biological Physics · Physics 2009-11-06 G. Getz , E. Levine , E. Domany

A comprehensive survey on computational learning methods for analysis of gene expression data

Computational analysis methods including machine learning have a significant impact in the fields of genomics and medicine. High-throughput gene expression analysis methods such as microarray technology and RNA sequencing produce enormous…

Genomics · Quantitative Biology 2022-09-28 Nikita Bhandari , Rahee Walambe , Ketan Kotecha , Satyajeet Khare

Multi-scale analysis and clustering of co-expression networks

The increasing capacity of high-throughput genomic technologies for generating time-course data has stimulated a rich debate on the most appropriate methods to highlight crucial aspects of data structure. In this work, we address the…

Quantitative Methods · Quantitative Biology 2017-12-05 Nuno R. Nené

EXCLUVIS: A MATLAB GUI Software for Comparative Study of Clustering and Visualization of Gene Expression Data

Clustering is a popular data mining technique that aims to partition an input space into multiple homogeneous regions. There exist several clustering algorithms in the literature. The performance of a clustering algorithm depends on its…

Human-Computer Interaction · Computer Science 2020-08-20 Sudip Poddar , Anirban Mukhopadhyay

Unsupervised Gene Expression Data using Enhanced Clustering Method

Microarrays are made it possible to simultaneously monitor the expression profiles of thousands of genes under various experimental conditions. Identification of co-expressed genes and coherent patterns is the central goal in microarray or…

Computational Engineering, Finance, and Science · Computer Science 2013-07-15 T. Chandrasekhar , K. Thangavel , E. Elayaraja , E. N. Sathishkumar

Non-Parametric Cluster Significance Testing with Reference to a Unimodal Null Distribution

Cluster analysis is an unsupervised learning strategy that can be employed to identify subgroups of observations in data sets of unknown structure. This strategy is particularly useful for analyzing high-dimensional data such as microarray…

Methodology · Statistics 2016-10-07 Erika S. Helgeson , Eric Bair

A Hash-based Co-Clustering Algorithm for Categorical Data

Many real-life data are described by categorical attributes without a pre-classification. A common data mining method used to extract information from this type of data is clustering. This method group together the samples from the data…

Machine Learning · Computer Science 2014-07-30 Fabricio Olivetti de França

Diverse correlation structures in gene expression data and their utility in improving statistical inference

It is well known that correlations in microarray data represent a serious nuisance deteriorating the performance of gene selection procedures. This paper is intended to demonstrate that the correlation structure of microarray data provides…

Applications · Statistics 2007-12-18 Lev Klebanov , Andrei Yakovlev

Statistical Significance of Clustering using Soft Thresholding

Clustering methods have led to a number of important discoveries in bioinformatics and beyond. A major challenge in their use is determining which clusters represent important underlying structure, as opposed to spurious sampling artifacts.…

Methodology · Statistics 2021-10-20 Hanwen Huang , Yufeng Liu , Ming Yuan , J. S. Marron

Clustering Hierarchies via a Semi-Parametric Generalized Linear Mixed Model: a statistical significance-based approach

We introduce a novel statistical significance-based approach for clustering hierarchical data using semi-parametric linear mixed-effects models designed for responses with laws in the exponential family (e.g., Poisson and Bernoulli). Within…

Methodology · Statistics 2025-02-04 Alessandra Ragni , Chiara Masci , Francesca Ieva , Anna Maria Paganoni

Clustering and Classification of Genetic Data Through U-Statistics

Genetic data are frequently categorical and have complex dependence structures that are not always well understood. For this reason, clustering and classification based on genetic data, while highly relevant, are challenging statistical…

Methodology · Statistics 2016-06-13 Gabriela Bettella Cybis , Marcio Valk , Silvia Regina Costa Lopes

Analysis of a Gibbs sampler method for model based clustering of gene expression data

Over the last decade, a large variety of clustering algorithms have been developed to detect coregulatory relationships among genes from microarray gene expression data. Model based clustering approaches have emerged as statistically well…

Quantitative Methods · Quantitative Biology 2008-01-15 Anagha Joshi , Yves Van de Peer , Tom Michoel

Performance Analysis of Clustering Algorithms for Gene Expression Data

Microarray technology is a process that allows thousands of genes simultaneously monitor to various experimental conditions. It is used to identify the co-expressed genes in specific cells or tissues that are actively used to make proteins,…

Computational Engineering, Finance, and Science · Computer Science 2013-07-16 T. Chandrasekhar , K. Thangavel , E. Elayaraja

Genetic Programming for Evolving Similarity Functions for Clustering: Representations and Analysis

Clustering is a difficult and widely-studied data mining task, with many varieties of clustering algorithms proposed in the literature. Nearly all algorithms use a similarity measure such as a distance metric (e.g. Euclidean distance) to…

Neural and Evolutionary Computing · Computer Science 2019-10-24 Andrew Lensen , Bing Xue , Mengjie Zhang

Clustering transformed compositional data using K-means, with applications in gene expression and bicycle sharing system data

Although there is no shortage of clustering algorithms proposed in the literature, the question of the most relevant strategy for clustering compositional data (i.e., data made up of profiles, whose rows belong to the simplex) remains…

Statistics Theory · Mathematics 2018-05-17 Antoine Godichon-Baggioni , Cathy Maugis-Rabusseau , Andrea Rau