Related papers: A Co-analysis Framework for Exploring Multivariate…

Co-clustering based exploratory analysis of mixed-type data tables

Co-clustering is a class of unsupervised data analysis techniques that extract the existing underlying dependency structure between the instances and variables of a data table as homogeneous blocks. Most of those techniques are limited to…

Machine Learning · Computer Science 2022-12-23 Aichetou Bouchareb , Marc Boullé , Fabrice Clérot , Fabrice Rossi

voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data

Relationships in scientific data, such as the numerical and spatial distribution relations of features in univariate data, the scalar-value combinations' relations in multivariate data, and the association of volumes in time-varying and…

Machine Learning · Computer Science 2022-07-25 Xiangyang He , Yubo Tao , Shuoliu Yang , Haoran Dai , Hai Lin

A Family of Mixture Models for Biclustering

Biclustering is used for simultaneous clustering of the observations and variables when there is no group structure known \textit{a priori}. It is being increasingly used in bioinformatics, text analytics, etc. Previously, biclustering has…

Methodology · Statistics 2020-09-14 Wangshu Tu , Sanjeena Subedi

Contributions to Biclustering of Microarray Data Using Formal Concept Analysis

Biclustering is an unsupervised data mining technique that aims to unveil patterns (biclusters) from gene expression data matrices. In the framework of this thesis, we propose new biclustering algorithms for microarray data. The latter is…

Machine Learning · Computer Science 2018-11-26 Amina Houari

Mining Biclusters of Similar Values with Triadic Concept Analysis

Biclustering numerical data became a popular data-mining task in the beginning of 2000's, especially for analysing gene expression data. A bicluster reflects a strong association between a subset of objects and a subset of attributes in a…

Data Structures and Algorithms · Computer Science 2011-11-15 Mehdi Kaytoue , Sergei O. Kuznetsov , Juraj Macko , Wagner Meira , Amedeo Napoli

The bixplot: A variation on the boxplot suited for bimodal data

Boxplots and related visualization methods are widely used exploratory tools for taking a first look at collections of univariate variables. In this note an extension is provided that is specifically designed to detect and display…

Methodology · Statistics 2026-05-05 Camille M. Montalcini , Peter J. Rousseeuw

Estimation of Gaussian Bi-Clusters with General Block-Diagonal Covariance Matrix and Applications

Bi-clustering is a technique that allows for the simultaneous clustering of observations and features in a dataset. This technique is often used in bioinformatics, text mining, and time series analysis. An important advantage of…

Computation · Statistics 2023-02-09 Anastasiia Livochka , Ryan Browne , Sanjeena Subedi

A method for visual identification of small sample subgroups and potential biomarkers

In order to find previously unknown subgroups in biomedical data and generate testable hypotheses, visually guided exploratory analysis can be of tremendous importance. In this paper we propose a new dissimilarity measure that can be used…

Applications · Statistics 2011-12-01 Charlotte Soneson , Magnus Fontes

Co-clustering of time-dependent data via Shape Invariant Model

Multivariate time-dependent data, where multiple features are observed over time for a set of individuals, are increasingly widespread in many application domains. To model these data we need to account for relations among both time…

Methodology · Statistics 2021-04-08 Alessandro Casa , Charles Bouveyron , Elena Erosheva , Giovanna Menardi

BiFold visualization of bipartite datasets

The emerging domain of data-enabled science necessitates development of algorithms and tools for knowledge discovery. Human interaction with data through well-constructed graphical representation can take special advantage of our visual…

Social and Information Networks · Computer Science 2017-05-30 Yazhen Jiang , Joseph Skufca , Jie Sun

Multiple co-clustering based on nonparametric mixture models with heterogeneous marginal distributions

We propose a novel method for multiple clustering that assumes a co-clustering structure (partitions in both rows and columns of the data matrix) in each view. The new method is applicable to high-dimensional data. It is based on a…

Machine Learning · Statistics 2019-07-03 Tomoki Tokuda , Junichiro Yoshimoto , Yu Shimizu , Shigeru Toki , Go Okada , Masahiro Takamura , Tetsuya Yamamoto , Shinpei Yoshimura , Yasumasa Okamoto , Shigeto Yamawaki , Kenji Doya

Model Based Co-clustering of Mixed Numerical and Binary Data

Co-clustering is a data mining technique used to extract the underlying block structure between the rows and columns of a data matrix. Many approaches have been studied and have shown their capacity to extract such structures in continuous,…

Machine Learning · Computer Science 2022-12-23 Aichetou Bouchareb , Marc Boullé , Fabrice Clérot , Fabrice Rossi

Visualizing class specific heterogeneous tendencies in categorical data

In multiple correspondence analysis, both individuals (observations) and categories can be represented in a biplot that jointly depicts the relationships across categories or individuals, as well as the associations between them. Additional…

Methodology · Statistics 2019-01-10 Mariko Takagishi , Michel van de Velden

Biclustering Via Sparse Clustering

In many situations it is desirable to identify clusters that differ with respect to only a subset of features. Such clusters may represent homogeneous subgroups of patients with a disease, such as cancer or chronic pain. We define a…

Methodology · Statistics 2014-07-14 Qian Liu , Guanhua Chen , Michael R. Kosorok , Eric Bair

Convex Biclustering

In the biclustering problem, we seek to simultaneously group observations and features. While biclustering has applications in a wide array of domains, ranging from text mining to collaborative filtering, the problem of identifying…

Methodology · Statistics 2018-06-07 Eric C. Chi , Genevera I. Allen , Richard G. Baraniuk

Improving Visualization Interpretation Using Counterfactuals

Complex, high-dimensional data is used in a wide range of domains to explore problems and make decisions. Analysis of high-dimensional data, however, is vulnerable to the hidden influence of confounding variables, especially as users apply…

Human-Computer Interaction · Computer Science 2022-07-01 Smiti Kaul , David Borland , Nan Cao , David Gotz

A Review on Analysis and Visualization Methods for Biclustering

Recently, biclustering is one of the hot topics in bioinformatics and takes the attention of authors from several different disciplines. Hence, many different methodologies from a variety of disciplines are proposed as a solution to the…

Human-Computer Interaction · Computer Science 2021-11-26 Melih Sozdinler

Discovering and Visualizing Hierarchy in Multivariate Data

How to extract useful insights from data is always a challenge, especially if the data is multidimensional. Often, the data can be organized according to certain hierarchical structure that are stemmed either from data collection process or…

Applications · Statistics 2016-04-21 Kun Yang , Wing Hung Wong

SightBi: Exploring Cross-View Data Relationships with Biclusters

Multiple-view visualization (MV) has been heavily used in visual analysis tools for sensemaking of data in various domains (e.g., bioinformatics, cybersecurity and text analytics). One common task of visual analysis with multiple views is…

Human-Computer Interaction · Computer Science 2021-09-29 Maoyuan Sun , Abdul Rahman Shaikh , Hamed Alhoori , Jian Zhao

Selection Bias Tracking and Detailed Subset Comparison for High-Dimensional Data

The collection of large, complex datasets has become common across a wide variety of domains. Visual analytics tools increasingly play a key role in exploring and answering complex questions about these large datasets. However, many…

Human-Computer Interaction · Computer Science 2020-06-19 David Borland , Wenyuan Wang , Jonathan Zhang , Joshua Shrestha , David Gotz