Related papers: Capacity Preserving Mapping for High-dimensional D…

An Analytical Survey on Recent Trends in High Dimensional Data Visualization

Data visualization is the process by which data of any size or dimensionality is processed to produce an understandable set of data in a lower dimensionality, allowing it to be manipulated and understood more easily by people. The goal of…

Graphics · Computer Science 2021-07-06 Alexander Kiefer , Md. Khaledur Rahman

High-Dimensional Data Clustering

Clustering in high-dimensional spaces is a difficult problem which is recurrent in many domains, for example in image analysis. The difficulty is due to the fact that high-dimensional data usually live in different low-dimensional subspaces…

Statistics Theory · Mathematics 2016-08-16 Charles Bouveyron , Stéphane Girard , Cordelia Schmid

Cluster-based multidimensional scaling embedding tool for data visualization

We present a new technique for visualizing high-dimensional data called cluster MDS (cl-MDS), which addresses a common difficulty of dimensionality reduction methods: preserving both local and global structures of the original sample in a…

Graphics · Computer Science 2024-05-27 Patricia Hernández-León , Miguel A. Caro

In search of the most efficient and memory-saving visualization of high dimensional data

Interactive exploration of large, multidimensional datasets plays a very important role in various scientific fields. It makes it possible not only to identify important structural features and forms, such as clusters of vertices and their…

Machine Learning · Computer Science 2023-03-10 Bartosz Minch

A Distance-preserving Matrix Sketch

Visualizing very large matrices involves many formidable problems. Various popular solutions to these problems involve sampling, clustering, projection, or feature selection to reduce the size and complexity of the original task. An…

Human-Computer Interaction · Computer Science 2022-06-06 Leland Wilkinson , Hengrui Luo

Dimension reduction for model-based clustering

We introduce a dimension reduction method for visualizing the clustering structure obtained from a finite mixture of Gaussian densities. Information on the dimension reduction subspace is obtained from the variation on group means and,…

Methodology · Statistics 2015-08-10 Luca Scrucca

Visual Analysis of Large Multivariate Scattered Data using Clustering and Probabilistic Summaries

Rapidly growing data sizes of scientific simulations pose significant challenges for interactive visualization and analysis techniques. In this work, we propose a compact probabilistic representation to interactively visualize large…

Graphics · Computer Science 2020-10-16 Tobias Rapp , Christoph Peters , Carsten Dachsbacher

IT-map: an Effective Nonlinear Dimensionality Reduction Method for Interactive Clustering

Scientists in many fields have the common and basic need of dimensionality reduction: visualizing the underlying structure of the massive multivariate data in a low-dimensional space. However, many dimensionality reduction methods confront…

Machine Learning · Statistics 2015-03-19 Teng Qiu , Yongjie Li

MapReduce and Streaming Algorithms for Diversity Maximization in Metric Spaces of Bounded Doubling Dimension

Given a dataset of points in a metric space and an integer $k$, a diversity maximization problem requires determining a subset of $k$ points maximizing some diversity objective measure, e.g., the minimum or the average distance between two…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-01-24 Matteo Ceccarello , Andrea Pietracaprina , Geppino Pucci , Eli Upfal

Understanding High Dimensional Spaces through Visual Means Employing Multidimensional Projections

Data visualisation helps understanding data represented by multiple variables, also called features, stored in a large matrix where individuals are stored in lines and variable values in columns. These data structures are frequently called…

Human-Computer Interaction · Computer Science 2022-07-25 Haseeb Younis , Paul Trust , Rosane Minghim

On class visualisation for high dimensional data: Exploring scientific datasets

Parametric Embedding (PE) has recently been proposed as a general-purpose algorithm for class visualisation. It takes class posteriors produced by a mixture-based clustering algorithm and projects them in 2D for visualisation. However,…

Astrophysics · Physics 2009-11-11 Ata Kaban , Jianyong Sun , Somak Raychaudhury , Louisa Nolan

Topology-Preserving Dimensionality Reduction via Interleaving Optimization

Dimensionality reduction techniques are powerful tools for data preprocessing and visualization which typically come with few guarantees concerning the topological correctness of an embedding. The interleaving distance between the…

Machine Learning · Computer Science 2022-02-01 Bradley J. Nelson , Yuan Luo

Compact Representations of Event Sequences

We introduce a new technique for the efficient management of large sequences of multidimensional data, which takes advantage of regularities that arise in real-world datasets and supports different types of aggregation queries. More…

Data Structures and Algorithms · Computer Science 2018-03-08 Nieves R. Brisaboa , Guillermo de Bernardo , Gonzalo Navarro , Tirso V. Rodeiro , Diego Seco

Consensus dimension reduction via multi-view learning

A plethora of dimension reduction methods have been developed to visualize high-dimensional data in low dimensions. However, different dimension reduction methods often output different and possibly conflicting visualizations of the same…

Methodology · Statistics 2025-12-19 Bingxue An , Tiffany M. Tang

Low-dimensional embeddings of high-dimensional data

Large collections of high-dimensional data have become nearly ubiquitous across many academic fields and application domains, ranging from biology to the humanities. Since working directly with high-dimensional data poses challenges, the…

Machine Learning · Computer Science 2025-08-25 Cyril de Bodt , Alex Diaz-Papkovich , Michael Bleher , Kerstin Bunte , Corinna Coupette , Sebastian Damrich , Enrique Fita Sanmartin , Fred A. Hamprecht , Emőke-Ágnes Horvát , Dhruv Kohli , Smita Krishnaswamy , John A. Lee , Boudewijn P. F. Lelieveldt , Leland McInnes , Ian T. Nabney , Maximilian Noichl , Pavlin G. Poličar , Bastian Rieck , Guy Wolf , Gal Mishne , Dmitry Kobak

Randomized Dimensionality Reduction for Euclidean Maximization and Diversity Measures

Randomized dimensionality reduction is a widely-used algorithmic technique for speeding up large-scale Euclidean optimization problems. In this paper, we study dimension reduction for a variety of maximization problems, including…

Data Structures and Algorithms · Computer Science 2025-06-03 Jie Gao , Rajesh Jayaram , Benedikt Kolbe , Shay Sapir , Chris Schwiegelshohn , Sandeep Silwal , Erik Waingarten

On random embeddings and their application to optimisation

Random embeddings project high-dimensional spaces to low-dimensional ones; they are careful constructions which allow the approximate preservation of key properties, such as the pair-wise distances between points. Often in the field of…

Optimization and Control · Mathematics 2022-06-08 Zhen Shao

An Incremental Dimensionality Reduction Method for Visualizing Streaming Multidimensional Data

Dimensionality reduction (DR) methods are commonly used for analyzing and visualizing multidimensional data. However, when data is a live streaming feed, conventional DR methods cannot be directly used because of their computational…

Graphics · Computer Science 2019-10-16 Takanori Fujiwara , Jia-Kai Chou , Shilpika , Panpan Xu , Liu Ren , Kwan-Liu Ma

Learning to compress and search visual data in large-scale systems

The problem of high-dimensional and large-scale representation of visual data is addressed from an unsupervised learning perspective. The emphasis is put on discrete representations, where the description length can be measured in bits and…

Machine Learning · Computer Science 2019-01-25 Sohrab Ferdowsi

Dimension Reduction with Prior Information for Knowledge Discovery

This paper addresses the problem of mapping high-dimensional data to a low-dimensional space, in the presence of other known features. This problem is ubiquitous in science and engineering as there are often controllable/measurable features…

Machine Learning · Statistics 2024-01-01 Anh Tuan Bui