Related papers: Stochastic Cluster Embedding

Stochastic Neighbor Embedding separates well-separated clusters

Stochastic Neighbor Embedding and its variants are widely used dimensionality reduction techniques -- despite their popularity, no theoretical results are known. We prove that the optimal SNE embedding of well-separated clusters from high…

Machine Learning · Statistics 2017-02-24 Uri Shaham , Stefan Steinerberger

EmbedOR: Provable Cluster-Preserving Visualizations with Curvature-Based Stochastic Neighbor Embeddings

Stochastic Neighbor Embedding (SNE) algorithms like UMAP and tSNE often produce visualizations that do not preserve the geometry of noisy and high dimensional data. In particular, they can spuriously separate connected components of the…

Machine Learning · Computer Science 2025-09-05 Tristan Luca Saidi , Abigail Hickok , Bastian Rieck , Andrew J. Blumberg

T-SNE Is Not Optimized to Reveal Clusters in Data

Cluster visualization is an essential task for nonlinear dimensionality reduction as a data analysis tool. It is often believed that Student t-Distributed Stochastic Neighbor Embedding (t-SNE) can show clusters for well clusterable data,…

Machine Learning · Computer Science 2021-10-07 Zhirong Yang , Yuwei Chen , Jukka Corander

Doubly Stochastic Neighbor Embedding on Spheres

Stochastic Neighbor Embedding (SNE) methods minimize the divergence between the similarity matrix of a high-dimensional data set and its counterpart from a low-dimensional embedding, leading to widely applied tools for data visualization.…

Machine Learning · Computer Science 2018-09-13 Yao Lu , Jukka Corander , Zhirong Yang

Stochastic Neighbor Embedding under f-divergences

The t-distributed Stochastic Neighbor Embedding (t-SNE) is a powerful and popular method for visualizing high-dimensional data. It minimizes the Kullback-Leibler (KL) divergence between the original and embedded data distributions. In this…

Machine Learning · Computer Science 2018-11-06 Daniel Jiwoong Im , Nakul Verma , Kristin Branson

ENS-t-SNE: Embedding Neighborhoods Simultaneously t-SNE

When visualizing a high-dimensional dataset, dimension reduction techniques are commonly employed which provide a single 2-dimensional view of the data. We describe ENS-t-SNE: an algorithm for Embedding Neighborhoods Simultaneously that…

Machine Learning · Computer Science 2024-04-02 Jacob Miller , Vahan Huroyan , Raymundo Navarrete , Md Iqbal Hossain , Stephen Kobourov

Cluster and then Embed: A Modular Approach for Visualization

Dimensionality reduction methods such as t-SNE and UMAP are popular methods for visualizing data with a potential (latent) clustered structure. They are known to group data points at the same time as they embed them, resulting in…

Machine Learning · Computer Science 2025-09-04 Elizabeth Coda , Ery Arias-Castro , Gal Mishne

Index $t$-SNE: Tracking Dynamics of High-Dimensional Datasets with Coherent Embeddings

$t$-SNE is an embedding method that the data science community has widely Two interesting characteristics of t-SNE are the structure preservation property and the answer to the crowding problem, where all neighbors in high dimensional space…

Machine Learning · Computer Science 2021-09-23 Gaëlle Candel , David Naccache

Supervised Stochastic Neighbor Embedding Using Contrastive Learning

Stochastic neighbor embedding (SNE) methods $t$-SNE, UMAP are two most popular dimensionality reduction methods for data visualization. Contrastive learning, especially self-supervised contrastive learning (SSCL), has showed great success…

Machine Learning · Computer Science 2023-09-18 Yi Zhang

Your Contrastive Learning Is Secretly Doing Stochastic Neighbor Embedding

Contrastive learning, especially self-supervised contrastive learning (SSCL), has achieved great success in extracting powerful features from unlabeled data. In this work, we contribute to the theoretical understanding of SSCL and uncover…

Machine Learning · Computer Science 2023-06-05 Tianyang Hu , Zhili Liu , Fengwei Zhou , Wenjia Wang , Weiran Huang

Parametric t-Stochastic Neighbor Embedding With Quantum Neural Network

t-Stochastic Neighbor Embedding (t-SNE) is a non-parametric data visualization method in classical machine learning. It maps the data from the high-dimensional space into a low-dimensional space, especially a two-dimensional plane, while…

Quantum Physics · Physics 2022-02-10 Yoshiaki Kawase , Kosuke Mitarai , Keisuke Fujii

Opening the black-box of Neighbor Embedding with Hotelling's T2 statistic and Q-residuals

In contrast to classical techniques for exploratory analysis of high-dimensional data sets, such as principal component analysis (PCA), neighbor embedding (NE) techniques tend to better preserve the local structure/topology of…

Machine Learning · Statistics 2022-09-07 Roman Josef Rainer , Michael Mayr , Johannes Himmelbauer , Ramin Nikzad-Langerodi

Memory-Based Neighbourhood Embedding for Visual Recognition

Learning discriminative image feature embeddings is of great importance to visual recognition. To achieve better feature embeddings, most current methods focus on designing different network structures or loss functions, and the estimated…

Computer Vision and Pattern Recognition · Computer Science 2019-08-15 Suichan Li , Dapeng Chen , Bin Liu , Nenghai Yu , Rui Zhao

Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data

This paper investigates the theoretical foundations of the t-distributed stochastic neighbor embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data visualization method. A novel theoretical framework for the analysis…

Machine Learning · Statistics 2022-11-02 T. Tony Cai , Rong Ma

FUnc-SNE: A flexible, Fast, and Unconstrained algorithm for neighbour embeddings

Neighbour embeddings (NE) allow the representation of high dimensional datasets into lower dimensional spaces and are often used in data visualisation. In practice, accelerated approximations are employed to handle very large datasets.…

Machine Learning · Computer Science 2025-09-10 Pierre Lambert , Edouard Couplet , Michel Verleysen , John Aldo Lee

Revised Conditional t-SNE: Looking Beyond the Nearest Neighbors

Conditional t-SNE (ct-SNE) is a recent extension to t-SNE that allows removal of known cluster information from the embedding, to obtain a visualization revealing structure beyond label information. This is useful, for example, when one…

Machine Learning · Computer Science 2023-04-12 Edith Heiter , Bo Kang , Ruth Seurinck , Jefrey Lijffijt

Stochastic Neighbor Embedding with Gaussian and Student-t Distributions: Tutorial and Survey

Stochastic Neighbor Embedding (SNE) is a manifold learning and dimensionality reduction method with a probabilistic approach. In SNE, every point is consider to be the neighbor of all other points with some probability and this probability…

Machine Learning · Statistics 2022-08-04 Benyamin Ghojogh , Ali Ghodsi , Fakhri Karray , Mark Crowley

Visualizing Data Velocity using DSNE

We present a new technique called "DSNE" which learns the velocity embeddings of low dimensional map points when given the high-dimensional data points with its velocities. The technique is a variation of Stochastic Neighbor Embedding,…

Machine Learning · Computer Science 2021-03-16 Songting Shi

Multi-view Data Visualisation via Manifold Learning

Non-linear dimensionality reduction can be performed by \textit{manifold learning} approaches, such as Stochastic Neighbour Embedding (SNE), Locally Linear Embedding (LLE) and Isometric Feature Mapping (ISOMAP). These methods aim to produce…

Machine Learning · Statistics 2021-12-09 Theodoulos Rodosthenous , Vahid Shahrezaei , Marina Evangelou

Node Embeddings via Neighbor Embeddings

Node embeddings are a paradigm in non-parametric graph representation learning, where graph nodes are embedded into a given vector space to enable downstream processing. State-of-the-art node-embedding algorithms, such as DeepWalk and…

Machine Learning · Computer Science 2025-11-25 Jan Niklas Böhm , Marius Keute , Alica Guzmán , Sebastian Damrich , Andrew Draganov , Dmitry Kobak