Parametric t-Distributed Stochastic Exemplar-centered Embedding

Martin Renqiang Min; Hongyu Guo; Dinghan Shen

Parametric t-Distributed Stochastic Exemplar-centered Embedding

Machine Learning 2018-04-24 v5

Authors: Martin Renqiang Min , Hongyu Guo , Dinghan Shen

Abstract

Parametric embedding methods such as parametric t-SNE (pt-SNE) have been widely adopted for data visualization and out-of-sample data embedding without further computationally expensive optimization or approximation. However, the performance of pt-SNE is highly sensitive to the hyper-parameter batch size due to conflicting optimization goals, and often produces dramatically different embeddings with different choices of user-defined perplexities. To effectively solve these issues, we present parametric t-distributed stochastic exemplar-centered embedding methods. Our strategy learns embedding parameters by comparing given data only with precomputed exemplars, resulting in a cost function with linear computational and memory complexity, which is further reduced by noise contrastive samples. Moreover, we propose a shallow embedding network with high-order feature interactions for data visualization, which is much easier to tune but produces comparable performance in contrast to a deep neural network employed by pt-SNE. We empirically demonstrate, using several benchmark datasets, that our proposed methods significantly outperform pt-SNE in terms of robustness, visual effects, and quantitative evaluations.

Keywords

sparse learning randomized algorithm dimensionality reduction

Cite

@article{arxiv.1710.05128,
  title  = {Parametric t-Distributed Stochastic Exemplar-centered Embedding},
  author = {Martin Renqiang Min and Hongyu Guo and Dinghan Shen},
  journal= {arXiv preprint arXiv:1710.05128},
  year   = {2018}
}

Comments

fixed typos

Parametric t-Distributed Stochastic Exemplar-centered Embedding

Abstract

Keywords

Cite

Comments

Related papers