English
Related papers

Related papers: Company classification using machine learning

200 papers

Cluster visualization is an essential task for nonlinear dimensionality reduction as a data analysis tool. It is often believed that Student t-Distributed Stochastic Neighbor Embedding (t-SNE) can show clusters for well clusterable data,…

Machine Learning · Computer Science 2021-10-07 Zhirong Yang , Yuwei Chen , Jukka Corander

In this work we used unsupervised machine learning methods in order to find possible clustering structures in superconducting materials data sets. We used the SuperCon database, as well as our own data sets complied from literature, in…

Superconductivity · Physics 2022-07-13 B. Roter , N. Ninkovic , S. V. Dordevic

t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology. Building on recent advances in speeding up t-SNE and obtaining finer-grained structure, we combine the two to create tree-SNE, a…

Machine Learning · Computer Science 2020-02-14 Isaac Robinson , Emma Pierce-Hoffman

Unsupervised machine learning, and in particular data clustering, is a powerful approach for the analysis of datasets and identification of characteristic features occurring throughout a dataset. It is gaining popularity across scientific…

Mesoscale and Nanoscale Physics · Physics 2021-03-23 Maria El Abbassi , Jan Overbeck , Oliver Braun , Michel Calame , Herre S. J. van der Zant , Mickael L. Perrin

The t-distributed Stochastic Neighbor Embedding (tSNE) algorithm has become in recent years one of the most used and insightful techniques for the exploratory data analysis of high-dimensional data. tSNE reveals clusters of high-dimensional…

This paper investigates the theoretical foundations of the t-distributed stochastic neighbor embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data visualization method. A novel theoretical framework for the analysis…

Machine Learning · Statistics 2022-11-02 T. Tony Cai , Rong Ma

A fundamental task in machine learning involves visualizing high-dimensional data sets that arise in high-impact application domains. When considering the context of large imbalanced data, this problem becomes much more challenging. In this…

Machine Learning · Computer Science 2021-09-21 Parisa Hajibabaee , Farhad Pourkamali-Anaraki , Mohammad Amin Hariri-Ardebili

t-distributed Stochastic Neighborhood Embedding (t-SNE) is a method for dimensionality reduction and visualization that has become widely popular in recent years. Efficient implementations of t-SNE are available, but they scale poorly to…

Machine Learning · Computer Science 2019-02-26 George C. Linderman , Manas Rachh , Jeremy G. Hoskins , Stefan Steinerberger , Yuval Kluger

t-distributed Stochastic Neighborhood Embedding (t-SNE), a clustering and visualization method proposed by van der Maaten & Hinton in 2008, has rapidly become a standard tool in a number of natural sciences. Despite its overwhelming…

Machine Learning · Computer Science 2017-06-09 George C. Linderman , Stefan Steinerberger

The dimensionality reduction has been widely introduced to use the high-dimensional data for regression, classification, feature analysis, and visualization. As the one technique of dimensionality reduction, a stochastic neighbor embedding…

Computer Vision and Pattern Recognition · Computer Science 2020-12-03 Motoshi Abe , Junichi Miyao , Takio Kurita

The t-distributed Stochastic Neighbor Embedding (t-SNE) is a powerful and popular method for visualizing high-dimensional data. It minimizes the Kullback-Leibler (KL) divergence between the original and embedded data distributions. In this…

Machine Learning · Computer Science 2018-11-06 Daniel Jiwoong Im , Nakul Verma , Kristin Branson

Neighbor Embedding (NE) aims to preserve pairwise similarities between data items and has been shown to yield an effective principle for data visualization. However, even the best existing NE methods such as Stochastic Neighbor Embedding…

Machine Learning · Computer Science 2021-09-15 Zhirong Yang , Yuwei Chen , Denis Sedov , Samuel Kaski , Jukka Corander

This article presents a novel application of the t-distributed Stochastic Neighbor Embedding (t-SNE) clustering algorithm to the telecommunication field. t-SNE is a dimensionality reduction (DR) algorithm that allows the visualization of…

Signal Processing · Electrical Eng. & Systems 2023-04-28 Alejandro Ramírez-Arroyo , Luz García , Antonio Alex-Amor , Juan F. Valenzuela-Valdés

t-Stochastic Neighbor Embedding (t-SNE) is a non-parametric data visualization method in classical machine learning. It maps the data from the high-dimensional space into a low-dimensional space, especially a two-dimensional plane, while…

Quantum Physics · Physics 2022-02-10 Yoshiaki Kawase , Kosuke Mitarai , Keisuke Fujii

$t$-SNE is an embedding method that the data science community has widely Two interesting characteristics of t-SNE are the structure preservation property and the answer to the crowding problem, where all neighbors in high dimensional space…

Machine Learning · Computer Science 2021-09-23 Gaëlle Candel , David Naccache

As the number of publicly traded companies as well as the amount of their financial data grows rapidly, it is highly desired to have tracking, analysis, and eventually stock selections automated. There have been few works focusing on…

Statistical Finance · Quantitative Finance 2014-06-04 Sercan Arik , Sukru Burc Eryilmaz , Adam Goldberg

The optimization of electric machines at multiple operating points is crucial for applications that require frequent changes on speeds and loads, such as the electric vehicles, to strive for the machine optimal performance across the entire…

Machine Learning · Computer Science 2019-11-05 Shen Zhang , Shibo Zhang , Sufei Li , Liang Du , Thomas G. Habetler

T-SNE is a well-known approach to embedding high-dimensional data and has been widely used in data visualization. The basic assumption of t-SNE is that the data are non-constrained in the Euclidean space and the local proximity can be…

Machine Learning · Computer Science 2015-08-06 Mian Wang , Dong Wang

Correct risk estimation of policyholders is of great significance to auto insurance companies. While the current tools used in this field have been proven in practice to be quite efficient and beneficial, we argue that there is still a lot…

Artificial Intelligence · Computer Science 2023-03-02 Joseph Levitas , Konstantin Yavilberg , Oleg Korol , Genadi Man

Malicious software are categorized into families based on their static and dynamic characteristics, infection methods, and nature of threat. Visual exploration of malware instances and families in a low dimensional space helps in giving a…

Cryptography and Security · Computer Science 2019-01-31 Mohamed Nassar , Haidar Safa
‹ Prev 1 2 3 10 Next ›