Related papers: A scale-based approach to finding effective dimens…

Dimensionality compression and expansion in Deep Neural Networks

Datasets such as images, text, or movies are embedded in high-dimensional spaces. However, in important cases such as images of objects, the statistical structure in the data constrains samples to a manifold of dramatically lower…

Machine Learning · Computer Science 2019-10-29 Stefano Recanatesi , Matthew Farrell , Madhu Advani , Timothy Moore , Guillaume Lajoie , Eric Shea-Brown

Pattern Search Multidimensional Scaling

We present a novel view of nonlinear manifold learning using derivative-free optimization techniques. Specifically, we propose an extension of the classical multi-dimensional scaling (MDS) method, where instead of performing gradient…

Machine Learning · Computer Science 2019-11-01 Georgios Paraskevopoulos , Efthymios Tzinis , Emmanouil-Vasileios Vlatakis-Gkaragkounis , Alexandros Potamianos

Estimating the intrinsic dimension of datasets by a minimal neighborhood information

Analyzing large volumes of high-dimensional data is an issue of fundamental importance in data science, molecular simulations and beyond. Several approaches work on the assumption that the important content of a dataset belongs to a…

Machine Learning · Statistics 2018-03-20 Elena Facco , Maria d'Errico , Alex Rodriguez , Alessandro Laio

Learning gradients on manifolds

A common belief in high-dimensional data analysis is that data are concentrated on a low-dimensional manifold. This motivates simultaneous dimension reduction and regression on manifolds. We provide an algorithm for learning gradients on…

Statistics Theory · Mathematics 2010-02-24 Sayan Mukherjee , Qiang Wu , Ding-Xuan Zhou

A Novel Approach for Intrinsic Dimension Estimation

The real-life data have a complex and non-linear structure due to their nature. These non-linearities and the large number of features can usually cause problems such as the empty-space phenomenon and the well-known curse of dimensionality.…

Machine Learning · Computer Science 2025-03-13 Kadir Özçoban , Murat Manguoğlu , Emrullah Fatih Yetkin

Dimensionality-Driven Learning with Noisy Labels

Datasets with significant proportions of noisy (incorrect) class labels present challenges for training accurate Deep Neural Networks (DNNs). We propose a new perspective for understanding DNN generalization for such datasets, by…

Computer Vision and Pattern Recognition · Computer Science 2018-08-01 Xingjun Ma , Yisen Wang , Michael E. Houle , Shuo Zhou , Sarah M. Erfani , Shu-Tao Xia , Sudanthi Wijewickrema , James Bailey

A New Estimator of Intrinsic Dimension Based on the Multipoint Morisita Index

The size of datasets has been increasing rapidly both in terms of number of variables and number of events. As a result, the empty space phenomenon and the curse of dimensionality complicate the extraction of useful information. But, in…

Data Analysis, Statistics and Probability · Physics 2015-05-07 Jean Golay , Mikhail Kanevski

Hierarchical Subspace Learning for Dimensionality Reduction to Improve Classification Accuracy in Large Data Sets

Manifold learning is used for dimensionality reduction, with the goal of finding a projection subspace to increase and decrease the inter- and intraclass variances, respectively. However, a bottleneck for subspace learning methods often…

Machine Learning · Computer Science 2021-05-26 Parisa Abdolrahim Poorheravi , Vincent Gaudet

Manifold Approximation by Moving Least-Squares Projection (MMLS)

In order to avoid the curse of dimensionality, frequently encountered in Big Data analysis, there was a vast development in the field of linear and nonlinear dimension reduction techniques in recent years. These techniques (sometimes…

Graphics · Computer Science 2020-02-27 Barak Sober , David Levin

Manifold learning: what, how, and why

Manifold learning (ML), known also as non-linear dimension reduction, is a set of methods to find the low dimensional structure of data. Dimension reduction for large, high dimensional data is not merely a way to reduce the data; the new…

Machine Learning · Statistics 2023-11-08 Marina Meilă , Hanyu Zhang

Multi-Objective Genetic Programming for Manifold Learning: Balancing Quality and Dimensionality

Manifold learning techniques have become increasingly valuable as data continues to grow in size. By discovering a lower-dimensional representation (embedding) of the structure of a dataset, manifold learning algorithms can substantially…

Neural and Evolutionary Computing · Computer Science 2020-01-31 Andrew Lensen , Mengjie Zhang , Bing Xue

A Survey and Comparative Evaluation of Intrinsic Dimension Estimators under the Manifold Hypothesis

The manifold hypothesis suggests that high-dimensional data often lie on or near a low-dimensional manifold. Estimating the dimension of this manifold is essential for leveraging its structure, yet existing work on dimension estimation is…

Machine Learning · Computer Science 2026-04-02 Zelong Bi , Pierre Lafaye de Micheaux

Estimating the effective dimension of large biological datasets using Fisher separability analysis

Modern large-scale datasets are frequently said to be high-dimensional. However, their data point clouds frequently possess structures, significantly decreasing their intrinsic dimensionality (ID) due to the presence of clusters, points…

Machine Learning · Computer Science 2019-01-21 Luca Albergante , Jonathan Bac , Andrei Zinovyev

Dimension Reduction Using Active Manifolds

Scientists and engineers rely on accurate mathematical models to quantify the objects of their studies, which are often high-dimensional. Unfortunately, high-dimensional models are inherently difficult, i.e. when observations are sparse or…

Machine Learning · Computer Science 2018-02-13 Robert A. Bridges , Chris Felder , Chelsey Hoff

Topological Singularity Detection at Multiple Scales

The manifold hypothesis, which assumes that data lies on or close to an unknown manifold of low intrinsic dimension, is a staple of modern machine learning research. However, recent work has shown that real-world data exhibits distinct…

Machine Learning · Computer Science 2023-06-16 Julius von Rohrscheidt , Bastian Rieck

Invertible Manifold Learning for Dimension Reduction

Dimension reduction (DR) aims to learn low-dimensional representations of high-dimensional data with the preservation of essential information. In the context of manifold learning, we define that the representation after…

Machine Learning · Computer Science 2021-07-01 Siyuan Li , Haitao Lin , Zelin Zang , Lirong Wu , Jun Xia , Stan Z. Li

Efficient Manifold-Constrained Neural ODE for High-Dimensional Datasets

Neural ordinary differential equations (NODE) have garnered significant attention for their design of continuous-depth neural networks and the ability to learn data/feature dynamics. However, for high-dimensional systems, estimating…

Machine Learning · Computer Science 2025-10-07 Muhao Guo , Haoran Li , Yang Weng

Robust estimation of the intrinsic dimension of data sets with quantum cognition machine learning

We propose a new data representation method based on Quantum Cognition Machine Learning and apply it to manifold learning, specifically to the estimation of intrinsic dimension of data sets. The idea is to learn a representation of each…

Machine Learning · Statistics 2024-09-20 Luca Candelori , Alexander G. Abanov , Jeffrey Berger , Cameron J. Hogan , Vahagn Kirakosyan , Kharen Musaelian , Ryan Samson , James E. T. Smith , Dario Villani , Martin T. Wells , Mengjia Xu

Modified Multidimensional Scaling and High Dimensional Clustering

Multidimensional scaling is an important dimension reduction tool in statistics and machine learning. Yet few theoretical results characterizing its statistical performance exist, not to mention any in high dimensions. By considering a…

Methodology · Statistics 2022-03-30 Xiucai Ding , Qiang Sun

The Intrinsic Dimension of Images and Its Impact on Learning

It is widely believed that natural image data exhibits low-dimensional structure despite the high dimensionality of conventional pixel representations. This idea underlies a common intuition for the remarkable success of deep learning in…

Computer Vision and Pattern Recognition · Computer Science 2021-04-20 Phillip Pope , Chen Zhu , Ahmed Abdelkader , Micah Goldblum , Tom Goldstein