Related papers: Computationally Efficient Learning of Statistical …

Exact Computation of a Manifold Metric, via Lipschitz Embeddings and Shortest Paths on a Graph

Data-sensitive metrics adapt distances locally based the density of data points with the goal of aligning distances and some notion of similarity. In this paper, we give the first exact algorithm for computing a data-sensitive metric called…

Computational Geometry · Computer Science 2020-04-22 Timothy Chu , Gary Miller , Donald Sheehy

Manifold Approximation by Moving Least-Squares Projection (MMLS)

In order to avoid the curse of dimensionality, frequently encountered in Big Data analysis, there was a vast development in the field of linear and nonlinear dimension reduction techniques in recent years. These techniques (sometimes…

Graphics · Computer Science 2020-02-27 Barak Sober , David Levin

Minimax rates for cost-sensitive learning on manifolds with approximate nearest neighbours

We study the approximate nearest neighbour method for cost-sensitive classification on low-dimensional manifolds embedded within a high-dimensional feature space. We determine the minimax learning rates for distributions on a smooth…

Machine Learning · Computer Science 2018-03-02 Henry WJ Reeve , Gavin Brown

Testing the Manifold Hypothesis

The hypothesis that high dimensional data tend to lie in the vicinity of a low dimensional manifold is the basis of manifold learning. The goal of this paper is to develop an algorithm (with accompanying complexity guarantees) for fitting a…

Statistics Theory · Mathematics 2013-12-23 Charles Fefferman , Sanjoy Mitter , Hariharan Narayanan

Approximation of Riemannian Distances and Applications to Distance-Based Learning on Manifolds

Several important algorithms for machine learning and data analysis use pairwise distances as input. On Riemannian manifolds these distances may be prohibitively costly to compute, in particular for large datasets. To tackle this problem,…

Differential Geometry · Mathematics 2019-04-29 Philipp Harms , Elodie Maignant , Stefan Schlager

Nonparametric Divergence Estimation with Applications to Machine Learning on Distributions

Low-dimensional embedding, manifold learning, clustering, classification, and anomaly detection are among the most important problems in machine learning. The existing methods usually consider the case when each instance has a fixed,…

Machine Learning · Computer Science 2012-02-20 Barnabas Poczos , Liang Xiong , Jeff Schneider

Manifold Hypothesis in Data Analysis: Double Geometrically-Probabilistic Approach to Manifold Dimension Estimation

Manifold hypothesis states that data points in high-dimensional space actually lie in close vicinity of a manifold of much lower dimension. In many cases this hypothesis was empirically verified and used to enhance unsupervised and…

Machine Learning · Computer Science 2021-07-09 Alexander Ivanov , Gleb Nosovskiy , Alexey Chekunov , Denis Fedoseev , Vladislav Kibkalo , Mikhail Nikulin , Fedor Popelenskiy , Stepan Komkov , Ivan Mazurenko , Aleksandr Petiushko

Statistical Inference for Manifold Similarity and Alignability across Noisy High-Dimensional Datasets

The rapid growth of high-dimensional datasets across various scientific domains has created a pressing need for new statistical methods to compare distributions supported on their underlying structures. Assessing similarity between datasets…

Statistics Theory · Mathematics 2025-11-27 Hongrui Chen , Rong Ma

Distance Learner: Incorporating Manifold Prior to Model Training

The manifold hypothesis (real world data concentrates near low-dimensional manifolds) is suggested as the principle behind the effectiveness of machine learning algorithms in very high dimensional problems that are common in domains such as…

Machine Learning · Computer Science 2022-07-15 Aditya Chetan , Nipun Kwatra

Proximal Algorithms in Statistics and Machine Learning

In this paper we develop proximal methods for statistical learning. Proximal point algorithms are useful in statistics and machine learning for obtaining optimization solutions for composite functions. Our approach exploits closed-form…

Machine Learning · Statistics 2015-06-02 Nicholas G. Polson , James G. Scott , Brandon T. Willard

Metric Learning on Manifolds

Recent literature has shown that symbolic data, such as text and graphs, is often better represented by points on a curved manifold, rather than in Euclidean space. However, geometrical operations on manifolds are generally more complicated…

Machine Learning · Computer Science 2019-02-06 Max Aalto , Nakul Verma

Efficient Distance Approximation for Structured High-Dimensional Distributions via Learning

We design efficient distance approximation algorithms for several classes of structured high-dimensional distributions. Specifically, we show algorithms for the following problems: - Given sample access to two Bayesian networks $P_1$ and…

Data Structures and Algorithms · Computer Science 2020-02-17 Arnab Bhattacharyya , Sutanu Gayen , Kuldeep S. Meel , N. V. Vinodchandran

An Explicit Nonlinear Mapping for Manifold Learning

Manifold learning is a hot research topic in the field of computer science and has many applications in the real world. A main drawback of manifold learning methods is, however, that there is no explicit mappings from the input data…

Computer Vision and Pattern Recognition · Computer Science 2010-01-18 Hong Qiao , Peng Zhang , Di Wang , Bo Zhang

Learning on manifolds without manifold learning

Function approximation based on data drawn randomly from an unknown distribution is an important problem in machine learning. The manifold hypothesis assumes that the data is sampled from an unknown submanifold of a high dimensional…

Machine Learning · Computer Science 2024-08-20 H. N. Mhaskar , Ryan O'Dowd

Distributed Stochastic Proximal Algorithm on Riemannian Submanifolds for Weakly-convex Functions

This paper aims to investigate the distributed stochastic optimization problems on compact embedded submanifolds (in the Euclidean space) for multi-agent network systems. To address the manifold structure, we propose a distributed…

Optimization and Control · Mathematics 2025-10-28 Jishu Zhao , Xi Wang , Jinlong Lei , Shixiang Chen

Adaptive Neighboring Selection Algorithm Based on Curvature Prediction in Manifold Learning

Recently manifold learning algorithm for dimensionality reduction attracts more and more interests, and various linear and nonlinear, global and local algorithms are proposed. The key step of manifold learning algorithm is the neighboring…

Methodology · Statistics 2017-04-14 Lin Ma , Caifa Zhou , Xi Liu , Yubin Xu

Learned k-NN Distance Estimation

Big data mining is well known to be an important task for data science, because it can provide useful observations and new knowledge hidden in given large datasets. Proximity-based data analysis is particularly utilized in many real-life…

Databases · Computer Science 2022-11-29 Daichi Amagata , Yusuke Arai , Sumio Fujita , Takahiro Hara

Boundary Detection Algorithm Inspired by Locally Linear Embedding

In the study of high-dimensional data, it is often assumed that the data set possesses an underlying lower-dimensional structure. A practical model for this structure is an embedded compact manifold with boundary. Since the underlying…

Machine Learning · Statistics 2025-08-22 Pei-Cheng Kuo , Nan Wu

Manifold Learning with Sparse Regularised Optimal Transport

Manifold learning is a central task in modern statistics and data science. Many datasets (cells, documents, images, molecules) can be represented as point clouds embedded in a high dimensional ambient space, however the degrees of freedom…

Machine Learning · Statistics 2025-02-18 Stephen Zhang , Gilles Mordant , Tetsuya Matsumoto , Geoffrey Schiebinger

Optimizing Variational Representations of Divergences and Accelerating their Statistical Estimation

Variational representations of divergences and distances between high-dimensional probability distributions offer significant theoretical insights and practical advantages in numerous research areas. Recently, they have gained popularity in…

Machine Learning · Computer Science 2022-03-25 Jeremiah Birrell , Markos A. Katsoulakis , Yannis Pantazis