English
Related papers

Related papers: Computationally Efficient Learning of Statistical …

200 papers

Data-sensitive metrics adapt distances locally based the density of data points with the goal of aligning distances and some notion of similarity. In this paper, we give the first exact algorithm for computing a data-sensitive metric called…

Computational Geometry · Computer Science 2020-04-22 Timothy Chu , Gary Miller , Donald Sheehy

In order to avoid the curse of dimensionality, frequently encountered in Big Data analysis, there was a vast development in the field of linear and nonlinear dimension reduction techniques in recent years. These techniques (sometimes…

Graphics · Computer Science 2020-02-27 Barak Sober , David Levin

We study the approximate nearest neighbour method for cost-sensitive classification on low-dimensional manifolds embedded within a high-dimensional feature space. We determine the minimax learning rates for distributions on a smooth…

Machine Learning · Computer Science 2018-03-02 Henry WJ Reeve , Gavin Brown

The hypothesis that high dimensional data tend to lie in the vicinity of a low dimensional manifold is the basis of manifold learning. The goal of this paper is to develop an algorithm (with accompanying complexity guarantees) for fitting a…

Statistics Theory · Mathematics 2013-12-23 Charles Fefferman , Sanjoy Mitter , Hariharan Narayanan

Several important algorithms for machine learning and data analysis use pairwise distances as input. On Riemannian manifolds these distances may be prohibitively costly to compute, in particular for large datasets. To tackle this problem,…

Differential Geometry · Mathematics 2019-04-29 Philipp Harms , Elodie Maignant , Stefan Schlager

Low-dimensional embedding, manifold learning, clustering, classification, and anomaly detection are among the most important problems in machine learning. The existing methods usually consider the case when each instance has a fixed,…

Machine Learning · Computer Science 2012-02-20 Barnabas Poczos , Liang Xiong , Jeff Schneider

Manifold hypothesis states that data points in high-dimensional space actually lie in close vicinity of a manifold of much lower dimension. In many cases this hypothesis was empirically verified and used to enhance unsupervised and…

The rapid growth of high-dimensional datasets across various scientific domains has created a pressing need for new statistical methods to compare distributions supported on their underlying structures. Assessing similarity between datasets…

Statistics Theory · Mathematics 2025-11-27 Hongrui Chen , Rong Ma

The manifold hypothesis (real world data concentrates near low-dimensional manifolds) is suggested as the principle behind the effectiveness of machine learning algorithms in very high dimensional problems that are common in domains such as…

Machine Learning · Computer Science 2022-07-15 Aditya Chetan , Nipun Kwatra

In this paper we develop proximal methods for statistical learning. Proximal point algorithms are useful in statistics and machine learning for obtaining optimization solutions for composite functions. Our approach exploits closed-form…

Machine Learning · Statistics 2015-06-02 Nicholas G. Polson , James G. Scott , Brandon T. Willard

Recent literature has shown that symbolic data, such as text and graphs, is often better represented by points on a curved manifold, rather than in Euclidean space. However, geometrical operations on manifolds are generally more complicated…

Machine Learning · Computer Science 2019-02-06 Max Aalto , Nakul Verma

We design efficient distance approximation algorithms for several classes of structured high-dimensional distributions. Specifically, we show algorithms for the following problems: - Given sample access to two Bayesian networks $P_1$ and…

Data Structures and Algorithms · Computer Science 2020-02-17 Arnab Bhattacharyya , Sutanu Gayen , Kuldeep S. Meel , N. V. Vinodchandran

Manifold learning is a hot research topic in the field of computer science and has many applications in the real world. A main drawback of manifold learning methods is, however, that there is no explicit mappings from the input data…

Computer Vision and Pattern Recognition · Computer Science 2010-01-18 Hong Qiao , Peng Zhang , Di Wang , Bo Zhang

Function approximation based on data drawn randomly from an unknown distribution is an important problem in machine learning. The manifold hypothesis assumes that the data is sampled from an unknown submanifold of a high dimensional…

Machine Learning · Computer Science 2024-08-20 H. N. Mhaskar , Ryan O'Dowd

This paper aims to investigate the distributed stochastic optimization problems on compact embedded submanifolds (in the Euclidean space) for multi-agent network systems. To address the manifold structure, we propose a distributed…

Optimization and Control · Mathematics 2025-10-28 Jishu Zhao , Xi Wang , Jinlong Lei , Shixiang Chen

Recently manifold learning algorithm for dimensionality reduction attracts more and more interests, and various linear and nonlinear, global and local algorithms are proposed. The key step of manifold learning algorithm is the neighboring…

Methodology · Statistics 2017-04-14 Lin Ma , Caifa Zhou , Xi Liu , Yubin Xu

Big data mining is well known to be an important task for data science, because it can provide useful observations and new knowledge hidden in given large datasets. Proximity-based data analysis is particularly utilized in many real-life…

Databases · Computer Science 2022-11-29 Daichi Amagata , Yusuke Arai , Sumio Fujita , Takahiro Hara

In the study of high-dimensional data, it is often assumed that the data set possesses an underlying lower-dimensional structure. A practical model for this structure is an embedded compact manifold with boundary. Since the underlying…

Machine Learning · Statistics 2025-08-22 Pei-Cheng Kuo , Nan Wu

Manifold learning is a central task in modern statistics and data science. Many datasets (cells, documents, images, molecules) can be represented as point clouds embedded in a high dimensional ambient space, however the degrees of freedom…

Machine Learning · Statistics 2025-02-18 Stephen Zhang , Gilles Mordant , Tetsuya Matsumoto , Geoffrey Schiebinger

Variational representations of divergences and distances between high-dimensional probability distributions offer significant theoretical insights and practical advantages in numerous research areas. Recently, they have gained popularity in…

Machine Learning · Computer Science 2022-03-25 Jeremiah Birrell , Markos A. Katsoulakis , Yannis Pantazis
‹ Prev 1 2 3 10 Next ›