English
Related papers

Related papers: Principal Graphs and Manifolds

200 papers

In the course of the last century, Principal Component Analysis (PCA) have become one of the pillars of modern scientific methods. Although PCA is normally addressed as a statistical tool aiming at finding orthogonal directions on which the…

Statistics Theory · Mathematics 2019-07-30 Yariv Aizenbud , Barak Sober

In this paper, auto-associative models are proposed as candidates to the generalization of Principal Component Analysis. We show that these models are dedicated to the approximation of the dataset by a manifold. Here, the word "manifold"…

Machine Learning · Statistics 2011-04-01 Stéphane Girard , Serge Iovleff

Suppose $V$ is an $n$-element set where for each $x \in V$, the elements of $V \setminus \{x\}$ are ranked by their similarity to $x$. The $K$-nearest neighbor graph is a directed graph including an arc from each $x$ to the $K$ points of $V…

Combinatorics · Mathematics 2020-12-29 Jacob D. Baron , R. W. R. Darling

Principal manifolds serve as useful tool for many practical applications. These manifolds are defined as lines or surfaces passing through "the middle" of data distribution. We propose an algorithm for fast construction of grid…

Disordered Systems and Neural Networks · Physics 2011-04-20 A. N. Gorban , A. Yu. Zinovyev

One develops a fast computational methodology for principal component analysis on manifolds. Instead of estimating intrinsic principal components on an object space with a Riemannian structure, one embeds the object space in a numerical…

Methodology · Statistics 2024-10-04 Ka Chun Wong , Vic Patrangenaru , Robert L. Paige , Mihaela Pricop Jeckstadt

The $k$-cut problem asks, given a connected graph $G$ and a positive integer $k$, to find a minimum-weight set of edges whose removal splits $G$ into $k$ connected components. We give the first polynomial-time algorithm with approximation…

Data Structures and Algorithms · Computer Science 2018-11-12 MohammadHossein Bateni , Alireza Farhadi , MohammadTaghi Hajiaghayi

We present several applications of non-linear data modeling, using principal manifolds and principal graphs constructed using the metaphor of elasticity (elastic principal graph approach). These approaches are generalizations of the…

Neural and Evolutionary Computing · Computer Science 2010-07-27 A. N. Gorban , A. Zinovyev

How to measure the complexity of a finite set of vectors embedded in a multidimensional space? This is a non-trivial question which can be approached in many different ways. Here we suggest a set of data complexity measures using universal…

Machine Learning · Computer Science 2013-01-03 Andrei Zinovyev , Evgeny Mirkes

Principal Component Analysis (PCA) is a well known procedure to reduce intrinsic complexity of a dataset, essentially through simplifying the covariance structure or the correlation structure. We introduce a novel algebraic, model-based…

Methodology · Statistics 2021-12-09 Martin Schlather , Felix Reinbott

We show how to approximate a data matrix $\mathbf{A}$ with a much smaller sketch $\mathbf{\tilde A}$ that can be used to solve a general class of constrained k-rank approximation problems to within $(1+\epsilon)$ error. Importantly, this…

Data Structures and Algorithms · Computer Science 2015-04-06 Michael B. Cohen , Sam Elder , Cameron Musco , Christopher Musco , Madalina Persu

The Euclidean $k$-means problem is a classical problem that has been extensively studied in the theoretical computer science, machine learning and the computational geometry communities. In this problem, we are given a set of $n$ points in…

Computational Complexity · Computer Science 2015-02-12 Pranjal Awasthi , Moses Charikar , Ravishankar Krishnaswamy , Ali Kemal Sinop

Revealing hidden geometry and topology in noisy data sets is a challenging task. Elastic principal graph is a computationally efficient and flexible data approximator based on embedding a graph into the data space and minimizing the energy…

Data Structures and Algorithms · Computer Science 2019-09-25 A. N. Gorban , E. M. Mirkes , A. Zinovyev

Contours may be viewed as the 2D outline of the image of an object. This type of data arises in medical imaging as well as in computer vision and can be modeled as data on a manifold and can be studied using statistical shape analysis.…

Applications · Statistics 2017-05-17 Chalani Prematilake , Leif Ellingson

We give the first polynomial-time approximation schemes (PTASs) for the following problems: (1) uniform facility location in edge-weighted planar graphs; (2) $k$-median and $k$-means in edge-weighted planar graphs; (3) $k$-means in…

Data Structures and Algorithms · Computer Science 2016-04-08 Vincent Cohen-Addad , Philip N. Klein , Claire Mathieu

Many scientific datasets are of high dimension, and the analysis usually requires visual manipulation by retaining the most important structures of data. Principal curve is a widely used approach for this purpose. However, many existing…

Artificial Intelligence · Computer Science 2016-01-19 Qi Mao , Li Wang , Ivor W. Tsang , Yijun Sun

A method of {\it topological grammars} is proposed for multidimensional data approximation. For data with complex topology we define a {\it principal cubic complex} of low dimension and given complexity that gives the best approximation for…

Neural and Evolutionary Computing · Computer Science 2007-05-23 A. N. Gorban , N. R. Sumner , A. Y. Zinovyev

An algorithm has been developed for finding the global minimum of a multidimensional error function by fitting model spectral maps into observed ones. Principal component analysis is applied to reduce the dimensionality of the model and the…

Astrophysics of Galaxies · Physics 2021-03-10 L. E. Pirogov , P. M. Zemlyanukha

Nearest neighbor search and k-nearest neighbor graph construction are two fundamental issues arise from many disciplines such as multimedia information retrieval, data-mining and machine learning. They become more and more imminent given…

Information Retrieval · Computer Science 2020-09-18 Wan-Lei Zhao , Hui Wang , Chong-Wah Ngo

Most biological data are multidimensional, posing a major challenge to human comprehension and computational analysis. Principal component analysis is the most popular approach to rendering two- or three-dimensional representations of the…

Methodology · Statistics 2016-09-13 Tom M. W. Nye , Xiaoxian Tang , Grady Weyenberg , Ruriko Yoshida

If we pick $n$ random points uniformly in $[0,1]^d$ and connect each point to its $k-$nearest neighbors, then it is well known that there exists a giant connected component with high probability. We prove that in $[0,1]^d$ it suffices to…

Combinatorics · Mathematics 2017-11-15 George C. Linderman , Gal Mishne , Yuval Kluger , Stefan Steinerberger
‹ Prev 1 2 3 10 Next ›