English
Related papers

Related papers: Learning Polynomial Transformations

200 papers

The question of polynomial learnability of probability distributions, particularly Gaussian mixture distributions, has recently received significant attention in theoretical computer science and machine learning. However, despite major…

Machine Learning · Computer Science 2010-05-13 Mikhail Belkin , Kaushik Sinha

We study the problem of learning hierarchical polynomials over the standard Gaussian distribution with three-layer neural networks. We specifically consider target functions of the form $h = g \circ p$ where $p : \mathbb{R}^d \rightarrow…

Machine Learning · Computer Science 2023-11-27 Zihao Wang , Eshaan Nichani , Jason D. Lee

We establish connections between the problem of learning a two-layer neural network and tensor decomposition. We consider a model with feature vectors $\boldsymbol x \in \mathbb R^d$, $r$ hidden units with weights $\{\boldsymbol w_i\}_{1\le…

Machine Learning · Computer Science 2018-10-11 Marco Mondelli , Andrea Montanari

Efficiently learning mixture of Gaussians is a fundamental problem in statistics and learning theory. Given samples coming from a random one out of k Gaussian distributions in Rn, the learning problem asks to estimate the means and the…

Machine Learning · Computer Science 2015-03-11 Rong Ge , Qingqing Huang , Sham M. Kakade

We give a new algorithm for learning mixtures of $k$ Gaussians (with identity covariance in $\mathbb{R}^n$) to TV error $\varepsilon$, with quasi-polynomial ($O(n^{\text{poly\,log}\left(\frac{n+k}{\varepsilon}\right)})$) time and sample…

Machine Learning · Computer Science 2025-03-05 Khashayar Gatmiry , Jonathan Kelner , Holden Lee

We study the task of agnostic learning of multiclass linear classifiers under the Gaussian distribution. Given labeled examples $(x, y)$ from a distribution over $\mathbb{R}^d \times [k]$, with Gaussian $x$-marginal, the goal is to output a…

Machine Learning · Computer Science 2026-05-21 Ilias Diakonikolas , Giannis Iakovidis , Mingchen Ma

In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimension. More precisely, we prove that a mixture with known identical covariance matrices whose number of components is a polynomial of any…

Machine Learning · Computer Science 2014-02-19 Joseph Anderson , Mikhail Belkin , Navin Goyal , Luis Rademacher , James Voss

Training neural networks is a challenging non-convex optimization problem, and backpropagation or gradient descent can get stuck in spurious local optima. We propose a novel algorithm based on tensor decomposition for guaranteed training of…

Machine Learning · Computer Science 2016-01-13 Majid Janzamin , Hanie Sedghi , Anima Anandkumar

Capturing the temporal evolution of Gaussian properties such as position, rotation, and scale is a challenging task due to the vast number of time-varying parameters and the limited photometric data available, which generally results in…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Bingbing Hu , Yanyan Li , Rui Xie , Bo Xu , Haoye Dong , Junfeng Yao , Gim Hee Lee

We consider the well-studied problem of learning a linear combination of $k$ ReLU activations with respect to a Gaussian distribution on inputs in $d$ dimensions. We give the first polynomial-time algorithm that succeeds whenever $k$ is a…

Machine Learning · Computer Science 2023-04-21 Sitan Chen , Zehao Dou , Surbhi Goel , Adam R Klivans , Raghu Meka

We undertake Bayesian learning of the high-dimensional functional relationship between a system parameter vector and an observable, that is in general tensor-valued. The ultimate aim is Bayesian inverse prediction of the system parameters,…

Methodology · Statistics 2018-04-17 Kangrui Wang , Dalia Chakrabarty

Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate the mixture parameters. We give an algorithm for this problem that has a running time, and data requirement polynomial in the dimension and…

Machine Learning · Computer Science 2010-04-27 Ankur Moitra , Gregory Valiant

Continual learning, the ability of a model to adapt to an ongoing sequence of tasks without forgetting earlier ones, is a central goal of artificial intelligence. To better understand its underlying mechanisms, we study the limitations of…

Machine Learning · Statistics 2026-04-21 Hossein Taheri , Avishek Ghosh , Arya Mazumdar

This paper proposes a novel method for learning highly nonlinear, multivariate functions from examples. Our method takes advantage of the property that continuous functions can be approximated by polynomials, which in turn are representable…

Machine Learning · Computer Science 2020-05-05 Sandor Szedmak , Anna Cichonska , Heli Julkunen , Tapio Pahikkala , Juho Rousu

We present novel, computationally efficient, and differentially private algorithms for two fundamental high-dimensional learning problems: learning a multivariate Gaussian and learning a product distribution over the Boolean hypercube in…

Data Structures and Algorithms · Computer Science 2019-05-31 Gautam Kamath , Jerry Li , Vikrant Singhal , Jonathan Ullman

This work represents a natural coalescence of two important lines of work: learning mixtures of Gaussians and algorithmic robust statistics. In particular we give the first provably robust algorithm for learning mixtures of any constant…

Data Structures and Algorithms · Computer Science 2021-07-27 Allen Liu , Ankur Moitra

Low rank tensor decompositions are a powerful tool for learning generative models, and uniqueness results give them a significant advantage over matrix decomposition methods. However, tensors pose significant algorithmic challenges and…

Data Structures and Algorithms · Computer Science 2014-01-21 Aditya Bhaskara , Moses Charikar , Ankur Moitra , Aravindan Vijayaraghavan

Polynomial regression is a basic primitive in learning and statistics. In its most basic form the goal is to fit a degree $d$ polynomial to a response variable $y$ in terms of an $n$-dimensional input vector $x$. This is extremely…

Data Structures and Algorithms · Computer Science 2020-04-30 Sitan Chen , Raghu Meka

We study the problem of learning mixtures of $k$ Gaussians in $d$ dimensions. We make no separation assumptions on the underlying mixture components: we only require that the covariance matrices have bounded condition number and that the…

Data Structures and Algorithms · Computer Science 2024-11-20 Sitan Chen , Vasilis Kontonis , Kulin Shah

We study the algorithmic task of testably learning general Massart halfspaces under the Gaussian distribution. In the testable learning setting, the aim is the design of a tester-learner pair satisfying the following properties: (1) if the…

Data Structures and Algorithms · Computer Science 2026-02-27 Ilias Diakonikolas , Giannis Iakovidis , Daniel M. Kane , Sihan Liu
‹ Prev 1 2 3 10 Next ›