Related papers: Learning Polynomial Transformations

Polynomial Learning of Distribution Families

The question of polynomial learnability of probability distributions, particularly Gaussian mixture distributions, has recently received significant attention in theoretical computer science and machine learning. However, despite major…

Machine Learning · Computer Science 2010-05-13 Mikhail Belkin , Kaushik Sinha

Learning Hierarchical Polynomials with Three-Layer Neural Networks

We study the problem of learning hierarchical polynomials over the standard Gaussian distribution with three-layer neural networks. We specifically consider target functions of the form $h = g \circ p$ where $p : \mathbb{R}^d \rightarrow…

Machine Learning · Computer Science 2023-11-27 Zihao Wang , Eshaan Nichani , Jason D. Lee

On the Connection Between Learning Two-Layers Neural Networks and Tensor Decomposition

We establish connections between the problem of learning a two-layer neural network and tensor decomposition. We consider a model with feature vectors $\boldsymbol x \in \mathbb R^d$, $r$ hidden units with weights $\{\boldsymbol w_i\}_{1\le…

Machine Learning · Computer Science 2018-10-11 Marco Mondelli , Andrea Montanari

Learning Mixtures of Gaussians in High Dimensions

Efficiently learning mixture of Gaussians is a fundamental problem in statistics and learning theory. Given samples coming from a random one out of k Gaussian distributions in Rn, the learning problem asks to estimate the means and the…

Machine Learning · Computer Science 2015-03-11 Rong Ge , Qingqing Huang , Sham M. Kakade

Learning Mixtures of Gaussians Using Diffusion Models

We give a new algorithm for learning mixtures of $k$ Gaussians (with identity covariance in $\mathbb{R}^n$) to TV error $\varepsilon$, with quasi-polynomial ($O(n^{\text{poly\,log}\left(\frac{n+k}{\varepsilon}\right)})$) time and sample…

Machine Learning · Computer Science 2025-03-05 Khashayar Gatmiry , Jonathan Kelner , Holden Lee

Polynomial-Time Robust Multiclass Linear Classification under Gaussian Marginals

We study the task of agnostic learning of multiclass linear classifiers under the Gaussian distribution. Given labeled examples $(x, y)$ from a distribution over $\mathbb{R}^d \times [k]$, with Gaussian $x$-marginal, the goal is to output a…

Machine Learning · Computer Science 2026-05-21 Ilias Diakonikolas , Giannis Iakovidis , Mingchen Ma

The More, the Merrier: the Blessing of Dimensionality for Learning Large Gaussian Mixtures

In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimension. More precisely, we prove that a mixture with known identical covariance matrices whose number of components is a polynomial of any…

Machine Learning · Computer Science 2014-02-19 Joseph Anderson , Mikhail Belkin , Navin Goyal , Luis Rademacher , James Voss

Beating the Perils of Non-Convexity: Guaranteed Training of Neural Networks using Tensor Methods

Training neural networks is a challenging non-convex optimization problem, and backpropagation or gradient descent can get stuck in spurious local optima. We propose a novel algorithm based on tensor decomposition for guaranteed training of…

Machine Learning · Computer Science 2016-01-13 Majid Janzamin , Hanie Sedghi , Anima Anandkumar

Learnable Infinite Taylor Gaussian for Dynamic View Rendering

Capturing the temporal evolution of Gaussian properties such as position, rotation, and scale is a challenging task due to the vast number of time-varying parameters and the limited photometric data available, which generally results in…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Bingbing Hu , Yanyan Li , Rui Xie , Bo Xu , Haoye Dong , Junfeng Yao , Gim Hee Lee

Learning Narrow One-Hidden-Layer ReLU Networks

We consider the well-studied problem of learning a linear combination of $k$ ReLU activations with respect to a Gaussian distribution on inputs in $d$ dimensions. We give the first polynomial-time algorithm that succeeds whenever $k$ is a…

Machine Learning · Computer Science 2023-04-21 Sitan Chen , Zehao Dou , Surbhi Goel , Adam R Klivans , Raghu Meka

Deep Bayesian Supervised Learning given Hypercuboidally-shaped, Discontinuous Data, using Compound Tensor-Variate & Scalar-Variate Gaussian Processes

We undertake Bayesian learning of the high-dimensional functional relationship between a system parameter vector and an observable, that is in general tensor-valued. The ultimate aim is Bayesian inverse prediction of the system parameters,…

Methodology · Statistics 2018-04-17 Kangrui Wang , Dalia Chakrabarty

Settling the Polynomial Learnability of Mixtures of Gaussians

Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate the mixture parameters. We give an algorithm for this problem that has a running time, and data requirement polynomial in the dimension and…

Machine Learning · Computer Science 2010-04-27 Ankur Moitra , Gregory Valiant

On the Theory of Continual Learning with Gradient Descent for Neural Networks

Continual learning, the ability of a model to adapt to an ongoing sequence of tasks without forgetting earlier ones, is a central goal of artificial intelligence. To better understand its underlying mechanisms, we study the limitations of…

Machine Learning · Statistics 2026-04-21 Hossein Taheri , Avishek Ghosh , Arya Mazumdar

A Solution for Large Scale Nonlinear Regression with High Rank and Degree at Constant Memory Complexity via Latent Tensor Reconstruction

This paper proposes a novel method for learning highly nonlinear, multivariate functions from examples. Our method takes advantage of the property that continuous functions can be approximated by polynomials, which in turn are representable…

Machine Learning · Computer Science 2020-05-05 Sandor Szedmak , Anna Cichonska , Heli Julkunen , Tapio Pahikkala , Juho Rousu

Privately Learning High-Dimensional Distributions

We present novel, computationally efficient, and differentially private algorithms for two fundamental high-dimensional learning problems: learning a multivariate Gaussian and learning a product distribution over the Boolean hypercube in…

Data Structures and Algorithms · Computer Science 2019-05-31 Gautam Kamath , Jerry Li , Vikrant Singhal , Jonathan Ullman

Settling the Robust Learnability of Mixtures of Gaussians

This work represents a natural coalescence of two important lines of work: learning mixtures of Gaussians and algorithmic robust statistics. In particular we give the first provably robust algorithm for learning mixtures of any constant…

Data Structures and Algorithms · Computer Science 2021-07-27 Allen Liu , Ankur Moitra

Smoothed Analysis of Tensor Decompositions

Low rank tensor decompositions are a powerful tool for learning generative models, and uniqueness results give them a significant advantage over matrix decomposition methods. However, tensors pose significant algorithmic challenges and…

Data Structures and Algorithms · Computer Science 2014-01-21 Aditya Bhaskara , Moses Charikar , Ankur Moitra , Aravindan Vijayaraghavan

Learning Polynomials of Few Relevant Dimensions

Polynomial regression is a basic primitive in learning and statistics. In its most basic form the goal is to fit a degree $d$ polynomial to a response variable $y$ in terms of an $n$-dimensional input vector $x$. This is extremely…

Data Structures and Algorithms · Computer Science 2020-04-30 Sitan Chen , Raghu Meka

Learning general Gaussian mixtures with efficient score matching

We study the problem of learning mixtures of $k$ Gaussians in $d$ dimensions. We make no separation assumptions on the underlying mixture components: we only require that the covariance matrices have bounded condition number and that the…

Data Structures and Algorithms · Computer Science 2024-11-20 Sitan Chen , Vasilis Kontonis , Kulin Shah

Testable Learning of General Halfspaces under Massart Noise

We study the algorithmic task of testably learning general Massart halfspaces under the Gaussian distribution. In the testable learning setting, the aim is the design of a tester-learner pair satisfying the following properties: (1) if the…

Data Structures and Algorithms · Computer Science 2026-02-27 Ilias Diakonikolas , Giannis Iakovidis , Daniel M. Kane , Sihan Liu