Learning a Single Neuron with Gradient Methods

Gilad Yehudai; Ohad Shamir

Learning a Single Neuron with Gradient Methods

Machine Learning 2022-03-01 v3 Neural and Evolutionary Computing Machine Learning

Authors: Gilad Yehudai , Ohad Shamir

Abstract

We consider the fundamental problem of learning a single neuron $x \mapsto\sigma(w^\top x)$ using standard gradient methods. As opposed to previous works, which considered specific (and not always realistic) input distributions and activation functions $\sigma(\cdot)$ , we ask whether a more general result is attainable, under milder assumptions. On the one hand, we show that some assumptions on the distribution and the activation function are necessary. On the other hand, we prove positive guarantees under mild assumptions, which go beyond those studied in the literature so far. We also point out and study the challenges in further strengthening and generalizing our results.

Keywords

neural networks neural network neural network training

Cite

@article{arxiv.2001.05205,
  title  = {Learning a Single Neuron with Gradient Methods},
  author = {Gilad Yehudai and Ohad Shamir},
  journal= {arXiv preprint arXiv:2001.05205},
  year   = {2022}
}

Comments

Fixed a small bug in the proof of Theorem 4.2

Learning a Single Neuron with Gradient Methods

Abstract

Keywords

Cite

Comments

Related papers