English
Related papers

Related papers: Estimation consistante de l'architecture des perce…

200 papers

We consider regression models involving multilayer perceptrons (MLP) with one hidden layer and a Gaussian noise. The estimation of the parameters of the MLP can be done by maximizing the likelihood of the model. In this framework, it is…

Statistics Theory · Mathematics 2008-02-25 Joseph Rynkiewicz

We consider regression models involving multilayer perceptrons (MLP) with one hidden layer and a Gaussian noise. The data are assumed to be generated by a true MLP model and the estimation of the parameters of the MLP is done by maximizing…

Statistics Theory · Mathematics 2010-12-01 Joseph Rynkiewicz

This work concerns estimation of multidimensional nonlinear regression models using multilayer perceptron (MLP). The main problem with such model is that we have to know the covariance matrix of the noise to get optimal estimator. however…

Statistics Theory · Mathematics 2008-02-22 Joseph Rynkiewicz

This work concerns the estimation of multidimensional nonlinear regression models using multilayer perceptrons (MLPs). The main problem with such models is that we need to know the covariance matrix of the noise to get an optimal estimator.…

Statistics Theory · Mathematics 2008-02-22 Joseph Rynkiewicz

This work concerns testing the number of parameters in one hidden layer multilayer perceptron (MLP). For this purpose we assume that we have identifiable models, up to a finite group of transformations on the weights, this is for example…

Statistics Theory · Mathematics 2008-02-22 Joseph Rynkiewicz

Multilayer perceptrons (MLP) with one hidden layer have been used for a long time to deal with non-linear regression. However, in some task, MLP's are too powerful models and a small mean square error (MSE) may be more due to overfitting…

Statistics Theory · Mathematics 2012-05-10 Joseph Rynkiewicz

We give analytical results for propagation of uncertainty through trained multi-layer perceptrons (MLPs) with a single hidden layer and ReLU activation functions. More precisely, we give expressions for the mean and variance of the output…

Machine Learning · Computer Science 2026-01-26 Andrew Thompson , Miles McCrory

Multi-layer perceptron (MLP) is a fundamental component of deep learning, and recent MLP-based architectures, especially the MLP-Mixer, have achieved significant empirical success. Nevertheless, our understanding of why and how the…

Machine Learning · Computer Science 2024-05-08 Tomohiro Hayase , Ryo Karakida

In this paper, a novel multi-head multi-layer perceptron (MLP) structure is presented for implicit neural representation (INR). Since conventional rectified linear unit (ReLU) networks are shown to exhibit spectral bias towards learning…

Machine Learning · Computer Science 2022-02-28 Arya Aftab , Alireza Morsali

In this work we revisit the most fundamental building block in deep learning, the multi-layer perceptron (MLP), and study the limits of its performance on vision tasks. Empirical insights into MLPs are important for multiple reasons. (1)…

Machine Learning · Computer Science 2023-10-04 Gregor Bachmann , Sotiris Anagnostidis , Thomas Hofmann

Image denoising can be described as the problem of mapping from a noisy image to a noise-free image. The best currently available denoising methods approximate this mapping with cleverly engineered algorithms. In this work we attempt to…

Computer Vision and Pattern Recognition · Computer Science 2012-11-12 Harold Christopher Burger , Christian J. Schuler , Stefan Harmeling

Stochastic binary hidden units in a multi-layer perceptron (MLP) network give at least three potential benefits when compared to deterministic MLP networks. (1) They allow to learn one-to-many type of mappings. (2) They can be used in…

Machine Learning · Statistics 2015-04-10 Tapani Raiko , Mathias Berglund , Guillaume Alain , Laurent Dinh

Multilayer perceptron (MLP) is a class of networks composed of multiple layers of perceptrons, and it is essentially a mathematical function. Based on MLP, we develop a new numerical method to find the extrema of functionals. As…

Machine Learning · Computer Science 2020-07-03 Tao Liu

We propose a new type of hidden layer for a multilayer perceptron, and demonstrate that it obtains the best reported performance for an MLP on the MNIST dataset.

Machine Learning · Statistics 2013-01-23 Ian J. Goodfellow

We consider the problem of reconstructing a signal from multi-layered (possibly) non-linear measurements. Using non-rigorous but standard methods from statistical physics we present the Multi-Layer Approximate Message Passing (ML-AMP)…

Information Theory · Computer Science 2020-01-22 Andre Manoel , Florent Krzakala , Marc Mézard , Lenka Zdeborová

In case of decision making problems, classification of pattern is a complex and crucial task. Pattern classification using multilayer perceptron (MLP) trained with back propagation learning becomes much complex with increase in number of…

Neural and Evolutionary Computing · Computer Science 2016-01-15 Tirtharaj Dash , H. S. Behera

Implicit Neural Representations (INRs) employ neural networks to represent continuous functions by mapping coordinates to the corresponding values of the target function, with applications e.g., inverse graphics. However, INRs face a…

Computer Vision and Pattern Recognition · Computer Science 2025-02-11 Mingze Ma , Qingtian Zhu , Yifan Zhan , Zhengwei Yin , Hongjun Wang , Yinqiang Zheng

The construction of a multilayer perceptron (MLP) as a piecewise low-order polynomial approximator using a signal processing approach is presented in this work. The constructed MLP contains one input, one intermediate and one output layers.…

Machine Learning · Computer Science 2020-10-16 Ruiyuan Lin , Suya You , Raghuveer Rao , C. -C. Jay Kuo

We consider {\em Mixed Linear Regression (MLR)}, where training data have been generated from a mixture of distinct linear models (or clusters) and we seek to identify the corresponding coefficient vectors. We introduce a {\em Mixed Integer…

Machine Learning · Statistics 2019-09-10 Taiyao Wang , Ioannis Ch. Paschalidis

Many machine learning applications use latent variable models to explain structure in data, whereby visible variables (= coordinates of the given datapoint) are explained as a probabilistic function of some hidden variables. Finding…

Machine Learning · Computer Science 2016-12-30 Sanjeev Arora , Rong Ge , Tengyu Ma , Andrej Risteski
‹ Prev 1 2 3 10 Next ›