Related papers: Universal Approximation Using Shuffled Linear Mode…

Universal Spin Models are Universal Approximators in Machine Learning

One of the theoretical pillars that sustain certain machine learning models are universal approximation theorems, which prove that they can approximate all functions from a function class to arbitrary precision. Independently, classical…

Disordered Systems and Neural Networks · Physics 2026-04-28 Tobias Reinhart , Gemma De les Coves

Beyond the Central Limit Theorem: Universal and Non-universal Simulations of Random Variables by General Mappings

Motivated by the Central Limit Theorem, in this paper, we study both universal and non-universal simulations of random variables with an arbitrary target distribution $Q_{Y}$ by general mappings, not limited to linear ones (as in the…

Probability · Mathematics 2018-12-05 Lei Yu

Augmented Space Linear Model

The linear model uses the space defined by the input to project the target or desired signal and find the optimal set of model parameters. When the problem is nonlinear, the adaption requires nonlinear models for good performance, but it…

Machine Learning · Computer Science 2018-02-05 Zhengda Qin , Badong Chen , Nanning Zheng , Jose C. Principe

Fast expectation-maximization algorithms for spatial generalized linear mixed models

Spatial generalized linear mixed models (SGLMMs) are popular and flexible models for non-Gaussian spatial data. They are useful for spatial interpolations as well as for fitting regression models that account for spatial dependence, and are…

Methodology · Statistics 2021-10-26 Yawen Guan , Murali Haran

Sublinear Optimization for Machine Learning

We give sublinear-time approximation algorithms for some optimization problems arising in machine learning, such as training linear classifiers and finding minimum enclosing balls. Our algorithms can be extended to some kernelized versions…

Machine Learning · Computer Science 2010-10-22 Kenneth L. Clarkson , Elad Hazan , David P. Woodruff

Minimal Learning Machine: Theoretical Results and Clustering-Based Reference Point Selection

The Minimal Learning Machine (MLM) is a nonlinear supervised approach based on learning a linear mapping between distance matrices computed in the input and output data spaces, where distances are calculated using a subset of points called…

Machine Learning · Computer Science 2020-10-08 Joonas Hämäläinen , Alisson S. C. Alencar , Tommi Kärkkäinen , César L. C. Mattos , Amauri H. Souza Júnior , João P. P. Gomes

Sufficiently Accurate Model Learning for Planning

Data driven models of dynamical systems help planners and controllers to provide more precise and accurate motions. Most model learning algorithms will try to minimize a loss function between the observed data and the model's predictions.…

Artificial Intelligence · Computer Science 2021-02-12 Clark Zhang , Santiago Paternain , Alejandro Ribeiro

Scalable Subset Selection in Linear Mixed Models

Linear mixed models (LMMs), which incorporate fixed and random effects, are key tools for analyzing heterogeneous data, such as in personalized medicine. Nowadays, this type of data is increasingly wide, sometimes containing thousands of…

Machine Learning · Statistics 2026-05-15 Ryan Thompson , Matt P. Wand , Joanna J. J. Wang

Improving predictions of Bayesian neural nets via local linearization

The generalized Gauss-Newton (GGN) approximation is often used to make practical Bayesian deep learning approaches scalable by replacing a second order derivative with a product of first order derivatives. In this paper we argue that the…

Machine Learning · Statistics 2021-02-26 Alexander Immer , Maciej Korzepa , Matthias Bauer

Maximum Approximated Likelihood Estimation

Empirical economic research frequently applies maximum likelihood estimation in cases where the likelihood function is analytically intractable. Most of the theoretical literature focuses on maximum simulated likelihood (MSL) estimators,…

Econometrics · Economics 2019-08-13 Michael Griebel , Florian Heiss , Jens Oettershagen , Constantin Weiser

The Stochastic Multi-Proximal Method for Nonsmooth Optimization

Stochastic gradient descent type methods are ubiquitous in machine learning, but they are only applicable to the optimization of differentiable functions. Proximal algorithms are more general and applicable to nonsmooth functions. We…

Optimization and Control · Mathematics 2025-05-20 Laurent Condat , Elnur Gasanov , Peter Richtárik

Potentially Predictive Variance Reducing Subsample Locations in Local Gaussian Process Regression

Gaussian process models are commonly used as emulators for computer experiments. However, developing a Gaussian process emulator can be computationally prohibitive when the number of experimental samples is even moderately large. Local…

Methodology · Statistics 2018-09-26 Chih-Li Sung , Robert B. Gramacy , Benjamin Haaland

Composite Optimization using Local Models and Global Approximations

This work presents a unified framework that combines global approximations with locally built models to handle challenging nonconvex and nonsmooth composite optimization problems, including cases involving extended real-valued functions. We…

Optimization and Control · Mathematics 2026-02-19 Welington de Oliveira , Johannes O. Royset

Matrix Approximation under Local Low-Rank Assumption

Matrix approximation is a common tool in machine learning for building accurate prediction models for recommendation systems, text mining, and computer vision. A prevalent assumption in constructing matrix approximations is that the…

Machine Learning · Computer Science 2013-01-16 Joonseok Lee , Seungyeon Kim , Guy Lebanon , Yoram Singer

Dynamic Universal Approximation Theory: The Basic Theory for Transformer-based Large Language Models

Language models have emerged as a critical area of focus in artificial intelligence, particularly with the introduction of groundbreaking innovations like ChatGPT. Large-scale Transformer networks have quickly become the leading approach…

Artificial Intelligence · Computer Science 2024-12-12 Wei Wang , Qing Li

SLM: End-to-end Feature Selection via Sparse Learnable Masks

Feature selection has been widely used to alleviate compute requirements during training, elucidate model interpretability, and improve model generalizability. We propose SLM -- Sparse Learnable Masks -- a canonical approach for end-to-end…

Machine Learning · Computer Science 2023-04-07 Yihe Dong , Sercan O. Arik

From Local SGD to Local Fixed-Point Methods for Federated Learning

Most algorithms for solving optimization problems or finding saddle points of convex-concave functions are fixed-point algorithms. In this work we consider the generic problem of finding a fixed point of an average of operators, or an…

Machine Learning · Computer Science 2020-06-17 Grigory Malinovsky , Dmitry Kovalev , Elnur Gasanov , Laurent Condat , Peter Richtárik

Locally Linear Continual Learning for Time Series based on VC-Theoretical Generalization Bounds

Most machine learning methods assume fixed probability distributions, limiting their applicability in nonstationary real-world scenarios. While continual learning methods address this issue, current approaches often rely on black-box models…

Machine Learning · Computer Science 2026-03-17 Yan V. G. Ferreira , Igor B. Lima , Pedro H. G. Mapa S. , Felipe V. Campos , Antonio P. Braga

Projection Methods for Operator Learning and Universal Approximation

We obtain a new universal approximation theorem for continuous (possibly nonlinear) operators on arbitrary Banach spaces using the Leray-Schauder mapping. Moreover, we introduce and study a method for operator learning in Banach spaces…

Numerical Analysis · Mathematics 2026-03-17 Emanuele Zappala

The Universal Approximation Property

The universal approximation property of various machine learning models is currently only understood on a case-by-case basis, limiting the rapid development of new theoretically justified neural network architectures and blurring our…

Machine Learning · Statistics 2020-12-01 Anastasis Kratsios