Related papers: Stochastic Separation Theorems

General stochastic separation theorems with optimal bounds

Phenomenon of stochastic separability was revealed and used in machine learning to correct errors of Artificial Intelligence (AI) systems and analyze AI instabilities. In high-dimensional datasets under broad assumptions each point can be…

Artificial Intelligence · Computer Science 2021-03-04 Bogdan Grechuk , Alexander N. Gorban , Ivan Y. Tyukin

Linear and Fisher Separability of Random Points in the d-dimensional Spherical Layer

Stochastic separation theorems play important role in high-dimensional data analysis and machine learning. It turns out that in high dimension any point of a random set of points can be separated from other points by a hyperplane with high…

Probability · Mathematics 2020-12-02 Sergey Sidorov , Nikolai Zolotykh

High-dimensional separability for one- and few-shot learning

This work is driven by a practical question: corrections of Artificial Intelligence (AI) errors. These corrections should be quick and non-iterative. To solve this problem without modification of a legacy AI system, we propose special…

Machine Learning · Computer Science 2021-10-26 Alexander N. Gorban , Bogdan Grechuk , Evgeny M. Mirkes , Sergey V. Stasenko , Ivan Y. Tyukin

Correction of AI systems by linear discriminants: Probabilistic foundations

Artificial Intelligence (AI) systems sometimes make errors and will make errors in the future, from time to time. These errors are usually unexpected, and can lead to dramatic consequences. Intensive development of AI and its practical…

Machine Learning · Computer Science 2019-03-01 A. N. Gorban , A. Golubkov , B. Grechuk , E. M. Mirkes , I. Y. Tyukin

Dynamic Latent Separation for Deep Learning

A core problem in machine learning is to learn expressive latent variables for model prediction on complex data that involves multiple sub-components in a flexible and interpretable fashion. Here, we develop an approach that improves…

Machine Learning · Computer Science 2024-02-13 Yi-Lin Tuan , Zih-Yun Chiu , William Yang Wang

On the Optimality of Averaging in Distributed Statistical Learning

A common approach to statistical learning with big-data is to randomly split it among $m$ machines and learn the parameter of interest by averaging the $m$ individual estimates. In this paper, focusing on empirical risk minimization, or…

Machine Learning · Statistics 2016-06-14 Jonathan Rosenblatt , Boaz Nadler

One-Class Semi-Supervised Learning: Detecting Linearly Separable Class by its Mean

In this paper, we presented a novel semi-supervised one-class classification algorithm which assumes that class is linearly separable from other elements. We proved theoretically that class is linearly separable if and only if it is maximal…

Machine Learning · Statistics 2017-05-03 Evgeny Bauman , Konstantin Bauman

Stochastic Approximation with Random Step Sizes and Urn Models with Random Replacement Matrices Having Finite Mean

Stochastic approximation algorithm is a useful technique which has been exploited successfully in probability theory and statistics for a long time. The step sizes used in stochastic approximation are generally taken to be deterministic and…

Probability · Mathematics 2019-09-25 Ujan Gangopadhyay , Krishanu Maulik

Blessing of dimensionality: mathematical foundations of the statistical physics of data

The concentration of measure phenomena were discovered as the mathematical background of statistical mechanics at the end of the XIX - beginning of the XX century and were then explored in mathematics of the XX-XXI centuries. At the…

Machine Learning · Computer Science 2018-03-20 A. N. Gorban , I. Y. Tyukin

On the Separability of Stochastic Geometric Objects, with Applications

In this paper, we study the linear separability problem for stochastic geometric objects under the well-known unipoint/multipoint uncertainty models. Let $S=S_R \cup S_B$ be a given set of stochastic bichromatic points, and define $n =…

Computational Geometry · Computer Science 2016-04-06 Jie Xue , Yuan Li , Ravi Janardan

For better learning, large datasets are often split into small batches and fed sequentially to the predictive model. In this paper, we study such batch decompositions from a probabilistic perspective. We assume that data points (possibly…

Machine Learning · Computer Science 2025-04-10 Ghurumuruhan Ganesan

Parameter Identification in a Probabilistic Setting

Parameter identification problems are formulated in a probabilistic language, where the randomness reflects the uncertainty about the knowledge of the true values. This setting allows conceptually easily to incorporate new information, e.g.…

Numerical Analysis · Computer Science 2013-03-19 Bojana V. Rosić , Anna Kučerová , Jan Sýkora , Oliver Pajonk , Alexander Litvinenko , Hermann G. Matthies

Boosting Bayesian Parameter Inference of Nonlinear Stochastic Differential Equation Models by Hamiltonian Scale Separation

Parameter inference is a fundamental problem in data-driven modeling. Given observed data that is believed to be a realization of some parameterized model, the aim is to find parameter values that are able to explain the observed data. In…

Data Structures and Algorithms · Computer Science 2016-04-20 Carlo Albert , Simone Ulzega , Ruedi Stoop

Stochastic Segmentation Networks: Modelling Spatially Correlated Aleatoric Uncertainty

In image segmentation, there is often more than one plausible solution for a given input. In medical imaging, for example, experts will often disagree about the exact location of object boundaries. Estimating this inherent uncertainty and…

Computer Vision and Pattern Recognition · Computer Science 2020-12-23 Miguel Monteiro , Loïc Le Folgoc , Daniel Coelho de Castro , Nick Pawlowski , Bernardo Marques , Konstantinos Kamnitsas , Mark van der Wilk , Ben Glocker

Discriminant analysis in small and large dimensions

We study the distributional properties of the linear discriminant function under the assumption of normality by comparing two groups with the same covariance matrix but different mean vectors. A stochastic representation for the…

Statistics Theory · Mathematics 2017-05-09 Taras Bodnar , Stepan Mazur , Edward Ngailo , Nestor Parolya

A Probabilistic Separation Logic

Probabilistic independence is a useful concept for describing the result of random sampling---a basic operation in all probabilistic languages---and for reasoning about groups of random variables. Nevertheless, existing verification methods…

Programming Languages · Computer Science 2020-07-21 Gilles Barthe , Justin Hsu , Kevin Liao

From Spectral Theorem to Statistical Independence with Application to System Identification

High dimensional random dynamical systems are ubiquitous, including -- but not limited to -- cyber-physical systems, daily return on different stocks of S&P 1500 and velocity profile of interacting particle systems around McKeanVlasov…

Statistics Theory · Mathematics 2023-10-17 Muhammad Abdullah Naeem , Amir Khazraei , Miroslav Pajic

Stochastic Descent Analysis of Representation Learning Algorithms

Although stochastic approximation learning methods have been widely used in the machine learning literature for over 50 years, formal theoretical analyses of specific machine learning algorithms are less common because stochastic…

Machine Learning · Statistics 2017-04-21 Richard M. Golden

Computationally Efficient Robust Estimation of Sparse Functionals

Many conventional statistical procedures are extremely sensitive to seemingly minor deviations from modeling assumptions. This problem is exacerbated in modern high-dimensional settings, where the problem dimension can grow with and…

Machine Learning · Statistics 2017-02-27 Simon S. Du , Sivaraman Balakrishnan , Aarti Singh

Near-Optimal Density Estimation in Near-Linear Time Using Variable-Width Histograms

Let $p$ be an unknown and arbitrary probability distribution over $[0,1)$. We consider the problem of {\em density estimation}, in which a learning algorithm is given i.i.d. draws from $p$ and must (with high probability) output a…

Machine Learning · Computer Science 2014-11-04 Siu-On Chan , Ilias Diakonikolas , Rocco A. Servedio , Xiaorui Sun