Related papers: Generalised Perceptron Learning

Lifted Bregman Training of Neural Networks

We introduce a novel mathematical formulation for the training of feed-forward neural networks with (potentially non-smooth) proximal maps as activation functions. This formulation is based on Bregman distances and a key advantage is that…

Optimization and Control · Mathematics 2022-08-19 Xiaoyu Wang , Martin Benning

A Simple Convergence Analysis of Bregman Proximal Gradient Algorithm

In this paper, we provide a simple convergence analysis of proximal gradient algorithm with Bregman distance, which provides a tighter bound than existing result. In particular, for the problem of minimizing a class of convex objective…

Optimization and Control · Mathematics 2017-12-19 Yi Zhou , Yingbin Liang , Lixin Shen

On Accelerated Perceptrons and Beyond

The classical Perceptron algorithm of Rosenblatt can be used to find a linear threshold function to correctly classify $n$ linearly separable data points, assuming the classes are separated by some margin $\gamma > 0$. A foundational result…

Machine Learning · Computer Science 2022-10-19 Guanghui Wang , Rafael Hanashiro , Etash Guha , Jacob Abernethy

Gradient Descent as a Perceptron Algorithm: Understanding Dynamics and Implicit Acceleration

Even for the gradient descent (GD) method applied to neural network training, understanding its optimization dynamics, including convergence rate, iterate trajectories, function value oscillations, and especially its implicit acceleration,…

Machine Learning · Computer Science 2026-05-22 Alexander Tyurin

GBM-based Bregman Proximal Algorithms for Constrained Learning

As the complexity of learning tasks surges, modern machine learning encounters a new constrained learning paradigm characterized by more intricate and data-driven function constraints. Prominent applications include Neyman-Pearson…

Machine Learning · Computer Science 2023-08-22 Zhenwei Lin , Qi Deng

A Bregman Method for Structure Learning on Sparse Directed Acyclic Graphs

We develop a Bregman proximal gradient method for structure learning on linear structural causal models. While the problem is non-convex, has high curvature and is in fact NP-hard, Bregman gradient methods allow us to neutralize at least…

Machine Learning · Statistics 2020-11-06 Manon Romain , Alexandre d'Aspremont

Generalization ability of a perceptron with non-monotonic transfer function

We investigate the generalization ability of a perceptron with non-monotonic transfer function of a reversed-wedge type in on-line mode. This network is identical to a parity machine, a multilayer network. We consider several learning…

Disordered Systems and Neural Networks · Physics 2009-10-30 Jun-ichi Inoue , Hidetoshi Nishimori , Yoshiyuki Kabashima

Gradient Descent as Implicit EM in Distance-Based Neural Models

Neural networks trained with standard objectives exhibit behaviors characteristic of probabilistic inference: soft clustering, prototype specialization, and Bayesian uncertainty tracking. These phenomena appear across architectures -- in…

Machine Learning · Computer Science 2026-01-01 Alan Oursland

Fast Reinforcement Learning with Incremental Gaussian Mixture Models

This work presents a novel algorithm that integrates a data-efficient function approximator with reinforcement learning in continuous state spaces. An online and incremental algorithm capable of learning from a single pass through data,…

Machine Learning · Computer Science 2020-11-03 Rafael Pinto

Bellman Gradient Iteration for Inverse Reinforcement Learning

This paper develops an inverse reinforcement learning algorithm aimed at recovering a reward function from the observed actions of an agent. We introduce a strategy to flexibly handle different types of actions with two approximations of…

Machine Learning · Computer Science 2017-07-26 Kun Li , Yanan Sui , Joel W. Burdick

Softly Induced Functional Simplicity: Implications for Neural Network Generalisation, Robustness, and Distillation

Learning robust and generalisable abstractions from high-dimensional input data is a central challenge in machine learning and its applications to high-energy physics (HEP). Solutions of lower functional complexity are known to produce…

Machine Learning · Computer Science 2026-01-16 Maciej Glowacki

Deep Learning as a Mixed Convex-Combinatorial Optimization Problem

As neural networks grow deeper and wider, learning networks with hard-threshold activations is becoming increasingly important, both for network quantization, which can drastically reduce time and energy requirements, and for creating large…

Machine Learning · Computer Science 2018-04-18 Abram L. Friesen , Pedro Domingos

An Adaptive Gradient Method with Energy and Momentum

We introduce a novel algorithm for gradient-based optimization of stochastic objective functions. The method may be seen as a variant of SGD with momentum equipped with an adaptive learning rate automatically adjusted by an 'energy'…

Optimization and Control · Mathematics 2022-03-24 Hailiang Liu , Xuping Tian

Convergence of energy-based learning in linear resistive networks

Energy-based learning algorithms are alternatives to backpropagation and are well-suited to distributed implementations in analog electronic devices. However, a rigorous theory of convergence is lacking. We make a first step in this…

Optimization and Control · Mathematics 2026-01-28 Anne-Men Huijzer , Thomas Chaffey , Bart Besselink , Henk J. van Waarde

Training a perceptron in a discrete weight space

On-line and batch learning of a perceptron in a discrete weight space, where each weight can take $2 L+1$ different values, are examined analytically and numerically. The learning algorithm is based on the training of the continuous…

Statistical Mechanics · Physics 2009-11-07 Michal Rosen-Zvi , Ido Kanter

A Kernel Loss for Solving the Bellman Equation

Value function learning plays a central role in many state-of-the-art reinforcement-learning algorithms. Many popular algorithms like Q-learning do not optimize any objective function, but are fixed-point iterations of some variant of…

Machine Learning · Computer Science 2020-01-10 Yihao Feng , Lihong Li , Qiang Liu

Reinforcement Learning with Goal-Distance Gradient

Reinforcement learning usually uses the feedback rewards of environmental to train agents. But the rewards in the actual environment are sparse, and even some environments will not rewards. Most of the current methods are difficult to get…

Machine Learning · Computer Science 2020-01-13 Kai Jiang , XiaoLong Qin

Bayesian Perceptron: Towards fully Bayesian Neural Networks

Artificial neural networks (NNs) have become the de facto standard in machine learning. They allow learning highly nonlinear transformations in a plethora of applications. However, NNs usually only provide point estimates without…

Machine Learning · Statistics 2020-09-11 Marco F. Huber

Functional Gradient Boosting based on Residual Network Perception

Residual Networks (ResNets) have become state-of-the-art models in deep learning and several theoretical studies have been devoted to understanding why ResNet works so well. One attractive viewpoint on ResNet is that it is optimizing the…

Machine Learning · Statistics 2018-07-10 Atsushi Nitanda , Taiji Suzuki

Generalization from correlated sets of patterns in the perceptron

Generalization is a central aspect of learning theory. Here, we propose a framework that explores an auxiliary task-dependent notion of generalization, and attempts to quantitatively answer the following question: given two sets of patterns…

Disordered Systems and Neural Networks · Physics 2020-01-08 Francesco Borra , Marco Cosentino Lagomarsino , Pietro Rotondo , Marco Gherardi