Related papers: Backprop as Functor: A compositional perspective o…

Learners' Languages

In "Backprop as functor", the authors show that the fundamental elements of deep learning -- gradient descent and backpropagation -- can be conceptualized as a strong monoidal functor Para(Euc)$\to$Learn from the category of parameterized…

Category Theory · Mathematics 2025-06-09 David I. Spivak

Polynomial Regression as a Task for Understanding In-context Learning Through Finetuning and Alignment

Simple function classes have emerged as toy problems to better understand in-context-learning in transformer-based architectures used for large language models. But previously proposed simple function classes like linear regression or…

Machine Learning · Computer Science 2024-07-30 Max Wilcoxson , Morten Svendgård , Ria Doshi , Dylan Davis , Reya Vir , Anant Sahai

Function regression using the forward forward training and inferring paradigm

Function regression/approximation is a fundamental application of machine learning. Neural networks (NNs) can be easily trained for function regression using a sufficient number of neurons and epochs. The forward-forward learning algorithm…

Machine Learning · Computer Science 2025-10-16 Shivam Padmani , Akshay Joshi

Fundamental Components of Deep Learning: A category-theoretic approach

Deep learning, despite its remarkable achievements, is still a young field. Like the early stages of many scientific disciplines, it is marked by the discovery of new phenomena, ad-hoc design decisions, and the lack of a uniform and…

Machine Learning · Computer Science 2024-03-21 Bruno Gavranović

Submodular Functions: Learnability, Structure, and Optimization

Submodular functions are discrete functions that model laws of diminishing returns and enjoy numerous algorithmic applications. They have been used in many areas, including combinatorial optimization, machine learning, and economics. In…

Data Structures and Algorithms · Computer Science 2012-08-24 Maria-Florina Balcan , Nicholas J. A. Harvey

Panprediction: Optimal Predictions for Any Downstream Task and Loss

Supervised learning is classically formulated as training a model to minimize a fixed loss function over a fixed distribution, or task. However, an emerging paradigm instead views model training as extracting enough information from data so…

Machine Learning · Computer Science 2025-11-03 Sivaraman Balakrishnan , Nika Haghtalab , Daniel Hsu , Brian Lee , Eric Zhao

Backprop Evolution

The back-propagation algorithm is the cornerstone of deep learning. Despite its importance, few variations of the algorithm have been attempted. This work presents an approach to discover new variations of the back-propagation equation. We…

Neural and Evolutionary Computing · Computer Science 2018-08-09 Maximilian Alber , Irwan Bello , Barret Zoph , Pieter-Jan Kindermans , Prajit Ramachandran , Quoc Le

Self-supervised Training of Proposal-based Segmentation via Background Prediction

While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this in scenarios where annotating data is…

Computer Vision and Pattern Recognition · Computer Science 2019-07-19 Isinsu Katircioglu , Helge Rhodin , Victor Constantin , Jörg Spörri , Mathieu Salzmann , Pascal Fua

Proximal Backpropagation

We propose proximal backpropagation (ProxProp) as a novel algorithm that takes implicit instead of explicit gradient steps to update the network parameters during neural network training. Our algorithm is motivated by the step size…

Machine Learning · Computer Science 2018-02-21 Thomas Frerix , Thomas Möllenhoff , Michael Moeller , Daniel Cremers

Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

This paper provides a unifying view of a wide range of problems of interest in machine learning by framing them as the minimization of functionals defined on the space of probability measures. In particular, we show that generative…

Machine Learning · Computer Science 2019-05-21 Casey Chu , Jose Blanchet , Peter Glynn

Supervised Manifold Learning for Functional Data

Classification is a core topic in functional data analysis. A large number of functional classifiers have been proposed in the literature, most of which are based on functional principal component analysis or functional regression. In…

Methodology · Statistics 2025-10-14 Ruoxu Tan , Yiming Zang

Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks

Representations are fundamental to artificial intelligence. The performance of a learning system depends on the type of representation used for representing the data. Typically, these representations are hand-engineered using domain…

Machine Learning · Computer Science 2017-04-28 Vivek Veeriah , Shangtong Zhang , Richard S. Sutton

Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods

In this paper we propose a novel gradient algorithm to learn a policy from an expert's observed behavior assuming that the expert behaves optimally with respect to some unknown reward function of a Markovian Decision Problem. The…

Machine Learning · Computer Science 2012-06-26 Gergely Neu , Csaba Szepesvari

Analyzing Inexact Hypergradients for Bilevel Learning

Estimating hyperparameters has been a long-standing problem in machine learning. We consider the case where the task at hand is modeled as the solution to an optimization problem. Here the exact gradient with respect to the hyperparameters…

Optimization and Control · Mathematics 2023-11-16 Matthias J. Ehrhardt , Lindon Roberts

Functorial Manifold Learning

We adapt previous research on category theory and topological unsupervised learning to develop a functorial perspective on manifold learning, also known as nonlinear dimensionality reduction. We first characterize manifold learning…

Machine Learning · Computer Science 2022-11-04 Dan Shiebler

Gradients without Backpropagation

Using backpropagation to compute gradients of objective functions for optimization has remained a mainstay of machine learning. Backpropagation, or reverse-mode differentiation, is a special case within the general family of automatic…

Machine Learning · Computer Science 2022-02-18 Atılım Güneş Baydin , Barak A. Pearlmutter , Don Syme , Frank Wood , Philip Torr

Optimized conformal classification using gradient descent approximation

Conformal predictors are an important class of algorithms that allow predictions to be made with a user-defined confidence level. They are able to do this by outputting prediction sets, rather than simple point predictions. The conformal…

Machine Learning · Computer Science 2021-05-25 Anthony Bellotti

Supervised Stochastic Gradient Algorithms for Multi-Trial Source Separation

We develop a stochastic algorithm for independent component analysis that incorporates multi-trial supervision, which is available in many scientific contexts. The method blends a proximal gradient-type algorithm in the space of invertible…

Machine Learning · Computer Science 2025-08-29 Ronak Mehta , Mateus Piovezan Otto , Noah Stanis , Azadeh Yazdan-Shahmorad , Zaid Harchaoui

Backpropagation in the Simply Typed Lambda-calculus with Linear Negation

Backpropagation is a classic automatic differentiation algorithm computing the gradient of functions specified by a certain class of simple, first-order programs, called computational graphs. It is a fundamental tool in several fields, most…

Logic in Computer Science · Computer Science 2019-11-07 Alois Brunel , Damiano Mazza , Michele Pagani

The Golden Ratio of Learning and Momentum

Gradient descent has been a central training principle for artificial neural networks from the early beginnings to today's deep learning networks. The most common implementation is the backpropagation algorithm for training feed-forward…

Machine Learning · Computer Science 2020-06-09 Stefan Jaeger