Tim Roith — Scifaro

Adaptive Regularization for Sparsity Control in Bregman-Based Optimizers

Sparse training reduces the memory and computational costs of deep neural networks. However, sparse optimization methods, e.g., those adding an $\ell_1$ penalty, often control sparsity only indirectly through a regularization parameter…

Machine Learning · Computer Science 2026-05-21 Ahmad Aloradi , Tim Roith , Emanuël A. P. Habets , Daniel Tenbrinck

Quantifying Concentration Phenomena of Mean-Field Transformers in the Low-Temperature Regime

Transformers with self-attention modules as their core components have become an integral architecture in modern large language and foundation models. In this paper, we study the evolution of tokens in deep encoder-only transformers at…

Analysis of PDEs · Mathematics 2026-05-12 Albert Alcalde , Leon Bungert , Konstantin Riedl , Tim Roith

Position-Blind Ptychography: Viability of image reconstruction via data-driven variational inference

In this work, we present and investigate the novel blind inverse problem of position-blind ptychography, i.e., ptychographic phase retrieval without any knowledge of scan positions, which then must be recovered jointly with the image. The…

Image and Video Processing · Electrical Eng. & Systems 2026-03-18 Simon Welker , Lorenz Kuger , Tim Roith , Berthy Feng , Martin Burger , Timo Gerkmann , Henry Chapman

Allure of Craquelure: A Variational-Generative Approach to Crack Detection in Paintings

Recent advances in imaging technologies, deep learning and numerical performance have enabled non-invasive detailed analysis of artworks, supporting their documentation and conservation. In particular, automated detection of craquelure in…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Laura Paul , Holger Rauhut , Martin Burger , Samira Kabri , Tim Roith

Adversarial flows: A gradient flow characterization of adversarial attacks

A popular method to perform adversarial attacks on neuronal networks is the so-called fast gradient sign method and its iterative variant. In this paper, we interpret this method as an explicit Euler discretization of a differential…

Machine Learning · Computer Science 2025-09-17 Lukas Weigand , Tim Roith , Martin Burger

Introduction to Regularization and Learning Methods for Inverse Problems

These lecture notes evolve around mathematical concepts arising in inverse problems. We start by introducing inverse problems through examples such as differentiation, deconvolution, computed tomography and phase retrieval. This then leads…

Numerical Analysis · Mathematics 2025-08-26 Danielle Bednarski , Tim Roith

MirrorCBO: A consensus-based optimization method in the spirit of mirror descent

In this work we propose MirrorCBO, a consensus-based optimization (CBO) method which generalizes standard CBO in the same way that mirror descent generalizes gradient descent. For this we apply the CBO methodology to a swarm of dual…

Optimization and Control · Mathematics 2025-07-17 Leon Bungert , Franca Hoffmann , Dohyeon Kim , Tim Roith

Consensus-based optimization for closed-box adversarial attacks and a connection to evolution strategies

Consensus-based optimization (CBO) has established itself as an efficient gradient-free optimization scheme, with attractive mathematical properties, such as mean-field convergence results for non-convex loss functions. In this work, we…

Optimization and Control · Mathematics 2025-07-01 Tim Roith , Leon Bungert , Philipp Wacker

Analysis of mean-field models arising from self-attention dynamics in transformer architectures with layer normalization

The aim of this paper is to provide a mathematical analysis of transformer architectures using a self-attention mechanism with layer normalization. In particular, observed patterns in such architectures resembling either clusters or uniform…

Analysis of PDEs · Mathematics 2025-04-29 Martin Burger , Samira Kabri , Yury Korolev , Tim Roith , Lukas Weigand

CBX: Python and Julia packages for consensus-based interacting particle methods

We introduce CBXPy and ConsensusBasedX.jl, Python and Julia implementations of consensus-based interacting particle systems (CBX), which generalise consensus-based optimization methods (CBO) for global, derivative-free optimisation. The…

Optimization and Control · Mathematics 2024-12-04 Rafael Bailo , Alethea Barbaro , Susana N. Gomes , Konstantin Riedl , Tim Roith , Claudia Totzeck , Urbain Vaes

Ratio convergence rates for Euclidean first-passage percolation: Applications to the graph infinity Laplacian

In this paper we prove the first quantitative convergence rates for the graph infinity Laplace equation for length scales at the connectivity threshold. In the graph-based semi-supervised learning community this equation is also known as…

Probability · Mathematics 2024-02-23 Leon Bungert , Jeff Calder , Tim Roith

Learning a Sparse Representation of Barron Functions with the Inverse Scale Space Flow

This paper presents a method for finding a sparse representation of Barron functions. Specifically, given an $L^2$ function $f$, the inverse scale space flow is used to find a sparse measure $\mu$ minimising the $L^2$ loss between the…

Machine Learning · Statistics 2023-12-06 Tjeerd Jan Heeringa , Tim Roith , Christoph Brune , Martin Burger

Polarized consensus-based dynamics for optimization and sampling

In this paper we propose polarized consensus-based dynamics in order to make consensus-based optimization (CBO) and sampling (CBS) applicable for objective functions with several global minima or distributions with many modes, respectively.…

Optimization and Control · Mathematics 2023-10-10 Leon Bungert , Tim Roith , Philipp Wacker

Resolution-Invariant Image Classification based on Fourier Neural Operators

In this paper we investigate the use of Fourier Neural Operators (FNOs) for image classification in comparison to standard Convolutional Neural Networks (CNNs). Neural operators are a discretization-invariant generalization of neural…

Computer Vision and Pattern Recognition · Computer Science 2023-04-05 Samira Kabri , Tim Roith , Daniel Tenbrinck , Martin Burger

Uniform Convergence Rates for Lipschitz Learning on Graphs

Lipschitz learning is a graph-based semi-supervised learning method where one extends labels from a labeled to an unlabeled data set by solving the infinity Laplace equation on a weighted graph. In this work we prove uniform convergence…

Numerical Analysis · Mathematics 2023-01-31 Leon Bungert , Jeff Calder , Tim Roith

CLIP: Cheap Lipschitz Training of Neural Networks

Despite the large success of deep neural networks (DNN) in recent years, most neural networks still lack mathematical guarantees in terms of stability. For instance, DNNs are vulnerable to small or even imperceptible input perturbations, so…

Machine Learning · Computer Science 2022-11-02 Leon Bungert , René Raab , Tim Roith , Leo Schwinn , Daniel Tenbrinck

A Bregman Learning Framework for Sparse Neural Networks

We propose a learning framework based on stochastic Bregman iterations, also known as mirror descent, to train sparse neural networks with an inverse scale space approach. We derive a baseline algorithm called LinBreg, an accelerated…

Machine Learning · Computer Science 2022-08-16 Leon Bungert , Tim Roith , Daniel Tenbrinck , Martin Burger

Continuum Limit of Lipschitz Learning on Graphs

Tackling semi-supervised learning problems with graph-based methods has become a trend in recent years since graphs can represent all kinds of data and provide a suitable framework for studying continuum limits, e.g., of differential…

Machine Learning · Computer Science 2022-02-07 Tim Roith , Leon Bungert

Neural Architecture Search via Bregman Iterations

We propose a novel strategy for Neural Architecture Search (NAS) based on Bregman iterations. Starting from a sparse neural network our gradient-based one-shot algorithm gradually adds relevant parameters in an inverse scale space manner.…

Machine Learning · Computer Science 2021-06-07 Leon Bungert , Tim Roith , Daniel Tenbrinck , Martin Burger