Related papers: Latent Iterative Refinement for Modular Source Sep…

Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training

Deep Learning architectures, and in particular Transformers, are conventionally viewed as a composition of layers. These layers are actually often obtained as the sum of two contributions: a residual path that copies the input and the…

Machine Learning · Computer Science 2026-03-03 Jonathan Lys , Vincent Gripon , Bastien Pasdeloup , Axel Marmoret , Lukas Mauch , Fabien Cardinaux , Ghouthi Boukli Hacene

Two-Step Sound Source Separation: Training on Learned Latent Targets

In this paper, we propose a two-step training procedure for source separation via a deep neural network. In the first step we learn a transform (and it's inverse) to a latent space where masking-based separation performance using oracles is…

Machine Learning · Computer Science 2021-05-12 Efthymios Tzinis , Shrikant Venkataramani , Zhepei Wang , Cem Subakan , Paris Smaragdis

Verify Distributed Deep Learning Model Implementation Refinement with Iterative Relation Inference

Distributed machine learning training and inference is common today because today's large models require more memory and compute than can be provided by a single GPU. Distributed models are generally produced by programmers who take a…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-14 Zhanghan Wang , Ding Ding , Hang Zhu , Haibin Lin , Aurojit Panda

RecycleNet: Latent Feature Recycling Leads to Iterative Decision Refinement

Despite the remarkable success of deep learning systems over the last decade, a key difference still remains between neural network and human decision-making: As humans, we cannot only form a decision on the spot, but also ponder,…

Computer Vision and Pattern Recognition · Computer Science 2023-09-15 Gregor Koehler , Tassilo Wald , Constantin Ulrich , David Zimmerer , Paul F. Jaeger , Jörg K. H. Franke , Simon Kohl , Fabian Isensee , Klaus H. Maier-Hein

Iterative Label Improvement: Robust Training by Confidence Based Filtering and Dataset Partitioning

State-of-the-art, high capacity deep neural networks not only require large amounts of labelled training data, they are also highly susceptible to label errors in this data, typically resulting in large efforts and costs and therefore…

Machine Learning · Computer Science 2020-07-20 Christian Haase-Schütz , Rainer Stal , Heinz Hertlein , Bernhard Sick

Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model

Extracting individual elements from music mixtures is a valuable tool for music production and practice. While neural networks optimized to mask or transform mixture spectrograms into the individual source(s) have been the leading approach,…

Sound · Computer Science 2025-11-26 Genís Plaja-Roglans , Yun-Ning Hung , Xavier Serra , Igor Pereira

Progressive Latent Replay for efficient Generative Rehearsal

We introduce a new method for internal replay that modulates the frequency of rehearsal based on the depth of the network. While replay strategies mitigate the effects of catastrophic forgetting in neural networks, recent works on…

Computer Vision and Pattern Recognition · Computer Science 2022-07-07 Stanisław Pawlak , Filip Szatkowski , Michał Bortkiewicz , Jan Dubiński , Tomasz Trzciński

Training-Free Multi-Step Audio Source Separation

Audio source separation aims to separate a mixture into target sources. Previous audio source separation systems usually conduct one-step inference, which does not fully explore the separation ability of models. In this work, we reveal that…

Sound · Computer Science 2025-05-27 Yongyi Zang , Jingyi Li , Qiuqiang Kong

Object Representations as Fixed Points: Training Iterative Refinement Algorithms with Implicit Differentiation

Iterative refinement -- start with a random guess, then iteratively improve the guess -- is a useful paradigm for representation learning because it offers a way to break symmetries among equally plausible explanations for the data. This…

Machine Learning · Computer Science 2023-01-03 Michael Chang , Thomas L. Griffiths , Sergey Levine

Scattered or Connected? An Optimized Parameter-efficient Tuning Approach for Information Retrieval

Pre-training and fine-tuning have achieved significant advances in the information retrieval (IR). A typical approach is to fine-tune all the parameters of large-scale pre-trained models (PTMs) on downstream tasks. As the model size and the…

Information Retrieval · Computer Science 2022-08-23 Xinyu Ma , Jiafeng Guo , Ruqing Zhang , Yixing Fan , Xueqi Cheng

Multilevel Initialization for Layer-Parallel Deep Neural Network Training

This paper investigates multilevel initialization strategies for training very deep neural networks with a layer-parallel multigrid solver. The scheme is based on the continuous interpretation of the training problem as a problem of optimal…

Machine Learning · Computer Science 2019-12-20 Eric C. Cyr , Stefanie Günther , Jacob B. Schroder

Operator Learning Meets Numerical Analysis: Improving Neural Networks through Iterative Methods

Deep neural networks, despite their success in numerous applications, often function without established theoretical foundations. In this paper, we bridge this gap by drawing parallels between deep learning and classical numerical analysis.…

Machine Learning · Computer Science 2023-10-04 Emanuele Zappala , Daniel Levine , Sizhuang He , Syed Rizvi , Sacha Levy , David van Dijk

Iterative training of neural networks for intra prediction

This paper presents an iterative training of neural networks for intra prediction in a block-based image and video codec. First, the neural networks are trained on blocks arising from the codec partitioning of images, each paired with its…

Image and Video Processing · Electrical Eng. & Systems 2021-02-24 Thierry Dumas , Franck Galpin , Philippe Bordes

Loop Neural Networks for Parameter Sharing

The success of large-scale language models like GPT can be attributed to their ability to efficiently predict the next token in a sequence. However, these models rely on constant computational effort regardless of the complexity of the…

Artificial Intelligence · Computer Science 2024-11-11 Kei-Sing Ng , Qingchen Wang

ItNet: iterative neural networks with small graphs for accurate, efficient and anytime semantic segmentation

Deep neural networks have usually to be compressed and accelerated for their usage in low-power, e.g. mobile, devices. Recently, massively-parallel hardware accelerators were developed that offer high throughput and low latency at low power…

Machine Learning · Computer Science 2021-08-04 Thomas Pfeil

Introspective Learning : A Two-Stage Approach for Inference in Neural Networks

In this paper, we advocate for two stages in a neural network's decision making process. The first is the existing feed-forward inference framework where patterns in given data are sensed and associated with previously learned patterns. The…

Machine Learning · Computer Science 2022-09-20 Mohit Prabhushankar , Ghassan AlRegib

SPRINT: Enabling Interleaved Planning and Parallelized Execution in Reasoning Models

Large reasoning models (LRMs) excel at complex reasoning tasks but typically generate lengthy sequential chains-of-thought, resulting in long inference times before arriving at the final answer. To address this challenge, we introduce…

Artificial Intelligence · Computer Science 2025-12-04 Emil Biju , Shayan Talaei , Zhemin Huang , Mohammadreza Pourreza , Azalia Mirhoseini , Amin Saberi

Tracing the Traces: Latent Temporal Signals for Efficient and Accurate Reasoning

Reasoning models improve their problem-solving ability through inference-time scaling, allocating more compute via longer token budgets. Identifying which reasoning traces are likely to succeed remains a key opportunity: reliably predicting…

Artificial Intelligence · Computer Science 2025-10-14 Martina G. Vilas , Safoora Yousefi , Besmira Nushi , Eric Horvitz , Vidhisha Balachandran

Improving Location-based Thermal Emission Side-Channel Analysis Using Iterative Transfer Learning

This paper proposes the use of iterative transfer learning applied to deep learning models for side-channel attacks. Currently, most of the side-channel attack methods train a model for each individual byte, without considering the…

Machine Learning · Computer Science 2024-12-31 Tun-Chieh Lou , Chung-Che Wang , Jyh-Shing Roger Jang , Henian Li , Lang Lin , Norman Chang

Discriminative and Generative Learning for Linear Estimation of Random Signals [Lecture Notes]

Inference tasks in signal processing are often characterized by the availability of reliable statistical modeling with some missing instance-specific parameters. One conventional approach uses data to estimate these missing parameters and…

Signal Processing · Electrical Eng. & Systems 2023-04-25 Nir Shlezinger , Tirza Routtenberg