English
Related papers

Related papers: Latent Iterative Refinement for Modular Source Sep…

200 papers

Deep Learning architectures, and in particular Transformers, are conventionally viewed as a composition of layers. These layers are actually often obtained as the sum of two contributions: a residual path that copies the input and the…

In this paper, we propose a two-step training procedure for source separation via a deep neural network. In the first step we learn a transform (and it's inverse) to a latent space where masking-based separation performance using oracles is…

Machine Learning · Computer Science 2021-05-12 Efthymios Tzinis , Shrikant Venkataramani , Zhepei Wang , Cem Subakan , Paris Smaragdis

Distributed machine learning training and inference is common today because today's large models require more memory and compute than can be provided by a single GPU. Distributed models are generally produced by programmers who take a…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-14 Zhanghan Wang , Ding Ding , Hang Zhu , Haibin Lin , Aurojit Panda

Despite the remarkable success of deep learning systems over the last decade, a key difference still remains between neural network and human decision-making: As humans, we cannot only form a decision on the spot, but also ponder,…

Computer Vision and Pattern Recognition · Computer Science 2023-09-15 Gregor Koehler , Tassilo Wald , Constantin Ulrich , David Zimmerer , Paul F. Jaeger , Jörg K. H. Franke , Simon Kohl , Fabian Isensee , Klaus H. Maier-Hein

State-of-the-art, high capacity deep neural networks not only require large amounts of labelled training data, they are also highly susceptible to label errors in this data, typically resulting in large efforts and costs and therefore…

Machine Learning · Computer Science 2020-07-20 Christian Haase-Schütz , Rainer Stal , Heinz Hertlein , Bernhard Sick

Extracting individual elements from music mixtures is a valuable tool for music production and practice. While neural networks optimized to mask or transform mixture spectrograms into the individual source(s) have been the leading approach,…

Sound · Computer Science 2025-11-26 Genís Plaja-Roglans , Yun-Ning Hung , Xavier Serra , Igor Pereira

We introduce a new method for internal replay that modulates the frequency of rehearsal based on the depth of the network. While replay strategies mitigate the effects of catastrophic forgetting in neural networks, recent works on…

Computer Vision and Pattern Recognition · Computer Science 2022-07-07 Stanisław Pawlak , Filip Szatkowski , Michał Bortkiewicz , Jan Dubiński , Tomasz Trzciński

Audio source separation aims to separate a mixture into target sources. Previous audio source separation systems usually conduct one-step inference, which does not fully explore the separation ability of models. In this work, we reveal that…

Sound · Computer Science 2025-05-27 Yongyi Zang , Jingyi Li , Qiuqiang Kong

Iterative refinement -- start with a random guess, then iteratively improve the guess -- is a useful paradigm for representation learning because it offers a way to break symmetries among equally plausible explanations for the data. This…

Machine Learning · Computer Science 2023-01-03 Michael Chang , Thomas L. Griffiths , Sergey Levine

Pre-training and fine-tuning have achieved significant advances in the information retrieval (IR). A typical approach is to fine-tune all the parameters of large-scale pre-trained models (PTMs) on downstream tasks. As the model size and the…

Information Retrieval · Computer Science 2022-08-23 Xinyu Ma , Jiafeng Guo , Ruqing Zhang , Yixing Fan , Xueqi Cheng

This paper investigates multilevel initialization strategies for training very deep neural networks with a layer-parallel multigrid solver. The scheme is based on the continuous interpretation of the training problem as a problem of optimal…

Machine Learning · Computer Science 2019-12-20 Eric C. Cyr , Stefanie Günther , Jacob B. Schroder

Deep neural networks, despite their success in numerous applications, often function without established theoretical foundations. In this paper, we bridge this gap by drawing parallels between deep learning and classical numerical analysis.…

Machine Learning · Computer Science 2023-10-04 Emanuele Zappala , Daniel Levine , Sizhuang He , Syed Rizvi , Sacha Levy , David van Dijk

This paper presents an iterative training of neural networks for intra prediction in a block-based image and video codec. First, the neural networks are trained on blocks arising from the codec partitioning of images, each paired with its…

Image and Video Processing · Electrical Eng. & Systems 2021-02-24 Thierry Dumas , Franck Galpin , Philippe Bordes

The success of large-scale language models like GPT can be attributed to their ability to efficiently predict the next token in a sequence. However, these models rely on constant computational effort regardless of the complexity of the…

Artificial Intelligence · Computer Science 2024-11-11 Kei-Sing Ng , Qingchen Wang

Deep neural networks have usually to be compressed and accelerated for their usage in low-power, e.g. mobile, devices. Recently, massively-parallel hardware accelerators were developed that offer high throughput and low latency at low power…

Machine Learning · Computer Science 2021-08-04 Thomas Pfeil

In this paper, we advocate for two stages in a neural network's decision making process. The first is the existing feed-forward inference framework where patterns in given data are sensed and associated with previously learned patterns. The…

Machine Learning · Computer Science 2022-09-20 Mohit Prabhushankar , Ghassan AlRegib

Large reasoning models (LRMs) excel at complex reasoning tasks but typically generate lengthy sequential chains-of-thought, resulting in long inference times before arriving at the final answer. To address this challenge, we introduce…

Artificial Intelligence · Computer Science 2025-12-04 Emil Biju , Shayan Talaei , Zhemin Huang , Mohammadreza Pourreza , Azalia Mirhoseini , Amin Saberi

Reasoning models improve their problem-solving ability through inference-time scaling, allocating more compute via longer token budgets. Identifying which reasoning traces are likely to succeed remains a key opportunity: reliably predicting…

Artificial Intelligence · Computer Science 2025-10-14 Martina G. Vilas , Safoora Yousefi , Besmira Nushi , Eric Horvitz , Vidhisha Balachandran

This paper proposes the use of iterative transfer learning applied to deep learning models for side-channel attacks. Currently, most of the side-channel attack methods train a model for each individual byte, without considering the…

Machine Learning · Computer Science 2024-12-31 Tun-Chieh Lou , Chung-Che Wang , Jyh-Shing Roger Jang , Henian Li , Lang Lin , Norman Chang

Inference tasks in signal processing are often characterized by the availability of reliable statistical modeling with some missing instance-specific parameters. One conventional approach uses data to estimate these missing parameters and…

Signal Processing · Electrical Eng. & Systems 2023-04-25 Nir Shlezinger , Tirza Routtenberg
‹ Prev 1 2 3 10 Next ›