Related papers: Embodied Self-supervised Learning by Coordinated S…

Learning to reconstruct from saturated data: audio declipping and high-dynamic range imaging

Learning based methods are now ubiquitous for solving inverse problems, but their deployment in real-world applications is often hindered by the lack of ground truth references for training. Recent self-supervised learning strategies offer…

Image and Video Processing · Electrical Eng. & Systems 2026-02-27 Victor Sechaud , Laurent Jacques , Patrice Abry , Julián Tachella

Equivariant Splitting: Self-supervised learning from incomplete data

Self-supervised learning for inverse problems allows to train a reconstruction network from noise and/or incomplete data alone. These methods have the potential of enabling learning-based solutions when obtaining ground-truth references for…

Computer Vision and Pattern Recognition · Computer Science 2026-05-14 Victor Sechaud , Jérémy Scanvic , Quentin Barthélemy , Patrice Abry , Julián Tachella

Self-Supervised Learning from Noisy and Incomplete Data

Many important problems in science and engineering involve inferring a signal from noisy and/or incomplete observations, where the observation process is known. Historically, this problem has been tackled using hand-crafted regularization…

Machine Learning · Statistics 2026-01-07 Julián Tachella , Mike Davies

Equivariance-based self-supervised learning for audio signal recovery from clipped measurements

In numerous inverse problems, state-of-the-art solving strategies involve training neural networks from ground truth and associated measurement datasets that, however, may be expensive or impossible to collect. Recently, self-supervised…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-25 Victor Sechaud , Laurent Jacques , Patrice Abry , Julián Tachella

Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation

We propose a computational model of speech production combining a pre-trained neural articulatory synthesizer able to reproduce complex speech stimuli from a limited set of interpretable articulatory parameters, a DNN-based internal forward…

Sound · Computer Science 2022-04-06 Marc-Antoine Georges , Julien Diard , Laurent Girin , Jean-Luc Schwartz , Thomas Hueber

Adversarially learned iterative reconstruction for imaging inverse problems

In numerous practical applications, especially in medical image reconstruction, it is often infeasible to obtain a large ensemble of ground-truth/measurement pairs for supervised learning. Therefore, it is imperative to develop unsupervised…

Image and Video Processing · Electrical Eng. & Systems 2021-03-31 Subhadip Mukherjee , Ozan Öktem , Carola-Bibiane Schönlieb

Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

Learning good representations without supervision is still an open issue in machine learning, and is particularly challenging for speech signals, which are often characterized by long sequences with a complex hierarchical structure. Some…

Machine Learning · Computer Science 2019-04-09 Santiago Pascual , Mirco Ravanelli , Joan Serrà , Antonio Bonafonte , Yoshua Bengio

From perception to production: how acoustic invariance facilitates articulatory learning in a self-supervised vocal imitation model

Human infants face a formidable challenge in speech acquisition: mapping extremely variable acoustic inputs into appropriate articulatory movements without explicit instruction. We present a computational model that addresses the…

Audio and Speech Processing · Electrical Eng. & Systems 2025-09-16 Marvin Lavechin , Thomas Hueber

Self-supervised learning of inverse problem solvers in medical imaging

In the past few years, deep learning-based methods have demonstrated enormous success for solving inverse problems in medical imaging. In this work, we address the following question:\textit{Given a set of measurements obtained from real…

Image and Video Processing · Electrical Eng. & Systems 2019-05-24 Ortal Senouf , Sanketh Vedula , Tomer Weiss , Alex Bronstein , Oleg Michailovich , Michael Zibulevsky

Self-Supervised Speech Representation Learning: A Review

Although supervised deep learning has revolutionized speech and audio processing, it has necessitated the building of specialist models for individual tasks and application scenarios. It is likewise difficult to apply this to dialects and…

Computation and Language · Computer Science 2022-11-23 Abdelrahman Mohamed , Hung-yi Lee , Lasse Borgholt , Jakob D. Havtorn , Joakim Edin , Christian Igel , Katrin Kirchhoff , Shang-Wen Li , Karen Livescu , Lars Maaløe , Tara N. Sainath , Shinji Watanabe

Self-supervised Learning with Physics-aware Neural Networks I: Galaxy Model Fitting

Estimating the parameters of a model describing a set of observations using a neural network is in general solved in a supervised way. In cases when we do not have access to the model's true parameters this approach can not be applied.…

Astrophysics of Galaxies · Physics 2020-09-30 Miguel A. Aragon-Calvo

Self-supervised learning for phase retrieval

In recent years, deep neural networks have emerged as a solution for inverse imaging problems. These networks are generally trained using pairs of images: one degraded and the other of high quality, the latter being called 'ground truth'.…

Information Retrieval · Computer Science 2025-10-01 Victor Sechaud , Patrice Abry , Laurent Jacques , Julián Tachella

Self-supervised Learning for Speech Enhancement

Supervised learning for single-channel speech enhancement requires carefully labeled training examples where the noisy mixture is input into the network and the network is trained to produce an output close to the ideal target. To relax the…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-19 Yu-Che Wang , Shrikant Venkataramani , Paris Smaragdis

Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision

The goal of this work is to train discriminative cross-modal embeddings without access to manually annotated data. Recent advances in self-supervised learning have shown that effective representations can be learnt from natural cross-modal…

Sound · Computer Science 2020-11-05 Soo-Whan Chung , Hong Goo Kang , Joon Son Chung

An iterative framework for self-supervised deep speaker representation learning

In this paper, we propose an iterative framework for self-supervised speaker representation learning based on a deep neural network (DNN). The framework starts with training a self-supervision speaker embedding network by maximizing…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-29 Danwei Cai , Weiqing Wang , Ming Li

Continual Robot Learning using Self-Supervised Task Inference

Endowing robots with the human ability to learn a growing set of skills over the course of a lifetime as opposed to mastering single tasks is an open problem in robot learning. While multi-task learning approaches have been proposed to…

Robotics · Computer Science 2023-09-19 Muhammad Burhan Hafez , Stefan Wermter

In-Context Symmetries: Self-Supervised Learning through Contextual World Models

At the core of self-supervised learning for vision is the idea of learning invariant or equivariant representations with respect to a set of data transformations. This approach, however, introduces strong inductive biases, which can render…

Machine Learning · Computer Science 2024-05-29 Sharut Gupta , Chenyu Wang , Yifei Wang , Tommi Jaakkola , Stefanie Jegelka

Visually Guided Self Supervised Learning of Speech Representations

Self supervised representation learning has recently attracted a lot of research interest for both the audio and visual modalities. However, most works typically focus on a particular modality or feature alone and there has been very…

Audio and Speech Processing · Electrical Eng. & Systems 2020-02-21 Abhinav Shukla , Konstantinos Vougioukas , Pingchuan Ma , Stavros Petridis , Maja Pantic

Learning to Compute the Articulatory Representations of Speech with the MIRRORNET

Most organisms including humans function by coordinating and integrating sensory signals with motor actions to survive and accomplish desired tasks. Learning these complex sensorimotor mappings proceeds simultaneously and often in an…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-26 Yashish M. Siriwardena , Carol Espy-Wilson , Shihab Shamma

Self-Supervised Learning of Audio Representations from Permutations with Differentiable Ranking

Self-supervised pre-training using so-called "pretext" tasks has recently shown impressive performance across a wide range of modalities. In this work, we advance self-supervised learning from permutations, by pre-training a model to…

Sound · Computer Science 2021-05-05 Andrew N Carr , Quentin Berthet , Mathieu Blondel , Olivier Teboul , Neil Zeghidour