Related papers: DDSP: Differentiable Digital Signal Processing

DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing

Controlling the variations of sound effects using neural audio synthesis models has been a difficult task. Differentiable digital signal processing (DDSP) provides a lightweight solution that achieves high-quality sound synthesis while…

Audio and Speech Processing · Electrical Eng. & Systems 2023-09-18 Yunyi Liu , Craig Jin , David Gunawan

Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds

A differentiable digital signal processing (DDSP) autoencoder is a musical sound synthesizer that combines a deep neural network (DNN) and spectral modeling synthesis. It allows us to flexibly edit sounds by changing the fundamental…

Sound · Computer Science 2022-02-02 Masaya Kawamura , Tomohiko Nakamura , Daichi Kitamura , Hiroshi Saruwatari , Yu Takahashi , Kazunobu Kondo

Modulation Discovery with Differentiable Digital Signal Processing

Modulations are a critical part of sound design and music production, enabling the creation of complex and evolving audio. Modern synthesizers provide envelopes, low frequency oscillators (LFOs), and more parameter automation tools that…

Sound · Computer Science 2025-10-08 Christopher Mitcheltree , Hao Hao Tan , Joshua D. Reiss

Differentiable Modelling of Percussive Audio with Transient and Spectral Synthesis

Differentiable digital signal processing (DDSP) techniques, including methods for audio synthesis, have gained attention in recent years and lend themselves to interpretability in the parameter space. However, current differentiable…

Sound · Computer Science 2023-09-14 Jordie Shier , Franco Caspe , Andrew Robertson , Mark Sandler , Charalampos Saitis , Andrew McPherson

Real-time Timbre Transfer and Sound Synthesis using DDSP

Neural audio synthesis is an actively researched topic, having yielded a wide range of techniques that leverages machine learning architectures. Google Magenta elaborated a novel approach called Differential Digital Signal Processing (DDSP)…

Sound · Computer Science 2021-03-15 Francesco Ganis , Erik Frej Knudesn , Søren V. K. Lyster , Robin Otterbein , David Südholt , Cumhur Erkut

DDSP Guitar Amp: Interpretable Guitar Amplifier Modeling

Neural network models for guitar amplifier emulation, while being effective, often demand high computational cost and lack interpretability. Drawing ideas from physical amplifier design, this paper aims to address these issues with a new…

Sound · Computer Science 2024-08-22 Yen-Tung Yeh , Yu-Hua Chen , Yuan-Chiao Cheng , Jui-Te Wu , Jun-Jie Fu , Yi-Fan Yeh , Yi-Hsuan Yang

MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling

Musical expression requires control of both what notes are played, and how they are performed. Conventional audio synthesizers provide detailed expressive controls, but at the cost of realism. Black-box neural audio synthesis and…

Sound · Computer Science 2022-03-21 Yusong Wu , Ethan Manilow , Yi Deng , Rigel Swavely , Kyle Kastner , Tim Cooijmans , Aaron Courville , Cheng-Zhi Anna Huang , Jesse Engel

DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition

The performances of automatic speech recognition (ASR) systems degrade drastically under noisy conditions. Explicit distortion modelling (EDM), as a feature compensation step, is able to enhance ASR systems under such conditions by…

Audio and Speech Processing · Electrical Eng. & Systems 2022-08-02 Z. Guo , C. Chen , E. S. Chng

Vocal Timbre Effects with Differentiable Digital Signal Processing

We explore two approaches to creatively altering vocal timbre using Differentiable Digital Signal Processing (DDSP). The first approach is inspired by classic cross-synthesis techniques. A pretrained DDSP decoder predicts a filter for a…

Sound · Computer Science 2023-06-21 David Südholt , Cumhur Erkut

DDX7: Differentiable FM Synthesis of Musical Instrument Sounds

FM Synthesis is a well-known algorithm used to generate complex timbre from a compact set of design primitives. Typically featuring a MIDI interface, it is usually impractical to control it from an audio source. On the other hand,…

Sound · Computer Science 2022-08-15 Franco Caspe , Andrew McPherson , Mark Sandler

A Review of Differentiable Digital Signal Processing for Music & Speech Synthesis

The term "differentiable digital signal processing" describes a family of techniques in which loss function gradients are backpropagated through digital signal processors, facilitating their integration into neural networks. This article…

Sound · Computer Science 2023-08-30 Ben Hayes , Jordie Shier , György Fazekas , Andrew McPherson , Charalampos Saitis

Continuous descriptor-based control for deep audio synthesis

Despite significant advances in deep models for music generation, the use of these techniques remains restricted to expert users. Before being democratized among musicians, generative models must first provide expressive control over the…

Sound · Computer Science 2023-02-28 Ninon Devis , Nils Demerlé , Sarah Nabi , David Genova , Philippe Esling

Differentiable Dictionary Search: Integrating Linear Mixing with Deep Non-Linear Modelling for Audio Source Separation

This paper describes several improvements to a new method for signal decomposition that we recently formulated under the name of Differentiable Dictionary Search (DDS). The fundamental idea of DDS is to exploit a class of powerful deep…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-29 Lukáš Samuel Marták , Rainer Kelz , Gerhard Widmer

Latent Space Explorations of Singing Voice Synthesis using DDSP

Machine learning based singing voice models require large datasets and lengthy training times. In this work we present a lightweight architecture, based on the Differentiable Digital Signal Processing (DDSP) library, that is able to output…

Sound · Computer Science 2021-03-15 Juan Alonso , Cumhur Erkut

Style Transfer of Audio Effects with Differentiable Signal Processing

We present a framework that can impose the audio effects and production style from one recording to another by example with the goal of simplifying the audio production process. We train a deep neural network to analyze an input recording…

Sound · Computer Science 2022-07-19 Christian J. Steinmetz , Nicholas J. Bryan , Joshua D. Reiss

Generative Deep Learning and Signal Processing for Data Augmentation of Cardiac Auscultation Signals: Improving Model Robustness Using Synthetic Audio

Accurately interpreting cardiac auscultation signals plays a crucial role in diagnosing and managing cardiovascular diseases. However, the paucity of labelled data inhibits classification models' training. Researchers have turned to…

Sound · Computer Science 2025-06-18 Leigh Abbott , Milan Marocchi , Matthew Fynn , Yue Rong , Sven Nordholm

Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes

Multi-speaker speech synthesis is a technique for modeling multiple speakers' voices with a single model. Although many approaches using deep neural networks (DNNs) have been proposed, DNNs are prone to overfitting when the amount of…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-10 Kentaro Mitsui , Tomoki Koriyama , Hiroshi Saruwatari

Generating sound effects with controllable variations is a challenging task, traditionally addressed using sophisticated physical models that require in-depth knowledge of signal processing parameters and algorithms. In the era of…

Sound · Computer Science 2024-12-30 Yunyi Liu , Craig Jin

Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model

The task of bandwidth extension addresses the generation of missing high frequencies of audio signals based on knowledge of the low-frequency part of the sound. This task applies to various problems, such as audio coding or audio…

Sound · Computer Science 2023-11-28 Pierre-Amaury Grumiaux , Mathieu Lagrange

Problems using deep generative models for probabilistic audio source separation

Recent advancements in deep generative modeling make it possible to learn prior distributions from complex data that subsequently can be used for Bayesian inference. However, we find that distributions learned by deep generative models for…

Machine Learning · Computer Science 2020-11-04 Maurice Frank , Maximilian Ilse