English
Related papers

Related papers: Guitar Tone Morphing by Diffusion-based Model

200 papers

We introduce Mix2Morph, a text-to-audio diffusion model fine-tuned to perform sound morphing without a dedicated dataset of morphs. By finetuning on noisy surrogate mixes at higher diffusion timesteps, Mix2Morph yields stable, perceptually…

Sound · Computer Science 2026-01-29 Annie Chu , Hugo Flores García , Oriol Nieto , Justin Salamon , Bryan Pardo , Prem Seetharaman

Sound morphing is the process of gradually and smoothly transforming one sound into another to generate novel and perceptually hybrid sounds that simultaneously resemble both. Recently, diffusion-based text-to-audio models have produced…

Audio and Speech Processing · Electrical Eng. & Systems 2024-08-15 Purnima Kamath , Chitralekha Gupta , Suranga Nanayakkara

Breakthroughs in text-to-music generation models are transforming the creative landscape, equipping musicians with innovative tools for composition and experimentation like never before. However, controlling the generation process to…

Sound · Computer Science 2025-06-19 Teysir Baoueb , Xiaoyu Bie , Xi Wang , Gaël Richard

We present SoundMorpher, an open-world sound morphing method designed to generate perceptually uniform morphing trajectories. Traditional sound morphing techniques typically assume a linear relationship between the morphing factor and sound…

Sound · Computer Science 2024-12-17 Xinlei Niu , Jing Zhang , Charles Patrick Martin

We present FreeMorph, the first tuning-free method for image morphing that accommodates inputs with different semantics or layouts. Unlike existing methods that rely on finetuning pre-trained diffusion models and are limited by time…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Yukang Cao , Chenyang Si , Jinghao Wang , Ziwei Liu

Music generation in the audio domain using artificial intelligence (AI) has witnessed steady progress in recent years. However for some instruments, particularly the guitar, controllable instrument synthesis remains limited in expressivity.…

Sound · Computer Science 2025-10-28 Jackson Loth , Pedro Sarmento , Mark Sandler , Mathieu Barthet

We study timbre transfer as an inference-time editing problem for music audio. Starting from a strong pre-trained latent diffusion model, we introduce a lightweight procedure that requires no additional training: (i) a dimension-wise noise…

Sound · Computer Science 2026-01-29 Ching Ho Lee , Javier Nistal , Stefan Lattner , Marco Pasini , George Fazekas

Diffusion models have achieved remarkable image generation quality surpassing previous generative models. However, a notable limitation of diffusion models, in comparison to GANs, is their difficulty in smoothly interpolating between two…

Computer Vision and Pattern Recognition · Computer Science 2023-12-13 Kaiwen Zhang , Yifan Zhou , Xudong Xu , Xingang Pan , Bo Dai

Temporal envelope morphing, the process of interpolating between the amplitude dynamics of two audio signals, is an emerging problem in generative audio systems that lacks sufficient perceptual grounding. Morphing of temporal envelopes in a…

Sound · Computer Science 2025-11-25 Satvik Dixit , Sungjoon Park , Chris Donahue , Laurie M. Heller

We investigate a dynamically adapting tuning scheme for microtonal tuning of musical instruments, allowing the performer to play music in just intonation in any key. Unlike other methods, which are based on a procedural analysis of the…

Popular Physics · Physics 2018-06-12 Karolin Stange , Christoph Wick , Haye Hinrichsen

Guitar tablature transcription consists in deducing the string and the fret number on which each note should be played to reproduce the actual musical part. This assignment should lead to playable string-fret combinations throughout the…

With the development of diffusion models, text-guided image style transfer has demonstrated high-quality controllable synthesis results. However, the utilization of text for diverse music style transfer poses significant challenges,…

Sound · Computer Science 2024-02-22 Sifei Li , Yuxin Zhang , Fan Tang , Chongyang Ma , Weiming dong , Changsheng Xu

Deep learning has recently empowered and democratized generative modeling of images and text, with additional concurrent works exploring the possibility of generating more complex forms of data, such as audio. However, the high…

Audio and Speech Processing · Electrical Eng. & Systems 2023-02-02 Josue Casco-Rodriguez

Guitar tablatures enrich the structure of traditional music notation by assigning each note to a string and fret of a guitar in a particular tuning, indicating precisely where to play the note on the instrument. The problem of generating…

Sound · Computer Science 2024-08-12 Drew Edwards , Xavier Riley , Pedro Sarmento , Simon Dixon

We generalized a voice morphing algorithm capable of handling temporally variable, multiple-attributes, and multiple instances. The generalized morphing provides a new strategy for investigating speech diversity. However, excessive…

Human-Computer Interaction · Computer Science 2024-04-23 Hideki Kawahara , Masanori Morise

In this paper we present mathematical and physical models to be used in the analysis of the problem of intonation of musical instruments such as guitars, mandolins and the like, i.e., we study how to improve the tuning on these instruments.…

Classical Physics · Physics 2010-01-26 Gabriele U. Varieschi , Christina M. Gower

We propose a timbre conversion model based on the Diffusion architecture de-signed to precisely translate music played by various instruments into piano ver-sions. The model employs a Pitch Encoder and Loudness Encoder to extract pitch and…

Deep learning has boosted the performance of many music information retrieval (MIR) systems in recent years. Yet, the complex hierarchical arrangement of music makes end-to-end learning hard for some MIR tasks - a very deep and flexible…

Sound · Computer Science 2018-12-11 Anders Elowsson

Building upon Diff-A-Riff, a latent diffusion model for musical instrument accompaniment generation, we present a series of improvements targeting quality, diversity, inference speed, and text-driven control. First, we upgrade the…

Sound · Computer Science 2024-10-31 Javier Nistal , Marco Pasini , Stefan Lattner

Electric guitar tone modeling typically focuses on the non-linear transformation from clean to amplifier-rendered audio. Traditional methods rely on one-to-one mappings, incorporating device parameters into neural models to replicate…

‹ Prev 1 2 3 10 Next ›