Related papers: Guitar Tone Morphing by Diffusion-based Model

Mix2Morph: Learning Sound Morphing from Noisy Mixes

We introduce Mix2Morph, a text-to-audio diffusion model fine-tuned to perform sound morphing without a dedicated dataset of morphs. By finetuning on noisy surrogate mixes at higher diffusion timesteps, Mix2Morph yields stable, perceptually…

Sound · Computer Science 2026-01-29 Annie Chu , Hugo Flores García , Oriol Nieto , Justin Salamon , Bryan Pardo , Prem Seetharaman

MorphFader: Enabling Fine-grained Controllable Morphing with Text-to-Audio Models

Sound morphing is the process of gradually and smoothly transforming one sound into another to generate novel and perceptually hybrid sounds that simultaneously resemble both. Recently, diffusion-based text-to-audio models have produced…

Audio and Speech Processing · Electrical Eng. & Systems 2024-08-15 Purnima Kamath , Chitralekha Gupta , Suranga Nanayakkara

Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models

Breakthroughs in text-to-music generation models are transforming the creative landscape, equipping musicians with innovative tools for composition and experimentation like never before. However, controlling the generation process to…

Sound · Computer Science 2025-06-19 Teysir Baoueb , Xiaoyu Bie , Xi Wang , Gaël Richard

SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model

We present SoundMorpher, an open-world sound morphing method designed to generate perceptually uniform morphing trajectories. Traditional sound morphing techniques typically assume a linear relationship between the morphing factor and sound…

Sound · Computer Science 2024-12-17 Xinlei Niu , Jing Zhang , Charles Patrick Martin

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

We present FreeMorph, the first tuning-free method for image morphing that accommodates inputs with different semantics or layouts. Unlike existing methods that rely on finetuning pre-trained diffusion models and are limited by time…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Yukang Cao , Chenyang Si , Jinghao Wang , Ziwei Liu

GuitarFlow: Realistic Electric Guitar Synthesis From Tablatures via Flow Matching and Style Transfer

Music generation in the audio domain using artificial intelligence (AI) has witnessed steady progress in recent years. However for some instruments, particularly the guitar, controllable instrument synthesis remains limited in expressivity.…

Sound · Computer Science 2025-10-28 Jackson Loth , Pedro Sarmento , Mark Sandler , Mathieu Barthet

Diffusion Timbre Transfer Via Mutual Information Guided Inpainting

We study timbre transfer as an inference-time editing problem for music audio. Starting from a strong pre-trained latent diffusion model, we introduce a lightweight procedure that requires no additional training: (i) a dimension-wise noise…

Sound · Computer Science 2026-01-29 Ching Ho Lee , Javier Nistal , Stefan Lattner , Marco Pasini , George Fazekas

DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing

Diffusion models have achieved remarkable image generation quality surpassing previous generative models. However, a notable limitation of diffusion models, in comparison to GANs, is their difficulty in smoothly interpolating between two…

Computer Vision and Pattern Recognition · Computer Science 2023-12-13 Kaiwen Zhang , Yifan Zhou , Xudong Xu , Xingang Pan , Bo Dai

Learning Perceptually Relevant Temporal Envelope Morphing

Temporal envelope morphing, the process of interpolating between the amplitude dynamics of two audio signals, is an emerging problem in generative audio systems that lacks sufficient perceptual grounding. Morphing of temporal envelopes in a…

Sound · Computer Science 2025-11-25 Satvik Dixit , Sungjoon Park , Chris Donahue , Laurie M. Heller

Playing Music in Just Intonation - A Dynamically Adapting Tuning Scheme

We investigate a dynamically adapting tuning scheme for microtonal tuning of musical instruments, allowing the performer to play music in just intonation in any key. Unlike other methods, which are based on a procedural analysis of the…

Popular Physics · Physics 2018-06-12 Karolin Stange , Christoph Wick , Haye Hinrichsen

A Machine Learning Approach for MIDI to Guitar Tablature Conversion

Guitar tablature transcription consists in deducing the string and the fret number on which each note should be played to reproduce the actual musical part. This assignment should lead to playable string-fret combinations throughout the…

Sound · Computer Science 2025-10-15 Maximos Kaliakatsos-Papakostas , Gregoris Bastas , Dimos Makris , Dorien Herremans , Vassilis Katsouros , Petros Maragos

Music Style Transfer with Time-Varying Inversion of Diffusion Models

With the development of diffusion models, text-guided image style transfer has demonstrated high-quality controllable synthesis results. However, the utilization of text for diverse music style transfer poses significant challenges,…

Sound · Computer Science 2024-02-22 Sifei Li , Yuxin Zhang , Fan Tang , Chongyang Ma , Weiming dong , Changsheng Xu

Rock Guitar Tablature Generation via Natural Language Processing

Deep learning has recently empowered and democratized generative modeling of images and text, with additional concurrent works exploring the possibility of generating more complex forms of data, such as audio. However, the high…

Audio and Speech Processing · Electrical Eng. & Systems 2023-02-02 Josue Casco-Rodriguez

MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling

Guitar tablatures enrich the structure of traditional music notation by assigning each note to a string and fret of a guitar in a particular tuning, indicating precisely where to play the note on the instrument. The problem of generating…

Sound · Computer Science 2024-08-12 Drew Edwards , Xavier Riley , Pedro Sarmento , Simon Dixon

Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances for explorational research and education

We generalized a voice morphing algorithm capable of handling temporally variable, multiple-attributes, and multiple instances. The generalized morphing provides a new strategy for investigating speech diversity. However, excessive…

Human-Computer Interaction · Computer Science 2024-04-23 Hideki Kawahara , Masanori Morise

Intonation and Compensation of Fretted String Instruments

In this paper we present mathematical and physical models to be used in the analysis of the problem of intonation of musical instruments such as guitars, mandolins and the like, i.e., we study how to improve the tuning on these instruments.…

Classical Physics · Physics 2010-01-26 Gabriele U. Varieschi , Christina M. Gower

Research on Piano Timbre Transformation System Based on Diffusion Model

We propose a timbre conversion model based on the Diffusion architecture de-signed to precisely translate music played by various instruments into piano ver-sions. The model employs a Pitch Encoder and Loudness Encoder to extract pitch and…

Sound · Computer Science 2026-01-15 Chun-Chieh Hsu , Tsai-Ling Hsu , Chen-Chen Yeh , Shao-Chien Lu , Cheng-Han Wu , Bing-Ze Liu , Timothy K. Shih , Yu-Cheng Lin

Deep Layered Learning in MIR

Deep learning has boosted the performance of many music information retrieval (MIR) systems in recent years. Yet, the complex hierarchical arrangement of music makes end-to-end learning hard for some MIR tasks - a very deep and flexible…

Sound · Computer Science 2018-12-11 Anders Elowsson

Improving Musical Accompaniment Co-creation via Diffusion Transformers

Building upon Diff-A-Riff, a latent diffusion model for musical instrument accompaniment generation, we present a series of improvements targeting quality, diversity, inference speed, and text-driven control. First, we upgrade the…

Sound · Computer Science 2024-10-31 Javier Nistal , Marco Pasini , Stefan Lattner

Demo of Zero-Shot Guitar Amplifier Modelling: Enhancing Modeling with Hyper Neural Networks

Electric guitar tone modeling typically focuses on the non-linear transformation from clean to amplifier-rendered audio. Traditional methods rely on one-to-one mappings, incorporating device parameters into neural models to replicate…

Sound · Computer Science 2024-10-08 Yu-Hua Chen , Yuan-Chiao Cheng , Yen-Tung Yeh , Jui-Te Wu , Yu-Hsiang Ho , Jyh-Shing Roger Jang , Yi-Hsuan Yang