Related papers: DSP Based System for Real time Voice Synthesis App…

Speech Synthesis and Control Using Differentiable DSP

Modern text-to-speech systems are able to produce natural and high-quality speech, but speech contains factors of variation (e.g. pitch, rhythm, loudness, timbre)\ that text alone cannot contain. In this work we move towards a speech…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-29 Giorgio Fabbro , Vladimir Golkov , Thomas Kemp , Daniel Cremers

On-device neural speech synthesis

Recent advances in text-to-speech (TTS) synthesis, such as Tacotron and WaveRNN, have made it possible to construct a fully neural network based TTS system, by coupling the two components together. Such a system is conceptually simple as it…

Audio and Speech Processing · Electrical Eng. & Systems 2021-09-21 Sivanand Achanta , Albert Antony , Ladan Golipour , Jiangchuan Li , Tuomo Raitio , Ramya Rasipuram , Francesco Rossi , Jennifer Shi , Jaimin Upadhyay , David Winarsky , Hepeng Zhang

An Intuitive Design Approach For Implementing Real Time Audio Effects

Audio effect implementation on random musical signal is a basic application of digital signal processors. In this paper, the compatibility features of MATLAB R2008a with Code Composer Studio 3.3 has been exploited to develop Simulink models…

Sound · Computer Science 2013-11-05 Mayukh Mukhopadhyay , Om Ranjan

An overview of text-to-speech systems and media applications

Producing synthetic voice, similar to human-like sound, is an emerging novelty of modern interactive media systems. Text-To-Speech (TTS) systems try to generate synthetic and authentic voices via text input. Besides, well known and familiar…

Audio and Speech Processing · Electrical Eng. & Systems 2023-10-24 Mohammad Reza Hasanabadi

Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control

In this paper, a text-to-rapping/singing system is introduced, which can be adapted to any speaker's voice. It utilizes a Tacotron-based multispeaker acoustic model trained on read-only speech data and which provides prosody control at the…

Sound · Computer Science 2021-11-18 Konstantinos Markopoulos , Nikolaos Ellinas , Alexandra Vioni , Myrsini Christidou , Panos Kakoulidis , Georgios Vamvoukakis , Georgia Maniati , June Sig Sung , Hyoungmin Park , Pirros Tsiakoulis , Aimilios Chalamandaris

High-Quality Vocoding Design with Signal Processing for Speech Synthesis and Voice Conversion

This Ph.D. thesis focuses on developing a system for high-quality speech synthesis and voice conversion. Vocoder-based speech analysis, manipulation, and synthesis plays a crucial role in various kinds of statistical parametric speech…

Sound · Computer Science 2021-01-26 Mohammed Salah Al-Radhi

Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis

Speech synthesis and music audio generation from symbolic input differ in many aspects but share some similarities. In this study, we investigate how text-to-speech synthesis techniques can be used for piano MIDI-to-audio synthesis tasks.…

Sound · Computer Science 2022-02-25 Erica Cooper , Xin Wang , Junichi Yamagishi

DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing

Controlling the variations of sound effects using neural audio synthesis models has been a difficult task. Differentiable digital signal processing (DDSP) provides a lightweight solution that achieves high-quality sound synthesis while…

Audio and Speech Processing · Electrical Eng. & Systems 2023-09-18 Yunyi Liu , Craig Jin , David Gunawan

Techniques and Challenges in Speech Synthesis

The aim of this project was to develop and implement an English language Text-to-Speech synthesis system. This involved a study of mechanisms of human speech production, a review of techniques in speech synthesis, and analysis of tests used…

Sound · Computer Science 2017-09-25 David Ferris

High-level synthesis under I/O Timing and Memory constraints

The design of complex Systems-on-Chips implies to take into account communication and memory access constraints for the integration of dedicated hardware accelerator. In this paper, we present a methodology and a tool that allow the…

Hardware Architecture · Computer Science 2016-08-16 Philippe Coussy , Gwenolé Corre , Pierre Bomel , Eric Senn , Eric Martin

Deep Voice: Real-time Neural Text-to-Speech

We present Deep Voice, a production-quality text-to-speech system constructed entirely from deep neural networks. Deep Voice lays the groundwork for truly end-to-end neural speech synthesis. The system comprises five major building blocks:…

Computation and Language · Computer Science 2017-03-09 Sercan O. Arik , Mike Chrzanowski , Adam Coates , Gregory Diamos , Andrew Gibiansky , Yongguo Kang , Xian Li , John Miller , Andrew Ng , Jonathan Raiman , Shubho Sengupta , Mohammad Shoeybi

Fast and High-Quality Singing Voice Synthesis System based on Convolutional Neural Networks

The present paper describes singing voice synthesis based on convolutional neural networks (CNNs). Singing voice synthesis systems based on deep neural networks (DNNs) are currently being proposed and are improving the naturalness of…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-23 Kazuhiro Nakamura , Shinji Takaki , Kei Hashimoto , Keiichiro Oura , Yoshihiko Nankaku , Keiichi Tokuda

ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis

In recent years, neural network based methods for multi-speaker text-to-speech synthesis (TTS) have made significant progress. However, the current speaker encoder models used in these methods still cannot capture enough speaker…

Sound · Computer Science 2022-03-29 Jinlong Xue , Yayue Deng , Yichen Han , Ya Li , Jianqing Sun , Jiaen Liang

Real-time Timbre Transfer and Sound Synthesis using DDSP

Neural audio synthesis is an actively researched topic, having yielded a wide range of techniques that leverages machine learning architectures. Google Magenta elaborated a novel approach called Differential Digital Signal Processing (DDSP)…

Sound · Computer Science 2021-03-15 Francesco Ganis , Erik Frej Knudesn , Søren V. K. Lyster , Robin Otterbein , David Südholt , Cumhur Erkut

Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds

A differentiable digital signal processing (DDSP) autoencoder is a musical sound synthesizer that combines a deep neural network (DNN) and spectral modeling synthesis. It allows us to flexibly edit sounds by changing the fundamental…

Sound · Computer Science 2022-02-02 Masaya Kawamura , Tomohiko Nakamura , Daichi Kitamura , Hiroshi Saruwatari , Yu Takahashi , Kazunobu Kondo

A Survey on Recent Deep Learning-driven Singing Voice Synthesis Systems

Singing voice synthesis (SVS) is a task that aims to generate audio signals according to musical scores and lyrics. With its multifaceted nature concerning music and language, producing singing voices indistinguishable from that of human…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-07 Yin-Ping Cho , Fu-Rong Yang , Yung-Chuan Chang , Ching-Ting Cheng , Xiao-Han Wang , Yi-Wen Liu

Exact Enumeration of Two-Dimensional Closed Random Paths Using a DSP Processor

The aim of this paper is to show that Digital Signal Processors (DSPs) can be used to efficiently implement complex algorithms. As an example we have chosen the problem of enumerating closed two-dimensional random paths. An Evaluation…

Computational Physics · Physics 2007-05-23 B. Afsari , N. Sadeghi-Meybodi , S. Rouhani

Generating sound effects with controllable variations is a challenging task, traditionally addressed using sophisticated physical models that require in-depth knowledge of signal processing parameters and algorithms. In the era of…

Sound · Computer Science 2024-12-30 Yunyi Liu , Craig Jin

Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters

Vocoders received renewed attention as main components in statistical parametric text-to-speech (TTS) synthesis and speech transformation systems. Even though there are vocoding techniques give almost accepted synthesized speech, their high…

Sound · Computer Science 2021-06-22 Mohammed Salah Al-Radhi , Tamás Gábor Csapó , Géza Németh

Embedded digital phase noise analyzer for optical frequency metrology

Digital signal processing (DSP) is supporting novel in-field applications of optical interferometry, such as in laser ranging and distributed acoustic sensing. While the highest performances are achieved with field-programmable gated arrays…

Signal Processing · Electrical Eng. & Systems 2023-08-08 Simone Donadello , Elio K. Bertacco , Davide Calonico , Cecilia Clivati