English
Related papers

Related papers: Towards Error-Resilient Neural Speech Coding

200 papers

Error resilient tools like Packet Loss Concealment (PLC) and Forward Error Correction (FEC) are essential to maintain a reliable speech communication for applications like Voice over Internet Protocol (VoIP), where packets are frequently…

Audio and Speech Processing · Electrical Eng. & Systems 2025-05-23 Kishan Gupta , Nicola Pia , Srikanth Korse , Andreas Brendel , Guillaume Fuchs , Markus Multrus

As deep speech enhancement algorithms have recently demonstrated capabilities greatly surpassing their traditional counterparts for suppressing noise, reverberation and echo, attention is turning to the problem of packet loss concealment…

Audio and Speech Processing · Electrical Eng. & Systems 2022-05-13 Jean-Marc Valin , Ahmed Mustafa , Christopher Montgomery , Timothy B. Terriberry , Michael Klingbeil , Paris Smaragdis , Arvindh Krishnaswamy

Packet loss concealment (PLC) is a tool for enhancing speech degradation caused by poor network conditions or underflow/overflow in audio processing pipelines. We propose a real-time recurrent method that leverages previous outputs to…

Sound · Computer Science 2023-05-15 Viet-Anh Nguyen , Anh H. T. Nguyen , Andy W. H. Khong

The packet loss problem seriously affects the quality of service in Voice over IP (VoIP) sceneries. In this paper, we investigated online receiver-based packet loss concealment which is much more portable and applicable. For ensuring the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-03-29 Yao Zhou , Changchun Bao

This paper introduces a real-time time-domain packet loss concealment (PLC) neural-network (tPLCnet). It efficiently predicts lost frames from a short context buffer in a sequence-to-one (seq2one) fashion. Because of its seq2one structure,…

Audio and Speech Processing · Electrical Eng. & Systems 2022-04-05 Nils L. Westhausen , Bernd T. Meyer

This paper focuses on AMR WB G.722.2 speech codec, and discusses the unused bandwidth resources of the senders by using a Word16(16 bit) to encode the sent frames. A packet loss concealment (PLC) method for G.722.2 speech codec is proposed…

Networking and Internet Architecture · Computer Science 2019-02-07 Tarek Gueham , Fatiha Merazka

Neural audio/speech coding has recently demonstrated its capability to deliver high quality at much lower bitrates than traditional methods. However, existing neural audio/speech codecs employ either acoustic features or learned blind…

Sound · Computer Science 2025-10-16 Xue Jiang , Xiulian Peng , Huaying Xue , Yuan Zhang , Yan Lu

Real-time communications in packet-switched networks have become widely used in daily communication, while they inevitably suffer from network delays and data losses in constrained real-time conditions. To solve these problems, audio packet…

Sound · Computer Science 2022-07-05 Yuansheng Guan , Guochen Yu , Andong Li , Chengshi Zheng , Jie Wang

This paper explores the integration of model-based and data-driven approaches within the realm of neural speech and audio coding systems. It highlights the challenges posed by the subjective evaluation processes of speech and audio codecs…

Sound · Computer Science 2025-01-08 Minje Kim , Jan Skoglund

Packet loss is a common problem in data transmission, including speech data transmission. This may affect a wide range of applications that stream audio data, like streaming applications or speech emotion recognition (SER). Packet Loss…

Audio and Speech Processing · Electrical Eng. & Systems 2020-05-19 Mostafa M. Mohamed , Björn W. Schuller

Conventional audio coding technologies commonly leverage human perception of sound, or psychoacoustics, to reduce the bitrate while preserving the perceptual quality of the decoded audio signals. For neural audio codecs, however, the…

Sound · Computer Science 2021-01-05 Kai Zhen , Mi Suk Lee , Jongmo Sung , Seungkwon Beack , Minje Kim

Audio coding is an essential module in the real-time communication system. Neural audio codecs can compress audio samples with a low bitrate due to the strong modeling and generative capabilities of deep neural networks. To address the poor…

Sound · Computer Science 2023-10-18 Wenzhe Liu , Wei Xiao , Meng Wang , Shan Yang , Yupeng Shi , Yuyong Kang , Dan Su , Shidong Shang , Dong Yu

Error correction codes are a crucial part of the physical communication layer, ensuring the reliable transfer of data over noisy channels. The design of optimal linear block codes capable of being efficiently decoded is of major concern,…

Information Theory · Computer Science 2024-05-08 Yoni Choukroun , Lior Wolf

Neural audio codecs have revolutionized audio processing by enabling speech tasks to be performed on highly compressed representations. Recent work has shown that speech separation can be achieved within these compressed domains, offering…

Audio and Speech Processing · Electrical Eng. & Systems 2024-11-28 Jia Qi Yip , Chin Yuen Kwok , Bin Ma , Eng Siong Chng

Communication technologies like voice over IP operate under constrained real-time conditions, with voice packets being subject to delays and losses from the network. In such cases, the packet loss concealment (PLC) algorithm reconstructs…

Sound · Computer Science 2021-07-09 Santiago Pascual , Joan Serrà , Jordi Pons

Estimating time-frequency domain masks for single-channel speech enhancement using deep learning methods has recently become a popular research field with promising results. In this paper, we propose a novel components loss (CL) for the…

Audio and Speech Processing · Electrical Eng. & Systems 2019-08-15 Ziyi Xu , Samy Elshamy , Ziyue Zhao , Tim Fingscheidt

In this paper, we propose a neural-based coding scheme in which an artificial neural network is exploited to automatically compress and decompress speech signals by a trainable approach. Having a two-stage training phase, the system can be…

Sound · Computer Science 2016-01-25 Mahmood Yousefi-Azar , Farbod Razzazi

This paper proposes a novel approach for speech signal prediction based on a recurrent neural network (RNN). Unlike existing RNN-based predictors, which operate on parametric features and are trained offline on a large collection of such…

Audio and Speech Processing · Electrical Eng. & Systems 2021-11-17 Reza Lotfidereshgi , Philippe Gournay

Neural speech codecs have demonstrated their ability to compress high-quality speech and audio by converting them into discrete token representations. Most existing methods utilize Residual Vector Quantization (RVQ) to encode speech into…

Sound · Computer Science 2024-10-22 Peiji Yang , Fengping Wang , Yicheng Zhong , Huawei Wei , Zhisheng Wang

Real-time speech communication over wireless networks remains challenging, as conventional channel protection mechanisms cannot effectively counter packet loss under stringent bandwidth and latency constraints. Semantic communication has…

Sound · Computer Science 2025-12-10 Zhuohang Han , Jincheng Dai , Shengshi Yao , Junyi Wang , Yanlong Li , Kai Niu , Wenjun Xu , Ping Zhang
‹ Prev 1 2 3 10 Next ›