Related papers: Towards Error-Resilient Neural Speech Coding

On Improving Error Resilience of Neural End-to-End Speech Coders

Error resilient tools like Packet Loss Concealment (PLC) and Forward Error Correction (FEC) are essential to maintain a reliable speech communication for applications like Voice over Internet Protocol (VoIP), where packets are frequently…

Audio and Speech Processing · Electrical Eng. & Systems 2025-05-23 Kishan Gupta , Nicola Pia , Srikanth Korse , Andreas Brendel , Guillaume Fuchs , Markus Multrus

Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model

As deep speech enhancement algorithms have recently demonstrated capabilities greatly surpassing their traditional counterparts for suppressing noise, reverberation and echo, attention is turning to the problem of packet loss concealment…

Audio and Speech Processing · Electrical Eng. & Systems 2022-05-13 Jean-Marc Valin , Ahmed Mustafa , Christopher Montgomery , Timothy B. Terriberry , Michael Klingbeil , Paris Smaragdis , Arvindh Krishnaswamy

Improving performance of real-time full-band blind packet-loss concealment with predictive network

Packet loss concealment (PLC) is a tool for enhancing speech degradation caused by poor network conditions or underflow/overflow in audio processing pipelines. We propose a real-time recurrent method that leverages previous outputs to…

Sound · Computer Science 2023-05-15 Viet-Anh Nguyen , Anh H. T. Nguyen , Andy W. H. Khong

A Neural Vocoder Based Packet Loss Concealment Algorithm

The packet loss problem seriously affects the quality of service in Voice over IP (VoIP) sceneries. In this paper, we investigated online receiver-based packet loss concealment which is much more portable and applicable. For ensuring the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-03-29 Yao Zhou , Changchun Bao

tPLCnet: Real-time Deep Packet Loss Concealment in the Time Domain Using a Short Temporal Context

This paper introduces a real-time time-domain packet loss concealment (PLC) neural-network (tPLCnet). It efficiently predicts lost frames from a short context buffer in a sequence-to-one (seq2one) fashion. Because of its seq2one structure,…

Audio and Speech Processing · Electrical Eng. & Systems 2022-04-05 Nils L. Westhausen , Bernd T. Meyer

An Enhanced Interleaving Frame Loss Concealment Method for Voice Over IP Network Services

This paper focuses on AMR WB G.722.2 speech codec, and discusses the unused bandwidth resources of the senders by using a Word16(16 bit) to encode the sent frames. A packet loss concealment (PLC) method for G.722.2 speech codec is proposed…

Networking and Internet Architecture · Computer Science 2019-02-07 Tarek Gueham , Fatiha Merazka

Latent-Domain Predictive Neural Speech Coding

Neural audio/speech coding has recently demonstrated its capability to deliver high quality at much lower bitrates than traditional methods. However, existing neural audio/speech codecs employ either acoustic features or learned blind…

Sound · Computer Science 2025-10-16 Xue Jiang , Xiulian Peng , Huaying Xue , Yuan Zhang , Yan Lu

TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network

Real-time communications in packet-switched networks have become widely used in daily communication, while they inevitably suffer from network delays and data losses in constrained real-time conditions. To solve these problems, audio packet…

Sound · Computer Science 2022-07-05 Yuansheng Guan , Guochen Yu , Andong Li , Chengshi Zheng , Jie Wang

Neural Speech and Audio Coding: Modern AI Technology Meets Traditional Codecs

This paper explores the integration of model-based and data-driven approaches within the realm of neural speech and audio coding systems. It highlights the challenges posed by the subjective evaluation processes of speech and audio codecs…

Sound · Computer Science 2025-01-08 Minje Kim , Jan Skoglund

ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition

Packet loss is a common problem in data transmission, including speech data transmission. This may affect a wide range of applications that stream audio data, like streaming applications or speech emotion recognition (SER). Packet Loss…

Audio and Speech Processing · Electrical Eng. & Systems 2020-05-19 Mostafa M. Mohamed , Björn W. Schuller

Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding

Conventional audio coding technologies commonly leverage human perception of sound, or psychoacoustics, to reduce the bitrate while preserving the perceptual quality of the decoded audio signals. For neural audio codecs, however, the…

Sound · Computer Science 2021-01-05 Kai Zhen , Mi Suk Lee , Jongmo Sung , Seungkwon Beack , Minje Kim

A High Fidelity and Low Complexity Neural Audio Coding

Audio coding is an essential module in the real-time communication system. Neural audio codecs can compress audio samples with a low bitrate due to the strong modeling and generative capabilities of deep neural networks. To address the poor…

Sound · Computer Science 2023-10-18 Wenzhe Liu , Wei Xiao , Meng Wang , Shan Yang , Yupeng Shi , Yuyong Kang , Dan Su , Shidong Shang , Dong Yu

Learning Linear Block Error Correction Codes

Error correction codes are a crucial part of the physical communication layer, ensuring the reliable transfer of data over noisy channels. The design of optimal linear block codes capable of being efficiently decoded is of major concern,…

Information Theory · Computer Science 2024-05-08 Yoni Choukroun , Lior Wolf

Speech Separation using Neural Audio Codecs with Embedding Loss

Neural audio codecs have revolutionized audio processing by enabling speech tasks to be performed on highly compressed representations. Recent work has shown that speech separation can be achieved within these compressed domains, offering…

Audio and Speech Processing · Electrical Eng. & Systems 2024-11-28 Jia Qi Yip , Chin Yuen Kwok , Bin Ma , Eng Siong Chng

Adversarial Auto-Encoding for Packet Loss Concealment

Communication technologies like voice over IP operate under constrained real-time conditions, with voice packets being subject to delays and losses from the network. In such cases, the packet loss concealment (PLC) algorithm reconstructs…

Sound · Computer Science 2021-07-09 Santiago Pascual , Joan Serrà , Jordi Pons

Components Loss for Neural Networks in Mask-Based Speech Enhancement

Estimating time-frequency domain masks for single-channel speech enhancement using deep learning methods has recently become a popular research field with promising results. In this paper, we propose a novel components loss (CL) for the…

Audio and Speech Processing · Electrical Eng. & Systems 2019-08-15 Ziyi Xu , Samy Elshamy , Ziyue Zhao , Tim Fingscheidt

A Robust Frame-based Nonlinear Prediction System for Automatic Speech Coding

In this paper, we propose a neural-based coding scheme in which an artificial neural network is exploited to automatically compress and decompress speech signals by a trainable approach. Having a two-stage training phase, the system can be…

Sound · Computer Science 2016-01-25 Mahmood Yousefi-Azar , Farbod Razzazi

Speech Prediction using an Adaptive Recurrent Neural Network with Application to Packet Loss Concealment

This paper proposes a novel approach for speech signal prediction based on a recurrent neural network (RNN). Unlike existing RNN-based predictors, which operate on parametric features and are trained offline on a large collection of such…

Audio and Speech Processing · Electrical Eng. & Systems 2021-11-17 Reza Lotfidereshgi , Philippe Gournay

Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Neural speech codecs have demonstrated their ability to compress high-quality speech and audio by converting them into discrete token representations. Most existing methods utilize Residual Vector Quantization (RVQ) to encode speech into…

Sound · Computer Science 2024-10-22 Peiji Yang , Fengping Wang , Yicheng Zhong , Huawei Wei , Zhisheng Wang

Error-Resilient Semantic Communication for Speech Transmission over Packet-Loss Networks

Real-time speech communication over wireless networks remains challenging, as conventional channel protection mechanisms cannot effectively counter packet loss under stringent bandwidth and latency constraints. Semantic communication has…

Sound · Computer Science 2025-12-10 Zhuohang Han , Jincheng Dai , Shengshi Yao , Junyi Wang , Yanlong Li , Kai Niu , Wenjun Xu , Ping Zhang