A sinusoidal voice over packet coder tailored for the frame-erasure channel

15 August 2005

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Speech and Audio Processing

Vol. 13 (5) , 787-798
https://doi.org/10.1109/tsa.2005.851913

Abstract

A speech coder tailored especially for the frame-erasure channel-the sinusoidal voice over packet coder (SVOPC)-is proposed. Based on a classified approach, avoiding interframe coding techniques, and synthesizing its output from slowly varying parameters, the coder is inherently robust to packet loss. SVOPC is based on quasi-harmonic modeling of the linear prediction (LP) residual. Both the sinusoidal amplitudes and phases are explicitly encoded using new methods based on Gaussian mixture models. A wide-band (16-kHz sampling frequency) implementation of the coder provides synthesized speech of good subjective quality at around 20 kbps. SVOPC is evaluated by means of subjective listening tests, and compared to a reference system based on G.722.2 (the AMR wide-band codec). Under frame erasure conditions (5%-30% frame erasures generated according to a Gilbert model), SVOPC clearly outperforms G.722.2.

Keywords

This publication has 26 references indexed in Scilit:

Variable-dimension quantization of sinusoidal amplitudes using Gaussian mixture models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Error protection and packet loss concealment based on a signal matched sinusoidal vocoder
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
PDF optimized parametric vector quantization of speech line spectral frequencies
IEEE Transactions on Speech and Audio Processing, 2003
LSP quantization in wideband speech coders
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for the Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs
Published by RFC Editor ,2002
Coding of variable dimension speech spectral vectors using weighted nonsquare transform vector quantization
IEEE Transactions on Speech and Audio Processing, 2001
Vector quantization based on Gaussian mixture models
IEEE Transactions on Speech and Audio Processing, 2000
Measurement and modelling of the temporal dependence in packet loss
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999
Efficient vector quantization of LPC parameters at 24 bits/frame
IEEE Transactions on Speech and Audio Processing, 1993
Vector Quantization and Signal Compression
Published by Springer Nature ,1992