A sinusoidal voice over packet coder tailored for the frame-erasure channel
- 15 August 2005
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Speech and Audio Processing
- Vol. 13 (5) , 787-798
- https://doi.org/10.1109/tsa.2005.851913
Abstract
A speech coder tailored especially for the frame-erasure channel-the sinusoidal voice over packet coder (SVOPC)-is proposed. Based on a classified approach, avoiding interframe coding techniques, and synthesizing its output from slowly varying parameters, the coder is inherently robust to packet loss. SVOPC is based on quasi-harmonic modeling of the linear prediction (LP) residual. Both the sinusoidal amplitudes and phases are explicitly encoded using new methods based on Gaussian mixture models. A wide-band (16-kHz sampling frequency) implementation of the coder provides synthesized speech of good subjective quality at around 20 kbps. SVOPC is evaluated by means of subjective listening tests, and compared to a reference system based on G.722.2 (the AMR wide-band codec). Under frame erasure conditions (5%-30% frame erasures generated according to a Gilbert model), SVOPC clearly outperforms G.722.2.Keywords
This publication has 26 references indexed in Scilit:
- Variable-dimension quantization of sinusoidal amplitudes using Gaussian mixture modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Error protection and packet loss concealment based on a signal matched sinusoidal vocoderPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- PDF optimized parametric vector quantization of speech line spectral frequenciesIEEE Transactions on Speech and Audio Processing, 2003
- LSP quantization in wideband speech codersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for the Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) Audio CodecsPublished by RFC Editor ,2002
- Coding of variable dimension speech spectral vectors using weighted nonsquare transform vector quantizationIEEE Transactions on Speech and Audio Processing, 2001
- Vector quantization based on Gaussian mixture modelsIEEE Transactions on Speech and Audio Processing, 2000
- Measurement and modelling of the temporal dependence in packet lossPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- Efficient vector quantization of LPC parameters at 24 bits/frameIEEE Transactions on Speech and Audio Processing, 1993
- Vector Quantization and Signal CompressionPublished by Springer Nature ,1992