Sine-wave phase coding at low data rates

Abstract
In the context of a sinusoidal representation for speech waveforms, it is shown that synthetic speech of high quality can be obtained using a parametric model for the sine-wave phases, hence obviating the need to code the phases at low data rates. It was found that if a synthetic linear phase term was computed based on the time of occurrence of an artificially generated sequence of pitch pulses, then high-quality voiced speech reconstruction was possible. For unvoiced speech, the modeling study showed that the sine-wave phases were essentially uniformly distributed random variables.

This publication has 10 references indexed in Scilit: