Digital coding of speech waveforms: PCM, DPCM, and DM quantizers
- 1 May 1974
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Proceedings of the IEEE
- Vol. 62 (5) , 611-632
- https://doi.org/10.1109/proc.1974.9484
Abstract
A study is presented on the digital coding of speech by means of a straightforward approximation of the time waveform. In particular, the closely related discrete-time discrete-amplitude signal representations that are rather well known as pulse-code modulation (PCM), differential pulse-code modulation (DPCM), and delta modulation (DM) are discussed. Speech is recognized as a nonstationary signal, and emphasis is therefore placed on "companding" and "adaptive" strategies for waveform quantization and prediction. With signal-to-quantization-error ratio SNR as a performance measure, techniques are suggested which are most likely to be appropriate for given specifications of information rate. It is pointed out that error waveforms in speech quantization cannot be regarded as additive white noise, in general. This means that for finer assessments of speech coders, either relative or absolute, one needs to supplement SNR-based observations with corrections for subjective and perceptual factors. The latter seem to defy quantification as a rule. Invaluable, therefore, are explicit preference tests for direct comparisons of coders from a perceptual standpoint, and notions such as isopreference and multidimensional scaling are naturally appropriate in interpreting the results of such tests. Final points of concern are communication questions such as multiple encodings of speech by tandem coder-decoder pairs; conversions among different digital code formats; and the effects of additive and multiplicative noise in the communication channel, as manifest in the erroneous reception of speech-carrying bits. Information on these topics tends to be heterogeneous and nontheoretical, and the present digression into the subject is cursory by intent. The gramophone record accompanying this paper demonstrates some of the manipulations of speech that are discussed.Keywords
This publication has 47 references indexed in Scilit:
- Delta modulation of pitch, formant, and amplitude signals for the synthesis of voiced speechIEEE Transactions on Audio and Electroacoustics, 1973
- Slope Overload Noise in Linear Delta Modulators With Gaussian InputsBell System Technical Journal, 1973
- On Delta ModulationBell System Technical Journal, 1972
- Subjective Effects of Channel Transmission Errors on PCM and DPCM Voice Communication SystemsIEEE Transactions on Communications, 1972
- Adaptive Delta Modulation with a One-Bit MemoryBell System Technical Journal, 1970
- Delta Modulation Granular Quantizing NoiseBell System Technical Journal, 1969
- Effects of Channel Errors on Delta ModulationIEEE Transactions on Communications, 1966
- Delta Modulation Quantizing Noise Analytical and Computer Simulation Results for Gaussian and Television Input SignalsBell System Technical Journal, 1966
- Articulation-Testing Methods: Consonantal Differentiation with a Closed-Response SetThe Journal of the Acoustical Society of America, 1965
- High information delta modulationProceedings of the IEEE, 1963