A Comparison of Four Methods for Analog Speech Privacy
- 1 January 1981
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Communications
- Vol. 29 (1) , 18-23
- https://doi.org/10.1109/tcom.1981.1094870
Abstract
Four well-known procedures for analog speech privacy have been compared in terms of residual intelligibility, bandwidth expansion, and encoding delay. Intelligibility scores have been determined from a perceptual experiment where about 70 untrained listeners were given the task of recognizing each of 200 spoken digits that occurred in a balanced set of 50 encrypted four-digit utterances, and by averaging resulting probabilities of correct digit recognition. Bandwidth expansion has been expressed in terms of a new segmental measure that is more sensitive to short-time bandwidth manipulations than a conventional, long-time-averaged power spectrum measurement. Encoding delay is a straightforward function of analog scrambler parameters. The scrambling procedures that have been compared are sample permutation (S), block permutation (B), frequency inversion (F), and a combination of methodsBandF, denoted by [BF]. Sample permutations involved a contiguous set of LS(2 to 128) 8 kHz samples, while block permutations operated on a contiguous set of NB(4 to 128) speech segments each of which was LB(8 to 256) samples long. Frequency inversion is obtained by simply inverting the sign of every other Nyquist (8 kHz) sample. The parameters,L_{s},N_{B}, and LB, determine residual intelligibility as well as transmission properties such as encoding delay and bandwidth. The comparisons in our study provide a quantitative justification for the popular approach [BF]. For example, withN_{B} = 8andL_{B} =128, although the encoding delay is as much as 128 ms, the bandwidth expansion is only about 100 Hz (using the new segmental measure), and the digit intelligibilityIis 20 percent. Note that in the specific problem of recognizing ten digits, purely random (input-independent) listener responses correspond toI = 10percent.Keywords
This publication has 7 references indexed in Scilit:
- An analog scrambling scheme which does not expand bandwidth, Part II: Continuous timeIEEE Transactions on Information Theory, 1979
- Speech CodingIEEE Transactions on Communications, 1979
- Privacy and authentication: An introduction to cryptographyProceedings of the IEEE, 1979
- On Speech Encryption Using Waveform ScramblingBell System Technical Journal, 1977
- Speech Perception Under Conditions of Spectral Transformation: I. Phonetic CharacteristicsJournal of Speech and Hearing Research, 1972
- Speech Analysis Synthesis and PerceptionPublished by Springer Nature ,1972
- The intelligibility of speech as a function of the context of the test materials.Journal of Experimental Psychology, 1951