On the number of channels needed to understand speech
- 1 October 1999
- journal article
- research article
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 106 (4) , 2097-2103
- https://doi.org/10.1121/1.427954
Abstract
Recent studies have shown that high levels of speech understanding could be achieved when the speech spectrum was divided into four channels and then reconstructed as a sum of four noise bands or sine waves with frequencies equal to the center frequencies of the channels. In these studies speech understanding was assessed using sentences produced by a single male talker. The aim of experiment 1 was to assess the number of channels necessary for a high level of speech understanding when sentences were produced by multiple talkers. In experiment 1, sentences produced by 135 different talkers were processed through number of channels, synthesized as a sum of n sine waves with frequencies equal to the center frequencies of the filters, and presented to normal-hearing listeners for identification. A minimum of five channels was needed to achieve a high level (90%) of speech understanding. Asymptotic performance was achieved with eight channels, at least for the speech material used in this study. The outcome of experiment 1 demonstrated that the number of channels needed to reach asymptotic performance varies as a function of the recognition task and/or need for listeners to attend to fine phonetic detail. In experiment 2, sentences were processed through 6 and 16 channels and quantized into a small number of steps. The purpose of this experiment was to investigate whether listeners use across-channel differences in amplitude to code frequency information, particularly when speech is processed through a small number of channels. For sentences processed through six channels there was a significant reduction in speech understanding when the spectral amplitudes were quantized into a small number (<8) of steps. High levels (92%) of speech understanding were maintained for sentences processed through 16 channels and quantized into only 2 steps. The findings of experiment 2 suggest an inverse relationship between the importance of spectral amplitude resolution (number of steps) and spectral resolution (number of channels).
Keywords
This publication has 20 references indexed in Scilit:
- Effects of noise and spectral resolution on vowel and consonant recognition: Acoustic and electric hearingThe Journal of the Acoustical Society of America, 1998
- Mimicking the human earIEEE Signal Processing Magazine, 1998
- The recognition of vowels produced by men, women, boys, and girls by cochlear implant patients using a six-channel CIS processorThe Journal of the Acoustical Society of America, 1998
- Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputsThe Journal of the Acoustical Society of America, 1997
- Temporal envelope and fine structure cues for speech intelligibilityThe Journal of the Acoustical Society of America, 1995
- Two left-hemisphere mechanisms in speech perceptionPerception & Psychophysics, 1974
- Speech Recognition as a Function of Channel Capacity in a Discrete Set of ChannelsThe Journal of the Acoustical Society of America, 1968
- Naturalness and Distortion in Speech-Processing DevicesThe Journal of the Acoustical Society of America, 1956
- An Experimental Study of the Acoustic Determinants of Vowel Color; Observations on One- and Two-Formant Vowels Synthesized from Spectrographic PatternsWORD, 1952
- Remaking SpeechThe Journal of the Acoustical Society of America, 1939