Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs
- 1 October 1997
- journal article
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 102 (4) , 2403-2411
- https://doi.org/10.1121/1.419603
Abstract
Vowels, consonants, and sentences were processed through software emulations of cochlear-implant signal processors with 2-9 output channels. The signals were then presented, as either the sum of sine waves at the center of the channels or as the sum of noise bands the width of the channels, to normal-hearing listeners for identification. The results indicate, as previous investigations have suggested, that high levels of speech understanding can be obtained using signal processors with a small number of channels. The number of channels needed for high levels of performance varied with the nature of the test material. For the most difficult material--vowels produced by men, women, and girls--no statistically significant differences in performance were observed when the number of channels was increased beyond 8. For the least difficult material--sentences--no statistically significant differences in performance were observed when the number of channels was increased beyond 5. The nature of the output signal, noise bands or sine waves, made only a small difference in performance. The mechanism mediating the high levels of speech recognition achieved with only few channels of stimulation may be the same one that mediates the recognition of signals produced by speakers with a high fundamental frequency, i.e., the levels of adjacent channels are used to determine the frequency of the input signal. The results of an experiment in which frequency information was altered but temporal information was not altered indicates that vowel recognition is based on information in the frequency domain even when the number of channels of stimulation is small.Keywords
This publication has 10 references indexed in Scilit:
- Modeling formant frequency discrimination of female vowelsThe Journal of the Acoustical Society of America, 1996
- Speech Recognition with Primarily Temporal CuesScience, 1995
- Acoustic characteristics of American English vowelsThe Journal of the Acoustical Society of America, 1995
- Development of the Hearing In Noise Test for the measurement of speech reception thresholds in quiet and in noiseThe Journal of the Acoustical Society of America, 1994
- The use of speech knowledge in automatic speech recognitionProceedings of the IEEE, 1985
- Speech Recognition as a Function of Channel Capacity in a Discrete Set of ChannelsThe Journal of the Acoustical Society of America, 1968
- On Vowel Duration in EnglishThe Journal of the Acoustical Society of America, 1961
- Tempo of frequency change as a cue for distinguishing classes of speech sounds.Journal of Experimental Psychology, 1956
- Some Experiments on the Perception of Synthetic Speech SoundsThe Journal of the Acoustical Society of America, 1952
- Control Methods Used in a Study of the VowelsThe Journal of the Acoustical Society of America, 1952