On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception
- 1 September 2001
- journal article
- conference paper
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 110 (3) , 1628-1640
- https://doi.org/10.1121/1.1396325
Abstract
Studies in neurophysiology and in psychophysics provide evidence for the existence of temporal integration mechanisms in the auditory system. These auditory mechanisms may be viewed as “detectors,” parametrized by their cutoff frequencies. There is an interest in quantifying those cutoff frequencies by direct psychophysical measurement, in particular for tasks that are related to speech perception. In this study, the inherent difficulties in synthesizing speech signals with prescribed temporal envelope bandwidth at the output of the listener’s cochlea have been identified. In order to circumvent these difficulties, a dichotic synthesis technique is suggested with interleaving critical-band envelopes. This technique is capable of producing signals which generate cochlear temporal envelopes with prescribed bandwidth. Moreover, for unsmoothed envelopes, the synthetic signal is perceptually indistinguishable from the original. With this technique established, psychophysical experiments have been conducted to quantify the upper cutoff frequency of the auditory critical-band envelope detectors at threshold, using high-quality, wideband speech signals (bandwidth of 7 kHz) as test stimuli. These experiments show that in order to preserve speech quality (i.e., for inaudible distortions), the minimum bandwidth of the envelope information for a given auditory channel is considerably smaller than a critical-band bandwidth (roughly one-half of one critical band). Difficulties encountered in using the dichotic synthesis technique to measure the cutoff frequencies relevant to intelligibility of speech signals with fair quality levels (e.g., above MOS level 3) are also discussed.Keywords
This publication has 14 references indexed in Scilit:
- The influence of carrier level and frequency on modulation and beat-detection thresholds for sinusoidal carriersThe Journal of the Acoustical Society of America, 2000
- Spectro-temporal modulation transfer functions and speech intelligibilityThe Journal of the Acoustical Society of America, 1999
- Intrinsic envelope fluctuations and modulation-detection thresholds for narrow-band noise carriersThe Journal of the Acoustical Society of America, 1999
- Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriersThe Journal of the Acoustical Society of America, 1997
- Modeling auditory processing of amplitude modulation. II. Spectral and temporal integrationThe Journal of the Acoustical Society of America, 1997
- On the perceptual distance between speech segmentsThe Journal of the Acoustical Society of America, 1997
- Effect of temporal envelope smearing on speech receptionThe Journal of the Acoustical Society of America, 1994
- Suggested formulae for calculating auditory-filter bandwidths and excitation patternsThe Journal of the Acoustical Society of America, 1983
- The relationship between spike rate and synchrony in responses of auditory-nerve fibers to single tonesThe Journal of the Acoustical Society of America, 1980
- Parametric coding of speech spectraThe Journal of the Acoustical Society of America, 1980