Speech recognition with amplitude and frequency modulations
Top Cited Papers
- 27 January 2005
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 102 (7) , 2293-2298
- https://doi.org/10.1073/pnas.0406460102
Abstract
Amplitude modulation (AM) and frequency modulation (FM) are commonly used in communication, but their relative contributions to speech recognition have not been fully explored. To bridge this gap, we derived slowly varying AM and FM from speech sounds and conducted listening tests using stimuli with different modulations in normal-hearing and cochlear-implant subjects. We found that although AM from a limited number of spectral bands may be sufficient for speech recognition in quiet, FM significantly enhances speech recognition in noise, as well as speaker and tone recognition. Additional speech reception threshold measures revealed that FM is particularly critical for speech recognition with a competing voice and is independent of spectral resolution and similarity. These results suggest that AM and FM provide independent yet complementary contributions to support robust speech recognition under realistic listening situations. Encoding FM may improve auditory scene analysis, cochlear-implant, and audiocoding performance.Keywords
This publication has 48 references indexed in Scilit:
- Encoding Frequency Modulation to Improve Cochlear Implant Performance in NoiseIEEE Transactions on Biomedical Engineering, 2004
- Temporal Envelope Processing in the Human Left and Right Auditory CorticesCerebral Cortex, 2004
- The Power of SpeechScience, 2003
- The role of frequency modulation in the perceptual segregation of concurrent vowelsThe Journal of the Acoustical Society of America, 1995
- A common neural code for frequency- and amplitude-modulated soundsNature, 1995
- Auditory Scene Analysis: The Perceptual Organization of SoundThe Journal of the Acoustical Society of America, 1994
- Perceptual separation of simultaneous vowels: Within and across-formant grouping by FThe Journal of the Acoustical Society of America, 1993
- Segregation of concurrent sounds. II: Effects of spectral envelope tracing, frequency modulation coherence, and frequency modulation widthThe Journal of the Acoustical Society of America, 1991
- A cochlear frequency-position function for several species—29 years laterThe Journal of the Acoustical Society of America, 1990
- Optimizing digital speech coders by exploiting masking properties of the human earThe Journal of the Acoustical Society of America, 1979