Static, dynamic, and relational properties in vowel perception
- 1 May 1989
- journal article
- research article
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 85 (5) , 2088-2113
- https://doi.org/10.1121/1.397861
Abstract
The present work reviews theories and empirical findings, including results from two new experiments, that bear on the perception of English vowels, with an emphasis on the comparison of data analytic ‘‘machine recognition’’ approaches with results from speech perceptionexperiments. Two major sources of variability (viz., speaker differences and consonantal context effects) are addressed from the classical perspective of overlap between vowel categories in F1×F2 space. Various approaches to the reduction of this overlap are evaluated. Two types of speaker normalization are considered. ‘‘Intrinsic’’ methods based on relationships among the steady‐state properties (F0, F1, F2, and F3) within individual vowel tokens are contrasted with ‘‘extrinsic’’ methods, involving the relationships among the formant frequencies of the entire vowel system of a single speaker. Evidence from a new experiment supports Ainsworth’s (1975) conclusion [W. Ainsworth, Auditory Analysis and Perception of Speech (Academic, London, 1975)] that both types of information have a role to play in perception. The effects of consonantal context on formant overlap are also considered. A new experiment is presented that extends Lindblom and Studdert‐Kennedy’s (1967) finding [B. Lindblom and M. Studdert‐Kennedy, J. Acoust. Soc. Am. 4 3, 840–843)] of perceptual effects of consonantal context on vowel perception to /dVd/ and /bVb/ contexts. Finally, the role of vowel‐inherent dynamic properties, including duration and diphthongization, is briefly reviewed. All of the above factors are shown to have reliable influences on vowel perception, although the relative weight of such effects and the circumstances that alter these weights remain far from clear. It is suggested that the design of more complex perceptual experiments, together with the development of quantitative pattern recognitionmodels of human vowel perception, will be necessary to resolve these issues.Keywords
This publication has 5 references indexed in Scilit:
- Review of text-to-speech conversion for EnglishThe Journal of the Acoustical Society of America, 1987
- Perception of front vowels: The role of harmonics in the first formant regionThe Journal of the Acoustical Society of America, 1987
- Vowel-space normalization procedures and phone-preserving transformations of synthetic vowelsThe Journal of the Acoustical Society of America, 1983
- The Examination and Analysis of ResidualsTechnometrics, 1963
- Acoustic PhoneticsLanguage, 1948