Unsupervised learning of vowel categories from infant-directed speech

14 August 2007

journal article
research article
Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences

Vol. 104 (33) , 13273-13278
https://doi.org/10.1073/pnas.0705369104

Abstract

Infants rapidly learn the sound categories of their native language, even though they do not receive explicit or focused training. Recent research suggests that this learning is due to infants' sensitivity to the distribution of speech sounds and that infant-directed speech contains the distributional information needed to form native-language vowel categories. An algorithm, based on Expectation-Maximization, is presented here for learning the categories from a sequence of vowel tokens without (i) receiving any category information with each vowel token, (ii) knowing in advance the number of categories to learn, or (iii) having access to the entire data ensemble. When exposed to vowel tokens drawn from either English or Japanese infant-directed speech, the algorithm successfully discovered the language-specific vowel categories (h, i, c, e/ for English, /i, i:, e, e:/for Japanese). A nonparametric version of the algorithm, closely related to neural network models based on topographic representation and competitive Hebbian learning, also was able to discover the vowel categories, albeit somewhat less reliably. These results reinforce the proposal that native-language speech categories are acquired through distributional learning and that such learning may be instantiated in a biologically plausible manner.

Keywords

This publication has 44 references indexed in Scilit:

Learning phonetic categories by tracking movements
Published by Elsevier ,2006
Cross-language speech perception: Evidence for perceptual reorganization during the first year of life
Published by Elsevier ,2004
Resonant neural dynamics of speech perception
Journal of Phonetics, 2003
Bayesian approaches to Gaussian mixture modeling
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1998
Developmental changes in perception of nonnative vowel contrasts.
Journal of Experimental Psychology: Human Perception and Performance, 1994
Spectral-shape features versus formants as acoustic correlates for vowels
The Journal of the Acoustical Society of America, 1993
On the sufficiency of compound target specification of isolated vowels and vowels in /bVb/ syllables
The Journal of the Acoustical Society of America, 1992
A cross-language study of prosodic modifications in mothers' and fathers' speech to preverbal infants
Journal of Child Language, 1989
Simplified neuron model as a principal component analyzer
Journal of Mathematical Biology, 1982