Learning to Segment Speech Using Multiple Cues: A Connectionist Model
- 1 June 1998
- journal article
- research article
- Published by Taylor & Francis in Language and Cognitive Processes
- Vol. 13 (2-3) , 221-268
- https://doi.org/10.1080/016909698386528
Abstract
Considerable research in language acquisition has addressed the extent to which basic aspects of linguistic structure might be identified on the basis of probabilistic cues in caregiver speech to children. This type of learning mechanism presents classic learnability issues: there are aspects of language for which the input is thought to provide no evidence, and the evidence that does exist tends to be unreliable. We address these issues in the context of the specific problem of learning to identify lexical units in speech. A simple recurrent network was trained on a phoneme prediction task. The model was explicitly provided with information about phonemes, relative lexical stress, and boundaries between utterances. Individually these sources of information provide relatively unreliable cues to word boundaries and no direct evidence about actual word boundaries. After training on a large corpus of childdirected speech, the model was able to use these cues to reliably identify word boundaries. The model shows that aspects of linguistic structure that are not overtly marked in the input can be derived by efficiently combining multiple probabilistic cues. Connectionist networks provide a plausible mechanism for acquiring, representing, and combining such probabilistic information.Keywords
This publication has 36 references indexed in Scilit:
- Learning and development in neural networks: the importance of starting smallPublished by Elsevier ,2002
- Segmentation problems, rhythmic solutionsLingua, 1994
- Sequence Recognition with Recurrent Neural NetworksConnection Science, 1993
- Learning Simple Arithmetic ProceduresConnection Science, 1993
- Linguistic Experience Alters Phonetic Perception in Infants by 6 Months of AgeScience, 1992
- Connectionism, Learning and MeaningConnection Science, 1992
- The Child Language Data Exchange System: an updateJournal of Child Language, 1990
- A cross-language study of prosodic modifications in mothers' and fathers' speech to preverbal infantsJournal of Child Language, 1989
- Structural packaging in the input to language learning: Contributions of prosodic and morphological marking of phrases to the acquisition of languageCognitive Psychology, 1987
- A Universal Prior for Integers and Estimation by Minimum Description LengthThe Annals of Statistics, 1983