Concatenated phoneme models for text-variable speaker recognition
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 391-394 vol.2
- https://doi.org/10.1109/icassp.1993.319321
Abstract
Methods that create models to specify both speaker and phonetic information accurately by using only a small amount of training data for each speaker are investigated. For a text-dependent speaker recognition method, in which arbitrary key texts are prompted from the recognizer, speaker-specific phoneme models are necessary to identify the key text and recognize the speaker. Two methods of making speaker-specific phoneme models are discussed: phoneme-adaptation of a phoneme-independent speaker model and speaker-adaptation of universal phoneme models. The authors also investigate supplementing these methods by adding a phoneme-independent speaker model to make up for the lack of speaker information. This combination achieves a rejection rate as high as 98.5% for speech that differs from the key text and a speaker verification rate of 100.0%.Keywords
This publication has 3 references indexed in Scilit:
- Speaker verification using randomized phrase promptingDigital Signal Processing, 1991
- Speaker verification: a tutorialIEEE Communications Magazine, 1990
- Automatic Speech RecognitionPublished by Springer Nature ,1989