On the use of instantaneous and transitional spectral information in speaker recognition

1 June 1988

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Acoustics, Speech, and Signal Processing

Vol. 36 (6) , 871-879
https://doi.org/10.1109/29.1598

Abstract

The use of instantaneous and transitional spectral representations of spoken utterances for speaker recognition is investigated. Linear-predictive-coding (LPC)-derived cepstral coefficients are used to represent instantaneous spectral information, and best linear fits of each cepstral coefficient over a specified time window are used to represent transitional information. An evaluation has been carried out using a database of isolated digit utterances over dialed-up telephone lines by 10 talkers. Two vector quantization (VQ) codebooks, instantaneous and transitional, were constructed from each speaker's training utterances. The experimental results show that the instantaneous and transitional representations are relatively uncorrelated, thus providing complementary information for speaker recognition. A rectangular window of approximately 100 ms duration provides an effective estimate of the transitional spectral features for speaker recognition. Also, simple transmission channel variations are shown to affect both the instantaneous spectral representations and the corresponding recognition performance significantly, while the transitional representations and performance are relatively resistant.

Keywords

This publication has 14 references indexed in Scilit:

Linear predictive hidden Markov models and the speech signal
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
An approach to text-independent speaker recognition with short utterances
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Prediction of perceived phonetic distance from critical-band spectra: A first step
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Evaluation of a vector quantization talker recognition system in text independent and text dependent modes
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
A vector quantization approach to speaker recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Comparative study of several distortion measures for speech recognition
Speech Communication, 1985
Text-independent speaker recognition experiments using codebooks in vector quantization
The Journal of the Acoustical Society of America, 1985
Direct (nonrecursive) relations between cepstrum and predictor coefficients
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1981
An Algorithm for Vector Quantizer Design
IEEE Transactions on Communications, 1980
Adaptive transform coding of speech signals
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1977