On the use of instantaneous and transitional spectral information in speaker recognition
- 1 June 1988
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Acoustics, Speech, and Signal Processing
- Vol. 36 (6) , 871-879
- https://doi.org/10.1109/29.1598
Abstract
The use of instantaneous and transitional spectral representations of spoken utterances for speaker recognition is investigated. Linear-predictive-coding (LPC)-derived cepstral coefficients are used to represent instantaneous spectral information, and best linear fits of each cepstral coefficient over a specified time window are used to represent transitional information. An evaluation has been carried out using a database of isolated digit utterances over dialed-up telephone lines by 10 talkers. Two vector quantization (VQ) codebooks, instantaneous and transitional, were constructed from each speaker's training utterances. The experimental results show that the instantaneous and transitional representations are relatively uncorrelated, thus providing complementary information for speaker recognition. A rectangular window of approximately 100 ms duration provides an effective estimate of the transitional spectral features for speaker recognition. Also, simple transmission channel variations are shown to affect both the instantaneous spectral representations and the corresponding recognition performance significantly, while the transitional representations and performance are relatively resistant.Keywords
This publication has 14 references indexed in Scilit:
- Linear predictive hidden Markov models and the speech signalPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- An approach to text-independent speaker recognition with short utterancesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Prediction of perceived phonetic distance from critical-band spectra: A first stepPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Evaluation of a vector quantization talker recognition system in text independent and text dependent modesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A vector quantization approach to speaker recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Comparative study of several distortion measures for speech recognitionSpeech Communication, 1985
- Text-independent speaker recognition experiments using codebooks in vector quantizationThe Journal of the Acoustical Society of America, 1985
- Direct (nonrecursive) relations between cepstrum and predictor coefficientsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1981
- An Algorithm for Vector Quantizer DesignIEEE Transactions on Communications, 1980
- Adaptive transform coding of speech signalsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1977