Speaker verification using temporal decorrelation post-processing

Abstract
A text-dependent method of speaker verification processing which utilizes the statistical correlation between measured features of speech across whole words is described. The correlation is used in a linear discriminant analysis to define uncorrelated world-level features as a metric. Initial results indicate that this method can significantly reduce the amount of storage necessary for speaker-specific speech information. Furthermore, this method provides promise of improved verification performance compared to methods based on hidden Markov model (HMM) state level observation metrics. Since the linear discriminant analysis yields features which are decorrelated over entire words, this method should be more robust to signal distortions which are consistent over the entire utterance.

This publication has 5 references indexed in Scilit: