Radial basis function networks for speaker recognition

Abstract
A speaker recognition system, using a modified form of feedforward neural network based on radial basis functions (RBFs), is presented. Each person to be recognized has his/her own neural model which is trained to recognise spectral feature vectors representative of his/her speech. Experimental results on a 40-speaker database indicate that the modified neural approach significantly outperforms both a standard multilayer perceptron and a vector quantization based system. The best performance for 4 digit test utterances is obtained from an RBF network with 384 RBF nodes in the hidden layer, given an 8% true talker rejection rate for a fixed 1% imposter acceptance rate. Additional advantages include a substantial reduction in training time over an MLP approach, and the ability to readily interpret the resulting model.

This publication has 3 references indexed in Scilit: