Robust talker-independent audio document retrieval

24 December 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1, 311-314 vol. 1
https://doi.org/10.1109/icassp.1996.541094

Abstract

The goal of the video mail retrieval (VMR) project is to integrate state-of-the-art document retrieval methods with speech recognition to yield a robust and efficient retrieval system. The work presented extends VMR towards an open-vocabulary, talker-independent system for retrieving spontaneously-spoken audio and video messages. We present results showing successful retrieval using a standard large-vocabulary (LV) recogniser, despite the lack of a matched language model and vocabulary. We further show that integrating a LV recogniser with conventional word spotting (WS) gives more robust retrieval performance than either method alone. This paper gives details of the message archive used, the speech recognition methodologies, the information retrieval methods, and experimental results.

Keywords

This publication has 11 references indexed in Scilit:

Large vocabulary continuous speech recognition using HTK
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Reducing word error rate on conversational speech from the Switchboard corpus
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Video mail retrieval: the effect of word spotting accuracy on precision
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Experiments in spoken document retrieval
Information Processing & Management, 1996
Combining the evidence of multiple query representations for information retrieval
Information Processing & Management, 1995
Tree-based state tying for high accuracy acoustic modelling
Published by Association for Computational Linguistics (ACL) ,1994
Speech-based retrieval using semantic co-occurrence filtering
Published by Association for Computational Linguistics (ACL) ,1994
Application of large vocabulary continuous speech recognition to topic and speaker identification using telephone speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1993
An algorithm for suffix stripping
Program: electronic library and information systems, 1980