Robust talker-independent audio document retrieval
- 24 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 311-314 vol. 1
- https://doi.org/10.1109/icassp.1996.541094
Abstract
The goal of the video mail retrieval (VMR) project is to integrate state-of-the-art document retrieval methods with speech recognition to yield a robust and efficient retrieval system. The work presented extends VMR towards an open-vocabulary, talker-independent system for retrieving spontaneously-spoken audio and video messages. We present results showing successful retrieval using a standard large-vocabulary (LV) recogniser, despite the lack of a matched language model and vocabulary. We further show that integrating a LV recogniser with conventional word spotting (WS) gives more robust retrieval performance than either method alone. This paper gives details of the message archive used, the speech recognition methodologies, the information retrieval methods, and experimental results.Keywords
This publication has 11 references indexed in Scilit:
- Large vocabulary continuous speech recognition using HTKPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Reducing word error rate on conversational speech from the Switchboard corpusPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Video mail retrieval: the effect of word spotting accuracy on precisionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Experiments in spoken document retrievalInformation Processing & Management, 1996
- Combining the evidence of multiple query representations for information retrievalInformation Processing & Management, 1995
- Tree-based state tying for high accuracy acoustic modellingPublished by Association for Computational Linguistics (ACL) ,1994
- Speech-based retrieval using semantic co-occurrence filteringPublished by Association for Computational Linguistics (ACL) ,1994
- Application of large vocabulary continuous speech recognition to topic and speaker identification using telephone speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1993
- An algorithm for suffix strippingProgram: electronic library and information systems, 1980