Metadata for integrating speech documents in a text retrieval system
- 1 December 1994
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record
- Vol. 23 (4) , 57-63
- https://doi.org/10.1145/190627.190645
Abstract
We present an information retrieval system that simultaneously allows to search for text and speech documents. The retrieval system accepts vague queries and performs a best-match search to find those documents that are relevant to the query. The output of the retrieval system is a list of ranked documents where the documents on the top of the list satisfy best the user's information need. The relevance of the documents is estimated by means of metadata (document description vectors). The metadata is automatically generated and it is organized such that queries can be processed efficiently. We introduce a controlled indexing vocabulary for both speech and text documents. The size of the new indexing vocabulary is small (1000 features) compared with the sizes of indexing vocabularies of conventional text retrieval (10000 - 100000 features). We show that the retrieval effectiveness based on such a small indexing vocabulary is similar to the retrieval effectiveness of a Boolean retrieval system.Keywords
This publication has 11 references indexed in Scilit:
- Continuous hidden Markov modeling for speaker-independent word spottingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A hidden Markov model based keyword recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Incremental updates of inverted lists for text document retrievalPublished by Association for Computing Machinery (ACM) ,1994
- Assessing the retrieval effectiveness of a speech retrieval system by simulating recognition errorsPublished by Association for Computational Linguistics (ACL) ,1994
- SPIDERPublished by Association for Computing Machinery (ACM) ,1993
- A system for retrieving speech documentsPublished by Association for Computing Machinery (ACM) ,1992
- Optimization for dynamic inverted index maintenancePublished by Association for Computing Machinery (ACM) ,1989
- Practical enhanced Boolean retrieval: Experiences with the smart and sire systemsInformation Processing & Management, 1988
- Optimization of inverted vector searchesPublished by Association for Computing Machinery (ACM) ,1985
- THE PROBABILITY RANKING PRINCIPLE IN IRJournal of Documentation, 1977