Metadata for mixed-media access
- 1 December 1994
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record
- Vol. 23 (4) , 64-71
- https://doi.org/10.1145/190627.190646
Abstract
In this paper, we discuss mixed-media access , an information access paradigm for multimedia data in which the media type of a query may differ from that of the data. The types of media considered in this paper are speech, images of text, and full-length text. Some examples of metadata for mixed-media access are locations of keywords in speech and images, identification of speakers, locations of emphasized regions in speech, and locations of topic boundaries in text. Algorithms for automatically generating this metadata are described, including word spotting, speaker segmentation, emphatic speech detection, and subtopic boundary location. We illustrate queries composed of diverse media types in an example of access to recorded meetings, via speaker and keyword location.Keywords
This publication has 10 references indexed in Scilit:
- Isolated word intonation recognition using hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- MarqueePublished by Association for Computing Machinery (ACM) ,1994
- Multi-paragraph segmentation of expository textPublished by Association for Computational Linguistics (ACL) ,1994
- Speech-based retrieval using semantic co-occurrence filteringPublished by Association for Computational Linguistics (ACL) ,1994
- Document analysis-from pixels to contentsProceedings of the IEEE, 1992
- Wordspotting for voice editing and audio indexingPublished by Association for Computing Machinery (ACM) ,1992
- A practical part-of-speech taggerPublished by Association for Computational Linguistics (ACL) ,1992
- An introduction to speech and speaker recognitionComputer, 1990
- Automatic recognition of keywords in unconstrained speech using hidden Markov modelsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
- An analysis of controlled vocabulary and free text search statements in online searchesOnline Review, 1980