The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing
- 21 August 2006
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Pattern Analysis and Machine Intelligence
- Vol. 28 (10) , 1678-1689
- https://doi.org/10.1109/tpami.2006.212
Abstract
This paper presents the semantic pathfinder architecture for generic indexing of multimedia archives. The semantic pathfinder extracts semantic concepts from video by exploring different paths through three consecutive analysis steps, which we derive from the observation that produced video is the result of an authoring-driven process. We exploit this authoring metaphor for machine-driven understanding. The pathfinder starts with the content analysis step. In this analysis step, we follow a data-driven approach of indexing semantics. The style analysis step is the second analysis step. Here, we tackle the indexing problem by viewing a video from the perspective of production. Finally, in the context analysis step, we view semantics in context. The virtue of the semantic pathfinder is its ability to learn the best path of analysis steps on a per-concept basis. To show the generality of this novel indexing approach, we develop detectors for a lexicon of 32 concepts and we evaluate the semantic pathfinder against the 2004 NIST TRECVID video retrieval benchmark, using a news archive of 64 hours. Top ranking performance in the semantic concept detection task indicates the merit of the semantic pathfinder for generic indexing of multimedia archivesKeywords
This publication has 24 references indexed in Scilit:
- MediaMillPublished by Association for Computing Machinery (ACM) ,2005
- Multimodal Video Indexing: A Review of the State-of-the-artMultimedia Tools and Applications, 2005
- Object Detection Using the Statistics of PartsInternational Journal of Computer Vision, 2004
- ClassView : Hierarchical Video Shot Classification, Indexing, and AccessingIEEE Transactions on Multimedia, 2004
- Extracting semantics from audio-visual content: the final frontier in multimedia retrievalIEEE Transactions on Neural Networks, 2002
- Event based indexing of broadcasted sports video by intermodal collaborationIEEE Transactions on Multimedia, 2002
- The LIMSI Broadcast News transcription systemSpeech Communication, 2002
- Multi-Modal Dialog Scene Detection Using Hidden Markov Models for Content-Based Multimedia IndexingMultimedia Tools and Applications, 2001
- A semantic event-detection approach and its application to detecting hunts in wildlife videoIEEE Transactions on Circuits and Systems for Video Technology, 2000
- Video OCR: indexing digital news libraries by recognition of superimposed captionsMultimedia Systems, 1999