Unsupervised clustering of ambulatory audio and video
- 1 January 1999
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 6 (15206149) , 3037-3040 vol.6
- https://doi.org/10.1109/icassp.1999.757481
Abstract
A truly personal and reactive computer system should have access to the same information as its user, including the ambient sights and sounds. To this end, we have developed a system for extracting events and scenes from natural audio/visual input. We find our system can (without any prior labeling of data) cluster the audio/visual data into events, such as passing through doors and crossing the street. Also, we hierarchically cluster these events into scenes and get clusters that correlate with visiting the supermarket, or walking down a busy street.Keywords
This publication has 4 references indexed in Scilit:
- Visual contextual awareness in wearable computingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1998
- Automatic Indexing of a Sound Database Using Self-Organizing Neural NetsComputer Music Journal, 1994
- Auditory Scene AnalysisPublished by MIT Press ,1990
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989