A framework for audio analysis based on classification and temporal segmentation
- 1 January 1999
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (10896503) , 61-67 vol.2
- https://doi.org/10.1109/eurmic.1999.794763
Abstract
Existing audio tools handle the increasing amount of computer audio data inadequately. The typical tape-recorder paradigm for audio interfaces is inflexible and time consuming, especially for large data sets. On the other hand, completely automatic audio analysis and annotation is impossible using current techniques. Alternative solutions are semi-automatic user interfaces that let users interact with sound in flexible ways based on content. This approach offers significant advantages over manual browsing, annotation and retrieval. Furthermore, it can be implemented using existing techniques for audio content analysis in restricted domains. This paper describes a framework for experimenting evaluating and integrating such techniques. As a test for the architecture, some recently proposed techniques have been implemented and tested. In addition, a new method for temporal segmentation based on audio texture is described. This method is combined with audio analysis techniques and used for hierarchical browsing classification and annotation of audio files.Keywords
This publication has 11 references indexed in Scilit:
- Experiments in syllable-based recognition of continuous speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A perceptual pitch detectorPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A hidden Markov model framework for video segmentation using audio and image featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Construction and evaluation of a robust multifeature speech/music discriminatorPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- An overview of audio information retrievalMultimedia Systems, 1999
- Tempo and beat analysis of acoustic musical signalsThe Journal of the Acoustical Society of America, 1998
- SpeechSkimmerACM Transactions on Computer-Human Interaction, 1997
- Content-based classification, search, and retrieval of audioIEEE MultiMedia, 1996
- A comparative performance study of several pitch detection algorithmsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1976
- Linear prediction: A tutorial reviewProceedings of the IEEE, 1975