Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: A comparative study
- 1 April 1998
- journal article
- research article
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 103 (4) , 2185-2196
- https://doi.org/10.1121/1.421364
Abstract
The performance of two techniques is compared for automated recognition of bird song units from continuous recordings. The advantages and limitations of dynamic time warping (DTW) and hidden Markov models (HMMs) are evaluated on a large database of male songs of zebra finches (Taeniopygia guttata) and indigo buntings (Passerina cyanea), which have different types of vocalizations and have been recorded under different laboratory conditions. Depending on the quality of recordings and complexity of song, the DTW-based technique gives excellent to satisfactory performance. Under challenging conditions such as noisy recordings or presence of confusing short-duration calls, good performance of the DTW-based technique requires careful selection of templates that may demand expert knowledge. Because HMMs are trained, equivalent or even better performance of HMMs can be achieved based only on segmentation and labeling of constituent vocalizations, albeit with many more training examples than DTW templates. One weakness in HMM performance is the misclassification of short-duration vocalizations or song units with more variable structure (e.g., some calls, and syllables of plastic songs). To address these and other limitations, new approaches for analyzing bird vocalizations are discussed.Keywords
This publication has 13 references indexed in Scilit:
- Minimum classification error rate methods for speech recognitionIEEE Transactions on Speech and Audio Processing, 1997
- Application of dynamic programming matching to classification of budgerigar contact callsThe Journal of the Acoustical Society of America, 1996
- Template-based automatic recognition of birdsong syllables from continuous recordingsThe Journal of the Acoustical Society of America, 1996
- Recognition of the utterances of terrestrial wildlife: A new approach.The Journal of the Acoustical Society of America, 1996
- State of the art in continuous speech recognition.Proceedings of the National Academy of Sciences, 1995
- A quantitative measure of similarity for tursiops truncatus signature whistlesThe Journal of the Acoustical Society of America, 1993
- Discriminative training of dynamic programming based speech recognizersIEEE Transactions on Speech and Audio Processing, 1993
- Data driven search organization for continuous speech recognitionIEEE Transactions on Signal Processing, 1992
- Stereotyped and plastic song in adult indigo buntings, Passerina cyaneaAnimal Behaviour, 1991
- The viterbi algorithmProceedings of the IEEE, 1973