The Timbre Toolbox: Extracting audio descriptors from musical signals
Top Cited Papers
- 1 November 2011
- journal article
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 130 (5) , 2902-2916
- https://doi.org/10.1121/1.3642604
Abstract
The analysis of musical signals to extract audio descriptors that can potentially characterize their timbre has been disparate and often too focused on a particular small set of sounds. The Timbre Toolbox provides a comprehensive set of descriptors that can be useful in perceptual research, as well as in music information retrieval and machine-learning approaches to content-based retrieval in large sound databases. Sound events are first analyzed in terms of various input representations (short-term Fourier transform, harmonic sinusoidal components, an auditory model based on the equivalent rectangular bandwidth concept, the energy envelope). A large number of audio descriptors are then derived from each of these representations to capture temporal, spectral, spectrotemporal, and energetic properties of the sound events. Some descriptors are global, providing a single value for the whole sound event, whereas others are time-varying. Robust descriptive statistics are used to characterize the time-varying descriptors. To examine the information redundancy across audio descriptors, correlational analysis followed by hierarchical clustering is performed. This analysis suggests ten classes of relatively independent audio descriptors, showing that the Timbre Toolbox is a multidimensional instrument for the measurement of the acoustical structure of complex sound signals.Keywords
This publication has 27 references indexed in Scilit:
- A sawtooth waveform inspired pitch estimator for speech and musicThe Journal of the Acoustical Society of America, 2008
- Acoustic correlates of timbre space dimensions: A confirmatory study using synthetic tonesThe Journal of the Acoustical Society of America, 2005
- YIN, a fundamental frequency estimator for speech and musicThe Journal of the Acoustical Society of America, 2002
- Feature dependence in the automatic identification of musical woodwind instrumentsThe Journal of the Acoustical Society of America, 2001
- Isolating the dynamic attributes of musical timbrea)The Journal of the Acoustical Society of America, 1993
- Transform coding of audio signals using perceptual noise criteriaIEEE Journal on Selected Areas in Communications, 1988
- The perceptual attack time of musical tonesThe Journal of the Acoustical Society of America, 1987
- Comparing partitionsJournal of Classification, 1985
- Perceptual effects of spectral modifications on musical timbresThe Journal of the Acoustical Society of America, 1978
- Multidimensional perceptual scaling of musical timbresThe Journal of the Acoustical Society of America, 1977