Sparse and shift-Invariant representations of music
- 19 December 2005
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Audio, Speech, and Language Processing
- Vol. 14 (1) , 50-57
- https://doi.org/10.1109/tsa.2005.860346
Abstract
Redundancy reduction has been proposed as the main computational process in the primary sensory pathways in the mammalian brain. This idea has led to the development of sparse coding techniques, which are exploited in this article to extract salient structure from musical signals. In particular, we use a sparse coding formulation within a generative model that explicitly enforces shift-invariance. Previous work has applied these methods to relatively small problem sizes. In this paper, we present a subset selection step to reduce the computational complexity of these methods, which then enables us to use the sparse coding approach for many real world applications. We demonstrate the algorithm's potential on two tasks in music analysis: the extraction of individual notes from polyphonic piano music and single-channel blind source separation.Keywords
This publication has 11 references indexed in Scilit:
- Sparse representations of polyphonic musicSignal Processing, 2006
- Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic InputsPublished by Springer Nature ,2004
- Adaptive sparseness for supervised learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Dictionary Learning Algorithms for Sparse RepresentationNeural Computation, 2003
- Efficient coding of natural soundsNature Neuroscience, 2002
- Learning Overcomplete RepresentationsNeural Computation, 2000
- Probabilistic framework for the adaptation and comparison of image codesJournal of the Optical Society of America A, 1999
- An affine scaling methodology for best basis selectionIEEE Transactions on Signal Processing, 1999
- Emergence of simple-cell receptive field properties by learning a sparse code for natural imagesNature, 1996
- Forming sparse representations by local anti-Hebbian learningBiological Cybernetics, 1990