YIN, a fundamental frequency estimator for speech and music
Top Cited Papers
- 1 April 2002
- journal article
- research article
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 111 (4) , 1917-1930
- https://doi.org/10.1121/1.1458024
Abstract
An algorithm is presented for the estimation of the fundamental frequency (F0) of speech or musical sounds. It is based on the well-known autocorrelation method with a number of modifications that combine to prevent errors. The algorithm has several desirable features. Error rates are about three times lower than the best competing methods, as evaluated over a database of speech recorded together with a laryngograph signal. There is no upper limit on the frequency search range, so the algorithm is suited for high-pitched voices and music. The algorithm is relatively simple and may be implemented efficiently and with low latency, and it involves few parameters that must be tuned. It is based on a signal model (periodic signal) that may be extended in several ways to handle various forms of aperiodicity that occur in particular applications. Finally, interesting parallels may be drawn with models of auditory processing.Keywords
This publication has 28 references indexed in Scilit:
- Multiple period estimation and pitch perception modelSpeech Communication, 1999
- Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in soundsSpeech Communication, 1999
- Virtual pitch and phase sensitivity of a computer model of the auditory periphery. I: Pitch identificationThe Journal of the Acoustical Society of America, 1991
- Musical frequency tracking using the methods of conventional and “narrowed” autocorrelationThe Journal of the Acoustical Society of America, 1991
- Pitch detection with a neural-net classifierIEEE Transactions on Signal Processing, 1991
- Calculation of a ‘‘narrowed’’ autocorrelation functionThe Journal of the Acoustical Society of America, 1989
- Measurement of pitch in speech: An implementation of Goldstein’s theory of pitch perceptionThe Journal of the Acoustical Society of America, 1982
- Average magnitude difference function pitch extractorIEEE Transactions on Acoustics, Speech, and Signal Processing, 1974
- Existence Region of the Tonal Residue. IThe Journal of the Acoustical Society of America, 1962
- The Perception of Repeated Bursts of NoiseThe Journal of the Acoustical Society of America, 1948