An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 554-557 vol.2
- https://doi.org/10.1109/icassp.1993.319366
Abstract
A concept of waveform similarity for tackling the problem of time-scale modification of speech is proposed. It is worked out in the context of short-time Fourier transform representations. The resulting WSOLA (waveform-similarity-based synchronized overlap-add) algorithm produces high-quality speech output, is algorithmically and computationally efficient and robust, and allows for online processing with arbitrary time-scaling factors that may be specified in a time-varying fashion and can be chosen over a wide continuous range of values.Keywords
This publication has 4 references indexed in Scilit:
- High quality time-scale modification for speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- On the quality of speech produced by impulse driven linear systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphonesSpeech Communication, 1990
- Signal estimation from modified short-time Fourier transformIEEE Transactions on Acoustics, Speech, and Signal Processing, 1984