An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech

1 January 1993

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2 (15206149) , 554-557 vol.2
https://doi.org/10.1109/icassp.1993.319366

Abstract

A concept of waveform similarity for tackling the problem of time-scale modification of speech is proposed. It is worked out in the context of short-time Fourier transform representations. The resulting WSOLA (waveform-similarity-based synchronized overlap-add) algorithm produces high-quality speech output, is algorithmically and computationally efficient and robust, and allows for online processing with arbitrary time-scaling factors that may be specified in a time-varying fashion and can be chosen over a wide continuous range of values.

Keywords

This publication has 4 references indexed in Scilit:

High quality time-scale modification for speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
On the quality of speech produced by impulse driven linear systems
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
Speech Communication, 1990
Signal estimation from modified short-time Fourier transform
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1984