Vocal quality factors: Analysis, synthesis, and perception
- 1 November 1991
- journal article
- research article
- Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America
- Vol. 90 (5) , 2394-2410
- https://doi.org/10.1121/1.402044
Abstract
The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source-related features from speech and electroglottographic (EGG) signals. Four factors were found to be important for characterizing the glottal excitations for the four voice types: the glottal pulse width, the glottal pulse skewness, the abruptness of glottal closure, and the turbulent noise component. The significance of these factors for voice synthesis was studied and a new voice source model that accounted for certain physiological aspects of vocal fold motion was developed and tested using speech synthesis. Perceptual listening tests were conducted to evaluate the auditory effects of the source model parameters upon synthesized speech. The effects of the spectral slope of the source excitation, the shape of the glottal excitation pulse, and the characteristics of the turbulent noise source were considered. Applications for these research results include synthesis of natural sounding speech, synthesis and modeling of vocal disorders, and the development of speaker independent (or adaptive) speech recognition systems.Keywords
This publication has 8 references indexed in Scilit:
- Gender recognition from speech. Part II: Fine analysisThe Journal of the Acoustical Society of America, 1991
- Articulatory synthesis: nasal sounds and male and female voicesJournal of Phonetics, 1991
- Quality of speech produced by analysis-synthesisSpeech Communication, 1990
- Analysis, synthesis, and perception of voice quality variations among female and male talkersThe Journal of the Acoustical Society of America, 1990
- A four-parameter model of the glottis and vocal fold contact areaSpeech Communication, 1989
- Review of text-to-speech conversion for EnglishThe Journal of the Acoustical Society of America, 1987
- Parameterization of the glottal area, glottal flow, and vocal fold contact areaThe Journal of the Acoustical Society of America, 1984
- Acoustic Characteristics of Normal and Pathological VoicesPublished by Elsevier ,1979