The use of emphasis to automatically summarize a spoken discourse
- 1 January 1992
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 229-232 vol.1
- https://doi.org/10.1109/icassp.1992.225930
Abstract
The authors describe a method for exploiting prosodic information in natural, conventional speed for the purpose of automatically creating an audio summary. The method is based on identifying emphasized speech and then using proximity measures on the emphasized regions to select summarizing excerpts. Emphasized speech is recognized using a hidden Markov model using only non-spectral, periodic information. Syllable-based models were created and the models trained on spontaneous speech in which words had been labeled by a panel of listeners for degree of emphasis. Emphatic speech from one speaker was automatically detected and summarizing excerpts were identified, with no noticeable difference when compared to excerpts selected by individual subjects. The extensibility of the emphasis detector to other speakers was tested on a small sample of telephone speech by ten other speakers.Keywords
This publication has 12 references indexed in Scilit:
- An integrated pitch tracking algorithm for speech systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A statistical approach to the segmentation and broad classification of continuous speech into phrase-sized information unitsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Isolated word intonation recognition using hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A real-time Mandarin dictation machine for Chinese language with unlimited texts and very large vocabularyPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The frequency scale of speech intonationThe Journal of the Acoustical Society of America, 1991
- Fundamental frequency and perceived prominence of accented syllablesThe Journal of the Acoustical Society of America, 1991
- Lexical stress estimation and phonological knowledgeComputer Speech & Language, 1990
- Recognition of isolated prosodic patterns using Hidden Markov ModelsComputer Speech & Language, 1987
- Speech intonation and focus location in matched statements and questionsThe Journal of the Acoustical Society of America, 1986
- Phonological Features of Intonational PeaksLanguage, 1983