An approach based on phonemes to large vocabulary Chinese sign language recognition
- 25 June 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Hitherto, the major challenge to sign language recognition is how to develop approaches that scale well with increasing vocabulary size. We present an approach to large vocabulary, continuous Chinese sign language (CSL) recognition that uses phonemes instead of whole signs as the basic units. Since the number of phonemes is limited, HMM-based training and recognition of the CSL signal becomes more tractable and has the potential to recognize enlarged vocabularies. Furthermore, the proposed method facilitates the CSL recognition when the finger-alphabet is blended with gestures. About 2400 phonemes are defined for CSL. One HMM is built for each phoneme, and then the signs are encoded based on these phonemes. A decoder that uses a tree-structured network is presented. Clustering of the Gaussians on the states, the language model and N-best-pass is used to improve the performance of the system. Experiments on a 5119 sign vocabulary are carried out, and the result is exciting.Keywords
This publication has 9 references indexed in Scilit:
- A real-time continuous gesture recognition system for sign languagePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- ASL recognition based on a coupling between HMMs and 3D motion analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Adapting hidden Markov models for ASL recognition by using three-dimensional computer vision methodsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- SIGN LANGUAGE RECOGNITION BASED ON HMM/ANN/DPInternational Journal of Pattern Recognition and Artificial Intelligence, 2000
- A review of large-vocabulary continuous-speechIEEE Signal Processing Magazine, 1996
- Recognition of space-time hand-gestures using hidden Markov modelPublished by Association for Computing Machinery (ACM) ,1996
- Glove-Talk: a neural network interface between a data-glove and a speech synthesizerIEEE Transactions on Neural Networks, 1993
- Image processing system for interpreting motion in American Sign LanguageJournal of Biomedical Engineering, 1992
- Estimation of probabilities from sparse data for the language model component of a speech recognizerIEEE Transactions on Acoustics, Speech, and Signal Processing, 1987