Multimodal integration-a statistical view
- 1 January 1999
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Multimedia
- Vol. 1 (4) , 334-341
- https://doi.org/10.1109/6046.807953
Abstract
This paper presents a statistical approach to developing multimodal recognition systems and, in particular, to integrating the posterior probabilities of parallel input signals involved in the multimodal system. We first identify the primary factors that influence multimodal recognition performance by evaluating the multimodal recognition probabilities. We then develop two techniques, an estimate approach and a learning approach, which are designed to optimize accurate recognition during the multimodal integration process. We evaluate these methods using Quickset, a speech/gesture multimodal system, and report evaluation results based on an empirical corpus collected with Quickset. From an architectural perspective, the integration technique presented here offers enhanced robustness. It also is premised on more realistic assumptions than previous multimodal systems using semantic fusion. From a methodological standpoint, the evaluation techniques that we describe provide a valuable tool for evaluating multimodal systems.Keywords
This publication has 17 references indexed in Scilit:
- Toward multimodal human-computer interfaceProceedings of the IEEE, 1998
- On combining classifiersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1998
- Integration and synchronization of input modes during multimodal human-computer interactionPublished by Association for Computing Machinery (ACM) ,1997
- Unification-based multimodal integrationPublished by Association for Computational Linguistics (ACL) ,1997
- Multimodal interfacesArtificial Intelligence Review, 1996
- Multimodal interfaces for dynamic interactive mapsPublished by Association for Computing Machinery (ACM) ,1996
- “Finger-Pointer”: Pointing interface by image processingComputers & Graphics, 1994
- An Overview of Predictive Learning and Function ApproximationPublished by Springer Nature ,1994
- Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern RecognitionPublished by Springer Nature ,1990
- “Put-that-there”ACM SIGGRAPH Computer Graphics, 1980