Multimodal people ID for a multimedia meeting browser

30 October 1999

conference paper
Published by Association for Computing Machinery (ACM)

p. 159-168
https://doi.org/10.1145/319463.319484

Abstract

A meeting browser is a system that allows users to review a multimedia meeting record from a variety of indexing methods. Identification of meeting participants is essential for creating such a multimedia meeting record. Moreover, knowing who is speaking can enhance the performance of speech recognition and indexing meeting transcription. In this paper, we present an approach that identifies meeting participants by fusing multimodal inputs. We use face ID, speaker ID, color appearance ID, and sound source directional ID to identify and track meeting. After describing the different modules in detail, we will discuss a framework for combining the information sources. Integration of the multimodal people ID into the multimedia meeting browser is in its preliminary stage.

Keywords

This publication has 11 references indexed in Scilit:

Face recognition using eigenfaces
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Visual tracking for multimodal human computer interaction
Published by Association for Computing Machinery (ACM) ,1998
Fusion of audio and video information for multi modal person authentication
Pattern Recognition Letters, 1997
Pfinder: real-time tracking of the human body
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1997
Human and machine recognition of faces: a survey
Proceedings of the IEEE, 1995
Robust text-independent speaker identification using Gaussian mixture speaker models
IEEE Transactions on Speech and Audio Processing, 1995
View-based and modular eigenspaces for face recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1994
Color indexing
International Journal of Computer Vision, 1991
Divergence measures based on the Shannon entropy
IEEE Transactions on Information Theory, 1991
Low-dimensional procedure for the characterization of human faces
Journal of the Optical Society of America A, 1987