Probabilistic fusion of stereo with color and contrast for bilayer segmentation
- 24 July 2006
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Pattern Analysis and Machine Intelligence
- Vol. 28 (9) , 1480-1492
- https://doi.org/10.1109/tpami.2006.193
Abstract
This paper describes models and algorithms for the real-time segmentation of foreground from background layers in stereo video sequences. Automatic separation of layers from color/contrast or from stereo alone is known to be error-prone. Here, color, contrast, and stereo matching information are fused to infer layers accurately and efficiently. The first algorithm, layered dynamic programming (LDP), solves stereo in an extended six-state space that represents both foreground/background layers and occluded regions. The stereo-match likelihood is then fused with a contrast-sensitive color model that is learned on-the-fly and stereo disparities are obtained by dynamic programming. The second algorithm, layered graph cut (LGC), does not directly solve stereo. Instead, the stereo match likelihood is marginalized over disparities to evaluate foreground and background hypotheses and then fused with a contrast-sensitive color model like the one used in LDP. Segmentation is solved efficiently by ternary graph cut. Both algorithms are evaluated with respect to ground truth data and found to have similar performance, substantially better than either stereo or color/contrast alone. However, their characteristics with respect to computational efficiency are rather different. The algorithms are demonstrated in the application of background substitution and shown to give good quality composite video outputKeywords
This publication has 26 references indexed in Scilit:
- Efficient Dense Stereo with Occlusions for New View-Synthesis by Four-State Dynamic ProgrammingInternational Journal of Computer Vision, 2006
- Estimating Disparity and Occlusions in Stereo Video SequencesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- "GrabCut"ACM Transactions on Graphics, 2004
- Gaze manipulation for one-to-one teleconferencingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Interactive graph cuts for optimal boundary & region segmentation of objects in N-D imagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Fast approximate energy minimization via graph cutsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2001
- A pixel dissimilarity measure that is insensitive to image samplingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1998
- A Bayesian approach to binocular steropsisInternational Journal of Computer Vision, 1996
- A Maximum Likelihood Stereo AlgorithmComputer Vision and Image Understanding, 1996
- Occlusions and binocular stereoInternational Journal of Computer Vision, 1995