Segmenting Highly Articulated Video Objects with Weak-Prior Random Forests

Hwann-Tzong Chen¹⁹,
Tyng-Luh Liu¹⁹ &
Chiou-Shann Fuh²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3954))

Included in the following conference series:

European Conference on Computer Vision

4668 Accesses
5 Citations

Abstract

We address the problem of segmenting highly articulated video objects in a wide variety of poses. The main idea of our approach is to model the prior information of object appearance via random forests. To automatically extract an object from a video sequence, we first build a random forest based on image patches sampled from the initial template. Owing to the nature of using a randomized technique and simple features, the modeled prior information is considered weak, but on the other hand appropriate for our application. Furthermore, the random forest can be dynamically updated to generate prior probabilities about the configurations of the object in subsequent image frames. The algorithm then combines the prior probabilities with low-level region information to produce a sequence of figure-ground segmentations. Overall, the proposed segmentation technique is useful and flexible in that one can easily integrate different cues and efficiently select discriminating features to model object appearance and handle various articulations.

Download to read the full chapter text

Chapter PDF

Semi-supervised Video Segmentation Using Decision Forests

Latent-Class Hough Forests for 3D Object Detection and Pose Estimation

Easy Minimax Estimation with Random Forests for Human Pose Estimation

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Amit, Y., Geman, D.: Shape quantization and recognition with randomized trees. Neural Computation 9, 1545–1588 (1997)
Article Google Scholar
Blake, A., Isard, M.: Active Contours: The Application of Techniques from Graphics, Vision, Control Theory and Statistics to Visual Tracking of Shapes in Motion. Springer, Heidelberg (1998)
Book Google Scholar
Blake, A., Rother, C., Brown, M., Perez, P., Torr, P.: Interactive image segmentation using an adaptive GMMRF model. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 428–441. Springer, Heidelberg (2004)
Chapter Google Scholar
Borenstein, E., Ullman, S.: Class-specific, top-down segmentation. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2351, pp. 109–122. Springer, Heidelberg (2002)
Chapter Google Scholar
Boykov, Y., Veksler, O., Zabih, R.: Markov random fields with efficient approximations. In: CVPR, pp. 648–655 (1998)
Google Scholar
Boykov, Y.Y., Jolly, M.P.: Interactive graph cuts for optimal boundary and region segmentation of objects in n-d images. In: ICCV, vol. 1, pp. 105–112 (2001)
Google Scholar
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Article MATH Google Scholar
Chan, T., Zhu, W.: Level set based shape prior segmentation. In: CVPR, vol. 2, pp. 1164–1170 (2005)
Google Scholar
Cremers, D., Tischhauser, F., Weickert, J., Schnorr, C.: Diffusion snakes: Introducing statistical shape knowledge into the Mumford-Shah functional. IJCV 50(3), 295–313 (2002)
Article MATH Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, Chichester (2001)
MATH Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. IJCV 61(1), 55–79 (2005)
Article Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR, vol. 2, pp. 264–271 (2003)
Google Scholar
Fowlkes, C.C., Martin, D.R., Malik, J.: Learning affinity functions for image segmentation: Combining patch-based and gradient-based approaches. In: CVPR, vol. 2, pp. 54–61 (2003)
Google Scholar
Gavrila, D.M.: Pedestrian detection from a moving vehicle. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1843, pp. 37–49. Springer, Heidelberg (2000)
Chapter Google Scholar
Geman, S., Geman, D.: Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. PAMI 6(6), 721–741 (1984)
Article MATH Google Scholar
The Berkeley Segmentation Dataset and Benchmark, http://www.cs.berkeley.edu/projects/vision/grouping/segbench/
The Berkeley Segmentation Engine, http://www.cs.berkeley.edu/~fowlkes/BSE/cvpr-segs/
Kumar, M.P., Torr, P.H.S., Zisserman, A.: OBJ CUT. In: CVPR, vol. 1, pp. 18–25 (2005)
Google Scholar
Lepetit, V., Lagger, P., Fua, P.: Randomized trees for real-time keypoint recognition. In: CVPR, vol. 2, pp. 775–781 (2005)
Google Scholar
Malik, J., Belongie, S., Leung, T., Shi, J.: Contour and texture analysis for image segmentation. IJCV 43(1), 7–27 (2001)
Article MATH Google Scholar
Martin, D.R., Fowlkes, C.C., Tal, D., Malik, J.: A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: ICCV, vol. 2, pp. 416–423 (2001)
Google Scholar
McInerney, T., Terzopoulos, D.: Deformable models in medical image analysis: A survey. MIA 1(2), 91–108 (1996)
Google Scholar
Mori, G., Ren, X., Efros, A.A., Malik, J.: Recovering human body configurations: Combining segmentation and recognition. In: CVPR, vol. 2, pp. 326–333 (2004)
Google Scholar
Mumford, D., Shah, J.: Boundary detection by minimizing functionals. In: CVPR, pp. 22–26 (1985)
Google Scholar
Paragios, N., Deriche, R.: Coupled geodesic active regions for image segmentation: A level set approach. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1843, pp. 224–240. Springer, Heidelberg (2000)
Chapter Google Scholar
Ren, X., Fowlkes, C., Malik, J.: Cue integration for figure/ground labeling. In: NIPS 18 (2005)
Google Scholar
Rother, C., Kolmogorov, V., Blake, A.: GrabCut: interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23(3), 309–314 (2004)
Article Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. PAMI 22(8), 888–905 (2000)
Article Google Scholar
Toyama, K., Blake, A.: Probabilistic tracking in a metric space. In: ICCV, vol. 2, pp. 50–57 (2001)
Google Scholar
Tu, Z., Chen, X., Yuille, A.L., Zhu, S.C.: Image parsing: Unifying segmentation, detection, and recognition. In: ICCV, pp. 18–25 (2003)
Google Scholar
Viola, P., Jones, M.J.: Rapid object detection using a boosted cascade of simple features. In: CVPR, vol. 1, pp. 511–518 (2001)
Google Scholar
Wu, Y., Zhang, A.: Adaptive pattern discovery for interactive multimedia retrieval. In: CVPR, vol. 2, pp. 649–655 (2003)
Google Scholar
Yedidia, J.S., Freeman, W.T., Weiss, Y.: Understanding belief propagation and its generalizations. In: Exploring Artificial Intelligence in the New Millennium, pp. 239–269 (2003)
Google Scholar
Yu, S.X., Shi, J.: Object-specific figure-ground segregation. In: CVPR, vol. 2, pp. 39–45 (2003)
Google Scholar
Yuille, A.L., Cohen, D.S., Hallinan, P.W.: Feature extraction from faces using deformable templates. IJCV 8(2), 99–111 (1992)
Article Google Scholar
Zhu, S.C., Yuille, A.: Region competition: Unifying snakes, region growing, and Bayes/MDL for multiband image segmentation. PAMI 18(9), 884–900 (1996)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Information Science, Academia Sinica, Taipei, 115, Taiwan
Hwann-Tzong Chen & Tyng-Luh Liu
Department of CSIE, National Taiwan University, Taipei, 106, Taiwan
Chiou-Shann Fuh

Authors

Hwann-Tzong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Tyng-Luh Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chiou-Shann Fuh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Ljubljana, Slovenia
Aleš Leonardis
Institute for Computer Graphics and Vision, TU Graz, Inffeldgasse 16, 8010, Graz, Austria
Horst Bischof
Vision-based Measurement Group, Inst. of El. Measurement and Meas. Sign. Proc. Graz, University of Technology, Austria
Axel Pinz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, HT., Liu, TL., Fuh, CS. (2006). Segmenting Highly Articulated Video Objects with Weak-Prior Random Forests. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3954. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744085_29

Download citation

DOI: https://doi.org/10.1007/11744085_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33838-3
Online ISBN: 978-3-540-33839-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Segmenting Highly Articulated Video Objects with Weak-Prior Random Forests

Abstract

Chapter PDF

Similar content being viewed by others

Semi-supervised Video Segmentation Using Decision Forests

Latent-Class Hough Forests for 3D Object Detection and Pose Estimation

Easy Minimax Estimation with Random Forests for Human Pose Estimation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Segmenting Highly Articulated Video Objects with Weak-Prior Random Forests

Abstract

Chapter PDF

Similar content being viewed by others

Semi-supervised Video Segmentation Using Decision Forests

Latent-Class Hough Forests for 3D Object Detection and Pose Estimation

Easy Minimax Estimation with Random Forests for Human Pose Estimation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation