Abstract
In this paper, we propose a gesture modeling system based on computer vision in order to recognize a gesture naturally without any trouble between a system and a user using real-time 3D modeling information on multiple objects. It recognizes a gesture after 3D modeling and analyzing the information pertaining to the user’s body shape in stereo views for human movement. In the 3D-modeling step, 2D information is extracted from each view by using an adaptive color difference detector. Potential objects such as faces, hands, and feet are labeled by using the information from 2D detection. We identify reliable objects by comparing the similarities of the potential objects that are obtained from both the views. We acquire information on 2D tracking from the selected objects by using the Kalman filter and reconstruct it as a 3D gesture. A joint of each part of a body is generated in the combined objects. We experimented on ambiguities using occlusion, clutter, and irregular 3D gestures to analyze the efficiency of the proposed system. In this experiment, the proposed gesture modeling system showed a good detection and a processing time of 30 frames per second, which can be used in a real-time.
Chapter PDF
Similar content being viewed by others
References
Venkatesh Babu, R., Ramakrishnan, K.R.: Recognition of human actions using motion history information extracted from the compressed video. Image and vision computing 22, 597–607 (2004)
Aggarwal, J.K., Cai, Q.: Human motion analysis: a review. Computer Vision and Image Understanding 73(3), 428–440 (1999)
Ayers, D., Shah, M.: Monitoring human behavior from video taken in an office environment. Image and Vision Computing 19, 833–846 (2001)
Davis, J.: Hierarchical motion history images for recognizing human motion. IEEE Workshop on Detection and Recognition of Event in Video, 39–46 (2001)
Tanie, H., Yamane, K., Nakamura, Y.: High Marker Density Motion Capture by Retroreflective Mesh Suit. In: International Conference on Robotics and Automation, pp. 2884–2889 (2005)
Hashi, S., Tokunaga, Y., Yabukami, S., Toyada, M., Ishiyama, K., Okazaki, Y., Arai, K.I.: Development of real-time and highly accurate wireless motion capture system utilizing soft magnetic core. IEEE Transactions on Magnetics 41, 4191–4193 (2005)
Miller, N., Jenkins, O.C., Kallmann, M., Mataric, M.J.: Motion capture from inertial sensing for untethered humanoid teleoperation. In: IEEE/RAS International Conference on Humanoid Robots, vol. 2, pp. 547–562 (2004)
Yabukami, S., Kikuchi, H., Yamaguchi, M.: Motion Capture System of Magnetic Makers Using Three-Axial Magnetic Field Sensor. IEEE Transactions on magnetics 36, 3646–3648 (2000)
Wren, C.R., Azarbayejani, A., Darrell, T., Pentland, A.P.: Pfinder: Real-Time Tracking of the Human Body. Pattern Analysis and Machine Intelligence 19, 780–785 (1997)
Date, N., Arita, D., Yonemoto, S., Taniguchi, R.: Performance Evaluation of Vision-based Real-time Motion Capture. In: Proceedings of the International Parallel and Distributed Processing Symposium, vol. 8 (2003)
Moeslund, T.B., Granum, E.: Multiple Cues used in Model-Based Human Motion Capture. Automatic Face and Gesture Recognition, 362–367 (2000)
Esteban, C.H., Schmitt, F.: Silhouette and stereo fusion for 3D object modeling. Computer Vision and Image Understanding 96, 367–392 (2004)
Li, L., Hilton, A., Illingworth, J.: A relaxation algorithm for real-time multiple view 3D-tracking. Image and vision computing 20, 841–859 (2002)
Sundaresan, A., Chellappa, R.: Markerless Motion Capture using Multiple Cameras. Computer Vision for Interactive and Intelligent Environment, pp. 15–26 (2005)
Saito, H., Baba, S., Kimura, M., Vedula, S., Kanade, T.: Appearance-Based Virtual View Generation of Temporally-Varying Events from Multi-Camera Images in the 3D Room. IEEE Transactions on 5, 303–316 (2003)
Cheung, K.M., Baker, S., Kanade, T.: Shape-From-Silhouette Across Time Part II: Applications to Human Modeling and Markerless Motion Tracking. International Journal of Computer Vision 63, 225–245 (2005)
Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, pp. 356–395. Prentice-Hall, Englewood Cliffs (2002)
Vapnik, V.: The Nature of Statistical Learning Theory, 2nd edn. Springer, Heidelberg (2001)
Kang, B.D., Kim, J.H., Kim, S.K., et al.: Effective Face Detection using a Small Quantity of Training Data. In: Chang, L.-W., Lie, W.-N. (eds.) PSIVT 2006. LNCS, vol. 4319, pp. 553–562. Springer, Heidelberg (2006)
Welch, G., Bishop, G.: An Introduction to the Kalman filter. University of North Carolina at Chapel Hill, Department of Computer Science, TR 95-041 (2004)
Jain, R., Kasturi, R., Schunck, B.G.: Machine vision, pp. 289–298. McGraw-Hill, Inc., New York (1995)
Sonka, M., Hlavac, V., Boyle, R.: Image processing analysis and machine vision. PWS, 441–507 (1998)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kang, BD. et al. (2007). Human Motion Modeling Using Multivision. In: Jacko, J.A. (eds) Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments. HCI 2007. Lecture Notes in Computer Science, vol 4552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73110-8_72
Download citation
DOI: https://doi.org/10.1007/978-3-540-73110-8_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73108-5
Online ISBN: 978-3-540-73110-8
eBook Packages: Computer ScienceComputer Science (R0)