Abstract
Vergence ability is an important visual behavior observed on living creatures when they use vision to interact with the environment. The notion of active observer is equally useful for robotic vision systems on tasks like object tracking, fixation and 3D environment structure recovery. Humanoid robotics are a potential playground for such behaviors. This paper describes the implementation of a real time binocular vergence behavior using cepstral filtering to estimate stereo disparities. By implementing the cepstral filter on a graphics processing unit (GPU) using Compute Unified Device Architecture (CUDA) we demonstrate that robust parallel algorithms that used to require dedicated hardware are now available on common computers. The overall system is implemented in the binocular vision system IMPEP (IMPEP Integrated Multimodal Perception Experimental Platform) to illustrate the system performance experimentally.
Chapter PDF
Similar content being viewed by others
References
Yeshurun, Y., Schwartz, E.L.: Cepstral Filtering on a Columnar Image Architecture: A Fast Algorithm for Binocular Stereo Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 11(7), 759–767 (1989)
Coombs, D.: Real-time Gaze Holding in Binocular Robot Vision, PhD. Thesis, Department of Computer Science, University of Rochester (June 1992)
Kwon, K.-C., Lee, H.-S., Kim, N.: Hybrid Cepstral Filter for Rapid and Precise Vergence. Control of Parallel Stereoscopic Camera. Journal of the Research Institute for Computer and Information Communication 12(3) (December 2004)
NVIDIA CUDA C ProgrammingGuide 3.1, NVIDIA (2010)
OpenCV (Open Source Computer Vision)
Taylor, J.R., Olson, T., Martin, W.N.: Accurate vergence control in complex scenes. In: Proc. Computer Vision and Pattern Recognition, Seattle, USA, pp. 540–545 (June 1994)
Ferreira, J.F., Lobo, J., Dias, J.: Bayesian Real-Time Perception Algorithms on GPU - Real-Time Implementation of Bayesian Models for Multimodal Perception Using CUDA. Journal of Real-Time Image Processing (February 26, 2010); Special Issue, Springer Berlin/Heidelberg, published online (ISSN: 1861-8219)
Garland, M., Le Grand, S., Nickolls, J., Anderson, J., Hardwick, J., Morton, S., Phillips, E., Zhang, Y., Volkov, V.: Parallel computing experiences with CUDA. IEEE Micro. 28(4), 13–27 (2008)
CUDA: CUFFT Library, NVIDIA Corp (2010)
Olson, T.J., Coombs, D.: Real-Time Vergence Control for Binocular Robots. IJCV 7(1), 67–89 (1991)
Ballard, D.H.: Reference Frames for Animate Vision. In: International Joint Conference on Artificial Intelligence. AAAI, Menlo Park (1989)
Dias, J., Paredes, C., Fonseca, I., Araujo, H., Batista, J., Almeida, A.T.: Simulating Pursuit with Machine Experiments with Robots and Artificial Vision. IEEE Transactions on Robotics and Automation 3(1), 1–18 (1998)
Batista, J., Dias, J., Araujo, H., de Almeida, A.: The ISR Multi-Degree of Freedom Active Vision Robot Head: Design and Calibration. In: SMART Program Workshop, Instituto Superior Técnico, Lisboa, Portugal, April 27-28 (1995)
Brown, C.M.: The Rochester robot, Tech. Rep. 257, Dept. Comp. Sci., Univ. Rochester, NY (1988)
Mowforth, P., Siebert, J., Jin, Z., Urquhart, C.: A head called Richard. In: Proceedings of the British Machine Vision Conference 1990, Oxford, UK, pp. 361–366 (1990)
Betsis, D., Lavest, J.: Kinematic calibration of the KTH head-eye system. In: ISRN KTH (1994)
Natale, L., Metta, G., Sandini, G.: Development of auditory-evoked reflexes: Visuo-acoustic cues integration in a binocular head. Robotics and Autonomous Systems 39, 87–106 (2002)
Eklundh, J.-O., Bjrkman, M.: Recognition of objects in the real world from a systems perspectiveâ. Kuenstliche Intelligenz 19(2), 12–17 (2005)
Bernardino, A., Santos Victor, J.: Binocular tracking: integrating perception and control. IEEE Transactions on Robotics & Automation 15(6), 1080–1094 (1999)
Bogert, B., Healy, M., Tukey, J.W.: The Quefrency Alanysis of Time Series for Echoes: Cepstrum, Pseudo-autocovariance, Cross-Cepstrurn, and Saphe Cracking. In: Rosenblatt, M. (ed.) Proc. Symp. Time Series Analysis, pp. 209–243. John Wiley and Sons, Chichester (1963)
Scharstein, D., Szeliski, R.: A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms. International Journal of Computer Vision 47(1-3), 7–42 (2002)
Brown, M.Z., Burschka, D., Hager, G.D.: Advances in Computational Stereo. IEEE Trans. on Pattern Analysis and Machine Intelligence (PAMI) 25(8), 993–1008 (2003)
POP project (Perception on Purpose), number FP6-IST - 2004-027268, http://perception.inrialpes.fr/POP/
Wilming, N., Wolfsteller, F., König, P., Caseiro, R., Xavier, J., Araújo, H.: Attention Models for Vergence Movements based on the JAMF Framework and the POPEYE Robot. VISSAPP 2, 429–435 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 IFIP International Federation for Information Processing
About this paper
Cite this paper
Almeida, L., Menezes, P., Dias, J. (2011). Vergence Using GPU Cepstral Filtering. In: Camarinha-Matos, L.M. (eds) Technological Innovation for Sustainability. DoCEIS 2011. IFIP Advances in Information and Communication Technology, vol 349. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19170-1_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-19170-1_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19169-5
Online ISBN: 978-3-642-19170-1
eBook Packages: Computer ScienceComputer Science (R0)