Extracting the Movement of Lip and Tongue During Articulation

Hanhoon Park¹⁸,
Seung-Wook Hong¹⁸,
Jong-Il Park¹⁸,
Sung-Kyun Moon¹⁹ &
…
Hyeongseok Ko²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3767))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1234 Accesses

Abstract

A method that extracts the 3-D shape and movement of lip and tongue and displays them simultaneously is presented. Lip movement is easily observable and thus extractable using a camera. However, it is difficult to extract the real movement of tongue exactly because the tongue may be occluded by the lip and teeth. In this paper, we use a magnetic resonance imaging (MRI) device to extract the sagittal view of the movement of tongue during articulation. Since the frame rate of the available MRI device is very low (5 fps), we obtain a smooth video sequence (20 fps) by a new contour-based interpolation method. The overall procedure of extracting the movement of lip and tongue is as follows. First, fiducial color markers attached on the lip are detected, and then the data of 3D movement of the lip are computed using a 3D reconstruction technique. Next, to extract the movement of tongue image, we applied a series of simple image processing algorithms to MRI images of tongue and then extracted the contour of tongue interactively. Finally, the data of lip and tongue are synchronized and temporally interpolated. An OpenGL based program is implemented to visualize the data interactively. We performed the experiment using the Korean basic syllables and some of the data are presented. It is confirmed that a lot of experiments using the results support theoretical and empirical observation of linguistics. The acquired data can be used not only as a fundamental database for scientific purpose but also as an educative material for language rehabilitation of the hearing-impaired. Also it can be used for making a high-quality lip-synchronized animation including tongue movement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Inner lips feature extraction based on CLNF with hybrid dynamic template for Cued Speech

Article Open access 19 December 2017

Tongue Mesh Extraction from 3D MRI Data of the Human Vocal Tract

An adaptive approach for lip-reading using image and depth data

Article 09 July 2015

References

Provine, J.A., Bruton, L.T.: Lip Synchronization in 3-D Model Based Coding for Video-Conferencing. In: Proc. of ISCAS, 1st edn., pp. 453–456 (1995)
Google Scholar
Chen, T., Rao, R.R.: Audio-visual Integration in Multimodal Communication. In: Proc. of the IEEE, vol. 86, pp. 837–852 (1998)
Google Scholar
Guenter, B., Grimm, C., Wood, D., Malvar, H., Pighin, F.: Making Faces. In: Proc. of SIGGRAPH., pp. 55–66 (1998)
Google Scholar
Hager, G.D., Belhumeur, P.N.: Real-time Tracking of Image Region with Changes in Geometry and Illumination. In: Proc. of CVPR., pp. 403–410 (1996)
Google Scholar
Matsino, K., Lee, C.W., Tsuji, S.: Automatic Recognition of Human Facial Expressions. In: Proc. of the IEEE., pp. 352–359 (1995)
Google Scholar
Pighin, F., Szeliski, R., Salesin, D.: Resynthesizing Facial Animation through 3D Model-Based Tracking. In: Proc. of ICCV., pp. 130–150 (1999)
Google Scholar
Laprie, Y., Berger, M.-O.: Extraction of Tongue Contours in X-ray Images with Minimal User Interaction. In: Proc. of International Conference on Spoken Language Processing, vol. 1, pp. 268–271 (1996)
Google Scholar
Akgul, Y.S., Kambhamettu, C., Stone, M.: Automatic Extraction and Tracking of the Tongue Contours. IEEE Trans. on Medical Imaging, 1035-1045 (1999)
Google Scholar
Unay, D.: Analysis of Tongue Motion Using Tagged Cine-MRI. Master Thesis, Bogazici University (2001)
Google Scholar
Stone, M., Davis, E., Nessaiver, M., Gullipalli, R., Levine, W., Lundberg, A.: Modeling Motion of the Internal Tongue from Tagged Cine-MRI Images. Journal of the Acoustical Society of America, 109(6), 2974–2982 (2001)
Article Google Scholar
Engwall, O., Beskow, J.: Resynthesis of 3D Tongue Movements from Facial Data. In: Proc. of Eurospeech., pp. 2261–2264 (2003)
Google Scholar
Engwall, O.: A 3D Tongue Model Based on MRI Data. In: Proc. of ICSLP, pp. 901–904 (2000)
Google Scholar
Hartley, R., Zisserman, A.: Mutiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2003)
Google Scholar
Dragonfly Technical Reference Manual, http://www.ptgrey.com
Zhang, Z.: A Flexible New Technique for Camera Calibration. IEEE Trans. on Pattern Analysis and Machine Intelligence 22(11), 1330–1334 (2000)
Article Google Scholar
MAGNETOM Sonata*, http://www.healthcare.siemens.com
Korean Standard Pronunciation, http://natogi.new21.org/han/pyojun/p202.htm
The National Institute of the Korean Language, http://www.korean.go.kr

Download references

Author information

Authors and Affiliations

Division of Electrical and Computer Engineering, Hanyang University, Seoul, Korea
Hanhoon Park, Seung-Wook Hong & Jong-Il Park
Department of Otolaryngology, Ajou University, Suwon, Korea
Sung-Kyun Moon
School of Electrical Engineering, Seoul National University, Seoul, Korea
Hyeongseok Ko

Authors

Hanhoon Park
View author publications
You can also search for this author in PubMed Google Scholar
Seung-Wook Hong
View author publications
You can also search for this author in PubMed Google Scholar
Jong-Il Park
View author publications
You can also search for this author in PubMed Google Scholar
Sung-Kyun Moon
View author publications
You can also search for this author in PubMed Google Scholar
Hyeongseok Ko
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong Buk-gu, 500-712, Gwangju, Korea
Yo-Sung Ho
Multimedia Security Lab, Korea University, Science Campus, 136-701, Seoul, Korea
Hyoung Joong Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, H., Hong, SW., Park, JI., Moon, SK., Ko, H. (2005). Extracting the Movement of Lip and Tongue During Articulation. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_75

Download citation

DOI: https://doi.org/10.1007/11581772_75
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Extracting the Movement of Lip and Tongue During Articulation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Inner lips feature extraction based on CLNF with hybrid dynamic template for Cued Speech

Tongue Mesh Extraction from 3D MRI Data of the Human Vocal Tract

An adaptive approach for lip-reading using image and depth data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Extracting the Movement of Lip and Tongue During Articulation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Inner lips feature extraction based on CLNF with hybrid dynamic template for Cued Speech

Tongue Mesh Extraction from 3D MRI Data of the Human Vocal Tract

An adaptive approach for lip-reading using image and depth data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation