Automated Annotation of Landmark Images Using Community Contributed Datasets and Web Resources

Gareth J. F. Jones²²,
Daragh Byrne^22,23,
Mark Hughes^22,23,
Noel E. O’Connor²³ &
…
Andrew Salway²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6725))

Included in the following conference series:

International Conference on Semantic and Digital Media Technologies

596 Accesses
3 Citations

Abstract

A novel solution to the challenge of automatic image annotation is described. Given an image with GPS data of its location of capture, our system returns a semantically-rich annotation comprising tags which both identify the landmark in the image, and provide an interesting fact about it, e.g. “A view of the Eiffel Tower, which was built in 1889 for an international exhibition in Paris”. This exploits visual and textual web mining in combination with content-based image analysis and natural language processing. In the first stage, an input image is matched to a set of community contributed images (with keyword tags) on the basis of its GPS information and image classification techniques. The depicted landmark is inferred from the keyword tags for the matched set. The system then takes advantage of the information written about landmarks available on the web at large to extract a fact about the landmark in the image. We report component evaluation results from an implementation of our solution on a mobile device. Image localisation and matching offers 93.6% classification accuracy; the selection of appropriate tags for use in annotation performs well (F1M of 0.59), and it subsequently automatically identifies a correct toponym for use in captioning and fact extraction in 69.0% of the tested cases; finally the fact extraction returns an interesting caption in 78% of cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Image Annotation Using a Semantic Hierarchy

Towards Automatic Cataloging of Image and Textual Collections with Wikipedia

A Picture Is Worth a Thousand Tags: Automatic Web Based Image Tag Expansion

References

Geonames, http://www.geonames.org
Panoramio, http://www.panoramio.com
Yahoo! search boss, http://developer.yahoo.com/search/boss/
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Chapter Google Scholar
Chevallet, J.-P., Lim, J.-H., Leong, M.-K.: Object identification and retrieval from efficient image matching. Snap2Tell with the STOIC dataset. Information Processing and Management 43(2), 515–530 (2007)
Article Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks, vol. (3), pp. 273–297 (1995)
Google Scholar
Fritz, G., Seifert, C., Paletta, L.: A mobile vision system for urban detection with informative local descriptors. In: Proceedings of the IEEE International Conference on Computer Vision Systems (ICVS 2006), p. 30 (2006)
Google Scholar
Jäschke, R., Eisterlehner, F., Hotho, A., Stumme, G.: Testing and evaluating tag recommenders in a live system. In: Workshop on Knowledge Discovery, Data Mining, and Machine Learning, pp. 44–51 (2009)
Google Scholar
Lorenz Wendt, F., Bres, S., Tellez, B., Laurini, R.: Markerless outdoor localisation based on sift descriptors for mobile applications. In: Elmoataz, A., Lezoray, O., Nouboud, F., Mammass, D. (eds.) ICISP 2008. LNCS, vol. 5099, pp. 439–446. Springer, Heidelberg (2008)
Chapter Google Scholar
Lowe, D.G.: Local feature view clustering for 3D object recognition. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, pp. I-682–I-688 (2001)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Article Google Scholar
Malobabic, J., le Borgne, H., Murphy, N., O’Connor, N.: Detecting the presence of large buildings in natural images. In: Proceedings of the 4th International Workshop on Content-Based Multimedia Indexing (CBMI 2005), pp. 529–532 (2005)
Google Scholar
Porter, M.F.: An Algorithm for Suffix Stripping. Program 14(3), 130–137 (1980)
Article Google Scholar
Qingji, G., Juan, L., Guoqing, Y.: Vision based road crossing scene recognition for robot localization. In: Proceedings of the International Conference on Computer Science and Software Engineering, vol. 6, pp. 62–66 (2008)
Google Scholar
Rahmani, R., Goldman, S.A., Zhang, H., Cholleti, S.R., Fritts, J.E.: Localized content-based image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(11), 1902–1912 (2008)
Article Google Scholar
Salway, A., Kelly, L., Skadina, I., Jones, G.J.F.: Portable extraction of partially structured facts from the web. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds.) IceTAL 2010. LNCS, vol. 6233, pp. 345–356. Springer, Heidelberg (2010)
Chapter Google Scholar
Szummer, M., Picard, R.W.: Indoor-outdoor image classification. In: Proceedings of the IEEE International Workshop on Content-Based Access of Image and Video Database, pp. 42–51 (1998)
Google Scholar
van Rijsbergen, C.: Information Retrieval, 2nd edn., Butterworths (1979)
Google Scholar
Yeh, T., Tollmar, K., Darrell, T.: Searching the web with mobile images for location recognition. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), vol. 2, pp. 76–81 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Digital Video Processing, School of Computing, Dublin City University, Dublin 9, Ireland
Gareth J. F. Jones, Daragh Byrne, Mark Hughes & Andrew Salway
CLARITY: Centre for Sensor Web Technologies, Dublin City University, Dublin 9, Ireland
Daragh Byrne, Mark Hughes & Noel E. O’Connor

Authors

Gareth J. F. Jones
View author publications
You can also search for this author in PubMed Google Scholar
Daragh Byrne
View author publications
You can also search for this author in PubMed Google Scholar
Mark Hughes
View author publications
You can also search for this author in PubMed Google Scholar
Noel E. O’Connor
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Salway
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

DFKI GmbH, Language Technology Lab, Stuhlsatzenhausweg, 3, 66123, Saarbrücken, Germany
Thierry Declerck
Know-Center Graz, 8010, Graz, Austria
Michael Granitzer
University of Siegen, Vision and Graphics, Hölderlinstrasse 3, 57076, Siegen, Germany
Marcin Grzegorzek
DFKI IUI, Saarbrücken, Germany
Massimo Romanelli
Knowledge Media Institute, The Open University, MK7 6AA, Milton Keynes, UK
Stefan Rüger
Knowledge Management Department, German Research Center for Artificial Intelligence (DFKI) GmbH, Trippstadter Straße 122, 67663, Kaiserslautern, Germany
Michael Sintek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jones, G.J.F., Byrne, D., Hughes, M., O’Connor, N.E., Salway, A. (2011). Automated Annotation of Landmark Images Using Community Contributed Datasets and Web Resources. In: Declerck, T., Granitzer, M., Grzegorzek, M., Romanelli, M., Rüger, S., Sintek, M. (eds) Semantic Multimedia. SAMT 2010. Lecture Notes in Computer Science, vol 6725. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23017-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-23017-2_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23016-5
Online ISBN: 978-3-642-23017-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Automated Annotation of Landmark Images Using Community Contributed Datasets and Web Resources

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Image Annotation Using a Semantic Hierarchy

Towards Automatic Cataloging of Image and Textual Collections with Wikipedia

A Picture Is Worth a Thousand Tags: Automatic Web Based Image Tag Expansion

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Automated Annotation of Landmark Images Using Community Contributed Datasets and Web Resources

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Image Annotation Using a Semantic Hierarchy

Towards Automatic Cataloging of Image and Textual Collections with Wikipedia

A Picture Is Worth a Thousand Tags: Automatic Web Based Image Tag Expansion

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation