Abstract
Many web documents refer to specific geographic localities and many people include geographic context in queries to web search engines. Standard web search engines treat the geographical terms in the same way as other terms. This can result in failure to find relevant documents that refer to the place of interest using alternative related names, such as those of included or nearby places. This can be overcome by associating text indexing with spatial indexing methods that exploit geo-tagging procedures to categorise documents with respect to geographic space. We describe three methods for spatio-textual indexing based on multiple spatially indexed text indexes, attaching spatial indexes to the document occurrences of a text index, and merging text index access results with results of access to a spatial index of documents. These schemes are compared experimentally with a conventional text index search engine, using a collection of geo-tagged web documents, and are shown to be able to compete in speed and storage performance with pure text indexing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amitay, E., et al.: Web-a-where: geotagging web content. In: 27th ACM SIGIR Conference, pp. 273–280 (2004)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison Wesley, Reading (1999)
Buyukokkten, O., et al.: Exploiting geographical location information of web pages. In: WebDB 1999 (with ACM SIGMOD 1999) (1999)
Cunningham, H., et al.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: 40th Anniversary Meeting of Assoc. for Computational Linguistics, ACL 2002 (2002)
Ding, J., Gravano, L., Shivakumar, N.: Computing Geographical Scopes of Web Resources. In: 26th Int. Conf. on Very Large Data Bases (VLDB), pp. 545–556 (2000)
GoogleLocal, http://www.local.google.com
Jones, C.B., Abdelmoty, A.I., Finch, D., Fu, G., Vaid, S.: The SPIRIT Spatial Search Engine:Architecture, Ontologies and Spatial Indexing. In: Egenhofer, M.J., Freksa, C., Miller, H.J. (eds.) GIScience 2004. LNCS, vol. 3234, pp. 125–139. Springer, Heidelberg (2004)
Jones, C.B., Abdelmoty, A.I., Fu, G.: Maintaining ontologies for geographical information retrieval on the web. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) CoopIS 2003, DOA 2003, and ODBASE 2003. LNCS, vol. 2888, pp. 934–951. Springer, Heidelberg (2003)
Jones, C.B., et al.: Spatial information retrieval and geographical ontologies an overview of the SPIRIT project. In: Proc ACM SIGIR 2002, pp. 387–388 (2002)
Kornai, A., Sundheim, B. (eds.): HLT-NAACL Workshop on Analysis of Geographic References (2003)
van Kreveld, M., Reinbacher, I., Arampatzis, A., van Zwol, R.: Distributed Ranking Methods for Geographic Information Retrieval. In: Fisher, P.F. (ed.) Developments in Spatial Data Handling, pp. 231–243. Springer, Heidelberg (2004)
McCurley, K.S.: Geospatial mapping and navigation on the web. In: WWW10 Conference (2001), http://www10.org/cdrom/papers/278/
Mirago, http://www.mirago.com
NorthernLight, http://www.northernlight.com
Purves, R., Jones, C.B.: Workshop on Geographic Information Retrieval, SIGIR (2004), http://www.sigir.org/forum/2004D/purves_sigirforum_2004d.pdf
Robertson, S.E., Walker, S.: Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In: ACM SIGIR 1994, pp. 232–241 (1994)
Sagara, T., Kitsuregawa, M.: Yellow Page driven Methods of Collecting and Scoring Spatial Web Documents. In: SIGIR Workshop on Geographical Information Retrieval (2004), http://www.geo.unizh.ch/~rsp/gir/
Sanderson, M., Kohler, J.: Analyzing geographic queries. In: SIGIR Workshop on Geographic Information Retrieval (2004), http://www.geo.unizh.ch/~rsp/gir/
Silva, M.J., et al.: Adding Geographic Scopes to Web Resources. In: SIGIR Workshop on Geographical Information Retrieval (2004), http://www.geo.unizh.ch/~rsp/gir/
SPIRIT, http://www.geo-spirit.org/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vaid, S., Jones, C.B., Joho, H., Sanderson, M. (2005). Spatio-textual Indexing for Geographical Search on the Web. In: Bauzer Medeiros, C., Egenhofer, M.J., Bertino, E. (eds) Advances in Spatial and Temporal Databases. SSTD 2005. Lecture Notes in Computer Science, vol 3633. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11535331_13
Download citation
DOI: https://doi.org/10.1007/11535331_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28127-6
Online ISBN: 978-3-540-31904-7
eBook Packages: Computer ScienceComputer Science (R0)