Abstract
As directories of named places, gazetteers link the names to geographic footprints and place types. Most existing gazetteers are managed strictly top-down: entries can only be added or changed by the responsible toponymic authority. The covered vocabulary is therefore often limited to an administrative view on places, using only official place names. In this paper, we propose a bottom-up approach for gazetteer building based on geotagged photos harvested from the web. We discuss the building blocks of a geotag and how they relate to each other to formally define the notion of a geotag. Based on this formalization, we introduce an extraction process for gazetteer entries that captures the emergent semantics of collections of geotagged photos and provides a group-cognitive perspective on named places. Using an experimental setup based on clustering and filtering algorithms, we demonstrate how to identify place names and assign adequate geographic footprints. The results for three different place names (Soho, Camino de Santiago and Kilimanjaro), representing different geographic feature types, are evaluated and compared to the results obtained from traditional gazetteers. Finally, we sketch how our approach can be combined with other (for example, linguistic) approaches and discuss how such a bottom-up gazetteer can complement existing gazetteers.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
- Point Cloud
- Delaunay Triangulation
- Volunteer Geographic Information
- Place Type
- Spatial Data Infrastructure
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Jones, C.B., Purves, R.S., Clough, P.D., Joho, H.: Modelling vague places with knowledge from the web. International Journal of Geographical Information Science 22(10), 1045–1065 (2008)
Larson, R.R.: Geographic information retrieval and spatial browsing. GIS and Libraries: Patrons, Maps and Spatial Information, 81–124 (April 1996)
Goodchild, M.F.: Citizens as voluntary sensors: Spatial data infrastructure in the world of web 2.0. International Journal of Spatial Data Infrastructures Research 2, 24–32 (2007)
Bennett, B., Mallenby, D., Third, A.: An ontology for grounding vague geographic terms. In: Eschenbach, C., Gruninger, M. (eds.) Proceedings of the 5th International Conference on Formal Ontology in Information Systems (FOIS 2008). IOS Press, Amsterdam (2008)
Henrich, A., Lüdecke, V.: Determining geographic representations for arbitrary concepts at query time. In: LOCWEB 2008: Proceedings of the first international workshop on Location and the web, pp. 17–24. ACM, New York (2008)
McConchie, A.: The great pop vs. soda controversy (2002), http://popvssoda.com (last visited august 1st, 2009)
Keßler, C., Janowicz, K., Bishr, M.: An agenda for the next generation gazetteer: Geographic information contribution and retrieval. In: ACM GIS 2009, Seattle, WA, USA, November 4–6. ACM, New York (2009)
Wilske, F.: Approximation of neighborhood boundaries using collaborative tagging systems. In: Pebesma, E., Bishr, M., Bartoschek, T. (eds.) GI-Days 2008. ifgiPrints, vol. 32, pp. 179–187 (2008)
Guo, Q., Liu, Y., Wieczorek, J.: Georeferencing locality descriptions and computing associated uncertainty using a probabilistic approach. International Journal of Geographical Information Science 22(10), 1067–1090 (2008)
Heuer, J.T., Dupke, S.: Towards a spatial search engine using geotags. In: Probst, F., Keßler, C. (eds.) GI-Days 2007 – Young Researchers Conference. ifgiPrints, vol. 30, pp. 199–204 (2007)
Aberer, K., Mauroux, P.C., Ouksel, A.M., Catarci, T., Hacid, M.S., Illarramendi, A., Kashyap, V., Mecella, M., Mena, E., Neuhold, E.J., et al.: Emergent semantics principles and issues. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 25–38. Springer, Heidelberg (2004)
Stahl, G.: Group Cognition: Computer Support for Building Collaborative Knowledge (Acting with Technology). MIT Press, Cambridge (2006)
Raubal, M.: Cognitive engineering for geographic information science. Geography Compass 3(3), 1087–1104 (2009)
Surowiecki, J.: The Wisdom of Crowds. Anchor, New York (2005)
Schlieder, C.: Modeling collaborative semantics with a geographic recommender. In: Hainaut, J.-L., Rundensteiner, E.A., Kirchberg, M., Bertolotto, M., Brochhausen, M., Chen, Y.-P.P., Cherfi, S.S.-S., Doerr, M., Han, H., Hartmann, S., Parsons, J., Poels, G., Rolland, C., Trujillo, J., Yu, E., Zimányie, E. (eds.) ER Workshops 2007. LNCS, vol. 4802, pp. 338–347. Springer, Heidelberg (2007)
Janowicz, K., Keßler, C.: The role of ontology in improving gazetteer interaction. International Journal of Geographical Information Science 22(10), 1129–1157 (2008)
Hill, L.L.: Georeferencing: The Geographic Associations of Information (Digital Libraries and Electronic Publishing). MIT Press, Cambridge (2006)
Casati, R., Varzi, A.C.: Parts and Places. The Structures of Spatial Representation. MIT Press, Cambridge (1999)
Goodchild, M.F., Hill, L.L.: Introduction to digital gazetteer research. International Journal of Geographical Information Science 22(10), 1039–1044 (2008)
Hastings, J.T.: Automated conflation of digital gazetteer data. International Journal of Geographical Information Science 22, 1109–1127 (2008)
Uryupina, O.: Semi-supervised learning of geographical gazetteers from the internet. In: Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references, Morristown, NJ, USA, Association for Computational Linguistics, pp. 18–25 (2003)
Goldberg, D.W., Wilson, J.P., Knoblock, C.A.: Extracting geographic features from the internet to automatically build detailed regional gazetteers. International Journal of Geographical Information Science 23(1), 93–128 (2009)
Bishr, M., Kuhn, W.: Geospatial information bottom-up: A matter of trust and semantics. In: Fabrikant, S., Wachowicz, M. (eds.) The European Information Society – Leading the Way with Geo-information (Proceedings of AGILE 2007), Aalborg, DK. Lecture Notes in Geoinformation and Cartography, pp. 365–387. Springer, Heidelberg (2007)
Guszlev, A., Lukács, L.: Folksonomy & landscape regions. In: Probst, F., Keßler, C. (eds.) GI-Days 2007 – Young Researchers Conference. ifgiPrints 30, pp. 193–197 (2007)
Gruber, T.: Ontology of folksonomy: A mash-up of apples and oranges. International Journal on Semantic Web & Information Systems 3 (2007), http://tomgruber.org/writing/ontology-of-folksonomy.htm (November 2005)
Frank, A.: Ontology for spatio-temporal databases. In: Sellis, T.K., Koubarakis, M., Frank, A., Grumbach, S., Güting, R.H., Jensen, C., Lorentzos, N.A., Manolopoulos, Y., Nardelli, E., Pernici, B., Theodoulidis, B., Tryfona, N., Schek, H.-J., Scholl, M.O. (eds.) Spatio-Temporal Databases. LNCS, vol. 2520, pp. 9–77. Springer, Heidelberg (2003)
Goodchild, M.F.: Geographical data modeling. Computational Geosciences 18(4), 401–408 (1992)
Saeed, J.I.: Semantics (Introducing Linguistics). Wiley-Blackwell (2003)
Searle, J.R.: Proper names. Mind 67(266), 166–173 (1958)
Codd, E.F.: A relational model of data for large shared data banks. Communications of the ACM 13(6), 377–387 (1970)
O’connor, M., Tu, S., Nyulas, C., Das, A., Musen, M.: Querying the semantic web with SWRL, pp. 155–159 (2007)
Shirky, C.: Ontology is overrated – categories, links, and tags. Essay (2005), http://shirky.com/writings/ontology_overrated.html
Edelsbrunner, H., Kirkpatrick, D., Seidel, R.: On the shape of a set of points in the plane. IEEE Transactions on Information Theory 29(4), 551–559 (1983)
Edelsbrunner, H., Mücke, E.: Three-dimensional alpha shapes. ACM Transactions on Graphics 13(1), 43–72 (1994)
Allen, R.: A query interface for an event gazetteer. In: Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, pp. 72–73 (2004)
Mostern, R., Johnson, I.: From named place to naming event: creating gazetteers for history. International Journal of Geographical Information Science 22(10), 1091–1108 (2008)
Hägerstrand, T.: What about people in regional science? Papers in Regional Science 24(1), 6–21 (1970)
Miller, H.J.: A measurement theory for time geography. Geographical Analysis 37, 17–45 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Keßler, C., Maué, P., Heuer, J.T., Bartoschek, T. (2009). Bottom-Up Gazetteers: Learning from the Implicit Semantics of Geotags. In: Janowicz, K., Raubal, M., Levashkin, S. (eds) GeoSpatial Semantics. GeoS 2009. Lecture Notes in Computer Science, vol 5892. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10436-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-10436-7_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10435-0
Online ISBN: 978-3-642-10436-7
eBook Packages: Computer ScienceComputer Science (R0)