skip to main content
10.1145/1316948.1316968acmconferencesArticle/Chapter ViewAbstractPublication PagesgirConference Proceedingsconference-collections
research-article

Geographic co-occurrence as a tool for gir.

Published:09 November 2007Publication History

ABSTRACT

In this paper we describe the development of a geographic co-occurrence model and how it can be applied to geographic information retrieval. The model consists of mining co-occurrences of placenames from Wikipedia, and then mapping these placenames to locations in the Getty Thesaurus of Geographical Names. We begin by quantifying the accuracy of our model and compute theoretical bounds for the accuracy achievable when applied to placename disambiguation in free text. We conclude with a discussion of the improvement such a model could provide for placename disambiguation and geographic relevance ranking over traditional methods.

References

  1. E. Agichtein and S. Cucerzan. Predicting accuracy of extracting information from unstructured text collections. In Proceedings of CIKM, pages 567--568, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. E. Amitay, N. Har'El, R. Silvan, and A. Soffer. Web-a-where: Geotagging web content. In Proceedings of SIGIR, pages 273--280, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Bunescu and M. Paşca. Using encycopedic knowledge for named entity disambiguation. In Proceedings of EACL, pages 9--16, 2006.Google ScholarGoogle Scholar
  4. D. Buscaldi, P. Rosso, and P. Garcia. Inferring geographic ontologies from multiple resources for geographic information retrieval. In SIGIR Workshop on GIR, pages 52--55, 2006.Google ScholarGoogle Scholar
  5. P. Clough, M. Sanderson, and H. Joho. Extraction of semantic annotations from textual web pages. Technical report, University of Sheffield, 2004.Google ScholarGoogle Scholar
  6. T. Cover and J. Thomas. Elements of Information Theory. Wiley, 1st edition, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Cucerzan. Large-scale named entity disambiguation based on wikipedia data. In Proceedings of EMNLP-CoNLL, 2007.Google ScholarGoogle Scholar
  8. M. Egenhofer and D. Mark. Naive geography. In Proceedings of COSIT, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  9. M. Egenhofer and A. Shariff. Metric details for natural-language spatial relations. Journal of the ACM TOIS, 4:295--321, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. E. Gabrilovich and S. Markovitch. Computing semantic relatedness using wikipedia-based explicit semantic analysis. In Proceedings of IJCAI, pages 1606--1611, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. W. Gale, K. Church, and D. Yarowsky. One sense per discourse. In DARPA Speech and Natural Language Workshop, pages 233--237, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. E. Garbin and I. Mani. Disambiguating toponyms in news. In Proceedings of HLT/EMNLP, pages 363--370, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. P. Harping. User's Guide to the TGN Data Releases. The Getty Vocabulary Program, 2.0 edition, 2000.Google ScholarGoogle Scholar
  14. L. Hill. Core elements of digital gazetteers: Placenames, categories, and footprints. In Proceedings of ECDL, pages 280--290, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Leveling, S. Hartrumpf, and D. Veiel. University of Hagen at GeoCLEF 2005: Using semantic networks for interpreting geographical queries. In Working Notes for the CLEF Workshop, 2005.Google ScholarGoogle Scholar
  16. H. Li, R. Srihari, C. Niu, and W. Li. InfoXtract location normalization: A hybrid approach to geographic references in information extraction. In HLT-NAACL Workshop on Analysis of Geographic References, pages 39--44, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. B. Martins, N. Cardoso, M. Chaves, L. Andrade, and M. Silva. The University of Lisbon at GeoCLEF 2006. In Working Notes for the CLEF Workshop, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. National Geospatial-Intelligence Agency. http://earth-info.nga.mil/gns/html/. Accessed 15 June 2007.Google ScholarGoogle Scholar
  19. National Geospatial-Intelligence Agency. http://www.nga.mil/. Accessed 15 June 2007.Google ScholarGoogle Scholar
  20. S. Overell and S. Rüger. Identifying and grounding descriptions of places. In SIGIR Workshop on GIR, pages 14--16, 2006.Google ScholarGoogle Scholar
  21. H. Raghavan, J. Allan, and A. McCallum. An exploration of entity models, collective classification and relation description. In KDD Workshop on Link Analysis and Group Detection, pages 1--10, 2004.Google ScholarGoogle Scholar
  22. E. Rauch, M. Bukatin, and K. Baker. A confidence-based framework for disambiguating geographic terms. In HLT-NAACL Workshop on Analysis of Geographic References, pages 50--54, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Sanderson and J. Kohler. Analyzing geographic queries. In SIGIR Workshop on GIR, 2004.Google ScholarGoogle Scholar
  24. C. Schlieder, T. Vögele, and U. Visser. Qualitative spatial representations for information retrieval by gazetteers. In {\em Proceedings of COSIT}, pages 336--351, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. Smith and G. Mann. Bootstrapping toponym classifiers. In HLT-NAACL Workshop on Analysis of Geographic References, pages 45--49, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. T. Steinberg. http://www.placeopedia.com. Accessed 15 June 2007.Google ScholarGoogle Scholar
  27. M. Strube and S. P. Ponzetto. WikiRelate! Computing semantic relatedness using Wikipedia. In Proceedings of AAAI--06, pages 1419--1424, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. N. Wacholder, Y. Ravin, and M. Choi. Disambiguation of proper names in text. In Proceedings of ANLP, pages 202--208, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. G. Weaver, B. Strickland, and G. Crane. Quantifying the accuracy of relational statements in Wikipedia: a methodology. In Proceedings of JCDL, pages 358--358, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Wikipedia. http://www.wikipedia.org. Accessed 15 June 2007.Google ScholarGoogle Scholar
  31. D. Yarowsky. One sense per collocation. In ARPA Human Language and Technology Workshop, pages 266--271, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. W. Zong, D. Wu, A. Sun, E. Lim, and D. Goh. On assigning place names to geography related web pages. In Proceedings of JCDL, pages 354--362, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Geographic co-occurrence as a tool for gir.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GIR '07: Proceedings of the 4th ACM workshop on Geographical information retrieval
      November 2007
      104 pages
      ISBN:9781595938282
      DOI:10.1145/1316948

      Copyright © 2007 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 November 2007

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate46of61submissions,75%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader