skip to main content
10.1145/1772690.1772732acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Equip tourists with knowledge mined from travelogues

Published:26 April 2010Publication History

ABSTRACT

With the prosperity of tourism and Web 2.0 technologies, more and more people have willingness to share their travel experiences on the Web (e.g., weblogs, forums, or Web 2.0 communities). These so-called travelogues contain rich information, particularly including location-representative knowledge such as attractions (e.g., Golden Gate Bridge), styles (e.g., beach, history), and activities (e.g., diving, surfing). The location-representative information in travelogues can greatly facilitate other tourists' trip planning, if it can be correctly extracted and summarized. However, since most travelogues are unstructured and contain much noise, it is difficult for common users to utilize such knowledge effectively. In this paper, to mine location-representative knowledge from a large collection of travelogues, we propose a probabilistic topic model, named as Location-Topic model. This model has the advantages of (1) differentiability between two kinds of topics, i.e., local topics which characterize locations and global topics which represent other common themes shared by various locations, and (2) representation of locations in the local topic space to encode both location-representative knowledge and similarities between locations. Some novel applications are developed based on the proposed model, including (1) destination recommendation for on flexible queries, (2) characteristic summarization for a given destination with representative tags and snippets, and (3) identification of informative parts of a travelogue and enriching such highlights with related images. Based on a large collection of travelogues, the proposed framework is evaluated using both objective and subjective evaluation methods and shows promising results.

References

  1. S. Ahern, M. Naaman, R. Nair, and J. Yang. World Explorer: visualizing aggregate data from unstructured text in geo-referenced collections. In Proc. JCDL, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Chang, J. Boyd-Graber, and D. M. Blei. Connections between the lines: augmenting social networks with text. In Proc. KDD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. Chemudugunta, P. Smyth, and M. Steyvers. Modeling general and specific aspects of documents with a probabilistic topic model. In Proc. NIPS, 2006.Google ScholarGoogle Scholar
  5. D. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the World's Photos. In Proc. WWW, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Flickr. http://www.flickr.com/Google ScholarGoogle Scholar
  7. T. Griffiths and M. Steyvers. Finding scientific topics. In PNAS, 101:5228--5235, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  8. Q. Hao, R. Cai, X.-J. Wang, J.-M. Yang, Y. Pang, and L. Zhang. Generating location overviews with images and tags by mining user-generated travelogues. In Proc. ACM Multi-media, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. K. Jarvelin and J. Kekalainen. IR evaluation methods for retrieving highly relevant documents. In Proc. SIGIR, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. F. Jing, L. Zhang, and W.-Y. Ma. VirtualTour: an online travel assistant based on high quality images. In Proc. ACM Multimedia, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. L. Kennedy and M. Naaman. Generating diverse and representative image search results for landmarks. In Proc. WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Kim, H. Kim, and J. Ryu. TripTip: a trip planning service with tag-based recommendation. In Proc. CHI, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Q. Mei, D. Cai, D. Zhang, and C. Zhai. Topic modeling with network regularization. In Proc. WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Q. Mei, C. Liu, H. Su, and C. Zhai. A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In Proc. WWW, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. E. Moxley, J. Kleban, and B. S. Manjunath. SpiritTagger: a geo-aware tag suggestion tool mined from Flickr. In Proc. MIR, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Newman, C. Chemudugunta, P. Smyth, and M. Steyvers. Statistical entity-topic models. In Proceedings of KDD, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In Proc. UAI, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. I. Simon, N. Snavely, and S. M. Seitz. Scene summarization for online image collections. In Proc. ICCV, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  19. I. Titov and R. McDonald. Modeling online reviews with multi-grain topic models. In Proc. WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Wang, J. Wang, X. Xie, W.-Y. Ma. Mining geographic knowledge using location aware topic model. In Proc. GIR, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. X. Wu, J. Li, Y. Zhang, S. Tang, and S.-Y. Neo. Personalized multimedia web summarizer for tourist. In Proc. WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Y.-T. Zheng, M. Zhao, Y. Song, H. Adam, U. Buddemeier, A. Bissacco, F. Brucher, T.-S. Chua, and H. Neven. Tour the world: building a web-scale landmark recognition engine. In Proc. CVPR, 2009.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Equip tourists with knowledge mined from travelogues

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        WWW '10: Proceedings of the 19th international conference on World wide web
        April 2010
        1407 pages
        ISBN:9781605587998
        DOI:10.1145/1772690

        Copyright © 2010 International World Wide Web Conference Committee (IW3C2)

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 26 April 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,899of8,196submissions,23%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      ePub

      View this article in ePub.

      View ePub