skip to main content
research-article

Efficient Processing of Spatial Group Keyword Queries

Published:30 June 2015Publication History
Skip Abstract Section

Abstract

With the proliferation of geo-positioning and geo-tagging techniques, spatio-textual objects that possess both a geographical location and a textual description are gaining in prevalence, and spatial keyword queries that exploit both location and textual description are gaining in prominence. However, the queries studied so far generally focus on finding individual objects that each satisfy a query rather than finding groups of objects where the objects in a group together satisfy a query.

We define the problem of retrieving a group of spatio-textual objects such that the group's keywords cover the query's keywords and such that the objects are nearest to the query location and have the smallest inter-object distances. Specifically, we study three instantiations of this problem, all of which are NP-hard. We devise exact solutions as well as approximate solutions with provable approximation bounds to the problems. In addition, we solve the problems of retrieving top-k groups of three instantiations, and study a weighted version of the problem that incorporates object weights. We present empirical studies that offer insight into the efficiency of the solutions, as well as the accuracy of the approximate solutions.

References

  1. Einat Amitay, Nadav Harel, Ron Sivan, and Aya Soffer. 2004. Web-a-where: Geotagging web content. In Proceedings of the 27th Annual ACM SIGIR International Conference on Research and Development in Information Retrieval (SIGIR'04). 273--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Esther M. Arkin and Refael Hassin. 2000. Minimum-diameter covering problems. Netw. 36, 3, 147--155.Google ScholarGoogle ScholarCross RefCross Ref
  3. Franz Aurenhammer and Herbert Edelsbrunner. 1984. An optimal algorithm for constructing the weighted voronoi diagram in the plane. Patt. Recogn. 17, 2, 251--257.Google ScholarGoogle ScholarCross RefCross Ref
  4. Kenneth Bøgh, Anders Skovsgaard, and Christian S. Jensen. 2013. GroupFinder: A new approach to op-k point-of-interest group retrieval. Proc. VLDB Endow. 6, 12, 1226--1229. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Xin Cao, Lisi Chen, Gao Cong, Christian S. Jensen, Qiang Qu, Anders Skovsgaard, Dingming Wu, and Man Lung Yiu. 2012b. Spatial keyword querying. In Proceedings of the 31st International Conference on Conceptual Modelling (ER'12). 16--29. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Xin Cao, Lisi Chen, Gao Cong, and Xiaokui Xiao. 2012a. Keyword-aware optimal route search. Proc. VLDB Endow. 5, 11, 1136--1147. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Xin Cao, Gao Cong, and Christian S. Jensen. 2010. Retrieving top-k prestige-based relevant spatial web objects. Proc. VLDB Endow. 3, 1, 373--384. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Xin Cao, Gao Cong, Christian S. Jensen, and Beng Chin Ooi. 2011. Collective spatial keyword querying. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'11). 373--384. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Xin Cao, Gao Cong, Christian S. Jensen, and Man Lung Yiu. 2014. Retrieving regions of interest for user exploration. Proc. VLDB Endow. 7, 9, 733--744. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Lisi Chen, Gao Cong, Christian S. Jensen, and Dingming Wu. 2013. Spatial keyword query processing: An experimental evaluation. Proc. VLDB Endow. 6, 3, 217--228. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Yen-Yu Chen, Torsten Suel, and Alexander Markowetz. 2006. Efficient query processing in geographic web search engines. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'06). 277--288. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Vaclav Chvatal. 1979. A greedy heuristic for the set-covering problem. Math. Oper. Res. 4, 233--235.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Gao Cong, Christian S. Jensen, and Dingming Wu. 2009. Efficient retrieval of the top-k most relevant spatial web objects. Proc. VLDB Endow. 2, 1, 337--348. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Bin Cui, Hong Mei, and Beng Chin Ooi. 2014. Big data: The driver for innovation in databases. Nat. Sci. Rev. 1, 1, 27--30.Google ScholarGoogle ScholarCross RefCross Ref
  15. Ian De Felipe, Vagelis Hristidis, and Naphtali Rishe. 2008. Keyword search on spatial databases. In Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE'08). 656--665. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Junyan Ding, Luis Gravano, and Narayanan Shivakumar. 2000. Computing geographical scopes of webresources. In Proceedings of the 26th International Conference on Very Large Data Bases (VLDB'00). 545--556. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Tao Guo, Xin Cao, and Gao Cong. 2015. Efficient algorithms for answering the m-closest keywords query. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'15). To appear. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Antonin Guttman. 1984. R-trees: A dynamic index structure for spatial searching. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'84). 47--57. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Ramaswamy Hariharan, Bijit Hore, Chen Li, and Sharad Mehrotra. 2007. Processing spatial-keyword (sk) queries in geographic information retrieval (gir) systems. In Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM'07). 16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Han Hu, Yonggang Wen, Tat-Seng Chua, and Xuelong Li. 2014. Toward scalable systems for big data analytics: A technology tutorial. IEEE Access 2, 652--687.Google ScholarGoogle ScholarCross RefCross Ref
  21. Ali Khodaei, Cyrus Shahabi, and Chen Li. 2010. Hybrid indexing and seamless ranking of spatial and textual features of web documents. In Proceedings of the 21st International Conference on Database and Expert Systems Applications (DEXA'10). 450--466. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Theodoros Lappas, Kun Liu, and Evimaria Terzi. 2009. Finding a team of experts in social networks. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). 467--476. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Feifei Li, Dihan Cheng, Marios Hadjieleftheriou, George Kollios, and Shang-Hua Teng. 2005. On trip planning queries in spatial databases. In Proceedings of the 9th International Conference on Advances in Spatial and Temporal Databases (SSTD'05). 273--290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Zhisheng Li, Ken C. K. Lee, Baihua Zheng, Wang-Chien Lee, Dik Lun Lee, and Xufa Wang. 2011. IR-tree: An efficient index for geographic document search. IEEE Trans. Knowl. Data Engin. 23, 4, 585--599. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Cheng Long, Raymond Chi-Wing Wong, Ke Wang, and Ada Wai-Chee Fu. 2013. Collective spatial keyword queries: A distance owner-driven approach. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'13). 689--700. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Kevin S. McCurley. 2001. Geospatial mapping and navigation of the web. In Proceedings of the 10th International Conference on World Wide Web (WWW'01). 221--229. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Joao B. Rocha-Junior and Kjetil Nørvag. 2012. Top-k spatial keyword queries on road networks. In Proceedings of the 15th International Conference on Extending Database Technology (EDBT'12). 168--179. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Mehdi Sharifzadeh, Mohammad R. Kolahdouzan, and Cyrus Shahabi. 2008. The optimal sequenced route query. VLDB J. 17, 4, 765--787. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Subodh Vaid, Christopher B. Jones, Hideo Joho, and Mark Sanderson. 2005. Spatio-textual indexing for geographical search on the Web. In Proceedings of the 9th International Conference on Advances in Spatial and Temporal Databases (SSTD'05). 218--235. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Dingming Wu, Gao Cong, and Christian S. Jensen. 2012a. A framework for efficient spatial web object retrieval. VLDB J. 21, 6, 797--822. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Dingming Wu, Man Lung Yiu, Gao Cong, and Christian S. Jensen. 2012b. Joint top-k spatial keyword query processing. IEEE Trans. Knowl. Data Engin. 24, 10, 1889--1903. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Dingming Wu, Man Lung Yiu, Christian S. Jensen, and Gao Cong. 2011. Efficient continuously moving top-k spatial keyword query processing. In Proceedings of the 27th IEEE International Conference on Data Engineering (ICDE'11). 541--552. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. De-Nian Yang, Chih-Ya Shen, Wang-Chien Lee, and Ming-Syan Chen. 2012. On socio-spatial group query for location-based social networks. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'12). 949--957. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Chengxiang Zhai and John Lafferty. 2004. A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst. 22, 2, 179--214. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Dongxiang Zhang, Yeow Meng Chee, Anirban Mondal, Anthony K. H. Tung, and Masaru Kitsuregawa. 2009. Keyword search in spatial databases: Towards searching by document. In Proceedings of the 25th International Conference on Data Engineering (ICDE'09). 688--699. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Dongxiang Zhang, Beng Chin Ooi, and Anthony K. H. Tung. 2010. Locating mapped resources in Web 2.0. In Proceedings of the 26th International Conference on Data Engineering (ICDE'10). 521--532.Google ScholarGoogle Scholar
  37. Yinghua Zhou, Xing Xie, Chuang Wang, Yuchang Gong, and Wei-Ying Ma. 2005. Hybrid index structures for location-based web search. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management (CIKM'05). 155--162. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Justin Zobel and Alistair Moffat. 2006. Inverted files for text search engines. ACM Comput. Surv. 38, 2, 6. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Efficient Processing of Spatial Group Keyword Queries

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Database Systems
        ACM Transactions on Database Systems  Volume 40, Issue 2
        June 2015
        283 pages
        ISSN:0362-5915
        EISSN:1557-4644
        DOI:10.1145/2799368
        Issue’s Table of Contents

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 30 June 2015
        • Revised: 1 March 2015
        • Accepted: 1 March 2015
        • Received: 1 October 2013
        Published in tods Volume 40, Issue 2

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader