Abstract
With the proliferation of geo-positioning and geo-tagging techniques, spatio-textual objects that possess both a geographical location and a textual description are gaining in prevalence, and spatial keyword queries that exploit both location and textual description are gaining in prominence. However, the queries studied so far generally focus on finding individual objects that each satisfy a query rather than finding groups of objects where the objects in a group together satisfy a query.
We define the problem of retrieving a group of spatio-textual objects such that the group's keywords cover the query's keywords and such that the objects are nearest to the query location and have the smallest inter-object distances. Specifically, we study three instantiations of this problem, all of which are NP-hard. We devise exact solutions as well as approximate solutions with provable approximation bounds to the problems. In addition, we solve the problems of retrieving top-k groups of three instantiations, and study a weighted version of the problem that incorporates object weights. We present empirical studies that offer insight into the efficiency of the solutions, as well as the accuracy of the approximate solutions.
- Einat Amitay, Nadav Harel, Ron Sivan, and Aya Soffer. 2004. Web-a-where: Geotagging web content. In Proceedings of the 27th Annual ACM SIGIR International Conference on Research and Development in Information Retrieval (SIGIR'04). 273--280. Google ScholarDigital Library
- Esther M. Arkin and Refael Hassin. 2000. Minimum-diameter covering problems. Netw. 36, 3, 147--155.Google ScholarCross Ref
- Franz Aurenhammer and Herbert Edelsbrunner. 1984. An optimal algorithm for constructing the weighted voronoi diagram in the plane. Patt. Recogn. 17, 2, 251--257.Google ScholarCross Ref
- Kenneth Bøgh, Anders Skovsgaard, and Christian S. Jensen. 2013. GroupFinder: A new approach to op-k point-of-interest group retrieval. Proc. VLDB Endow. 6, 12, 1226--1229. Google ScholarDigital Library
- Xin Cao, Lisi Chen, Gao Cong, Christian S. Jensen, Qiang Qu, Anders Skovsgaard, Dingming Wu, and Man Lung Yiu. 2012b. Spatial keyword querying. In Proceedings of the 31st International Conference on Conceptual Modelling (ER'12). 16--29. Google ScholarDigital Library
- Xin Cao, Lisi Chen, Gao Cong, and Xiaokui Xiao. 2012a. Keyword-aware optimal route search. Proc. VLDB Endow. 5, 11, 1136--1147. Google ScholarDigital Library
- Xin Cao, Gao Cong, and Christian S. Jensen. 2010. Retrieving top-k prestige-based relevant spatial web objects. Proc. VLDB Endow. 3, 1, 373--384. Google ScholarDigital Library
- Xin Cao, Gao Cong, Christian S. Jensen, and Beng Chin Ooi. 2011. Collective spatial keyword querying. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'11). 373--384. Google ScholarDigital Library
- Xin Cao, Gao Cong, Christian S. Jensen, and Man Lung Yiu. 2014. Retrieving regions of interest for user exploration. Proc. VLDB Endow. 7, 9, 733--744. Google ScholarDigital Library
- Lisi Chen, Gao Cong, Christian S. Jensen, and Dingming Wu. 2013. Spatial keyword query processing: An experimental evaluation. Proc. VLDB Endow. 6, 3, 217--228. Google ScholarDigital Library
- Yen-Yu Chen, Torsten Suel, and Alexander Markowetz. 2006. Efficient query processing in geographic web search engines. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'06). 277--288. Google ScholarDigital Library
- Vaclav Chvatal. 1979. A greedy heuristic for the set-covering problem. Math. Oper. Res. 4, 233--235.Google ScholarDigital Library
- Gao Cong, Christian S. Jensen, and Dingming Wu. 2009. Efficient retrieval of the top-k most relevant spatial web objects. Proc. VLDB Endow. 2, 1, 337--348. Google ScholarDigital Library
- Bin Cui, Hong Mei, and Beng Chin Ooi. 2014. Big data: The driver for innovation in databases. Nat. Sci. Rev. 1, 1, 27--30.Google ScholarCross Ref
- Ian De Felipe, Vagelis Hristidis, and Naphtali Rishe. 2008. Keyword search on spatial databases. In Proceedings of the 24th IEEE International Conference on Data Engineering (ICDE'08). 656--665. Google ScholarDigital Library
- Junyan Ding, Luis Gravano, and Narayanan Shivakumar. 2000. Computing geographical scopes of webresources. In Proceedings of the 26th International Conference on Very Large Data Bases (VLDB'00). 545--556. Google ScholarDigital Library
- Tao Guo, Xin Cao, and Gao Cong. 2015. Efficient algorithms for answering the m-closest keywords query. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'15). To appear. Google ScholarDigital Library
- Antonin Guttman. 1984. R-trees: A dynamic index structure for spatial searching. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'84). 47--57. Google ScholarDigital Library
- Ramaswamy Hariharan, Bijit Hore, Chen Li, and Sharad Mehrotra. 2007. Processing spatial-keyword (sk) queries in geographic information retrieval (gir) systems. In Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM'07). 16. Google ScholarDigital Library
- Han Hu, Yonggang Wen, Tat-Seng Chua, and Xuelong Li. 2014. Toward scalable systems for big data analytics: A technology tutorial. IEEE Access 2, 652--687.Google ScholarCross Ref
- Ali Khodaei, Cyrus Shahabi, and Chen Li. 2010. Hybrid indexing and seamless ranking of spatial and textual features of web documents. In Proceedings of the 21st International Conference on Database and Expert Systems Applications (DEXA'10). 450--466. Google ScholarDigital Library
- Theodoros Lappas, Kun Liu, and Evimaria Terzi. 2009. Finding a team of experts in social networks. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). 467--476. Google ScholarDigital Library
- Feifei Li, Dihan Cheng, Marios Hadjieleftheriou, George Kollios, and Shang-Hua Teng. 2005. On trip planning queries in spatial databases. In Proceedings of the 9th International Conference on Advances in Spatial and Temporal Databases (SSTD'05). 273--290. Google ScholarDigital Library
- Zhisheng Li, Ken C. K. Lee, Baihua Zheng, Wang-Chien Lee, Dik Lun Lee, and Xufa Wang. 2011. IR-tree: An efficient index for geographic document search. IEEE Trans. Knowl. Data Engin. 23, 4, 585--599. Google ScholarDigital Library
- Cheng Long, Raymond Chi-Wing Wong, Ke Wang, and Ada Wai-Chee Fu. 2013. Collective spatial keyword queries: A distance owner-driven approach. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'13). 689--700. Google ScholarDigital Library
- Kevin S. McCurley. 2001. Geospatial mapping and navigation of the web. In Proceedings of the 10th International Conference on World Wide Web (WWW'01). 221--229. Google ScholarDigital Library
- Joao B. Rocha-Junior and Kjetil Nørvag. 2012. Top-k spatial keyword queries on road networks. In Proceedings of the 15th International Conference on Extending Database Technology (EDBT'12). 168--179. Google ScholarDigital Library
- Mehdi Sharifzadeh, Mohammad R. Kolahdouzan, and Cyrus Shahabi. 2008. The optimal sequenced route query. VLDB J. 17, 4, 765--787. Google ScholarDigital Library
- Subodh Vaid, Christopher B. Jones, Hideo Joho, and Mark Sanderson. 2005. Spatio-textual indexing for geographical search on the Web. In Proceedings of the 9th International Conference on Advances in Spatial and Temporal Databases (SSTD'05). 218--235. Google ScholarDigital Library
- Dingming Wu, Gao Cong, and Christian S. Jensen. 2012a. A framework for efficient spatial web object retrieval. VLDB J. 21, 6, 797--822. Google ScholarDigital Library
- Dingming Wu, Man Lung Yiu, Gao Cong, and Christian S. Jensen. 2012b. Joint top-k spatial keyword query processing. IEEE Trans. Knowl. Data Engin. 24, 10, 1889--1903. Google ScholarDigital Library
- Dingming Wu, Man Lung Yiu, Christian S. Jensen, and Gao Cong. 2011. Efficient continuously moving top-k spatial keyword query processing. In Proceedings of the 27th IEEE International Conference on Data Engineering (ICDE'11). 541--552. Google ScholarDigital Library
- De-Nian Yang, Chih-Ya Shen, Wang-Chien Lee, and Ming-Syan Chen. 2012. On socio-spatial group query for location-based social networks. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'12). 949--957. Google ScholarDigital Library
- Chengxiang Zhai and John Lafferty. 2004. A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst. 22, 2, 179--214. Google ScholarDigital Library
- Dongxiang Zhang, Yeow Meng Chee, Anirban Mondal, Anthony K. H. Tung, and Masaru Kitsuregawa. 2009. Keyword search in spatial databases: Towards searching by document. In Proceedings of the 25th International Conference on Data Engineering (ICDE'09). 688--699. Google ScholarDigital Library
- Dongxiang Zhang, Beng Chin Ooi, and Anthony K. H. Tung. 2010. Locating mapped resources in Web 2.0. In Proceedings of the 26th International Conference on Data Engineering (ICDE'10). 521--532.Google Scholar
- Yinghua Zhou, Xing Xie, Chuang Wang, Yuchang Gong, and Wei-Ying Ma. 2005. Hybrid index structures for location-based web search. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management (CIKM'05). 155--162. Google ScholarDigital Library
- Justin Zobel and Alistair Moffat. 2006. Inverted files for text search engines. ACM Comput. Surv. 38, 2, 6. Google ScholarDigital Library
Index Terms
- Efficient Processing of Spatial Group Keyword Queries
Recommendations
Collective spatial keyword querying
SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of dataWith the proliferation of geo-positioning and geo-tagging, spatial web objects that possess both a geographical location and a textual description are gaining in prevalence, and spatial keyword queries that exploit both location and textual description ...
Semantic-aware top-k spatial keyword queries
The fast development of GPS equipped devices has aroused widespread use of spatial keyword querying in location based services nowadays. Existing spatial keyword query methodologies mainly focus on the spatial and textual similarities, while leaving the ...
WISK: A Workload-aware Learned Index for Spatial Keyword Queries
PACMMODSpatial objects often come with textual information, such as Points of Interest (POIs) with their descriptions, which are referred to as geo-textual data. To retrieve such data, spatial keyword queries that take into account both spatial proximity and ...
Comments