Analysis and evaluation of the top- $$k$$ most influential location selection query

Chen, Jian; Huang, Jin; Wen, Zeyi; He, Zhen; Taylor, Kerry; Zhang, Rui

doi:10.1007/s10115-013-0720-0

Analysis and evaluation of the top-$k$ most influential location selection query

Regular Paper
Published: 16 January 2014

Volume 43, pages 181–217, (2015)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Jian Chen¹,
Jin Huang²,
Zeyi Wen²,
Zhen He³,
Kerry Taylor⁴ &
…
Rui Zhang²

404 Accesses
7 Citations
Explore all metrics

Abstract

In this paper, we propose a new type of queries to retrieve the top-k most influential locations from a candidate set $C$ given sets of customers $M$ and existing facilities $F$. The influence models the popularity of a facility. Such queries have wide applications in decision support systems. A naive solution sequentially scans (SS) all data sets, which is expensive, and hence, we investigate two branch-and-bound algorithms for the query, namely Estimate Expanding Pruning (EEP) and Bounding Influence Pruning (BIP). Both algorithms follow the best first traverse. On determining the traversal order, while EEP leverages distance metrics between nodes, BIP relies on half plane pruning which avoids the repetitive estimations in EEP. As our experiments shown, BIP is much faster than SS which outperforms EEP, while the worst-case complexity of EEP and BIP is worse than that of SS. To improve the efficiency, we further propose a Nearest Facility Circle Join (NFCJ) algorithm. NFCJ builds an influence R-tree on the influence relationship between customers and existing facilities and joins the candidate R-tree with the influence R-tree to obtain the results. We compare all algorithms and conclude that NFCJ is the best solution, which outperforms SS, EEP, and BIP by orders of magnitude.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An efficient join operations for utility list-based high-utility mining approaches using hybrid search technique

Article 12 April 2024

Rashmin Gajera, Suresh Patel, … Ayush Solanki

The p-Median Problem

Large Neighborhood Search

References

Achtert E, Kriegel HP, Krger P, Renz M, Zfle A (2009) Reverse k-nearest neighbor search in dynamic and general metric databases. In: Proceedings of EDBT
ANNLibrary (2011) http://www.cs.umd.edu/~mount/ann/
Aronovich L, Spiegler I (2009) Bulk construction of dynamic clustered metric trees. Knowl Inf Syst 22(2):211–244
Article Google Scholar
Brinkhoff T, Kriegel HP, Seeger B (1993) Efficient processing of spatial joins using r-trees. In: Proceedings of SIGMOD
Cabello S, Díaz-Báñez JM, Langerman S, Seara C, Ventura I (2006) Reverse facility location problems. In: Proceedings of CCCG
Cheema MA, Lin X, Wang W, Zhang W, Pei J (2010) Probabilistic reverse nearest neighbor queries on uncertain data. IEEE TKDE 22(4):550–564
Google Scholar
Cheema MA, Lin X, Zhang W, Zhang Y (2011) Influence zone : efficiently processing reverse k nearest neighbors queries. In: Proceedings of ICDE
Cheema MA, Zhang W, Lin X, Zhang Y (2012) Efficiently processing snapshot and continuous reverse k nearest neighbors queries. VLDB J
Chen H, Liu J, Furuse K, Yu JX, Ohbo N (2010) Indexing expensive functions for efficient multi-dimensional similarity search. Knowl Inf Syst 27(2):165–192
Article Google Scholar
CloudMade (2013) http://downloads.cloudmade.com/
Du Y, Zhang D, Xia T (2005) The optimal-location query. Adv Sp Temp Databases 3633:163–180
Article Google Scholar
Gao Y, Zheng B, Chen G, Li Q (2009) Optimal-location-selection query processing in spatial databases. IEEE TKDE 68(8):1162–1177
Google Scholar
Ghaemi P, Shahabi K, Wilson JP, Banaei-Kashani F (2010) Optimal network location queries. In: Proceedings of GIS
Guttman A (1984) R-trees: a dynamic index structure for spatial searching. In: Proceedings of SIGMOD, pp 47–57
Huang J, Wen Z, Qi J, Zhang R, Chen J, He Z (2011) Top-k most influential location selection. In: Proceedings of CIKM
Korn F, Muthukrishnan S (2000) Influence sets based on reverse nearest neighbor queries. In: Proceedings of SIGMOD
Mouratidis K, Papadias D, Papadimitriou S (2005) Medoid queries in large spatial databases. In: Proceedings of SSTD, pp 55–72
OpenStreetMap (2013) http://www.openstreetmap.org/
Qi J, Zhang R, Kulik L, Lin D, Xue Y (2012) The min-dist location selection query. In: Proceedings of ICDE
Roussopoulos N, Kelley S, Vincent F (1995) Nearest neighbor queries. In: Proceedings of SIGMOD, pp 71–79
Shang S, Yuan B, Deng K, Xie K, Zhou X (2011) Finding the most accessible locations-reverse path nearest neighbor query in road networks categories and subject descriptors. In: Proceedings of GIS
SouFang (2013) http://www.soufun.com
Stanoi I, Riedewald M, Agrawal D, Abbadi AE (2001) Discovery of influence sets in frequently updated database. In: Proceedings of VLDB
Sun Y, Huang J, Chen Y, Zhang R, Du X (2012) Location selection for utility maximization with capacity constraints. In: Proceedings of CIKM
Tao Y, Lian X (2004) Reverse kNN search in arbitrary dimensionality. In: Proceedings of VLDB
Trulia (2013) http://trulia.com
Vaidya PM (1989) AnO(n logn) algorithm for the all-nearest-neighbors problem. Discret Comput Geom 4(1):101–115
Google Scholar
Wong RCW, Özsu MT, Fu AWC, Yu PS, Liu L, Liu Y (2011) Maximizing bichromatic reverse nearest neighbor for L p -norm in two- and three-dimensional spaces. VLDB J 20(6):893–919
Article Google Scholar
Wong RCW, Ozsu MT, Yu PS, Fu AWC, Liu L (2009) Efficient method for maximizing bichromatic reverse nearest neighbor. In: Proceedings of VLDB
Wu W, Yang F, Chan CY, Tan KL (2008) FINCH: evaluating reverse k-nearest-neighbor queries on location data. In: Proceedings of VLDB
Xia T, Zhang D, Kanoulas E, Du Y (2005) On computing top-t most influential spatial sites. In: Proceedings of VLDB
Yan D, Wong RCW, Ng W (2011) Efficient methods for finding influential locations with adaptive grids. In: Proceedings of CIKM, pp 1475–1484
Yang C, Lin KI (2001) An index structure for efficient reverse nearest neighbor queries. In: Proceedings of ICDE, pp 485–492
Zhang D, Du Y, Xia T, Tao Y (2006) Progressive computation of the min-dist optimal location query. In: Proceedings of VLDB
Zhang J, Mamoulis N, Papadias D, Tao Y (2004) All-nearest-neighbors queries in spatial databases. In: Proceedings of SSDM, pp 297–306
Zheng K, Huang Z, Zhou A, Zhou X (2011) Discovering the most influential sites over uncertain data: a rank based approach. IEEE TKDE

Download references

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China (No. 61272065) and the Natural Science Foundation of Guangdong Province, China (No. S2012010009311), the Fundamental Research Funds for the Central Universities, SCUT(Grant No. 2012ZZ0088), and the Australian Research Council (ARC) Discovery Project DP130104587. Dr. Rui Zhang was supported by the ARC Future Fellowships Project FT120100832. Zeyi Wen was supported by the Commonwealth Scientific and Industrial Research Organisation (CSIRO).

Author information

Authors and Affiliations

School of Software Engineering, South China University of Technology, Guangzhou, China
Jian Chen
Department of Computing and Information Systems, University of Melbourne, Melbourne, Australia
Jin Huang, Zeyi Wen & Rui Zhang
Department of Computer Science, La Trobe University, Bundoora, Australia
Zhen He
The Commonwealth Scientific and Industrial Research Organisation (CSIRO), Canberra, Australia
Kerry Taylor

Authors

Jian Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jin Huang
View author publications
You can also search for this author in PubMed Google Scholar
Zeyi Wen
View author publications
You can also search for this author in PubMed Google Scholar
Zhen He
View author publications
You can also search for this author in PubMed Google Scholar
Kerry Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Rui Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rui Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, J., Huang, J., Wen, Z. et al. Analysis and evaluation of the top-$k$ most influential location selection query. Knowl Inf Syst 43, 181–217 (2015). https://doi.org/10.1007/s10115-013-0720-0

Download citation

Received: 22 August 2012
Revised: 08 November 2013
Accepted: 27 December 2013
Published: 16 January 2014
Issue Date: April 2015
DOI: https://doi.org/10.1007/s10115-013-0720-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Analysis and evaluation of the top-\(k\) most influential location selection query

Abstract

Access this article

Similar content being viewed by others

An efficient join operations for utility list-based high-utility mining approaches using hybrid search technique

The p-Median Problem

Large Neighborhood Search

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Analysis and evaluation of the top-\(k\) most influential location selection query

Abstract

Access this article

Similar content being viewed by others

An efficient join operations for utility list-based high-utility mining approaches using hybrid search technique

The p-Median Problem

Large Neighborhood Search

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation