skip to main content
10.1145/1142473.1142531acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Efficient reverse k-nearest neighbor search in arbitrary metric spaces

Published:27 June 2006Publication History

ABSTRACT

The reverse k-nearest neighbor (RkNN) problem, i.e. finding all objects in a data set the k-nearest neighbors of which include a specified query object, is a generalization of the reverse 1-nearest neighbor problem which has received increasing attention recently. Many industrial and scientific applications call for solutions of the RkNN problem in arbitrary metric spaces where the data objects are not Euclidean and only a metric distance function is given for specifying object similarity. Usually, these applications need a solution for the generalized problem where the value of k is not known in advance and may change from query to query. However, existing approaches, except one, are designed for the specific R1NN problem. In addition - to the best of our knowledge - all previously proposed methods, especially the one for generalized RkNN search, are only applicable to Euclidean vector data but not for general metric objects. In this paper, we propose the first approach for efficient RkNN search in arbitrary metric spaces where the value of k is specified at query time. Our approach uses the advantages of existing metric index structures but proposes to use conservative and progressive distance approximations in order to filter out true drops and true hits. In particular, we approximate the k-nearest neighbor distance for each data object by upper and lower bounds using two functions of only two parameters each. Thus, our method does not generate any considerable storage overhead. We show in a broad experimental evaluation on real-world data the scalability and the usability of our novel approach.

References

  1. {1} A. M. Andrew. Another efficient algorithm for convex hulls in two dimensions. Information Processing Letters, 9, 1979.Google ScholarGoogle Scholar
  2. {2} N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The R*-Tree: An efficient and robust access method for points and rectangles. In Proc. SIGMOD, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. {3} S. Berchtold, D. A. Keim, and H.-P. Kriegel. The X-Tree: An index structure for high-dimensional data. In Proc. VLDB, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. {4} P. Ciaccia, M. Patella, and P. Zezula. M-Tree: an efficient access method for similarity search in metric spaces. In Proc. VLDB, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. {5} C. Ding and H. Peng. Minimum redundancy feature selection from microarray gene expression data. In CSB03, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. {6} A. Guttman. R-Trees: A dynamic index structure for spatial searching. In Proc. SIGMOD, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. {7} F. Korn and S. Muthukrishnan. Influenced sets based on reverse nearest neighbor queries. In Proc. SIGMOD, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. {8} M. Schroeder. Fractals, Chaos, Power Laws: Minutes from an infinite paradise. W.H. Freeman and company, New York, 1991.Google ScholarGoogle Scholar
  9. {9} A. Singh, H. Ferhatosmanoglu, and A. S. Tosun. High dimensional reverse nearest neighbor queries. In Proc. CIKM, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. {10} I. Stanoi, D. Agrawal, and A. E. Abbadi. Reverse nearest neighbor queries for dynamic databases. In Proc. DMKD, 2000.Google ScholarGoogle Scholar
  11. {11} Y. Tao, D. Papadias, and X. Lian. Reverse kNN search in arbitrary dimensionality. In Proc. VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. {12} C. Yang and K.-I. Lin. An index structure for efficient reverse nearest neighbor queries. In Proc. ICDE, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Efficient reverse k-nearest neighbor search in arbitrary metric spaces

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of data
      June 2006
      830 pages
      ISBN:1595934340
      DOI:10.1145/1142473

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 June 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader